Lecture: DURel Annotation Tool

DURel is an annotation tool for sentence pairs of a word. The annotations are used to form sense clusters of a word and to visualize them over time.
Our web application for DURel was already presented at conferences and tested. It is now being improved in regard to usability and utility for lexicographs and linguists.
DURel provides an annotation interface for sentence pairs of a word. Annotators are asked to judge the degree of semantic relatedness of pairs of word uses. One example would be the two uses of arm in (1) and (2) on a scale of 1 (unrelated) to 4 (identical).
(1) and taking a knife from her pocket, she opened a vein in her little arm, and dipping a feather in the blood, wrote something on a piece of white cloth, which was spread before her.
(2) It stood behind a high brick wall, its back windows overlooking an arm of the sea which, at low tide, was a black and stinking mud-flat
The annotated data of a word is then represented in a Word Usage Graph (WUG). Its nodes represent word uses and weights on edges represent the (median) semantic relatedness judgment of a pair of uses as e.g. (1) and (2).
The focus of our talk will be on improvements being made in DURel, so that researchers can utilize information from clustering to explain semantic change or annotate word usages.
Users may use the tool to manage their projects, assigning them to registered annotators. The user's annotation can be stopped at any point and the finished data can be downloaded. The system also allows to directly cluster and visualize the data over time as interactive WUGs. New functionalities include the ability to switch between clustering and dimension reduction methods. This could lead to better results depending on each annotator's area of focus.
A user study will be conducted with the goal of evaluating improvements in the tool's interface for linguistics researchers and lexicographs interested in word usage changes over time.

Info

Day: 2022-11-03
Start time: 14:15
Duration: 00:30
Room: Wiwi-Bunker —Room 5050
Track: Computational Linguistics
Language: en

Links:

Feedback

Click here to let us know how you liked this event.

Concurrent Events