Details zu Publikationen

Annotation uncertainty in the context of grammatical change

verfasst von
Marie Luis Merten, Marcel Wever, Michaela Geierhos, Doris Tophinke, Eyke Hüllermeier
Abstract

This paper elaborates on the notion of uncertainty in the context of annotation in large text corpora, specifically focusing on (but not limited to) historical languages. Such uncertainty might be due to inherent properties of the language, for example, linguistic ambiguity and overlapping categories of linguistic description, but could also be caused by a lack of annotation expertise. By examining annotation uncertainty in more detail, we identify the sources, deepen our understanding of the nature and different types of uncertainty encountered in daily annotation practice, and discuss practical implications of our theoretical findings. This paper can be seen as an attempt to reconcile the perspectives of the main scientific disciplines involved in corpus projects, linguistics and computer science, to develop a unified view and to highlight the potential synergies between these disciplines.

Externe Organisation(en)
Universität Zürich (UZH)
Universität der Bundeswehr München
Universität Paderborn
Ludwig-Maximilians-Universität München (LMU)
Typ
Artikel
Journal
International Journal of Corpus Linguistics
Band
28
Seiten
430-459
Anzahl der Seiten
30
ISSN
1384-6655
Publikationsdatum
19.07.2023
Publikationsstatus
Veröffentlicht
Peer-reviewed
Ja
ASJC Scopus Sachgebiete
Sprache und Linguistik, Linguistik und Sprache
Elektronische Version(en)
https://doi.org/10.1075/ijcl.20113.mer (Zugang: Offen)