Verifying Annotation Agreement without Multiple Experts: A Case Study with Gujarati SNACS

Maitrey Mehta, Vivek Srikumar


Abstract
Good datasets are a foundation of NLP research, and form the basis for training and evaluating models of language use. While creating datasets, the standard practice is to verify the annotation consistency using a committee of human annotators. This norm assumes that multiple annotators are available, which is not the case for highly specialized tasks or low-resource languages. In this paper, we ask: Can we evaluate the quality of a dataset constructed by a single human annotator? To address this question, we propose four weak verifiers to help estimate dataset quality, and outline when each may be employed. We instantiate these strategies for the task of semantic analysis of adpositions in Gujarati, a low-resource language, and show that our weak verifiers concur with a double-annotation study. As an added contribution, we also release the first dataset with semantic annotations in Gujarati along with several model baselines.
Anthology ID:
2023.findings-acl.696
Volume:
Findings of the Association for Computational Linguistics: ACL 2023
Month:
July
Year:
2023
Address:
Toronto, Canada
Editors:
Anna Rogers, Jordan Boyd-Graber, Naoaki Okazaki
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
10941–10958
Language:
URL:
https://aclanthology.org/2023.findings-acl.696
DOI:
10.18653/v1/2023.findings-acl.696
Bibkey:
Cite (ACL):
Maitrey Mehta and Vivek Srikumar. 2023. Verifying Annotation Agreement without Multiple Experts: A Case Study with Gujarati SNACS. In Findings of the Association for Computational Linguistics: ACL 2023, pages 10941–10958, Toronto, Canada. Association for Computational Linguistics.
Cite (Informal):
Verifying Annotation Agreement without Multiple Experts: A Case Study with Gujarati SNACS (Mehta & Srikumar, Findings 2023)
Copy Citation:
PDF:
https://preview.aclanthology.org/dois-2013-emnlp/2023.findings-acl.696.pdf