Abstract
Good datasets are a foundation of NLP research, and form the basis for training and evaluating models of language use. While creating datasets, the standard practice is to verify the annotation consistency using a committee of human annotators. This norm assumes that multiple annotators are available, which is not the case for highly specialized tasks or low-resource languages. In this paper, we ask: Can we evaluate the quality of a dataset constructed by a single human annotator? To address this question, we propose four weak verifiers to help estimate dataset quality, and outline when each may be employed. We instantiate these strategies for the task of semantic analysis of adpositions in Gujarati, a low-resource language, and show that our weak verifiers concur with a double-annotation study. As an added contribution, we also release the first dataset with semantic annotations in Gujarati along with several model baselines.- Anthology ID:
- 2023.findings-acl.696
- Volume:
- Findings of the Association for Computational Linguistics: ACL 2023
- Month:
- July
- Year:
- 2023
- Address:
- Toronto, Canada
- Editors:
- Anna Rogers, Jordan Boyd-Graber, Naoaki Okazaki
- Venue:
- Findings
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 10941–10958
- Language:
- URL:
- https://aclanthology.org/2023.findings-acl.696
- DOI:
- 10.18653/v1/2023.findings-acl.696
- Cite (ACL):
- Maitrey Mehta and Vivek Srikumar. 2023. Verifying Annotation Agreement without Multiple Experts: A Case Study with Gujarati SNACS. In Findings of the Association for Computational Linguistics: ACL 2023, pages 10941–10958, Toronto, Canada. Association for Computational Linguistics.
- Cite (Informal):
- Verifying Annotation Agreement without Multiple Experts: A Case Study with Gujarati SNACS (Mehta & Srikumar, Findings 2023)
- PDF:
- https://preview.aclanthology.org/dois-2013-emnlp/2023.findings-acl.696.pdf