Abstract
Coreference resolution is the task of identifying all mentions in a text that refer to the same real-world entity. Collecting sufficient labelled data from expert annotators to train a high-performance coreference resolution system is time-consuming and expensive. Crowdsourcing makes it possible to obtain the required amounts of data rapidly and cost-effectively. However, crowd-sourced labels can be noisy. To ensure high-quality data, it is crucial to infer the correct labels by aggregating the noisy labels. In this paper, we split the aggregation into two subtasks, i.e, mention classification and coreference chain inference. Firstly, we predict the general class of each mention using an autoencoder, which incorporates contextual information about each mention, while at the same time taking into account the mention’s annotation complexity and annotators’ reliability at different levels. Secondly, to determine the coreference chain of each mention, we use weighted voting which takes into account the learned reliability in the first subtask. Experimental results demonstrate the effectiveness of our method in predicting the correct labels. We also illustrate our model’s interpretability through a comprehensive analysis of experimental results.- Anthology ID:
- 2020.coling-main.507
- Volume:
- Proceedings of the 28th International Conference on Computational Linguistics
- Month:
- December
- Year:
- 2020
- Address:
- Barcelona, Spain (Online)
- Editors:
- Donia Scott, Nuria Bel, Chengqing Zong
- Venue:
- COLING
- SIG:
- Publisher:
- International Committee on Computational Linguistics
- Note:
- Pages:
- 5760–5773
- Language:
- URL:
- https://aclanthology.org/2020.coling-main.507
- DOI:
- 10.18653/v1/2020.coling-main.507
- Cite (ACL):
- Maolin Li, Hiroya Takamura, and Sophia Ananiadou. 2020. A Neural Model for Aggregating Coreference Annotation in Crowdsourcing. In Proceedings of the 28th International Conference on Computational Linguistics, pages 5760–5773, Barcelona, Spain (Online). International Committee on Computational Linguistics.
- Cite (Informal):
- A Neural Model for Aggregating Coreference Annotation in Crowdsourcing (Li et al., COLING 2020)
- PDF:
- https://preview.aclanthology.org/dois-2013-emnlp/2020.coling-main.507.pdf