Abstract
When processing a text, humans and machines must disambiguate between different uses of the pronoun it, including non-referential, nominal anaphoric or clause anaphoric ones. In this paper we use eye-tracking data to learn how humans perform this disambiguation and use this knowledge to improve the automatic classification of it. We show that by using gaze data and a POS-tagger we are able to significantly outperform a common baseline and classify between three categories of it with an accuracy comparable to that of linguistic-based approaches. In addition, the discriminatory power of specific gaze features informs the way humans process the pronoun, which, to the best of our knowledge, has not been explored using data from a natural reading task.- Anthology ID:
- D18-1528
- Volume:
- Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
- Month:
- October-November
- Year:
- 2018
- Address:
- Brussels, Belgium
- Editors:
- Ellen Riloff, David Chiang, Julia Hockenmaier, Jun’ichi Tsujii
- Venue:
- EMNLP
- SIG:
- SIGDAT
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 4896–4901
- Language:
- URL:
- https://aclanthology.org/D18-1528
- DOI:
- 10.18653/v1/D18-1528
- Cite (ACL):
- Victoria Yaneva, Le An Ha, Richard Evans, and Ruslan Mitkov. 2018. Classifying Referential and Non-referential It Using Gaze. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pages 4896–4901, Brussels, Belgium. Association for Computational Linguistics.
- Cite (Informal):
- Classifying Referential and Non-referential It Using Gaze (Yaneva et al., EMNLP 2018)
- PDF:
- https://preview.aclanthology.org/add_acl24_videos/D18-1528.pdf
- Code
- victoria-ianeva/It-Classification