Abstract
NLP models are imperfect and lack intricate capabilities that humans access automatically when processing speech or reading a text. Human language processing data can be leveraged to increase the performance of models and to pursue explanatory research for a better understanding of the differences between human and machine language processing. We review recent studies leveraging different types of cognitive processing signals, namely eye-tracking, M/EEG and fMRI data recorded during language understanding. We discuss the role of cognitive data for machine learning-based NLP methods and identify fundamental challenges for processing pipelines. Finally, we propose practical strategies for using these types of cognitive signals to enhance NLP models.- Anthology ID:
- 2020.lincr-1.3
- Volume:
- Proceedings of the Second Workshop on Linguistic and Neurocognitive Resources
- Month:
- May
- Year:
- 2020
- Address:
- Marseille, France
- Venue:
- LiNCr
- SIG:
- Publisher:
- European Language Resources Association
- Note:
- Pages:
- 15–27
- Language:
- English
- URL:
- https://aclanthology.org/2020.lincr-1.3
- DOI:
- Cite (ACL):
- Nora Hollenstein, Maria Barrett, and Lisa Beinborn. 2020. Towards Best Practices for Leveraging Human Language Processing Signals for Natural Language Processing. In Proceedings of the Second Workshop on Linguistic and Neurocognitive Resources, pages 15–27, Marseille, France. European Language Resources Association.
- Cite (Informal):
- Towards Best Practices for Leveraging Human Language Processing Signals for Natural Language Processing (Hollenstein et al., LiNCr 2020)
- PDF:
- https://preview.aclanthology.org/ingestion-script-update/2020.lincr-1.3.pdf