Abstract
To bridge the gap between Machine Reading Comprehension (MRC) models and human beings, which is mainly reflected in the hunger for data and the robustness to noise, in this paper, we explore how to integrate the neural networks of MRC models with the general knowledge of human beings. On the one hand, we propose a data enrichment method, which uses WordNet to extract inter-word semantic connections as general knowledge from each given passage-question pair. On the other hand, we propose an end-to-end MRC model named as Knowledge Aided Reader (KAR), which explicitly uses the above extracted general knowledge to assist its attention mechanisms. Based on the data enrichment method, KAR is comparable in performance with the state-of-the-art MRC models, and significantly more robust to noise than them. When only a subset (20%-80%) of the training examples are available, KAR outperforms the state-of-the-art MRC models by a large margin, and is still reasonably robust to noise.- Anthology ID:
- P19-1219
- Volume:
- Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
- Month:
- July
- Year:
- 2019
- Address:
- Florence, Italy
- Editors:
- Anna Korhonen, David Traum, Lluís Màrquez
- Venue:
- ACL
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 2263–2272
- Language:
- URL:
- https://aclanthology.org/P19-1219
- DOI:
- 10.18653/v1/P19-1219
- Cite (ACL):
- Chao Wang and Hui Jiang. 2019. Explicit Utilization of General Knowledge in Machine Reading Comprehension. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pages 2263–2272, Florence, Italy. Association for Computational Linguistics.
- Cite (Informal):
- Explicit Utilization of General Knowledge in Machine Reading Comprehension (Wang & Jiang, ACL 2019)
- PDF:
- https://preview.aclanthology.org/naacl24-info/P19-1219.pdf
- Data
- ConceptNet, SQuAD