Explicit Utilization of General Knowledge in Machine Reading Comprehension

Chao Wang, Hui Jiang


Abstract
To bridge the gap between Machine Reading Comprehension (MRC) models and human beings, which is mainly reflected in the hunger for data and the robustness to noise, in this paper, we explore how to integrate the neural networks of MRC models with the general knowledge of human beings. On the one hand, we propose a data enrichment method, which uses WordNet to extract inter-word semantic connections as general knowledge from each given passage-question pair. On the other hand, we propose an end-to-end MRC model named as Knowledge Aided Reader (KAR), which explicitly uses the above extracted general knowledge to assist its attention mechanisms. Based on the data enrichment method, KAR is comparable in performance with the state-of-the-art MRC models, and significantly more robust to noise than them. When only a subset (20%-80%) of the training examples are available, KAR outperforms the state-of-the-art MRC models by a large margin, and is still reasonably robust to noise.
Anthology ID:
P19-1219
Volume:
Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
Month:
July
Year:
2019
Address:
Florence, Italy
Editors:
Anna Korhonen, David Traum, Lluís Màrquez
Venue:
ACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
2263–2272
Language:
URL:
https://aclanthology.org/P19-1219
DOI:
10.18653/v1/P19-1219
Bibkey:
Cite (ACL):
Chao Wang and Hui Jiang. 2019. Explicit Utilization of General Knowledge in Machine Reading Comprehension. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pages 2263–2272, Florence, Italy. Association for Computational Linguistics.
Cite (Informal):
Explicit Utilization of General Knowledge in Machine Reading Comprehension (Wang & Jiang, ACL 2019)
Copy Citation:
PDF:
https://preview.aclanthology.org/naacl24-info/P19-1219.pdf
Data
ConceptNetSQuAD