The NarrativeQA Reading Comprehension Challenge

Tomáš Kočiský; Jonathan Schwarz; Phil Blunsom; Chris Dyer; Karl Moritz Hermann; Gábor Melis; Edward Grefenstette

doi:10.1162/tacl_a_00023

The NarrativeQA Reading Comprehension Challenge

Tomáš Kočiský, Jonathan Schwarz, Phil Blunsom, Chris Dyer, Karl Moritz Hermann, Gábor Melis, Edward Grefenstette

Abstract

Reading comprehension (RC)—in contrast to information retrieval—requires integrating information and reasoning about events, entities, and their relations across a full document. Question answering is conventionally used to assess RC ability, in both artificial agents and children learning to read. However, existing RC datasets and tasks are dominated by questions that can be solved by selecting answers using superficial information (e.g., local context similarity or global term frequency); they thus fail to test for the essential integrative aspect of RC. To encourage progress on deeper comprehension of language, we present a new dataset and set of tasks in which the reader must answer questions about stories by reading entire books or movie scripts. These tasks are designed so that successfully answering their questions requires understanding the underlying narrative rather than relying on shallow pattern matching or salience. We show that although humans solve the tasks easily, standard RC models struggle on the tasks presented here. We provide an analysis of the dataset and the challenges it presents.

Anthology ID:: Q18-1023
Volume:: Transactions of the Association for Computational Linguistics, Volume 6
Month:
Year:: 2018
Address:: Cambridge, MA
Editors:: Lillian Lee, Mark Johnson, Kristina Toutanova, Brian Roark
Venue:: TACL
SIG:
Publisher:: MIT Press
Note:
Pages:: 317–328
Language:
URL:: https://aclanthology.org/Q18-1023
DOI:: 10.1162/tacl_a_00023
Bibkey:
Cite (ACL):: Tomáš Kočiský, Jonathan Schwarz, Phil Blunsom, Chris Dyer, Karl Moritz Hermann, Gábor Melis, and Edward Grefenstette. 2018. The NarrativeQA Reading Comprehension Challenge. Transactions of the Association for Computational Linguistics, 6:317–328.
Cite (Informal):: The NarrativeQA Reading Comprehension Challenge (Kočiský et al., TACL 2018)
Copy Citation:
PDF:: https://preview.aclanthology.org/naacl-24-ws-corrections/Q18-1023.pdf
Video:: https://preview.aclanthology.org/naacl-24-ws-corrections/Q18-1023.mp4
Code: additional community code
Data: NarrativeQA, BookTest, CBT, Children's Book Test, MCTest, MS MARCO, NewsQA, SQuAD, SearchQA

PDF Search Code Video