Aparna Vijayakumar
2021
Looking Beyond Sentence-Level Natural Language Inference for Question Answering and Text Summarization
Anshuman Mishra
|
Dhruvesh Patel
|
Aparna Vijayakumar
|
Xiang Lorraine Li
|
Pavan Kapanipathi
|
Kartik Talamadupula
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Natural Language Inference (NLI) has garnered significant attention in recent years; however, the promise of applying NLI breakthroughs to other downstream NLP tasks has remained unfulfilled. In this work, we use the multiple-choice reading comprehension (MCRC) and checking factual correctness of textual summarization (CFCS) tasks to investigate potential reasons for this. Our findings show that: (1) the relatively shorter length of premises in traditional NLI datasets is the primary challenge prohibiting usage in downstream applications (which do better with longer contexts); (2) this challenge can be addressed by automatically converting resource-rich reading comprehension datasets into longer-premise NLI datasets; and (3) models trained on the converted, longer-premise datasets outperform those trained using short-premise traditional NLI datasets on downstream tasks primarily due to the difference in premise lengths.
2020
Reading Comprehension as Natural Language Inference:A Semantic Analysis
Anshuman Mishra
|
Dhruvesh Patel
|
Aparna Vijayakumar
|
Xiang Li
|
Pavan Kapanipathi
|
Kartik Talamadupula
Proceedings of the Ninth Joint Conference on Lexical and Computational Semantics
In the recent past, Natural language Inference (NLI) has gained significant attention, particularly given its promise for downstream NLP tasks. However, its true impact is limited and has not been well studied. Therefore, in this paper, we explore the utility of NLI for one of the most prominent downstream tasks, viz. Question Answering (QA). We transform one of the largest available MRC dataset (RACE) to an NLI form, and compare the performances of a state-of-the-art model (RoBERTa) on both these forms. We propose new characterizations of questions, and evaluate the performance of QA and NLI models on these categories. We highlight clear categories for which the model is able to perform better when the data is presented in a coherent entailment form, and a structured question-answer concatenation form, respectively.
Search
Co-authors
- Anshuman Mishra 2
- Dhruvesh Patel 2
- Kartik Talamadupula 2
- Pavan Kapanipathi 2
- Xiang Li (李翔) 1
- show all...