Putting the Horse before the Cart: A Generator-Evaluator Framework for Question Generation from Text
Abstract
Automatic question generation (QG) is a useful yet challenging task in NLP. Recent neural network-based approaches represent the state-of-the-art in this task. In this work, we attempt to strengthen them significantly by adopting a holistic and novel generator-evaluator framework that directly optimizes objectives that reward semantics and structure. The generator is a sequence-to-sequence model that incorporates the structure and semantics of the question being generated. The generator predicts an answer in the passage that the question can pivot on. Employing the copy and coverage mechanisms, it also acknowledges other contextually important (and possibly rare) keywords in the passage that the question needs to conform to, while not redundantly repeating words. The evaluator model evaluates and assigns a reward to each predicted question based on its conformity to the structure of ground-truth questions. We propose two novel QG-specific reward functions for text conformity and answer conformity of the generated question. The evaluator also employs structure-sensitive rewards based on evaluation measures such as BLEU, GLEU, and ROUGE-L, which are suitable for QG. In contrast, most of the previous works only optimize the cross-entropy loss, which can induce inconsistencies between training (objective) and testing (evaluation) measures. Our evaluation shows that our approach significantly outperforms state-of-the-art systems on the widely-used SQuAD benchmark as per both automatic and human evaluation.- Anthology ID:
- K19-1076
- Volume:
- Proceedings of the 23rd Conference on Computational Natural Language Learning (CoNLL)
- Month:
- November
- Year:
- 2019
- Address:
- Hong Kong, China
- Editors:
- Mohit Bansal, Aline Villavicencio
- Venue:
- CoNLL
- SIG:
- SIGNLL
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 812–821
- Language:
- URL:
- https://aclanthology.org/K19-1076
- DOI:
- 10.18653/v1/K19-1076
- Cite (ACL):
- Vishwajeet Kumar, Ganesh Ramakrishnan, and Yuan-Fang Li. 2019. Putting the Horse before the Cart: A Generator-Evaluator Framework for Question Generation from Text. In Proceedings of the 23rd Conference on Computational Natural Language Learning (CoNLL), pages 812–821, Hong Kong, China. Association for Computational Linguistics.
- Cite (Informal):
- Putting the Horse before the Cart: A Generator-Evaluator Framework for Question Generation from Text (Kumar et al., CoNLL 2019)
- PDF:
- https://preview.aclanthology.org/dois-2013-emnlp/K19-1076.pdf