QTUNA: A Corpus for Understanding How Speakers Use Quantification
Guanyi Chen, Kees van Deemter, Silvia Pagliaro, Louk Smalbil, Chenghua Lin
Abstract
A prominent strand of work in formal semantics investigates the ways in which human languages quantify over the elements of a set, as when we say “All A are B ”, “All except two A are B ”, “Only a few of the A are B ” and so on. Our aim is to build Natural Language Generation algorithms that mimic humans’ use of quantified expressions. To inform these algorithms, we conducted on a series of elicitation experiments in which human speakers were asked to perform a linguistic task that invites the use of quantified expressions. We discuss how these experiments were conducted and what corpora they gave rise to. We conduct an informal analysis of the corpora, and offer an initial assessment of the challenges that these corpora pose for Natural Language Generation. The dataset is available at: https://github.com/a-quei/qtuna.- Anthology ID:
- W19-8616
- Volume:
- Proceedings of the 12th International Conference on Natural Language Generation
- Month:
- October–November
- Year:
- 2019
- Address:
- Tokyo, Japan
- Venue:
- INLG
- SIG:
- SIGGEN
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 124–129
- Language:
- URL:
- https://aclanthology.org/W19-8616
- DOI:
- 10.18653/v1/W19-8616
- Cite (ACL):
- Guanyi Chen, Kees van Deemter, Silvia Pagliaro, Louk Smalbil, and Chenghua Lin. 2019. QTUNA: A Corpus for Understanding How Speakers Use Quantification. In Proceedings of the 12th International Conference on Natural Language Generation, pages 124–129, Tokyo, Japan. Association for Computational Linguistics.
- Cite (Informal):
- QTUNA: A Corpus for Understanding How Speakers Use Quantification (Chen et al., INLG 2019)
- PDF:
- https://preview.aclanthology.org/ingestion-script-update/W19-8616.pdf
- Code
- a-quei/qtuna
- Data
- QTuna