QTUNA: A Corpus for Understanding How Speakers Use Quantification

Guanyi Chen, Kees van Deemter, Silvia Pagliaro, Louk Smalbil, Chenghua Lin


Abstract
A prominent strand of work in formal semantics investigates the ways in which human languages quantify over the elements of a set, as when we say “All A are B ”, “All except two A are B ”, “Only a few of the A are B ” and so on. Our aim is to build Natural Language Generation algorithms that mimic humans’ use of quantified expressions. To inform these algorithms, we conducted on a series of elicitation experiments in which human speakers were asked to perform a linguistic task that invites the use of quantified expressions. We discuss how these experiments were conducted and what corpora they gave rise to. We conduct an informal analysis of the corpora, and offer an initial assessment of the challenges that these corpora pose for Natural Language Generation. The dataset is available at: https://github.com/a-quei/qtuna.
Anthology ID:
W19-8616
Volume:
Proceedings of the 12th International Conference on Natural Language Generation
Month:
October–November
Year:
2019
Address:
Tokyo, Japan
Venue:
INLG
SIG:
SIGGEN
Publisher:
Association for Computational Linguistics
Note:
Pages:
124–129
Language:
URL:
https://aclanthology.org/W19-8616
DOI:
10.18653/v1/W19-8616
Bibkey:
Cite (ACL):
Guanyi Chen, Kees van Deemter, Silvia Pagliaro, Louk Smalbil, and Chenghua Lin. 2019. QTUNA: A Corpus for Understanding How Speakers Use Quantification. In Proceedings of the 12th International Conference on Natural Language Generation, pages 124–129, Tokyo, Japan. Association for Computational Linguistics.
Cite (Informal):
QTUNA: A Corpus for Understanding How Speakers Use Quantification (Chen et al., INLG 2019)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingestion-script-update/W19-8616.pdf
Supplementary attachment:
 W19-8616.Supplementary_Attachment.pdf
Code
 a-quei/qtuna
Data
QTuna