Building and Annotating a Large Comparable Corpus for Studying Semantic Quantification - Chinese, French, Japanese, Korean

Raoul Blin, Jinnam Choi, WU qishen, Yuxin Zhang, Soonhee Hwang, Takahiro Morita, Alexander Delaporte, Ilaine Wang, Chang Liu


Abstract
Quantifiers and noun quantification are well-studied topics in linguistics, but, to the best of our knowledge, there are still no dedicated multilingual resources for the study of quantification. To address this gap, we compiled a large multilingual comparable corpus (Chinese, French, Japanese, Korean) and propose to enrich it with both syntactic and “quantificational annotation” (semantic information relevant to the study of quantification). In this paper, we present both the corpus and the annotation project, and report on our initial attempt at quantificational annotation, the challenges encountered, and the linguistic observations drawn from it.
Anthology ID:
2026.lrec-main.532
Volume:
Proceedings of the Fifteenth Language Resources and Evaluation Conference
Month:
May
Year:
2026
Address:
Palma de Mallorca, Spain
Editors:
Stelios Piperidis, Núria Bel, Henk van den Heuvel, Nancy Ide, Simon Krek, Antonio Toral
Venue:
LREC
SIG:
Publisher:
ELRA Language Resource Association
Note:
Pages:
6685–6694
Language:
URL:
https://preview.aclanthology.org/ingest-lrec/2026.lrec-main.532/
DOI:
Bibkey:
Cite (ACL):
Raoul Blin, Jinnam Choi, WU qishen, Yuxin Zhang, Soonhee Hwang, Takahiro Morita, Alexander Delaporte, Ilaine Wang, and Chang Liu. 2026. Building and Annotating a Large Comparable Corpus for Studying Semantic Quantification - Chinese, French, Japanese, Korean. International Conference on Language Resources and Evaluation, main:6685–6694.
Cite (Informal):
Building and Annotating a Large Comparable Corpus for Studying Semantic Quantification - Chinese, French, Japanese, Korean (Blin et al., LREC 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-lrec/2026.lrec-main.532.pdf