ding-01 :ARG0: An AMR Corpus for Spontaneous French Dialogue

Jeongwoo Kang, Maria Boritchev, Maximin Coavoux


Abstract
We present our work to build a French semantic corpus by annotating French dialogue in Abstract Meaning Representation (AMR).Specifically, we annotate the DinG corpus, consisting of transcripts of spontaneous French dialogues recorded during the board game Catan. As AMR has insufficient coverage of the dynamics of spontaneous speech, we extend the framework to better represent spontaneous speech and sentence structures specific to French. Additionally, to support consistent annotation, we provide an annotation guideline detailing these extensions. We publish our corpus under a free license (CC-SA-BY). We also train and evaluate an AMR parser on our data. This model can be used as an assistance annotation tool to provide initial annotations that can be refined by human annotators. Our work contributes to the development of semantic resources for French dialogue.
Anthology ID:
2025.iwcs-1.4
Volume:
Proceedings of the 16th International Conference on Computational Semantics
Month:
September
Year:
2025
Address:
Düsseldorf, Germany
Editors:
Kilian Evang, Laura Kallmeyer, Sylvain Pogodalla
Venues:
IWCS | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
40–50
Language:
URL:
https://preview.aclanthology.org/iwcs-25-ingestion/2025.iwcs-1.4/
DOI:
Bibkey:
Cite (ACL):
Jeongwoo Kang, Maria Boritchev, and Maximin Coavoux. 2025. ding-01 :ARG0: An AMR Corpus for Spontaneous French Dialogue. In Proceedings of the 16th International Conference on Computational Semantics, pages 40–50, Düsseldorf, Germany. Association for Computational Linguistics.
Cite (Informal):
ding-01 :ARG0: An AMR Corpus for Spontaneous French Dialogue (Kang et al., IWCS 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/iwcs-25-ingestion/2025.iwcs-1.4.pdf