Examining Large Language Models’ form-meaning mappings of information structure constructions in Mandarin Chinese

Shihui Li, Xiaojuan Tan, Jelke Bloem


Abstract
Construction Grammar (CxG) knowledge in language models has been extensively studied for English, but remains underexplored in other languages. In Mandarin Chinese, the ba (把, disposal) and bei (被, passive) constructions are widely used for managing information structure. They foreground topical elements (information structure) and encode systematic form-meaning mappings (CxG), particularly with respect to the semantic role of the object. We probe language models’ linguistic competence with these constructions using minimal pairs, constructing a new minimal-pair dataset comprising seven paradigms that target both syntactic constraints and verb–construction compatibility. Our results show that it remains a challenge for many models to capture the form-meaning mappings underlying the ba construction, although they achieve high accuracy on paradigms driven by surface syntactic cues.
Anthology ID:
2026.conll-main.37
Volume:
Proceedings of the 30th Conference on Computational Natural Language Learning
Month:
July
Year:
2026
Address:
San Diego, California, USA
Editors:
Claire Bonial, Yevgeni Berzak
Venues:
CoNLL | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
613–625
Language:
URL:
https://preview.aclanthology.org/ingest-acl-workshops/2026.conll-main.37/
DOI:
Bibkey:
Cite (ACL):
Shihui Li, Xiaojuan Tan, and Jelke Bloem. 2026. Examining Large Language Models’ form-meaning mappings of information structure constructions in Mandarin Chinese. In Proceedings of the 30th Conference on Computational Natural Language Learning, pages 613–625, San Diego, California, USA. Association for Computational Linguistics.
Cite (Informal):
Examining Large Language Models’ form-meaning mappings of information structure constructions in Mandarin Chinese (Li et al., CoNLL 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl-workshops/2026.conll-main.37.pdf