Code-Switching and Syntax: A Large-Scale Experiment

Igor Sterner, Simone Teufel


Abstract
The theoretical code-switching (CS) literature provides numerous pointwise investigations that aim to explain patterns in CS, i.e. why bilinguals switch language in certain positions in a sentence more often than in others. A resulting consensus is that CS can be explained by the syntax of the contributing languages. There is however no large-scale, multi-language, cross-phenomena experiment that tests this claim. When designing such an experiment, we need to make sure that the system that is predicting where bilinguals tend to switch has access only to syntactic information. We provide such an experiment here. Results show that syntax alone is sufficient for an automatic system to distinguish between sentences in minimal pairs of CS, to the same degree as bilingual humans. Furthermore, the learnt syntactic patterns generalise well to unseen language pairs.
Anthology ID:
2025.findings-acl.600
Volume:
Findings of the Association for Computational Linguistics: ACL 2025
Month:
July
Year:
2025
Address:
Vienna, Austria
Editors:
Wanxiang Che, Joyce Nabende, Ekaterina Shutova, Mohammad Taher Pilehvar
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
11526–11533
Language:
URL:
https://preview.aclanthology.org/display_plenaries/2025.findings-acl.600/
DOI:
Bibkey:
Cite (ACL):
Igor Sterner and Simone Teufel. 2025. Code-Switching and Syntax: A Large-Scale Experiment. In Findings of the Association for Computational Linguistics: ACL 2025, pages 11526–11533, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):
Code-Switching and Syntax: A Large-Scale Experiment (Sterner & Teufel, Findings 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/display_plenaries/2025.findings-acl.600.pdf