Code-switching in Context: Investigating the Role of Discourse Topic in Bilingual Speech Production

Debasmita Bhattacharya, Anxin Yi, Siying Ding, Julia Hirschberg


Abstract
Code-switching (CSW) in speech is motivated by conversational factors across levels of linguistic analysis. While we know much about why speakers code-switch, there remains great scope for exploring how CSW occurs in speech, particularly within the discourse-level linguistic context. We build on prior work by asking: how are patterns of CSW influenced by different conversational contexts spanning Academic, Cultural, Personal, and Professional discourse topics? To answer this, we annotate a Mandarin-English spontaneous speech corpus, and analyze its discourse topics alongside various aspects of CSW production. We show that discourse topics interact significantly with utterance-level CSW, resulting in distinctive patterns of CSW presence, richness, language direction, and syntax that are uniquely associated with different contexts. Our work is the first to take such a context-sensitive approach to studying CSW, contributing to a broader understanding of the discourse topics that motivate speakers to code-switch in diverse ways.
Anthology ID:
2025.codi-1.6
Volume:
Proceedings of the 6th Workshop on Computational Approaches to Discourse, Context and Document-Level Inferences (CODI 2025)
Month:
November
Year:
2025
Address:
Suzhou, China
Editors:
Michael Strube, Chloe Braud, Christian Hardmeier, Junyi Jessy Li, Sharid Loaiciga, Amir Zeldes, Chuyuan Li
Venues:
CODI | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
64–80
Language:
URL:
https://preview.aclanthology.org/ingest-emnlp/2025.codi-1.6/
DOI:
Bibkey:
Cite (ACL):
Debasmita Bhattacharya, Anxin Yi, Siying Ding, and Julia Hirschberg. 2025. Code-switching in Context: Investigating the Role of Discourse Topic in Bilingual Speech Production. In Proceedings of the 6th Workshop on Computational Approaches to Discourse, Context and Document-Level Inferences (CODI 2025), pages 64–80, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):
Code-switching in Context: Investigating the Role of Discourse Topic in Bilingual Speech Production (Bhattacharya et al., CODI 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-emnlp/2025.codi-1.6.pdf