Improving Language Identification for Code-Switched Speech: The Pivotal Role of Accented English

Adyasha Patra; Dhiraj Kumar Sah; Preethi Jyothi

Improving Language Identification for Code-Switched Speech: The Pivotal Role of Accented English

Adyasha Patra, Dhiraj Kumar Sah, Preethi Jyothi

Abstract

Code-switching, where speakers alternate between languages within a single utterance, poses unique challenges for language identification (LID). Existing LID models often fail to reliably identify English spoken with the accent of the matrix (dominant) language. We show that finetuning LID models with small amounts of such accented English significantly improves code-switched LID, without degrading performance on standard monolingual speech—a limitation observed with direct finetuning on code-switched utterances. This is achieved via low-rank adaptation (LoRA) on limited accented data, which allows models to adapt efficiently. To better evaluate performance, we introduce LangRank, a metric that captures the relative ranking of identified languages often overlooked by traditional metrics. Our method generalizes across multiple language pairs, including Hindi-English, Bengali-English, Mandarin-English, and Arabic-English, providing robust LID in code-switched multilingual contexts.

Anthology ID:: 2026.findings-eacl.242
Volume:: Findings of the Association for Computational Linguistics: EACL 2026
Month:: March
Year:: 2026
Address:: Rabat, Morocco
Editors:: Vera Demberg, Kentaro Inui, Lluís Marquez
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 4643–4656
Language:
URL:: https://preview.aclanthology.org/ingest-eacl/2026.findings-eacl.242/
DOI:
Bibkey:
Cite (ACL):: Adyasha Patra, Dhiraj Kumar Sah, and Preethi Jyothi. 2026. Improving Language Identification for Code-Switched Speech: The Pivotal Role of Accented English. In Findings of the Association for Computational Linguistics: EACL 2026, pages 4643–4656, Rabat, Morocco. Association for Computational Linguistics.
Cite (Informal):: Improving Language Identification for Code-Switched Speech: The Pivotal Role of Accented English (Patra et al., Findings 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-eacl/2026.findings-eacl.242.pdf
Checklist:: 2026.findings-eacl.242.checklist.pdf

PDF Cite Search Checklist Fix data