Why Do Multilingual Reasoning Gaps Emerge in Reasoning Language Models?

Deokhyung Kang; Seonjeong Hwang; Daehui Kim; Hyounghun Kim; Gary Geunbae Lee

Why Do Multilingual Reasoning Gaps Emerge in Reasoning Language Models?

Deokhyung Kang, Seonjeong Hwang, Daehui Kim, Hyounghun Kim, Gary Lee

Abstract

Reasoning language models (RLMs) achieve strong performance on complex reasoning tasks, yet they still exhibit a multilingual reasoning gap, performing better in high-resource languages than in low-resource ones. While recent efforts have been made to address this gap, its underlying causes remain largely unexplored. In this work, we show that this gap primarily stems from failures in language understanding—specifically, the model’s inability to translate multilingual inputs into the language dominating its reasoning traces (typically English). As identifying understanding failures can enable targeted mitigation of the gap, we evaluate a range of detection methods and find that understanding failures are detectable to a meaningful extent, with supervised approaches performing best. Building on this, we propose Selective Translation, a strategy that incorporates an English translation into the initial reasoning trace when an understanding failure is detected. Experimental results using Qwen3-4B show that Selective Translation substantially bridges the multilingual reasoning gap, achieving near full-translation performance while translating only about 20% of inputs. Together, our results show that failures in language understanding are the primary driver of the multilingual reasoning gap and can be detected and selectively mitigated, clarifying its origin and suggesting a path toward more equitable multilingual reasoning.

Anthology ID:: 2026.findings-acl.1586
Volume:: Findings of the Association for Computational Linguistics: ACL 2026
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 31684–31716
Language:
URL:: https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1586/
DOI:
Bibkey:
Cite (ACL):: Deokhyung Kang, Seonjeong Hwang, Daehui Kim, Hyounghun Kim, and Gary Lee. 2026. Why Do Multilingual Reasoning Gaps Emerge in Reasoning Language Models?. In Findings of the Association for Computational Linguistics: ACL 2026, pages 31684–31716, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: Why Do Multilingual Reasoning Gaps Emerge in Reasoning Language Models? (Kang et al., Findings 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1586.pdf
Checklist:: 2026.findings-acl.1586.checklist.pdf

PDF Cite Search Checklist Fix data