Enhancing Multilingual Reasoning via Steerable Model Merging

Zhuoran Li, Rui Xu, Jian Yang, Junnan Liu, Zhijun Chen, Qianren Mao, Hongcheng Guo, Jiaheng Liu, Likang Xiao, Ming LI, Xiaojie Wang


Abstract
Model merging is an effective technique for composing the capabilities of a multilingual model and a reasoning model. It has achieved promising generalization in multilingual reasoning tasks by aligning feature spaces of different models. However, the merged single model often fails to address the conflicts between source models, leading to suboptimal performance. In other words, the one-size-fits-all merging strategy may not align with the characteristics of different inputs which may require prioritizing certain models over others. To this end, we propose a Steerable Model Merging (**ST-Merge**) framework to modulate the contribution of each source model. To realize this idea, we introduce a gated cross-attention mechanism to weight or filter the two attended source models in an adaptive manner. Extensive experiments demonstrate that ST-Merge consistently outperforms multiple strong baselines on four multilingual reasoning benchmarks across 21 different languages.
Anthology ID:
2026.findings-acl.1856
Volume:
Findings of the Association for Computational Linguistics: ACL 2026
Month:
July
Year:
2026
Address:
San Diego, California, United States
Editors:
Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
37266–37277
Language:
URL:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1856/
DOI:
Bibkey:
Cite (ACL):
Zhuoran Li, Rui Xu, Jian Yang, Junnan Liu, Zhijun Chen, Qianren Mao, Hongcheng Guo, Jiaheng Liu, Likang Xiao, Ming LI, and Xiaojie Wang. 2026. Enhancing Multilingual Reasoning via Steerable Model Merging. In Findings of the Association for Computational Linguistics: ACL 2026, pages 37266–37277, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):
Enhancing Multilingual Reasoning via Steerable Model Merging (Li et al., Findings 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1856.pdf
Checklist:
 2026.findings-acl.1856.checklist.pdf