The Transfer Neurons Hypothesis: An Underlying Mechanism for Language Latent Space Transitions in Multilingual LLMs

Hinata Tezuka; Naoya Inoue

doi:10.18653/v1/2025.emnlp-main.1618

The Transfer Neurons Hypothesis: An Underlying Mechanism for Language Latent Space Transitions in Multilingual LLMs

Abstract

Recent studies have suggested a processing framework for multilingual inputs in decoder-based LLMs: early layers convert inputs into English-centric and language-agnostic representations; middle layers perform reasoning within an English-centric latent space; and final layers generate outputs by transforming these representations back into language-specific latent spaces.However, the internal dynamics of such transformation and the underlying mechanism remain underexplored.Towards a deeper understanding of this framework, we propose and empirically validate **The Transfer Neurons Hypothesis**: certain neurons in the MLP module are responsible for transferring representations between language-specific latent spaces and a shared semantic latent space.Furthermore, we show that one function of language-specific neurons, as identified in recent studies, is to facilitate movement between latent spaces.Finally, we show that transfer neurons are critical for reasoning in multilingual LLMs

Anthology ID:: 2025.emnlp-main.1618
Volume:: Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Month:: November
Year:: 2025
Address:: Suzhou, China
Editors:: Christos Christodoulopoulos, Tanmoy Chakraborty, Carolyn Rose, Violet Peng
Venue:: EMNLP
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 31730–31780
Language:
URL:: https://preview.aclanthology.org/ingest-luhme/2025.emnlp-main.1618/
DOI:: 10.18653/v1/2025.emnlp-main.1618
Bibkey:
Cite (ACL):: Hinata Tezuka and Naoya Inoue. 2025. The Transfer Neurons Hypothesis: An Underlying Mechanism for Language Latent Space Transitions in Multilingual LLMs. In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, pages 31730–31780, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):: The Transfer Neurons Hypothesis: An Underlying Mechanism for Language Latent Space Transitions in Multilingual LLMs (Tezuka & Inoue, EMNLP 2025)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-luhme/2025.emnlp-main.1618.pdf
Checklist:: 2025.emnlp-main.1618.checklist.pdf

PDF Cite Search Checklist Fix data