The Role of Outgoing Connection Heterogeneity in Feedforward Layers of Large Language Models

Felix Stahlberg, Shankar Kumar


Abstract
We report on investigations into the characteristics of outgoing connections in feedforward layers of large language models. Our findings show that inner neurons with diverse outgoing connection strengths are more critical to model performance than those with uniform connections. We propose a new fine-tuning loss that takes advantage of this observation by decreasing the outgoing connection entropy in feedforward layers. Using this loss yields gains over standard fine-tuning across two different model families (PaLM-2 and Gemma-2) for downstream tasks in math, coding, and language understanding. To further elucidate the role of outgoing connection heterogeneity, we develop a data-free structured pruning method, which uses entropy to identify and remove neurons. This method is considerably more effective than removing neurons either randomly or based on their magnitude.
Anthology ID:
2025.emnlp-main.1143
Volume:
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Month:
November
Year:
2025
Address:
Suzhou, China
Editors:
Christos Christodoulopoulos, Tanmoy Chakraborty, Carolyn Rose, Violet Peng
Venue:
EMNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
22487–22495
Language:
URL:
https://preview.aclanthology.org/ingest-emnlp/2025.emnlp-main.1143/
DOI:
Bibkey:
Cite (ACL):
Felix Stahlberg and Shankar Kumar. 2025. The Role of Outgoing Connection Heterogeneity in Feedforward Layers of Large Language Models. In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, pages 22487–22495, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):
The Role of Outgoing Connection Heterogeneity in Feedforward Layers of Large Language Models (Stahlberg & Kumar, EMNLP 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-emnlp/2025.emnlp-main.1143.pdf
Checklist:
 2025.emnlp-main.1143.checklist.pdf