A Mechanistic Account of Attention Sinks in GPT-2: One Circuit, Broader Implications for Mitigation

Yuval Ran-Milo; Hila Ofek; Shahar Mendel

A Mechanistic Account of Attention Sinks in GPT-2: One Circuit, Broader Implications for Mitigation

Yuval Ran-Milo, Hila Ofek, Shahar Mendel

Abstract

Transformers commonly exhibit an attention sink: disproportionately high attention to the first position. We study this behavior in GPT-2–style models with learned query biases and absolute positional embeddings. Combining structural analysis with causal interventions, validated across natural-language, mathematical, and code inputs, we find that the sink arises from the interaction among (i) a learned query bias, (ii) the first-layer MLP transformation of the positional encoding, and (iii) structure in the key projection. Crucially, each component we identify is individually dispensable: architectures omitting each of them robustly exhibit sinks. This indicates that attention sinks may arise through distinct circuits across architectures. These findings inform mitigation of sinks, and motivate broader investigation into why sinks emerges.

Anthology ID:: 2026.acl-short.9
Volume:: Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 90–98
Language:
URL:: https://preview.aclanthology.org/ingest-acl/2026.acl-short.9/
DOI:
Bibkey:
Cite (ACL):: Yuval Ran-Milo, Hila Ofek, and Shahar Mendel. 2026. A Mechanistic Account of Attention Sinks in GPT-2: One Circuit, Broader Implications for Mitigation. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), pages 90–98, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: A Mechanistic Account of Attention Sinks in GPT-2: One Circuit, Broader Implications for Mitigation (Ran-Milo et al., ACL 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-acl/2026.acl-short.9.pdf
Checklist:: 2026.acl-short.9.checklist.pdf

PDF Cite Search Checklist Fix data