Task-driven Layerwise Additive Activation Intervention

Hieu Trung Nguyen, Bao Nguyen, Binh Nguyen, Viet Anh Nguyen


Abstract
Modern language models (LMs) have significantly advanced generative modeling in natural language processing (NLP). Despite their success, LMs often struggle with adaptation to new contexts in real-time applications. A promising approach to task adaptation is activation intervention, which steers the LMs’ generation process by identifying and manipulating the activations. However, existing interventions rely heavily on heuristic rules or require many prompt inputs to determine effective interventions. In this paper, we propose a layer-wise additive activation intervention framework that optimizes the intervention process, thereby enhancing sample efficiency. We evaluate our framework on various datasets, demonstrating improvements in the accuracy of pretrained LMs and competing intervention baselines.
Anthology ID:
2025.naacl-short.43
Volume:
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 2: Short Papers)
Month:
April
Year:
2025
Address:
Albuquerque, New Mexico
Editors:
Luis Chiruzzo, Alan Ritter, Lu Wang
Venue:
NAACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
506–513
Language:
URL:
https://preview.aclanthology.org/fix-sig-urls/2025.naacl-short.43/
DOI:
Bibkey:
Cite (ACL):
Hieu Trung Nguyen, Bao Nguyen, Binh Nguyen, and Viet Anh Nguyen. 2025. Task-driven Layerwise Additive Activation Intervention. In Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 2: Short Papers), pages 506–513, Albuquerque, New Mexico. Association for Computational Linguistics.
Cite (Informal):
Task-driven Layerwise Additive Activation Intervention (Nguyen et al., NAACL 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/fix-sig-urls/2025.naacl-short.43.pdf