Soft Head Selection for Injecting ICL-Derived Task Embeddings

Jungwon Park, Jimyeong Kim, Changin Choi, Wonjong Rhee


Abstract
Large language models (LLMs) are commonly adapted to downstream tasks using parameter-efficient fine-tuning (PEFT) or in-context learning (ICL). Recently, ICL-driven embedding-based adaptation has been proposed as a distinct task adaptation paradigm. It derives task-specific embeddings from intermediate activations using few-shot prompts and injects them during inference. Despite its conceptual appeal, this approach has not demonstrated consistent performance gains over PEFT or ICL, and its empirical advantages have been limited in practice. We propose Soft head-selection for ICL-derived Task Embeddings (SITE), a gradient-based method that identifies task-relevant attention heads to enable effective task embedding injection. Across various types of open-ended generation, reasoning, and natural language understanding tasks, SITE significantly outperforms prior embedding-based adaptation methods and few-shot ICL, while using substantially fewer trainable parameters than PEFT. Experiments on 12 LLMs ranging from 4B to 70B parameters demonstrate the generality of our approach, and intra-task and inter-task activation patching analyses further provide new mechanistic insights by revealing strong task dependence in attention head functionality.
Anthology ID:
2026.findings-acl.1355
Volume:
Findings of the Association for Computational Linguistics: ACL 2026
Month:
July
Year:
2026
Address:
San Diego, California, United States
Editors:
Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
27161–27214
Language:
URL:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1355/
DOI:
Bibkey:
Cite (ACL):
Jungwon Park, Jimyeong Kim, Changin Choi, and Wonjong Rhee. 2026. Soft Head Selection for Injecting ICL-Derived Task Embeddings. In Findings of the Association for Computational Linguistics: ACL 2026, pages 27161–27214, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):
Soft Head Selection for Injecting ICL-Derived Task Embeddings (Park et al., Findings 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1355.pdf
Checklist:
 2026.findings-acl.1355.checklist.pdf