AHEAD: Attention Head Energy-Aware Dynamics for Hallucination Mitigation in MLLMs

Jiale Chang, Ying Li, Siliang Tang, Yueting Zhuang


Abstract
Multimodal large language models excel at vision-language tasks but remain prone to hallucinations that undermine their reliability. Existing approaches predominantly treat hallucinations as classification errors, overlooking the heterogeneous behaviors of attention heads and their dynamic influences during inference. We revisit MLLM reasoning from an energy perspective and identify that hallucinations stem from imbalances between visual potential and language prior potential: when visual information is ambiguous or language priors dominate, attention heads tend to be driven by linguistic statistical patterns, generating content inconsistent with visual evidence. We propose AHEAD, a framework that quantifies the energetic properties of each attention head during object generation through two potential networks—the Visual Grounding Potential Network and the Language Prior Potential Network—and dynamically adjusts their contributions at inference time. Specifically, we amplify attention heads with strong visual grounding capacity while suppressing those overly reliant on language priors. Experiments across multiple benchmarks demonstrate that AHEAD significantly reduces hallucination rates without fine-tuning the base MLLM while maintaining generation quality.
Anthology ID:
2026.findings-acl.425
Volume:
Findings of the Association for Computational Linguistics: ACL 2026
Month:
July
Year:
2026
Address:
San Diego, California, United States
Editors:
Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
8728–8739
Language:
URL:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.425/
DOI:
Bibkey:
Cite (ACL):
Jiale Chang, Ying Li, Siliang Tang, and Yueting Zhuang. 2026. AHEAD: Attention Head Energy-Aware Dynamics for Hallucination Mitigation in MLLMs. In Findings of the Association for Computational Linguistics: ACL 2026, pages 8728–8739, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):
AHEAD: Attention Head Energy-Aware Dynamics for Hallucination Mitigation in MLLMs (Chang et al., Findings 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.425.pdf
Checklist:
 2026.findings-acl.425.checklist.pdf