When Personalization Misleads: Understanding and Mitigating Hallucinations in Personalized LLMs
Zhongxiang Sun, Yi Zhan, Chenglei Shen, Weijie Yu, Xiao Zhang, Ming He, Jun Xu
Abstract
Personalized large language models (LLMs) adapt model behavior to individual users to enhance user satisfaction, yet personalization can inadvertently distort factual reasoning. We show that when personalized LLMs face factual queries, there exists a phenomenon where the model generates answers aligned with a user’s prior history rather than the objective truth, resulting in **personalization-induced hallucinations** that degrade factual reliability and may propagate incorrect beliefs, due to representational entanglement between personalization and factual representations. To address this issue, we propose **Factuality-Preserving Personalized Steering (FPPS)**, a lightweight inference-time approach that mitigates personalization-induced factual distortions while preserving personalized behavior. We further introduce **PFQABench**, the first benchmark designed to jointly evaluate factual and personalized question answering under personalization. Experiments across multiple LLM backbones and personalization methods show that FPPS substantially improves factual accuracy while maintaining personalized performance.- Anthology ID:
- 2026.findings-acl.395
- Volume:
- Findings of the Association for Computational Linguistics: ACL 2026
- Month:
- July
- Year:
- 2026
- Address:
- San Diego, California, United States
- Editors:
- Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
- Venue:
- Findings
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 8041–8060
- Language:
- URL:
- https://preview.aclanthology.org/ingest-acl/2026.findings-acl.395/
- DOI:
- Cite (ACL):
- Zhongxiang Sun, Yi Zhan, Chenglei Shen, Weijie Yu, Xiao Zhang, Ming He, and Jun Xu. 2026. When Personalization Misleads: Understanding and Mitigating Hallucinations in Personalized LLMs. In Findings of the Association for Computational Linguistics: ACL 2026, pages 8041–8060, San Diego, California, United States. Association for Computational Linguistics.
- Cite (Informal):
- When Personalization Misleads: Understanding and Mitigating Hallucinations in Personalized LLMs (Sun et al., Findings 2026)
- PDF:
- https://preview.aclanthology.org/ingest-acl/2026.findings-acl.395.pdf