Uncovering Strategic Egoism Behaviors in Large Language Models
Yaoyuan Zhang, Zonghao Ying, Aishan Liu, Jian Yang, Tianlin Li, Yaodong Yang, Xianglong Liu
Abstract
Large language models (LLMs) exhibit growing safety and alignment risks, hindering their deployment in high-stakes decision-making scenarios. In this paper, we identify a previously underexplored risk: similar to humans, LLMs can exhibit egoistic decision-making, in which they pursue short-term self-benefits through improper means while disregarding collective welfare and ethical constraints. We term this phenomenon Strategic Egoism (SE). To systematically evaluate SE, we introduce SEBench, a benchmark comprising 880 decision-making scenarios across 11 domains involving explicit profit temptations, which measures egoistic behavior along 6 psychologically grounded dimensions (e.g., rule circumvention). Each scenario adopts a single-role decision-making setting with carefully designed choice options to elicit self-serving strategies. Extensive experiments on 9 proprietary LLMs reveal that SE behaviors are widespread, with an average occurrence rate of 67.96%, and frequently manifest as manipulative coercion. Notably, we find that models more susceptible to profit temptations also exhibit broader safety deficiencies, including higher toxicity, lower truthfulness, increased jailbreak vulnerability, and elevated Dark Triad–style trait scores. Drawing inspiration from psychological interventions, we further propose SEGuard, a lightweight mitigation that reinforces situational constraints and suppresses egoistic tactics.- Anthology ID:
- 2026.findings-acl.249
- Volume:
- Findings of the Association for Computational Linguistics: ACL 2026
- Month:
- July
- Year:
- 2026
- Address:
- San Diego, California, United States
- Editors:
- Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
- Venue:
- Findings
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 5074–5087
- Language:
- URL:
- https://preview.aclanthology.org/ingest-acl/2026.findings-acl.249/
- DOI:
- Cite (ACL):
- Yaoyuan Zhang, Zonghao Ying, Aishan Liu, Jian Yang, Tianlin Li, Yaodong Yang, and Xianglong Liu. 2026. Uncovering Strategic Egoism Behaviors in Large Language Models. In Findings of the Association for Computational Linguistics: ACL 2026, pages 5074–5087, San Diego, California, United States. Association for Computational Linguistics.
- Cite (Informal):
- Uncovering Strategic Egoism Behaviors in Large Language Models (Zhang et al., Findings 2026)
- PDF:
- https://preview.aclanthology.org/ingest-acl/2026.findings-acl.249.pdf