Implicit Behavioral Alignment of Language Agents in High-Stakes Crowd Simulations

Yunzhe Wang; Gale Lucas; Burcin Becerik-Gerber; Volkan Ustun

Implicit Behavioral Alignment of Language Agents in High-Stakes Crowd Simulations

Yunzhe Wang, Gale Lucas, Burcin Becerik-Gerber, Volkan Ustun

Abstract

Language-driven generative agents have enabled large-scale social simulations with transformative uses, from interpersonal training to aiding global policy-making. However, recent studies indicate that generative agent behaviors often deviate from expert expectations and real-world data—a phenomenon we term the *Behavior-Realism Gap*. To address this, we introduce a theoretical framework called Persona-Environment Behavioral Alignment (PEBA), formulated as a distribution matching problem grounded in Lewin’s behavior equation stating that behavior is a function of the person and their environment. Leveraging PEBA, we propose PersonaEvolve (PEvo), an LLM-based optimization algorithm that iteratively refines agent personas, implicitly aligning their collective behaviors with realistic expert benchmarks within a specified environmental context. We validate PEvo in an active shooter incident simulation we developed, achieving an 84% average reduction in distributional divergence compared to no steering and a 34% improvement over explicit instruction baselines. Results also show PEvo-refined personas generalize to novel, related simulation scenarios. Our method greatly enhances behavioral realism and reliability in high-stakes social simulations. More broadly, the PEBA-PEvo framework provides a principled approach to developing trustworthy LLM-driven social simulations.

Anthology ID:: 2025.emnlp-main.1562
Volume:: Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Month:: November
Year:: 2025
Address:: Suzhou, China
Editors:: Christos Christodoulopoulos, Tanmoy Chakraborty, Carolyn Rose, Violet Peng
Venue:: EMNLP
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 30669–30686
Language:
URL:: https://preview.aclanthology.org/ingest-emnlp/2025.emnlp-main.1562/
DOI:
Bibkey:
Cite (ACL):: Yunzhe Wang, Gale Lucas, Burcin Becerik-Gerber, and Volkan Ustun. 2025. Implicit Behavioral Alignment of Language Agents in High-Stakes Crowd Simulations. In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, pages 30669–30686, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):: Implicit Behavioral Alignment of Language Agents in High-Stakes Crowd Simulations (Wang et al., EMNLP 2025)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-emnlp/2025.emnlp-main.1562.pdf
Checklist:: 2025.emnlp-main.1562.checklist.pdf

PDF Cite Search Checklist Fix data