Jiajun Chen

Other people with similar names: Jiajun Chen

Unverified author pages with similar names: Jiajun Chen

2026

VALUE ALIGNMENT TAX: Measuring Value Trade-offs in LLM Alignment
Jiajun Chen | Hua Shen
Findings of the Association for Computational Linguistics: ACL 2026

Existing work on value alignment typically characterizes value relations statically, ignoring how alignment interventions—such as prompting, fine-tuning, or preference optimization—reshape the broader value system. In practice, aligning a target value can implicitly shift other values, creating value trade-offs that remain largely unmeasured.We introduce the VAT, a framework that quantifies value trade-offs by measuring how alignment-induced changes propagate across interconnected values relative to achieved on-target gain. VAT captures the system-level dynamics of value expression under alignment intervention, enabling evaluation of both intended improvements and unintended side effects.Using a controlled scenario–action dataset grounded in Schwartz value theory, we collect paired pre–post normative judgments and analyze alignment effects across models, values, and interventions. Results show that alignment often produces uneven and structured co-movement among values, revealing systematic trade-offs between target and non-target values. These effects are largely invisible under conventional target-only evaluation, but become evident via VAT, highlighting process-level alignment risks and offering new insights into the dynamic nature of value alignment in LLMs.Dataset and code are open-sourced.

Co-authors

Hua Shen 1

Venues

Findings1

Fix author