Distributional Alignment for Large Language Models under Domain Shift

Viet Thanh Pham; Lizhen Qu; Zhuang Li; Gholamreza Haffari

Distributional Alignment for Large Language Models under Domain Shift

Viet Thanh Pham, Lizhen Qu, Zhuang Li, Gholamreza Haffari

Abstract

Distributional alignment enables large language models (LLMs) to predict how a target population distributes its responses across answer options, rather than collapsing disagreement into a single consensus answer. However, existing LLM-based distribution prediction is often unstable and degrades under cultural and domain shift. Token score-based estimates can change with minor option wording or formatting, response sampling-based estimates are expensive and sensitive to prompts and decoding settings, and directly generated distributions are frequently miscalibrated.We propose Evi-DA, an evidence-based alignment technique that improves the fidelity and robustness of LLM-based distribution estimation under domain and cultural shift. Given a target country and a multiple-choice question, Evi-DA retrieves related World Values Survey items and their answer distributions, predicts a coarse Welzel value signature for each option, and infers the country-conditioned answer distribution in a structured format. We train the LLMs using a two-stage pipeline, where reinforcement learning optimizes survey-derived rewards that encourage accurate intermediate value predictions, faithful final distributions, well-formed structured outputs, and reduced cultural bias. Across in-domain and out-of-domain benchmarks and multiple open-source backbones, Evi-DA reduces Jensen-Shannon divergence between predicted and gold distributions relative to strong baselines, with average relative improvements of up to 44%.

Anthology ID:: 2026.findings-acl.1026
Volume:: Findings of the Association for Computational Linguistics: ACL 2026
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 20514–20528
Language:
URL:: https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1026/
DOI:
Bibkey:
Cite (ACL):: Viet Thanh Pham, Lizhen Qu, Zhuang Li, and Gholamreza Haffari. 2026. Distributional Alignment for Large Language Models under Domain Shift. In Findings of the Association for Computational Linguistics: ACL 2026, pages 20514–20528, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: Distributional Alignment for Large Language Models under Domain Shift (Pham et al., Findings 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1026.pdf
Checklist:: 2026.findings-acl.1026.checklist.pdf

PDF Cite Search Checklist Fix data