Beyond Marginal Distributions: A Framework to Evaluate the Representativeness of Demographic-Aligned LLMs

Tristan Williams, Franziska Weeber, Sebastian Pad\'o, Alan Akbik


Abstract
Large language models are increasingly used to represent human opinions, values, or beliefs, and their steerability towards these ideals is an active area of research. Existing work focuses predominantly on aligning marginal response distributions, treating each alignment evaluation example independently. While essential, this may overlook deeper latent structures that characterise real populations and underpin cultural values theories. We propose a framework for evaluating the representativeness of aligned models through multivariate correlation patterns in addition to marginal distributions. We show the value of our evaluation scheme by comparing two model steering techniques (persona prompting and demographic fine-tuning) and evaluating them against human responses from the World Values Survey. While the demographic fine-tuned model better approximates marginal response distributions, persona prompting performs marginally better at reproducing the empirical correlation structure between survey items. Despite this reversal, neither technique aligns with human correlation patterns. We conclude that representativeness is a distinct aspect of value alignment and an evaluation focused on marginals can mask structural failures, leading to overly optimistic conclusions about model representativeness.
Anthology ID:
2026.findings-acl.236
Volume:
Findings of the Association for Computational Linguistics: ACL 2026
Month:
July
Year:
2026
Address:
San Diego, California, United States
Editors:
Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
4806–4826
Language:
URL:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.236/
DOI:
Bibkey:
Cite (ACL):
Tristan Williams, Franziska Weeber, Sebastian Pad\'o, and Alan Akbik. 2026. Beyond Marginal Distributions: A Framework to Evaluate the Representativeness of Demographic-Aligned LLMs. In Findings of the Association for Computational Linguistics: ACL 2026, pages 4806–4826, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):
Beyond Marginal Distributions: A Framework to Evaluate the Representativeness of Demographic-Aligned LLMs (Williams et al., Findings 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.236.pdf
Checklist:
 2026.findings-acl.236.checklist.pdf