Convergent Demographic Utility Hierarchies: Geometry of Intersectional Values in LLMs

Pravish Sainath


Abstract
Recent work has shown that LLMs develop internally coherent utility functions that emerge with scale, yet whether these value systemsencode systematic demographic hierarchies remains unexplored. We elicit pairwise preferences across 15 intersectional demographic groups (defined by race, gender, and their combinations) and 8 policy domains on three 7–8B instruction-tuned LLMs, fitting Thurstonian utility models to the resulting preference matrices. All three models converge on a compensatory hierarchy that invertsreal-world structural advantage, consistently ranking marginalized groups, the highest and dominant groups are lowest. Intersectional utilities do not combine additively: single-axis audits that measure gender and race gaps independently overestimate the most extreme intersectional gap by 26- 40% in our experiments. Geometrically, we identify a linear direction in the representation space that predicts the full utility hierarchy from neutral sentences alone, and show that this direction is substantially aligned with gender encoding but not with race encoding. Orthogonalization reveals that gender separation in representations is not fully explained by utility encoding. The hierarchy is already present in base (pre-alignment) models and is amplified several-fold by instruction tuning, suggesting it originates in pre-training data rather than alignment procedures.
Anthology ID:
2026.acl-srw.122
Volume:
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (ACL 2026)
Month:
July
Year:
2026
Address:
San Diego, California, United States
Editors:
Santosh T.Y.S.S., Juan Diego Rodriguez, Ona de Gibert
Venue:
ACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
1376–1390
Language:
URL:
https://preview.aclanthology.org/ingest-acl/2026.acl-srw.122/
DOI:
Bibkey:
Cite (ACL):
Pravish Sainath. 2026. Convergent Demographic Utility Hierarchies: Geometry of Intersectional Values in LLMs. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (ACL 2026), pages 1376–1390, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):
Convergent Demographic Utility Hierarchies: Geometry of Intersectional Values in LLMs (Sainath, ACL 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl/2026.acl-srw.122.pdf