Neural network embeddings recover value dimensions from psychometric survey items on par with human data

Max Pellert, Clemens M Lechner, Indira Sen, Markus Strohmaier


Abstract
We demonstrate that embeddings derived from large language models, when processed with "Survey and Questionnaire Item Embeddings Differentials" (SQuID), can recover the structure of human values obtained from human rater judgments on the Revised Portrait Value Questionnaire (PVQ-RR). We compare multiple embedding models across a number of evaluation metrics including internal consistency, dimension correlations and multidimensional scaling configurations. Unlike previous approaches, SQuID addresses the challenge of obtaining negative correlations between dimensions without requiring domain-specific fine-tuning or training data re-annotation. Quantitative analysis reveals that our embedding-based approach explains 55% of variance in dimension-dimension similarities compared to human data. Multidimensional scaling configurations show alignment with pooled human data from 49 different countries. Generalizability tests across three personality inventories (IPIP, BFI-2, HEXACO) demonstrate that SQuID consistently increases correlation ranges, suggesting applicability beyond value theory. These results show that semantic embeddings can effectively replicate psychometric structures previously established through extensive human surveys. The approach offers substantial advantages in cost, scalability and flexibility while maintaining comparable quality to traditional methods. Our findings have significant implications for psychometrics and social science research, providing a complementary methodology that could expand the scope of human behavior and experience represented in measurement tools.
Anthology ID:
2026.findings-eacl.303
Volume:
Findings of the Association for Computational Linguistics: EACL 2026
Month:
March
Year:
2026
Address:
Rabat, Morocco
Editors:
Vera Demberg, Kentaro Inui, Lluís Marquez
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
5738–5752
Language:
URL:
https://preview.aclanthology.org/ingest-eacl/2026.findings-eacl.303/
DOI:
Bibkey:
Cite (ACL):
Max Pellert, Clemens M Lechner, Indira Sen, and Markus Strohmaier. 2026. Neural network embeddings recover value dimensions from psychometric survey items on par with human data. In Findings of the Association for Computational Linguistics: EACL 2026, pages 5738–5752, Rabat, Morocco. Association for Computational Linguistics.
Cite (Informal):
Neural network embeddings recover value dimensions from psychometric survey items on par with human data (Pellert et al., Findings 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-eacl/2026.findings-eacl.303.pdf
Checklist:
 2026.findings-eacl.303.checklist.pdf