GenPT: Beyond Self-Report for Reliable LLM Psychometrics via Generative Projective Testing

Ming Wang; Shuang Wu; Bixuan Wang; Lu Lin; Yuxin Chen; Xiaocui Yang; Daling Wang; Shi Feng; Yifei Zhang; Yufan Sun

GenPT: Beyond Self-Report for Reliable LLM Psychometrics via Generative Projective Testing

Ming Wang, Shuang Wu, Bixuan Wang, Lu Lin, Yuxin Chen, Xiaocui Yang, Daling Wang, Shi Feng, Yifei Zhang, Yufan Sun

Abstract

Self-report questionnaires remain the default tool for probing the psychological characteristics of Large Language Model (LLM) agents, yet classical instruments (BFI, BDI, MBTI, BSS) inherit three well-known threats under LLMs: contamination from training corpora, directional bias under social-desirability framing, and limited responsiveness to context beyond the item text. We ask whether a *projective* paradigm can be adapted into a usable psychometric tool for LLM agents. We introduce **GenPT** (Generative Projective Testing), which reformulates TAT, Rorschach, and SCT with newly generated stimuli and organises assessment as a three-stage pipeline (Behavior Collection → Interpretation → Diagnosis) grounded in SCORS-G and a Simplified Rorschach Analysis System. On personality traits (Big Five, MBTI) and mental-health risks (depression, suicide ideation), questionnaires exhibit systematic directional shifts under social-desirability framing, most strongly on suicide ideation, whereas GenPT’s collected behavioral patterns stay near the symmetric baseline; under a longitudinal counselling context, GenPT-based depression assessment shifts by roughly an order of magnitude more than its questionnaire counterpart. Questionnaires remain competitive on clean-persona trait tasks where items align lexically with the persona description. Overall, GenPT complements rather than replaces self-report when contamination resistance, bias asymmetry, and context sensitivity matter. Code and stimuli: https://github.com/sci-m-wang/GenPT.

Anthology ID:: 2026.acl-long.1901
Volume:: Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 40958–40974
Language:
URL:: https://preview.aclanthology.org/ingest-acl/2026.acl-long.1901/
DOI:
Bibkey:
Cite (ACL):: Ming Wang, Shuang Wu, Bixuan Wang, Lu Lin, Yuxin Chen, Xiaocui Yang, Daling Wang, Shi Feng, Yifei Zhang, and Yufan Sun. 2026. GenPT: Beyond Self-Report for Reliable LLM Psychometrics via Generative Projective Testing. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 40958–40974, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: GenPT: Beyond Self-Report for Reliable LLM Psychometrics via Generative Projective Testing (Wang et al., ACL 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-acl/2026.acl-long.1901.pdf
Checklist:: 2026.acl-long.1901.checklist.pdf

PDF Cite Search Checklist Fix data