Beyond Self-Reports: Multi-Observer Agents for Personality Assessment in Large Language Models

Yin Jou Huang; Rafik Hadfi

doi:10.18653/v1/2025.findings-emnlp.1150

Beyond Self-Reports: Multi-Observer Agents for Personality Assessment in Large Language Models

Abstract

Self-report questionnaires have long been used to assess LLM personality traits, yet they fail to capture behavioral nuances due to biases and meta-knowledge contamination. This paper proposes a novel multi-observer framework for personality trait assessments in LLM agents that draws on informant-report methods in psychology. Instead of relying on self-assessments, we employ multiple observer LLM agents, each of which is configured with a specific relationship (e.g., family member, friend, or coworker). The observer agents interact with the subject LLM agent before assessing its Big Five personality traits. We show that observer-report ratings align more closely with human judgments than traditional self-reports and reveal systematic biases in LLM self-assessments. Further analysis shows that aggregating ratings of multiple observers provides more reliable results, reflecting a wisdom of the crowd effect up to 5 to 7 observers.

Anthology ID:: 2025.findings-emnlp.1150
Volume:: Findings of the Association for Computational Linguistics: EMNLP 2025
Month:: November
Year:: 2025
Address:: Suzhou, China
Editors:: Christos Christodoulopoulos, Tanmoy Chakraborty, Carolyn Rose, Violet Peng
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 21086–21101
Language:
URL:: https://preview.aclanthology.org/author-page-yu-wang-polytechnic/2025.findings-emnlp.1150/
DOI:: 10.18653/v1/2025.findings-emnlp.1150
Bibkey:
Cite (ACL):: Yin Jou Huang and Rafik Hadfi. 2025. Beyond Self-Reports: Multi-Observer Agents for Personality Assessment in Large Language Models. In Findings of the Association for Computational Linguistics: EMNLP 2025, pages 21086–21101, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):: Beyond Self-Reports: Multi-Observer Agents for Personality Assessment in Large Language Models (Huang & Hadfi, Findings 2025)
Copy Citation:
PDF:: https://preview.aclanthology.org/author-page-yu-wang-polytechnic/2025.findings-emnlp.1150.pdf
Checklist:: 2025.findings-emnlp.1150.checklist.pdf

PDF Cite Search Checklist Fix data