Sycophancy Negatively Affects LLM-as-a-Judge in Conflict Evaluation

Naghmeh Farzi; Laura Dietz; Samuel Carton

Sycophancy Negatively Affects LLM-as-a-Judge in Conflict Evaluation

Naghmeh Farzi, Laura Dietz, Samuel Carton

Abstract

LLM-as-Judge systems are increasingly used to generate labels and evaluate conversational data, yet their susceptibility to narrative framing remains underexplored. We study whether replacing one speaker’s username with the first-person identifier ’Me’ systematically biases model judgments independent of the underlying evidence. Using the Conversations Gone Awry corpus, we evaluate four LLMs across three judgment tasks (attack detection, attacker identification, and blame attribution), three perspective conditions, and two evidence visibility settings. Our results show that narrative perspective induces strong, task-dependent distortions, particularly in more subjective judgment tasks. We find that models systematically favor the narrator when a speaker is presented as ’Me’, reducing blame and responsibility attribution toward that speaker even when the underlying evidence is unchanged. These findings raise concerns about using LLMs to judge or moderate first-person conversational data.

Anthology ID:: 2026.gem-main.45
Volume:: Proceedings of the Fifth Workshop on Generation, Evaluation and Metrics (GEM)
Month:: July
Year:: 2026
Address:: San Diego, California, USA
Editors:: Simon Mille, Sebastian Gehrmann, Patrícia Schmidtová, Ondřej Dušek, Marzieh Fadaee, Kyle Lo, Enrico Santus, Gabriel Stanovsky
Venues:: GEM | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 490–501
Language:
URL:: https://preview.aclanthology.org/ingest-acl-workshops/2026.gem-main.45/
DOI:
Bibkey:
Cite (ACL):: Naghmeh Farzi, Laura Dietz, and Samuel Carton. 2026. Sycophancy Negatively Affects LLM-as-a-Judge in Conflict Evaluation. In Proceedings of the Fifth Workshop on Generation, Evaluation and Metrics (GEM), pages 490–501, San Diego, California, USA. Association for Computational Linguistics.
Cite (Informal):: Sycophancy Negatively Affects LLM-as-a-Judge in Conflict Evaluation (Farzi et al., GEM 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-acl-workshops/2026.gem-main.45.pdf

PDF Cite Search Fix data