Echoes of Agreement: Argument Driven Sycophancy in Large Language models

Avneet Kaur

doi:10.18653/v1/2025.findings-emnlp.1241

Echoes of Agreement: Argument Driven Sycophancy in Large Language models

Abstract

Existing evaluation of political biases in Large Language Models (LLMs) outline the high sensitivity to prompt formulation. Furthermore, Large Language Models are known to exhibit sycophancy, a tendency to align their outputs with a user’s stated belief, which is often attributed to human feedback during fine-tuning. However, such bias in the presence of explicit argumentation within a prompt remains underexplored. This paper investigates how argumentative prompts induce sycophantic behaviour in LLMs in a political context. Through a series of experiments, we demonstrate that models consistently alter their responses to mirror the stance present expressed by the user. This sycophantic behaviour is observed in both single and multi-turn interactions, and its intensity correlates with argument strength. Our findings establish a link between user stance and model sycophancy, revealing a critical vulnerability that impacts model reliability. Thus has significant implications for models being deployed in real-world settings and calls for developing robust evaluations and mitigations against manipulative or biased interactions.

Anthology ID:: 2025.findings-emnlp.1241
Volume:: Findings of the Association for Computational Linguistics: EMNLP 2025
Month:: November
Year:: 2025
Address:: Suzhou, China
Editors:: Christos Christodoulopoulos, Tanmoy Chakraborty, Carolyn Rose, Violet Peng
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 22803–22812
Language:
URL:: https://preview.aclanthology.org/author-page-yu-wang-polytechnic/2025.findings-emnlp.1241/
DOI:: 10.18653/v1/2025.findings-emnlp.1241
Bibkey:
Cite (ACL):: Avneet Kaur. 2025. Echoes of Agreement: Argument Driven Sycophancy in Large Language models. In Findings of the Association for Computational Linguistics: EMNLP 2025, pages 22803–22812, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):: Echoes of Agreement: Argument Driven Sycophancy in Large Language models (Kaur, Findings 2025)
Copy Citation:
PDF:: https://preview.aclanthology.org/author-page-yu-wang-polytechnic/2025.findings-emnlp.1241.pdf
Checklist:: 2025.findings-emnlp.1241.checklist.pdf

PDF Cite Search Checklist Fix data