Anthropocentric Bias in Language Model Evaluation

Raphaël Millière, Charles Rathkopf


Abstract
Evaluating the cognitive capacities of large language models (LLMs) requires overcoming not only anthropomorphic but also anthropocentric biases. This article identifies two types of anthropocentric bias that have been neglected: overlooking how auxiliary factors can impede LLM performance despite competence (auxiliary oversight), and dismissing LLM mechanistic strategies that differ from those of humans as not genuinely competent (mechanistic chauvinism). Mitigating these biases requires an empirical, iterative approach to mapping cognitive tasks to LLM-specific capacities and mechanisms, achieved by supplementing behavioral experiments with mechanistic studies.
Anthology ID:
2026.cl-1.11
Volume:
Computational Linguistics, Volume 52, Issue 1 - March 2026
Month:
March
Year:
2026
Address:
Cambridge, MA
Venue:
CL
SIG:
Publisher:
MIT Press
Note:
Pages:
379–388
Language:
URL:
https://preview.aclanthology.org/ingest-latest-mitpress-cl-tacl/2026.cl-1.11/
DOI:
10.1162/coli.a.582
Bibkey:
Cite (ACL):
Raphaël Millière and Charles Rathkopf. 2026. Anthropocentric Bias in Language Model Evaluation. Computational Linguistics, 52(1):379–388.
Cite (Informal):
Anthropocentric Bias in Language Model Evaluation (Millière & Rathkopf, CL 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-latest-mitpress-cl-tacl/2026.cl-1.11.pdf