A Scalable Entity-Based Framework for Auditing Bias in Large Language Models

Akram Elbouanani; Aboubacar Tuo; Adrian Popescu

A Scalable Entity-Based Framework for Auditing Bias in Large Language Models

Akram Elbouanani, Aboubacar Tuo, Adrian Popescu

Abstract

Existing approaches to bias evaluation in large language models (LLMs) trade ecological validity for statistical control, relying on artificial prompts that poorly reflect real-world use, or on naturalistic tasks that lack scale and rigor. We introduce a scalable bias-auditing framework using named entities as probes to measure structural disparities in model behavior. We show that synthetic data reliably reproduces bias patterns observed in natural text, enabling large-scale analysis. Using this approach, we conduct the largest bias audit to date, comprising 1.9 billion data points across multiple entity types, tasks, languages, models, and prompting strategies. Our results reveal systematic biases: models penalize right-wing politicians, favor left-wing politicians, prefer Western and wealthy nations over the Global South, favor Western companies, and penalize firms in the defense and pharmaceutical sectors. While instruction tuning reduces bias, increasing model scale amplifies it, and prompting in Chinese or Russian does not attenuate Western-aligned preferences. These results indicate that LLMs should undergo rigorous auditing before deployment in high-stakes applications.

Anthology ID:: 2026.findings-acl.1689
Volume:: Findings of the Association for Computational Linguistics: ACL 2026
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 33823–33852
Language:
URL:: https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1689/
DOI:
Bibkey:
Cite (ACL):: Akram Elbouanani, Aboubacar Tuo, and Adrian Popescu. 2026. A Scalable Entity-Based Framework for Auditing Bias in Large Language Models. In Findings of the Association for Computational Linguistics: ACL 2026, pages 33823–33852, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: A Scalable Entity-Based Framework for Auditing Bias in Large Language Models (Elbouanani et al., Findings 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1689.pdf
Checklist:: 2026.findings-acl.1689.checklist.pdf

PDF Cite Search Checklist Fix data