With a Grain of SALT: Are LLMs Fair Across Social Dimensions?

Samee Arif, Zohaib Khan, Maaidah Kaleem Butt, Muhammad Suhaib Rashid, Agha Ali Raza, Awais Athar


Abstract
In this paper we present a systematic study of social bias in small- to mid-scale Large Language Models (LLMs), focusing on gender, religion, and race. Using our SALT (Social Appropriateness in LLM Text) dataset, we explore two bias categories—Theoretical and Practical. Theoretical bias covers General Debate and Positioned Debate while practical bias includes Career Advice, Personal Advice, and Resume Generation. We quantify bias using win-rate gaps in general debate, and negative-role assignments in positioned debate. For Practical bias, we anonymize model outputs to remove explicit demographic cues and use DeepSeek-R1 as an automated evaluator, measuring outcome disparities across groups. We also examine systemic issues in LLM-based evaluation including evaluation bias, positional bias, and length bias and validate our findings through human annotation. Our results show consistent disadvantages for White, Christian, and male-associated outputs across multiple tasks. Larger models often amplify these disparities, highlighting that scale does not guarantee fairness.
Anthology ID:
2026.trustnlp-main.48
Volume:
Proceedings of the 6th Workshop on Trustworthy NLP (TrustNLP 2026)
Month:
July
Year:
2026
Address:
San Diego, California
Editors:
Kai-Wei Chang, Ninareh Mehrabi, Satyapriya Krishna, Anubrata Das, Jwala Dhamala, Yang Trista Cao, Tharindu Kumarage, Anil Ramakrishna, Christos Christodoulopoulos, Yixin Wan, Aram Galystan, Anoop Kumar, Rahul Gupta
Venues:
TrustNLP | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
618–636
Language:
URL:
https://preview.aclanthology.org/ingest-acl-workshops/2026.trustnlp-main.48/
DOI:
Bibkey:
Cite (ACL):
Samee Arif, Zohaib Khan, Maaidah Kaleem Butt, Muhammad Suhaib Rashid, Agha Ali Raza, and Awais Athar. 2026. With a Grain of SALT: Are LLMs Fair Across Social Dimensions?. In Proceedings of the 6th Workshop on Trustworthy NLP (TrustNLP 2026), pages 618–636, San Diego, California. Association for Computational Linguistics.
Cite (Informal):
With a Grain of SALT: Are LLMs Fair Across Social Dimensions? (Arif et al., TrustNLP 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl-workshops/2026.trustnlp-main.48.pdf