Identifying Bias in Machine-generated Text Detection

Kevin Stowe; Svetlana Afanaseva; Rodolfo C. Raimundo; Yitao Sun; Kailash Patil

Identifying Bias in Machine-generated Text Detection

Kevin Stowe, Svetlana Afanaseva, Rodolfo C. Raimundo, Yitao Sun, Kailash Patil

Abstract

The meteoric rise in text generation capability has been accompanied by parallel growth in interest in machine-generated text detection: the capability to identify whether a given text was generated using a model or written by a person. While detection models show strong performance, they have the capacity to cause significant negative impacts. We explore potential biases in English machine-generated text detection systems. We curate a dataset of student essays and assess 16 different detection systems for bias across four attributes: gender, race/ethnicity, English-language learner (ELL) status, and economic status. We evaluate these attributes using regression-based models to determine the significance and power of the effects, as well as performing subgroup analysis. We find that while biases are generally inconsistent across systems, there are several key issues: several models tend to classify disadvantaged groups as machine-generated, ELL essays are more likely to be classified as machine-generated, economically disadvantaged students’ essays are less likely to be classified as machine-generated, and non-White ELL essays are disproportionately classified as machine-generated relative to their White counterparts. Finally, we perform human annotation and find that while humans perform generally poorly at the detection task, they show no significant biases on the studied attributes.

Anthology ID:: 2026.acl-long.109
Volume:: Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 2383–2395
Language:
URL:: https://preview.aclanthology.org/ingest-acl/2026.acl-long.109/
DOI:
Bibkey:
Cite (ACL):: Kevin Stowe, Svetlana Afanaseva, Rodolfo C. Raimundo, Yitao Sun, and Kailash Patil. 2026. Identifying Bias in Machine-generated Text Detection. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 2383–2395, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: Identifying Bias in Machine-generated Text Detection (Stowe et al., ACL 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-acl/2026.acl-long.109.pdf
Checklist:: 2026.acl-long.109.checklist.pdf

PDF Cite Search Checklist Fix data