Observations and Remedies for Large Language Model Bias in Self-Consuming Performative Loop

Yaxuan Wang; Zhongteng Cai; Yujia Bao; Xueru Zhang; Yang Liu

Observations and Remedies for Large Language Model Bias in Self-Consuming Performative Loop

Yaxuan Wang, Zhongteng Cai, Yujia Bao, Xueru Zhang, Yang Liu

Abstract

The rapid advancement of large language models (LLMs) has led to growing interest in using synthetic data to train future models. However, this creates a self-consuming retraining loop, where models are trained on their own outputs and may cause performance drops and induce emerging biases. In real-world applications, previously deployed LLMs may influence the data they generate, leading to a dynamic system driven by user feedback. For example, if a model continues to underserve users from a group, less query data will be collected from this particular demographic of users. In this study, we introduce the concept of Self-Consuming Performative Loop (SCPL) and investigate the role of synthetic data in shaping bias during these dynamic iterative training processes under controlled performative feedback. This controlled setting is motivated by the inaccessibility of real-world user preference data from dynamic production systems, and enables us to isolate and analyze feedback-driven bias evolution in a principled manner. We focus on two types of loops, including the typical retraining setting and the incremental fine-tuning setting, which is largely underexplored. Through experiments on three real-world tasks, we find that the performative loop increases preference bias and decreases disparate bias. We design a reward-based rejection sampling strategy to mitigate the bias, moving towards more trustworthy self-improving systems. The code is available at https://github.com/UCSC-REAL/SCPL.git.

Anthology ID:: 2026.acl-long.1561
Volume:: Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 33862–33882
Language:
URL:: https://preview.aclanthology.org/ingest-acl/2026.acl-long.1561/
DOI:
Bibkey:
Cite (ACL):: Yaxuan Wang, Zhongteng Cai, Yujia Bao, Xueru Zhang, and Yang Liu. 2026. Observations and Remedies for Large Language Model Bias in Self-Consuming Performative Loop. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 33862–33882, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: Observations and Remedies for Large Language Model Bias in Self-Consuming Performative Loop (Wang et al., ACL 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-acl/2026.acl-long.1561.pdf
Checklist:: 2026.acl-long.1561.checklist.pdf

PDF Cite Search Checklist Fix data