Position: From Noise to Signal to Selbstzweck - Reframing Human Label Variation in the Era of Post-training in NLP

Shanshan Xu; Santosh T.Y.S.S; Barbara Plank

Position: From Noise to Signal to Selbstzweck - Reframing Human Label Variation in the Era of Post-training in NLP

Shanshan Xu, Santosh T.y.s.s, Barbara Plank

Abstract

Human Label Variation (HLV) refers to legitimate disagreement in annotation that reflects the diversity of human perspectives rather than mere error. Long treated in NLP as noise to be eliminated, HLV has only recently been reframed as a signal for improving model robustness. With the rise of large language models (LLMs) and post-training methods such as human feedback-based alignment, the role of HLV has become increasingly consequential. Yet current preference-learning datasets routinely collapse multiple annotations into a single label, flattening diverse perspectives into artificial consensus. Preserving HLV is necessary not only for pluralistic alignment but also for sociotechnical safety evaluation, where model behavior must be assessed in relation to human interaction and societal context.This position paper argues that preserving HLV as an embodiment of human pluralism must be treated as a Selbstzweck, an intrinsic value in itself. We analyze the limitations of existing preference datasets and propose actionable strategies for incorporating HLV into dataset construction to better preserve pluralistic human values.

Anthology ID:: 2026.findings-acl.1190
Volume:: Findings of the Association for Computational Linguistics: ACL 2026
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 23762–23772
Language:
URL:: https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1190/
DOI:
Bibkey:
Cite (ACL):: Shanshan Xu, Santosh T.y.s.s, and Barbara Plank. 2026. Position: From Noise to Signal to Selbstzweck - Reframing Human Label Variation in the Era of Post-training in NLP. In Findings of the Association for Computational Linguistics: ACL 2026, pages 23762–23772, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: Position: From Noise to Signal to Selbstzweck - Reframing Human Label Variation in the Era of Post-training in NLP (Xu et al., Findings 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1190.pdf
Checklist:: 2026.findings-acl.1190.checklist.pdf

PDF Cite Search Checklist Fix data