HEDS 3.0: The Human Evaluation Data Sheet Version 3.0

Anya Belz, Craig Thomson


Abstract
This paper presents a new version of the Human Evaluation Datasheet (HEDS), numbered 3.0 This update is the result of our experience using HEDS in the context of numerous recent human evaluation experiments, including reproduction studies, and of feedback collected from other researchers. Our main overall goal was to improve clarity, and to enable users to complete the datasheet more consistently and comparably. The HEDS 3.0 package consists of the digital data sheet, documentation, and code for exporting completed data sheets as latex files, all available from the HEDS 3.0 GitHub.
Anthology ID:
2025.gem-1.6
Volume:
Proceedings of the Fourth Workshop on Generation, Evaluation and Metrics (GEM²)
Month:
July
Year:
2025
Address:
Vienna, Austria and virtual meeting
Editors:
Kaustubh Dhole, Miruna Clinciu
Venues:
GEM | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
60–81
Language:
URL:
https://preview.aclanthology.org/corrections-2025-08/2025.gem-1.6/
DOI:
Bibkey:
Cite (ACL):
Anya Belz and Craig Thomson. 2025. HEDS 3.0: The Human Evaluation Data Sheet Version 3.0. In Proceedings of the Fourth Workshop on Generation, Evaluation and Metrics (GEM²), pages 60–81, Vienna, Austria and virtual meeting. Association for Computational Linguistics.
Cite (Informal):
HEDS 3.0: The Human Evaluation Data Sheet Version 3.0 (Belz & Thomson, GEM 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/corrections-2025-08/2025.gem-1.6.pdf