ReproHum #0124-03: Reproducing Human Scores on Neural REG Models

Maurice Langner


Abstract
In the context of the ReproNLP’26 shared task, I report on a single-criterion reproduction study of a human evaluation experiment for neuralreferring expression generation models (Castro Ferreira et al., 2018a), which has already been reproduced once by Mahamood (2024)for the ReproHum 2024 shared task. The experiments reported on in this paper therefore seek to second the findings from both previousexperiments.
Anthology ID:
2026.gem-main.86
Volume:
Proceedings of the Fifth Workshop on Generation, Evaluation and Metrics (GEM)
Month:
July
Year:
2026
Address:
San Diego, California, USA
Editors:
Simon Mille, Sebastian Gehrmann, Patrícia Schmidtová, Ondřej Dušek, Marzieh Fadaee, Kyle Lo, Enrico Santus, Gabriel Stanovsky
Venues:
GEM | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
1094–1103
Language:
URL:
https://preview.aclanthology.org/ingest-acl-workshops/2026.gem-main.86/
DOI:
Bibkey:
Cite (ACL):
Maurice Langner. 2026. ReproHum #0124-03: Reproducing Human Scores on Neural REG Models. In Proceedings of the Fifth Workshop on Generation, Evaluation and Metrics (GEM), pages 1094–1103, San Diego, California, USA. Association for Computational Linguistics.
Cite (Informal):
ReproHum #0124-03: Reproducing Human Scores on Neural REG Models (Langner, GEM 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl-workshops/2026.gem-main.86.pdf