A Position Paper on the Automatic Generation of Machine Learning Leaderboards

Roelien C. Timmer, Yufang Hou, Stephen Wan


Abstract
An important task in machine learning (ML) research is comparing prior work, which is often performed via ML leaderboards: a tabular overview of experiments with comparable conditions (e.g. same task, dataset, and metric). However, the growing volume of literature creates challenges in creating and maintaining these leaderboards. To ease this burden, researchers have developed methods to extract leaderboard entries from research papers for automated leaderboard curation. Yet, prior work varies in problem framing, complicating comparisons and limiting real-world applicability. In this position paper, we present the first overview of Automatic Leaderboard Generation (ALG) research, identifying fundamental differences in assumptions, scope, and output formats. We propose an ALG unified conceptual framework to standardise how the ALG task is defined. We offer ALG benchmarking guidelines, including recommendations for datasets and metrics that promote fair, reproducible evaluation. Lastly, we outline challenges and new directions for ALG, advocating for broader coverage by including all reported results and richer metadata.
Anthology ID:
2025.emnlp-main.1566
Volume:
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Month:
November
Year:
2025
Address:
Suzhou, China
Editors:
Christos Christodoulopoulos, Tanmoy Chakraborty, Carolyn Rose, Violet Peng
Venue:
EMNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
30749–30772
Language:
URL:
https://preview.aclanthology.org/ingest-emnlp/2025.emnlp-main.1566/
DOI:
Bibkey:
Cite (ACL):
Roelien C. Timmer, Yufang Hou, and Stephen Wan. 2025. A Position Paper on the Automatic Generation of Machine Learning Leaderboards. In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, pages 30749–30772, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):
A Position Paper on the Automatic Generation of Machine Learning Leaderboards (Timmer et al., EMNLP 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-emnlp/2025.emnlp-main.1566.pdf
Checklist:
 2025.emnlp-main.1566.checklist.pdf