FreshTab: Sourcing Fresh Data for Table-to-Text Generation Evaluation

Kristýna Onderková, Ondrej Platek, Zdeněk Kasner, Ondrej Dusek


Abstract
Table-to-text generation (insight generation from tables) is a challenging task that requires precision in analyzing the data. In addition, the evaluation of existing benchmarks is affected by contamination of Large Language Model (LLM) training data as well as domain imbalance. We introduce FreshTab, an on-the-fly table-to-text benchmark generation from Wikipedia, to combat the LLM data contamination problem and enable domain-sensitive evaluation. While non-English table-to-text datasets are limited, FreshTab collects datasets in different languages on demand (we experiment with German, Russian and French in addition to English). We find that insights generated by LLMs from recent tables collected by our method appear clearly worse by automatic metrics, but this does not translate into LLM and human evaluations. Domain effects are visible in all evaluations, showing that a domain-balanced benchmark is more challenging.
Anthology ID:
2025.inlg-main.7
Volume:
Proceedings of the 18th International Natural Language Generation Conference
Month:
October
Year:
2025
Address:
Hanoi, Vietnam
Editors:
Lucie Flek, Shashi Narayan, Lê Hồng Phương, Jiahuan Pei
Venue:
INLG
SIG:
SIGGEN
Publisher:
Association for Computational Linguistics
Note:
Pages:
108–121
Language:
URL:
https://preview.aclanthology.org/author-page-you-zhang-rochester/2025.inlg-main.7/
DOI:
Bibkey:
Cite (ACL):
Kristýna Onderková, Ondrej Platek, Zdeněk Kasner, and Ondrej Dusek. 2025. FreshTab: Sourcing Fresh Data for Table-to-Text Generation Evaluation. In Proceedings of the 18th International Natural Language Generation Conference, pages 108–121, Hanoi, Vietnam. Association for Computational Linguistics.
Cite (Informal):
FreshTab: Sourcing Fresh Data for Table-to-Text Generation Evaluation (Onderková et al., INLG 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/author-page-you-zhang-rochester/2025.inlg-main.7.pdf