SemEval-2025 Task 8: Question Answering over Tabular Data

Jorge Osés Grijalba, L. Alfonso Ureñ - López, Eugenio Martínez Cámara, Jose Camacho - Collados


Abstract
We introduce the findings and results of SemEval-2025 Task 8: Question Answering over Tabular Data. We featured two subtasks, DataBench and DataBench Lite. DataBench consists on question answering over tabular data, and DataBench Lite small comprising small datasets that might be easier to manage by current models by for example fitting them into a prompt. The task was open for any approach, but their answer has to conform to a required typing format. In this paper we present the task, analyze a number of system submissions and discuss the results. The results show how approaches leveraging LLMs dominated the task, with larger models exhibiting a considerably superior performance compared to small models.
Anthology ID:
2025.semeval-1.324
Volume:
Proceedings of the 19th International Workshop on Semantic Evaluation (SemEval-2025)
Month:
July
Year:
2025
Address:
Vienna, Austria
Editors:
Sara Rosenthal, Aiala Rosá, Debanjan Ghosh, Marcos Zampieri
Venues:
SemEval | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
2512–2522
Language:
URL:
https://preview.aclanthology.org/transition-to-people-yaml/2025.semeval-1.324/
DOI:
Bibkey:
Cite (ACL):
Jorge Osés Grijalba, L. Alfonso Ureñ - López, Eugenio Martínez Cámara, and Jose Camacho - Collados. 2025. SemEval-2025 Task 8: Question Answering over Tabular Data. In Proceedings of the 19th International Workshop on Semantic Evaluation (SemEval-2025), pages 2512–2522, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):
SemEval-2025 Task 8: Question Answering over Tabular Data (Osés Grijalba et al., SemEval 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/transition-to-people-yaml/2025.semeval-1.324.pdf