SemEval-2025 Task 8: Question Answering over Tabular Data
Jorge Osés Grijalba, L. Alfonso Ureñ - López, Eugenio Martínez Cámara, Jose Camacho - Collados
Abstract
We introduce the findings and results of SemEval-2025 Task 8: Question Answering over Tabular Data. We featured two subtasks, DataBench and DataBench Lite. DataBench consists on question answering over tabular data, and DataBench Lite small comprising small datasets that might be easier to manage by current models by for example fitting them into a prompt. The task was open for any approach, but their answer has to conform to a required typing format. In this paper we present the task, analyze a number of system submissions and discuss the results. The results show how approaches leveraging LLMs dominated the task, with larger models exhibiting a considerably superior performance compared to small models.- Anthology ID:
- 2025.semeval-1.324
- Volume:
- Proceedings of the 19th International Workshop on Semantic Evaluation (SemEval-2025)
- Month:
- July
- Year:
- 2025
- Address:
- Vienna, Austria
- Editors:
- Sara Rosenthal, Aiala Rosá, Debanjan Ghosh, Marcos Zampieri
- Venues:
- SemEval | WS
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 2512–2522
- Language:
- URL:
- https://preview.aclanthology.org/corrections-2025-08/2025.semeval-1.324/
- DOI:
- Cite (ACL):
- Jorge Osés Grijalba, L. Alfonso Ureñ - López, Eugenio Martínez Cámara, and Jose Camacho - Collados. 2025. SemEval-2025 Task 8: Question Answering over Tabular Data. In Proceedings of the 19th International Workshop on Semantic Evaluation (SemEval-2025), pages 2512–2522, Vienna, Austria. Association for Computational Linguistics.
- Cite (Informal):
- SemEval-2025 Task 8: Question Answering over Tabular Data (Osés Grijalba et al., SemEval 2025)
- PDF:
- https://preview.aclanthology.org/corrections-2025-08/2025.semeval-1.324.pdf