CSIRO LT at SemEval-2025 Task 8: Answering Questions over Tabular Data using LLMs
Tomas Turek, Shakila Mahjabin Tonni, Vincent Nguyen, Huichen Yang, Sarvnaz Karimi
Abstract
Question Answering over large tables is challenging due to the difficulty of reasoning required in linking information from different parts of a table, such as heading and metadata to the values in the table and information needs. We investigate using Large Language Models (LLM) for tabular reasoning, where, given a pair of a table and a question from the DataBench benchmark, the models generate answers. We experiment with three techniques that enables symbolic reasoning through code execution: a direct code prompting (DCP) approach, ‘DCP_Py’, which uses Python, multi-step code (MSC) prompting ‘MSC_SQL+FS’ using SQL and ReAct prompting, ‘MSR_Py+FS’, which combines multi-step reasoning (MSR), few-shot (FS) learning and Python tools. We also conduct an analysis exploring the impact of answer types, data size, and multi-column dependencies on LLMs’ answer generation performance, including an assessment of the models’ limitations and the underlying challenges of tabular reasoning in LLMs.- Anthology ID:
- 2025.semeval-1.222
- Volume:
- Proceedings of the 19th International Workshop on Semantic Evaluation (SemEval-2025)
- Month:
- July
- Year:
- 2025
- Address:
- Vienna, Austria
- Editors:
- Sara Rosenthal, Aiala Rosá, Debanjan Ghosh, Marcos Zampieri
- Venues:
- SemEval | WS
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 1690–1701
- Language:
- URL:
- https://preview.aclanthology.org/transition-to-people-yaml/2025.semeval-1.222/
- DOI:
- Cite (ACL):
- Tomas Turek, Shakila Mahjabin Tonni, Vincent Nguyen, Huichen Yang, and Sarvnaz Karimi. 2025. CSIRO LT at SemEval-2025 Task 8: Answering Questions over Tabular Data using LLMs. In Proceedings of the 19th International Workshop on Semantic Evaluation (SemEval-2025), pages 1690–1701, Vienna, Austria. Association for Computational Linguistics.
- Cite (Informal):
- CSIRO LT at SemEval-2025 Task 8: Answering Questions over Tabular Data using LLMs (Turek et al., SemEval 2025)
- PDF:
- https://preview.aclanthology.org/transition-to-people-yaml/2025.semeval-1.222.pdf