CSIRO LT at SemEval-2025 Task 8: Answering Questions over Tabular Data using LLMs

Tomas Turek; Shakila Mahjabin Tonni; Vincent Nguyen; Huichen Yang; Sarvnaz Karimi

CSIRO LT at SemEval-2025 Task 8: Answering Questions over Tabular Data using LLMs

Tomas Turek, Shakila Mahjabin Tonni, Vincent Nguyen, Huichen Yang, Sarvnaz Karimi

Abstract

Question Answering over large tables is challenging due to the difficulty of reasoning required in linking information from different parts of a table, such as heading and metadata to the values in the table and information needs. We investigate using Large Language Models (LLM) for tabular reasoning, where, given a pair of a table and a question from the DataBench benchmark, the models generate answers. We experiment with three techniques that enables symbolic reasoning through code execution: a direct code prompting (DCP) approach, ‘DCP_Py’, which uses Python, multi-step code (MSC) prompting ‘MSC_SQL+FS’ using SQL and ReAct prompting, ‘MSR_Py+FS’, which combines multi-step reasoning (MSR), few-shot (FS) learning and Python tools. We also conduct an analysis exploring the impact of answer types, data size, and multi-column dependencies on LLMs’ answer generation performance, including an assessment of the models’ limitations and the underlying challenges of tabular reasoning in LLMs.

Anthology ID:: 2025.semeval-1.222
Volume:: Proceedings of the 19th International Workshop on Semantic Evaluation (SemEval-2025)
Month:: July
Year:: 2025
Address:: Vienna, Austria
Editors:: Sara Rosenthal, Aiala Rosá, Debanjan Ghosh, Marcos Zampieri
Venues:: SemEval | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 1690–1701
Language:
URL:: https://preview.aclanthology.org/transition-to-people-yaml/2025.semeval-1.222/
DOI:
Bibkey:
Cite (ACL):: Tomas Turek, Shakila Mahjabin Tonni, Vincent Nguyen, Huichen Yang, and Sarvnaz Karimi. 2025. CSIRO LT at SemEval-2025 Task 8: Answering Questions over Tabular Data using LLMs. In Proceedings of the 19th International Workshop on Semantic Evaluation (SemEval-2025), pages 1690–1701, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):: CSIRO LT at SemEval-2025 Task 8: Answering Questions over Tabular Data using LLMs (Turek et al., SemEval 2025)
Copy Citation:
PDF:: https://preview.aclanthology.org/transition-to-people-yaml/2025.semeval-1.222.pdf

PDF Cite Search Fix data