Samsung Research Poland at SemEval-2025 Task 8: LLM ensemble methods for QA over tabular data

Pawel Bujnowski, Tomasz Dryjanski, Christian Goltz, Bartosz Swiderski, Natalia Paszkiewicz, Bartlomiej Kuzma, Jacek Rutkowski, Jakub Stepka, Milosz Dudek, Wojciech Siemiatkowski, Weronika Plichta, Bartłomiej Paziewski, Maciej Grabowski, Katarzyna Beksa, Zuzanna Bordzicka, Filip Ostrowski, Grzegorz Sochacki


Abstract
Question answering using Large Language Models has gained significant popularity inboth everyday communication and at the workplace. However, certain tasks, such as querying tables, still pose challenges for commercial and open-source chatbots powered by advanceddeep learning models. Addressing these challenges requires specialized approaches.During the SemEval-2025 Task 8 competition focused on tabular data, our solution achieved86.21% accuracy and took 2nd place out of 100 teams. In this paper we present ten methodsthat significantly improve the baseline solution. Our code is available as open-source.
Anthology ID:
2025.semeval-1.163
Volume:
Proceedings of the 19th International Workshop on Semantic Evaluation (SemEval-2025)
Month:
July
Year:
2025
Address:
Vienna, Austria
Editors:
Sara Rosenthal, Aiala Rosá, Debanjan Ghosh, Marcos Zampieri
Venues:
SemEval | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
1223–1232
Language:
URL:
https://preview.aclanthology.org/transition-to-people-yaml/2025.semeval-1.163/
DOI:
Bibkey:
Cite (ACL):
Pawel Bujnowski, Tomasz Dryjanski, Christian Goltz, Bartosz Swiderski, Natalia Paszkiewicz, Bartlomiej Kuzma, Jacek Rutkowski, Jakub Stepka, Milosz Dudek, Wojciech Siemiatkowski, Weronika Plichta, Bartłomiej Paziewski, Maciej Grabowski, Katarzyna Beksa, Zuzanna Bordzicka, Filip Ostrowski, and Grzegorz Sochacki. 2025. Samsung Research Poland at SemEval-2025 Task 8: LLM ensemble methods for QA over tabular data. In Proceedings of the 19th International Workshop on Semantic Evaluation (SemEval-2025), pages 1223–1232, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):
Samsung Research Poland at SemEval-2025 Task 8: LLM ensemble methods for QA over tabular data (Bujnowski et al., SemEval 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/transition-to-people-yaml/2025.semeval-1.163.pdf