Termite Italian Text-to-SQL: A CALAMITA Challenge
Federico Ranaldi, Elena Sofia Ruzzetti, Dario Onorati, Fabio Massimo Zanzotto, Leonardo Ranaldi
Abstract
We introduce Termite, which is a definitely unseen resource for evaluating Text-to-SQL in Italian. Specifically,we transfer evaluation pipelines beyond English, proposing novel, definitely unseen resources that avoid data-contamination phenomena while assessing the ability of models to perform Text-to-SQL tasks when natural language queries are written in Italian. We establish an evaluation grid based on execution accuracy.- Anthology ID:
- 2024.clicit-1.130
- Volume:
- Proceedings of the 10th Italian Conference on Computational Linguistics (CLiC-it 2024)
- Month:
- December
- Year:
- 2024
- Address:
- Pisa, Italy
- Editors:
- Felice Dell'Orletta, Alessandro Lenci, Simonetta Montemagni, Rachele Sprugnoli
- Venue:
- CLiC-it
- SIG:
- Publisher:
- CEUR Workshop Proceedings
- Note:
- Pages:
- 1176–1183
- Language:
- URL:
- https://preview.aclanthology.org/jlcl-multiple-ingestion/2024.clicit-1.130/
- DOI:
- Cite (ACL):
- Federico Ranaldi, Elena Sofia Ruzzetti, Dario Onorati, Fabio Massimo Zanzotto, and Leonardo Ranaldi. 2024. Termite Italian Text-to-SQL: A CALAMITA Challenge. In Proceedings of the 10th Italian Conference on Computational Linguistics (CLiC-it 2024), pages 1176–1183, Pisa, Italy. CEUR Workshop Proceedings.
- Cite (Informal):
- Termite Italian Text-to-SQL: A CALAMITA Challenge (Ranaldi et al., CLiC-it 2024)
- PDF:
- https://preview.aclanthology.org/jlcl-multiple-ingestion/2024.clicit-1.130.pdf