Easy to Complete, Hard to Choose: Investigating LLM Performance on the ProverbIT Benchmark
Enrico Mensa, Lorenzo Zane, Calogero Jerik Scozzaro, Matteo Delsanto, Tommaso Milani, Daniele P. Radicioni
- Anthology ID:
- 2025.clicit-1.69
- Volume:
- Proceedings of the Eleventh Italian Conference on Computational Linguistics (CLiC-it 2025)
- Month:
- September
- Year:
- 2025
- Address:
- Cagliari, Italy
- Editors:
- Cristina Bosco, Elisabetta Jezek, Marco Polignano, Manuela Sanguinetti
- Venue:
- CLiC-it
- SIG:
- Publisher:
- CEUR Workshop Proceedings
- Note:
- Pages:
- 722–734
- Language:
- URL:
- https://preview.aclanthology.org/sigarab-more-entries-6621/2025.clicit-1.69/
- DOI:
- Cite (ACL):
- Enrico Mensa, Lorenzo Zane, Calogero Jerik Scozzaro, Matteo Delsanto, Tommaso Milani, and Daniele P. Radicioni. 2025. Easy to Complete, Hard to Choose: Investigating LLM Performance on the ProverbIT Benchmark. In Proceedings of the Eleventh Italian Conference on Computational Linguistics (CLiC-it 2025), pages 722–734, Cagliari, Italy. CEUR Workshop Proceedings.
- Cite (Informal):
- Easy to Complete, Hard to Choose: Investigating LLM Performance on the ProverbIT Benchmark (Mensa et al., CLiC-it 2025)
- PDF:
- https://preview.aclanthology.org/sigarab-more-entries-6621/2025.clicit-1.69.pdf