Alice Suozzi


Fixing paper assignments

  1. Please select all papers that belong to the same person.
  2. Indicate below which author they should be assigned to.
Provide a valid ORCID iD here. This will be used to match future papers to this author.
Provide the name of the school or the university where the author has received or will receive their highest degree (e.g., Ph.D. institution for researchers, or current affiliation for students). This will be used to form the new author page ID, if needed.

TODO: "submit" and "cancel" buttons here


2025

pdf bib
BAMBI Goes to School: Evaluating Italian BabyLMs with Invalsi-ITA
Luca Capone | Alice Suozzi | Gianluca Lebani | Alessandro Lenci
Proceedings of the Eleventh Italian Conference on Computational Linguistics (CLiC-it 2025)

pdf bib
Language Models and the Magic of Metaphor: A Comparative Evaluation with Human Judgments
Simone Mazzoli | Alice Suozzi | Gianluca Lebani
Proceedings of the Eleventh Italian Conference on Computational Linguistics (CLiC-it 2025)

2024

pdf bib
BaBIEs: A Benchmark for the Linguistic Evaluation of Italian Baby Language Models
Luca Capone | Alice Suozzi | Gianluca Lebani | Alessandro Lenci
Proceedings of the Tenth Italian Conference on Computational Linguistics (CLiC-it 2024)

The possibility of comparing the linguistic competence of Language Models (LMs) to that of children has gained growing attention lately, raising the need for effective tools for evaluating both the former and the latter. To this purpose, we developed a resource for the linguistic evaluation of BabyLMs, which are LMs trained on datasets that comparable to the linguistic stimulus received by children. This resource adapts four standardized tests for the evaluation of linguistic skills of Italian-speaking children (BVL, TROG-2, TCGB-2 and Peabody). To verify the effectiveness of our benchmark, we administered it to Minerva, a LLM pretrained from scratch on Italian. Our results indicate that Minerva struggles to master certain linguistic aspects, achieving an age-equivalent score of 4 years, and that the type of task administered affects the model’s performance.

2020

pdf bib
L’impatto emotivo della comunicazione istituzionale durante la pandemia di COVID-19: uno studio di Twitter Sentiment Analysis
Gloria Gagliardi | Lorenzo Gregori | Alice Suozzi
Proceedings of the Seventh Italian Conference on Computational Linguistics (CLiC-it 2020)