Controllable Sentence Simplification in Italian: Fine-Tuning Large Language Models on Automatically Generated Resources

Michele Papucci, Giulia Venturi, Felice Dell'Orletta


Abstract
This paper presents a study on readability-controlled Sentence Simplification for Italian, addressing the scarcity of annotated resources for low-resource languages. We introduce IMPaCTS (Italian Multilevel Parallel Corpus for Text Simplification), the first fully automatically created corpus of 1,444,160 original–simple sentence pairs automatically annotated with readability levels and linguistic features. It was generated using an Italian LLM prompted in zero-shot to produce multiple simplifications per input sentence. Increasing portions of the resource are used to fine-tune mono- and multilingual open-weight LLMs, conditioning them to generate simplifications at a target readability level. Results from automatic and human evaluations show that fine-tuning on IMPaCTS improves performance both in terms of task completion and adherence to the targeted readability levels compared to few-shot baselines.
Anthology ID:
2026.lrec-main.570
Volume:
Proceedings of the Fifteenth Language Resources and Evaluation Conference
Month:
May
Year:
2026
Address:
Palma de Mallorca, Spain
Editors:
Stelios Piperidis, Núria Bel, Henk van den Heuvel, Nancy Ide, Simon Krek, Antonio Toral
Venue:
LREC
SIG:
Publisher:
ELRA Language Resource Association
Note:
Pages:
7178–7191
Language:
URL:
https://preview.aclanthology.org/ingest-lrec/2026.lrec-main.570/
DOI:
Bibkey:
Cite (ACL):
Michele Papucci, Giulia Venturi, and Felice Dell'Orletta. 2026. Controllable Sentence Simplification in Italian: Fine-Tuning Large Language Models on Automatically Generated Resources. International Conference on Language Resources and Evaluation, main:7178–7191.
Cite (Informal):
Controllable Sentence Simplification in Italian: Fine-Tuning Large Language Models on Automatically Generated Resources (Papucci et al., LREC 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-lrec/2026.lrec-main.570.pdf