Simplifying Lithuanian text into Easy-to-Read language using large language models

Simona Kuoraitė, Valentas Gružauskas


Abstract
This paper explores the task of simplifying Lithuanian text into Easy-to-Read language. Easy-to-Read language is a text written in short, clear sentences and simple words, adapted for people with intellectual disabilities or limited language skills. The aim of this work is to investigate how the large language model Lt-Llama-2-7b-hf, pre-trained on Lithuanian language data, can be adapted to the task of simplifying Lithuanian texts into Easy-to-Read language. To achieve this goal, specialized datasets were developed to fine-tune the model, and experiments were carried out. The model was tested by presenting the texts in their original language and the texts with a prompt adapted to the task. The results were evaluated using the SARI metric for assessing the quality of simplified texts and a qualitative evaluation of the large language model. The results show that the fine-tuned model sometimes simplifies text better than a not fine-tuned model, but that a larger and more extensive dataset would be needed to achieve significant results, and that more research should be carried out on fine-tuning the model for this task.
Anthology ID:
2025.aielpl-1.4
Volume:
Proceedings of the 1st Workshop on Artificial Intelligence and Easy and Plain Language in Institutional Contexts (AI & EL/PL)
Month:
June
Year:
2025
Address:
Geneva, Switzerland
Editors:
María Isabel Rivas Ginel, Patrick Cadwell, Paolo Canavese, Silvia Hansen-Schirra, Martin Kappus, Anna Matamala, Will Noonan
Venue:
AIELPL
SIG:
Publisher:
European Association for Machine Translation
Note:
Pages:
30–37
Language:
URL:
https://preview.aclanthology.org/mtsummit-25-ingestion/2025.aielpl-1.4/
DOI:
Bibkey:
Cite (ACL):
Simona Kuoraitė and Valentas Gružauskas. 2025. Simplifying Lithuanian text into Easy-to-Read language using large language models. In Proceedings of the 1st Workshop on Artificial Intelligence and Easy and Plain Language in Institutional Contexts (AI & EL/PL), pages 30–37, Geneva, Switzerland. European Association for Machine Translation.
Cite (Informal):
Simplifying Lithuanian text into Easy-to-Read language using large language models (Kuoraitė & Gružauskas, AIELPL 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/mtsummit-25-ingestion/2025.aielpl-1.4.pdf