Application of Existing Readability Methods to the Ukrainian Language: A Comprehensive Study

Serhii D. Prykhodchenko, Oksana Yu. Prykhodchenko


Abstract
The Ukrainian language currently lacks a well-developed framework for assessing text readability. This study addresses this gap by focusing on three key contributions. First, we present the creation of UkrTB, a Ukrainian-language corpus of texts categorized by reader age. Second, we conduct a statistical analysis of the corpus, evaluating key linguistic features such as sentence length, word complexity, and part-of-speech distribution. Third, we systematically assess the applicability of existing readability formulas, including Flesch, Flesch-Kincaid, Matskovskii, Pisarek, and Solnyshkina et al., to Ukrainian texts. Our findings indicate that readability models developed for English and other Slavic languages exhibit significant limitations when applied to Ukrainian. While some methods demonstrate partial correlation with expected readability levels, others produce inconsistent results, underscoring the need for a specialized readability metric tailored to Ukrainian. This work lays the foundation for further research in Ukrainian readability assessment and the development of language-specific models
Anthology ID:
2025.quasy-1.4
Volume:
Proceedings of the Third Workshop on Quantitative Syntax (QUASY, SyntaxFest 2025)
Month:
August
Year:
2025
Address:
Ljubljana, Slovenia
Editors:
Xinying Chen, Yaqin Wang
Venues:
Quasy | WS | SyntaxFest
SIG:
SIGPARSE
Publisher:
Association for Computational Linguistics
Note:
Pages:
17–25
Language:
URL:
https://preview.aclanthology.org/mtsummit-25-ingestion/2025.quasy-1.4/
DOI:
Bibkey:
Cite (ACL):
Serhii D. Prykhodchenko and Oksana Yu. Prykhodchenko. 2025. Application of Existing Readability Methods to the Ukrainian Language: A Comprehensive Study. In Proceedings of the Third Workshop on Quantitative Syntax (QUASY, SyntaxFest 2025), pages 17–25, Ljubljana, Slovenia. Association for Computational Linguistics.
Cite (Informal):
Application of Existing Readability Methods to the Ukrainian Language: A Comprehensive Study (Prykhodchenko & Prykhodchenko, Quasy-SyntaxFest 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/mtsummit-25-ingestion/2025.quasy-1.4.pdf