Generation of Russian Poetry of Different Genres and Styles Using Neural Networks with Character-Level Tokenization

Ilya Koziev, Alena Fenogenova


Abstract
Automatic poetry generation is an immensely complex task, even for the most advanced Large Language Models (LLMs) that requires a profound understanding of intelligence, world and linguistic knowledge, and a touch of creativity.This paper investigates the use of LLMs in generating Russian syllabo-tonic poetry of various genres and styles. The study explores a character-level tokenization architectures and demonstrates how a language model can be pretrained and finetuned to generate poetry requiring knowledge of a language’s phonetics. Additionally, the paper assesses the quality of the generated poetry and the effectiveness of the approach in producing different genres and styles. The study’s main contribution is the introduction of two end-to-end architectures for syllabo-tonic Russian poetry: pretrained models, a comparative analysis of the approaches, and poetry evaluation metrics.
Anthology ID:
2025.latechclfl-1.6
Volume:
Proceedings of the 9th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature (LaTeCH-CLfL 2025)
Month:
May
Year:
2025
Address:
Albuquerque, New Mexico
Editors:
Anna Kazantseva, Stan Szpakowicz, Stefania Degaetano-Ortlieb, Yuri Bizzoni, Janis Pagel
Venues:
LaTeCHCLfL | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
47–63
Language:
URL:
https://preview.aclanthology.org/fix-sig-urls/2025.latechclfl-1.6/
DOI:
Bibkey:
Cite (ACL):
Ilya Koziev and Alena Fenogenova. 2025. Generation of Russian Poetry of Different Genres and Styles Using Neural Networks with Character-Level Tokenization. In Proceedings of the 9th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature (LaTeCH-CLfL 2025), pages 47–63, Albuquerque, New Mexico. Association for Computational Linguistics.
Cite (Informal):
Generation of Russian Poetry of Different Genres and Styles Using Neural Networks with Character-Level Tokenization (Koziev & Fenogenova, LaTeCHCLfL 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/fix-sig-urls/2025.latechclfl-1.6.pdf