Analyzing Large Language Models’ pastiche ability: a case study on a 20th century Romanian author

Anca Dinu, Andra-Maria Florescu, Liviu Dinu


Abstract
This study evaluated the ability of several Large Language Models (LLMs) to pastiche the literary style of the Romanian 20th century author Mateiu Caragiale, by continuing one of his novels left unfinished upon his death. We assembled a database of novels consisting of six texts by Mateiu Caragiale, including his unfinished one, six texts by Radu Albala, including a continuation of Mateiu’s novel, and six LLM generated novels that try to pastiche it. We compared the LLM generated texts with the continuation by Radu Albala, using various methods. We automatically evaluated the pastiches by standard metrics such as ROUGE, BLEU, and METEOR. We performed stylometric analysis, clustering, and authorship attribution, and a manual analysis. Both computational and manual analysis of the pastiches indicated that LLMs are able to produce fairly qualitative pastiches, without matching the professional writer performance. The study also showed that ML techniques outperformed the more recent DL ones in both clusterization and authorship attribution tasks, probably because the dataset consists of only a few literary archaic texts in Romanian. In addition, linguistically informed features were shown to be competitive compared to automatically extracted features.
Anthology ID:
2025.nlp4dh-1.3
Volume:
Proceedings of the 5th International Conference on Natural Language Processing for Digital Humanities
Month:
May
Year:
2025
Address:
Albuquerque, USA
Editors:
Mika Hämäläinen, Emily Öhman, Yuri Bizzoni, So Miyagawa, Khalid Alnajjar
Venues:
NLP4DH | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
20–32
Language:
URL:
https://preview.aclanthology.org/fix-sig-urls/2025.nlp4dh-1.3/
DOI:
Bibkey:
Cite (ACL):
Anca Dinu, Andra-Maria Florescu, and Liviu Dinu. 2025. Analyzing Large Language Models’ pastiche ability: a case study on a 20th century Romanian author. In Proceedings of the 5th International Conference on Natural Language Processing for Digital Humanities, pages 20–32, Albuquerque, USA. Association for Computational Linguistics.
Cite (Informal):
Analyzing Large Language Models’ pastiche ability: a case study on a 20th century Romanian author (Dinu et al., NLP4DH 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/fix-sig-urls/2025.nlp4dh-1.3.pdf