Rethinking Scene Segmentation. Advancing Automated Detection of Scene Changes in Literary Texts

Svenja Guhr, Huijun Mao, Fengyi Lin


Abstract
Automated scene segmentation is an ongoing challenge in computational literary studies (CLS) to approach literary texts by analyzing comparable units. In this paper, we present our approach (work in progress) to text segmentation using a classifier that identifies the position of a scene change in English-language fiction. By manually annotating novels from a 20th-century US-English romance fiction corpus, we prepared training data for fine-tuning transformer models, yielding promising preliminary results for improving automated text segmentation in CLS.
Anthology ID:
2025.latechclfl-1.8
Volume:
Proceedings of the 9th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature (LaTeCH-CLfL 2025)
Month:
May
Year:
2025
Address:
Albuquerque, New Mexico
Editors:
Anna Kazantseva, Stan Szpakowicz, Stefania Degaetano-Ortlieb, Yuri Bizzoni, Janis Pagel
Venues:
LaTeCHCLfL | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
79–86
Language:
URL:
https://preview.aclanthology.org/landing_page/2025.latechclfl-1.8/
DOI:
Bibkey:
Cite (ACL):
Svenja Guhr, Huijun Mao, and Fengyi Lin. 2025. Rethinking Scene Segmentation. Advancing Automated Detection of Scene Changes in Literary Texts. In Proceedings of the 9th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature (LaTeCH-CLfL 2025), pages 79–86, Albuquerque, New Mexico. Association for Computational Linguistics.
Cite (Informal):
Rethinking Scene Segmentation. Advancing Automated Detection of Scene Changes in Literary Texts (Guhr et al., LaTeCHCLfL 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/landing_page/2025.latechclfl-1.8.pdf