Automatic Annotation of Direct Speech in Written French Narratives

Noé Durandard, Viet Anh Tran, Gaspard Michel, Elena Epure


Abstract
The automatic annotation of direct speech (AADS) in written text has been often used in computational narrative understanding. Methods based on either rules or deep neural networks have been explored, in particular for English or German languages. Yet, for French, our target language, not many works exist. Our goal is to create a unified framework to design and evaluate AADS models in French. For this, we consolidated the largest-to-date French narrative dataset annotated with DS per word; we adapted various baselines for sequence labelling or from AADS in other languages; and we designed and conducted an extensive evaluation focused on generalisation. Results show that the task still requires substantial efforts and emphasise characteristics of each baseline. Although this framework could be improved, it is a step further to encourage more research on the topic.
Anthology ID:
2023.acl-long.393
Volume:
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:
July
Year:
2023
Address:
Toronto, Canada
Editors:
Anna Rogers, Jordan Boyd-Graber, Naoaki Okazaki
Venue:
ACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
7129–7147
Language:
URL:
https://aclanthology.org/2023.acl-long.393
DOI:
10.18653/v1/2023.acl-long.393
Bibkey:
Cite (ACL):
Noé Durandard, Viet Anh Tran, Gaspard Michel, and Elena Epure. 2023. Automatic Annotation of Direct Speech in Written French Narratives. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 7129–7147, Toronto, Canada. Association for Computational Linguistics.
Cite (Informal):
Automatic Annotation of Direct Speech in Written French Narratives (Durandard et al., ACL 2023)
Copy Citation:
PDF:
https://preview.aclanthology.org/dois-2013-emnlp/2023.acl-long.393.pdf
Video:
 https://preview.aclanthology.org/dois-2013-emnlp/2023.acl-long.393.mp4