Francesca Dell’Oro


2022

pdf
Setting Up Bilingual Comparable Corpora with Non-Contemporary Languages
Helena Bermudez Sabel | Francesca Dell’Oro | Cyrielle Montrichard | Corinne Rossari
Proceedings of the BUCC Workshop within LREC 2022

This paper presents the project “Les corpora latins et français: une fabrique pour l’accès à la représentation des connaissances” (Latin and French Corpora: a Factory For Accessing Knowledge Representation) whose focus is the study of modality in both Latin and French by means of multi-genre, diachronic comparable corpora. The setting up of such corpora involves a number of conceptualisation challenges, in particular with regard to how to compare two asynchronous textual productions corresponding to different cultural frameworks. In this paper we outline the rationale of designing comparable corpora to explore our research questions and then focus on some of the issues that arise when comparing different diachronic spans of Latin and French. We also explain how these issues were dealt with, thus providing some grounds upon which other projects could build their methodology.