ParlaMint-RO: Chamber of the Eternal Future

Petru Rebeja, Mădălina Chitez, Roxana Rogobete, Andreea Dincă, Loredana Bercuci


Abstract
The present paper aims to describe the collection of ParlaMint-RO corpus and to analyse several trends in parliamentary debates (plenary sessions of the Lower House) held in between 2000 and 2020). After a short description of the data collection (of existing transcripts), the workflow of data processing (text extraction, conversion, encoding, linguistic annotation), and an overview of the corpus, the paper will move on to a multi-layered linguistic analysis to validate interdisciplinary perspectives. We use computational methods and corpus linguistics approaches to scrutinize the future tense forms used by Romanian speakers, in order to create a data-supported profile of the parliamentary group strategies and planning.
Anthology ID:
2022.parlaclarin-1.19
Volume:
Proceedings of the Workshop ParlaCLARIN III within the 13th Language Resources and Evaluation Conference
Month:
June
Year:
2022
Address:
Marseille, France
Venue:
ParlaCLARIN
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
131–134
Language:
URL:
https://aclanthology.org/2022.parlaclarin-1.19
DOI:
Bibkey:
Cite (ACL):
Petru Rebeja, Mădălina Chitez, Roxana Rogobete, Andreea Dincă, and Loredana Bercuci. 2022. ParlaMint-RO: Chamber of the Eternal Future. In Proceedings of the Workshop ParlaCLARIN III within the 13th Language Resources and Evaluation Conference, pages 131–134, Marseille, France. European Language Resources Association.
Cite (Informal):
ParlaMint-RO: Chamber of the Eternal Future (Rebeja et al., ParlaCLARIN 2022)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingestion-script-update/2022.parlaclarin-1.19.pdf
Code
 romanian-parlamint/future-tense-usage