ParlaMint II: The Show Must Go On
Maciej Ogrodniczuk, Petya Osenova, Tomaž Erjavec, Darja Fišer, Nikola Ljubešić, Çağrı Çöltekin, Matyáš Kopp, Meden Katja
Abstract
In ParlaMint I, a CLARIN-ERIC supported project in pandemic times, a set of comparable and uniformly annotated multilingual corpora for 17 national parliaments were developed and released in 2021. For 2022 and 2023, the project has been extended to ParlaMint II, again with the CLARIN ERIC financial support, in order to enhance the existing corpora with new data and metadata; upgrade the XML schema; add corpora for 10 new parliaments; provide more application scenarios and carry out additional experiments. The paper reports on these planned steps, including some that have already been taken, and outlines future plans.- Anthology ID:
- 2022.parlaclarin-1.1
- Volume:
- Proceedings of the Workshop ParlaCLARIN III within the 13th Language Resources and Evaluation Conference
- Month:
- June
- Year:
- 2022
- Address:
- Marseille, France
- Editors:
- Darja Fišer, Maria Eskevich, Jakob Lenardič, Franciska de Jong
- Venue:
- ParlaCLARIN
- SIG:
- Publisher:
- European Language Resources Association
- Note:
- Pages:
- 1–6
- Language:
- URL:
- https://aclanthology.org/2022.parlaclarin-1.1
- DOI:
- Cite (ACL):
- Maciej Ogrodniczuk, Petya Osenova, Tomaž Erjavec, Darja Fišer, Nikola Ljubešić, Çağrı Çöltekin, Matyáš Kopp, and Meden Katja. 2022. ParlaMint II: The Show Must Go On. In Proceedings of the Workshop ParlaCLARIN III within the 13th Language Resources and Evaluation Conference, pages 1–6, Marseille, France. European Language Resources Association.
- Cite (Informal):
- ParlaMint II: The Show Must Go On (Ogrodniczuk et al., ParlaCLARIN 2022)
- PDF:
- https://preview.aclanthology.org/nschneid-patch-5/2022.parlaclarin-1.1.pdf