Beata Wójtowicz
2024
Polish Round Table Corpus
Maciej Ogrodniczuk | Ryszard Tuora | Beata Wójtowicz
Proceedings of the IV Workshop on Creating, Analysing, and Increasing Accessibility of Parliamentary Corpora (ParlaCLARIN) @ LREC-COLING 2024
Maciej Ogrodniczuk | Ryszard Tuora | Beata Wójtowicz
Proceedings of the IV Workshop on Creating, Analysing, and Increasing Accessibility of Parliamentary Corpora (ParlaCLARIN) @ LREC-COLING 2024
The paper describes the process of preparation of the Polish Round Table Corpus (Pol. Korpus Okrągłego Stołu), a new resource documenting negotiations taking place in 1989 between the representatives of the communist government of the People’s Republic of Poland and the Solidarity opposition. The process consisted of OCR of graphical transcripts of the talks stored in the form of parliament-like stenographic transcripts, carrying out their manual correction and making them available for search in a concordancer currently used for standard parliamentary transcripts.
UniDive: A COST Action on Universality, Diversity and Idiosyncrasy in Language Technology
Agata Savary | Daniel Zeman | Verginica Barbu Mititelu | Anabela Barreiro | Olesea Caftanatov | Marie-Catherine de Marneffe | Kaja Dobrovoljc | Gülşen Eryiğit | Voula Giouli | Bruno Guillaume | Stella Markantonatou | Nurit Melnik | Joakim Nivre | Atul Kr. Ojha | Carlos Ramisch | Abigail Walsh | Beata Wójtowicz | Alina Wróblewska
Proceedings of the 3rd Annual Meeting of the Special Interest Group on Under-resourced Languages @ LREC-COLING 2024
Agata Savary | Daniel Zeman | Verginica Barbu Mititelu | Anabela Barreiro | Olesea Caftanatov | Marie-Catherine de Marneffe | Kaja Dobrovoljc | Gülşen Eryiğit | Voula Giouli | Bruno Guillaume | Stella Markantonatou | Nurit Melnik | Joakim Nivre | Atul Kr. Ojha | Carlos Ramisch | Abigail Walsh | Beata Wójtowicz | Alina Wróblewska
Proceedings of the 3rd Annual Meeting of the Special Interest Group on Under-resourced Languages @ LREC-COLING 2024
This paper presents the objectives, organization and activities of the UniDive COST Action, a scientific network dedicated to universality, diversity and idiosyncrasy in language technology. We describe the objectives and organization of this initiative, the people involved, the working groups and the ongoing tasks and activities. This paper is also an pen call for participation towards new members and countries.
2022
Error Correction Environment for the Polish Parliamentary Corpus
Maciej Ogrodniczuk | Michał Rudolf | Beata Wójtowicz | Sonia Janicka
Proceedings of the Workshop ParlaCLARIN III within the 13th Language Resources and Evaluation Conference
Maciej Ogrodniczuk | Michał Rudolf | Beata Wójtowicz | Sonia Janicka
Proceedings of the Workshop ParlaCLARIN III within the 13th Language Resources and Evaluation Conference
The paper introduces the environment for detecting and correcting various kinds of errors in the Polish Parliamentary Corpus. After performing a language model-based error detection experiment which resulted in too many false positives, a simpler rule-based method was introduced and is currently used in the process of manual verification of corpus texts. The paper presents types of errors detected in the corpus, the workflow of the correction process and the tools newly implemented for this purpose. To facilitate comparison of a target corpus XML file with its usually graphical PDF source, a new mechanism for inserting PDF page markers into XML was developed and is used for displaying a single source page corresponding to a given place in the resulting XML directly in the error correction environment.
2009
A Repository of Free Lexical Resources for African Languages: The Project and the Method
Piotr Bański | Beata Wójtowicz
Proceedings of the First Workshop on Language Technologies for African Languages
Piotr Bański | Beata Wójtowicz
Proceedings of the First Workshop on Language Technologies for African Languages
2007
Search
Fix author
Co-authors
- Maciej Ogrodniczuk 2
- Verginica Barbu Mititelu 1
- Anabela Barreiro 1
- Piotr Bański 1
- Olesea Caftanatov 1
- Łukasz Degórski 1
- Kaja Dobrovoljc 1
- Gülşen Eryiğit 1
- Voula Giouli 1
- Bruno Guillaume 1
- Sonia Janicka 1
- Vladislav Kubon 1
- Lothar Lemnitzer 1
- Stella Markantonatou 1
- Nurit Melnik 1
- Joakim Nivre 1
- Atul Kr. Ojha 1
- Petya Osenova 1
- Adam Przepiórkowski 1
- Carlos Ramisch 1
- Michał Rudolf 1
- Agata Savary 1
- Kiril Simov 1
- Miroslav Spousta 1
- Ryszard Tuora 1
- Abigail Walsh 1
- Alina Wróblewska 1
- Daniel Zeman 1
- Marie-Catherine de Marneffe 1