Vladimir Petkevic

Also published as: Vladimír Petkevič


2016

pdf
SYN2015: Representative Corpus of Contemporary Written Czech
Michal Křen | Václav Cvrček | Tomáš Čapka | Anna Čermáková | Milena Hnátková | Lucie Chlumská | Tomáš Jelínek | Dominika Kováříková | Vladimír Petkevič | Pavel Procházka | Hana Skoumalová | Michal Škrabal | Petr Truneček | Pavel Vondřička | Adrian Jan Zasina
Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)

The paper concentrates on the design, composition and annotation of SYN2015, a new 100-million representative corpus of contemporary written Czech. SYN2015 is a sequel of the representative corpora of the SYN series that can be described as traditional (as opposed to the web-crawled corpora), featuring cleared copyright issues, well-defined composition, reliability of annotation and high-quality text processing. At the same time, SYN2015 is designed as a reflection of the variety of written Czech text production with necessary methodological and technological enhancements that include a detailed bibliographic annotation and text classification based on an updated scheme. The corpus has been produced using a completely rebuilt text processing toolchain called SynKorp. SYN2015 is lemmatized, morphologically and syntactically annotated with state-of-the-art tools. It has been published within the framework of the Czech National Corpus and it is available via the standard corpus query interface KonText at http://kontext.korpus.cz as well as a dataset in shuffled format.

2015

pdf bib
Analytic Morphology – Merging the Paradigmatic and Syntagmatic Perspective in a Treebank
Vladimír Petkevič | Alexandr Rosen | Hana Skoumalová | Přemysl Vítovec
The 5th Workshop on Balto-Slavic Natural Language Processing

2003

pdf
The MULTEXT-East Morphosyntactic Specification for Slavic Languages
Tomaž Erjavec | Cvetana Krstev | Vladimír Petkevič | Kiril Simov | Marko Tadić | Duško Vitas
Proceedings of the 2003 EACL Workshop on Morphological Processing of Slavic Languages

2001

pdf
Serial Combination of Rules and Statistics: A Case Study in Czech Tagging
Jan Hajic | Pavel Krbec | Pavel Kveton | Karel Oliva | Vladimir Petkevic
Proceedings of the 39th Annual Meeting of the Association for Computational Linguistics

1998

pdf
Multext-East: Parallel and Comparable Corpora and Lexicons for Six Central and Eastern European Languages
Ludmila Dimitrova | Tomaz Erjavec | Nancy Ide | Heiki Jaan Kaalep | Vladimir Petkevic | Dan Tufis
36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, Volume 1

pdf
Multext-East: Parallel and Comparable Corpora and Lexicons for Six Central and Eastern European Languages
Ludmila Dimitrova | Tomaz Erjavec | Nancy Ide | Heiki Jaan Kaalep | Vladimir Petkevic | Dan Tufis
COLING 1998 Volume 1: The 17th International Conference on Computational Linguistics

1988

pdf
New Dependency Based Specification of Underlying Representations of Sentences
Vladimir Petkevic
Coling Budapest 1988 Volume 2: International Conference on Computational Linguistics