Alternative Lexicalizations of Discourse Connectives in Czech

Magdaléna Rysová


Abstract
The paper concentrates on which language means may be included into the annotation of discourse relations in the Prague Dependency Treebank (PDT) and tries to examine the so called alternative lexicalizations of discourse markers (AltLex's) in Czech. The analysis proceeds from the annotated data of PDT and tries to draw a comparison between the Czech AltLex's from PDT and English AltLex's from PDTB (the Penn Discourse Treebank). The paper presents a lexico-syntactic and semantic characterization of the Czech AltLex's and comments on the current stage of their annotation in PDT. In the current version, PDT contains 306 expressions (within the total 43,955 of sentences) that were labeled by annotators as being an AltLex. However, as the analysis demonstrates, this number is not final. We suppose that it will increase after the further elaboration, as AltLex's are not restricted to a limited set of syntactic classes and some of them exhibit a great degree of variation.
Anthology ID:
L12-1216
Volume:
Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)
Month:
May
Year:
2012
Address:
Istanbul, Turkey
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
2800–2807
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/420_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Magdaléna Rysová. 2012. Alternative Lexicalizations of Discourse Connectives in Czech. In Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12), pages 2800–2807, Istanbul, Turkey. European Language Resources Association (ELRA).
Cite (Informal):
Alternative Lexicalizations of Discourse Connectives in Czech (Rysová, LREC 2012)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/420_Paper.pdf