CzeDLex 0.6 and its Representation in the PML-TQ

Jiří Mírovský, Lucie Poláková, Pavlína Synková


Abstract
CzeDLex is an electronic lexicon of Czech discourse connectives with its data coming from a large treebank annotated with discourse relations. Its new version CzeDLex 0.6 (as compared with the previous version 0.5, which was published in 2017) is significantly larger with respect to manually processed entries. Also, its structure has been modified to allow for primary connectives to appear with multiple entries for a single discourse sense. The lexicon comes in several formats, being both human and machine readable, and is available for searching in PML Tree Query, a user-friendly and powerful search tool for all kinds of linguistically annotated treebanks. The main purpose of this paper/demo is to present the new version of the lexicon and to demonstrate possibilities of mining various types of information from the lexicon using PML Tree Query; we present several examples of search queries over the lexicon data along with their results. The new version of the lexicon, CzeDLex~0.6, is available on-line and was officially released in December 2019 under the Creative Commons License.
Anthology ID:
2020.lrec-1.142
Volume:
Proceedings of the Twelfth Language Resources and Evaluation Conference
Month:
May
Year:
2020
Address:
Marseille, France
Venue:
LREC
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
1128–1134
Language:
English
URL:
https://aclanthology.org/2020.lrec-1.142
DOI:
Bibkey:
Cite (ACL):
Jiří Mírovský, Lucie Poláková, and Pavlína Synková. 2020. CzeDLex 0.6 and its Representation in the PML-TQ. In Proceedings of the Twelfth Language Resources and Evaluation Conference, pages 1128–1134, Marseille, France. European Language Resources Association.
Cite (Informal):
CzeDLex 0.6 and its Representation in the PML-TQ (Mírovský et al., LREC 2020)
Copy Citation:
PDF:
https://preview.aclanthology.org/auto-file-uploads/2020.lrec-1.142.pdf