2020
pdf
abs
On WordNet Semantic Classes: Is the Sum Always Bigger?
Tsvetana Dimitrova
Proceedings of the 4th International Conference on Computational Linguistics in Bulgaria (CLIB 2020)
The paper offers an approach to the validation of the data resulted from a previous effort on expansion of WordNet noun semantic classes by mapping them with the semantic types within the Corpus Pattern Analysis (CPA) ontology employed by the framework of the Pattern Dictionary of English Verbs (PDEV). A case study is presented along with a set of conditions to be checked when validating the combined data.
2019
pdf
abs
On Hidden Semantic Relations between Nouns in WordNet
Tsvetana Dimitrova
|
Valentina Stefanova
Proceedings of the 10th Global Wordnet Conference
The paper presents an effort on transferability of noun–verb and noun–adjective derivative and semantic relations to noun-noun relations. The approach relies on information from semantic classes and existing inter-POS derivative and (morpho)semantic relations between noun and verb, and noun and adjective synsets. We have added semantic relations between nouns in WordNet that are indirectly linked via verbs and adjectives. Observations on the combination between the relations and semantic classes of nouns they link, may facilitate further efforts in assigning semantic properties to nouns pointing to their abilities to participate in predicate-argument structures.
pdf
bib
abs
Hear about Verbal Multiword Expressions in the Bulgarian and the Romanian Wordnets Straight from the Horse’s Mouth
Verginica Barbu Mititelu
|
Ivelina Stoyanova
|
Svetlozara Leseva
|
Maria Mitrofan
|
Tsvetana Dimitrova
|
Maria Todorova
Proceedings of the Joint Workshop on Multiword Expressions and WordNet (MWE-WN 2019)
In this paper we focus on verbal multiword expressions (VMWEs) in Bulgarian and Romanian as reflected in the wordnets of the two languages. The annotation of VMWEs relies on the classification defined within the PARSEME Cost Action. After outlining the properties of various types of VMWEs, a cross-language comparison is drawn, aimed to highlight the similarities and the differences between Bulgarian and Romanian with respect to the lexicalization and distribution of VMWEs. The contribution of this work is in outlining essential features of the description and classification of VMWEs and the cross-language comparison at the lexical level, which is essential for the understanding of the need for uniform annotation guidelines and a viable procedure for validation of the annotation.
2016
pdf
abs
Automatic Prediction of Morphosemantic Relations
Svetla Koeva
|
Svetlozara Leseva
|
Ivelina Stoyanova
|
Tsvetana Dimitrova
|
Maria Todorova
Proceedings of the 8th Global WordNet Conference (GWC)
This paper presents a machine learning method for automatic identification and classification of morphosemantic relations (MSRs) between verb and noun synset pairs in the Bulgarian WordNet (BulNet). The core training data comprise 6,641 morphosemantically related verb–noun literal pairs from BulNet. The core dataset were preprocessed quality-wise by applying validation and reorganisation procedures. Further, the data were supplemented with negative examples of literal pairs not linked by an MSR. The designed supervised machine learning method uses the RandomTree algorithm and is implemented in Java with the Weka package. A set of experiments were performed to test various approaches to the task. Future work on improving the classifier includes adding more training data, employing more features, and fine-tuning. Apart from the language specific information about derivational processes, the proposed method is language independent.
pdf
abs
Hydra for Web: A Browser for Easy Access to Wordnets
Borislav Rizov
|
Tsvetana Dimitrova
Proceedings of the 8th Global WordNet Conference (GWC)
This paper presents a web interface for wordnets named Hydra for Web which is built on top of Hydra – an open source tool for wordnet development – by means of modern web technologies. It is a Single Page Application with simple but powerful and convenient GUI. It has two modes for visualisation of the language correspondences of searched (and found) wordnet synsets – single and parallel modes. Hydra for web is available at: http://dcl.bas.bg/bulnet/.
2015
pdf
Automatic Classification of WordNet Morphosemantic Relations
Svetlozara Leseva
|
Ivelina Stoyanova
|
Maria Todorova
|
Tsvetana Dimitrova
|
Borislav Rizov
|
Svetla Koeva
The 5th Workshop on Balto-Slavic Natural Language Processing
2014
pdf
Coping with Derivation in the Bulgarian Wordnet
Tsvetana Dimitrova
|
Ekaterina Tarpomanova
|
Borislav Rizov
Proceedings of the Seventh Global Wordnet Conference
2012
pdf
Application of Clause Alignment for Statistical Machine Translation
Svetla Koeva
|
Svetlozara Leseva
|
Ivelina Stoyanova
|
Rositsa Dekova
|
Angel Genov
|
Borislav Rizov
|
Tsvetana Dimitrova
|
Ekaterina Tarpomanova
|
Hristina Kukova
Proceedings of the Sixth Workshop on Syntax, Semantics and Structure in Statistical Translation
2011
pdf
The Tenth-Century Cyrillic Manuscript Codex Suprasliensis: the creation of an electronic corpus. UNESCO project (2010–2011)
Hanne Martine Eckhoff
|
David Birnbaum
|
Anissava Miltenova
|
Tsvetana Dimitrova
Proceedings of the Workshop on Language Technologies for Digital Humanities and Cultural Heritage