Abstract
We present an attempt to automatically identify Czech deverbative nouns using several methods that use large corpora as well as existing lexical resources. The motivation for the task is to extend a verbal valency (i.e., predicate-argument) lexicon by adding nouns that share the valency properties with the base verb, assuming their properties can be derived (even if not trivially) from the underlying verb by deterministic grammatical rules. At the same time, even in inflective languages, not all deverbatives are simply created from their underlying base verb by regular lexical derivation processes. We have thus developed hybrid techniques that use both large parallel corpora and several standard lexical resources. Thanks to the use of parallel corpora, the resulting sets contain also synonyms, which the lexical derivation rules cannot get. For evaluation, we have manually created a small, 100-verb gold data since no such dataset was initially available for Czech.- Anthology ID:
- W16-3810
- Volume:
- Proceedings of the Workshop on Grammar and Lexicon: interactions and interfaces (GramLex)
- Month:
- December
- Year:
- 2016
- Address:
- Osaka, Japan
- Editors:
- Eva Hajičová, Igor Boguslavsky
- Venue:
- GramLex
- SIG:
- Publisher:
- The COLING 2016 Organizing Committee
- Note:
- Pages:
- 71–80
- Language:
- URL:
- https://aclanthology.org/W16-3810
- DOI:
- Cite (ACL):
- Eva Fučíková, Jan Hajič, and Zdeňka Urešová. 2016. Enriching a Valency Lexicon by Deverbative Nouns. In Proceedings of the Workshop on Grammar and Lexicon: interactions and interfaces (GramLex), pages 71–80, Osaka, Japan. The COLING 2016 Organizing Committee.
- Cite (Informal):
- Enriching a Valency Lexicon by Deverbative Nouns (Fučíková et al., GramLex 2016)
- PDF:
- https://preview.aclanthology.org/nschneid-patch-4/W16-3810.pdf
- Data
- NomBank, Penn Treebank