Abstract
We present a new freely available dictionary of paraphrases of Czech complex predicates with light verbs, ParaDi. Candidates for single predicative paraphrases of selected complex predicates have been extracted automatically from large monolingual data using word2vec. They have been manually verified and further refined. We demonstrate one of many possible applications of ParaDi in an experiment with improving machine translation quality.- Anthology ID:
- W17-1701
- Volume:
- Proceedings of the 13th Workshop on Multiword Expressions (MWE 2017)
- Month:
- April
- Year:
- 2017
- Address:
- Valencia, Spain
- Venue:
- MWE
- SIG:
- SIGLEX
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 1–10
- Language:
- URL:
- https://aclanthology.org/W17-1701
- DOI:
- 10.18653/v1/W17-1701
- Cite (ACL):
- Petra Barančíková and Václava Kettnerová. 2017. ParaDi: Dictionary of Paraphrases of Czech Complex Predicates with Light Verbs. In Proceedings of the 13th Workshop on Multiword Expressions (MWE 2017), pages 1–10, Valencia, Spain. Association for Computational Linguistics.
- Cite (Informal):
- ParaDi: Dictionary of Paraphrases of Czech Complex Predicates with Light Verbs (Barančíková & Kettnerová, MWE 2017)
- PDF:
- https://preview.aclanthology.org/auto-file-uploads/W17-1701.pdf