Pavel Stepachev
2021
Preserving high MT quality for content with inline tags
Konstantin Savenkov
|
Grigory Sapunov
|
Pavel Stepachev
Proceedings of Machine Translation Summit XVIII: Users and Providers Track
Attendees will learn about how we use machine translation to provide targeted, high MT quality for content with inline tags. We offer a new and innovative approach to inserting tags into the translated text in a way that reliably preserves their quality. This process can achieve better MT quality and lower costs, as it is MT-independent, and can be used for all languages, MT engines, and use cases.
2018
Multi-source synthetic treebank creation for improved cross-lingual dependency parsing
Francis Tyers
|
Mariya Sheyanova
|
Aleksandra Martynova
|
Pavel Stepachev
|
Konstantin Vinogorodskiy
Proceedings of the Second Workshop on Universal Dependencies (UDW 2018)
This paper describes a method of creating synthetic treebanks for cross-lingual dependency parsing using a combination of machine translation (including pivot translation), annotation projection and the spanning tree algorithm. Sentences are first automatically translated from a lesser-resourced language to a number of related highly-resourced languages, parsed and then the annotations are projected back to the lesser-resourced language, leading to multiple trees for each sentence from the lesser-resourced language. The final treebank is created by merging the possible trees into a graph and running the spanning tree algorithm to vote for the best tree for each sentence. We present experiments aimed at parsing Faroese using a combination of Danish, Swedish and Norwegian. In a similar experimental setup to the CoNLL 2018 shared task on dependency parsing we report state-of-the-art results on dependency parsing for Faroese using an off-the-shelf parser.
Search
Co-authors
- Konstantin Savenkov 1
- Grigory Sapunov 1
- Francis Tyers 1
- Mariya Sheyanova 1
- Aleksandra Martynova 1
- show all...