Martin Cmejrek
Also published as: Martin Čmejrek
2014
Reordering Model for Forest-to-String Machine Translation
Martin Čmejrek
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP)
Martin Čmejrek
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP)
2013
Flexible and Efficient Hypergraph Interactions for Joint Hierarchical and Forest-to-String Decoding
Martin Čmejrek | Haitao Mi | Bowen Zhou
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing
Martin Čmejrek | Haitao Mi | Bowen Zhou
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing
2010
Two Methods for Extending Hierarchical Rules from the Bilingual Chart Parsing
Martin Čmejrek | Bowen Zhou
Coling 2010: Posters
Martin Čmejrek | Bowen Zhou
Coling 2010: Posters
Soft Syntactic Constraints for Hierarchical Phrase-Based Translation Using Latent Syntactic Distributions
Zhongqiang Huang | Martin Čmejrek | Bowen Zhou
Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
Zhongqiang Huang | Martin Čmejrek | Bowen Zhou
Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
2009
Enriching SCFG rules directly from efficient bilingual chart parsing
Martin Čmejrek | Bowen Zhou | Bing Xiang
Proceedings of the 6th International Workshop on Spoken Language Translation: Papers
Martin Čmejrek | Bowen Zhou | Bing Xiang
Proceedings of the 6th International Workshop on Spoken Language Translation: Papers
In this paper, we propose a new method for training translation rules for a Synchronous Context-free Grammar. A bilingual chart parser is used to generate the parse forest, and EM algorithm to estimate expected counts for each rule of the ruleset. Additional rules are constructed as combinations of reliable rules occurring in the parse forest. The new method of proposing additional translation rules is independent of word alignments. We present the theoretical background for this method, and initial experimental results on German-English translations of Europarl data.
2005
Prague Czech-English dependency treebank: resource for structure-based MT
Martin Čmejrek | Jan Cuřín | Jan Hajič | Jiří Havelka
Proceedings of the 10th EAMT Conference: Practical applications of machine translation
Martin Čmejrek | Jan Cuřín | Jan Hajič | Jiří Havelka
Proceedings of the 10th EAMT Conference: Practical applications of machine translation
2004
Prague Czech-English Dependency Treebank. Syntactically Annotated Resources for Machine Translation
Martin Čmejrek | Jan Cuřín | Jiří Havelka | Jan Hajič | Vladislav Kuboň
Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC’04)
Martin Čmejrek | Jan Cuřín | Jiří Havelka | Jan Hajič | Vladislav Kuboň
Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC’04)
Prague Czech-English Dependency Treebank: Any Hopes for a Common Annotation Scheme?
Martin Čmejrek | Jan Cuřín | Jiří Havelka
Proceedings of the Workshop Frontiers in Corpus Annotation at HLT-NAACL 2004
Martin Čmejrek | Jan Cuřín | Jiří Havelka
Proceedings of the Workshop Frontiers in Corpus Annotation at HLT-NAACL 2004
2003
Czech-English Dependency Tree-based Machine Translation
Martin Čmejrek | Jan Cuřín | Jiří Havelka
10th Conference of the European Chapter of the Association for Computational Linguistics
Martin Čmejrek | Jan Cuřín | Jiří Havelka
10th Conference of the European Chapter of the Association for Computational Linguistics