Martin Cmejrek

Also published as: Martin Čmejrek


2014

pdf bib
Reordering Model for Forest-to-String Machine Translation
Martin Čmejrek
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP)

2013

pdf bib
Flexible and Efficient Hypergraph Interactions for Joint Hierarchical and Forest-to-String Decoding
Martin Čmejrek | Haitao Mi | Bowen Zhou
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing

2010

pdf bib
Soft Syntactic Constraints for Hierarchical Phrase-Based Translation Using Latent Syntactic Distributions
Zhongqiang Huang | Martin Čmejrek | Bowen Zhou
Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing

pdf bib
Two Methods for Extending Hierarchical Rules from the Bilingual Chart Parsing
Martin Čmejrek | Bowen Zhou
Coling 2010: Posters

2009

pdf bib
Enriching SCFG rules directly from efficient bilingual chart parsing
Martin Čmejrek | Bowen Zhou | Bing Xiang
Proceedings of the 6th International Workshop on Spoken Language Translation: Papers

In this paper, we propose a new method for training translation rules for a Synchronous Context-free Grammar. A bilingual chart parser is used to generate the parse forest, and EM algorithm to estimate expected counts for each rule of the ruleset. Additional rules are constructed as combinations of reliable rules occurring in the parse forest. The new method of proposing additional translation rules is independent of word alignments. We present the theoretical background for this method, and initial experimental results on German-English translations of Europarl data.

2005

pdf bib
Prague Czech-English dependency treebank: resource for structure-based MT
Martin Čmejrek | Jan Cuřín | Jan Hajič | Jiří Havelka
Proceedings of the 10th EAMT Conference: Practical applications of machine translation

2004

pdf bib
Prague Czech-English Dependency Treebank. Syntactically Annotated Resources for Machine Translation
Martin Čmejrek | Jan Cuřín | Jiří Havelka | Jan Hajič | Vladislav Kuboň
Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC’04)

pdf bib
Prague Czech-English Dependency Treebank: Any Hopes for a Common Annotation Scheme?
Martin Čmejrek | Jan Cuřín | Jiří Havelka
Proceedings of the Workshop Frontiers in Corpus Annotation at HLT-NAACL 2004

2003

pdf bib
Czech-English Dependency Tree-based Machine Translation
Martin Čmejrek | Jan Cuřín | Jiří Havelka
10th Conference of the European Chapter of the Association for Computational Linguistics

1999

bib
Automatic translation lexicon extraction from English-Czech parallel texts
Martin Čmejrek | Jan Cuřin
EAMT Workshop: EU and the new languages