1 |
Statistical post-editing and quality estimation for machine translation systems
|
|
|
|
In: Béchara, Hanna (2014) Statistical post-editing and quality estimation for machine translation systems. Master of Science thesis, Dublin City University. (2014)
|
|
BASE
|
|
Show details
|
|
5 |
Statistical Analysis of Alignment Characteristics for Phrase-based Machine Translation
|
|
|
|
In: Proceedings of the 14th European Association for Machine Translation ; https://hal.archives-ouvertes.fr/hal-00525181 ; Proceedings of the 14th European Association for Machine Translation, May 2010, Saint-Raphaël, France. no page number (2010)
|
|
BASE
|
|
Show details
|
|
6 |
HMM word-to-phrase alignment with dependency constraints
|
|
|
|
In: Ma, Yanjun and Way, Andy orcid:0000-0001-5736-5930 (2010) HMM word-to-phrase alignment with dependency constraints. In: SSST 2010 - 4th Workshop on Syntax and Structure in Statistical Translation, 28 August 2010, Beijing, China. (2010)
|
|
BASE
|
|
Show details
|
|
7 |
Statistical analysis of alignment characteristics for phrase-based machine translation
|
|
|
|
In: Lambert, Patrik, Petitrenaud, Simon, Ma, Yanjun and Way, Andy orcid:0000-0001-5736-5930 (2010) Statistical analysis of alignment characteristics for phrase-based machine translation. In: EAMT 2010 - 14th Annual Conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. (2010)
|
|
BASE
|
|
Show details
|
|
8 |
Accuracy-based scoring for phrase-based statistical machine translation
|
|
|
|
In: Penkale, Sergio, Ma, Yanjun, Galron, Daniel and Way, Andy orcid:0000-0001-5736-5930 (2010) Accuracy-based scoring for phrase-based statistical machine translation. In: AMTA 2010 - 9th Conference of the Association for Machine Translation in the Americas, 31 October - 4 November 2010, Denver, CO, USA. (2010)
|
|
BASE
|
|
Show details
|
|
9 |
Integrating N-best SMT outputs into a TM system
|
|
|
|
In: He, Yifan, Ma, Yanjun, Way, Andy orcid:0000-0001-5736-5930 and van Genabith, Josef orcid:0000-0003-1322-7944 (2010) Integrating N-best SMT outputs into a TM system. In: COLING 2010 - 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing, China. (2010)
|
|
BASE
|
|
Show details
|
|
10 |
Bridging SMT and TM with translation recommendation
|
|
|
|
In: He, Yifan, Ma, Yanjun, van Genabith, Josef orcid:0000-0003-1322-7944 and Way, Andy orcid:0000-0001-5736-5930 (2010) Bridging SMT and TM with translation recommendation. In: ACL 2010 - 48th Annual Meeting of the Association for Computational Linguistics, 11-16 July 2010, Uppsala, Sweden. (2010)
|
|
BASE
|
|
Show details
|
|
13 |
Constrained word alignment models for statistical machine translation
|
|
Ma, Yanjun. - : Dublin City University. National Centre for Language Technology (NCLT), 2009. : Dublin City University. School of Computing, 2009
|
|
In: Ma, Yanjun (2009) Constrained word alignment models for statistical machine translation. PhD thesis, Dublin City University. (2009)
|
|
BASE
|
|
Show details
|
|
14 |
Using supertags as source language context in SMT
|
|
|
|
In: Haque, Rejwanul orcid:0000-0003-1680-0099 , Naskar, Sudip Kumar, Ma, Yanjun and Way, Andy orcid:0000-0001-5736-5930 (2009) Using supertags as source language context in SMT. In: EAMT 2009 - 13th Annual Conference of the European Association for Machine Translation, 13-15 May 2009, Barcelona, Spain. (2009)
|
|
BASE
|
|
Show details
|
|
15 |
Source-side context-informed hypothesis alignment for combining outputs from machine translation systems
|
|
|
|
In: Du, Jinhua orcid:0000-0002-3267-4881 , Ma, Yanjun and Way, Andy orcid:0000-0001-5736-5930 (2009) Source-side context-informed hypothesis alignment for combining outputs from machine translation systems. In: MT Summit XII - The twelfth Machine Translation Summit, 26-30 August 2009, Ottawa, Canada. (2009)
|
|
BASE
|
|
Show details
|
|
16 |
Bilingually motivated word segmentation for statistical machine translation
|
|
|
|
In: Ma, Yanjun and Way, Andy orcid:0000-0001-5736-5930 (2009) Bilingually motivated word segmentation for statistical machine translation. ACM Transactions on Asian Language Information Processing, 8 (2). ISSN 1530-0226 (2009)
|
|
Abstract:
We introduce a bilingually motivated word segmentation approach to languages where word boundaries are not orthographically marked, with application to Phrase-Based Statistical Machine Translation (PB-SMT). Our approach is motivated from the insight that PB-SMT systems can be improved by optimizing the input representation to reduce the predictive power of translation models. We firstly present an approach to optimize the existing segmentation of both source and target languages for PB-SMT and demonstrate the effectiveness of this approach using a Chinese–English MT task, that is, to measure the influence of the segmentation on the performance of PB-SMT systems. We report a 5.44% relative increase in Bleu score and a consistent increase according to other metrics. We then generalize this method for Chinese word segmentation without relying on any segmenters and show that using our segmentation PB-SMT can achieve more consistent state-of-the-art performance across two domains. There are two main advantages of our approach. First of all, it is adapted to the specific translation task at hand by taking the corresponding source (target) language into account. Second, this approach does not rely on manually segmented training data so that it can be automatically adapted for different domains.
|
|
Keyword:
Machine translating
|
|
URL: http://doras.dcu.ie/15814/
|
|
BASE
|
|
Hide details
|
|
17 |
Tracking relevant alignment characteristics for machine translation
|
|
|
|
In: Lambert, Patrik, Ma, Yanjun, Ozdowska, Sylwia and Way, Andy orcid:0000-0001-5736-5930 (2009) Tracking relevant alignment characteristics for machine translation. In: MT Summit XII - The twelfth Machine Translation Summit, 26-30 August 2009, Ottawa, Canada. (2009)
|
|
BASE
|
|
Show details
|
|
18 |
Bilingually motivated domain-adapted word segmentation for statistical machine translation
|
|
|
|
In: Ma, Yanjun and Way, Andy orcid:0000-0001-5736-5930 (2009) Bilingually motivated domain-adapted word segmentation for statistical machine translation. In: EACL 2009 Workshop on Computational Approaches to Semitic Languages, 31 March 2009, Athens, Greece. (2009)
|
|
BASE
|
|
Show details
|
|
19 |
Improving word alignment using syntactic dependencies
|
|
|
|
In: Ma, Yanjun, Ozdowska, Sylwia, Sun, Yanli and Way, Andy orcid:0000-0001-5736-5930 (2008) Improving word alignment using syntactic dependencies. In: ACL08-SSST - Proceedings of ACL08 workshop on Syntax and Structure in Statistical Translation, 20 June 2008, Columbus, Ohio, USA. (2008)
|
|
BASE
|
|
Show details
|
|
20 |
Exploiting alignment techniques in MATREX: the DCU machine translation system for IWSLT 2008
|
|
|
|
In: Ma, Yanjun, Tinsley, John, Hassan, Hany, Du, Jinhua orcid:0000-0002-3267-4881 and Way, Andy orcid:0000-0001-5736-5930 (2008) Exploiting alignment techniques in MATREX: the DCU machine translation system for IWSLT 2008. In: IWSLT 2008 - International Workshop on Spoken Language Translation, 20-21 October 2008, Hawaii, USA. (2008)
|
|
BASE
|
|
Show details
|
|
|
|