1 |
Improving Multilingual Translation by Representation and Gradient Regularization ...
|
|
|
|
BASE
|
|
Show details
|
|
2 |
Discovering Representation Sprachbund For Multilingual Pre-Training ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
Multi-Source Cross-Lingual Model Transfer: Learning What to Share ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
Universal Neural Machine Translation for Extremely Low Resource Languages ...
|
|
|
|
BASE
|
|
Show details
|
|
5 |
Synthetic Data for Neural Machine Translation of Spoken-Dialects ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
A syntactified direct translation model with linear-time decoding
|
|
|
|
In: Hassan, Hany, Sima'an, Khalil and Way, Andy orcid:0000-0001-5736-5930 (2009) A syntactified direct translation model with linear-time decoding. In: EMNLP 2009 - Conference on Empirical Methods in Natural Language Processing, 6-7 August 2009, Singapore. (2009)
|
|
BASE
|
|
Show details
|
|
9 |
Lexical syntax for statistical machine translation
|
|
Hassan, Hany. - : Dublin City University. School of Computing, 2009
|
|
In: Hassan, Hany (2009) Lexical syntax for statistical machine translation. PhD thesis, Dublin City University. (2009)
|
|
Abstract:
Statistical Machine Translation (SMT) is by far the most dominant paradigm of Machine Translation. This can be justified by many reasons, such as accuracy, scalability, computational efficiency and fast adaptation to new languages and domains. However, current approaches of Phrase-based SMT lacks the capabilities of producing more grammatical translations and handling long-range reordering while maintaining the grammatical structure of the translation output. Recently, SMT researchers started to focus on extending Phrase-based SMT systems with syntactic knowledge; however, the previous techniques have limited capabilities due to introducing redundantly ambiguous syntactic structures and using decoders with limited language models, and with a high computational cost. In this thesis, we extend Phrase-based SMT with lexical syntactic descriptions that localize global syntactic information on the word without introducing syntactic redundant ambiguity. We presente a novel model of Phrase-based SMT which integrates linguistic lexical descriptions —supertags— into the target language model and the target side of the translation model. We conduct extensive experiments in two language pairs, Arabic– English and German–English, which show significant improvements over the state-ofthe- art Phrase-based SMT systems. Moreover, we introduce a novel Incremental Dependency-based Syntactic Language Model (IDLM) based on wide-coverage CCG incremental parsing which we integrate into a direct translation SMT system. Our proposed approach is the first to integrate full dependency parsing in SMT systems with a very attractive computational cost since it deploys the linear decoders widely used in Phrase–based SMT systems. The experimental results show a good improvement over a top-ranked state-of-the-art system.
|
|
Keyword:
Computational linguistics; Machine learning; Machine translating
|
|
URL: http://doras.dcu.ie/2320/
|
|
BASE
|
|
Hide details
|
|
10 |
Lexicalized semi-incremental dependency parsing
|
|
|
|
In: Hassan, Hany, Sima'an, Khalil and Way, Andy orcid:0000-0001-5736-5930 (2009) Lexicalized semi-incremental dependency parsing. In: RANLP-2009 - Recent Advances in Natural Language Processing, 14-16 September 2009, Borovets, Bulgaria. (2009)
|
|
BASE
|
|
Show details
|
|
12 |
Exploiting alignment techniques in MATREX: the DCU machine translation system for IWSLT 2008
|
|
|
|
In: Ma, Yanjun, Tinsley, John, Hassan, Hany, Du, Jinhua orcid:0000-0002-3267-4881 and Way, Andy orcid:0000-0001-5736-5930 (2008) Exploiting alignment techniques in MATREX: the DCU machine translation system for IWSLT 2008. In: IWSLT 2008 - International Workshop on Spoken Language Translation, 20-21 October 2008, Hawaii, USA. (2008)
|
|
BASE
|
|
Show details
|
|
13 |
Supertagged phrase-based statistical machine translation
|
|
|
|
In: Hassan, Hany, Sima'an, Khalil and Way, Andy orcid:0000-0001-5736-5930 (2007) Supertagged phrase-based statistical machine translation. In: ACL 2007 - 45th Annual Meeting of the Association for Computational Linguistics, 25-27 June 2007, Prague, Czech Republic. (2007)
|
|
BASE
|
|
Show details
|
|
|
|