1 |
Deep interactive text prediction and quality estimation in translation interfaces
|
|
|
|
In: Hokamp, Christopher M. (2018) Deep interactive text prediction and quality estimation in translation interfaces. PhD thesis, Dublin City University. (2018)
|
|
BASE
|
|
Show details
|
|
2 |
Statistical post-editing and quality estimation for machine translation systems
|
|
|
|
In: Béchara, Hanna (2014) Statistical post-editing and quality estimation for machine translation systems. Master of Science thesis, Dublin City University. (2014)
|
|
BASE
|
|
Show details
|
|
3 |
Predicting sentence translation quality using extrinsic and language independent features
|
|
|
|
In: Bicici, Ergun, Groves, Declan and van Genabith, Josef orcid:0000-0003-1322-7944 (2013) Predicting sentence translation quality using extrinsic and language independent features. Machine Translation, 27 (3-4). pp. 171-192. ISSN 0922-6567 (2013)
|
|
BASE
|
|
Show details
|
|
4 |
Domain adaptation for statistical machine translation of corporate and user-generated content
|
|
|
|
In: Banerjee, Pratyush (2013) Domain adaptation for statistical machine translation of corporate and user-generated content. PhD thesis, Dublin City University. (2013)
|
|
Abstract:
The growing popularity of Statistical Machine Translation (SMT) techniques in recent years has led to the development of multiple domain-specic resources and adaptation scenarios. In this thesis we address two important and industrially relevant adaptation scenarios, each suited to different kinds of content. Initially focussing on professionally edited `enterprise-quality' corporate content, we address a specic scenario of data translation from a mixture of different domains where, for each of them domain-specific data is available. We utilise an automatic classifier to combine multiple domain-specific models and empirically show that such a configuration results in better translation quality compared to both traditional and state-of-the-art techniques for handling mixed domain translation. In the second phase of our research we shift our focus to the translation of possibly `noisy' user-generated content in web-forums created around products and services of a multinational company. Using professionally edited translation memory (TM) data for training, we use different normalisation and data selection techniques to adapt SMT models to noisy forum content. In this scenario, we also study the effect of mixture adaptation using a combination of in-domain and out-of-domain data at different component levels of an SMT system. Finally we focus on the task of optimal supplementary training data selection from out-of-domain corpora using a novel incremental model merging mechanism to adapt TM-based models to improve forum-content translation quality.
|
|
Keyword:
Computational linguistics; Machine learning; Machine translating; SMT; Statistical Machine Translation
|
|
URL: http://doras.dcu.ie/17722/
|
|
BASE
|
|
Hide details
|
|
5 |
CNGL: Grading student answers by acts of translation
|
|
|
|
In: Bicici, Ergun orcid:0000-0002-2293-2031 and van Genabith, Josef orcid:0000-0003-1322-7944 (2013) CNGL: Grading student answers by acts of translation. In: SEMEVAL, 14-15 Jun 2013, Atlanta, Georgia. (2013)
|
|
BASE
|
|
Show details
|
|
6 |
Definition of interfaces
|
|
|
|
In: Almaghout, Hala, Bicici, Ergun, Doherty, Stephen orcid:0000-0003-0887-1049 , Gaspari, Federico, Groves, Declan, Toral, Antonio orcid:0000-0003-2357-2960 , van Genabith, Josef orcid:0000-0003-1322-7944 , Popović, Maja orcid:0000-0001-8234-8745 and Piperidis, Stelios (2013) Definition of interfaces. Project Report. QTLaunchPad. (2013)
|
|
BASE
|
|
Show details
|
|
7 |
Mapping the industry I: Findings on translation technologies and quality assessment
|
|
|
|
In: Doherty, Stephen orcid:0000-0003-0887-1049 , Gaspari, Federico, Groves, Declan and van Genabith, Josef orcid:0000-0003-1322-7944 (2013) Mapping the industry I: Findings on translation technologies and quality assessment. Technical Report. GALA. (2013)
|
|
BASE
|
|
Show details
|
|
8 |
Quality metrics for human and machine translation.
|
|
|
|
In: Doherty, Stephen orcid:0000-0003-4864-5986 , Gaspari, Federico, Groves, Declan, Srivastava, Ankit Kumar and van Genabith, Josef orcid:0000-0003-1322-7944 (2013) Quality metrics for human and machine translation. Project Report. UNSPECIFIED. (2013)
|
|
BASE
|
|
Show details
|
|
9 |
CNGL-CORE: Referential translation machines for measuring semantic similarity
|
|
|
|
In: Bicici, Ergun orcid:0000-0002-2293-2031 and van Genabith, Josef orcid:0000-0003-1322-7944 (2013) CNGL-CORE: Referential translation machines for measuring semantic similarity. In: *SEM, 13-14 Jun 2013, Atlanta, Georgia. (2013)
|
|
BASE
|
|
Show details
|
|
10 |
Decreasing lexical data sparsity in statistical syntactic parsing - experiments with named entities
|
|
|
|
In: Hogan, Deirdre, Foster, Jennifer orcid:0000-0002-7789-4853 and van Genabith, Josef orcid:0000-0003-1322-7944 (2011) Decreasing lexical data sparsity in statistical syntactic parsing - experiments with named entities. In: Multiword Expressions: from Parsing and Generation to the Real World (MWE). Workshop at ACL 2011, 19-24 June 2011, Portland, Oregon. (2011)
|
|
BASE
|
|
Show details
|
|
11 |
Deep Syntax in Statistical Machine Translation
|
|
Graham, Yvette. - : Dublin City University. National Centre for Language Technology (NCLT), 2011. : Dublin City University. School of Computing, 2011
|
|
In: Graham, Yvette (2011) Deep Syntax in Statistical Machine Translation. PhD thesis, Dublin City University. (2011)
|
|
BASE
|
|
Show details
|
|
12 |
The integration of machine translation and translation memory
|
|
He, Yifan. - : Dublin City University. Centre for Next Generation Localisation (CNGL), 2011. : Dublin City University. School of Computing, 2011
|
|
In: He, Yifan (2011) The integration of machine translation and translation memory. PhD thesis, Dublin City University. (2011)
|
|
BASE
|
|
Show details
|
|
13 |
Improving dependency label accuracy using statistical post-editing: A cross-framework study
|
|
|
|
In: Cetinoglu, Ozlem, Bryl, Anton, Foster, Jennifer orcid:0000-0002-7789-4853 and van Genabith, Josef orcid:0000-0003-1322-7944 (2011) Improving dependency label accuracy using statistical post-editing: A cross-framework study. In: International Conference on Dependency Linguistics (DepLing), 5-7 Sept 2011, Barcelona, Spain. (2011)
|
|
BASE
|
|
Show details
|
|
14 |
An automatically built named entity lexicon for Arabic
|
|
|
|
In: Attia, Mohammed, Toral, Antonio orcid:0000-0003-2357-2960 , Tounsi, Lamia, Monachini, Monica and van Genabith, Josef orcid:0000-0003-1322-7944 (2010) An automatically built named entity lexicon for Arabic. In: LREC 2010 - 7th conference on International Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta. (2010)
|
|
BASE
|
|
Show details
|
|
15 |
Seeding statistical machine translation with translation memory output through tree-based structural alignment
|
|
|
|
In: Zhechev, Ventsislav and van Genabith, Josef orcid:0000-0003-1322-7944 (2010) Seeding statistical machine translation with translation memory output through tree-based structural alignment. In: SSST-4 - 4th Workshop on Syntax and Structure in Statistical Translation, 28 August 2010, Beijing, China. (2010)
|
|
BASE
|
|
Show details
|
|
16 |
Arabic parsing using grammar transforms
|
|
|
|
In: Tounsi, Lamia and van Genabith, Josef orcid:0000-0003-1322-7944 (2010) Arabic parsing using grammar transforms. In: LREC 2010 - 7th conference on International Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta. (2010)
|
|
BASE
|
|
Show details
|
|
17 |
LFG without C-structures
|
|
|
|
In: Cetinoglu, Ozlem, Foster, Jennifer orcid:0000-0002-7789-4853 , Nivre, Joakim, Hogan, Deirdre, Cahill, Aoife orcid:0000-0002-3519-7726 and van Genabith, Josef orcid:0000-0003-1322-7944 (2010) LFG without C-structures. In: the 9th International Workshop on Treebanks and Linguistic Theories, 3 - 4 Dec. 2010, Tartu Estonia. (2010)
|
|
BASE
|
|
Show details
|
|
18 |
Closing the gap between stochastic and rule-based LFG grammars
|
|
|
|
In: Hautli, Annette, Cetinoglu, Ozlem and van Genabith, Josef orcid:0000-0003-1322-7944 (2010) Closing the gap between stochastic and rule-based LFG grammars. In: the LFG10 Conference, 18-20 July 2010, Ottowa, Canada. (2010)
|
|
BASE
|
|
Show details
|
|
19 |
Lemmatization and lexicalized statistical parsing of morphologically rich languages: the case of French
|
|
|
|
In: Seddah, Djamé, Chrupała, Grzegorz, Cetinoglu, Ozlem, van Genabith, Josef orcid:0000-0003-1322-7944 and Candito, Marie (2010) Lemmatization and lexicalized statistical parsing of morphologically rich languages: the case of French. In: SPMRL 2010 - 1st Workshop on Statistical Parsing of Morphologically-Rich Languages at NAACL HLT 2010, 5 June 2010, Los Angeles, CA, USA. (2010)
|
|
BASE
|
|
Show details
|
|
20 |
Deep syntax language models and statistical machine translation
|
|
|
|
In: Graham, Yvette and van Genabith, Josef orcid:0000-0003-1322-7944 (2010) Deep syntax language models and statistical machine translation. In: SSST-4 - 4th Workshop on Syntax and Structure in Statistical Translation at COLING 2010, 28 August 2010, Beijing, China. (2010)
|
|
BASE
|
|
Show details
|
|
|
|