1 |
Deep interactive text prediction and quality estimation in translation interfaces
|
|
|
|
In: Hokamp, Christopher M. (2018) Deep interactive text prediction and quality estimation in translation interfaces. PhD thesis, Dublin City University. (2018)
|
|
Abstract:
The output of automatic translation systems is usually destined for human consumption. In most cases, translators use machine translation (MT) as the first step in the process of creating a fluent translation in a target language given a text in a source language. However, there are many possible ways for translators to interact with MT. The goal of this thesis is to investigate new interactive designs and interfaces for translation. In the first part of the thesis, we present pilot studies which investigate aspects of the interactive translation process, building upon insights from Human-Computer Interaction (HCI) and Translation Studies. We developed HandyCAT, an open-source platform for translation process research, which was used to conduct two user studies: an investigation into interactive machine translation and evaluation of a novel component for post-editing. We then propose new models for quality estimation (QE) of MT, and new models for es- timating the confidence of prefix-based neural interactive MT (IMT) systems. We present a series of experiments using neural sequence models for QE and IMT. We focus upon token-level QE models, which can be used as standalone components or integrated into post-editing pipelines, guiding users in selecting phrases to edit. We introduce a strong recurrent baseline for neural QE, and show how state of the art automatic post-editing (APE) models can be re-purposed for word-level QE. We also propose an auxiliary con- fidence model, which can be attached to (I)-MT systems to use the model’s internal state to estimate confidence about the model’s predictions. The third part of the thesis introduces lexically constrained decoding using grid beam search (GBS), a means of expanding prefix-based interactive translation to general lexical constraints. By integrating lexically constrained decoding with word-level QE, we then suggest a novel interactive design for translation interfaces, and test our hypotheses using simulated editing. The final section focuses upon designing an interface for interactive post-editing, incorporating both GBS and QE. We design components which introduce a new way of interacting with translation models, and test these components in a user-study.
|
|
Keyword:
Computational linguistics; Machine learning; Machine translating
|
|
URL: http://doras.dcu.ie/22664/
|
|
BASE
|
|
Hide details
|
|
2 |
Predicting sentence translation quality using extrinsic and language independent features
|
|
|
|
In: Bicici, Ergun, Groves, Declan and van Genabith, Josef orcid:0000-0003-1322-7944 (2013) Predicting sentence translation quality using extrinsic and language independent features. Machine Translation, 27 (3-4). pp. 171-192. ISSN 0922-6567 (2013)
|
|
BASE
|
|
Show details
|
|
3 |
Domain adaptation for statistical machine translation of corporate and user-generated content
|
|
|
|
In: Banerjee, Pratyush (2013) Domain adaptation for statistical machine translation of corporate and user-generated content. PhD thesis, Dublin City University. (2013)
|
|
BASE
|
|
Show details
|
|
4 |
CNGL: Grading student answers by acts of translation
|
|
|
|
In: Bicici, Ergun orcid:0000-0002-2293-2031 and van Genabith, Josef orcid:0000-0003-1322-7944 (2013) CNGL: Grading student answers by acts of translation. In: SEMEVAL, 14-15 Jun 2013, Atlanta, Georgia. (2013)
|
|
BASE
|
|
Show details
|
|
5 |
CNGL-CORE: Referential translation machines for measuring semantic similarity
|
|
|
|
In: Bicici, Ergun orcid:0000-0002-2293-2031 and van Genabith, Josef orcid:0000-0003-1322-7944 (2013) CNGL-CORE: Referential translation machines for measuring semantic similarity. In: *SEM, 13-14 Jun 2013, Atlanta, Georgia. (2013)
|
|
BASE
|
|
Show details
|
|
6 |
Detecting grammatical errors with treebank-induced, probabilistic parsers
|
|
|
|
In: Wagner, Joachim orcid:0000-0002-8290-3849 (2012) Detecting grammatical errors with treebank-induced, probabilistic parsers. PhD thesis, Dublin City University. (2012)
|
|
BASE
|
|
Show details
|
|
7 |
Identifying high-impact sub-structures for convolution kernels in document-level sentiment classification
|
|
|
|
In: Tu, Zhaopeng, He, Yifan, Foster, Jennifer orcid:0000-0002-7789-4853 , van Genabith, Josef orcid:0000-0003-1322-7944 , Liu, Qun and Shouxun, Lin (2012) Identifying high-impact sub-structures for convolution kernels in document-level sentiment classification. In: Annual Meeting of the Association for Computational Linguistics (ACL 2012), 9-11 Jul 2012, Jelu, Korea. (2012)
|
|
BASE
|
|
Show details
|
|
8 |
Judging grammaticality: experiments in sentence classification
|
|
|
|
In: Wagner, Joachim orcid:0000-0002-8290-3849 , Foster, Jennifer orcid:0000-0002-7789-4853 and van Genabith, Josef orcid:0000-0003-1322-7944 (2009) Judging grammaticality: experiments in sentence classification. CALICO Journal, 26 (3). pp. 474-490. ISSN 0742-7778 (2009)
|
|
BASE
|
|
Show details
|
|
9 |
Towards a machine-learning architecture for lexical functional grammar parsing
|
|
|
|
In: Chrupała, Grzegorz (2008) Towards a machine-learning architecture for lexical functional grammar parsing. PhD thesis, Dublin City University. (2008)
|
|
BASE
|
|
Show details
|
|
10 |
Using machine-learning to assign function labels to parser output for Spanish
|
|
|
|
In: Chrupała, Grzegorz and van Genabith, Josef (2006) Using machine-learning to assign function labels to parser output for Spanish. In: COLING/ACL 2006 - 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, 17-21 July 2006, Sydney, Australia. (2006)
|
|
BASE
|
|
Show details
|
|
|
|