11 |
Informationsextraktion aus Wirtschaftsnachrichten über Unternehmenszusammenschlüsse mit lokalen Grammatiken
|
|
|
|
IDS Mannheim
|
|
Show details
|
|
13 |
Learning translations via images with a massively multilingual image dataset
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Losing Shahrazad: A Distant Reading of 1001 Nights
|
|
|
|
In: Senior Projects Spring 2018 (2018)
|
|
BASE
|
|
Show details
|
|
15 |
Bangime: Secret Language, Language Isolate, or Language Island?
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-01867003 ; 2018 (2018)
|
|
BASE
|
|
Show details
|
|
16 |
A database of German definitory contexts from selected web sources
|
|
|
|
In: 11th International Conference on Language Resources and Evaluation (LREC 2018) ; https://hal.archives-ouvertes.fr/hal-01798704 ; 11th International Conference on Language Resources and Evaluation (LREC 2018), May 2018, Miyazaki, Japan. pp.3068-3073 (2018)
|
|
BASE
|
|
Show details
|
|
17 |
Deep interactive text prediction and quality estimation in translation interfaces
|
|
|
|
In: Hokamp, Christopher M. (2018) Deep interactive text prediction and quality estimation in translation interfaces. PhD thesis, Dublin City University. (2018)
|
|
Abstract:
The output of automatic translation systems is usually destined for human consumption. In most cases, translators use machine translation (MT) as the first step in the process of creating a fluent translation in a target language given a text in a source language. However, there are many possible ways for translators to interact with MT. The goal of this thesis is to investigate new interactive designs and interfaces for translation. In the first part of the thesis, we present pilot studies which investigate aspects of the interactive translation process, building upon insights from Human-Computer Interaction (HCI) and Translation Studies. We developed HandyCAT, an open-source platform for translation process research, which was used to conduct two user studies: an investigation into interactive machine translation and evaluation of a novel component for post-editing. We then propose new models for quality estimation (QE) of MT, and new models for es- timating the confidence of prefix-based neural interactive MT (IMT) systems. We present a series of experiments using neural sequence models for QE and IMT. We focus upon token-level QE models, which can be used as standalone components or integrated into post-editing pipelines, guiding users in selecting phrases to edit. We introduce a strong recurrent baseline for neural QE, and show how state of the art automatic post-editing (APE) models can be re-purposed for word-level QE. We also propose an auxiliary con- fidence model, which can be attached to (I)-MT systems to use the model’s internal state to estimate confidence about the model’s predictions. The third part of the thesis introduces lexically constrained decoding using grid beam search (GBS), a means of expanding prefix-based interactive translation to general lexical constraints. By integrating lexically constrained decoding with word-level QE, we then suggest a novel interactive design for translation interfaces, and test our hypotheses using simulated editing. The final section focuses upon designing an interface for interactive post-editing, incorporating both GBS and QE. We design components which introduce a new way of interacting with translation models, and test these components in a user-study.
|
|
Keyword:
Computational linguistics; Machine learning; Machine translating
|
|
URL: http://doras.dcu.ie/22664/
|
|
BASE
|
|
Hide details
|
|
18 |
Machine translation evaluation resources and methods: a survey
|
|
|
|
In: Han, Lifeng orcid:0000-0002-3221-2185 (2018) Machine translation evaluation resources and methods: a survey. In: IPRC- Irish Postgraduate Research Conference, 8-9 Nov 2018, Dublin, Ireland. (2018)
|
|
BASE
|
|
Show details
|
|
19 |
Semantic reranking of CRF label sequences for verbal multiword expression identification
|
|
|
|
In: Moreau, Erwan, Alsulaimani, Ashjan, Maldonado, Alfredo orcid:0000-0001-8426-5249 , Han, Lifeng, Vogel, Carl orcid:0000-0001-8928-8546 and Dutta Chowdhury, Koel (2018) Semantic reranking of CRF label sequences for verbal multiword expression identification. In: Markantonatou, Stella, Ramisch, Carlos orcid:0000-0001-7466-9039 , Savary, Agata and Vincze, Veronika orcid:0000-0002-9844-2194 , (eds.) Multiword expressions at length and in depth: Extended papers from the MWE 2017 workshop. Language Science Press, Berlin, pp. 177-207. ISBN 978-3-96110-124-5 (2018)
|
|
BASE
|
|
Show details
|
|
20 |
Unlocking cultural conceptualisation in indigenous language resources: collaborative computing methodologies
|
|
|
|
In: Dorn, Amelie, Wandl-Vogt, Eveline orcid:0000-0002-0802-0255 , Abgaz, Yalemisew orcid:0000-0002-3887-5342 , Benito Santos, Alejandro orcid:0000-0001-5317-6390 and Therón, Roberto orcid:0000-0001-6739-8875 (2018) Unlocking cultural conceptualisation in indigenous language resources: collaborative computing methodologies. In: 11th Language Resources and Evaluation Conference, 7-12 May 2018, Miyazaki, Japan. ISBN 979-10-95546-22-1 (2018)
|
|
BASE
|
|
Show details
|
|
|
|