21 |
INFODENS: An Open-source Framework for Learning Text Representations ...
|
|
|
|
BASE
|
|
Show details
|
|
22 |
Query Translation for Cross-lingual Search in the Academic Search Engine PubPsych ...
|
|
|
|
BASE
|
|
Show details
|
|
23 |
Query Translation for Cross-lingual Search in the Academic Search Engine PubPsych ...
|
|
|
|
BASE
|
|
Show details
|
|
24 |
A Hybrid Machine Translation Framework for an Improved Translation Workflow
|
|
Pal, Santanu. - : Saarländische Universitäts- und Landesbibliothek, 2018
|
|
BASE
|
|
Show details
|
|
30 |
Massively Multilingual Neural Grapheme-to-Phoneme Conversion ...
|
|
|
|
BASE
|
|
Show details
|
|
31 |
An Empirical Analysis of NMT-Derived Interlingual Embeddings and their Use in Parallel Sentence Identification ...
|
|
|
|
BASE
|
|
Show details
|
|
32 |
Predicting the Law Area and Decisions of French Supreme Court Cases ...
|
|
|
|
BASE
|
|
Show details
|
|
33 |
Pluricentric languages : automatic identification and linguistic variation ; Plurizentrische Sprachen : automatische Spracherkennung und linguistische Variation
|
|
|
|
BASE
|
|
Show details
|
|
34 |
Improving translation memory matching and retrieval using paraphrases
|
|
|
|
In: 30 ; 1 ; 19 ; 40 (2016)
|
|
Abstract:
This is an accepted manuscript of an article published by Springer Nature in Machine Translation on 02/11/2016, available online: https://doi.org/10.1007/s10590-016-9180-0 The accepted version of the publication may differ from the final published version. ; Most of the current Translation Memory (TM) systems work on string level (character or word level) and lack semantic knowledge while matching. They use simple edit-distance calculated on surface-form or some variation on it (stem, lemma), which does not take into consideration any semantic aspects in matching. This paper presents a novel and efficient approach to incorporating semantic information in the form of paraphrasing in the edit-distance metric. The approach computes edit-distance while efficiently considering paraphrases using dynamic programming and greedy approximation. In addition to using automatic evaluation metrics like BLEU and METEOR, we have carried out an extensive human evaluation in which we measured post-editing time, keystrokes, HTER, HMETEOR, and carried out three rounds of subjective evaluations. Our results show that paraphrasing substantially improves TM matching and retrieval, resulting in translation performance increases when translators use paraphrase-enhanced TMs.
|
|
Keyword:
Computer aided translation (CAT); Dynamic programming; Edit distance; Greedy approximation; Paraphrasing; Translation memory (TM)
|
|
URL: http://hdl.handle.net/2436/620274 https://doi.org/10.1007/s10590-016-9180-0
|
|
BASE
|
|
Hide details
|
|
35 |
A Minimally Supervised Approach for Synonym Extraction with Word Embeddings
|
|
|
|
In: Prague Bulletin of Mathematical Linguistics , Vol 105, Iss 1, Pp 111-142 (2016) (2016)
|
|
BASE
|
|
Show details
|
|
36 |
Statistical post-editing and quality estimation for machine translation systems
|
|
|
|
In: Béchara, Hanna (2014) Statistical post-editing and quality estimation for machine translation systems. Master of Science thesis, Dublin City University. (2014)
|
|
BASE
|
|
Show details
|
|
37 |
Predicting sentence translation quality using extrinsic and language independent features
|
|
|
|
In: Bicici, Ergun, Groves, Declan and van Genabith, Josef orcid:0000-0003-1322-7944 (2013) Predicting sentence translation quality using extrinsic and language independent features. Machine Translation, 27 (3-4). pp. 171-192. ISSN 0922-6567 (2013)
|
|
BASE
|
|
Show details
|
|
38 |
Working with a small dataset - semi-supervised dependency parsing for Irish
|
|
|
|
In: Lynn, Teresa, Foster, Jennifer orcid:0000-0002-7789-4853 , Dras, Mark orcid:0000-0001-9908-7182 and van Genabith, Josef orcid:0000-0003-1322-7944 (2013) Working with a small dataset - semi-supervised dependency parsing for Irish. In: Fourth Workshop on Statistical Parsing of Morphologically Rich Languages, 18 Oct 2013, Seattle, WA. USA. (2013)
|
|
BASE
|
|
Show details
|
|
39 |
Computer assisted (language) learning (CA(L)L) for the inclusive classroom
|
|
Greene, Cara N.. - : Dublin City University. Centre for Next Generation Localisation (CNGL), 2013. : Dublin City University. National Centre for Language Technology (NCLT), 2013. : Dublin City University. School of Computing, 2013
|
|
In: Greene, Cara N. (2013) Computer assisted (language) learning (CA(L)L) for the inclusive classroom. PhD thesis, Dublin City University. (2013)
|
|
BASE
|
|
Show details
|
|
40 |
Domain adaptation for statistical machine translation of corporate and user-generated content
|
|
|
|
In: Banerjee, Pratyush (2013) Domain adaptation for statistical machine translation of corporate and user-generated content. PhD thesis, Dublin City University. (2013)
|
|
BASE
|
|
Show details
|
|
|
|