DE eng

Search in the Catalogues and Directories

Hits 1 – 8 of 8

1
Semantic Relatedness and Taxonomic Word Embeddings ...
BASE
Show details
2
English WordNet Taxonomic Random Walk Pseudo-Corpora
In: Conference papers (2020)
BASE
Show details
3
Language related issues for machine translation between closely related south Slavic languages
Arcan, Mihael; Klubicka, Filip; Popovic, Maja. - : The COLING 2016 Organizing Committee, 2019
Abstract: Machine translation between closely related languages is less challenging and exhibits a smaller number of translation errors than translation between distant languages, but there are still obstacles which should be addressed in order to improve such systems. This work explores the obstacles for machine translation systems between closely related South Slavic languages, namely Croatian, Serbian and Slovenian. Statistical systems for all language pairs and translation directions are trained using parallel texts from different domains, however mainly on spoken language i.e. subtitles. For translation between Serbian and Croatian, a rule-based system is also explored. It is shown that for all language pairs and for both translation systems, the main obstacles are the differences between syntactic properties. ; This work has emerged from research supported by TRAMOOC project (Translation for Massive Open Online Courses) partially funded by the European Commission under H2020-ICT-2014/H2020-ICT2014-1 under grant agreement number 644333 and by the Science Foundation Ireland (SFI) under Grant Number SFI/12/RC/2289 (Insight). The research leading to these results has also received funding from the European Union Seventh Framework Programme FP7/2007-2013 under grant agreement PIAP-GA2012-324414 (Abu-MaTran) and the Swiss National Science Foundation grant IZ74Z0 160501 (ReLDI). ; non-peer-reviewed
Keyword: Language; Machine translation; South Slavic languages
URL: http://hdl.handle.net/10379/14887
BASE
Hide details
4
Synthetic, Yet Natural: Properties of WordNet Random Walk Corpora and the impact of rare words on embedding performance
In: Conference papers (2019)
BASE
Show details
5
Size Matters: The Impact of Training Size in Taxonomically-Enriched Word Embeddings
In: Articles (2019)
BASE
Show details
6
Quantitative Fine-grained Human Evaluation of Machine Translation Systems: a Case Study on English to Croatian
In: Articles (2018)
BASE
Show details
7
Is it worth it? Budget-related evaluation metrics for model selection
In: Conference papers (2018)
BASE
Show details
8
hr500k – A Reference Training Corpus of Croatian.
In: Conference papers (2018)
BASE
Show details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
8
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern