DE eng

Search in the Catalogues and Directories

Page: 1 2
Hits 1 – 20 of 29

1
The effect of domain and diacritics in Yorùbá-English neural machine translation
In: 18th Biennial Machine Translation Summit ; https://hal.inria.fr/hal-03350967 ; 18th Biennial Machine Translation Summit, Aug 2021, Orlando, United States (2021)
BASE
Show details
2
Europarl Direct Translationese Dataset ...
BASE
Show details
3
Europarl Direct Translationese Dataset ...
BASE
Show details
4
Europarl Direct Translationese Dataset ...
BASE
Show details
5
A Data Augmentation Approach for Sign-Language-To-Text Translation In-The-Wild ...
Nunnari, Fabrizio; España-Bonet, Cristina; Avramidis, Eleftherios. - : Schloss Dagstuhl - Leibniz-Zentrum für Informatik, 2021
BASE
Show details
6
The Effect of Domain and Diacritics in Yorùbá-English Neural Machine Translation ...
BASE
Show details
7
Integrating Unsupervised Data Generation into Self-Supervised Neural Machine Translation for Low-Resource Languages ...
BASE
Show details
8
Comparing Feature-Engineering and Feature-Learning Approaches for Multilingual Translationese Classification ...
BASE
Show details
9
Comparing Feature-Engineering and Feature-Learning Approaches for Multilingual Translationese Classification ...
BASE
Show details
10
Automatic classification of human translation and machine translation : a study from the perspective of lexical diversity
Fu, Yingxue; Nederhof, Mark Jan. - : Linkoping University Electronic Press, 2021
BASE
Show details
11
Tailoring and Evaluating the Wikipedia for in-Domain Comparable Corpora Extraction ...
Abstract: We propose an automatic language-independent graph-based method to build à-la-carte article collections on user-defined domains from the Wikipedia. The core model is based on the exploration of the encyclopaedia's category graph and can produce both monolingual and multilingual comparable collections. We run thorough experiments to assess the quality of the obtained corpora in 10 languages and 743 domains. According to an extensive manual evaluation, our graph-based model outperforms a retrieval-based approach and reaches an average precision of 84% on in-domain articles. As manual evaluations are costly, we introduce the concept of "domainness" and design several automatic metrics to account for the quality of the collections. Our best metric for domainness shows a strong correlation with the human-judged precision, representing a reasonable automatic alternative to assess the quality of domain-specific corpora. We release the WikiTailor toolkit with the implementation of the extraction methods, the ... : 26 pages, 8 figures, 6 tables ...
Keyword: Computation and Language cs.CL; FOS Computer and information sciences; Information Retrieval cs.IR
URL: https://arxiv.org/abs/2005.01177
https://dx.doi.org/10.48550/arxiv.2005.01177
BASE
Hide details
12
WTC1.1 (WikiTailor corpus v. 1.1) ...
BASE
Show details
13
MT models for multilingual CLuBS engine (en-de-fr-es) ...
BASE
Show details
14
WTC1.0 (WikiTailor corpus v. 1.0) ...
BASE
Show details
15
WTC1.1 (WikiTailor corpus v. 1.1) ...
BASE
Show details
16
MT models for multilingual CLuBS engine (en-de-fr-es) ...
BASE
Show details
17
Multilingual and Interlingual Semantic Representations for Natural Language Processing: A Brief Introduction
In: Computational Linguistics, Vol 46, Iss 2, Pp 249-255 (2020) (2020)
BASE
Show details
18
GeBioToolkit: Automatic Extraction of Gender-Balanced Multilingual Corpus of Wikipedia Biographies ...
BASE
Show details
19
Massive vs. Curated Word Embeddings for Low-Resourced Languages. The Case of Yorùbá and Twi ...
BASE
Show details
20
Query Translation for Cross-lingual Search in the Academic Search Engine PubPsych ...
BASE
Show details

Page: 1 2

Catalogues
0
0
1
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
28
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern