1 |
Rethinking Data Augmentation for Low-Resource Neural Machine Translation: A Multi-Task Learning Approach ...
|
|
|
|
BASE
|
|
Show details
|
|
2 |
Understanding the effects of word-level linguistic annotations in under-resourced neural machine translation ...
|
|
|
|
Abstract:
This paper studies the effects of word-level linguistic annotations in under-resourced neural machine translation, for which there is incomplete evidence in the literature. The study covers eight language pairs, different training corpus sizes, two architectures and three types of annotation: dummy tags (with no linguistic information at all), part-of-speech tags, and morpho-syntactic description tags, which consist of part of speech and morphological features. These linguistic annotations are interleaved in the input or output streams as a single tag placed before each word. In order to measure the performance under each scenario, we use automatic evaluation metrics and perform automatic error classification. Our experiments show that, in general, source-language annotations are helpful and morpho-syntactic descriptions outperform part of speech for some language pairs. On the contrary, when words are annotated in the target language, part-of-speech tags systematically outperform morpho-syntactic ...
|
|
Keyword:
Computer and Information Science; Natural Language Processing; Neural Network
|
|
URL: https://dx.doi.org/10.48448/82n5-qd30 https://underline.io/lecture/6283-understanding-the-effects-of-word-level-linguistic-annotations-in-under-resourced-neural-machine-translation
|
|
BASE
|
|
Hide details
|
|
3 |
Quantitative Fine-grained Human Evaluation of Machine Translation Systems: a Case Study on English to Croatian
|
|
|
|
In: Articles (2018)
|
|
BASE
|
|
Show details
|
|
|
|