1 |
AlloVera: a multilingual allophone database
|
|
|
|
In: LREC 2020: 12th Language Resources and Evaluation Conference ; https://halshs.archives-ouvertes.fr/halshs-02527046 ; LREC 2020: 12th Language Resources and Evaluation Conference, European Language Resources Association, May 2020, Marseille, France ; https://lrec2020.lrec-conf.org/ (2020)
|
|
BASE
|
|
Show details
|
|
3 |
Towards Minimal Supervision BERT-based Grammar Error Correction ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
SIGMORPHON 2020 Shared Task 0: Typologically Diverse Morphological Inflection ...
|
|
Vylomova, Ekaterina; White, Jennifer; Salesky, Elizabeth; Mielke, Sabrina J.; Wu, Shijie; Ponti, Edoardo; Maudslay, Rowan Hall; Zmigrod, Ran; Valvoda, Josef; Toldova, Svetlana; Tyers, Francis; Klyachko, Elena; Yegorov, Ilya; Krizhanovsky, Natalia; Czarnowska, Paula; Nikkarinen, Irene; Krizhanovsky, Andrew; Pimentel, Tiago; Hennigen, Lucas Torroba; Kirov, Christo; Nicolai, Garrett; Williams, Adina; Anastasopoulos, Antonios; Cruz, Hilaria; Chodroff, Eleanor; Cotterell, Ryan; Silfverberg, Miikka; Hulden, Mans. - : arXiv, 2020
|
|
Abstract:
A broad goal in natural language processing (NLP) is to develop a system that has the capacity to process any natural language. Most systems, however, are developed using data from just one language such as English. The SIGMORPHON 2020 shared task on morphological reinflection aims to investigate systems' ability to generalize across typologically distinct languages, many of which are low resource. Systems were developed using data from 45 languages and just 5 language families, fine-tuned with data from an additional 45 languages and 10 language families (13 in total), and evaluated on all 90 languages. A total of 22 systems (19 neural) from 10 teams were submitted to the task. All four winning systems were neural (two monolingual transformers and two massively multilingual RNN-based models with gated attention). Most teams demonstrate utility of data hallucination and augmentation, ensembles, and multilingual training for low-resource languages. Non-neural learners and manually designed grammars showed ... : 39 pages, SIGMORPHON ...
|
|
Keyword:
Computation and Language cs.CL; FOS Computer and information sciences
|
|
URL: https://dx.doi.org/10.48550/arxiv.2006.11572 https://arxiv.org/abs/2006.11572
|
|
BASE
|
|
Hide details
|
|
5 |
It's not a Non-Issue: Negation as a Source of Error in Machine Translation ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Automatic Extraction of Rules Governing Morphological Agreement ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
A Summary of the First Workshop on Language Technology for Language Documentation and Revitalization ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
It’s Easier to Translate out of English than into it: Measuring Neural Translation Difficulty by Cross-Mutual Information ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Universal Phone Recognition with a Multilingual Allophone System ...
|
|
|
|
BASE
|
|
Show details
|
|
10 |
It’s Easier to Translate out of English than into it: Measuring Neural Translation Difficulty by Cross-Mutual Information
|
|
|
|
In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (2020)
|
|
BASE
|
|
Show details
|
|
11 |
X-FACTR: Multilingual Factual Knowledge Retrieval from Pretrained Language Models ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
AlloVera: a multilingual allophone database
|
|
|
|
In: LREC 2020: 12th Language Resources and Evaluation Conference ; https://halshs.archives-ouvertes.fr/halshs-02527046 ; LREC 2020: 12th Language Resources and Evaluation Conference, European Language Resources Association, May 2020, Marseille, France ; https://lrec2020.lrec-conf.org/ (2020)
|
|
BASE
|
|
Show details
|
|
|
|