DE eng

Search in the Catalogues and Directories

Hits 1 – 13 of 13

1
Investigating Code-Mixed Modern Standard Arabic-Egyptian to English Machine Translation ...
Abstract: Recent progress in neural machine translation (NMT) has made it possible to translate successfully between monolingual language pairs where large parallel data exist, with pre-trained models improving performance even further. Although there exists work on translating in code-mixed settings (where one of the pairs includes text from two or more languages), it is still unclear what recent success in NMT and language modeling exactly means for translating code-mixed text. We investigate one such context, namely MT from code-mixed Modern Standard Arabic and Egyptian Arabic (MSAEA) into English. We develop models under different conditions, employing both (i) standard end-to-end sequence-to-sequence (S2S) Transformers trained from scratch and (ii) pre-trained S2S language models (LMs). We are able to acquire reasonable performance using only MSA-EN parallel data with S2S models trained from scratch. We also find LMs fine-tuned on data from various Arabic dialects to help the MSAEA-EN task. Our work is in the ... : CALCS2021, colocated with NAACL-2021 ...
Keyword: Computation and Language cs.CL; FOS Computer and information sciences; Machine Learning cs.LG
URL: https://dx.doi.org/10.48550/arxiv.2105.13573
https://arxiv.org/abs/2105.13573
BASE
Hide details
2
NADI 2021: The Second Nuanced Arabic Dialect Identification Shared Task ...
BASE
Show details
3
ARBERT & MARBERT: Deep Bidirectional Transformers for Arabic ...
BASE
Show details
4
AraT5: Text-to-Text Transformers for Arabic Language Generation ...
BASE
Show details
5
DiaLex: A Benchmark for Evaluating Multidialectal Arabic Word Embeddings ...
BASE
Show details
6
Mega-COV: A Billion-Scale Dataset of 100+ Languages for COVID-19 ...
BASE
Show details
7
Toward Micro-Dialect Identification in Diaglossic and Code-Switched Environments ...
BASE
Show details
8
DiaNet: BERT and Hierarchical Attention Multi-Task Learning of Fine-Grained Dialect ...
BASE
Show details
9
Improving Dialogue Act Classification for Spontaneous Arabic Speech and Instant Messages at Utterance Level ...
BASE
Show details
10
JANA: A Human-Human Dialogues Corpus for Egyptian Dialect
Elmadany, AbdelRahim A.; Abdou, Sherif M.; Gheith, Mervat. - : Linguistic Data Consortium, 2016. : https://www.ldc.upenn.edu, 2016
BASE
Show details
11
JANA: A Human-Human Dialogues Corpus for Egyptian Dialect ...
Elmadany, AbdelRahim A.; Abdou, Sherif M.; Gheith, Mervat. - : Linguistic Data Consortium, 2016
BASE
Show details
12
Turn Segmentation into Utterances for Arabic Spontaneous Dialogues and Instance Messages ...
BASE
Show details
13
Towards Understanding Egyptian Arabic Dialogues ...
BASE
Show details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
13
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern