41 |
Morphologically Annotated Corpora and Morphological Analyzers for Moroccan and Sanaani Yemeni Arabic
|
|
|
|
In: 10th Language Resources and Evaluation Conference (LREC 2016) ; https://hal.archives-ouvertes.fr/hal-01349201 ; 10th Language Resources and Evaluation Conference (LREC 2016), May 2016, Portoroz, Slovenia (2016)
|
|
BASE
|
|
Show details
|
|
42 |
A Large Scale Corpus of Gulf Arabic
|
|
|
|
In: Language Resources and Evaluation Conference ; https://hal.archives-ouvertes.fr/hal-01349204 ; Language Resources and Evaluation Conference, 2016, Portoroz, Slovenia (2016)
|
|
BASE
|
|
Show details
|
|
43 |
Exploiting Arabic Diacritization for High Quality Automatic Annotation
|
|
|
|
In: Language Resources and Evaluation Conference ; https://hal.archives-ouvertes.fr/hal-01349206 ; Language Resources and Evaluation Conference, 2016, Portoroz, Slovenia (2016)
|
|
BASE
|
|
Show details
|
|
44 |
DALILA: The Dialectal Arabic Linguistic Learning Assistant
|
|
|
|
In: Language Resources and Evaluation Conference ; https://hal.archives-ouvertes.fr/hal-01349203 ; Language Resources and Evaluation Conference, 2016, Portoroz, Slovenia (2016)
|
|
BASE
|
|
Show details
|
|
45 |
Egyptian Arabic to English Statistical Machine Translation System for NIST OpenMT'2015 ...
|
|
|
|
BASE
|
|
Show details
|
|
47 |
A Conventional Orthography for Algerian Arabic
|
|
|
|
In: Proceedings of the Second Workshop on Arabic Natural Language ; the Second Workshop on Arabic Natural Language Processing ; https://hal.archives-ouvertes.fr/hal-02012254 ; the Second Workshop on Arabic Natural Language Processing, 2015, Beijing, China. pp.69 - 79 (2015)
|
|
BASE
|
|
Show details
|
|
48 |
POS-tagging of Tunisian Dialect Using Standard Arabic Resources and Tools
|
|
|
|
In: Proceedings of the Second Workshop on Arabic Natural Language Processing ; Workshop on Arabic Natural Language Processing ; https://hal.archives-ouvertes.fr/hal-01464860 ; Workshop on Arabic Natural Language Processing, Jul 2015, Beijing, China. pp.59 - 68, ⟨10.18653/v1/W15-3207⟩ (2015)
|
|
BASE
|
|
Show details
|
|
49 |
Conventional Orthography for Dialectal Arabic (CODA): Principles and Guidelines -- Egyptian Arabic - Version 0.7 - March 2012
|
|
|
|
BASE
|
|
Show details
|
|
50 |
A Corpus and Phonetic Dictionary for Tunisian Arabic Speech Recognition
|
|
|
|
In: The 9th edition of the Language Resources and Evaluation Conference (LREC 2014) ; https://hal.archives-ouvertes.fr/hal-01433247 ; The 9th edition of the Language Resources and Evaluation Conference (LREC 2014), 2014, Reykjavik, Iceland (2014)
|
|
BASE
|
|
Show details
|
|
51 |
Conventional Orthography for Dialectal Arabic (CODA): Principles and Guidelines -- Egyptian Arabic - Version 0.7 - March 2012 ...
|
|
|
|
BASE
|
|
Show details
|
|
52 |
Domain and Dialect Adaptation for Machine Translation into Egyptian Arabic ...
|
|
|
|
BASE
|
|
Show details
|
|
53 |
Domain and Dialect Adaptation for Machine Translation into Egyptian Arabic ...
|
|
|
|
BASE
|
|
Show details
|
|
56 |
Un système de traduction de verbes entre arabe standard et arabe dialectal par analyse morphologique profonde
|
|
|
|
In: Traitement Automatique des Langues Naturelles ; https://hal.archives-ouvertes.fr/hal-00908795 ; Traitement Automatique des Langues Naturelles, Jun 2013, France. pp.396 - 406 (2013)
|
|
BASE
|
|
Show details
|
|
57 |
Overview of the SPMRL 2013 shared task: cross-framework evaluation of parsing morphologically rich languages
|
|
|
|
In: Seddah, Djamé, Tsarfaty, Reut, Kübler, Sandra, Candito, Marie, Choi, Jinho, Farkas, Richard, Foster, Jennifer orcid:0000-0002-7789-4853 , Goenaga, Iakes, Gojenola, Koldo, Goldberg, Yoav, Green, Spence, Habash, Nizar, Kuhlmann, Marco, Maier, Wolfgang, Nivre, Joakim, Przepiórkowski, Adam, Roth, Ryan, Seeker, Wolfgang, Versley, Yannick, Vincze, Veronika, Wolinski, Marcin, Wróblewska, Alina and Villemonte de la Clérgerie, Eric (2013) Overview of the SPMRL 2013 shared task: cross-framework evaluation of parsing morphologically rich languages. In: Fourth Workshop on Statistical Parsing of Morphologically Rich Languages, 18 Oct 2013, Seattle, WA. (2013)
|
|
BASE
|
|
Show details
|
|
58 |
Annotation Guidelines for Arabic Nominal Gender, Number, and Rationality
|
|
|
|
BASE
|
|
Show details
|
|
59 |
LDC Arabic Treebanks and Associated Corpora: Data Divisions Manual
|
|
|
|
BASE
|
|
Show details
|
|
60 |
The Effects of Factorizing Root and Pattern Mapping in Bidirectional Tunisian - Standard Arabic Machine Translation
|
|
|
|
In: MT Summit 2013 ; https://hal.archives-ouvertes.fr/hal-00908761 ; MT Summit 2013, Sep 2013, France. pas d'édition papier (2013)
|
|
Abstract:
International audience ; The development of natural language processing tools for dialects faces the severe problem of lack of resources. In cases of diglossia, as in Arabic, one variant, Modern Standard Arabic (MSA), has many resources that can be used to build natural language processing tools. Whereas other variants, Arabic dialects, are resource poor. Taking advantage of the closeness of MSA and its dialects, one way to solve the problem of limited resources, consists in performing a translation of the dialect into MSA in order to use the tools developed for MSA. We describe in this paper an architecture for such a translation and we evaluate it on Tunisian Arabic verbs. Our approach relies on modeling the translation process over the deep morphological representations of roots and patterns, commonly used to model Semitic morphology. We compare different techniques for how to perform the cross-lingual mapping. Our evaluation demonstrates that the use of a decent coverage root+pattern lexicon of Tunisian and MSA with a backoff that assumes independence of mapping roots and patterns is optimal in reducing overall ambiguity and increasing recall.
|
|
Keyword:
[INFO.INFO-TT]Computer Science [cs]/Document and Text Processing
|
|
URL: https://hal.archives-ouvertes.fr/hal-00908761/file/mts2013_verbs.pdf https://hal.archives-ouvertes.fr/hal-00908761/document https://hal.archives-ouvertes.fr/hal-00908761
|
|
BASE
|
|
Hide details
|
|
|
|