1 |
AraBART: a Pretrained Arabic Sequence-to-Sequence Model for Abstractive Summarization ...
|
|
|
|
Abstract:
Like most natural language understanding and generation tasks, state-of-the-art models for summarization are transformer-based sequence-to-sequence architectures that are pretrained on large corpora. While most existing models focused on English, Arabic remained understudied. In this paper we propose AraBART, the first Arabic model in which the encoder and the decoder are pretrained end-to-end, based on BART. We show that AraBART achieves the best performance on multiple abstractive summarization datasets, outperforming strong baselines including a pretrained Arabic BERT-based model and multilingual mBART and mT5 models. ...
|
|
Keyword:
Computation and Language cs.CL; FOS Computer and information sciences
|
|
URL: https://arxiv.org/abs/2203.10945 https://dx.doi.org/10.48550/arxiv.2203.10945
|
|
BASE
|
|
Hide details
|
|
2 |
NADI 2021: The Second Nuanced Arabic Dialect Identification Shared Task ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
The Interplay of Variant, Size, and Task Type in Arabic Pre-trained Language Models ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
Language Models in Sociological Research: An Application to Classifying Large Administrative Data and Measuring Religiosity ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
Language Models in Sociological Research: An Application to Classifying Large Administrative Data and Measuring Religiosity ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Morphosyntactic Tagging with Pre-trained Language Models for Arabic and its Dialects ...
|
|
|
|
BASE
|
|
Show details
|
|
11 |
Multitask Easy-First Dependency Parsing: Exploiting Complementarities of Different Dependency Representations
|
|
|
|
In: Proceedings of the 28th International Conference on Computational Linguistics ; 28th International Conference on Computational Linguistics ; https://hal.archives-ouvertes.fr/hal-03168039 ; 28th International Conference on Computational Linguistics, Dec 2020, Barcelona (on line), Spain. ⟨10.18653/v1/2020.coling-main.225⟩ (2020)
|
|
BASE
|
|
Show details
|
|
12 |
NADI 2020: The First Nuanced Arabic Dialect Identification Shared Task ...
|
|
|
|
BASE
|
|
Show details
|
|
15 |
A Panoramic Survey of Natural Language Processing in the Arab World ...
|
|
|
|
BASE
|
|
Show details
|
|
17 |
NADI 2020: The First Nuanced Arabic Dialect Identification Shared Task ...
|
|
|
|
BASE
|
|
Show details
|
|
19 |
Gender-Aware Reinflectionusing Linguistically Enhanced Neural Models ...
|
|
|
|
BASE
|
|
Show details
|
|
20 |
The Paradigm Discovery Problem
|
|
|
|
In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (2020)
|
|
BASE
|
|
Show details
|
|
|
|