1 |
One Country, 700+ Languages: NLP Challenges for Underrepresented Languages and Dialects in Indonesia ...
|
|
|
|
BASE
|
|
Show details
|
|
2 |
Universals of Linguistic Idiosyncrasy in Multilingual Computational Linguistics ; Universals of Linguistic Idiosyncrasy in Multilingual Computational Linguistics: Dagstuhl Seminar 21351
|
|
|
|
In: Universals of Linguistic Idiosyncrasy in Multilingual Computational Linguistics ; https://hal.archives-ouvertes.fr/hal-03507948 ; Universals of Linguistic Idiosyncrasy in Multilingual Computational Linguistics, Aug 2021, pp.89--138, 2021, 2192-5283. ⟨10.4230/DagRep.11.7.89⟩ ; https://gitlab.com/unlid/dagstuhl-seminar/-/wikis/home (2021)
|
|
BASE
|
|
Show details
|
|
3 |
Universals of Linguistic Idiosyncrasy in Multilingual Computational Linguistics (Dagstuhl Seminar 21351)
|
|
|
|
BASE
|
|
Show details
|
|
4 |
Evaluating the Efficacy of Summarization Evaluation across Languages ...
|
|
|
|
Abstract:
While automatic summarization evaluation methods developed for English are routinely applied to other languages, this is the first attempt to systematically quantify their panlinguistic efficacy. We take a summarization corpus for eight different languages, and manually annotate generated summaries for focus (precision) and coverage (recall). Based on this, we evaluate 19 summarization evaluation metrics, and find that using multilingual BERT within BERTScore performs well across all languages, at a level above that for English. ... : Findings of ACL 2021 ...
|
|
Keyword:
Computation and Language cs.CL; FOS Computer and information sciences
|
|
URL: https://dx.doi.org/10.48550/arxiv.2106.01478 https://arxiv.org/abs/2106.01478
|
|
BASE
|
|
Hide details
|
|
6 |
Just What do You Think You're Doing, Dave?' A Checklist for Responsible Data Use in NLP ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
IndoBERTweet: A Pretrained Language Model for Indonesian Twitter with Effective Domain-Specific Vocabulary Initialization ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Evaluating the Efficacy of Summarization Evaluation across Languages ...
|
|
|
|
BASE
|
|
Show details
|
|
10 |
Universals of Linguistic Idiosyncrasy in Multilingual Computational Linguistics (Dagstuhl Seminar 21351) ...
|
|
|
|
BASE
|
|
Show details
|
|
11 |
Learning Contextualised Cross-lingual Word Embeddings and Alignments for Extremely Low-Resource Languages Using Parallel Corpora ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
Balancing out Bias: Achieving Fairness Through Training Reweighting ...
|
|
|
|
BASE
|
|
Show details
|
|
14 |
ChEMU 2020: Natural Language Processing Methods Are Effective for Information Extraction From Chemical Patents
|
|
|
|
In: Front Res Metr Anal (2021)
|
|
BASE
|
|
Show details
|
|
15 |
Liputan6: A Large-scale Indonesian Dataset for Text Summarization ...
|
|
|
|
BASE
|
|
Show details
|
|
16 |
WikiUMLS: Aligning UMLS to Wikipedia via Cross-lingual Neural Ranking ...
|
|
|
|
BASE
|
|
Show details
|
|
17 |
Learning Contextualised Cross-lingual Word Embeddings and Alignments for Extremely Low-Resource Languages Using Parallel Corpora ...
|
|
|
|
BASE
|
|
Show details
|
|
20 |
IndoLEM and IndoBERT: A Benchmark Dataset and Pre-trained Language Model for Indonesian NLP ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|