5 |
Parsing with Pretrained Language Models, Multiple Datasets, and Dataset Embeddings ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
On the Effectiveness of Dataset Embeddings in Mono-lingual,Multi-lingual and Zero-shot Conditions ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
Genre as Weak Supervision for Cross-lingual Dependency Parsing ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Genre as Weak Supervision for Cross-lingual Dependency Parsing ...
|
|
|
|
BASE
|
|
Show details
|
|
10 |
DaN+: Danish Nested Named Entities and Lexical Normalization ...
|
|
|
|
Abstract:
This paper introduces DaN+, a new multi-domain corpus and annotation guidelines for Danish nested named entities (NEs) and lexical normalization to support research on cross-lingual cross-domain learning for a less-resourced language. We empirically assess three strategies to model the two-layer Named Entity Recognition (NER) task. We compare transfer capabilities from German versus in-language annotation from scratch. We examine language-specific versus multilingual BERT, and study the effect of lexical normalization on NER. Our results show that 1) the most robust strategy is multi-task learning which is rivaled by multi-label decoding, 2) BERT-based NER models are sensitive to domain shifts, and 3) in-language BERT and lexical normalization are the most beneficial on the least canonical data. Our results also show that an out-of-domain setup remains challenging, while performance on news plateaus quickly. This highlights the importance of cross-domain evaluation of cross-lingual transfer. ... : COLING 2020 ...
|
|
Keyword:
Computation and Language cs.CL; FOS Computer and information sciences
|
|
URL: https://dx.doi.org/10.48550/arxiv.2105.11301 https://arxiv.org/abs/2105.11301
|
|
BASE
|
|
Hide details
|
|
11 |
From Masked Language Modeling to Translation: Non-English Auxiliary Tasks Improve Zero-shot Spoken Language Understanding ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
From Masked-Language Modeling to Translation: Non-English Auxiliary Tasks Improve Zero-shot Spoken Language Understanding ...
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Lexical Normalization for Code-switched Data and its Effect on POS-tagging ...
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Fair Is Better than Sensational: Man Is to Doctor as Woman Is to Doctor
|
|
|
|
In: Computational Linguistics, Vol 46, Iss 2, Pp 487-497 (2020) (2020)
|
|
BASE
|
|
Show details
|
|
15 |
Bleaching Text: Abstract Features for Cross-lingual Gender Prediction ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|