1 |
Evaluating Multilingual Text Encoders for Unsupervised Cross-Lingual Retrieval ...
|
|
|
|
BASE
|
|
Show details
|
|
2 |
Fast, Effective, and Self-Supervised: Transforming Masked Language Models into Universal Lexical and Sentence Encoders ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
Cross-lingual semantic specialization via lexical relation induction ...
|
|
|
|
BASE
|
|
Show details
|
|
5 |
Adversarial propagation and zero-shot cross-lingual transfer of word vector specialization ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Do we really need fully unsupervised cross-lingual embeddings? ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
On the relation between linguistic typology and (limitations of) multilingual language modeling ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
Cross-lingual semantic specialization via lexical relation induction
|
|
Ponti, Edoardo; Vulić, I; Glavaš, G. - : EMNLP-IJCNLP 2019 - 2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing, Proceedings of the Conference, 2020
|
|
BASE
|
|
Show details
|
|
9 |
On the relation between linguistic typology and (limitations of) multilingual language modeling
|
|
|
|
BASE
|
|
Show details
|
|
10 |
Adversarial propagation and zero-shot cross-lingual transfer of word vector specialization
|
|
|
|
BASE
|
|
Show details
|
|
11 |
Do we really need fully unsupervised cross-lingual embeddings?
|
|
Vulić, I; Glavaš, G; Reichart, R. - : EMNLP-IJCNLP 2019 - 2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing, Proceedings of the Conference, 2020
|
|
BASE
|
|
Show details
|
|
12 |
Towards zero-shot language modeling
|
|
|
|
Abstract:
Can we construct a neural language model which is inductively biased towards learning human language? Motivated by this question, we aim at constructing an informative prior for held-out languages on the task of character-level, open-vocabulary language modeling. We obtain this prior as the posterior over network weights conditioned on the data from a sample of training languages, which is approximated through Laplace’s method. Based on a large and diverse sample of languages, the use of our prior outperforms baseline models with an uninformative prior in both zero-shot and few-shot settings, showing that the prior is imbued with universal linguistic knowledge. Moreover, we harness broad language-specific information available for most languages of the world, i.e., features from typological databases, as distant supervision for held-out languages. We explore several language modeling conditioning techniques, including concatenation and meta-networks for parameter generation. They appear beneficial in the few-shot setting, but ineffective in the zero-shot setting. Since the paucity of even plain digital text affects the majority of the world’s languages, we hope that these insights will broaden the scope of applications for language technology.
|
|
URL: https://www.repository.cam.ac.uk/handle/1810/296685 https://doi.org/10.17863/CAM.43733
|
|
BASE
|
|
Hide details
|
|
14 |
Zero-shot language transfer for cross-lingual sentence retrieval using bidirectional attention model ...
|
|
|
|
BASE
|
|
Show details
|
|
15 |
Learning unsupervised multilingual word embeddings with incremental multilingual hubs ...
|
|
|
|
BASE
|
|
Show details
|
|
16 |
Specializing distributional vectors of allwords for lexical entailment ...
|
|
|
|
BASE
|
|
Show details
|
|
17 |
Investigating cross-lingual alignment methods for contextualized embeddings with Token-level evaluation ...
|
|
|
|
BASE
|
|
Show details
|
|
18 |
Specializing distributional vectors of allwords for lexical entailment
|
|
|
|
BASE
|
|
Show details
|
|
19 |
Investigating cross-lingual alignment methods for contextualized embeddings with Token-level evaluation
|
|
|
|
BASE
|
|
Show details
|
|
20 |
Learning unsupervised multilingual word embeddings with incremental multilingual hubs
|
|
Heyman, G; Verreet, B; Vulić, I. - : NAACL HLT 2019 - 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies - Proceedings of the Conference, 2019
|
|
BASE
|
|
Show details
|
|
|
|