1 |
What do complexity measures measure? Correlating and validating corpus-based measures of morphological complexity ...
|
|
|
|
BASE
|
|
Show details
|
|
5 |
Are pre-trained text representations useful for multilingual and multi-dimensional language proficiency modeling? ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Environmental factors affect the evolution of linguistic subgroups in Borneo ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Disentangling dialects: a neural approach to Indo-Aryan historical phonology and subgrouping ...
|
|
|
|
BASE
|
|
Show details
|
|
10 |
Probing Multilingual BERT for Genetic and Typological Signals ...
|
|
|
|
Abstract:
We probe the layers in multilingual BERT (mBERT) for phylogenetic and geographic language signals across 100 languages and compute language distances based on the mBERT representations. We 1) employ the language distances to infer and evaluate language trees, finding that they are close to the reference family tree in terms of quartet tree distance, 2) perform distance matrix regression analysis, finding that the language distances can be best explained by phylogenetic and worst by structural factors and 3) present a novel measure for measuring diachronic meaning stability (based on cross-lingual representation variability) which correlates significantly with published ranked lists based on linguistic approaches. Our results contribute to the nascent field of typological interpretability of cross-lingual text representations. ... : COLING 2020 ...
|
|
Keyword:
Computation and Language cs.CL; FOS Computer and information sciences
|
|
URL: https://dx.doi.org/10.48550/arxiv.2011.02070 https://arxiv.org/abs/2011.02070
|
|
BASE
|
|
Hide details
|
|
11 |
Disentangling dialects: a neural approach to Indo-Aryan historical phonology and subgrouping ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
A test of Generalized Bayesian dating: A new linguistic dating method
|
|
|
|
In: PLoS One (2020)
|
|
BASE
|
|
Show details
|
|
13 |
Disentangling dialects: a neural approach to Indo-Aryan historical phonology and subgrouping
|
|
|
|
In: Cathcart, Chundra; Rama, Taraka (2020). Disentangling dialects: a neural approach to Indo-Aryan historical phonology and subgrouping. In: Fernández, Raquel; Linzen, Tal. Proceedings of the 24th Conference on Computational Natural Language Learning. Online: Association for Computational Linguistics, 620-630. (2020)
|
|
BASE
|
|
Show details
|
|
14 |
Towards unsupervised extraction of linguistic typological features from language descriptions ...
|
|
|
|
BASE
|
|
Show details
|
|
15 |
An automated framework for fast cognate detection and Bayesian phylogenetic inference in computational historical linguistics ...
|
|
|
|
BASE
|
|
Show details
|
|
16 |
An automated framework for fast cognate detection and bayesian phylogenetic inference in computational historical linguistics ...
|
|
|
|
BASE
|
|
Show details
|
|
19 |
Towards unsupervised extraction of linguistic typological features from language descriptions - SI ...
|
|
|
|
BASE
|
|
Show details
|
|
20 |
Towards unsupervised extraction of linguistic typological features from language descriptions - SI ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|