1 |
What do complexity measures measure? Correlating and validating corpus-based measures of morphological complexity ...
|
|
|
|
Abstract:
We present an analysis of eight measures used for quantifying morphological complexity of natural languages. The measures we study are corpus-based measures of morphological complexity with varying requirements for corpus annotation. We present similarities and differences between these measures visually and through correlation analyses, as well as their relation to the relevant typological variables. Our analysis focuses on whether these `measures' are measures of the same underlying variable, or whether they measure more than one dimension of morphological complexity. The principal component analysis indicates that the first principal component explains 92.62 % of the variation in eight measures, indicating a strong linear dependence between the complexity measures studied. ... : Submitted to Linguistics Vanguard ...
|
|
Keyword:
Computation and Language cs.CL; FOS Computer and information sciences
|
|
URL: https://arxiv.org/abs/2204.05056 https://dx.doi.org/10.48550/arxiv.2204.05056
|
|
BASE
|
|
Hide details
|
|
5 |
Are pre-trained text representations useful for multilingual and multi-dimensional language proficiency modeling? ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Environmental factors affect the evolution of linguistic subgroups in Borneo ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Disentangling dialects: a neural approach to Indo-Aryan historical phonology and subgrouping ...
|
|
|
|
BASE
|
|
Show details
|
|
10 |
Probing Multilingual BERT for Genetic and Typological Signals ...
|
|
|
|
BASE
|
|
Show details
|
|
11 |
Disentangling dialects: a neural approach to Indo-Aryan historical phonology and subgrouping ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
A test of Generalized Bayesian dating: A new linguistic dating method
|
|
|
|
In: PLoS One (2020)
|
|
BASE
|
|
Show details
|
|
13 |
Disentangling dialects: a neural approach to Indo-Aryan historical phonology and subgrouping
|
|
|
|
In: Cathcart, Chundra; Rama, Taraka (2020). Disentangling dialects: a neural approach to Indo-Aryan historical phonology and subgrouping. In: Fernández, Raquel; Linzen, Tal. Proceedings of the 24th Conference on Computational Natural Language Learning. Online: Association for Computational Linguistics, 620-630. (2020)
|
|
BASE
|
|
Show details
|
|
14 |
Towards unsupervised extraction of linguistic typological features from language descriptions ...
|
|
|
|
BASE
|
|
Show details
|
|
15 |
An automated framework for fast cognate detection and Bayesian phylogenetic inference in computational historical linguistics ...
|
|
|
|
BASE
|
|
Show details
|
|
16 |
An automated framework for fast cognate detection and bayesian phylogenetic inference in computational historical linguistics ...
|
|
|
|
BASE
|
|
Show details
|
|
19 |
Towards unsupervised extraction of linguistic typological features from language descriptions - SI ...
|
|
|
|
BASE
|
|
Show details
|
|
20 |
Towards unsupervised extraction of linguistic typological features from language descriptions - SI ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|