1 |
What do complexity measures measure? Correlating and validating corpus-based measures of morphological complexity ...
|
|
|
|
BASE
|
|
Show details
|
|
5 |
Are pre-trained text representations useful for multilingual and multi-dimensional language proficiency modeling? ...
|
|
|
|
Abstract:
Development of language proficiency models for non-native learners has been an active area of interest in NLP research for the past few years. Although language proficiency is multidimensional in nature, existing research typically considers a single "overall proficiency" while building models. Further, existing approaches also considers only one language at a time. This paper describes our experiments and observations about the role of pre-trained and fine-tuned multilingual embeddings in performing multi-dimensional, multilingual language proficiency classification. We report experiments with three languages -- German, Italian, and Czech -- and model seven dimensions of proficiency ranging from vocabulary control to sociolinguistic appropriateness. Our results indicate that while fine-tuned embeddings are useful for multilingual proficiency modeling, none of the features achieve consistently best performance for all dimensions of language proficiency. All code, data and related supplementary material can ... : 10 pages ...
|
|
Keyword:
Computation and Language cs.CL; FOS Computer and information sciences
|
|
URL: https://arxiv.org/abs/2102.12971 https://dx.doi.org/10.48550/arxiv.2102.12971
|
|
BASE
|
|
Hide details
|
|
6 |
Environmental factors affect the evolution of linguistic subgroups in Borneo ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Disentangling dialects: a neural approach to Indo-Aryan historical phonology and subgrouping ...
|
|
|
|
BASE
|
|
Show details
|
|
10 |
Probing Multilingual BERT for Genetic and Typological Signals ...
|
|
|
|
BASE
|
|
Show details
|
|
11 |
Disentangling dialects: a neural approach to Indo-Aryan historical phonology and subgrouping ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
A test of Generalized Bayesian dating: A new linguistic dating method
|
|
|
|
In: PLoS One (2020)
|
|
BASE
|
|
Show details
|
|
13 |
Disentangling dialects: a neural approach to Indo-Aryan historical phonology and subgrouping
|
|
|
|
In: Cathcart, Chundra; Rama, Taraka (2020). Disentangling dialects: a neural approach to Indo-Aryan historical phonology and subgrouping. In: Fernández, Raquel; Linzen, Tal. Proceedings of the 24th Conference on Computational Natural Language Learning. Online: Association for Computational Linguistics, 620-630. (2020)
|
|
BASE
|
|
Show details
|
|
14 |
Towards unsupervised extraction of linguistic typological features from language descriptions ...
|
|
|
|
BASE
|
|
Show details
|
|
15 |
An automated framework for fast cognate detection and Bayesian phylogenetic inference in computational historical linguistics ...
|
|
|
|
BASE
|
|
Show details
|
|
16 |
An automated framework for fast cognate detection and bayesian phylogenetic inference in computational historical linguistics ...
|
|
|
|
BASE
|
|
Show details
|
|
19 |
Towards unsupervised extraction of linguistic typological features from language descriptions - SI ...
|
|
|
|
BASE
|
|
Show details
|
|
20 |
Towards unsupervised extraction of linguistic typological features from language descriptions - SI ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|