121 |
Measuring the Similarity of Grammatical Gender Systems by Comparing Partitions ...
|
|
|
|
BASE
|
|
Show details
|
|
124 |
Examining Gender Bias in Languages with Grammatical Gender ...
|
|
|
|
BASE
|
|
Show details
|
|
126 |
The SIGMORPHON 2019 Shared Task: Morphological Analysis in Context and Cross-Lingual Transfer for Inflection ...
|
|
|
|
BASE
|
|
Show details
|
|
127 |
Uncovering Probabilistic Implications in Typological Knowledge Bases ...
|
|
|
|
BASE
|
|
Show details
|
|
128 |
On the Distribution of Deep Clausal Embeddings: A Large Cross-linguistic Study ...
|
|
|
|
BASE
|
|
Show details
|
|
129 |
On the Idiosyncrasies of the Mandarin Chinese Classifier System ...
|
|
|
|
Abstract:
While idiosyncrasies of the Chinese classifier system have been a richly studied topic among linguists (Adams and Conklin, 1973; Erbaugh, 1986; Lakoff, 1986), not much work has been done to quantify them with statistical methods. In this paper, we introduce an information-theoretic approach to measuring idiosyncrasy; we examine how much the uncertainty in Mandarin Chinese classifiers can be reduced by knowing semantic information about the nouns that the classifiers modify. Using the empirical distribution of classifiers from the parsed Chinese Gigaword corpus (Graff et al., 2005), we compute the mutual information (in bits) between the distribution over classifiers and distributions over other linguistic quantities. We investigate whether semantic classes of nouns and adjectives differ in how much they reduce uncertainty in classifier choice, and find that it is not fully idiosyncratic; while there are no obvious trends for the majority of semantic classes, shape nouns reduce uncertainty in classifier ...
|
|
Keyword:
Computation and Language cs.CL; FOS Computer and information sciences
|
|
URL: https://dx.doi.org/10.48550/arxiv.1902.10193 https://arxiv.org/abs/1902.10193
|
|
BASE
|
|
Hide details
|
|
130 |
Combining Sentiment Lexica with a Multi-View Variational Autoencoder ...
|
|
|
|
BASE
|
|
Show details
|
|
131 |
Don't Forget the Long Tail! A Comprehensive Analysis of Morphological Generalization in Bilingual Lexicon Induction ...
|
|
|
|
BASE
|
|
Show details
|
|
133 |
Unsupervised Discovery of Gendered Language through Latent-Variable Modeling ...
|
|
|
|
BASE
|
|
Show details
|
|
135 |
Are All Languages Equally Hard to Language-Model?
|
|
|
|
In: Proceedings of the Society for Computation in Linguistics (2019)
|
|
BASE
|
|
Show details
|
|
136 |
Rethinking Phonotactic Complexity
|
|
|
|
In: Proceedings of the Society for Computation in Linguistics (2019)
|
|
BASE
|
|
Show details
|
|
137 |
On the Complexity and Typology of Inflectional Morphological Systems
|
|
|
|
In: Transactions of the Association for Computational Linguistics, Vol 7, Pp 327-342 (2019) (2019)
|
|
BASE
|
|
Show details
|
|
138 |
Generalizing Procrustes Analysis for Better Bilingual Dictionary Induction ...
|
|
|
|
BASE
|
|
Show details
|
|
139 |
Marrying Universal Dependencies and Universal Morphology ...
|
|
|
|
BASE
|
|
Show details
|
|
140 |
On the Complexity and Typology of Inflectional Morphological Systems ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|