1 |
Finding Concept-specific Biases in Form--Meaning Associations ...
|
|
|
|
BASE
|
|
Show details
|
|
2 |
Finding Concept-specific Biases in Form–Meaning Associations ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
Disambiguatory Signals are Stronger in Word-initial Positions ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
Finding Concept-specific Biases in Form–Meaning Associations
|
|
|
|
In: Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (2021)
|
|
BASE
|
|
Show details
|
|
5 |
Disambiguatory Signals are Stronger in Word-initial Positions
|
|
|
|
In: Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume (2021)
|
|
BASE
|
|
Show details
|
|
6 |
Disambiguatory Signals are Stronger in Word-initial Positions ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
Finding Concept-specific Biases in Form--Meaning Associations ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Phonotactic Complexity and its Trade-offs ...
|
|
|
|
Abstract:
We present methods for calculating a measure of phonotactic complexity---bits per phoneme---that permits a straightforward cross-linguistic comparison. When given a word, represented as a sequence of phonemic segments such as symbols in the international phonetic alphabet, and a statistical model trained on a sample of word types from the language, we can approximately measure bits per phoneme using the negative log-probability of that word under the model. This simple measure allows us to compare the entropy across languages, giving insight into how complex a language's phonotactics are. Using a collection of 1016 basic concept words across 106 languages, we demonstrate a very strong negative correlation of -0.74 between bits per phoneme and the average length of words. ... : Published in TACL: https://doi.org/10.1162/tacl_a_00296 ...
|
|
Keyword:
Computation and Language cs.CL; FOS Computer and information sciences
|
|
URL: https://arxiv.org/abs/2005.03774 https://dx.doi.org/10.48550/arxiv.2005.03774
|
|
BASE
|
|
Hide details
|
|
11 |
Processing South Asian Languages Written in the Latin Script: the Dakshina Dataset ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
Phonotactic Complexity and Its Trade-offs
|
|
|
|
In: Transactions of the Association for Computational Linguistics, 8 (2020)
|
|
BASE
|
|
Show details
|
|
13 |
Explaining vowel inventory tendencies via simulation: finding a role for quantal locations and formant normalization
|
|
|
|
In: North East Linguistics Society (2020)
|
|
BASE
|
|
Show details
|
|
14 |
Are All Languages Equally Hard to Language-Model?
|
|
|
|
In: Proceedings of the Society for Computation in Linguistics (2019)
|
|
BASE
|
|
Show details
|
|
15 |
Rethinking Phonotactic Complexity
|
|
|
|
In: Proceedings of the Society for Computation in Linguistics (2019)
|
|
BASE
|
|
Show details
|
|
17 |
Graph-Based Word Alignment for Clinical Language Evaluation
|
|
|
|
In: Comput Linguist Assoc Comput Linguist (2015)
|
|
BASE
|
|
Show details
|
|
18 |
COMPUTATIONAL ANALYSIS OF TRAJECTORIES OF LINGUISTIC DEVELOPMENT IN AUTISM
|
|
|
|
BASE
|
|
Show details
|
|
|
|