1 |
Finding Variants for Construction-Based Dialectometry: A Corpus-Based Approach to Regional CxGs ...
|
|
|
|
BASE
|
|
Show details
|
|
2 |
Representations of Language Varieties Are Reliable Given Corpus Similarity Measures ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
Learned Construction Grammars Converge Across Registers Given Increased Exposure ...
|
|
|
|
BASE
|
|
Show details
|
|
5 |
Production vs Perception: The Role of Individuality in Usage-Based Grammar Induction ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Global Syntactic Variation in Seven Languages: Towards a Computational Dialectology ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
Learned Construction Grammars Converge Across Registers Given Increased Exposure
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Production vs Perception: The Role of Individuality in Usage-Based Grammar Induction
|
|
|
|
BASE
|
|
Show details
|
|
10 |
Representations of Language Varieties Are Reliable Given Corpus Similarity Measures
|
|
|
|
BASE
|
|
Show details
|
|
12 |
Modeling Global Syntactic Variation in English Using Dialect Classification ...
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Mapping Languages and Demographics with Georeferenced Corpora
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Global Syntactic Variation in Seven Languages: Toward a Computational Dialectology
|
|
|
|
In: Front Artif Intell (2019)
|
|
Abstract:
The goal of this paper is to provide a complete representation of regional linguistic variation on a global scale. To this end, the paper focuses on removing three constraints that have previously limited work within dialectology/dialectometry. First, rather than assuming a fixed and incomplete set of variants, we use Computational Construction Grammar to provide a replicable and falsifiable set of syntactic features. Second, rather than assuming a specific area of interest, we use global language mapping based on web-crawled and social media datasets to determine the selection of national varieties. Third, rather than looking at a single language in isolation, we model seven major languages together using the same methods: Arabic, English, French, German, Portuguese, Russian, and Spanish. Results show that models for each language are able to robustly predict the region-of-origin of held-out samples better using Construction Grammars than using simpler syntactic features. These global-scale experiments are used to argue that new methods in computational sociolinguistics are able to provide more generalized models of regional variation that are essential for understanding language variation and change at scale.
|
|
Keyword:
Artificial Intelligence
|
|
URL: https://doi.org/10.3389/frai.2019.00015 http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7861279/
|
|
BASE
|
|
Hide details
|
|
17 |
Modeling the Complexity and Descriptive Adequacy of Construction Grammars
|
|
|
|
In: Proceedings of the Society for Computation in Linguistics (2018)
|
|
BASE
|
|
Show details
|
|
18 |
Learnability and falsifiability of Construction Grammars
|
|
|
|
In: Proceedings of the Linguistic Society of America; Vol 2 (2017): Proceedings of the Linguistic Society of America; 1:1–15 ; 2473-8689 (2017)
|
|
BASE
|
|
Show details
|
|
19 |
The Linguistic Status of Predictions and Feature Ranks from SVM Text Classifiers
|
|
|
|
In: LSA Annual Meeting Extended Abstracts; Vol 6: LSA Annual Meeting Extended Abstracts 2015; 5:1-5 ; 2377-3367 (2015)
|
|
BASE
|
|
Show details
|
|
|
|