1 |
Building and curating conversational corpora for diversity-aware language science and technology ...
|
|
|
|
Abstract:
We present a pipeline and tools to build a maximally natural data set of conversational interaction that covers 66 languages and varieties from 32 phyla. We describe the curation and compilation process moving from diverse language documentation corpora to a unified format and describe an open-source tool "convo-parse" to help in quality control and assessment of conversational data. We conclude with two case studies of how diverse data sets can inform interactional linguistics and speech recognition technology and thus contribute to broadening the empirical foundations of language sciences and technologies of the future. ...
|
|
Keyword:
Computation and Language cs.CL; FOS Computer and information sciences
|
|
URL: https://arxiv.org/abs/2203.03399 https://dx.doi.org/10.48550/arxiv.2203.03399
|
|
BASE
|
|
Hide details
|
|
2 |
Computational challenges in explaining communication: How deep the rabbit hole goes
|
|
|
|
In: Proceedings of the Annual Meeting of the Cognitive Science Society, vol 43, iss 43 (2021)
|
|
BASE
|
|
Show details
|
|
3 |
Data and code for "An inverse relation between expressiveness and grammatical integration" ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
A Systematic Investigation of Gesture Kinematics in Evolving Manual Languages in the Lab
|
|
|
|
In: Cogn Sci (2021)
|
|
BASE
|
|
Show details
|
|
5 |
Recruiting assistance and collaboration: A West-African corpus study ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Recruiting assistance and collaboration: A West-African corpus study ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
Sequence organization : a universal infrastructure for social action
|
|
|
|
BASE
|
|
Show details
|
|
8 |
Iconicity in Word Learning and Beyond: A Critical Review
|
|
|
|
In: Lang Speech (2020)
|
|
BASE
|
|
Show details
|
|
9 |
Alignment in Multimodal Interaction: An Integrative Framework
|
|
|
|
In: Cogn Sci (2020)
|
|
BASE
|
|
Show details
|
|
10 |
Sequence organization: A universal infrastructure for social action
|
|
|
|
BASE
|
|
Show details
|
|
11 |
Construals of iconicity: experimental approaches to form-meaning resemblances in language
|
|
|
|
BASE
|
|
Show details
|
|
12 |
Cross-modal associations and synesthesia: Categorical perception and structure in vowel–color mappings in a large online sample
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Differential coding of perception in the world’s languages
|
|
|
|
In: ISSN: 0027-8424 ; EISSN: 1091-6490 ; Proceedings of the National Academy of Sciences of the United States of America ; https://hal.archives-ouvertes.fr/hal-01984190 ; Proceedings of the National Academy of Sciences of the United States of America , National Academy of Sciences, 2018, 115 (45), pp.11369-11376 (2018)
|
|
BASE
|
|
Show details
|
|
15 |
Supplementary material from "Universals and cultural diversity in the expression of gratitude" ...
|
|
|
|
BASE
|
|
Show details
|
|
16 |
Supplementary material from "Universals and cultural diversity in the expression of gratitude" ...
|
|
|
|
BASE
|
|
Show details
|
|
17 |
Supplementary material from "Universals and cultural diversity in the expression of gratitude" ...
|
|
|
|
BASE
|
|
Show details
|
|
18 |
Iconicity in word learning and beyond: A critical review ...
|
|
|
|
BASE
|
|
Show details
|
|
20 |
Redrawing the margins of language: Lessons from research on ideophones
|
|
|
|
In: Glossa: a journal of general linguistics; Vol 3, No 1 (2018); 4 ; 2397-1835 (2018)
|
|
BASE
|
|
Show details
|
|
|
|