1 |
Frequency, Informativity and Word Length: Insights from Typologically Diverse Corpora
|
|
|
|
In: Entropy; Volume 24; Issue 2; Pages: 280 (2022)
|
|
BASE
|
|
Show details
|
|
2 |
Loose and tight languages: A typology based on associations between constructions and lexemes ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
Loose and tight languages: A typology based on associations between constructions and lexemes ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
Yongning Na for Natural Language Processing: a single-speaker audio corpus with transcriptions ...
|
|
|
|
BASE
|
|
Show details
|
|
5 |
Yongning Na for Natural Language Processing: a single-speaker audio corpus with transcriptions ...
|
|
|
|
Abstract:
(français ci-dessous) This archive contains a dataset (audio files and transcriptions) of a minority language, Yongning Na (iso 639-3 code: nru). The archive contains a subset of the Na corpus of the Pangloss Collection: it is a single-speaker corpus, consisting of all the audio resources transcribed, for the main speaker of this corpus (Ms. LATAMI Dashilame). The corpus is versioned, so that the experiments carried out on these resources (for linguistic research or for Natural Language Processing) are fully reproducible. All relevant information is contained in YAML files (.yml extension; one in French, one in English). The data sub-folder contains the converted and demultiplexed audio files, as well as the annotations associated with each channel of the audio files. The summary files contain, among other things, the list of graphemes used in the language (complex graphemes are particularly important), as well as information on the various resources (audio and annotations), such as their identifiers (DOIs) ...
|
|
Keyword:
audio corpora; endangered languages; interdisciplinary research; language conservation; language documentation; multimedia linguistic resources; Naish languages; Sino-Tibetan languages
|
|
URL: https://zenodo.org/record/5336698 https://dx.doi.org/10.5281/zenodo.5336698
|
|
BASE
|
|
Hide details
|
|
8 |
Jeu des termes et chronodiversité. Examen polydiachronique de quelques termes de sémantique et de lexicologie ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
The Use of Corpora in Language Education. An Overview of the Italian Language Corpora ...
|
|
|
|
BASE
|
|
Show details
|
|
10 |
Del atlas lingüístico tradicional al corpus geolingüístico digital : diseño de un proyecto
|
|
|
|
BASE
|
|
Show details
|
|
11 |
You’re a bitch, the stallion said: estudio contrastivo inglés-español sobre el uso sexista del lenguaje.
|
|
|
|
BASE
|
|
Show details
|
|
12 |
An analysis of the centrality of intuition talk in the discussion on taste disagreements
|
|
|
|
BASE
|
|
Show details
|
|
13 |
How software features and linguistic analyses add value to orthographic markup in transcription of multilingual recordings for digital archives
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Linguistic analysis, ethical practice, and quality assurance in anonymizing recordings of spoken language for deposit in digital archives
|
|
|
|
BASE
|
|
Show details
|
|
15 |
The Use of Corpora in Language Education. An Overview of the Italian Language Corpora
|
|
|
|
In: Studi di glottodidattica; V. 6, N. 1 (2021); 103 - 117 ; 1970-1861 (2021)
|
|
BASE
|
|
Show details
|
|
16 |
Dependency Lengths in Speech and Writing: A Cross-Linguistic Comparison via YouDePP, a Pipeline for Scraping and Parsing YouTube Captions
|
|
|
|
In: Proceedings of the Society for Computation in Linguistics (2021)
|
|
BASE
|
|
Show details
|
|
17 |
On the Use of Corpora in Second Language Acquisition – Chinese as an Example
|
|
|
|
In: Acta Linguistica Asiatica, Vol 11, Iss 2 (2021) (2021)
|
|
BASE
|
|
Show details
|
|
18 |
Alector: A Parallel Corpus of Simplified French Texts with Alignments of Misreadings by Poor and Dyslexic Readers
|
|
|
|
In: Language Resources and Evaluation for Language Technologies (LREC) ; https://hal.archives-ouvertes.fr/hal-02503986 ; Language Resources and Evaluation for Language Technologies (LREC), May 2020, Marseille, France (2020)
|
|
BASE
|
|
Show details
|
|
19 |
RefCo: An initiative to develop a set of quality criteria for fieldwork corpora
|
|
|
|
In: Actes des 2èmes journées scientifiques du Groupement de Recherche Linguistique Informatique Formelle et de Terrain (LIFT). ; 2èmes journées scientifiques du Groupement de Recherche Linguistique Informatique Formelle et de Terrain (LIFT) ; https://hal.archives-ouvertes.fr/hal-03047143 ; 2èmes journées scientifiques du Groupement de Recherche Linguistique Informatique Formelle et de Terrain (LIFT), 2020, Montrouge, France. pp.95-101 (2020)
|
|
BASE
|
|
Show details
|
|
20 |
ERRATAS database of editorial principles and practices in printed editions of historical correspondence ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|