1 |
Final devoicing in the 'pool of variation': A large-scale corpora approach with automatic alignment
|
|
|
|
In: Phonetics and Phonology in Europe Conference ; https://hal.archives-ouvertes.fr/hal-02336112 ; Phonetics and Phonology in Europe Conference, Jun 2019, Lecce, Italy (2019)
|
|
BASE
|
|
Show details
|
|
2 |
"Gra[f]e!" Word-final devoicing of obstruents in Standard French: An acoustic study based on large corpora
|
|
|
|
In: Annual Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-02336119 ; Annual Conference of the International Speech Communication Association, ISCA, Sep 2019, Graz, Austria. DOI:10.21437/Interspeech.2019-2329 (2019)
|
|
BASE
|
|
Show details
|
|
3 |
Studying Vowel Variation in French-Algerian Arabic Code-switched Speech
|
|
|
|
In: Interspeech 2018 ; https://halshs.archives-ouvertes.fr/halshs-01969143 ; Interspeech 2018, Sep 2018, Hyderabad,, India. ⟨10.21437/interspeech.2018-2381⟩ (2018)
|
|
BASE
|
|
Show details
|
|
4 |
The French-Algerian Code-Switching Triggered audio corpus (FACST)
|
|
|
|
In: LREC 2018, Eleventh International Conference on Language Resources and Evaluation ; LREC 2018 11th edition of the Language Resources and Evaluation Conference, ; https://halshs.archives-ouvertes.fr/halshs-01969152 ; LREC 2018 11th edition of the Language Resources and Evaluation Conference,, May 2018, Miyazaki, Japan (2018)
|
|
Abstract:
International audience ; The French Algerian Code-Switching Triggered corpus (FACST) was created in order to support a variety of studies in phonetics, prosody and natural language processing. The first aim of the FACST corpus is to collect a spontaneous Code-switching speech (CS) corpus. In order to obtain a large quantity of spontaneous CS utterances in natural conversations experiments were carried out on how to elicit CS. Applying a triggering protocol by means of code-switched questions was found to be effective in eliciting CS in the responses. To ensure good audio quality, all recordings were made in a soundproof room or in a very calm room. This paper describes FACST corpus, along with the principal steps to build a CS speech corpus in French-Algerian languages and data collection steps. We also explain the selection criteria for the CS speakers and the recording protocols used. We present the methods used for data segmentation and annotation, and propose a conventional transcription of this type of speech in each language with the aim of being well-suited for both computational linguistic and acoustic-phonetic studies. We provide an a quantitative description of the FACST corpus along with results of linguistic studies, and discuss some of the challenges we faced in collecting CS data.
|
|
Keyword:
[INFO]Computer Science [cs]; [SHS.LANGUE]Humanities and Social Sciences/Linguistics; Arabic; bilingual speakers; Code-switching; French; oral speech data
|
|
URL: https://halshs.archives-ouvertes.fr/halshs-01969152/file/DAMAZOUZ_lrec2018.pdf https://halshs.archives-ouvertes.fr/halshs-01969152 https://halshs.archives-ouvertes.fr/halshs-01969152/document
|
|
BASE
|
|
Hide details
|
|
5 |
Adaptor Grammars for the Linguist: Word Segmentation Experiments for Very Low-Resource Languages
|
|
|
|
In: Workshop on Computational Research in Phonetics, Phonology, and Morphology ; https://hal.archives-ouvertes.fr/hal-01910757 ; Workshop on Computational Research in Phonetics, Phonology, and Morphology, Oct 2018, Bruxelles, Belgium. pp.32 - 42, ⟨10.18653/v1/P17⟩ (2018)
|
|
BASE
|
|
Show details
|
|
6 |
Studying variation in Romanian: deletion of the definite article -l in continuous speech
|
|
|
|
In: Linguistic Vanguard ; https://hal.archives-ouvertes.fr/hal-01837197 ; Linguistic Vanguard, 2018, 5 (1), 17p (2018)
|
|
BASE
|
|
Show details
|
|
7 |
A corpus based study of morpheme deletion in a low resourced language: A case study for Embosi
|
|
|
|
In: Annual Meeting of the Linguistic Society of America ; https://hal.archives-ouvertes.fr/hal-01837164 ; Annual Meeting of the Linguistic Society of America, Jan 2018, Salt Lake City, United States (2018)
|
|
BASE
|
|
Show details
|
|
8 |
The French-Algerian Code-Switching Triggered audio corpus (FACST)
|
|
|
|
In: International Conference on Language Resources and Evaluation ; https://hal.archives-ouvertes.fr/hal-01837163 ; International Conference on Language Resources and Evaluation, ELRA, May 2018, Miyazaki, Japan (2018)
|
|
BASE
|
|
Show details
|
|
9 |
Studying Vowel Variation in French-Algerian Arabic Code-switched Speech
|
|
|
|
In: Annual Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-02387386 ; Annual Conference of the International Speech Communication Association, ISCA, Sep 2018, Hyderabad, India (2018)
|
|
BASE
|
|
Show details
|
|
10 |
Developing an Embosi (Bantu C25) Speech Variant Dictionary to Model Vowel Elision and Morpheme Deletion
|
|
|
|
In: Annual Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-01837178 ; Annual Conference of the International Speech Communication Association , ISCA, Aug 2017, Stockholm, Sweden (2017)
|
|
BASE
|
|
Show details
|
|
11 |
Schwa Realization in French: Using Automatic Speech Processing to Study Phonological and Socio-linguistic Factors in Large Corpora
|
|
|
|
In: Annual Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-01837179 ; Annual Conference of the International Speech Communication Association , ISCA, Aug 2017, Stockholm, Sweden (2017)
|
|
BASE
|
|
Show details
|
|
12 |
Addressing Code-Switching in French/Algerian Arabic Speech
|
|
|
|
In: Interspeech 2017 ; https://halshs.archives-ouvertes.fr/halshs-01969148 ; Interspeech 2017, Aug 2017, Stockholm, Sweden. pp.62-66, ⟨10.21437/interspeech.2017-1373⟩ (2017)
|
|
BASE
|
|
Show details
|
|
13 |
Addressing Code-Switching in French/Algerian Arabic Speech
|
|
|
|
In: Annual Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-01837206 ; Annual Conference of the International Speech Communication Association , ISCA, Aug 2017, Stockholm, Sweden (2017)
|
|
BASE
|
|
Show details
|
|
14 |
Corpus base linguistic exploration via forced alignments with a ‘light-weight’ ASR tool
|
|
|
|
In: Language & Technology Conference : Human Language Technologies as a Challenge for Computer Science and Linguistics ; https://hal.archives-ouvertes.fr/hal-01837174 ; Language & Technology Conference : Human Language Technologies as a Challenge for Computer Science and Linguistics, Nov 2017, Poznań, Poland (2017)
|
|
BASE
|
|
Show details
|
|
15 |
BULB: Breaking the Unwritten Language Barrier
|
|
|
|
In: Procedia Computer Science ; Computational Methods for Endangered Language Documentation and Description ; https://hal.archives-ouvertes.fr/hal-01836496 ; Computational Methods for Endangered Language Documentation and Description, May 2016, Yogyakarta, Indonesia. pp.8-14, ⟨10.1016/j.procs.2016.04.023⟩ (2016)
|
|
BASE
|
|
Show details
|
|
16 |
BULB: Breaking the Unwritten Language Barrier
|
|
|
|
In: Procedia Computer Science ; Computational Methods for Endangered Language Documentation and Description ; https://hal.archives-ouvertes.fr/hal-01836496 ; Computational Methods for Endangered Language Documentation and Description, May 2016, Yogyakarta, Indonesia. pp.8-14, ⟨10.1016/j.procs.2016.04.023⟩ (2016)
|
|
BASE
|
|
Show details
|
|
17 |
Automatic language identity tagging on word and sentence-level in multilingual text sources: a case-study on Luxembourgish
|
|
|
|
In: International Conference on Language Resources and Evaluation ; https://hal.archives-ouvertes.fr/hal-01843401 ; International Conference on Language Resources and Evaluation, May 2014, Reykjavik, Iceland (2014)
|
|
BASE
|
|
Show details
|
|
18 |
Modélisation acoustico-phonétique de langues peu dotées : Études phonétiques et travaux de reconnaissance automatique en luxembourgois
|
|
|
|
In: Journées d'Etude sur la Parole ; https://hal.archives-ouvertes.fr/hal-01843399 ; Journées d'Etude sur la Parole, Jan 2014, Le Mans, France (2014)
|
|
BASE
|
|
Show details
|
|
19 |
Recent Evolution of Non Standard Consonantal Variants in French Broadcast News
|
|
|
|
In: Annual Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-01843431 ; Annual Conference of the International Speech Communication Association , International Speech Communication Association, F. Bimbot, C. Cerisara, C. Fougeron, G. Gravier, L. Lamel, F. Pellegrino, P. Perrier, Jan 2013, Lyon, France (2013)
|
|
BASE
|
|
Show details
|
|
20 |
What we can learn from ASR errors about low-resourced languages: a case- study of Luxembourgish and Austrian
|
|
|
|
In: Errors by Humans and Machines in Multimedia, Multimodal, Multilingual Data Processing ; https://hal.archives-ouvertes.fr/hal-01843440 ; Errors by Humans and Machines in Multimedia, Multimodal, Multilingual Data Processing, Jan 2013, Ermenonville, France (2013)
|
|
BASE
|
|
Show details
|
|
|
|