1 |
Vocapia-LIMSI System for 2020 Shared Task on Code-switched Spoken Language Identification
|
|
|
|
In: The First Workshop on Speech Technologies for Code-Switching in Multilingual Communities ; https://hal.archives-ouvertes.fr/hal-03091792 ; The First Workshop on Speech Technologies for Code-Switching in Multilingual Communities, Oct 2020, Shanghai, China (2020)
|
|
BASE
|
|
Show details
|
|
2 |
Low-latency speaker spotting with online diarization and detection
|
|
|
|
In: The Speaker and Language Recognition Workshop ; https://hal.archives-ouvertes.fr/hal-01836490 ; The Speaker and Language Recognition Workshop, ISCA, Jun 2018, Les Sables d'Olonne, France (2018)
|
|
BASE
|
|
Show details
|
|
3 |
Combining Speaker Turn Embedding and Incremental Structure Prediction for Low-Latency Speaker Diarization
|
|
|
|
In: Interspeech 2017, 18th Annual Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-01690162 ; Interspeech 2017, 18th Annual Conference of the International Speech Communication Association, Aug 2017, Stockholm, Sweden. ⟨10.21437/Interspeech.2017-1067⟩ (2017)
|
|
BASE
|
|
Show details
|
|
4 |
Multimodal Person Discovery in Broadcast TV: lessons learned from MediaEval 2015
|
|
|
|
In: ISSN: 1380-7501 ; EISSN: 1573-7721 ; Multimedia Tools and Applications ; https://hal.archives-ouvertes.fr/hal-01690581 ; Multimedia Tools and Applications, Springer Verlag, 2017, 76 (21), pp.22547 - 22567. ⟨10.1007/s11042-017-4730-x⟩ (2017)
|
|
BASE
|
|
Show details
|
|
5 |
Benchmarking Multimedia Technologies with the CAMOMILE Platform: the Case of Multimodal Person Discovery at MediaEval 2015
|
|
|
|
In: LREC 2016 ; https://hal.archives-ouvertes.fr/hal-01690277 ; LREC 2016, May 2016, Portorož, Slovenia (2016)
|
|
BASE
|
|
Show details
|
|
6 |
The CAMOMILE Collaborative Annotation Platform for Multi-modal, Multi-lingual and Multi-media Documents
|
|
|
|
In: Proceedings of LREC 2016 ; LREC 2016 Conference ; https://hal.archives-ouvertes.fr/hal-01350096 ; LREC 2016 Conference, May 2016, Portoroz, Slovenia (2016)
|
|
BASE
|
|
Show details
|
|
7 |
Lexical speaker identification in TV shows
|
|
|
|
In: ISSN: 1380-7501 ; EISSN: 1573-7721 ; Multimedia Tools and Applications ; https://hal.archives-ouvertes.fr/hal-01690342 ; Multimedia Tools and Applications, Springer Verlag, 2015, 74 (4), pp.1377 - 1396. ⟨10.1007/s11042-014-1940-3⟩ (2015)
|
|
BASE
|
|
Show details
|
|
8 |
Analysing rhythm in ritual discourse in Yucatec Maya using automatic speech alignment
|
|
|
|
In: Interspeech 2015 Speech beyond speech ; https://halshs.archives-ouvertes.fr/halshs-01250490 ; Interspeech 2015 Speech beyond speech, Sep 2015, Dresden, Germany ; http://interspeech2015.org/ (2015)
|
|
Abstract:
International audience ; Over the years, research in ethno-linguistics contributed to gather corpora in a wide range of languages, cultures and topics. In the present work, we are investigating ritual speech in Yu-catec Maya. The ritual discourse tends to have a cyclic structure with repetitive patterns and various types of parallelisms between speech sections. Previous studies have revealed an intricate connexion between a speech's structure and vocal productions , in particular through temporal aspects including rhythm, pauses and durations of different speech sections. To further investigate our findings by relying more strongly on the acoustic recordings, automatic speech recognition tools may become of great help, in particular to test various linguistic and ethno-linguistic hypotheses. Unfortunately, Yucatec Maya, with less than one million native speakers, is an under-resourced language with respect to digital resources. As a total, 24 minutes of ritual speech from three performances were manually transcribed by expert linguists in Yucatec and a basic pronunciation dictionary for Yucatec was created accordingly. The transcribed acoustic recordings were then automatically time-aligned on a phonetic and lexical basis. Automatic segmentations were used to measure tempo changes, durations of breath units as well as to examine their link with the structure of the ritual text.
|
|
Keyword:
[SHS.LANGUE]Humanities and Social Sciences/Linguistics; automatic alignment; Index Terms: ethnolinguistic; phonetic segmentation; ritual discourse; tempo; Yucatec Maya
|
|
URL: https://halshs.archives-ouvertes.fr/halshs-01250490/document https://halshs.archives-ouvertes.fr/halshs-01250490 https://halshs.archives-ouvertes.fr/halshs-01250490/file/i15_0344_Vapnarsky.pdf
|
|
BASE
|
|
Hide details
|
|
9 |
Collaborative Annotation for Person Identification in TV Shows
|
|
|
|
In: Interspeech 2015 (short demo paper) ; https://hal.archives-ouvertes.fr/hal-01170513 ; Interspeech 2015 (short demo paper), Sep 2015, Dresden, Germany (2015)
|
|
BASE
|
|
Show details
|
|
10 |
TVD: a reproducible and multiply aligned TV series dataset
|
|
|
|
In: LREC 2014 ; https://hal.archives-ouvertes.fr/hal-01690279 ; LREC 2014, May 2014, Reykjavik, Iceland (2014)
|
|
BASE
|
|
Show details
|
|
11 |
Study of vowels and Voice Strength by Discriminant Analysis ; Etude des voyelles et de la force de voix par analyse discriminante
|
|
|
|
In: ISCA JEP2014 ; 30emes Journees d'Etude sur la Parole ; https://hal.archives-ouvertes.fr/hal-01885618 ; 30emes Journees d'Etude sur la Parole, ISCA AFCP, Jun 2014, Le Mans, France (2014)
|
|
BASE
|
|
Show details
|
|
12 |
Combination of Cepstral and Phonetically Discriminative Features for Speaker Verification
|
|
|
|
In: ISSN: 1070-9908 ; IEEE Signal Processing Letters ; https://hal.archives-ouvertes.fr/hal-01690336 ; IEEE Signal Processing Letters, Institute of Electrical and Electronics Engineers, 2014, 21 (9), pp.1040 - 1044. ⟨10.1109/LSP.2014.2323432⟩ (2014)
|
|
BASE
|
|
Show details
|
|
13 |
Impact of overlapping speech detection on speaker diarization for broadcast news and debates
|
|
|
|
In: IEEE International Conference on Acoustics, Speech, and Signal Processing ; https://hal.archives-ouvertes.fr/hal-01836475 ; IEEE International Conference on Acoustics, Speech, and Signal Processing, Jan 2013, Vancouver, Canada (2013)
|
|
BASE
|
|
Show details
|
|
14 |
Towards a better integration of written names for unsupervised speakers identification in videos
|
|
|
|
In: First Workshop on Speech, Language and Audio in Multimedia, SLAM ; https://hal.inria.fr/hal-00953089 ; First Workshop on Speech, Language and Audio in Multimedia, SLAM, 2013, Marseille, France (2013)
|
|
BASE
|
|
Show details
|
|
15 |
Une étude quantitative des marqueurs discursifs, disfluences et chevauchements de parole dans des interviews politiques
|
|
|
|
In: ISSN: 2118-870X ; EISSN: 2264-7082 ; Travaux Interdisciplinaires du Laboratoire Parole et Langage d'Aix-en-Provence (TIPA) ; https://hal.archives-ouvertes.fr/hal-01135042 ; Travaux Interdisciplinaires du Laboratoire Parole et Langage d'Aix-en-Provence (TIPA), Laboratoire Parole et Langage, 2013, pp.18. ⟨10.4000/tipa.830⟩ (2013)
|
|
BASE
|
|
Show details
|
|
16 |
Lattice MLLR based m-vector system for speaker verification
|
|
|
|
In: IEEE International Conference on Acoustics, Speech, and Signal Processing ; https://hal.archives-ouvertes.fr/hal-01836461 ; IEEE International Conference on Acoustics, Speech, and Signal Processing, Jan 2013, Vancouver, Canada (2013)
|
|
BASE
|
|
Show details
|
|
17 |
Unsupervised Speaker Identification using Overlaid Texts in TV Broadcast
|
|
|
|
In: Proceedings of the 13th Annual Conference of the International Speech Communication Association (Interspeech) ; Interspeech 2012 - Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-00767427 ; Interspeech 2012 - Conference of the International Speech Communication Association, Sep 2012, Portland, OR, United States. 4p (2012)
|
|
BASE
|
|
Show details
|
|
18 |
Comparing Multi-Stage Approaches for Cross-Show Speaker Diarization
|
|
|
|
In: Interspeech 2011 ; https://hal.archives-ouvertes.fr/hal-01690265 ; Interspeech 2011, Aug 2011, Florence, Italy (2011)
|
|
BASE
|
|
Show details
|
|
19 |
Time structure and detection of the multivoiced segments in mixed speech
|
|
|
|
In: International Congress of Phonetic Sciences ; https://hal.archives-ouvertes.fr/hal-01836479 ; International Congress of Phonetic Sciences, Jan 2011, Hong Kong, China (2011)
|
|
BASE
|
|
Show details
|
|
|
|