DE eng

Search in the Catalogues and Directories

Page: 1 2
Hits 1 – 20 of 23

1
Vocapia-LIMSI System for 2020 Shared Task on Code-switched Spoken Language Identification
In: The First Workshop on Speech Technologies for Code-Switching in Multilingual Communities ; https://hal.archives-ouvertes.fr/hal-03091792 ; The First Workshop on Speech Technologies for Code-Switching in Multilingual Communities, Oct 2020, Shanghai, China (2020)
BASE
Show details
2
Challenges in Audio Processing of Terrorist-Related Data
In: International Conference on Multimedia Modeling ; https://hal.archives-ouvertes.fr/hal-02415176 ; International Conference on Multimedia Modeling, Springer, Jan 2019, Thessaloniki, Greece (2019)
BASE
Show details
3
Challenges in Audio Processing of Terrorist-Related Data
In: International Conference on Multimedia Modeling ; https://hal.archives-ouvertes.fr/hal-02387373 ; International Conference on Multimedia Modeling, Springer, Jan 2019, Thessaloniki, Greece (2019)
BASE
Show details
4
Language Recognition for Dialects and Closely Related Languages
In: Odyssey 2016 ; https://hal.archives-ouvertes.fr/hal-01744188 ; Odyssey 2016, Jun 2016, Bilbao, Spain (2016)
BASE
Show details
5
Improving Data Selection for Low Resource STT and KWS
BASE
Show details
6
Lexical speaker identification in TV shows
In: ISSN: 1380-7501 ; EISSN: 1573-7721 ; Multimedia Tools and Applications ; https://hal.archives-ouvertes.fr/hal-01690342 ; Multimedia Tools and Applications, Springer Verlag, 2015, 74 (4), pp.1377 - 1396. ⟨10.1007/s11042-014-1940-3⟩ (2015)
Abstract: The final publication is available at https://link.springer.com/article/10.1007/s11042-014-1940-3 ; International audience ; It is possible to use lexical information extracted from speech transcripts for speaker identification (SID), either on its own or to improve the performance of standard cepstral-based SID systems upon fusion. This was established before typically using isolated speech from single speakers (NIST SRE corpora, parliamentary speeches). On the contrary, this work applies lexical approaches for SID on a different type of data. It uses the REPERE corpus consisting of unsegmented multiparty conversations, mostly debates, discussions and Q&A sessions from TV shows. It is hypothesized that people give out clues to their identity when speaking in such settings which this work aims to exploit. The impact on SID performance of the diarization front-end required to pre-process the unsegmented data is also measured. Four lexical SID approaches are studied in this work, including TFIDF, BM25 and LDA-based topic modeling. Results are analysed in terms of TV shows and speaker roles. Lexical approaches achieve low error rates for certain speaker roles such as anchors and journalists, sometimes lower than a standard cepstral-based Gaussian Supervector-Support Vector Machine (GSV-SVM) system. Also, in certain cases, the lexical system shows modest improvement over the cepstral-based system performance using score-level sum fusion. To highlight the potential of using lexical information not just to improve upon cepstral-based SID systems but as an independent approach in its own right, initial studies on crossmedia SID is briefly reported. Instead of using 2 Anindya Roy et al. speech data as all cepstral systems require, this approach uses Wikipedia texts to train lexical speaker models which are then tested on speech transcripts to identify speakers.
Keyword: [INFO.INFO-MM]Computer Science [cs]/Multimedia [cs.MM]; [INFO]Computer Science [cs]
URL: https://hal.archives-ouvertes.fr/hal-01690342/file/paper_v0.pdf
https://hal.archives-ouvertes.fr/hal-01690342/document
https://doi.org/10.1007/s11042-014-1940-3
https://hal.archives-ouvertes.fr/hal-01690342
BASE
Hide details
7
Traduction de la parole dans le projet RAPMAT
In: Journées d'Études sur la Parole ; https://hal.archives-ouvertes.fr/hal-01843418 ; Journées d'Études sur la Parole, Jan 2014, Le Mans, France (2014)
BASE
Show details
8
Comparing decoding strategies for subword-based keyword spotting in low-resourced languages
In: Annual Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-01843408 ; Annual Conference of the International Speech Communication Association , ISCA, Sep 2014, Singapore, Singapore (2014)
BASE
Show details
9
Combination of Cepstral and Phonetically Discriminative Features for Speaker Verification
In: ISSN: 1070-9908 ; IEEE Signal Processing Letters ; https://hal.archives-ouvertes.fr/hal-01690336 ; IEEE Signal Processing Letters, Institute of Electrical and Electronics Engineers, 2014, 21 (9), pp.1040 - 1044. ⟨10.1109/LSP.2014.2323432⟩ (2014)
BASE
Show details
10
Lattice MLLR based m-vector system for speaker verification
In: IEEE International Conference on Acoustics, Speech, and Signal Processing ; https://hal.archives-ouvertes.fr/hal-01836461 ; IEEE International Conference on Acoustics, Speech, and Signal Processing, Jan 2013, Vancouver, Canada (2013)
BASE
Show details
11
Unsupervised Speaker Identification using Overlaid Texts in TV Broadcast
In: Proceedings of the 13th Annual Conference of the International Speech Communication Association (Interspeech) ; Interspeech 2012 - Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-00767427 ; Interspeech 2012 - Conference of the International Speech Communication Association, Sep 2012, Portland, OR, United States. 4p (2012)
BASE
Show details
12
Recherche par le contenu dans des documents audiovisuels multilingues
In: ISSN: 1279-5127 ; EISSN: 1963-1014 ; Document Numérique ; https://hal.inria.fr/hal-00953796 ; Document Numérique, Lavoisier, 2010, 13 (1), pp.229-246 (2010)
BASE
Show details
13
Content-based search in multilingual audiovisual documents using the International Phonetic Alphabet
In: ISSN: 1380-7501 ; EISSN: 1573-7721 ; Multimedia Tools and Applications ; https://hal.archives-ouvertes.fr/hal-00953696 ; Multimedia Tools and Applications, Springer Verlag, 2010, 48 (1), pp.123-140. ⟨10.1007/s11042-009-0377-6⟩ (2010)
BASE
Show details
14
Automatic speech recognition for under-resourced languages: application to Vietnamese language
In: Institute of Electrical and Electronics Engineers. IEEE transactions on audio, speech and language processing. - New York, NY : Inst. 17 (2009) 8, 1471-1482
BLLDB
OLC Linguistik
Show details
15
Exploitation d'un corpus bilingue comparable pour la création d'un système de traduction probabiliste Vietnamien - Français
In: TALN ; TALN 2009, Senlis, 24-26 juin 2009 ; https://hal.archives-ouvertes.fr/hal-00959202 ; TALN 2009, Senlis, 24-26 juin 2009, 2009, Unknown, pp.x-x (2009)
BASE
Show details
16
Mining a comparable text corpus for a Vietnamese - French statistical machine translation system
In: Fourth Workshop on Statistical Machine Translation ; https://hal.archives-ouvertes.fr/hal-01393602 ; Fourth Workshop on Statistical Machine Translation, 2009, Athens, Greece. pp.165 - 172, ⟨10.3115/1626431.1626466⟩ ; http://www.statmt.org/wmt09/ (2009)
BASE
Show details
17
Recherche par le contenu dans des documents audiovisuels multilingues
In: Actes de la conférence CORIA ; https://hal.inria.fr/hal-00954025 ; Actes de la conférence CORIA, 2009, Giens, France. pp.67-82 (2009)
BASE
Show details
18
Content-Based Search in Multilingual Audiovisual Documents using the International Phonetic Alphabet
In: 7th International Workshop on Content-Based Multimedia Indexing (CBMI 2009) ; https://hal.inria.fr/hal-00953855 ; 7th International Workshop on Content-Based Multimedia Indexing (CBMI 2009), 2009, Chania, Crete. 3-5 June 2009 (2009)
BASE
Show details
19
Normalisation et alignement de corpus français et vietnamiens : Format et Logiciels
In: Actes JATD 2008 ; journées internationales d'analyse statistique des données textuelles ; https://hal.archives-ouvertes.fr/hal-01705630 ; journées internationales d'analyse statistique des données textuelles, Jun 2008, Lyon, France (2008)
BASE
Show details
20
Acoustic-Phonetic Unit Similarities for Context Dependent Acoustic Model Portability
BASE
Show details

Page: 1 2

Catalogues
0
0
1
0
0
0
0
Bibliographies
1
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
22
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern