Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Hits 1 – 7 of 7

1	RETRIEVING SPEAKER INFORMATION FROM PERSONALIZED ACOUSTIC MODELS FOR SPEECH RECOGNITION
	Mdhaffar, Salima; Bonastre, Jean-François; Tommasi, Marc...
	In: IEEE ICASSP 2022 ; https://hal.archives-ouvertes.fr/hal-03539741 ; IEEE ICASSP 2022, 2022, Singapour, Singapore (2022)
	BASE
	Show details

2	Where are we in semantic concept extraction for Spoken Language Understanding? ⋆
	Ghannay, Sahar; Caubrière, Antoine; Mdhaffar, Salima; Laperrière, Gaëlle; Jabaian, Bassam; Estève, Yannick
	In: SPECOM 2021 23rd International Conference on Speech and Computer ; https://hal.archives-ouvertes.fr/hal-03372494 ; SPECOM 2021 23rd International Conference on Speech and Computer, Sep 2021, Saint Petersburg, Russia (2021)
	Abstract: International audience ; Spoken language understanding (SLU) topic has seen a lot of progress these last three years, with the emergence of end-to-end neural approaches. Spoken language understanding refers to natural language processing tasks related to semantic extraction from speech signal, like named entity recognition from speech or slot filling task in a context of human-machine dialogue. Classically, SLU tasks were processed through a cascade approach that consists in applying, firstly, an automatic speech recognition process, followed by a natural language processing module applied to the automatic transcriptions. These three last years, end-toend neural approaches, based on deep neural networks, have been proposed in order to directly extract the semantics from speech signal, by using a single neural model. More recent works on self-supervised training with unlabeled data open new perspectives in term of performance for automatic speech recognition and natural language processing. In this paper, we present a brief overview of the recent advances on the French MEDIA benchmark dataset for SLU, with or without the use of additional data. We also present our last results that significantly outperform the current state-of-the-art with a Concept Error Rate (CER) of 11.2%, instead of 13.6% for the last state-of-the-art system presented this year.
	Keyword: [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; Cascade approach; End-to-end approach; Self supervised training; Spoken language understanding
	URL: https://hal.archives-ouvertes.fr/hal-03372494/file/2106.13045.pdf https://hal.archives-ouvertes.fr/hal-03372494/document https://hal.archives-ouvertes.fr/hal-03372494
	BASE
	Hide details

3	LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech
	Evain, Solène; Nguyen, Ha; Le, Hang...
	In: INTERSPEECH 2021: Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-03317730 ; INTERSPEECH 2021: Conference of the International Speech Communication Association, Aug 2021, Brno, Czech Republic (2021)
	BASE
	Show details

4	LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech
	Evain, Solène; Nguyen, Ha; Le, Hang...
	In: INTERSPEECH 2021: ; INTERSPEECH 2021: Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-03317730 ; INTERSPEECH 2021: Conference of the International Speech Communication Association, Aug 2021, Brno, Czech Republic (2021)
	BASE
	Show details

5	LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech
	Evain, Solène; Nguyen, Ha; Le, Hang...
	In: INTERSPEECH 2021: ; INTERSPEECH 2021: Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-03317730 ; INTERSPEECH 2021: Conference of the International Speech Communication Association, Aug 2021, Brno, Czech Republic (2021)
	BASE
	Show details

6	ON-TRAC' systems for the IWSLT 2021 low-resource speech translation and multilingual speech translation shared tasks
	Le, Hang; Barbier, Florentin; Nguyen, Ha...
	In: Proceedings of the 18th International Conference on Spoken Language Translation, ; International Conference on Spoken Language Translation (IWSLT) ; https://hal.archives-ouvertes.fr/hal-03298854 ; International Conference on Spoken Language Translation (IWSLT), Aug 2021, Bangkok (virtual), Thailand. ⟨10.18653/v1/2021.iwslt-1.20⟩ (2021)
	BASE
	Show details

7	Speech recognition in the context of lectures : assessment, progress and enrichment ; Reconnaissance de la parole dans un contexte de cours magistraux : évaluation, avancées et enrichissement
	Mdhaffar, Salima. - : HAL CCSD, 2020
	In: https://tel.archives-ouvertes.fr/tel-02928451 ; Informatique et langage [cs.CL]. Le Mans Université, 2020. Français. ⟨NNT : 2020LEMA1008⟩ (2020)
	BASE
	Show details

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern