Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Page: 1 2 3

Hits 1 – 20 of 47

1	First DIHARD Challenge -- System Submissions and Scores ...
	Ryant, Neville; Church, Kenneth; Cieri, Christopher. - : Zenodo, 2021
	BASE
	Show details

2	First DIHARD Challenge -- System Submissions and Scores ...
	Ryant, Neville; Church, Kenneth; Cieri, Christopher. - : Zenodo, 2021
	BASE
	Show details

3	Adaptations in Speech Processing
	Xu, Jue. - : Humboldt-Universität zu Berlin, 2021
	BASE
	Show details

4	Learning speech embeddings for speaker adaptation and speech understanding
	Sari, Leda. - 2021
	BASE
	Show details

5	Towards unsupervised learning of speech features in the wild
	Rivière, Morgane; Dupoux, Emmanuel
	In: SLT 2020 : IEEE Spoken Language Technology Workshop ; https://hal.archives-ouvertes.fr/hal-03070411 ; SLT 2020 : IEEE Spoken Language Technology Workshop, Dec 2020, Shenzhen / Virtual, China (2020)
	Abstract: International audience ; Recent work on unsupervised contrastive learning of speech representation has shown promising results, but so far has mostly been applied to clean, curated speech datasets. Can it also be used with unprepared audio data "in the wild"? Here, we explore three potential problems in this setting: (i) presence of non-speech data, (ii) noisy or low quality speech data, and (iii) imbalance in speaker distribution. We show that on the Libri-light train set, which is itself a relatively clean speech-only dataset, these problems combined can already have a performance cost of up to 30% relative for the ABX score. We show that the first two problems can be alleviated by data filtering, with voice activity detection selecting speech segments, while perplexity of a model trained with clean data helping to discard entire files. We show that the third problem can be alleviated by learning a speaker embedding in the predictive branch of the model. We show that these techniques build more robust speech features that can be transferred to an ASR task in the low resource setting.
	Keyword: [INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]; [INFO]Computer Science [cs]; Contrastive predictive coding; Data filtering; Speaker adaptation; Speech recognition; Unsupervised representation learning
	URL: https://hal.archives-ouvertes.fr/hal-03070411 https://hal.archives-ouvertes.fr/hal-03070411/document https://hal.archives-ouvertes.fr/hal-03070411/file/Riviere_D_2020_Towards_CPC_in_the_wild.SLT.pdf
	BASE
	Hide details

6	Achieving Multi-Accent ASR via Unsupervised Acoustic Model Adaptation
	Turan, Mehmet Ali Tuğtekin; Vincent, Emmanuel; Jouvet, Denis
	In: INTERSPEECH 2020 ; https://hal.inria.fr/hal-02907929 ; INTERSPEECH 2020, Oct 2020, Shanghai, China (2020)
	BASE
	Show details

7	Preschoolers' Attention to Emotional Prosody as a Function of Speaker Conventionality ...
	Wieczorek, Karolina Marta. - : Arts, 2020
	BASE
	Show details

8	Learning to adapt: meta-learning approaches for speaker adaptation
	Klejch, Ondrej. - : The University of Edinburgh, 2020
	BASE
	Show details

9	Introducing Phonetic Information to Speaker Embedding for Speaker Verification
	Liu, Yi; He, Liang; Johnson, Michael T.
	In: Electrical and Computer Engineering Faculty Publications (2019)
	BASE
	Show details

10	Speaker-Adapted Confidence Measures for ASR using Deep Bidirectional Recurrent Neural Networks
	Del Agua Teba, Miguel Angel; Giménez Pastor, Adrián; Sanchis Navarro, José Alberto. - : Institute of Electrical and Electronics Engineers, 2018
	BASE
	Show details

11	Extending the Cascaded Gaussian Mixture Regression Framework for Cross-Speaker Acoustic-Articulatory Mapping
	Girin, Laurent; Hueber, Thomas; Alameda-Pineda, Xavier
	In: ISSN: 2329-9290 ; EISSN: 2329-9304 ; IEEE/ACM Transactions on Audio, Speech and Language Processing ; https://hal.archives-ouvertes.fr/hal-01485540 ; IEEE/ACM Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2017, 25 (3), pp.662-673. ⟨10.1109/TASLP.2017.2651398⟩ (2017)
	BASE
	Show details

12	Articulatory representations to address acoustic variability in speech ...
	Sivaraman, Ganesh. - : Digital Repository at the University of Maryland, 2017
	BASE
	Show details

13	Articulatory representations to address acoustic variability in speech
	Sivaraman, Ganesh. - 2017
	BASE
	Show details

14	Adaptation au locuteur pour la séparation de la parole par NMF
	Doras, Guillaume
	In: https://hal.sorbonne-universite.fr/hal-01482183 ; [Stage] STMS - Sciences et Technologies de la Musique et du Son UMR 9912 IRCAM-CNRS-UPMC. 2016 (2016)
	BASE
	Show details

15	Iterative PLDA Adaptation for Speaker Diarization
	Le Lan, Gaël; Charlet, Delphine; Larcher, Anthony...
	In: Interspeech 2016 ; https://hal.archives-ouvertes.fr/hal-01433172 ; Interspeech 2016, Sep 2016, San Francisco, United States. pp.2175 - 2179, ⟨10.21437/Interspeech.2016-572⟩ (2016)
	BASE
	Show details

16	Speaker-dependent Multipitch Tracking Using Deep Neural Networks
	Liu,Yuzhou; Wang,DeLiang. - 2015
	BASE
	Show details

17	Phonetic reduction in spontaneous speech by children aged 9-14 years
	Tuomainen, O; Lee, C; Granlund, S...
	In: Presented at: 18th International Congress of Phonetic Sciences, Glasgow, UK. (2015) (2015)
	BASE
	Show details

18	The Acquisition of Vowel Normalization during Early Infancy: Theory and Computational Framework
	Plummer, Andrew R
	In: http://rave.ohiolink.edu/etdc/view?acc_num=osu1388689249 (2014)
	BASE
	Show details

19	'All the better for not seeing you': effects of communicative context on the speech of an individual with acquired communication difficulties.
	Bruce, C; Braidwood, U; Newton, C
	In: J Commun Disord , 46 (5-6) 475 - 483. (2013) (2013)
	BASE
	Show details

20	Contributions to Adaptation on Automatic Speech Recognition and Multilingual Handwritten Text Recognition
	Del Agua Teba, Miguel Angel. - : Universitat Politècnica de València, 2013
	BASE
	Show details

Page: 1 2 3

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern