Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Hits 1 – 9 of 9

1	An Overview of Indian Spoken Language Recognition from Machine Learning Perspective
	Dey, Spandan; Sahidullah, Md; Saha, Goutam
	In: ISSN: 2375-4699 ; EISSN: 2375-4702 ; ACM Transactions on Asian and Low-Resource Language Information Processing ; https://hal.inria.fr/hal-03616853 ; ACM Transactions on Asian and Low-Resource Language Information Processing, ACM, In press, ⟨10.1145/3523179⟩ (2022)
	BASE
	Show details

2	Utterance partitioning for speaker recognition: an experimental review and analysis with new findings under GMM-SVM framework
	Sen, Nirmalya; Sahidullah, Md; Patil, Hemant...
	In: ISSN: 1381-2416 ; EISSN: 1572-8110 ; International Journal of Speech Technology ; https://hal.archives-ouvertes.fr/hal-03232723 ; International Journal of Speech Technology, Springer Verlag, In press, ⟨10.1007/s10772-021-09862-8⟩ (2021)
	BASE
	Show details

3	Privacy and utility of x-vector based speaker anonymization
	Srivastava, Brij Mohan Lal; Maouche, Mohamed; Sahidullah, Md...
	In: https://hal.inria.fr/hal-03197376 ; 2021 (2021)
	BASE
	Show details

4	Benchmarking and challenges in security and privacy for voice biometrics
	Bonastre, Jean-Francois; Delgado, Hector; Evans, Nicholas...
	In: SPSC 2021, 1st ISCA Symposium on Security and Privacy in Speech Communication ; https://hal.archives-ouvertes.fr/hal-03346196 ; SPSC 2021, 1st ISCA Symposium on Security and Privacy in Speech Communication, ISCA, Nov 2021, Magdeburg, Germany. ⟨10.21437/SPSC.2021-11⟩ ; https://spsc-symposium2021.de/#home (2021)
	BASE
	Show details

5	Language recognition on unknown conditions: the LORIA-Inria-MULTISPEECH system for AP20-OLR Challenge
	Duroselle, Raphaël; Sahidullah, Md; Jouvet, Denis; Illina, Irina
	In: Interspeech ; https://hal.archives-ouvertes.fr/hal-03228823 ; Interspeech, Aug 2021, Brno, Czech Republic (2021)
	Abstract: International audience ; We describe the LORIA-Inria-MULTISPEECH system submitted to the Oriental Language Recognition AP20-OLR Challenge. This system has been specifically designed to be robust to unknown conditions: channel mismatch (task 1) and noisy conditions (task 3). Three sets of studies have been carried out for elaborating the system: design of multilingual bottleneck features, selection of robust features by evaluating language recognition performance on an unobserved channel, and design of the final models with different loss functions which exploit channel diversity within the training set. Key factors for robustness to unknown conditions are data augmentation techniques, stochastic weight averaging, and regularization of TDNNs with domain robustness loss functions. The final system is the combination of four TDNNs using bottleneck features and one GMM using SDC-MFCC features. Within the AP20-OLR Challenge, it achieves the top performance for tasks 1 and 3 with a $C_{avg}$ of respectively 0.0239 and 0.0374. This validates the approach for generalization to unknown conditions.
	Keyword: [INFO]Computer Science [cs]; channel mismatch; domain generalization; Language recognition
	URL: https://hal.archives-ouvertes.fr/hal-03228823 https://hal.archives-ouvertes.fr/hal-03228823v2/file/OLR20_interspeech2021_publication.pdf https://hal.archives-ouvertes.fr/hal-03228823v2/document
	BASE
	Hide details

6	Language recognition on unknown conditions: the LORIA-Inria-MULTISPEECH system for AP20-OLR Challenge
	Duroselle, Raphaël; Sahidullah, Md; Jouvet, Denis...
	In: https://hal.archives-ouvertes.fr/hal-03228823 ; 2021 (2021)
	BASE
	Show details

7	Privacy and utility of x-vector based speaker anonymization
	Srivastava, Brij Mohan Lal; Maouche, Mohamed; Sahidullah, Md...
	In: https://hal.inria.fr/hal-03197376 ; 2021 (2021)
	BASE
	Show details

8	Evaluating Voice Conversion-based Privacy Protection against Informed Attackers
	Srivastava, Brij Mohan Lal; Vauquier, Nathalie; Sahidullah, Md...
	In: ICASSP 2020 - 45th International Conference on Acoustics, Speech, and Signal Processing ; https://hal.inria.fr/hal-02355115 ; ICASSP 2020 - 45th International Conference on Acoustics, Speech, and Signal Processing, IEEE Signal Processing Society, May 2020, Barcelona, Spain. pp.2802-2806 (2020)
	BASE
	Show details

9	Design, analysis and experimental evaluation of block based transformation in MFCC computation for speaker recognition
	Saha, Goutam; Sahidullah, Md.
	In: Speech communication. - Amsterdam [u.a.] : Elsevier 54 (2012) 4, 543-565
	BLLDB
	OLC Linguistik
	Show details

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern