Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Hits 1 – 9 of 9

1	An Overview of Indian Spoken Language Recognition from Machine Learning Perspective
	Dey, Spandan; Sahidullah, Md; Saha, Goutam
	In: ISSN: 2375-4699 ; EISSN: 2375-4702 ; ACM Transactions on Asian and Low-Resource Language Information Processing ; https://hal.inria.fr/hal-03616853 ; ACM Transactions on Asian and Low-Resource Language Information Processing, ACM, In press, ⟨10.1145/3523179⟩ (2022)
	Abstract: International audience ; Automatic spoken language identification (LID) is a very important research field in the era of multilingual voice-command-based human-computer interaction (HCI). A front-end LID module helps to improve the performance of many speech-based applications in the multilingual scenario. India is a populous country with diverse cultures and languages. The majority of the Indian population needs to use their respective native languages for verbal interaction with machines. Therefore, the development of efficient Indian spoken language recognition systems is useful for adapting smart technologies in every section of Indian society. The field of Indian LID has started gaining momentum in the last two decades, mainly due to the development of several standard multilingual speech corpora for the Indian languages. Even though significant research progress has already been made in this field, to the best of our knowledge, there are not many attempts to analytically review them collectively. In this work, we have conducted one of the very first attempts to present a comprehensive review of the Indian spoken language recognition research field. In-depth analysis has been presented to emphasize the unique challenges of low-resource and mutual influences for developing LID systems in the Indian contexts. Several essential aspects of the Indian LID research, such as the detailed description of the available speech corpora, the major research contributions, including the earlier attempts based on statistical modeling to the recent approaches based on different neural network architectures, and the future research trends are discussed. This review work will help assess the state of the present Indian LID research by any active researcher or any research enthusiasts from related fields.
	Keyword: [INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]; [INFO.INFO-CV]Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV]; [INFO.INFO-HC]Computer Science [cs]/Human-Computer Interaction [cs.HC]; [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing; [SCCO.LING]Cognitive science/Linguistics; [SHS.LANGUE]Humanities and Social Sciences/Linguistics; [STAT.ML]Statistics [stat]/Machine Learning [stat.ML]; acoustic phonetics; code-switching; corpora development; discriminative model; Indian language identification; Language resources; language similarity; Machine learning; Signal processing systems Low-resourced languages
	URL: https://hal.inria.fr/hal-03616853/file/TALLIP_Overview.pdf https://doi.org/10.1145/3523179 https://hal.inria.fr/hal-03616853 https://hal.inria.fr/hal-03616853/document
	BASE
	Hide details

2	Detection of Lungs Status Using Morphological Complexities of Respiratory Sounds
	Mondal, Ashok; Bhattacharya, Parthasarathi; Saha, Goutam. - : Hindawi Publishing Corporation, 2014
	BASE
	Show details

3	Design, analysis and experimental evaluation of block based transformation in MFCC computation for speaker recognition
	Saha, Goutam; Sahidullah, Md.
	In: Speech communication. - Amsterdam [u.a.] : Elsevier 54 (2012) 4, 543-565
	BLLDB
	OLC Linguistik
	Show details

4	Bayesian marginal statistics for speech enhancement using log Gabor wavelet
	Saha, Goutam; Senapati, Suman; Bhende, Neeraj
	In: International journal of speech technology. - Boston, Mass. [u.a.] : Kluwer Acad. Publ. 14 (2011) 3, 193-210
	BLLDB
	OLC Linguistik
	Show details

5	Spectral entropy and spectral shape based pre-quantization for real time speaker identification system
	Saha, Goutam; Sarkar, Gourav
	In: International journal of speech technology. - Boston, Mass. [u.a.] : Kluwer Acad. Publ. 13 (2010) 4, 189-199
	BLLDB
	OLC Linguistik
	Show details

6	Feature selection using singular value decomposition and QR factorization with column pivoting for text-independent speaker identification
	Saha, Goutam; Chakroborty, Sandipan
	In: Speech communication. - Amsterdam [u.a.] : Elsevier 52 (2010) 9, 693-709
	BLLDB
	OLC Linguistik
	Show details

7	Speech enhancement by joint statistical characterization in the Log Gabor Wavelet domain
	Chakroborty, Sandipan; Senapati, Suman; Saha, Goutam
	In: Speech communication. - Amsterdam [u.a.] : Elsevier 50 (2008) 6, 504-518
	BLLDB
	OLC Linguistik
	Show details

8	Speech enhancement by joint statistical characterization in the Log Gabor Wavelet domain
	Senapati, Suman; Chakroborty, Sandipan; Saha, Goutam
	In: Speech communication. - Amsterdam [u.a.] : Elsevier 50 (2008) 6, 504-518
	OLC Linguistik
	Show details

9	The EB-ANUBAD translator: A hybrid scheme
	Saha Goutam Kumar
	In: http://www.zju.edu.cn/jzus/2005/A0510/A051007.pdf (2005)
	BASE
	Show details

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern