1 |
An Overview of Indian Spoken Language Recognition from Machine Learning Perspective
|
|
|
|
In: ISSN: 2375-4699 ; EISSN: 2375-4702 ; ACM Transactions on Asian and Low-Resource Language Information Processing ; https://hal.inria.fr/hal-03616853 ; ACM Transactions on Asian and Low-Resource Language Information Processing, ACM, In press, ⟨10.1145/3523179⟩ (2022)
|
|
Abstract:
International audience ; Automatic spoken language identification (LID) is a very important research field in the era of multilingual voice-command-based human-computer interaction (HCI). A front-end LID module helps to improve the performance of many speech-based applications in the multilingual scenario. India is a populous country with diverse cultures and languages. The majority of the Indian population needs to use their respective native languages for verbal interaction with machines. Therefore, the development of efficient Indian spoken language recognition systems is useful for adapting smart technologies in every section of Indian society. The field of Indian LID has started gaining momentum in the last two decades, mainly due to the development of several standard multilingual speech corpora for the Indian languages. Even though significant research progress has already been made in this field, to the best of our knowledge, there are not many attempts to analytically review them collectively. In this work, we have conducted one of the very first attempts to present a comprehensive review of the Indian spoken language recognition research field. In-depth analysis has been presented to emphasize the unique challenges of low-resource and mutual influences for developing LID systems in the Indian contexts. Several essential aspects of the Indian LID research, such as the detailed description of the available speech corpora, the major research contributions, including the earlier attempts based on statistical modeling to the recent approaches based on different neural network architectures, and the future research trends are discussed. This review work will help assess the state of the present Indian LID research by any active researcher or any research enthusiasts from related fields.
|
|
Keyword:
[INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]; [INFO.INFO-CV]Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV]; [INFO.INFO-HC]Computer Science [cs]/Human-Computer Interaction [cs.HC]; [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing; [SCCO.LING]Cognitive science/Linguistics; [SHS.LANGUE]Humanities and Social Sciences/Linguistics; [STAT.ML]Statistics [stat]/Machine Learning [stat.ML]; acoustic phonetics; code-switching; corpora development; discriminative model; Indian language identification; Language resources; language similarity; Machine learning; Signal processing systems Low-resourced languages
|
|
URL: https://hal.inria.fr/hal-03616853/file/TALLIP_Overview.pdf https://doi.org/10.1145/3523179 https://hal.inria.fr/hal-03616853 https://hal.inria.fr/hal-03616853/document
|
|
BASE
|
|
Hide details
|
|
2 |
Language identification, a tool for Corsican and for the evaluation of linguistic resources ; L'identification de langue, un outil au service du corse et de l'évaluation des ressources linguistiques
|
|
|
|
In: Traitement Automatique des Langues ; https://hal.archives-ouvertes.fr/hal-03633290 ; Traitement Automatique des Langues, 2022, Diversité Linguistique, 62 (3), pp.13-37 ; https://www.atala.org/content/diversité-linguistique-linguistic-diversity-natural-language-processing (2022)
|
|
BASE
|
|
Show details
|
|
3 |
The Twitter user dataset for discriminating between Bosnian, Croatian, Montenegrin and Serbian Twitter-HBS 1.0
|
|
|
|
BASE
|
|
Show details
|
|
4 |
The news dataset for discriminating between Bosnian, Croatian and Serbian SETimes.HBS 1.0
|
|
|
|
BASE
|
|
Show details
|
|
5 |
Об истории речевых исследований в России ... : About the history of speech research in Russia ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
INTEGRATION OF PHONOTACTIC FEATURES FOR LANGUAGE IDENTIFICATION ON CODE-SWITCHED SPEECH ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
INTEGRATION OF PHONOTACTIC FEATURES FOR LANGUAGE IDENTIFICATION ON CODE-SWITCHED SPEECH ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
Leveraging lyrics from audio for MIR ; Exploiter les paroles de chansons à partir de l'audio pour le MIR
|
|
|
|
In: https://tel.archives-ouvertes.fr/tel-03558515 ; Signal and Image processing. Institut Polytechnique de Paris, 2021. English. ⟨NNT : 2021IPPAT027⟩ (2021)
|
|
BASE
|
|
Show details
|
|
9 |
Privacy and utility of x-vector based speaker anonymization
|
|
|
|
In: https://hal.inria.fr/hal-03197376 ; 2021 (2021)
|
|
BASE
|
|
Show details
|
|
10 |
Privacy and utility of x-vector based speaker anonymization
|
|
|
|
In: https://hal.inria.fr/hal-03197376 ; 2021 (2021)
|
|
BASE
|
|
Show details
|
|
11 |
Machine Learning of Motion Statistics Reveals the Kinematic Signature of the Identity of a Person in Sign Language
|
|
|
|
In: ISSN: 2296-4185 ; Frontiers in Bioengineering and Biotechnology ; https://hal.archives-ouvertes.fr/hal-03298752 ; Frontiers in Bioengineering and Biotechnology, Frontiers, 2021, 9, ⟨10.3389/fbioe.2021.710132⟩ (2021)
|
|
BASE
|
|
Show details
|
|
12 |
Anonymous speaker clusters: Making distinctions between anonymised speech recordings with clustering interface
|
|
|
|
In: INTERSPEECH 2021 ; https://hal.archives-ouvertes.fr/hal-03267084 ; INTERSPEECH 2021, Aug 2021, Brno, Czech Republic (2021)
|
|
BASE
|
|
Show details
|
|
13 |
Presentation matters: Evaluating speaker identification tasks
|
|
|
|
In: INTERSPEECH 2021 ; https://hal.archives-ouvertes.fr/hal-03267089 ; INTERSPEECH 2021, Aug 2021, Brno, Czech Republic (2021)
|
|
BASE
|
|
Show details
|
|
14 |
Anonymous speaker clusters: Making distinctions between anonymised speech recordings with clustering interface
|
|
|
|
In: INTERSPEECH 2021 ; https://hal.archives-ouvertes.fr/hal-03267084 ; INTERSPEECH 2021, Aug 2021, Brno, Czech Republic (2021)
|
|
BASE
|
|
Show details
|
|
15 |
From the Stage to the Audience: Propaganda on Reddit
|
|
|
|
In: EACL 2021 - 16th Conference of the European Chapter of the Association for Computational Linguistics ; https://hal.inria.fr/hal-03351621 ; EACL 2021 - 16th Conference of the European Chapter of the Association for Computational Linguistics, Apr 2021, Online, France ; https://2021.eacl.org/ (2021)
|
|
BASE
|
|
Show details
|
|
16 |
ДИСКУРС ЯЗЫКОВОЙ ДИФФЕРЕНЦИАЦИИ КАК ФАКТОР ЭТНИЧЕСКОЙ ИДЕНТИФИКАЦИИ ... : DISCOURSE OF LANGUAGE DIFFERENTIATION AS A FACTOR OF ETHNIC IDENTIFICATION ...
|
|
|
|
BASE
|
|
Show details
|
|
17 |
Automatic Loanword Identification Using Tree Reconciliation ...
|
|
|
|
BASE
|
|
Show details
|
|
18 |
Automatic Language Identification in Code-Switched Hindi-English Social Media Text
|
|
|
|
In: Journal of Open Humanities Data; Vol 7 (2021); 7 ; 2059-481X (2021)
|
|
BASE
|
|
Show details
|
|
19 |
The Unsolved Problem of Language Identification: A GMM-based Approach ...
|
|
|
|
BASE
|
|
Show details
|
|
20 |
The Unsolved Problem of Language Identification: A GMM-based Approach ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|