1 |
Multilingual Audio-Visual Smartphone Dataset And Evaluation ...
|
|
|
|
BASE
|
|
Show details
|
|
2 |
Multilingual and Multimode Phone Recognition System for Indian Languages ...
|
|
|
|
Abstract:
The aim of this paper is to develop a flexible framework capable of automatically recognizing phonetic units present in a speech utterance of any language spoken in any mode. In this study, we considered two modes of speech: conversation, and read modes in four Indian languages, namely, Telugu, Kannada, Odia, and Bengali. The proposed approach consists of two stages: (1) Automatic speech mode classification (SMC) and (2) Automatic phonetic recognition using mode-specific multilingual phone recognition system (MPRS). In this work, the vocal tract and excitation source features are considered for speech mode classification (SMC) task. SMC systems are developed using multilayer perceptron (MLP). Further, vocal tract, excitation source, and tandem features are used to build the deep neural network (DNN)-based MPRSs. The performance of the proposed approach is compared with mode-dependent MPRSs. Experimental results show that the proposed approach which combines both SMC and MPRS into a single system outperforms ... : 33 pages, 5 figures, 6 tables, article ...
|
|
Keyword:
Audio and Speech Processing eess.AS; FOS Computer and information sciences; FOS Electrical engineering, electronic engineering, information engineering; Signal Processing eess.SP; Sound cs.SD
|
|
URL: https://arxiv.org/abs/1908.09634 https://dx.doi.org/10.48550/arxiv.1908.09634
|
|
BASE
|
|
Hide details
|
|
3 |
Speech recognition using articulatory and excitation source features
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Language identification using spectral and prosodic features
|
|
|
|
BASE
|
|
Show details
|
|
16 |
Robust emotion recognition using spectral and prosodic features
|
|
|
|
BASE
|
|
Show details
|
|
|
|