DE eng

Search in the Catalogues and Directories

Page: 1 2
Hits 1 – 20 of 28

1
Multilingual Audio-Visual Smartphone Dataset And Evaluation ...
BASE
Show details
2
Multilingual and Multimode Phone Recognition System for Indian Languages ...
Abstract: The aim of this paper is to develop a flexible framework capable of automatically recognizing phonetic units present in a speech utterance of any language spoken in any mode. In this study, we considered two modes of speech: conversation, and read modes in four Indian languages, namely, Telugu, Kannada, Odia, and Bengali. The proposed approach consists of two stages: (1) Automatic speech mode classification (SMC) and (2) Automatic phonetic recognition using mode-specific multilingual phone recognition system (MPRS). In this work, the vocal tract and excitation source features are considered for speech mode classification (SMC) task. SMC systems are developed using multilayer perceptron (MLP). Further, vocal tract, excitation source, and tandem features are used to build the deep neural network (DNN)-based MPRSs. The performance of the proposed approach is compared with mode-dependent MPRSs. Experimental results show that the proposed approach which combines both SMC and MPRS into a single system outperforms ... : 33 pages, 5 figures, 6 tables, article ...
Keyword: Audio and Speech Processing eess.AS; FOS Computer and information sciences; FOS Electrical engineering, electronic engineering, information engineering; Signal Processing eess.SP; Sound cs.SD
URL: https://arxiv.org/abs/1908.09634
https://dx.doi.org/10.48550/arxiv.1908.09634
BASE
Hide details
3
Speech recognition using articulatory and excitation source features
Rao, K Sreenivasa; K E, Manjunath. - : Springer, 2017
BASE
Show details
4
Source and system features for phone recognition
In: International journal of speech technology. - Boston, Mass. [u.a.] : Kluwer Acad. Publ. 18 (2015) 2, 257-270
BLLDB
Show details
5
Language identification using excitation source features
Rao, K Sreenivasa; Nandi, Dipanjan. - : Springer, 2015
BASE
Show details
6
Language identification using spectral and prosodic features
BASE
Show details
7
Segmentation, indexing and retrieval of TV broadcast news bulletins using Gaussian mixture models and vector quantization codebooks
In: International journal of speech technology. - Boston, Mass. [u.a.] : Kluwer Acad. Publ. 17 (2014) 3, 259-269
OLC Linguistik
Show details
8
Film segmentation and indexing using autoassociative neural networks
In: International journal of speech technology. - Boston, Mass. [u.a.] : Kluwer Acad. Publ. 17 (2014) 1, 65-74
OLC Linguistik
Show details
9
Two-stage intonation modeling using feedforward neural networks for syllable based text-to-speech synthesis
In: Computer speech and language. - Amsterdam [u.a.] : Elsevier 27 (2013) 5, 1105-1126
OLC Linguistik
Show details
10
Emotion recognition from speech using global and local prosodic features
In: International journal of speech technology. - Boston, Mass. [u.a.] : Kluwer Acad. Publ. 16 (2013) 2, 143-160
OLC Linguistik
Show details
11
Vowel onset point detection for noisy speech using spectral energy at formant frequencies
In: International journal of speech technology. - Boston, Mass. [u.a.] : Kluwer Acad. Publ. 16 (2013) 2, 229-235
OLC Linguistik
Show details
12
Pitch synchronous and glottal closure based speech analysis for language recognition
In: International journal of speech technology. - Boston, Mass. [u.a.] : Kluwer Acad. Publ. 16 (2013) 4, 413-430
OLC Linguistik
Show details
13
Non-uniform time scale modification using instants of significant excitation and vowel onset points
In: Speech communication. - Amsterdam [u.a.] : Elsevier 55 (2013) 6, 745-756
OLC Linguistik
Show details
14
Robust emotion recognition using spectral and prosodic features
Rao, K. Sreenivasa; Koolagudi, Shashidhar G.. - New York, NY [u.a.] : Springer, 2013
BLLDB
UB Frankfurt Linguistik
Show details
15
Emotion Recognition using Speech Features
BASE
Show details
16
Robust emotion recognition using spectral and prosodic features
BASE
Show details
17
Emotion recognition from speech: a review
In: International journal of speech technology. - Boston, Mass. [u.a.] : Kluwer Acad. Publ. 15 (2012) 2, 99-117
BLLDB
OLC Linguistik
Show details
18
Emotion recognition from speech using source, system, and prosodic features
In: International journal of speech technology. - Boston, Mass. [u.a.] : Kluwer Acad. Publ. 15 (2012) 2, 265-289
BLLDB
OLC Linguistik
Show details
19
Predicting Prosody from Text for Text-to-Speech Synthesis
Rao, K Sreenivasa. - : Springer, 2012
BASE
Show details
20
Development of syllable-based text to speech synthesis system in Bengali
In: International journal of speech technology. - Boston, Mass. [u.a.] : Kluwer Acad. Publ. 14 (2011) 3, 167-181
BLLDB
OLC Linguistik
Show details

Page: 1 2

Catalogues
1
0
17
0
0
0
0
Bibliographies
12
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
8
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern