DE eng

Search in the Catalogues and Directories

Hits 1 – 9 of 9

1
Improving the expressiveness of neural vocoding with non-affine Normalizing Flows ...
BASE
Show details
2
Detection of Lexical Stress Errors in Non-Native (L2) English with Data Augmentation and Attention ...
BASE
Show details
3
I Feel You: The Design and Evaluation of a Domotic Affect-Sensitive Spoken Conversational Agent
Lutfi, Syaheerah Lebai; Fernández-Martínez, Fernando; Lorenzo-Trueba, Jaime. - : Molecular Diversity Preservation International (MDPI), 2013
BASE
Show details
4
Speaker diarization features: the UPM contribution to the RT09 evaluation
In: Institute of Electrical and Electronics Engineers. IEEE transactions on audio, speech and language processing. - New York, NY : Inst. 20 (2012) 2, 426-435
BLLDB
OLC Linguistik
Show details
5
Towards Glottal Source Controllability in Expressive Speech Synthesis
In: Interspeech ; https://hal.archives-ouvertes.fr/hal-01161011 ; Interspeech, 2012, Portland, United States. pp.1-1 (2012)
Abstract: cote interne IRCAM: LorenzoTrueba12a ; None / None ; National audience ; In order to obtain more human like sounding human- machine interfaces we must first be able to give them expressive capabilities in the way of emotional and stylistic features so as to closely adequate them to the intended task. If we want to replicate those features it is not enough to merely replicate the prosodic information of fundamental frequency and speaking rhythm. The proposed additional layer is the modification of the glottal model, for which we make use of the GlottHMM parameters. This paper analyzes the viability of such an approach by verifying that the expressive nuances are captured by the aforementioned features, obtaining 95% recognition rates on styled speaking and 82% on emotional speech. Then we evaluate the effect of speaker bias and recording environment on the source modeling in order to quantify possible problems when analyzing multi-speaker databases. Finally we propose a speaking styles separation for Spanish based on prosodic features and check its perceptual significance.
Keyword: [SCCO.NEUR]Cognitive science/Neuroscience; [SPI.ACOU]Engineering Sciences [physics]/Acoustics [physics.class-ph]; [SPI.SIGNAL]Engineering Sciences [physics]/Signal and Image processing; expressive speech synthesis; glottal source modeling; Informatique musicale; speaking style
URL: https://hal.archives-ouvertes.fr/hal-01161011
https://hal.archives-ouvertes.fr/hal-01161011/file/index.pdf
https://hal.archives-ouvertes.fr/hal-01161011/document
BASE
Hide details
6
Speaker diarization based on intensity channel contribution
In: Institute of Electrical and Electronics Engineers. IEEE transactions on audio, speech and language processing. - New York, NY : Inst. 19 (2011) 4, 754-761
BLLDB
OLC Linguistik
Show details
7
Analysis of Statistical Parametric and Unit Selection Speech Synthesis Systems Applied to Emotional Speech
In: ISSN: 0167-6393 ; EISSN: 1872-7182 ; Speech Communication ; https://hal.archives-ouvertes.fr/hal-00627926 ; Speech Communication, Elsevier : North-Holland, 2010, 52 (5), pp.394. ⟨10.1016/j.specom.2009.12.007⟩ (2010)
BASE
Show details
8
Analysis of statistical parametric and unit selection speech synthesis systems applied to emotional speech
In: Speech communication. - Amsterdam [u.a.] : Elsevier 52 (2010) 5, 394-404
BLLDB
OLC Linguistik
Show details
9
Aplicación de métodos estadísticos para la traducción de voz a lengua de signos ; Using statistical methods for translating speech into sign language
Gallo Gutiérrez, Beatriz; San Segundo Hernández, Rubén; Lucas Cuesta, Juan Manuel. - : Sociedad Española para el Procesamiento del Lenguaje Natural, 2008
BASE
Show details

Catalogues
0
0
3
0
0
0
0
Bibliographies
3
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
6
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern