Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Page: 1 2 3 4 5...96

Hits 1 – 20 of 1.916

1	REYD Yiddish TTS Corpus ...
	Unkn Unknown. - : Centre for Speech Technology Research (CSTR), 2022
	BASE
	Show details

2	Об истории речевых исследований в России ... : About the history of speech research in Russia ...
	Потапова, Р.К.; Потапов, В.В.. - : Издательство ГЕОС, 2022
	BASE
	Show details

3	Implementing a Statistical Parametric Speech Synthesis System for a Patient with Laryngeal Cancer
	Krzysztof Szklanny; Jakub Lachowicz
	In: Sensors; Volume 22; Issue 9; Pages: 3188 (2022)
	BASE
	Show details

4	Evaluation of Tacotron Based Synthesizers for Spanish and Basque
	Víctor García; Inma Hernáez; Eva Navas
	In: Applied Sciences; Volume 12; Issue 3; Pages: 1686 (2022)
	BASE
	Show details

5	Contribution of Vocal Tract and Glottal Source Spectral Cues in the Generation of Acted Happy and Aggressive Spanish Vowels
	Marc Freixes; Joan Claudi Socoró; Francesc Alías
	In: Applied Sciences; Volume 12; Issue 4; Pages: 2055 (2022)
	BASE
	Show details

6	Neural Vocoding for Singing and Speaking Voices with the Multi-Band Excited WaveNet
	Axel Roebel; Frederik Bous
	In: Information; Volume 13; Issue 3; Pages: 103 (2022)
	Abstract: The use of the mel spectrogram as a signal parameterization for voice generation is quite recent and linked to the development of neural vocoders. These are deep neural networks that allow reconstructing high-quality speech from a given mel spectrogram. While initially developed for speech synthesis, now neural vocoders have also been studied in the context of voice attribute manipulation, opening new means for voice processing in audio production. However, to be able to apply neural vocoders in real-world applications, two problems need to be addressed: (1) To support use in professional audio workstations, the computational complexity should be small, (2) the vocoder needs to support a large variety of speakers, differences in voice qualities, and a wide range of intensities potentially encountered during audio production. In this context, the present study will provide a detailed description of the Multi-band Excited WaveNet, a fully convolutional neural vocoder built around signal processing blocks. It will evaluate the performance of the vocoder when trained on a variety of multi-speaker and multi-singer databases, including an experimental evaluation of the neural vocoder trained on speech and singing voices. Addressing the problem of intensity variation, the study will introduce a new adaptive signal normalization scheme that allows for robust compensation for dynamic and static gain variations. Evaluations are performed using objective measures and a number of perceptual tests including different neural vocoder algorithms known from the literature. The results confirm that the proposed vocoder compares favorably to the state-of-the-art in its capacity to generalize to unseen voices and voice qualities. The remaining challenges will be discussed.
	Keyword: adversarial training; mel spectrogram; neural vocoder; singing synthesis; singing transformation; speech synthesis; speech transformation
	URL: https://doi.org/10.3390/info13030103
	BASE
	Hide details

7	Affect Expression: Global and Local Control of Voice Source Parameters ; Speech Prosody
	Yanushevskaya, Irena; Gobl, Christer; Murphy, Andrew. - 2022
	BASE
	Show details

8	Applying phonetics : speech science in everyday life
	Munro, Murray J.. - Chichester, West Sussex : Wiley Blackwell, 2021
	BLLDB
	UB Frankfurt Linguistik
	Show details

9	Prosodic Boundary Prediction Model for Vietnamese Text-To-Speech
	Trang, Nguyen Thi Thu; Ky, Nguyen,; Rilliard, Albert...
	In: Proc. Interspeech 2021 ; Interspeech 2021 ; https://hal.archives-ouvertes.fr/hal-03329116 ; Interspeech 2021, Aug 2021, Brno, Czech Republic. pp.3885-3889, ⟨10.21437/interspeech.2021-125⟩ (2021)
	BASE
	Show details

10	Supplementary material to the paper The VoicePrivacy 2020 Challenge: Results and findings
	Tomashenko, Natalia; Wang, Xin; Vincent, Emmanuel...
	In: https://hal.archives-ouvertes.fr/hal-03335126 ; 2021 (2021)
	BASE
	Show details

11	Supplementary material to the paper The VoicePrivacy 2020 Challenge: Results and findings
	Tomashenko, Natalia; Wang, Xin; Vincent, Emmanuel...
	In: https://hal.archives-ouvertes.fr/hal-03335126 ; 2021 (2021)
	BASE
	Show details

12	The VoicePrivacy 2020 Challenge: Results and findings
	Tomashenko, Natalia; Wang, Xin; Vincent, Emmanuel...
	In: https://hal.archives-ouvertes.fr/hal-03332224 ; 2021 (2021)
	BASE
	Show details

13	Supplementary material to the paper The VoicePrivacy 2020 Challenge: Results and findings
	Tomashenko, Natalia; Wang, Xin; Vincent, Emmanuel...
	In: https://hal.archives-ouvertes.fr/hal-03335126 ; 2021 (2021)
	BASE
	Show details

14	The VoicePrivacy 2020 Challenge: Results and findings
	Tomashenko, Natalia; Wang, Xin; Vincent, Emmanuel...
	In: https://hal.archives-ouvertes.fr/hal-03332224 ; 2021 (2021)
	BASE
	Show details

15	Supplementary material to the paper The VoicePrivacy 2020 Challenge: Results and findings
	Tomashenko, Natalia; Wang, Xin; Vincent, Emmanuel...
	In: https://hal.archives-ouvertes.fr/hal-03335126 ; 2021 (2021)
	BASE
	Show details

16	Supplementary material to the paper The VoicePrivacy 2020 Challenge: Results and findings
	Tomashenko, Natalia; Wang, Xin; Vincent, Emmanuel...
	In: https://hal.archives-ouvertes.fr/hal-03335126 ; 2021 (2021)
	BASE
	Show details

17	The VoicePrivacy 2020 Challenge: Results and findings
	Tomashenko, Natalia; Wang, Xin; Vincent, Emmanuel...
	In: https://hal.archives-ouvertes.fr/hal-03332224 ; 2021 (2021)
	BASE
	Show details

18	Impact of Segmentation and Annotation in French end-to-end Synthesis
	Lenglet, Martin; Perrotin, Olivier; Bailly, Gérard
	In: Proc. 11th ISCA Speech Synthesis Workshop (SSW 11) ; SSW 11th ISCA Speech Synthesis Workshop ; https://hal.archives-ouvertes.fr/hal-03362000 ; SSW 11th ISCA Speech Synthesis Workshop, Aug 2021, Budapest, Hungary. pp.13-18, ⟨10.21437/SSW.2021-3⟩ ; https://ssw11.hte.hu/ (2021)
	BASE
	Show details

19	Anonymous speaker clusters: Making distinctions between anonymised speech recordings with clustering interface
	O'Brien, Benjamin; Tomashenko, Natalia; Chanclu, Anaïs...
	In: INTERSPEECH 2021 ; https://hal.archives-ouvertes.fr/hal-03267084 ; INTERSPEECH 2021, Aug 2021, Brno, Czech Republic (2021)
	BASE
	Show details

20	Learning emotions latent representation with CVAE for Text-Driven Expressive AudioVisual Speech Synthesis
	Dahmani, Sara; Colotte, Vincent; Girard, Valérian...
	In: ISSN: 0893-6080 ; Neural Networks ; https://hal.inria.fr/hal-03204193 ; Neural Networks, Elsevier, 2021, 141, pp.315-329. ⟨10.1016/j.neunet.2021.04.021⟩ (2021)
	BASE
	Show details

Page: 1 2 3 4 5...96

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern