1 |
A Bottleneck Auto-Encoder for F0 Transformations on Speech and Singing Voice
|
|
|
|
In: ISSN: 2078-2489 ; Information ; https://hal.archives-ouvertes.fr/hal-03599085 ; Information, MDPI, 2022, 13 (3), pp.102. ⟨10.3390/info13030102⟩ (2022)
|
|
BASE
|
|
Show details
|
|
2 |
Neural Vocoding for Singing and Speaking Voices with the Multi-Band Excited WaveNet
|
|
|
|
In: ISSN: 2078-2489 ; Information ; https://hal.archives-ouvertes.fr/hal-03599076 ; Information, MDPI, 2022, 13 (3), pp.103. ⟨10.3390/info13030103⟩ (2022)
|
|
Abstract:
International audience ; The use of the mel spectrogram as a signal parameterization for voice generation is quite recent and linked to the development of neural vocoders. These are deep neural networks that allow reconstructing high-quality speech from a given mel spectrogram. While initially developed for speech synthesis, now neural vocoders have also been studied in the context of voice attribute manipulation, opening new means for voice processing in audio production. However, to be able to apply neural vocoders in real-world applications, two problems need to be addressed: (1) To support use in professional audio workstations, the computational complexity should be small, (2) the vocoder needs to support a large variety of speakers, differences in voice qualities, and a wide range of intensities potentially encountered during audio production. In this context, the present study will provide a detailed description of the Multi-band Excited WaveNet, a fully convolutional neural vocoder built around signal processing blocks. It will evaluate the performance of the vocoder when trained on a variety of multi-speaker and multi-singer databases, including an experimental evaluation of the neural vocoder trained on speech and singing voices. Addressing the problem of intensity variation, the study will introduce a new adaptive signal normalization scheme that allows for robust compensation for dynamic and static gain variations. Evaluations are performed using objective measures and a number of perceptual tests including different neural vocoder algorithms known from the literature. The results confirm that the proposed vocoder compares favorably to the state-of-the-art in its capacity to generalize to unseen voices and voice qualities. The remaining challenges will be discussed.
|
|
Keyword:
[INFO.INFO-SD]Computer Science [cs]/Sound [cs.SD]; [SPI.SIGNAL]Engineering Sciences [physics]/Signal and Image processing
|
|
URL: https://doi.org/10.3390/info13030103 https://hal.archives-ouvertes.fr/hal-03599076
|
|
BASE
|
|
Hide details
|
|
3 |
From Biological Synapses to “Intelligent” Robots
|
|
|
|
In: ISSN: 2079-9292 ; Electronics ; https://hal.archives-ouvertes.fr/hal-03590998 ; Electronics, MDPI, 2022, 11 (5), pp.707. ⟨10.3390/electronics11050707⟩ (2022)
|
|
BASE
|
|
Show details
|
|
4 |
Meta-Analysis of the Functional Neuroimaging Literature with Probabilistic Logic Programming
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-03590714 ; 2022 (2022)
|
|
BASE
|
|
Show details
|
|
5 |
Linguistic resources for paraphrase generation in Portuguese: a Lexicon-Grammar approach
|
|
|
|
In: ISSN: 1574-020X ; EISSN: 1574-0218 ; Language Resources and Evaluation ; https://hal.archives-ouvertes.fr/hal-03548861 ; Language Resources and Evaluation, Springer Verlag, 2022, ⟨10.1007/s10579-021-09561-5⟩ ; https://link.springer.com/article/10.1007/s10579-021-09561-5 (2022)
|
|
BASE
|
|
Show details
|
|
6 |
Caveats of Measuring Semantic Change of Cognates and Borrowings using Multilingual Word Embeddings
|
|
|
|
In: LChange'22 - 3rd International Workshop on Computational Approaches to Historical Language Change 2022 ; https://hal.inria.fr/hal-03635005 ; LChange'22 - 3rd International Workshop on Computational Approaches to Historical Language Change 2022, May 2022, Dublin, Ireland (2022)
|
|
BASE
|
|
Show details
|
|
7 |
DeepL et Google Translate face à l'ambiguïté phraséologique
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-03583995 ; 2022 (2022)
|
|
BASE
|
|
Show details
|
|
8 |
Preprint Citation Praxis in PLOS
|
|
|
|
In: ISSN: 0138-9130 ; EISSN: 1588-2861 ; Scientometrics ; https://hal.archives-ouvertes.fr/hal-03506094 ; In press (2022)
|
|
BASE
|
|
Show details
|
|
9 |
Emotion on a textual level: the structuring function of emotions observed from annotations ; L'émotion à un niveau textuel : la fonction structurante des émotions observée à partir d'annotations
|
|
|
|
In: ISSN: 1963-1723 ; Discours - Revue de linguistique, psycholinguistique et informatique ; https://hal.archives-ouvertes.fr/hal-03607564 ; Discours - Revue de linguistique, psycholinguistique et informatique, Laboratoire LATTICE, A paraître (2022)
|
|
BASE
|
|
Show details
|
|
10 |
An Overview of Indian Spoken Language Recognition from Machine Learning Perspective
|
|
|
|
In: ISSN: 2375-4699 ; EISSN: 2375-4702 ; ACM Transactions on Asian and Low-Resource Language Information Processing ; https://hal.inria.fr/hal-03616853 ; ACM Transactions on Asian and Low-Resource Language Information Processing, ACM, In press, ⟨10.1145/3523179⟩ (2022)
|
|
BASE
|
|
Show details
|
|
11 |
The contextual logic
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-03195162 ; 2022 (2022)
|
|
BASE
|
|
Show details
|
|
12 |
Morphology in the Corsican Language Database (BDLC) : assessment and perspectives ; La morphologie dans la Banque de Données Langue Corse : bilan et perspectives
|
|
|
|
In: ISSN: 1638-9808 ; EISSN: 1765-3126 ; Corpus ; https://hal.archives-ouvertes.fr/hal-03591866 ; Corpus, Bases, Corpus, Langage - UMR 7320, 2022, Corpus et données en morpholgie, ⟨10.4000/corpus.7115⟩ ; https://journals.openedition.org/corpus/7115 (2022)
|
|
BASE
|
|
Show details
|
|
13 |
Automatic generation of the complete vocal tract shape from the sequence of phonemes to be articulated
|
|
|
|
In: ISSN: 0167-6393 ; EISSN: 1872-7182 ; Speech Communication ; https://hal.univ-lorraine.fr/hal-03650212 ; Speech Communication, Elsevier : North-Holland, 2022, ⟨10.1016/j.specom.2022.04.004⟩ (2022)
|
|
BASE
|
|
Show details
|
|
14 |
Entities, Dates, and Languages: Zero-Shot on Historical Texts with T0
|
|
|
|
In: Proceedings of the International Workshop on Challenges & Perspectives in Creating Large Language Models 2022 (BigScience 2022) ; https://hal.inria.fr/hal-03639144 ; Proceedings of the International Workshop on Challenges & Perspectives in Creating Large Language Models 2022 (BigScience 2022), May 2022, Dublin, France (2022)
|
|
BASE
|
|
Show details
|
|
15 |
Probing Multilingual Cognate Prediction Models
|
|
|
|
In: Findings of the Association for Computational Linguistics: ACL 2022 ; https://hal.inria.fr/hal-03614691 ; Findings of the Association for Computational Linguistics: ACL 2022, May 2022, Dublin, Ireland (2022)
|
|
BASE
|
|
Show details
|
|
16 |
Automatic Speech Recognition and Query By Example for Creole Languages Documentation
|
|
|
|
In: Findings of the Association for Computational Linguistics: ACL 2022 ; https://hal.archives-ouvertes.fr/hal-03625303 ; Findings of the Association for Computational Linguistics: ACL 2022, May 2022, Dublin, Ireland (2022)
|
|
BASE
|
|
Show details
|
|
17 |
Annotation of Morphological Errors in L2 Russian Corpus Analysis
|
|
|
|
In: 21st Annual Second Language Acquisition and Teaching Interdisciplinary Roundtable ; https://hal.archives-ouvertes.fr/hal-03620469 ; 21st Annual Second Language Acquisition and Teaching Interdisciplinary Roundtable, University of Arizona, Feb 2022, Tucson, United States (2022)
|
|
BASE
|
|
Show details
|
|
18 |
Usages du Dictionnaire Électronique des Synonymes (DES) du CRISCO : focus sur les mots inexistants
|
|
|
|
In: ISSN: 2607-0987 ; Le carnet de la MRSH ; https://halshs.archives-ouvertes.fr/halshs-03606075 ; 2022 (2022)
|
|
BASE
|
|
Show details
|
|
19 |
Identifier l’ironie ?
|
|
|
|
In: ISSN: 1774-7988 ; EISSN: 2261-3455 ; Synergies Pologne ; https://halshs.archives-ouvertes.fr/halshs-03552205 ; Synergies Pologne, 2022 (2022)
|
|
BASE
|
|
Show details
|
|
20 |
Cross-Situational Learning Towards Robot Grounding
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-03628290 ; 2022 (2022)
|
|
BASE
|
|
Show details
|
|
|
|