2 |
Об истории речевых исследований в России ... : About the history of speech research in Russia ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
Implementing a Statistical Parametric Speech Synthesis System for a Patient with Laryngeal Cancer
|
|
|
|
In: Sensors; Volume 22; Issue 9; Pages: 3188 (2022)
|
|
BASE
|
|
Show details
|
|
4 |
Evaluation of Tacotron Based Synthesizers for Spanish and Basque
|
|
|
|
In: Applied Sciences; Volume 12; Issue 3; Pages: 1686 (2022)
|
|
BASE
|
|
Show details
|
|
5 |
Contribution of Vocal Tract and Glottal Source Spectral Cues in the Generation of Acted Happy and Aggressive Spanish Vowels
|
|
|
|
In: Applied Sciences; Volume 12; Issue 4; Pages: 2055 (2022)
|
|
BASE
|
|
Show details
|
|
6 |
Neural Vocoding for Singing and Speaking Voices with the Multi-Band Excited WaveNet
|
|
|
|
In: Information; Volume 13; Issue 3; Pages: 103 (2022)
|
|
BASE
|
|
Show details
|
|
7 |
Affect Expression: Global and Local Control of Voice Source Parameters ; Speech Prosody
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Prosodic Boundary Prediction Model for Vietnamese Text-To-Speech
|
|
|
|
In: Proc. Interspeech 2021 ; Interspeech 2021 ; https://hal.archives-ouvertes.fr/hal-03329116 ; Interspeech 2021, Aug 2021, Brno, Czech Republic. pp.3885-3889, ⟨10.21437/interspeech.2021-125⟩ (2021)
|
|
BASE
|
|
Show details
|
|
10 |
Supplementary material to the paper The VoicePrivacy 2020 Challenge: Results and findings
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-03335126 ; 2021 (2021)
|
|
BASE
|
|
Show details
|
|
11 |
Supplementary material to the paper The VoicePrivacy 2020 Challenge: Results and findings
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-03335126 ; 2021 (2021)
|
|
BASE
|
|
Show details
|
|
12 |
The VoicePrivacy 2020 Challenge: Results and findings
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-03332224 ; 2021 (2021)
|
|
BASE
|
|
Show details
|
|
13 |
Supplementary material to the paper The VoicePrivacy 2020 Challenge: Results and findings
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-03335126 ; 2021 (2021)
|
|
BASE
|
|
Show details
|
|
14 |
The VoicePrivacy 2020 Challenge: Results and findings
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-03332224 ; 2021 (2021)
|
|
BASE
|
|
Show details
|
|
15 |
Supplementary material to the paper The VoicePrivacy 2020 Challenge: Results and findings
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-03335126 ; 2021 (2021)
|
|
BASE
|
|
Show details
|
|
16 |
Supplementary material to the paper The VoicePrivacy 2020 Challenge: Results and findings
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-03335126 ; 2021 (2021)
|
|
BASE
|
|
Show details
|
|
17 |
The VoicePrivacy 2020 Challenge: Results and findings
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-03332224 ; 2021 (2021)
|
|
BASE
|
|
Show details
|
|
18 |
Impact of Segmentation and Annotation in French end-to-end Synthesis
|
|
|
|
In: Proc. 11th ISCA Speech Synthesis Workshop (SSW 11) ; SSW 11th ISCA Speech Synthesis Workshop ; https://hal.archives-ouvertes.fr/hal-03362000 ; SSW 11th ISCA Speech Synthesis Workshop, Aug 2021, Budapest, Hungary. pp.13-18, ⟨10.21437/SSW.2021-3⟩ ; https://ssw11.hte.hu/ (2021)
|
|
BASE
|
|
Show details
|
|
19 |
Anonymous speaker clusters: Making distinctions between anonymised speech recordings with clustering interface
|
|
|
|
In: INTERSPEECH 2021 ; https://hal.archives-ouvertes.fr/hal-03267084 ; INTERSPEECH 2021, Aug 2021, Brno, Czech Republic (2021)
|
|
Abstract:
International audience ; Our study examined the performance of evaluators tasked to group natural and anonymised speech recordings into clusters based on their perceived similarities. Speech stimuli were selected from the VCTK corpus; two systems developed for the VoicePrivacy 2020 Challenge were used for anonymisation. The Baseline-1 (B1) system was developed by using x-vectors and neural waveform models, while the Baseline-2 (B2) system relied on digital-signal-processing techniques. 74 evaluators completed three trials composed of 16 recordings with either natural or anonymised speech generated from a single system. F-measure and cluster purity metrics were used to assess evaluator accuracy. Probabilistic linear discriminant analysis (PLDA) scores from an automatic speaker verification system were generated to quantify similarity between recordings and used to correlate subjective results. Our findings showed that non-native English speaking evaluators significantly lowered their F-measure means when presented anonymised recordings. We observed no significance for cluster purity. Pearson correlation procedures revealed that PLDA scores generated from natural and B2-anonymised speech recordings correlated positively to F-measure and cluster purity metrics. These findings show evaluators were able to use the interface to cluster natural and anonymised speech recordings and suggest anonymisation systems modelled like B1 are more effective at suppressing identifiable speech characteristics.
|
|
Keyword:
[INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; [SCCO.LING]Cognitive science/Linguistics; anonymisation; clustering; privacy; speaker identification; speech synthesis; subjective evaluation
|
|
URL: https://hal.archives-ouvertes.fr/hal-03267084 https://hal.archives-ouvertes.fr/hal-03267084v1/file/Linkablity_INTERSPEECH_2021.pdf https://hal.archives-ouvertes.fr/hal-03267084v1/document
|
|
BASE
|
|
Hide details
|
|
20 |
Learning emotions latent representation with CVAE for Text-Driven Expressive AudioVisual Speech Synthesis
|
|
|
|
In: ISSN: 0893-6080 ; Neural Networks ; https://hal.inria.fr/hal-03204193 ; Neural Networks, Elsevier, 2021, 141, pp.315-329. ⟨10.1016/j.neunet.2021.04.021⟩ (2021)
|
|
BASE
|
|
Show details
|
|
|
|