1 |
RETRIEVING SPEAKER INFORMATION FROM PERSONALIZED ACOUSTIC MODELS FOR SPEECH RECOGNITION
|
|
|
|
In: IEEE ICASSP 2022 ; https://hal.archives-ouvertes.fr/hal-03539741 ; IEEE ICASSP 2022, 2022, Singapour, Singapore (2022)
|
|
BASE
|
|
Show details
|
|
2 |
Hippocampal and auditory contributions to speech segmentation
|
|
|
|
In: ISSN: 0010-9452 ; Cortex ; https://hal.archives-ouvertes.fr/hal-03604957 ; Cortex, Elsevier, 2022, ⟨10.1016/j.cortex.2022.01.017⟩ (2022)
|
|
BASE
|
|
Show details
|
|
3 |
Cross-lingual few-shot hate speech and offensive language detection using meta learning
|
|
|
|
In: ISSN: 2169-3536 ; EISSN: 2169-3536 ; IEEE Access ; https://hal.archives-ouvertes.fr/hal-03559484 ; IEEE Access, IEEE, 2022, 10, pp.14880-14896. ⟨10.1109/ACCESS.2022.3147588⟩ (2022)
|
|
BASE
|
|
Show details
|
|
4 |
МОНОЛОГИЧЕСКАЯ РЕЧЬ С ТОЧКИ ЗРЕНИЯ УЧЁНЫХ ... : MONOLOGICAL SPEECH FROM THE POINT OF VIEW OF SCIENTISTS ...
|
|
|
|
BASE
|
|
Show details
|
|
5 |
A comparative study of several parameterizations for speaker recognition ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Subspace-based Representation and Learning for Phonotactic Spoken Language Recognition ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
REYD Yiddish TTS Corpus ...
|
|
Unkn Unknown. - : Centre for Speech Technology Research (CSTR), 2022
|
|
Abstract:
* The Reading Electronic Yiddish Documents (REYD) Dataset. The REYD TTS dataset is a speech dataset for Yiddish consisting of 4,892 short audio clips, with a total duration of 475.7 minutes. The recordings are of three speakers, two of whom speak the Lithuanian Yiddish dialect and one who speaks the Polish Yiddish dialect. The source texts are in standard literary Yiddish. The text sources are mostly works of fiction from the late 19th and early 20th centuries. Audio was recorded at the Montreal Jewish Public Library and the University of Haifa. All source texts and audio are public domain. Permission has been granted by the surviving relatives of the three readers for this work to be made public. This work has been used to train a TTS system. For an interactive demo and other information, please see our GitHub project page at https://github.com/REYD-TTS. A paper describing the work of assembling this dataset has been submitted for publication and will be linked to on the project page if accepted. * ...
|
|
Keyword:
Machine Learning; Mathematical and Computer Sciences; Speech Synthesis; Yiddish
|
|
URL: https://dx.doi.org/10.7488/ds/3424 https://datashare.ed.ac.uk/handle/10283/4383
|
|
BASE
|
|
Hide details
|
|
8 |
Data From: A Protracted Developmental Trajectory for English-Learning Children’s Detection of Consonant Mispronunciations in Newly Learned Words
|
|
|
|
In: Speech and Hearing Sciences Faculty Datasets (2022)
|
|
BASE
|
|
Show details
|
|
9 |
Intoxication and pitch control in tonal and non-tonal language speakers ...
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Learning and controlling the source-filter representation of speech with a variational autoencoder ...
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Correcting Misproducted Speech using Spectrogram Inpainting ...
|
|
|
|
BASE
|
|
Show details
|
|
17 |
WavThruVec: Latent speech representation as intermediate features for neural speech synthesis ...
|
|
|
|
BASE
|
|
Show details
|
|
18 |
CTA-RNN: Channel and Temporal-wise Attention RNN Leveraging Pre-trained ASR Embeddings for Speech Emotion Recognition ...
|
|
|
|
BASE
|
|
Show details
|
|
19 |
Fine-grained Noise Control for Multispeaker Speech Synthesis ...
|
|
|
|
BASE
|
|
Show details
|
|
20 |
Emotion Intensity and its Control for Emotional Voice Conversion ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|