1 |
Neural Speech Decoding During Audition, Imagination and Production
|
|
|
|
In: IEEE (2021)
|
|
BASE
|
|
Show details
|
|
2 |
A multispeaker dataset of raw and reconstructed speech production real-time MRI video and 3D volumetric images ...
|
|
Lim, Yongwan; Toutios, Asterios; Bliesener, Yannick; Tian, Ye; Lingala, Sajan Goud; Vaz, Colin; Sorensen, Tanner; Oh, Miran; Harper, Sarah; Chen, Weiyi; Lee, Yoonjeong; Töger, Johannes; Montesserin, Mairym Lloréns; Smith, Caitlin; Godinez, Bianca; Goldstein, Louis; Byrd, Dani; Nayak, Krishna S.; Narayanan, Shrikanth S.. - : arXiv, 2021
|
|
Abstract:
Real-time magnetic resonance imaging (RT-MRI) of human speech production is enabling significant advances in speech science, linguistics, bio-inspired speech technology development, and clinical applications. Easy access to RT-MRI is however limited, and comprehensive datasets with broad access are needed to catalyze research across numerous domains. The imaging of the rapidly moving articulators and dynamic airway shaping during speech demands high spatio-temporal resolution and robust reconstruction methods. Further, while reconstructed images have been published, to-date there is no open dataset providing raw multi-coil RT-MRI data from an optimized speech production experimental setup. Such datasets could enable new and improved methods for dynamic image reconstruction, artifact correction, feature extraction, and direct extraction of linguistically-relevant biomarkers. The present dataset offers a unique corpus of 2D sagittal-view RT-MRI videos along with synchronized audio for 75 subjects performing ... : 27 pages, 6 figures, 5 tables, submitted to Nature Scientific Data ...
|
|
Keyword:
Audio and Speech Processing eess.AS; FOS Computer and information sciences; FOS Electrical engineering, electronic engineering, information engineering; Image and Video Processing eess.IV; Signal Processing eess.SP; Sound cs.SD
|
|
URL: https://dx.doi.org/10.48550/arxiv.2102.07896 https://arxiv.org/abs/2102.07896
|
|
BASE
|
|
Hide details
|
|
3 |
Deblurring for Spiral Real-Time MRI Using Convolutional Neural Networks ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
Vocal tract shaping of emotional speech
|
|
|
|
In: Comput Speech Lang (2020)
|
|
BASE
|
|
Show details
|
|
5 |
Data from: Speed-accuracy tradeoffs in human speech production ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Dynamic Off-resonance Correction for Spiral Real-Time MRI of Speech
|
|
|
|
BASE
|
|
Show details
|
|
9 |
A technology prototype system for rating therapist empathy from audio recordings in addiction counseling
|
|
|
|
BASE
|
|
Show details
|
|
10 |
Advances in real-time magnetic resonance imaging of the vocal tract for speech science and technology research
|
|
|
|
BASE
|
|
Show details
|
|
11 |
"Rate My Therapist": Automated Detection of Empathy in Drug and Alcohol Counseling via Speech and Language Processing
|
|
|
|
BASE
|
|
Show details
|
|
12 |
On Quantifying Facial Expression-Related Atypicality of Children with Autism Spectrum Disorder
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Directly data-derived articulatory gesture-like representations retain discriminatory information about phone categories
|
|
|
|
BASE
|
|
Show details
|
|
19 |
Dynamic 3-D visualization of vocal tract shaping during speech
|
|
|
|
BASE
|
|
Show details
|
|
20 |
Analyzing the Language of Therapist Empathy in Motivational Interview based Psychotherapy
|
|
|
|
BASE
|
|
Show details
|
|
|
|