2 |
Speech Synthesis from ECoG using Densely Connected 3D Convolutional Neural Networks
|
|
|
|
In: J Neural Eng (2019)
|
|
BASE
|
|
Show details
|
|
3 |
Generating Natural, Intelligible Speech From Brain Activity in Motor, Premotor, and Inferior Frontal Cortices
|
|
|
|
BASE
|
|
Show details
|
|
4 |
Automatic Speech Recognition from Neural Signals: A Focused Review
|
|
|
|
Abstract:
Speech interfaces have become widely accepted and are nowadays integrated in various real-life applications and devices. They have become a part of our daily life. However, speech interfaces presume the ability to produce intelligible speech, which might be impossible due to either loud environments, bothering bystanders or incapabilities to produce speech (i.e., patients suffering from locked-in syndrome). For these reasons it would be highly desirable to not speak but to simply envision oneself to say words or sentences. Interfaces based on imagined speech would enable fast and natural communication without the need for audible speech and would give a voice to otherwise mute people. This focused review analyzes the potential of different brain imaging techniques to recognize speech from neural signals by applying Automatic Speech Recognition technology. We argue that modalities based on metabolic processes, such as functional Near Infrared Spectroscopy and functional Magnetic Resonance Imaging, are less suited for Automatic Speech Recognition from neural signals due to low temporal resolution but are very useful for the investigation of the underlying neural mechanisms involved in speech processes. In contrast, electrophysiologic activity is fast enough to capture speech processes and is therefor better suited for ASR. Our experimental results indicate the potential of these signals for speech recognition from neural data with a focus on invasively measured brain activity (electrocorticography). As a first example of Automatic Speech Recognition techniques used from neural signals, we discuss the Brain-to-text system.
|
|
Keyword:
Neuroscience
|
|
URL: http://www.ncbi.nlm.nih.gov/pubmed/27729844 https://doi.org/10.3389/fnins.2016.00429 http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5037201/
|
|
BASE
|
|
Hide details
|
|
5 |
Brain-to-text: decoding spoken phrases from phone representations in the brain
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Multilingual Deep Neural Network based Acoustic Modeling For Rapid Language Adaptation
|
|
|
|
In: http://infoscience.epfl.ch/record/198446 (2014)
|
|
BASE
|
|
Show details
|
|
10 |
Integration of Language Identification into a Recognition System for Spoken Conversations Containing Code-Switches ...
|
|
|
|
BASE
|
|
Show details
|
|
11 |
Integration of Language Identification into a Recognition System for Spoken Conversations Containing Code-Switches ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
An Investigation on Initialization Schemes for Multilayer Perceptron Training Using Multilingual Data and Their Effect on ASR Performance ...
|
|
|
|
BASE
|
|
Show details
|
|
13 |
An Investigation on Initialization Schemes for Multilayer Perceptron Training Using Multilingual Data and Their Effect on ASR Performance ...
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Multilingual Bottle-Neck Features and its Application for Under-Resourced Languages ...
|
|
|
|
BASE
|
|
Show details
|
|
15 |
Multilingual Bottle-Neck Features and its Application for Under-Resourced Languages ...
|
|
|
|
BASE
|
|
Show details
|
|
16 |
Modeling Coarticulation in EMG-based Continuous Speech Recognition
|
|
|
|
In: Speech Communication, 52 (4), 341-353 ; ISSN: 0167-6393 (2012)
|
|
BASE
|
|
Show details
|
|
|
|