1 |
Japanese perceptual epenthesis is modulated by transitional probability
|
|
|
|
BASE
|
|
Show details
|
|
2 |
When more is more : the mixed language Light Warlpiri amalgamates source language phonologies to form a near-maximal inventory
|
|
|
|
BASE
|
|
Show details
|
|
3 |
A Happy Marriage: The Stop and Affricate Inventory of the Mixed Language Light Warlpiri (Australia)
|
|
|
|
In: https://assta.org/proceedings/ICPhS2019/ (2020)
|
|
BASE
|
|
Show details
|
|
4 |
Voice Onset Time and Constriction Duration in Warlpiri Stops (Australia)
|
|
|
|
In: Proceedings of the 19th International Congress of Phonetic Sciences ; https://www.icphs2019.org/ (2020)
|
|
BASE
|
|
Show details
|
|
5 |
Pause acceptability indicates word-internal structure in Wubuy
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Japanese co-occurrence restrictions influence second language perception
|
|
|
|
BASE
|
|
Show details
|
|
8 |
Using deep neural networks to estimate tongue movements from speech face motion
|
|
|
|
Abstract:
This study concludes a tripartite investigation into the indirect visibility of the moving tongue in human speech as reflected in co-occurring changes of the facial surface. We were in particular interested in how the shared information is distributed over the range of contributing frequencies. In the current study we examine the degree to which tongue movements during speech can be reliably estimated from face motion using artificial neural networks. We simultaneously acquired data for both movement types; tongue movements were measured with Electromagnetic Articulography (EMA), face motion with a passive marker-based motion capture system. A multiresolution analysis using wavelets provided the desired decomposition into frequency subbands. In the two earlier studies of the project we established linear and non-linear relations between lingual and facial speech motions, as predicted and compatible with previous research in auditory-visual speech. The results of the current study using a Deep Neural Network (DNN) for prediction show that a substantive amount of variance can be recovered (between 13.9 and 33.2% dependent on the speaker and tongue sensor location). Importantly, however, the recovered variance values and the root mean squared error values of the Euclidean distances between the measured and the predicted tongue trajectories are in the range of the linear estimations of our earlier study.
|
|
Keyword:
speech processing systems; XXXXXX - Unknown
|
|
URL: http://avsp2017.loria.fr/proceedings/ http://handle.westernsydney.edu.au:8081/1959.7/uws:44755
|
|
BASE
|
|
Hide details
|
|
10 |
Child Kriol has stop distinctions based on VOT and constriction duration
|
|
|
|
BASE
|
|
Show details
|
|
11 |
Consonantal timing and release burst acoustics distinguish multiple coronal stop place distinctions in Wubuy (Australia)
|
|
|
|
BASE
|
|
Show details
|
|
12 |
Pause acceptability is predicted by morphological transparency in Wubuy
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Discrimination of Multiple Coronal Stop Contrasts in Wubuy (Australia): A Natural Referent Consonant Account
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Perception of voicing in the absence of native voicing experience
|
|
|
|
BASE
|
|
Show details
|
|
16 |
Discrimination of multiple coronal stop contrasts in Wubuy (Australia) : a natural referent consonant account
|
|
|
|
BASE
|
|
Show details
|
|
17 |
Wubuy coronal stop perception by speakers of three dialects of Bangla
|
|
|
|
BASE
|
|
Show details
|
|
18 |
A comparison of the acoustics of nonsense and real word stimuli : coronal stops in Bengali
|
|
|
|
BASE
|
|
Show details
|
|
|
|