1 |
A prospective study of associations between early fearfulness and perceptual sensitivity and later restricted and repetitive behaviours in infants with typical and elevated likelihood of Autism
|
|
|
|
BASE
|
|
Show details
|
|
2 |
Semi-supervised cycle-consistency training for end-to-end ASR using unpaired speech
|
|
|
|
BASE
|
|
Show details
|
|
3 |
Incorporating Temporal Information in Entailment Graph Mining ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Integrating Lexical Information into Entity Neighbourhood Representations for Relation Prediction ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
Investigating the Mechanisms Driving Referent Selection and Retention in Toddlers at Typical and Elevated Likelihood for Autism Spectrum Disorder. ...
|
|
|
|
BASE
|
|
Show details
|
|
10 |
Enforcing constraints for multi-lingual and cross-lingual speech-to-text systems
|
|
|
|
BASE
|
|
Show details
|
|
11 |
Knowledge base integration in biomedical natural language processing applications
|
|
|
|
BASE
|
|
Show details
|
|
12 |
Learning speech embeddings for speaker adaptation and speech understanding
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Modeling phones, keywords, topics and intents in spoken languages
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Investigating the Mechanisms Driving Referent Selection and Retention in Toddlers at Typical and Elevated Likelihood for Autism Spectrum Disorder.
|
|
|
|
BASE
|
|
Show details
|
|
15 |
Infant EEG theta modulation predicts childhood intelligence
|
|
|
|
BASE
|
|
Show details
|
|
16 |
Neural and behavioural indices of face processing in siblings of children with autism spectrum disorder (ASD): a longitudinal study from infancy to mid-childhood
|
|
|
|
BASE
|
|
Show details
|
|
17 |
Speech technology for unwritten languages
|
|
|
|
In: ISSN: 2329-9290 ; EISSN: 2329-9304 ; IEEE/ACM Transactions on Audio, Speech and Language Processing ; https://hal.inria.fr/hal-02480675 ; IEEE/ACM Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2020, ⟨10.1109/TASLP.2020.2973896⟩ (2020)
|
|
BASE
|
|
Show details
|
|
18 |
Incorporating Temporal Information in Entailment Graph Mining ...
|
|
|
|
BASE
|
|
Show details
|
|
19 |
How Phonotactics Affect Multilingual and Zero-shot ASR Performance ...
|
|
|
|
Abstract:
The idea of combining multiple languages' recordings to train a single automatic speech recognition (ASR) model brings the promise of the emergence of universal speech representation. Recently, a Transformer encoder-decoder model has been shown to leverage multilingual data well in IPA transcriptions of languages presented during training. However, the representations it learned were not successful in zero-shot transfer to unseen languages. Because that model lacks an explicit factorization of the acoustic model (AM) and language model (LM), it is unclear to what degree the performance suffered from differences in pronunciation or the mismatch in phonotactics. To gain more insight into the factors limiting zero-shot ASR transfer, we replace the encoder-decoder with a hybrid ASR system consisting of a separate AM and LM. Then, we perform an extensive evaluation of monolingual, multilingual, and crosslingual (zero-shot) acoustic and language models on a set of 13 phonetically diverse languages. We show that ... : Accepted for publication in IEEE ICASSP 2021. The first 2 authors contributed equally to this work ...
|
|
Keyword:
Audio and Speech Processing eess.AS; Computation and Language cs.CL; FOS Computer and information sciences; FOS Electrical engineering, electronic engineering, information engineering; Sound cs.SD
|
|
URL: https://arxiv.org/abs/2010.12104 https://dx.doi.org/10.48550/arxiv.2010.12104
|
|
BASE
|
|
Hide details
|
|
20 |
That Sounds Familiar: an Analysis of Phonetic Representations Transfer Across Languages ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|