DE eng

Search in the Catalogues and Directories

Hits 1 – 14 of 14

1
Pseudo-Labeling for Massively Multilingual Speech Recognition ...
BASE
Show details
2
LIBRI-LIGHT: a benchmark for asr with limited or no supervision
In: ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing ; https://hal.archives-ouvertes.fr/hal-02959460 ; ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing, May 2020, Barcelona / Virtual, Spain. pp.7669-7673, ⟨10.1109/ICASSP40776.2020.9052942⟩ (2020)
BASE
Show details
3
Massively Multilingual ASR: 50 Languages, 1 Model, 1 Billion Parameters ...
Abstract: We study training a single acoustic model for multiple languages with the aim of improving automatic speech recognition (ASR) performance on low-resource languages, and over-all simplifying deployment of ASR systems that support diverse languages. We perform an extensive benchmark on 51 languages, with varying amount of training data by language(from 100 hours to 1100 hours). We compare three variants of multilingual training from a single joint model without knowing the input language, to using this information, to multiple heads (one per language cluster). We show that multilingual training of ASR models on several languages can improve recognition performance, in particular, on low resource languages. We see 20.9%, 23% and 28.8% average WER relative reduction compared to monolingual baselines on joint model, joint model with language input and multi head model respectively. To our knowledge, this is the first work studying multilingual ASR at massive scale, with more than 50 languages and more than 16,000 ...
Keyword: Audio and Speech Processing eess.AS; Computation and Language cs.CL; FOS Computer and information sciences; FOS Electrical engineering, electronic engineering, information engineering; Sound cs.SD
URL: https://dx.doi.org/10.48550/arxiv.2007.03001
https://arxiv.org/abs/2007.03001
BASE
Hide details
4
MLS: A Large-Scale Multilingual Dataset for Speech Research ...
BASE
Show details
5
Unsupervised Cross-lingual Representation Learning for Speech Recognition ...
BASE
Show details
6
End-to-End Acoustic Modeling using Convolutional Neural Networks for HMM-based Automatic Speech Recognition
In: http://infoscience.epfl.ch/record/264125 (2019)
BASE
Show details
7
End-to-End Speech Recognition From the Raw Waveform
In: Interspeech 2018 ; https://hal.archives-ouvertes.fr/hal-01888739 ; Interspeech 2018, Sep 2018, Hyderabad, India. ⟨10.21437/Interspeech.2018-2414⟩ (2018)
BASE
Show details
8
Fully Convolutional Speech Recognition ...
BASE
Show details
9
Learning linearly separable features for speech recognition using convolutional neural networks ...
BASE
Show details
10
Estimating Phoneme Class Conditional Probabilities from Raw Speech Signal using Convolutional Neural Networks
In: http://infoscience.epfl.ch/record/192756 (2013)
BASE
Show details
11
Estimating Phoneme Class Conditional Probabilities from Raw Speech Signal using Convolutional Neural Networks
In: http://infoscience.epfl.ch/record/192560 (2013)
BASE
Show details
12
Estimating Phoneme Class Conditional Probabilities from Raw Speech Signal using Convolutional Neural Networks ...
BASE
Show details
13
Towards Understanding Situated Natural Language
In: 13th International Conference on Artificial Intelligence and Statistics ; https://hal.archives-ouvertes.fr/hal-00750937 ; 13th International Conference on Artificial Intelligence and Statistics, May 2010, Chia Laguna Resort, Sardinia, Italy. pp.65-72 (2010)
BASE
Show details
14
Large Scale Application of Neural Network Based Semantic Role Labeling for Automated Relation Extraction from Biomedical Texts
Barnickel, Thorsten; Weston, Jason; Collobert, Ronan. - : Public Library of Science, 2009
BASE
Show details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
14
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern