DE eng

Search in the Catalogues and Directories

Hits 1 – 20 of 20

1
Effect and Analysis of Large-scale Language Model Rescoring on Competitive ASR Systems ...
BASE
Show details
2
Single headed attention based sequence-to-sequence model for state-of-the-art results on Switchboard ...
BASE
Show details
3
Challenging the Boundaries of Speech Recognition: The MALACH Corpus ...
BASE
Show details
4
Building competitive direct acoustics-to-word models for English conversational speech recognition ...
Abstract: Direct acoustics-to-word (A2W) models in the end-to-end paradigm have received increasing attention compared to conventional sub-word based automatic speech recognition models using phones, characters, or context-dependent hidden Markov model states. This is because A2W models recognize words from speech without any decoder, pronunciation lexicon, or externally-trained language model, making training and decoding with such models simple. Prior work has shown that A2W models require orders of magnitude more training data in order to perform comparably to conventional models. Our work also showed this accuracy gap when using the English Switchboard-Fisher data set. This paper describes a recipe to train an A2W model that closes this gap and is at-par with state-of-the-art sub-word based models. We achieve a word error rate of 8.8%/13.9% on the Hub5-2000 Switchboard/CallHome test sets without any decoder or language model. We find that model initialization, training data order, and regularization have the most ... : Submitted to IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2018 ...
Keyword: Artificial Intelligence cs.AI; Computation and Language cs.CL; FOS Computer and information sciences; Machine Learning stat.ML; Neural and Evolutionary Computing cs.NE
URL: https://dx.doi.org/10.48550/arxiv.1712.03133
https://arxiv.org/abs/1712.03133
BASE
Hide details
5
Bayesian sensing hidden Markov models
In: Institute of Electrical and Electronics Engineers. IEEE transactions on audio, speech and language processing. - New York, NY : Inst. 20 (2012) 1, 43-54
BLLDB
OLC Linguistik
Show details
6
Boosting systems for large vocabulary continuous speech recognition
In: Speech communication. - Amsterdam [u.a.] : Elsevier 54 (2012) 2, 212-218
BLLDB
OLC Linguistik
Show details
7
Advances in Arabic speech transcription at IBM under the DARPA GALE program
In: Institute of Electrical and Electronics Engineers. IEEE transactions on audio, speech and language processing. - New York, NY : Inst. 17 (2009) 5, 884-894
BLLDB
OLC Linguistik
Show details
8
Advances in speech transcription at IBM under the DARPA EARS program
In: Institute of Electrical and Electronics Engineers. IEEE transactions on audio, speech and language processing. - New York, NY : Inst. 14 (2006) 5, 1596-1608
BLLDB
OLC Linguistik
Show details
9
Arc minimization in finite-state decoding graphs with cross-word acoustic context
In: Computer speech and language. - Amsterdam [u.a.] : Elsevier 18 (2004) 4, 397-415
BLLDB
Show details
10
Arc minimization in finite-state decoding graphs with cross-word acoustic context
In: Computer speech and language. - Amsterdam [u.a.] : Elsevier 18 (2004) 4, 397-416
OLC Linguistik
Show details
11
Automatic speech recognition performance on a voicemail transcription task
In: Institute of Electrical and Electronics Engineers. IEEE transactions on speech and audio processing. - New York, NY : Inst. 10 (2002) 7, 433-442
BLLDB
Show details
12
Voicemail Corpus Part II
Padmanabhan, Mukund; Kingsbury, Brian; Ramabhadran, Bhuvana. - : Linguistic Data Consortium, 2002. : https://www.ldc.upenn.edu, 2002
BASE
Show details
13
Voicemail Corpus Part II ...
Padmanabhan, Mukund; Kingsbury, Brian; Ramabhadran, Bhuvana. - : Linguistic Data Consortium, 2002
BASE
Show details
14
Cursive word recognition using a random field based hidden Markov model
In: International journal on document analysis and recognition. - Berlin ; Heidelberg : Springer 1 (1999) 4, 199-208
BLLDB
Show details
15
Cursive word recognition using a random field based hidden Markov model
In: International journal on document analysis and recognition. - Berlin ; Heidelberg : Springer 1 (1998) 4, 199-208
OLC Linguistik
Show details
16
Binary Pattern Recognition Using Markov Random Fields and HMMs
In: IEEE International Conference on Acoustics, Speech, and Signal Processing - ICASSP 1997 ; https://hal.inria.fr/inria-00537357 ; IEEE International Conference on Acoustics, Speech, and Signal Processing - ICASSP 1997, Apr 1997, Munich, Germany. pp.3725 - 3728, ⟨10.1109/ICASSP.1997.604678⟩ ; http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=604678 (1997)
BASE
Show details
17
Off-line Handwritten Word Recognition Using a Mixed HMM-MRF Approach
In: 4th International Conference on Document Analysis and Recognition - ICDAR'97 ; https://hal.inria.fr/inria-00537568 ; 4th International Conference on Document Analysis and Recognition - ICDAR'97, Aug 1997, Ulm, Germany. pp.118 - 122, ⟨10.1109/ICDAR.1997.619825⟩ ; http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=619825 (1997)
BASE
Show details
18
One and two-dimensional Markov models for off-line handwriting recognition ; Modèles markoviens uni- et bidimensionnels pour la reconnaissance de l'écriture manuscrite hors-ligne
Saon, George. - : HAL CCSD, 1997
In: https://hal.univ-lorraine.fr/tel-01747325 ; Autre [cs.OH]. Université Henri Poincaré - Nancy 1, 1997. Français. ⟨NNT : 1997NAN10299⟩ (1997)
BASE
Show details
19
Modèles markoviens uni- et bidimensionnels pour la reconnaissance de l'écriture manuscrite hors-ligne ; One and two-dimensional Markov models for off-line handwriting recognition
Saon, George. - 1997
BASE
Show details
20
Off-Line Handwriting Recognition by Statistical Correlation
In: IAPR Workshop on Machine Vision Applications - MVA'94 ; https://hal.inria.fr/inria-00533959 ; IAPR Workshop on Machine Vision Applications - MVA'94, IAPR, Dec 1994, Kawasaki, Japan. pp.371-374 ; http://b2.cvl.iis.u-tokyo.ac.jp/mva/proceedings/CommemorativeDVD/1994/papers/1994371.pdf (1994)
BASE
Show details

Catalogues
0
0
6
0
0
0
0
Bibliographies
7
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
11
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern