DE eng

Search in the Catalogues and Directories

Hits 1 – 11 of 11

1
Using heterogeneity in semi-supervised transcription hypotheses to improve code-switched speech recognition ...
Abstract: Modeling code-switched speech is an important problem in automatic speech recognition (ASR). Labeled code-switched data are rare, so monolingual data are often used to model code-switched speech. These monolingual data may be more closely matched to one of the languages in the code-switch pair. We show that such asymmetry can bias prediction toward the better-matched language and degrade overall model performance. To address this issue, we propose a semi-supervised approach for code-switched ASR. We consider the case of English-Mandarin code-switching, and the problem of using monolingual data to build bilingual "transcription models'' for annotation of unlabeled code-switched data. We first build multiple transcription models so that their individual predictions are variously biased toward either English or Mandarin. We then combine these biased transcriptions using confidence-based selection. This strategy generates a superior transcript for semi-supervised training, and obtains a 19% relative improvement ... : 5 pages ...
Keyword: Audio and Speech Processing eess.AS; Computation and Language cs.CL; FOS Computer and information sciences; FOS Electrical engineering, electronic engineering, information engineering; Sound cs.SD
URL: https://dx.doi.org/10.48550/arxiv.2106.07699
https://arxiv.org/abs/2106.07699
BASE
Hide details
2
Advances in transcription of broadcast news and conversational telephone speech within the combined EARS BBN/LIMSI system
In: Institute of Electrical and Electronics Engineers. IEEE transactions on audio, speech and language processing. - New York, NY : Inst. 14 (2006) 5, 1541-1556
BLLDB
OLC Linguistik
Show details
3
Advances in Transcription of Broadcast News and Conversational Telephone Speech Within the Combined EARS BBN/LIMSI System
In: ISSN: 1558-7916 ; IEEE Transactions on Audio, Speech and Language Processing ; https://hal.archives-ouvertes.fr/hal-01299058 ; IEEE Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2006 (2006)
BASE
Show details
4
Fisher English Training Part 2, Speech
Cieri, Christopher; Graff, David; Kimball, Owen. - : Linguistic Data Consortium, 2005. : https://www.ldc.upenn.edu, 2005
BASE
Show details
5
Fisher English Training Part 2, Transcripts
Cieri, Christopher; Graff, David; Kimball, Owen. - : Linguistic Data Consortium, 2005. : https://www.ldc.upenn.edu, 2005
BASE
Show details
6
Fisher English Training Part 2, Speech ...
Cieri, Christopher; Graff, David; Kimball, Owen. - : Linguistic Data Consortium, 2005
BASE
Show details
7
Fisher English Training Part 2, Transcripts ...
Cieri, Christopher; Graff, David; Kimball, Owen. - : Linguistic Data Consortium, 2005
BASE
Show details
8
Fisher English Training Speech Part 1 Transcripts
Cieri, Christopher; Graff, David; Kimball, Owen. - : Linguistic Data Consortium, 2004. : https://www.ldc.upenn.edu, 2004
BASE
Show details
9
Fisher English Training Speech Part 1 Speech
Cieri, Christopher; Graff, David; Kimball, Owen. - : Linguistic Data Consortium, 2004. : https://www.ldc.upenn.edu, 2004
BASE
Show details
10
Fisher English Training Speech Part 1 Speech ...
Cieri, Christopher; Graff, David; Kimball, Owen. - : Linguistic Data Consortium, 2004
BASE
Show details
11
Fisher English Training Speech Part 1 Transcripts ...
Cieri, Christopher; Graff, David; Kimball, Owen. - : Linguistic Data Consortium, 2004
BASE
Show details

Catalogues
0
0
1
0
0
0
0
Bibliographies
1
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
10
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern