DE eng

Search in the Catalogues and Directories

Page: 1 2 3
Hits 1 – 20 of 53

1
ASR-Aware End-to-end Neural Diarization ...
BASE
Show details
2
Attention-based Contextual Language Model Adaptation for Speech Recognition ...
Abstract: Language modeling (LM) for automatic speech recognition (ASR) does not usually incorporate utterance level contextual information. For some domains like voice assistants, however, additional context, such as the time at which an utterance was spoken, provides a rich input signal. We introduce an attention mechanism for training neural speech recognition language models on both text and non-linguistic contextual data. When applied to a large de-identified dataset of utterances collected by a popular voice assistant platform, our method reduces perplexity by 7.0% relative over a standard LM that does not incorporate contextual information. When evaluated on utterances extracted from the long tail of the dataset, our method improves perplexity by 9.0% relative over a standard LM and by over 2.8% relative when compared to a state-of-the-art model for contextual LM. ...
Keyword: Artificial Intelligence cs.AI; Computation and Language cs.CL; FOS Computer and information sciences
URL: https://arxiv.org/abs/2106.01451
https://dx.doi.org/10.48550/arxiv.2106.01451
BASE
Hide details
3
Attention-based Contextual Language Model Adaptation for Speech Recognition ...
BASE
Show details
4
Reranking Machine Translation Hypotheses with Structured and Web-based Language Models ...
BASE
Show details
5
Combining Acoustics, Content and Interaction Features to Find Hot Spots in Meetings ...
BASE
Show details
6
Mispronunciation Detection in Children's Reading of Sentences
Proença, Jorge; Lopes, Carla Alexandra; Tjalve, Michael. - : Institute of Electrical and Electronics Engineers, 2018
BASE
Show details
7
The SRI NIST 2010 Speaker Recognition Evaluation System (PREPRINT)
In: DTIC (2011)
BASE
Show details
8
The CALO meeting assistant system
In: Institute of Electrical and Electronics Engineers. IEEE transactions on audio, speech and language processing. - New York, NY : Inst. 18 (2010) 6, 1601-1611
BLLDB
Show details
9
Improving robustness of MLLR adaptation with speaker-clustered regression class trees
In: Computer speech and language. - Amsterdam [u.a.] : Elsevier 23 (2009) 2, 176-199
BLLDB
OLC Linguistik
Show details
10
Speaker recognition with session variability normalization based on MLLR adaptation transforms
In: Institute of Electrical and Electronics Engineers. IEEE transactions on audio, speech and language processing. - New York, NY : Inst. 15 (2007) 7, 1987-1998
BLLDB
OLC Linguistik
Show details
11
Morphology-based language modeling for conversational Arabic speech recognition
In: Computer speech and language. - Amsterdam [u.a.] : Elsevier 20 (2006) 4, 589-608
BLLDB
OLC Linguistik
Show details
12
A study in machine learning from imbalanced data for sentence boundary detection in speech
In: Computer speech and language. - Amsterdam [u.a.] : Elsevier 20 (2006) 4, 468-494
BLLDB
OLC Linguistik
Show details
13
Enriching speech recognition with automatic detection of sentence boundaries and disfluencies
In: Institute of Electrical and Electronics Engineers. IEEE transactions on audio, speech and language processing. - New York, NY : Inst. 14 (2006) 5, 1526-1540
BLLDB
OLC Linguistik
Show details
14
Recent innovations in speech-to-text transcription at SRI-ICSI-UW
In: Institute of Electrical and Electronics Engineers. IEEE transactions on audio, speech and language processing. - New York, NY : Inst. 14 (2006) 5, 1729-1744
BLLDB
OLC Linguistik
Show details
15
Combining Prosodic, Lexical and Cepstral Systems for Deceptive Speech Detection
Hirschberg, Julia Bell; Enos, Frank; Graciarena, Martin. - : Proceedings IEEE ICASSP 2006, 2006
BASE
Show details
16
Combining Prosodic, Lexical and Cepstral Systems for Deceptive Speech Detection ...
Hirschberg, Julia Bell; Enos, Frank; Graciarena, Martin. - : Columbia University, 2006
BASE
Show details
17
Modeling prosodic feature sequences for speaker recognition
In: Speech communication. - Amsterdam [u.a.] : Elsevier 46 (2005) 3-4, 455-472
BLLDB
OLC Linguistik
Show details
18
Distinguishing Deceptive from Non-Deceptive Speech
Hirschberg, Julia Bell; Benus, Stefan; Brenier, Jason M.. - : Proceedings of Eurospeech'05, 2005
BASE
Show details
19
Distinguishing Deceptive from Non-Deceptive Speech ...
Hirschberg, Julia Bell; Benus, Stefan; Brenier, Jason M.. - : Columbia University, 2005
BASE
Show details
20
Toward Joint Segmentation and Classification of Dialog Acts in Multiparty Meetings
In: DTIC (2005)
BASE
Show details

Page: 1 2 3

Catalogues
0
0
15
0
0
0
0
Bibliographies
18
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
27
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern