DE eng

Search in the Catalogues and Directories

Page: 1 2 3 4
Hits 1 – 20 of 63

1
Familiar Speaker Recognition
In: DTIC (2012)
BASE
Show details
2
Machine Recognition vs Human Recognition of Voices
In: DTIC (2012)
BASE
Show details
3
Speaker Clustering for a Mixture of Singing and Reading (Preprint)
In: DTIC (2012)
BASE
Show details
4
Compressed Domain Automatic Level Control Based on ITU-T G.722.2
In: DTIC (2012)
BASE
Show details
5
Segregation of Whispered Speech Interleaved with Noise or Speech Maskers
In: DTIC (2011)
BASE
Show details
6
The SRI NIST 2010 Speaker Recognition Evaluation System (PREPRINT)
In: DTIC (2011)
BASE
Show details
7
Recovering Asynchronous Watermark Tones from Speech
In: DTIC (2009)
BASE
Show details
8
The Multi-Session Audio Research Project (MARP) Corpus: Goals, Design and Initial Findings
In: DTIC (2009)
BASE
Show details
9
Long Term Examination of Intra-Session and Inter-Session Speaker Variability
In: DTIC (2009)
BASE
Show details
10
Perturbation and Pitch Normalization as Enhancements to Speaker Recognition
In: DTIC (2009)
BASE
Show details
11
Long Term Examination of Intra-Session and Inter-Session Speaker Variability
In: DTIC (2009)
BASE
Show details
12
Automating Convoy Training Assessment to Improve Soldier Performance
In: DTIC (2008)
BASE
Show details
13
Odds of Successful Transfer of Low-level Concepts: A Key Metric for Bidirectional Speech-to-Speech Machine Translation in DARPA's TRANSTAC Program
In: DTIC (2008)
BASE
Show details
14
Iterated Class-Specific Subspaces for Speaker-Dependent Phoneme Classification
In: DTIC (2008)
BASE
Show details
15
Listener Detection of Talker Stress in Low-Rate Coded Speech
In: DTIC (2008)
BASE
Show details
16
Comparing Evaluation Metrics for Sentence Boundary Detection
In: DTIC (2007)
BASE
Show details
17
Using Prosody for Automatic Sentence Segmentation of Multi-Party Meetings
In: DTIC (2006)
Abstract: We explore the use of prosodic features beyond pauses, including duration, pitch, and energy features, for automatic sentence segmentation of ICSI meeting data. We examine two different approaches to boundary classification: score-level combination of independent language and prosodic models using HMMs, and feature-level combination of models using a boosting-based method (BoosTexter). We report classification results for reference word transcripts as well as for transcripts from a state-of-the-art automatic speech recognizer (ASR). We also compare results using the lexical model plus a pause-only prosody model, versus results using additional prosodic features. Results show that (1) information from pauses is important, including pause duration both at the boundary and at the previous and following word boundaries; (2) adding duration, pitch, and energy features yields significant improvement over pause alone; (3) the integrated boosting-based model performs better than the HMM for ASR conditions; (4) training the boosting-based model on recognized words yields further improvement. ; Presented at the International Conference on Text, Speech, and Dialogue (9th) (TSD 2006) held in Brno, Czech Republic on 11-15 Sep 2006. Published in the Proceedings of the International Conference on Text, Speech, and Dialogue (9th), 2006. Sponsored in part by National Science Foundation contract no. IIS-0121396.
Keyword: *BOUNDARIES; *MODELS; *PROSODIC FEATURES; *PROSODY; *SENTENCE SEGMENTATION; *SPEECH RECOGNITION; ALGORITHMS; ASR(AUTOMATIC SPEECH RECOGNIZER); AUTOMATIC; BOOSTING; CLASSIFICATION; CLASSIFIERS; Cybernetics; HMM(HIDDEN MARKOV MODELS); LEXICAL FEATURES; Linguistics; MARKOV PROCESSES; PAUSES; SYMPOSIA; TEAMS(PERSONNEL); Voice Communications; WORDS(LANGUAGE)
URL: http://www.dtic.mil/docs/citations/ADA459015
http://oai.dtic.mil/oai/oai?&verb=getRecord&metadataPrefix=html&identifier=ADA459015
BASE
Hide details
18
A Methodology to Predict Specific Communication Themes from Overall Communication Volume for Individuals and Teams
In: DTIC (2006)
BASE
Show details
19
Toward an Interagency Language Roundtable Based Assessment of Speech-to-Speech Translation Capabilities
In: DTIC (2006)
BASE
Show details
20
The Mixer and Transcript Reading Corpora: Resources for Multilingual, Crosschannel Speaker Recognition Research
In: DTIC (2006)
BASE
Show details

Page: 1 2 3 4

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
63
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern