2 |
Machine Recognition vs Human Recognition of Voices
|
|
|
|
In: DTIC (2012)
|
|
BASE
|
|
Show details
|
|
3 |
Speaker Clustering for a Mixture of Singing and Reading (Preprint)
|
|
|
|
In: DTIC (2012)
|
|
BASE
|
|
Show details
|
|
4 |
The SRI NIST 2010 Speaker Recognition Evaluation System (PREPRINT)
|
|
|
|
In: DTIC (2011)
|
|
BASE
|
|
Show details
|
|
5 |
Long Term Examination of Intra-Session and Inter-Session Speaker Variability
|
|
|
|
In: DTIC (2009)
|
|
BASE
|
|
Show details
|
|
6 |
Automating Convoy Training Assessment to Improve Soldier Performance
|
|
|
|
In: DTIC (2008)
|
|
BASE
|
|
Show details
|
|
7 |
Iterated Class-Specific Subspaces for Speaker-Dependent Phoneme Classification
|
|
|
|
In: DTIC (2008)
|
|
BASE
|
|
Show details
|
|
8 |
Listener Detection of Talker Stress in Low-Rate Coded Speech
|
|
|
|
In: DTIC (2008)
|
|
Abstract:
We describe an experiment where listeners were asked to detect two specific forms of stress in talkers' recorded voices heard via six different simulated communication systems. Both task-induced stress and dramatized urgency were used. Communication systems included low-rate digital speech coding combined with bit errors, packet loss, and packet loss concealment. Twenty-four listeners participated in a total of 11,520 detection trials. A parallel investigation of word intelligibility in sentence context used 576 trials. Intelligibility results showed wide variance due to communication system and stress detection results showed less variance. More specifically, we found that listener detection of dramatized talker urgency was 4.7 times more robust to communication system degradations than word intelligibility in sentence context. ; See also ADM002091. Presented at the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2008) held in Las Vegas, Nevada on 30 March - 4 April 2008. Published in the Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2008), p4813-4816, 2008. Government or Federal Purpose Rights License. The original document contains color images.
|
|
Keyword:
*CODING; *DETECTION; *SIGNAL PROCESSING; *SPEECH CODING; *SPEECH INTELLIGIBILITY; *SPEECH RECOGNITION; *STRESS DETECTION; *VOICE COMMUNICATIONS; COMMUNICATION AND RADIO SYSTEMS; DEGRADATION; DIGITAL SYSTEMS; DU(DRAMATIZED URGENCY); HUMANS; INTELLIGIBILITY; ITU(INTERNATIONAL TELECOMMUNICATIONS UNION); ITU-T(INTERNATIONAL TELECOMMUNICATIONS UNION TELECOMMUNICATIONS STANDARDS); LOW RATE; Miscellaneous Detection and Detectors; MNRU(MODULATED NOISE REFERENCE UNIT); NTP(NORMALIZED TEST PERFORMANCE); Numerical Mathematics; PCM(PULSE CODE MODULATION); REPRINTS; SIGNAL TO NOISE RATIO; SIMULATION; SNR(SIGNAL TO NOISE RATIO); SPEECH; STRESSES; SUBJECTIVE TESTING; SUSAS(SPEECH UNDER SIMULATED AND ACTUAL STRESS); SYMPOSIA; TALKER STRESS; TIS(TASK INDUCED STRESS); VARIATIONS; Voice Communications; WORDS(LANGUAGE)
|
|
URL: http://oai.dtic.mil/oai/oai?&verb=getRecord&metadataPrefix=html&identifier=ADA510264 http://www.dtic.mil/docs/citations/ADA510264
|
|
BASE
|
|
Hide details
|
|
9 |
Comparing Evaluation Metrics for Sentence Boundary Detection
|
|
|
|
In: DTIC (2007)
|
|
BASE
|
|
Show details
|
|
10 |
Using Prosody for Automatic Sentence Segmentation of Multi-Party Meetings
|
|
|
|
In: DTIC (2006)
|
|
BASE
|
|
Show details
|
|
11 |
The Mixer and Transcript Reading Corpora: Resources for Multilingual, Crosschannel Speaker Recognition Research
|
|
|
|
In: DTIC (2006)
|
|
BASE
|
|
Show details
|
|
13 |
Measuring Human Readability of Machine Generated Text: Three Case Studies in Speech Recognition and Machine Translation
|
|
|
|
In: DTIC (2005)
|
|
BASE
|
|
Show details
|
|
14 |
Conversational Telephone Speech Corpus Collection for the NIST Speaker Recognition Evaluation 2004
|
|
|
|
In: DTIC (2004)
|
|
BASE
|
|
Show details
|
|
15 |
Combining Cross-Stream And Time Dimensions In Phonetic Speaker Recognition
|
|
|
|
In: DTIC (2003)
|
|
BASE
|
|
Show details
|
|
16 |
Natural Language Generation in Dialog Systems
|
|
|
|
In: DTIC (2001)
|
|
BASE
|
|
Show details
|
|
18 |
Towards Multilingual Interoperability in Automatic Speech Recognition
|
|
|
|
In: DTIC (2000)
|
|
BASE
|
|
Show details
|
|
20 |
Clustering of Context Dependent Speech Units for Multilingual Speech Recognition
|
|
|
|
In: DTIC (2000)
|
|
BASE
|
|
Show details
|
|
|
|