1 |
How Autism Affects Speech Understanding in Multitalker Environments
|
|
|
|
In: DTIC (2014)
|
|
BASE
|
|
Show details
|
|
2 |
How Autism Affects Speech Understanding in Multitalker Environments
|
|
|
|
In: DTIC (2013)
|
|
BASE
|
|
Show details
|
|
3 |
Effects of Speech Intensity on the Callsign Acquisition Test (CAT) and Modified Rhyme Test (MRT) Presented in Noise
|
|
|
|
In: DTIC (2012)
|
|
BASE
|
|
Show details
|
|
4 |
Strengthening Homeland Security through Improved Foreign Language Capability
|
|
|
|
In: DTIC (2011)
|
|
BASE
|
|
Show details
|
|
5 |
Special Operations Forces Language and Culture Needs Assessment: Defense Language Proficiency Test (DLPT)
|
|
In: DTIC (2010)
|
|
BASE
|
|
Show details
|
|
6 |
Relevance Feedback based on Constrained Clustering: FDU at TREC 09
|
|
|
|
In: DTIC (2009)
|
|
BASE
|
|
Show details
|
|
7 |
An Analysis of Specware and Its Usefulness in the Verification of High Assurance Systems
|
|
|
|
In: DTIC (2006)
|
|
BASE
|
|
Show details
|
|
8 |
Minimum Bayes-Risk Decoding for Statistical Machine Translation
|
|
|
|
In: DTIC (2004)
|
|
BASE
|
|
Show details
|
|
9 |
The Pragmatics of Taking a Spoken Language System Out of the Laboratory
|
|
|
|
In: DTIC (2003)
|
|
BASE
|
|
Show details
|
|
10 |
Acoustic-Phonetic Modeling of Non-Native Speech for Language Identification
|
|
|
|
In: DTIC (2000)
|
|
BASE
|
|
Show details
|
|
11 |
The Bible, Truth, and Multilingual OCR Evaluation
|
|
|
|
In: DTIC (1998)
|
|
Abstract:
Multilingual OCR has emerged as an important information technology, thanks to the increasing need for cross-language information access. While many research groups and companies have developed OCR algorithms for various languages, it is difficult to compare the performance of these OCR algorithms across languages. This difficulty arises because most evaluation methodologies rely on the use of a document image dataset in each of the languages and it is difficult to find document datasets in different languages that are similar in content and layout. In this paper we propose to use the Bible as a dataset for comparing OCR accuracy across languages. Besides being available in a wide range of languages, Bible translation are closely parallel in content, carefully translated, surprisingly relevant with respect to modern-day language, and quite inexpensive. A project at the University of Maryland is currently implementing this idea. We have created a scanned image dataset with groundtruth from an Arabic Bible. We have also used image degradation models to create synthetically degraded images of a French Bible. We hope to generate similar Bible datasets for other languages, and we are exploring alternative corpora such as the Koran and the Bhagavad Gita that have similar properties. Quantitative OCR evaluation based on the Arabic Bible dataset is currently in progress. ; Sponsored in part by DARPA and Army Research Lab. Report no. CS-TR-3967. Presented at the SPIE Conference on Document Recognition and Retrieval VI held in San Jose, CA on 27-28 Jan 1999. Published in the Proceedings of the SPIE Conference on Document Recognition and Retrieval VI, Proceedings of SPIE, v3651, 1999.
|
|
Keyword:
*BIBLE; *CORPUS; *DATASETS; *GROUNDTRUTH; *OPTICAL CHARACTER RECOGNITION; *TEST SETS; *TRANSLATIONS; ACCURACY; ALGORITHMS; Cybernetics; DOCUMENT IMAGES; DOCUMENTS; IMAGES; Information Science; LANGUAGE; Linguistics; MULTILINGUAL OCR(OPTICAL CHARACTER RECOGNITION); SYMPOSIA; TEST AND EVALUATION
|
|
URL: http://www.dtic.mil/docs/citations/ADA458666 http://oai.dtic.mil/oai/oai?&verb=getRecord&metadataPrefix=html&identifier=ADA458666
|
|
BASE
|
|
Hide details
|
|
13 |
AMAR: A Computational Model of Autosegmental Phonology
|
|
|
|
In: DTIC AND NTIS (1993)
|
|
BASE
|
|
Show details
|
|
14 |
A Practical Methodology for the Evaluation of Spoken Language Systems
|
|
|
|
In: DTIC (1992)
|
|
BASE
|
|
Show details
|
|
15 |
Subject-Based Evaluation Measures for Interactive Spoken Language Systems
|
|
|
|
In: DTIC (1992)
|
|
BASE
|
|
Show details
|
|
16 |
Development of a Spoken Language System
|
|
|
|
In: DTIC AND NTIS (1992)
|
|
BASE
|
|
Show details
|
|
17 |
Very High Speed Integrated Circuits (VHSIC) Hardware Description Language (VHDL) Syntax and Semantics Summary
|
|
|
|
In: DTIC AND NTIS (1991)
|
|
BASE
|
|
Show details
|
|
18 |
Continued Performance Assessment Methodology (PAM) Research (VORPET). Refinement and Implementation of the JWGD3 MILPERF-NAMRL Multidisciplinary Performance Test Battery (NMPTB).
|
|
|
|
In: DTIC AND NTIS (1991)
|
|
BASE
|
|
Show details
|
|
19 |
Management and Evaluation of Interactive Dialog in the Air Travel Domain
|
|
|
|
In: DTIC (1990)
|
|
BASE
|
|
Show details
|
|
20 |
Reasoning and Comprehension Processes of Linguistic Minority Persons Learning from Text
|
|
|
|
In: DTIC AND NTIS (1989)
|
|
BASE
|
|
Show details
|
|
|
|