DE eng

Search in the Catalogues and Directories

Hits 1 – 18 of 18

1
Short text authorship attribution via sequence kernels, Markov chains and author unmasking: An investigation
In: Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing ; http://acl.ldc.upenn.edu/W/W06/#W06-1600 (2015)
BASE
Show details
2
Towards optimal choice selection for improved hybrid machine translation
In: The Prague bulletin of mathematical linguistics. - Praha : Univ. (2012) 97, 5-22
BLLDB
OLC Linguistik
Show details
3
Toward Determining the Comprehensibility of Machine Translations
In: DTIC (2012)
BASE
Show details
4
Novel Topic Impact on Authorship Attribution
In: DTIC (2009)
BASE
Show details
5
Syntactic Simplification for Improving Content Selection in Multi-Document Summarization
In: DTIC (2004)
BASE
Show details
6
Cross-Document Coreference on a Large Scale Corpus
In: DTIC (2004)
BASE
Show details
7
A Similarity-Based Approach and Evaluation Methodology for Reduction of Drug Name Confusion
In: DTIC (2003)
BASE
Show details
8
The Bible, Truth, and Multilingual OCR Evaluation
In: DTIC (1998)
Abstract: Multilingual OCR has emerged as an important information technology, thanks to the increasing need for cross-language information access. While many research groups and companies have developed OCR algorithms for various languages, it is difficult to compare the performance of these OCR algorithms across languages. This difficulty arises because most evaluation methodologies rely on the use of a document image dataset in each of the languages and it is difficult to find document datasets in different languages that are similar in content and layout. In this paper we propose to use the Bible as a dataset for comparing OCR accuracy across languages. Besides being available in a wide range of languages, Bible translation are closely parallel in content, carefully translated, surprisingly relevant with respect to modern-day language, and quite inexpensive. A project at the University of Maryland is currently implementing this idea. We have created a scanned image dataset with groundtruth from an Arabic Bible. We have also used image degradation models to create synthetically degraded images of a French Bible. We hope to generate similar Bible datasets for other languages, and we are exploring alternative corpora such as the Koran and the Bhagavad Gita that have similar properties. Quantitative OCR evaluation based on the Arabic Bible dataset is currently in progress. ; Sponsored in part by DARPA and Army Research Lab. Report no. CS-TR-3967. Presented at the SPIE Conference on Document Recognition and Retrieval VI held in San Jose, CA on 27-28 Jan 1999. Published in the Proceedings of the SPIE Conference on Document Recognition and Retrieval VI, Proceedings of SPIE, v3651, 1999.
Keyword: *BIBLE; *CORPUS; *DATASETS; *GROUNDTRUTH; *OPTICAL CHARACTER RECOGNITION; *TEST SETS; *TRANSLATIONS; ACCURACY; ALGORITHMS; Cybernetics; DOCUMENT IMAGES; DOCUMENTS; IMAGES; Information Science; LANGUAGE; Linguistics; MULTILINGUAL OCR(OPTICAL CHARACTER RECOGNITION); SYMPOSIA; TEST AND EVALUATION
URL: http://www.dtic.mil/docs/citations/ADA458666
http://oai.dtic.mil/oai/oai?&verb=getRecord&metadataPrefix=html&identifier=ADA458666
BASE
Hide details
9
Experiments in Spoken Document Retrieval at CMU
In: DTIC (1997)
BASE
Show details
10
Efficient Algorithms for Speech Recognition.
In: DTIC AND NTIS (1996)
BASE
Show details
11
Overview of Results of the MUC-6 Evaluation
In: DTIC (1995)
BASE
Show details
12
Phonological Parsing for Bi-directional Letter-to-Sound/Sound-to-Letter Generation
In: DTIC (1994)
BASE
Show details
13
Signal Processing for Robust Speech Recognition
In: DTIC (1994)
BASE
Show details
14
Adaptive Natural Language Processing
In: DTIC AND NTIS (1991)
BASE
Show details
15
Global Optimization of Digital Circuits.
In: DTIC AND NTIS (1991)
BASE
Show details
16
Integrating Syntax, Semantics,and Discourse DARPA (Defense Advanced Research Projects Agency) Natural Language Understanding Program
In: DTIC AND NTIS (1989)
BASE
Show details
17
Integration of Speech and Natural Language
In: DTIC AND NTIS (1989)
BASE
Show details
18
PROGRAMMING LANGUAGE FOR AUTOMATIC CHECKOUT EQUIPMENT. VOLUME II. ADAPTED PLACE FOR THE BENDIX AN/GJQ-9.
In: DTIC AND NTIS (1963)
BASE
Show details

Catalogues
0
0
1
0
0
0
0
Bibliographies
1
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
17
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern