DE eng

Search in the Catalogues and Directories

Hits 1 – 6 of 6

1
Automatic Predicate Argument Analysis of the Penn TreeBank
In: DTIC (2001)
BASE
Show details
2
Facilitating Treebank Annotation Using a Statistical Parser
In: DTIC (2001)
Abstract: Corpora of phrase-structure-annotated text, or treebanks, are useful for supervised training of statistical models for natural language processing, as well as for corpus linguistics. Their primary drawback, however, is that they are very time-consuming to produce. To alleviate this problem, the standard approach is to make two passes over the text: first, parse the text automatically, then correct the parser output by hand. In this paper we explore three questions: How much does an automatic first pass speed up annotation? Does this automatic first pass affect the reliability of the final product? What kind of parser is best suited for such an automatic first pass? We investigate these questions by an experiment to augment the Penn Chinese Treebank [15] using a statistical parser developed by Chiang [3] for English. This experiment differs from previous efforts in two ways: first, we quantify the increase in annotation speed provided by the automatic first pass (70 100%); second, we use a parser developed on one language to augment a corpus in an unrelated language. ; Presented at the Human Language Technology Conference (HLT 2001) held in San Diego, CA on 18-21 Mar 2001. Pub. in the Proceedings of the Human Language Technology Conference (HLT 2001), 2001. Sponsored in part by grant NSF SBR-89-20230-15.
Keyword: *AUTOMATA; *INFORMATION PROCESSING; *MATHEMATICAL MODELS; *NATURAL LANGUAGE; *PARSERS; *PHRASE STRUCTURE GRAMMARS; *STATISTICAL PARSERS; *TREEBANK ANNOTATION; CHINA; COMPUTATIONAL LINGUISTICS; Cybernetics; HIERARCHIES; Information Science; LEXICOGRAPHY; Linguistics; RELIABILITY; STATISTICAL MODELS; STOCHASTIC PROCESSES; SYMPOSIA; TEXT PROCESSING
URL: http://www.dtic.mil/docs/citations/ADA460488
http://oai.dtic.mil/oai/oai?&verb=getRecord&metadataPrefix=html&identifier=ADA460488
BASE
Hide details
3
An Automatic Method of Finding Topic Boundaries
In: DTIC (1994)
BASE
Show details
4
A Simple Rule-Based Part of Speech Tagger
In: DTIC (1992)
BASE
Show details
5
Parsing the Voyager Domain Using Pearl
In: DTIC (1991)
BASE
Show details
6
Elements of a Computational Model of Cooperative Response Generation
In: DTIC (1989)
BASE
Show details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
6
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern