DE eng

Search in the Catalogues and Directories

Hits 1 – 15 of 15

1
Syntactic Variation, Change, and Lexical Preference: a Corpus-Based Study ...
Lehmann, Hans Martin. - : University of Zurich, 2020
BASE
Show details
2
Enhancing the linguistic discovery potential of historical corpora: a twin-track approach using ARCHER ...
BASE
Show details
3
Enhancing the linguistic discovery potential of historical corpora:A twin-track approach using ARCHER
BASE
Show details
4
Enhancing the linguistic discovery potential of historical corpora: a twin-track approach using ARCHER
In: Smith, Nick; Schneider, Gerold; Hoffmann, Sebastian; Lehmann, Hans Martin (2019). Enhancing the linguistic discovery potential of historical corpora: a twin-track approach using ARCHER. In: CL 2019 International Corpus Linguistics Conference, Cardiff, Wales, UK, 22 July 2019 - 26 July 2019. (2019)
BASE
Show details
5
Parsing early and late modern English corpora
BASE
Show details
6
Parsing early and late modern English corpora
In: Schneider, Gerold; Lehmann, Hans Martin; Schneider, Peter (2015). Parsing early and late modern English corpora. Literary and Linguistic Computing, 30(3):423-439. (2015)
Abstract: We describe, evaluate, and improve the automatic annotation of diachronic corpora at the levels of word-class, lemma, chunks, and dependency syntax. As corpora we use the ARCHER corpus (texts from 1,600 to 2,000) and the ZEN corpus (texts from 1,660 to 1,800). Performance on Modern English is considerably lower than on Present Day English (PDE). We present several methods that improve performance. First we use the spelling normalization tool VARD to map spelling variants to their PDE equivalent, which improves tagging. We investigate the tagging changes that are due to the normalization and observe improvements, deterioration, and missing mappings. We then implement an optimized version, using VARD rules and preprocessing steps to improve normalization. We evaluate the improvement on parsing performance, comparing original text, standard VARD, and our optimized version. Over 90% of the normalization changes lead to improved parsing, and 17.3% of all 422 manually annotated sentences get a net improved parse. As a next step, we adapt the parser’s grammar, add a semantic expectation model and a model for prepositional phrases (PP)-attachment interaction to the parser. These extensions improve parser performance, marginally on PDE, more considerably on earlier texts—2—5% on PP-attachment relations (e.g. from 63.6 to 68.4% and from 70 to 72.9% on 17th century texts). Finally, we briefly outline linguistic applications and give two examples: gerundials and auxiliary verbs in the ZEN corpus, showing that despite high noise levels linguistic signals clearly emerge, opening new possibilities for large-scale research of gradient phenomena in language change.
Keyword: 000 Computer science; 410 Linguistics; 820 English & Old English literatures; English Department; Institute of Computational Linguistics; knowledge & systems; Zurich Center for Linguistics
URL: https://doi.org/10.5167/uzh-108376
https://doi.org/10.1093/llc/fqu001
https://www.zora.uzh.ch/id/eprint/108376/8/fqu001.pdf
https://www.zora.uzh.ch/id/eprint/108376/1/ParsingEmodEv4_LLC_revised2.pdf
https://www.zora.uzh.ch/id/eprint/108376/
BASE
Hide details
7
Dependency bank
In: Lehmann, Hans Martin; Schneider, Gerold (2012). Dependency bank. In: LREC 2012 Conference Workshop "Challenges in the Management of Large Corpora", Istanbul, Turkey, 22 May 2012 - 22 May 2012, 23-28. (2012)
BASE
Show details
8
Parser-based analysis of syntax-lexis interactions
In: Lehmann, Hans Martin; Schneider, Gerold (2009). Parser-based analysis of syntax-lexis interactions. In: Jucker, Andreas H; Schreier, Daniel; Hundt, Marianne. Corpora: Pragmatics and Discourse. Amsterdam, The Netherlands: Rodopi, 477-502. (2009)
BASE
Show details
9
Text types and corpora: Studies in honour of Udo Fries
Fischer, Andreas (Hrsg.); Tottie, Gunnel (Hrsg.); Lehmann, Hans Martin (Hrsg.). - Tübingen : Narr, 2002
IDS Bibliografie zur deutschen Grammatik
Show details
10
From the COLT's mouth ... and others' : language corpora studies in honour of Anna-Brita Stenström
Kirk, John M. (Mitarb.); Wichmann, Anne (Mitarb.); Kjellmer, Göran (Mitarb.). - Amsterdam [u.a.] : Rodopi, 2002
BLLDB
UB Frankfurt Linguistik
Show details
11
Towards a history of English directives
Kohnen, Thomas. - : Niemeyer, 2002
BASE
Show details
12
The Student as Corpus Linguist.
BASE
Show details
13
Corpora Galore : analyses and techniques in describing English ; papers from the nineteenth International Conference on English Language Research on Computerised Corpora (ICAME 1998)
Lindquist, Hans (Mitarb.); Minugh, David (Mitarb.); Kirk, John M. (Hrsg.). - Amsterdam [u.a.] : Rodopi, 2000
BLLDB
UB Frankfurt Linguistik
Show details
14
Grammar and lexis in English corpora
Lysvåg, Per (Mitarb.); Stenström, Anna-Brita (Mitarb.); Kjellmer, Göran (Mitarb.)...
In: Out of corpora. - Amsterdam [u.a.] : Rodopi (1999), 47-212
BLLDB
Show details
15
Automatic retrieval of zero elements in a computerised corpus
In: Corpus based studies in English. - Amsterdam [u.a.] : Rodopi (1997), 179-194
BLLDB
Show details

Catalogues
2
0
0
0
0
0
0
Bibliographies
4
0
1
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
10
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern