DE eng

Search in the Catalogues and Directories

Hits 1 – 8 of 8

1
LeiKo ...
BASE
Show details
2
LeiKo ...
BASE
Show details
3
Manual Topic Annotation of German Novels and Parlament Protocols by multiple Annotators ...
BASE
Show details
4
Manual Topic Annotation of German Novels and Parlament Protocols by multiple Annotators ...
BASE
Show details
5
LeiKo ...
BASE
Show details
6
LeiKo ...
BASE
Show details
7
LeiKo ...
BASE
Show details
8
LeiKo ...
Abstract: LeiKo is a comparable corpus of German easy-to-read news texts. This freely available resource is systematically compiled and linguistically annotated for linguistic and computational linguistic research. LeiKo consists of 216 news and newspaper texts (approx. 56,600 tokens) and their meta data structured in four subcorpora according to the websites they were published on. All texts are tokenized, lemmatized, part-of-speech tagged and dependency parsed and can be queried in ANNIS (Krause/Zeldes 2016). A core corpus of 40 texts is manually corrected. Version 0.9 contains only the core corpus with lemma and pos annotations and can be queried here: https://corpora.uni-hamburg.de/hzsk/de/hzsk_access/annis/leiko Version 1.0 comprises all 216 texts and not only lemma and pos annotations, but also syntactic annotations and metadata. Further versions with additional manual annotation levels will follow. The corpus is provided in the annis format, which can be directly imported into ANNIS Kickstarter. ...
Keyword: ANNIS; annotation; Corpus; corpus linguistics; easy-to-read; einfache Sprache; Leichte Sprache; linguistics; news texts; newspaper; Plain Language; text simplification
URL: https://dx.doi.org/10.5281/zenodo.3626764
https://zenodo.org/record/3626764
BASE
Hide details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
8
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern