DE eng

Search in the Catalogues and Directories

Hits 1 – 5 of 5

1
Corpus of term-annotated texts RSDO5 1.1
Abstract: The RSDO5 corpus was compiled in order to serve as a training set for automatic term identification. It consists of 12 texts with 250,000 words and almost 38,000 manually annotated terms, each marked to be either in- or out-domain. The corpus texts were published between 2000 and 2019, are either PhD theses (3), a scientific book based on a PhD thesis (1), graduate level text books (4), or journal articles (4) and belong to the fields of biomechanics (3), linguistics (3), chemistry (3), or veterinary science (3). Apart from the manually annotated terms, the corpus was automatically annotated with Universal Dependencies annotations, i.e. tokenisation, sentence segmentation, lemmatisation, morpological features and dependency syntax. As opposed to the previous version, this one adds in- and out-domain marking on terms in the TEI and vertical files.
Keyword: manual annotation; TEI; terminology
URL: http://hdl.handle.net/11356/1470
BASE
Hide details
2
Corpus of Slovenian school texts SBSJ 1.0
BASE
Show details
3
Corpus of term-annotated texts RSDO5 1.0
BASE
Show details
4
Slovene Grammars and Orthographic Dictionaries
BASE
Show details
5
Wüster’s View of Terminology
Trojar, Mitja. - 2017
BASE
Show details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
5
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern