DE eng

Search in the Catalogues and Directories

Page: 1 2 3 4 5 6 7...9
Hits 41 – 60 of 174

41
Dialogue act annotated spoken corpus GORDAN 1.0 (audio/video)
Zwitter Vitez, Ana; Zemljarič Miklavčič, Jana; Krek, Simon. - : Centre for Language Resources and Technologies, University of Ljubljana, 2020. : Faculty of Electrical Engineering and Computer Science, University of Maribor, 2020
BASE
Show details
42
English-Slovene term candidates KAS-biterm 1.0
Erjavec, Tomaž; Ljubešić, Nikola; Fišer, Darja. - : Jožef Stefan Institute, 2020
BASE
Show details
43
MULTEXT-East ...
Erjavec, Tomaž. - : arXiv, 2020
BASE
Show details
44
Building English-to-Serbian machine translation system for IMDb movie reviews
In: Way, Andy orcid:0000-0001-5736-5930 , Lohar, Pintu and Popović, Maja orcid:0000-0001-8234-8745 (2019) Building English-to-Serbian machine translation system for IMDb movie reviews. In: Proceedings of the 7th Workshop on Balto-Slavic Natural Language Processing, 2 Aug 2019, Florence,Italy. ISBN 978-1-950737-41-3 (2019)
BASE
Show details
45
Universal Dependencies 2.5
Zeman, Daniel; Nivre, Joakim; Abrams, Mitchell. - : Universal Dependencies Consortium, 2019
BASE
Show details
46
Universal Dependencies 2.4
Nivre, Joakim; Abrams, Mitchell; Agić, Željko. - : Universal Dependencies Consortium, 2019
BASE
Show details
47
Corpus of academic Slovene KAS 1.0
Erjavec, Tomaž; Fišer, Darja; Ljubešić, Nikola. - : Jožef Stefan Institute, 2019. : Faculty of Electrical Engineering and Computer Science, University of Maribor, 2019
BASE
Show details
48
Corpus of "Attacks on the Yugoslav National Army" (1989) VAYNA 1.1
Žagar, Igor; Tancig, Peter; Erjavec, Tomaž. - : Jožef Stefan Institute, 2019
BASE
Show details
49
Training corpus ssj500k 2.2
Krek, Simon; Dobrovoljc, Kaja; Erjavec, Tomaž. - : Centre for Language Resources and Technologies, University of Ljubljana, 2019
BASE
Show details
50
Morphological lexicon Sloleks 2.0
Dobrovoljc, Kaja; Krek, Simon; Holozan, Peter. - : Centre for Language Resources and Technologies, University of Ljubljana, 2019
BASE
Show details
51
Slovenian parliamentary corpus siParl 1.0 (1990-2018)
Pančur, Andrej; Erjavec, Tomaž; Ojsteršek, Mihael. - : Institute of Contemporary History, 2019
BASE
Show details
52
Slovenian parliamentary corpus ParlaMeter-sl 1.0
Dobranić, Filip; Ljubešić, Nikola; Erjavec, Tomaž. - : Jožef Stefan Institute, 2019
BASE
Show details
53
Croatian Twitter training corpus ReLDI-NormTagNER-hr 2.1
Ljubešić, Nikola; Erjavec, Tomaž; Batanović, Vuk. - : Jožef Stefan Institute, 2019
BASE
Show details
54
Spoken corpus Gos VideoLectures 4.0 (transcription)
Verdonik, Darinka; Potočnik, Tomaž; Sepesy Maučec, Mirjam. - : Faculty of Electrical Engineering and Computer Science, University of Maribor, 2019
BASE
Show details
55
CMC training corpus Janes-Tag 2.1
Erjavec, Tomaž; Fišer, Darja; Čibej, Jaka. - : Jožef Stefan Institute, 2019
BASE
Show details
56
Corpus of Academic Slovene (PhD theses) KAS-dr 1.0
Erjavec, Tomaž; Fišer, Darja; Ljubešić, Nikola; Ferme, Marko; Borovič, Mladen; Boškovič, Borko; Ojsteršek, Milan; Hrovat, Goran. - : Jožef Stefan Institute, 2019. : Faculty of Electrical Engineering and Computer Science, University of Maribor, 2019
Abstract: The KAS-dr corpus of Slovene PhD theses consists of almost 1,600 texts (266 thousand pages or 100 million tokens) written 2000 - 2018 and gathered from the digital libraries of Slovene higher education institutions via the Slovene Open Science portal (http://openscience.si). The theses have associated with them significant metadata, while each thesis in the corpus contains its textual body, i.e. without their front and back matter. The body is divided into pages, these into paragraphs, and then into sentences. The sentence tokens are morphosyntactically annotated, words are lemmatised and English-Slovene pairs of term candidates are marked up and linked. Slovene monolingual term candidates are also marked up. The corpus is distributed in the canonical TEI encoding, in the so called vertical format used by the (no)Sketch Engine and CWB concordancers, and as plain text files. Each format distribution also contains a file with thesis metadata. This repository entry contains the corpus of PhD theses only; separate entries are available that contain MSc/MA theses (KAS-mag: http://hdl.handle.net/11356/1266), BSc/BA theses (KAS-dipl: http://hdl.handle.net/11356/1267) and the complete KAS corpus with all three (KAS: http://hdl.handle.net/11356/1244).
Keyword: academic writing; PhD theses; TEI; terminology
URL: http://hdl.handle.net/11356/1265
BASE
Hide details
57
Collocation lexicon of Slovene academic discourse Aleks
Logar, Nataša; Kosem, Iztok; Erjavec, Tomaž. - : Faculty of Social Sciences, University of Ljubljana, 2019. : Centre for Language Resources and Technologies, University of Ljubljana, 2019. : Jožef Stefan Institute, 2019
BASE
Show details
58
Croatian parliamentary corpus ParlaMeter-hr 1.0
Dobranić, Filip; Ljubešić, Nikola; Erjavec, Tomaž. - : Jožef Stefan Institute, 2019
BASE
Show details
59
Corpus of Academic Slovene (MSc/MA theses) KAS-mag 1.0
Erjavec, Tomaž; Fišer, Darja; Ljubešić, Nikola. - : Jožef Stefan Institute, 2019. : Faculty of Electrical Engineering and Computer Science, University of Maribor, 2019
BASE
Show details
60
Corpus of Informatics DSI 5.0
Erjavec, Tomaž; Puc, Katarina; Kanič, Ivan. - : Jožef Stefan Institute, 2019. : Slovensko društvo INFORMATIKA, 2019
BASE
Show details

Page: 1 2 3 4 5 6 7...9

Catalogues
2
0
0
0
6
0
0
Bibliographies
7
0
0
0
0
0
2
0
0
Linked Open Data catalogues
0
Online resources
1
0
0
0
Open access documents
155
0
3
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern