DE eng

Search in the Catalogues and Directories

Page: 1 2 3 4 5 6...10
Hits 21 – 40 of 191

21
Slovenian Twitter dataset 2018-2020 1.0
Evkoski, Bojan; Pelicon, Andraž; Mozetič, Igor. - : Jožef Stefan Institute, 2021
BASE
Show details
22
Slovene ontology of semantic types for nouns SLONEST-noun 1.0
Abstract: SLONEST stands for Slovene Ontologies of Semantic Types. The first subset – SLONEST-noun 1.0 – represents an ontology developed for nouns. SLONEST-noun contains an XML file with a total of 271 categories of semantic types: 21 top-level categories, which are further divided into up to three levels of hierarchical subcategories. The ontology was developed and evaluated using the data from the Collocations Dictionary of Modern Slovene (Kosem et al. 2018; https://viri.cjvt.si/kolokacije; http://hdl.handle.net/11356/1250) and the Comprehensive Slovene-Hungarian Dictionary (https://www.cjvt.si/en/research/cjvt-projects/slovene-hungarian-dictionary), which are being compiled at the Centre for Language Resources and Technologies, University of Ljubljana. The semantic types in the SLONEST-noun ontology are accompanied with numerical ids (listed in the attribute SEMCODE; e.g. "1.1.1") and full ontology path (attribute SEMFULLNAME; e.g. "HUMAN-ACTIVITY-OTHER"). Every semantic type is provided with a definition (e.g. "Other denominations for humans related to activities."). Where relevant, especially at top-level semantic types, the corresponding semantic type (i.e. lexicographer file) from Wordnet (https://wordnet.princeton.edu/) is listed, along with the level of matching ("full" or "partial"). For most semantic types, examples of Slovene lemmas or multiword units are also provided. As the ontology was also developed for, and tested on, collocation data, a selection of collocations is also provided for most categories. For every collocation, noun headwords and collocates are clearly labelled, and the information on grammatical structure (id and name) is provided, based on the most recent database of Slovene collocations (http://hdl.handle.net/11356/1415). The ontology was developed as part of the KOLOS project. The authors acknowledge that the project titled Collocation as a basis for language description: semantic and temporal perspectives (J6-8255) was financially supported by the Slovenian Research Agency.
Keyword: collocations; concepts; nouns; ontology; semantic types; Slovenian language; wordnet
URL: http://hdl.handle.net/11356/1428
BASE
Hide details
23
Corpus of Serbian Forms of Address 1.0
Lemmenmeier-Batinić, Dolores; Ljubešić, Nikola; Samardžić, Tanja. - : Slavic Seminary, University of Zurich, 2021
BASE
Show details
24
Offensive language dataset of Croatian, English and Slovenian comments FRENK 1.0
Ljubešić, Nikola; Fišer, Darja; Erjavec, Tomaž. - : Jožef Stefan Institute, 2021
BASE
Show details
25
Montenegrin web corpus meWaC 1.0
Ljubešić, Nikola; Erjavec, Tomaž. - : Jožef Stefan Institute, 2021
BASE
Show details
26
The Orange workflow for observing collocation clusters ColEmbed 1.0
Kosem, Iztok; Čibej, Jaka; Ljubešić, Nikola. - : Centre for Language Resources and Technologies, University of Ljubljana, 2021
BASE
Show details
27
Comparable corpora of South-Slavic Wikipedias CLASSLA-Wikipedia 1.0
Ljubešić, Nikola; Markoski, Filip; Markoska, Elena. - : Jožef Stefan Institute, 2021
BASE
Show details
28
Text collection for training the BERTić transformer model BERTić-data
Ljubešić, Nikola. - : Jožef Stefan Institute, 2021
BASE
Show details
29
Multilingual comparable corpora of parliamentary debates ParlaMint 2.1
BASE
Show details
30
Corpus of Croatian news portals ENGRI (2014-2018)
Bogunović, Irena; Kučić, Mario; Ljubešić, Nikola. - : University of Rijeka, Faculty of Maritime Studies, 2021
BASE
Show details
31
Offensive language dataset of Croatian, English and Slovenian comments FRENK 1.1
Ljubešić, Nikola; Fišer, Darja; Erjavec, Tomaž. - : Jožef Stefan Institute, 2021
BASE
Show details
32
The CLASSLA-StanfordNLP model for morphosyntactic annotation of standard Slovenian 1.2
Ljubešić, Nikola; Krsnik, Luka. - : Jožef Stefan Institute, 2021
BASE
Show details
33
The CLASSLA-StanfordNLP model for lemmatisation of standard Slovenian 1.3
Ljubešić, Nikola; Krsnik, Luka. - : Jožef Stefan Institute, 2021
BASE
Show details
34
Abstracts from the KAS corpus KAS-Abs 1.0
Erjavec, Tomaž; Fišer, Darja; Ljubešić, Nikola. - : Jožef Stefan Institute, 2021. : Faculty of Electrical Engineering and Computer Science, University of Maribor, 2021
BASE
Show details
35
Linguistically annotated multilingual comparable corpora of parliamentary debates ParlaMint.ana 2.1
BASE
Show details
36
Linguistically annotated multilingual comparable corpora of parliamentary debates ParlaMint.ana 2.0
BASE
Show details
37
Slovenian Twitter hate speech dataset IMSyPP-sl
Kralj Novak, Petra; Mozetič, Igor; Ljubešić, Nikola. - : Jožef Stefan Institute, 2021
BASE
Show details
38
Multilingual comparable corpora of parliamentary debates ParlaMint 2.0
BASE
Show details
39
English YouTube Hate Speech Corpus
Ljubešić, Nikola; Mozetič, Igor; Cinelli, Matteo. - : Jožef Stefan Institute, 2021
BASE
Show details
40
Corpus of Written Standard Slovene Gigafida 2.0
Krek, Simon; Erjavec, Tomaž; Repar, Andraž. - : Centre for Language Resources and Technologies, University of Ljubljana, 2021
BASE
Show details

Page: 1 2 3 4 5 6...10

Catalogues
0
0
0
0
5
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
1
0
0
1
Open access documents
183
0
2
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern