Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Page: 1 2 3 4 5 6...10

Hits 21 – 40 of 191

21	Slovenian Twitter dataset 2018-2020 1.0
	Evkoski, Bojan; Pelicon, Andraž; Mozetič, Igor. - : Jožef Stefan Institute, 2021
	BASE
	Show details

22	Slovene ontology of semantic types for nouns SLONEST-noun 1.0
	Kosem, Iztok; Pori, Eva; Gantar, Polona; Logar, Nataša; Krek, Simon; Laskowski, Cyprian; Arhar Holdt, Špela; Čibej, Jaka; Dobrovoljc, Kaja; Gorjanc, Vojko; Klemenc, Bojan; Ljubešić, Nikola. - : Centre for Language Resources and Technologies, University of Ljubljana, 2021
	Abstract: SLONEST stands for Slovene Ontologies of Semantic Types. The first subset – SLONEST-noun 1.0 – represents an ontology developed for nouns. SLONEST-noun contains an XML file with a total of 271 categories of semantic types: 21 top-level categories, which are further divided into up to three levels of hierarchical subcategories. The ontology was developed and evaluated using the data from the Collocations Dictionary of Modern Slovene (Kosem et al. 2018; https://viri.cjvt.si/kolokacije; http://hdl.handle.net/11356/1250) and the Comprehensive Slovene-Hungarian Dictionary (https://www.cjvt.si/en/research/cjvt-projects/slovene-hungarian-dictionary), which are being compiled at the Centre for Language Resources and Technologies, University of Ljubljana. The semantic types in the SLONEST-noun ontology are accompanied with numerical ids (listed in the attribute SEMCODE; e.g. "1.1.1") and full ontology path (attribute SEMFULLNAME; e.g. "HUMAN-ACTIVITY-OTHER"). Every semantic type is provided with a definition (e.g. "Other denominations for humans related to activities."). Where relevant, especially at top-level semantic types, the corresponding semantic type (i.e. lexicographer file) from Wordnet (https://wordnet.princeton.edu/) is listed, along with the level of matching ("full" or "partial"). For most semantic types, examples of Slovene lemmas or multiword units are also provided. As the ontology was also developed for, and tested on, collocation data, a selection of collocations is also provided for most categories. For every collocation, noun headwords and collocates are clearly labelled, and the information on grammatical structure (id and name) is provided, based on the most recent database of Slovene collocations (http://hdl.handle.net/11356/1415). The ontology was developed as part of the KOLOS project. The authors acknowledge that the project titled Collocation as a basis for language description: semantic and temporal perspectives (J6-8255) was financially supported by the Slovenian Research Agency.
	Keyword: collocations; concepts; nouns; ontology; semantic types; Slovenian language; wordnet
	URL: http://hdl.handle.net/11356/1428
	BASE
	Hide details

23	Corpus of Serbian Forms of Address 1.0
	Lemmenmeier-Batinić, Dolores; Ljubešić, Nikola; Samardžić, Tanja. - : Slavic Seminary, University of Zurich, 2021
	BASE
	Show details

24	Offensive language dataset of Croatian, English and Slovenian comments FRENK 1.0
	Ljubešić, Nikola; Fišer, Darja; Erjavec, Tomaž. - : Jožef Stefan Institute, 2021
	BASE
	Show details

25	Montenegrin web corpus meWaC 1.0
	Ljubešić, Nikola; Erjavec, Tomaž. - : Jožef Stefan Institute, 2021
	BASE
	Show details

26	The Orange workflow for observing collocation clusters ColEmbed 1.0
	Kosem, Iztok; Čibej, Jaka; Ljubešić, Nikola. - : Centre for Language Resources and Technologies, University of Ljubljana, 2021
	BASE
	Show details

27	Comparable corpora of South-Slavic Wikipedias CLASSLA-Wikipedia 1.0
	Ljubešić, Nikola; Markoski, Filip; Markoska, Elena. - : Jožef Stefan Institute, 2021
	BASE
	Show details

28	Text collection for training the BERTić transformer model BERTić-data
	Ljubešić, Nikola. - : Jožef Stefan Institute, 2021
	BASE
	Show details

29	Multilingual comparable corpora of parliamentary debates ParlaMint 2.1
	Erjavec, Tomaž; Ogrodniczuk, Maciej; Osenova, Petya. - : CLARIN ERIC, 2021
	BASE
	Show details

30	Corpus of Croatian news portals ENGRI (2014-2018)
	Bogunović, Irena; Kučić, Mario; Ljubešić, Nikola. - : University of Rijeka, Faculty of Maritime Studies, 2021
	BASE
	Show details

31	Offensive language dataset of Croatian, English and Slovenian comments FRENK 1.1
	Ljubešić, Nikola; Fišer, Darja; Erjavec, Tomaž. - : Jožef Stefan Institute, 2021
	BASE
	Show details

32	The CLASSLA-StanfordNLP model for morphosyntactic annotation of standard Slovenian 1.2
	Ljubešić, Nikola; Krsnik, Luka. - : Jožef Stefan Institute, 2021
	BASE
	Show details

33	The CLASSLA-StanfordNLP model for lemmatisation of standard Slovenian 1.3
	Ljubešić, Nikola; Krsnik, Luka. - : Jožef Stefan Institute, 2021
	BASE
	Show details

34	Abstracts from the KAS corpus KAS-Abs 1.0
	Erjavec, Tomaž; Fišer, Darja; Ljubešić, Nikola. - : Jožef Stefan Institute, 2021. : Faculty of Electrical Engineering and Computer Science, University of Maribor, 2021
	BASE
	Show details

35	Linguistically annotated multilingual comparable corpora of parliamentary debates ParlaMint.ana 2.1
	Erjavec, Tomaž; Ogrodniczuk, Maciej; Osenova, Petya. - : CLARIN ERIC, 2021
	BASE
	Show details

36	Linguistically annotated multilingual comparable corpora of parliamentary debates ParlaMint.ana 2.0
	Erjavec, Tomaž; Ogrodniczuk, Maciej; Osenova, Petya. - : CLARIN ERIC, 2021
	BASE
	Show details

37	Slovenian Twitter hate speech dataset IMSyPP-sl
	Kralj Novak, Petra; Mozetič, Igor; Ljubešić, Nikola. - : Jožef Stefan Institute, 2021
	BASE
	Show details

38	Multilingual comparable corpora of parliamentary debates ParlaMint 2.0
	Erjavec, Tomaž; Ogrodniczuk, Maciej; Osenova, Petya. - : CLARIN ERIC, 2021
	BASE
	Show details

39	English YouTube Hate Speech Corpus
	Ljubešić, Nikola; Mozetič, Igor; Cinelli, Matteo. - : Jožef Stefan Institute, 2021
	BASE
	Show details

40	Corpus of Written Standard Slovene Gigafida 2.0
	Krek, Simon; Erjavec, Tomaž; Repar, Andraž. - : Centre for Language Resources and Technologies, University of Ljubljana, 2021
	BASE
	Show details

Page: 1 2 3 4 5 6...10

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern