Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Page: 1 2 3

Hits 1 – 20 of 42

1	EMBEDDIA tools output example corpus of Estonian, Croatian and Latvian news articles 1.0
	Freienthal, Linda; Pelicon, Andraž; Martinc, Matej. - : Ekspress Meedia Group, 2022. : Styria Media Group, 2022
	BASE
	Show details

2	Out of Thin Air: Is Zero-Shot Cross-Lingual Keyword Detection Better Than Unsupervised? ...
	Koloski, Boshko; Pollak, Senja; Škrlj, Blaž. - : arXiv, 2022
	BASE
	Show details

3	Word-embedding based bilingual terminology alignment ...
	Repar, Andraž; Martinc, Matej; Ulčar, Matej. - : Zenodo, 2021
	BASE
	Show details

4	Word-embedding based bilingual terminology alignment ...
	Repar, Andraž; Martinc, Matej; Ulčar, Matej. - : Zenodo, 2021
	BASE
	Show details

5	Ekspress news article archive (in Estonian and Russian) 1.0
	Purver, Matthew; Pollak, Senja; Freienthal, Linda. - : Ekspress Meedia Group, 2021
	BASE
	Show details

6	Latvian user comment dataset 1.0
	Shekhar, Ravi; Purver, Matthew; Pollak, Senja. - : Ekspress Meedia Group, 2021
	BASE
	Show details

7	Ekspress user comment dataset 1.0
	Shekhar, Ravi; Pollak, Senja; Pelicon, Andraž. - : Ekspress Meedia Group, 2021
	BASE
	Show details

8	24sata news comment dataset 1.0
	Shekhar, Ravi; Pranjic, Marko; Pollak, Senja. - : Styria Media Group, 2021
	BASE
	Show details

9	Keyword extraction datasets for Croatian, Estonian, Latvian and Russian 1.0
	Koloski, Boshko; Pollak, Senja; Škrlj, Blaž. - : Ekspress Meedia Group, 2021. : Styria Media Group, 2021
	BASE
	Show details

10	24sata news article archive 1.0
	Purver, Matthew; Shekhar, Ravi; Pranjić, Marko. - : Styria Media Group, 2021
	BASE
	Show details

11	Latvian Delfi article archive (in Latvian and Russian) 1.0
	Pollak, Senja; Purver, Matthew; Shekhar, Ravi. - : Ekspress Meedia Group, 2021
	BASE
	Show details

12	List of single-word male and female occupations in Slovenian
	Supej, Anka; Ulčar, Matej; Robnik-Šikonja, Marko. - : Jožef Stefan Institute, 2021. : Faculty of Computer and Information Science, University of Ljubljana, 2021
	BASE
	Show details

13	SimLex-999 Slovenian translation SimLex-999-sl 1.0
	Pollak, Senja; Vulić, Ivan; Pelicon, Andraž; Repar, Andraž; Armendariz, Carlos; Matthew, Purver; Ljubešić, Nikola. - : University of Ljubljana, 2021
	Abstract: The resource contains English SimLex-999 (Hill et al. 2015) and their Slovene translations. In the translation process, the word pairs were first translated by two translators independently, and next, for the examples where the translations differed, the final translations were chosen in a consensus meeting. The translators had also access to Croatian Simlex-999 translations (Mrkšić et al. 2017) and received translation guidelines (see next sheet) inspired by guidelines of Multi-SimLex (Vulić et al. 2020). The resources was used for building the CoSimLex resource (Armendariz et al. 2020). The list contains English original pair of words (Word1 and Word2), their part-of-speech, followed by Slovene translations (Trans1 and Trans2). The last column Comment relates to special cases: - "multiword_translation" -> translators were asked to opt for single-word equivalents, in some cases the only appropriate translation was a multi-word expression (for example, "birthday" -> "rojstni dan"). - "no_translation" -> pairs without a proper translation, i.e. translation pair contains two identical words. Although the translators were asked to find two different translations for the words, in a few examples that was not possible. For example, for the English pair "taxi" and "cab", only "taksi" was considered a good Slovene equivalent. - "duplicated_translation" -> in cases where a pair of words is repeated for two different English original pairs, both occurrences are marked as duplicate translations. - "duplicated_original" -> in one case, the original word pair was a duplicate, which is also marked. Cite: If you use the dataset, please cite the Clarin handle and the following paper: Armendariz, Carlos Santos, Purver, Matthew, Ulčar, Matej, Pollak, Senja, Ljubešić, Nikola, Granroth-Wilding, Mark, and Vaik, Kristiina (2020). CoSimLex: A Resource for Evaluating Graded Word Similarity in Context. In Proceedings of the 12th Language Resources and Evaluation Conference, p. 5878--5886. https://www.aclweb.org/anthology/2020.lrec-1.720/ References: Armendariz, Carlos Santos, Purver, Matthew, Ulčar, Matej, Pollak, Senja, Ljubešić, Nikola, Granroth-Wilding, Mark, and Vaik, Kristiina (2020). CoSimLex: A Resource for Evaluating Graded Word Similarity in Context. In Proceedings of the 12th Language Resources and Evaluation Conference, p. 5878--5886. https://www.aclweb.org/anthology/2020.lrec-1.720/ Hill, F., Reichart, R., and Korhonen, A. (2015). Simlex-999: Evaluating semantic models with (genuine) similarity estimation. Computational Linguistics, 41(4):665–695. https://www.aclweb.org/anthology/J15-4004/ Mrkšić, Nikola, Ivan Vulić, Diarmuid Ó Séaghdha, Ira Leviant, Roi Reichart, Milica Gašić, Anna Korhonen, and Steve Young. (2017). Semantic specialisation of distributional word vector spaces using monolingual and cross-lingual constraints. Transactions of the ACL, 5:309–324. https://www.mitpressjournals.org/doi/abs/10.1162/tacl_a_00063 Vulić, Ivan, Baker, Simon, Ponti, Edoardo Maria, Petti, Ulla, Leviant, Ira, Wing, Kelly, Majewska, Olga, Bar, Eden, Malone, Matt, Poibeau, Thierry, Reichart, Roi and Anna Korhonen (2020). Multi-SimLex: A Large-Scale Evaluation of Multilingual and Cross-Lingual Lexical Semantic Similarity. Computational Linguistics. https://doi.org/10.1162/coli_a_00391
	Keyword: English language; evaluation; similarity; Slovenian language; word embeddings
	URL: http://hdl.handle.net/11356/1309
	BASE
	Hide details

14	Slav-NER: the 3rd Cross-lingual Challenge on Recognition, Normalization, Classification, and Linking of Named Entities across Slavic languages ...
	Piskorski, Jakub; Babych, Bogdan; Kancheva, Zara. - : Zenodo, 2021
	BASE
	Show details

15	Slav-NER: the 3rd Cross-lingual Challenge on Recognition, Normalization, Classification, and Linking of Named Entities across Slavic languages ...
	Piskorski, Jakub; Babych, Bogdan; Kancheva, Zara. - : Zenodo, 2021
	BASE
	Show details

16	Evaluation of contextual embeddings on less-resourced languages ...
	Ulčar, Matej; Žagar, Aleš; Armendariz, Carlos S.. - : arXiv, 2021
	BASE
	Show details

17	Simple Discovery of COVID IS WAR Metaphors Using Word Embeddings ...
	Brglez, Mojca; Pollak, Senja; Vintar, Špela. - : Zenodo, 2021
	BASE
	Show details

18	Simple Discovery of COVID IS WAR Metaphors Using Word Embeddings ...
	Brglez, Mojca; Pollak, Senja; Vintar, Špela. - : Zenodo, 2021
	BASE
	Show details

19	Investigating cross-lingual training for offensive language detection
	Pelicon, Andraž; Shekhar, Ravi; Škrlj, Blaž...
	In: PeerJ Comput Sci (2021)
	BASE
	Show details

20	Temporal Integration of Text Transcripts and Acoustic Features for Alzheimer's Diagnosis Based on Spontaneous Speech
	Martinc, Matej; Haider, Fasih; Pollak, Senja...
	In: Front Aging Neurosci (2021)
	BASE
	Show details

Page: 1 2 3

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern