DE eng

Search in the Catalogues and Directories

Page: 1 2 3
Hits 1 – 20 of 42

1
EMBEDDIA tools output example corpus of Estonian, Croatian and Latvian news articles 1.0
Freienthal, Linda; Pelicon, Andraž; Martinc, Matej. - : Ekspress Meedia Group, 2022. : Styria Media Group, 2022
BASE
Show details
2
Out of Thin Air: Is Zero-Shot Cross-Lingual Keyword Detection Better Than Unsupervised? ...
BASE
Show details
3
Word-embedding based bilingual terminology alignment ...
Abstract: The ability to accurately align concepts between languages can provide significant benefits in many practical applications. In this paper, we extend a machine learning approach using dictionary and cognate-based features with novel cross-lingual embedding features using pretrained fastText embeddings. We use the tool VecMap to align the embeddings between Slovenian and English and then for every word calculate the top 3 closest word embeddings in the opposite language based on cosine distance. These alignments are then used as features for the machine learning algorithm. With one configuration of the input parameters, we managed to improve the overall F-score compared to previous work, while another configuration yielded improved precision (96%) at a cost of lower recall. Using embedding-based features as a replacement for dictionary-based features provides a significant benefit: while a large bilingual parallel corpus is required to generate the Giza++ word alignment lists, no such data is required for ...
Keyword: embeddings alignment; machine learning; terminology alignment; word embeddings
URL: https://dx.doi.org/10.5281/zenodo.5547981
https://zenodo.org/record/5547981
BASE
Hide details
4
Word-embedding based bilingual terminology alignment ...
BASE
Show details
5
Ekspress news article archive (in Estonian and Russian) 1.0
Purver, Matthew; Pollak, Senja; Freienthal, Linda. - : Ekspress Meedia Group, 2021
BASE
Show details
6
Latvian user comment dataset 1.0
Shekhar, Ravi; Purver, Matthew; Pollak, Senja. - : Ekspress Meedia Group, 2021
BASE
Show details
7
Ekspress user comment dataset 1.0
Shekhar, Ravi; Pollak, Senja; Pelicon, Andraž. - : Ekspress Meedia Group, 2021
BASE
Show details
8
24sata news comment dataset 1.0
Shekhar, Ravi; Pranjic, Marko; Pollak, Senja. - : Styria Media Group, 2021
BASE
Show details
9
Keyword extraction datasets for Croatian, Estonian, Latvian and Russian 1.0
Koloski, Boshko; Pollak, Senja; Škrlj, Blaž. - : Ekspress Meedia Group, 2021. : Styria Media Group, 2021
BASE
Show details
10
24sata news article archive 1.0
Purver, Matthew; Shekhar, Ravi; Pranjić, Marko. - : Styria Media Group, 2021
BASE
Show details
11
Latvian Delfi article archive (in Latvian and Russian) 1.0
Pollak, Senja; Purver, Matthew; Shekhar, Ravi. - : Ekspress Meedia Group, 2021
BASE
Show details
12
List of single-word male and female occupations in Slovenian
Supej, Anka; Ulčar, Matej; Robnik-Šikonja, Marko. - : Jožef Stefan Institute, 2021. : Faculty of Computer and Information Science, University of Ljubljana, 2021
BASE
Show details
13
SimLex-999 Slovenian translation SimLex-999-sl 1.0
Pollak, Senja; Vulić, Ivan; Pelicon, Andraž. - : University of Ljubljana, 2021
BASE
Show details
14
Slav-NER: the 3rd Cross-lingual Challenge on Recognition, Normalization, Classification, and Linking of Named Entities across Slavic languages ...
BASE
Show details
15
Slav-NER: the 3rd Cross-lingual Challenge on Recognition, Normalization, Classification, and Linking of Named Entities across Slavic languages ...
BASE
Show details
16
Evaluation of contextual embeddings on less-resourced languages ...
BASE
Show details
17
Simple Discovery of COVID IS WAR Metaphors Using Word Embeddings ...
BASE
Show details
18
Simple Discovery of COVID IS WAR Metaphors Using Word Embeddings ...
BASE
Show details
19
Investigating cross-lingual training for offensive language detection
In: PeerJ Comput Sci (2021)
BASE
Show details
20
Temporal Integration of Text Transcripts and Acoustic Features for Alzheimer's Diagnosis Based on Spontaneous Speech
In: Front Aging Neurosci (2021)
BASE
Show details

Page: 1 2 3

Catalogues
0
0
0
0
1
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
41
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern