DE eng

Search in the Catalogues and Directories

Page: 1 2
Hits 1 – 20 of 37

1
Curlie Dataset - Language-agnostic Website Embedding and Classification ...
Lugeon, Sylvain; Piccardi, Tiziano. - : figshare, 2022
BASE
Show details
2
Curlie Dataset - Language-agnostic Website Embedding and Classification ...
Lugeon, Sylvain; Piccardi, Tiziano. - : figshare, 2022
BASE
Show details
3
Curlie Dataset - Language-agnostic Website Embedding and Classification ...
Lugeon, Sylvain; Piccardi, Tiziano. - : figshare, 2022
Abstract: **************** Full Curlie dataset **************** This dataset contains the URL scrapped from curlie.org alongside with their multilingual labels. The label correspond to the sub-category where the URL was referenced in Curlie. We also provide a mapping between english labels and labels from other languages for alignment. The URLs have been filtered to only contain homepages. Each distint URL is indexed with a unique identifier (uid). curlie.csv.gz > [url, uid, label, lang] x 2,275,150 samples mapping.json.gz > [english_label, matchings] x 35,946 labels **************** Processed Curlie dataset **************** You find here the data used to train Homepage2vec. URLs have been further filtered out: websites listed under the Regional top-category where dropped, as well as non-accessible websites. This filtering yields 1,018,207 valid URL. The labels are aligned across languages and reduced to the 14 top-categories (classes). Because a URL can belong to several classes, a binary vector is used. The ...
Keyword: 170203 Knowledge Representation and Machine Learning; 80505 Web Technologies excl. Web Search; 80704 Information Retrieval and Web Search; Applied Computer Science; FOS Computer and information sciences; FOS Media and communications; FOS Psychology
URL: https://figshare.com/articles/dataset/Curlie_Dataset_-_Language-agnostic_Website_Embedding_and_Classification/19406693
https://dx.doi.org/10.6084/m9.figshare.19406693
BASE
Hide details
4
Curlie Dataset - Language-agnostic Website Embedding and Classification ...
Lugeon, Sylvain; Piccardi, Tiziano. - : figshare, 2022
BASE
Show details
5
Community Development of the SWEET Semantic System for Earth and Environmental Data - A Call for Interest ...
Rovetto, Robert J.. - : ESIP, 2022
BASE
Show details
6
The SWEET (Semantic Web for Earth and Environmental Terminology) Bibliography ...
Rovetto, Robert J.. - : ESIP, 2022
BASE
Show details
7
Community Development of the SWEET Semantic System for Earth and Environmental Data - A Call for Interest ...
Rovetto, Robert J.. - : ESIP, 2022
BASE
Show details
8
The SWEET (Semantic Web for Earth and Environmental Terminology) Bibliography ...
Rovetto, Robert J.. - : ESIP, 2022
BASE
Show details
9
nkresearch ...
hyun, eileen. - : figshare, 2022
BASE
Show details
10
nkresearch ...
hyun, eileen. - : figshare, 2022
BASE
Show details
11
nkresearch ...
hyun, eileen. - : figshare, 2022
BASE
Show details
12
nkresearch ...
hyun, eileen. - : figshare, 2022
BASE
Show details
13
nkresearch ...
hyun, eileen. - : figshare, 2022
BASE
Show details
14
Keywords Queries Palabras Clave búsquedas en Google sobre libro y lectura en España 2004-2016 - tesis doctoral jorge serrano-cobos ...
Serrano-Cobos, Jorge. - : figshare, 2021
BASE
Show details
15
Keywords Queries Palabras Clave búsquedas en Google sobre libro y lectura en España 2004-2016 - tesis doctoral jorge serrano-cobos ...
Serrano-Cobos, Jorge. - : figshare, 2021
BASE
Show details
16
Keywords Queries Palabras Clave búsquedas en Google sobre libro y lectura en España 2004-2016 - tesis doctoral jorge serrano-cobos ...
Serrano-Cobos, Jorge. - : figshare, 2021
BASE
Show details
17
Quality assessment of Wikipedia and its sources ...
Włodzimierz Lewoniewski. - : figshare, 2020
BASE
Show details
18
Quality assessment of Wikipedia and its sources ...
Włodzimierz Lewoniewski. - : figshare, 2020
BASE
Show details
19
Quality assessment of Wikipedia and its sources ...
Włodzimierz Lewoniewski. - : figshare, 2020
BASE
Show details
20
An Overview of Textual Semantic Similarity Measures Based on Web Intelligence ...
Martinez-Gil, Jorge. - : figshare, 2018
BASE
Show details

Page: 1 2

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
37
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern