Home
Catalogue search
Refine your search:
Keyword
Creator / Publisher:
Kelleher, John D. (4)
Klubicka, Filip (4)
Maldonado, Alfredo (4)
Mahalunkar, Abhijit (2)
SFI Research Centres Programme (2)
ADAPT Centre for Dig- ital Content Technology (1)
ADAPT Centre for Digital Content Technology (1)
John D. Kelleher (1)
Kacmajor, Magdalena (1)
SFI Research Centres Pro-gramme (1)
Year
Medium
Type:
Article (3)
Miscellaneous (1)
BLLDB-Access
Search in the Catalogues and Directories
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
Sort by
creator [A → Z]
'
creator [Z → A]
'
publishing year ↑ (asc)
'
publishing year ↓ (desc)
'
title [A → Z]
'
title [Z → A]
'
Simple Search
Hits 1 – 4 of 4
1
Semantic Relatedness and Taxonomic Word Embeddings ...
Kacmajor, Magdalena
;
Kelleher, John D.
;
Klubicka, Filip
. - : arXiv, 2020
BASE
Show details
2
English WordNet Taxonomic Random Walk Pseudo-Corpora
Klubicka, Filip
;
Maldonado, Alfredo
;
Mahalunkar, Abhijit
...
In: Conference papers (2020)
BASE
Show details
3
Synthetic, Yet Natural: Properties of WordNet Random Walk Corpora and the impact of rare words on embedding performance
Klubicka, Filip
;
Mahalunkar, Abhijit
;
Maldonado, Alfredo
;
Kelleher, John D.
In: Conference papers (2019)
Abstract:
Creating word embeddings that reflect semantic relationships encoded in lexical knowledge resources is an open challenge. One approach is to use a random walk over a knowledge graph to generate a pseudo-corpus and use this corpus to train embeddings. However, the effect of the shape of the knowledge graph on the generated pseudo-corpora, and on the resulting word embeddings, has not been studied. To explore this, we use English WordNet, constrained to the taxonomic (tree-like) portion of the graph, as a case study. We investigate the properties of the generated pseudo-corpora, and their impact on the resulting embeddings. We find that the distributions in the psuedo-corpora exhibit properties found in natural corpora, such as Zipf’s and Heaps’ law, and also ob- serve that the proportion of rare words in a pseudo-corpus affects the performance of its embeddings on word similarity.
Keyword:
Artificial Intelligence and Robotics
;
Computational Linguistics
;
corpus
;
evaluation
;
Numerical Analysis and Scientific Computing
;
random walk
;
representations
;
Software Engineering
;
taxonomy
;
word embeddings
;
word similarity
;
WordNet
URL:
https://arrow.tudublin.ie/scschcomcon/271
https://arrow.tudublin.ie/cgi/viewcontent.cgi?article=1283&context=scschcomcon
BASE
Hide details
4
Size Matters: The Impact of Training Size in Taxonomically-Enriched Word Embeddings
Maldonado, Alfredo
;
Klubicka, Filip
;
Kelleher, John D.
In: Articles (2019)
BASE
Show details
Mobile view
All
Catalogues
UB Frankfurt Linguistik
0
IDS Mannheim
0
OLC Linguistik
0
UB Frankfurt Retrokatalog
0
DNB Subject Category Language
0
Institut für Empirische Sprachwissenschaft
0
Leibniz-Centre General Linguistics (ZAS)
0
Bibliographies
BLLDB
0
BDSL
0
IDS Bibliografie zur deutschen Grammatik
0
IDS Bibliografie zur Gesprächsforschung
0
IDS Konnektoren im Deutschen
0
IDS Präpositionen im Deutschen
0
IDS OBELEX meta
0
MPI-SHH Linguistics Collection
0
MPI for Psycholinguistics
0
Linked Open Data catalogues
Annohub
0
Online resources
Link directory
0
Journal directory
0
Database directory
0
Dictionary directory
0
Open access documents
BASE
4
Linguistik-Repository
0
IDS Publikationsserver
0
Online dissertations
0
Language Description Heritage
0
© 2013 - 2024 Lin|gu|is|tik
|
Imprint
|
Privacy Policy
|
Datenschutzeinstellungen ändern