1 |
Semantic Data Set Construction from Human Clustering and Spatial Arrangement ...
|
|
|
|
BASE
|
|
Show details
|
|
2 |
Context vs Target Word: Quantifying Biases in Lexical Semantic Datasets ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
AM2iCo: Evaluating Word Meaning in Context across Low-Resource Languages with Adversarial Examples ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
AM2iCo: Evaluating Word Meaning in Context across Low-Resource Languages with Adversarial Examples ...
|
|
|
|
BASE
|
|
Show details
|
|
5 |
Quantifying lexical usage: vocabulary pertaining to ecosystems and the environment
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Manual Clustering and Spatial Arrangement of Verbs for Multilingual Evaluation and Typology Analysis ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
Manual Clustering and Spatial Arrangement of Verbs for Multilingual Evaluation and Typology Analysis ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
Manual Clustering and Spatial Arrangement of Verbs for Multilingual Evaluation and Typology Analysis
|
|
Majewska, Olga; Vulic, Ivan; McCarthy, Diana. - : International Committee on Computational Linguistics, 2020. : https://www.aclweb.org/anthology/2020.coling-main.423, 2020. : Proceedings of the 28th International Conference on Computational Linguistics (COLING 2020), 2020
|
|
BASE
|
|
Show details
|
|
9 |
Investigating the cross-lingual translatability of VerbNet-style classification. ...
|
|
|
|
BASE
|
|
Show details
|
|
10 |
Investigating the cross-lingual translatability of VerbNet-style classification.
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Word Sense Clustering and Clusterability
|
|
|
|
In: ISSN: 0891-2017 ; EISSN: 1530-9312 ; Computational Linguistics ; https://hal.archives-ouvertes.fr/hal-01838502 ; Computational Linguistics, Massachusetts Institute of Technology Press (MIT Press), 2016, 42, pp.245-275. ⟨10.1162/COLI⟩ (2016)
|
|
Abstract:
International audience ; Word sense disambiguation and the related field of automated word sense induction traditionally assume that the occurrences of a lemma can be partitioned into senses. But this seems to be a much easier task for some lemmas than others. Our work builds on recent work that proposes describing word meaning in a graded fashion rather than through a strict partition into senses; in this article we argue that not all lemmas may need the more complex graded analysis, depending on their partitionability. Although there is plenty of evidence from previous studies and from the linguistics literature that there is a spectrum of partitionability of word meanings, this is the first attempt to measure the phenomenon and to couple the machine learning literature on clusterability with word usage data used in computational linguistics.We propose to operationalize partitionability as clusterability, a measure of how easy the occurrences of a lemma are to cluster. We test two ways of measuring clusterability: (1) existing measures from the machine learning literature that aim to measure the goodness of optimal k-means clusterings, and (2) the idea that if a lemma is more clusterable, two clusterings based on two different “views” of the same data points will be more congruent. The two views that we use are two different sets of manually constructed lexical substitutes for the target lemma, on the one hand monolingual paraphrases, and on the other hand translations. We apply automatic clustering to the manual annotations. We use manual annotations because we want the representations of the instances that we cluster to be as informative and “clean” as possible. We show that when we control for polysemy, our measures of clusterability tend to correlate with partitionability, in particular some of the type-(1) clusterability measures, and that these measures outperform a baseline that relies on the amount of overlap in a soft clustering.
|
|
Keyword:
[INFO]Computer Science [cs]; clusterability; word sense clustering
|
|
URL: https://hal.archives-ouvertes.fr/hal-01838502 https://doi.org/10.1162/COLI https://hal.archives-ouvertes.fr/hal-01838502/document https://hal.archives-ouvertes.fr/hal-01838502/file/COLI_a_00247.pdf
|
|
BASE
|
|
Hide details
|
|
14 |
Integrating character representations into Chinese word embedding
|
|
|
|
BASE
|
|
Show details
|
|
16 |
Semantic clustering of pivot paraphrases
|
|
|
|
In: International Conference on Language Resources and Evaluation ; https://hal.archives-ouvertes.fr/hal-01838559 ; International Conference on Language Resources and Evaluation, Jan 2014, Reykjavik, Iceland (2014)
|
|
BASE
|
|
Show details
|
|
19 |
Quantifying lexical usage: vocabulary pertaining to ecosystems and the environment
|
|
|
|
BASE
|
|
Show details
|
|
20 |
Finding Meaning in Context Using Graph Algorithms in Mono- and Cross-lingual Settings
|
|
|
|
BASE
|
|
Show details
|
|
|
|