Page: 1 2 3 4 5 6 7 8 9 10
81 |
Spatial multi-arrangement for clustering and multi-way similarity dataset construction
|
|
|
|
Abstract:
We present a novel methodology for fast bottom-up creation of large-scale semantic similarity resources to support development and evaluation of NLP systems. Our work targets verb similarity, but the methodology is equally applicable to other parts of speech. Our approach circumvents the bottleneck of slow and expensive manual development of lexical resources by leveraging semantic intuitions of native speakers and adapting a spatial multi-arrangement approach from cognitive neuroscience, used before only with visual stimuli, to lexical stimuli. Our approach critically obtains judgments of word similarity in the context of a set of related words, rather than of word pairs in isolation. We also handle lexical ambiguity as a natural consequence of a two-phase process where verbs are placed in broad semantic classes prior to the fine-grained spatial similarity judgments. Our proposed design produces a large-scale verb resource comprising 17 relatedness-based classes and a verb similarity dataset containing similarity scores for 29,721 unique verb pairs and 825 target verbs, which we release with this paper.
|
|
URL: https://www.repository.cam.ac.uk/handle/1810/306834 https://doi.org/10.17863/CAM.53925
|
|
BASE
|
|
Hide details
|
|
82 |
Manual Clustering and Spatial Arrangement of Verbs for Multilingual Evaluation and Typology Analysis
|
|
Majewska, Olga; Vulic, Ivan; McCarthy, Diana. - : International Committee on Computational Linguistics, 2020. : https://www.aclweb.org/anthology/2020.coling-main.423, 2020. : Proceedings of the 28th International Conference on Computational Linguistics (COLING 2020), 2020
|
|
BASE
|
|
Show details
|
|
84 |
SemEval-2020 Task 2: Predicting Multilingual and Cross-Lingual (Graded) Lexical Entailment
|
|
Glavas, Goran; Vulic, Ivan; Korhonen, Anna-Leena. - : International Committee for Computational Linguistics, 2020. : https://www.aclweb.org/anthology/2020.semeval-1.2, 2020. : Proceedings of the 14th International Workshop on Semantic Evaluation (SemEval 2020), 2020
|
|
BASE
|
|
Show details
|
|
85 |
Classification-Based Self-Learning for Weakly Supervised Bilingual Lexicon Induction
|
|
|
|
BASE
|
|
Show details
|
|
86 |
Towards Instance-Level Parser Selection for Cross-Lingual Transfer of Dependency Parsers
|
|
Glavas, Goran; Agic, Zeljko; Vulic, Ivan. - : International Committee on Computational Linguistics, 2020. : https://www.aclweb.org/anthology/2020.coling-main.345, 2020. : Proceedings of the 28th International Conference on Computational Linguistics (COLING 2020), 2020
|
|
BASE
|
|
Show details
|
|
87 |
From Zero to Hero: On the Limitations of Zero-Shot Cross-Lingual Transfer with Multilingual Transformers
|
|
|
|
BASE
|
|
Show details
|
|
88 |
Emergent Communication Pretraining for Few-Shot Machine Translation
|
|
Vulic, Ivan; Ponti, Edoardo; Korhonen, Anna. - : International Committee on Computational Linguistics, 2020. : https://www.aclweb.org/anthology/2020.coling-main.416, 2020. : Proceedings of the 28th International Conference on Computational Linguistics (COLING 2020), 2020
|
|
BASE
|
|
Show details
|
|
90 |
MAD-X: An Adapter-Based Framework for Multi-Task Cross-Lingual Transfer
|
|
|
|
BASE
|
|
Show details
|
|
91 |
SemEval-2020 Task 3: Graded Word Similarity in Context
|
|
Santos Armendariz, Carlos; Purver, Matthew; Pollak, Senja. - : International Committee for Computational Linguistics, 2020. : https://www.aclweb.org/anthology/2020.semeval-1.3, 2020. : Proceedings of the 14th International Workshop on Semantic Evaluation (SemEval 2020), 2020
|
|
BASE
|
|
Show details
|
|
92 |
Multidirectional Associative Optimization of Function-Specific Word Representations
|
|
Gerz, Daniela; Vulic, Ivan; Rei, Marek. - : Association for Computational Linguistics, 2020. : 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020
|
|
BASE
|
|
Show details
|
|
93 |
AdapterHub: A Framework for Adapting Transformers
|
|
Pfeiffer, Jonas; Ruckle, Andreas; Poth, Clifton. - : Association for Computational Linguistics, 2020. : Proceedings of the Conference on Empirical Methods in Natural Language Processing: System Demonstrations (EMNLP 2020), 2020
|
|
BASE
|
|
Show details
|
|
95 |
XCOPA: A Multilingual Dataset for Causal Commonsense Reasoning
|
|
|
|
BASE
|
|
Show details
|
|
96 |
XHate-999: Analyzing and Detecting Abusive Language Across Domains and Languages
|
|
Glavas, Goran; Karan, Mladen; Vulic, Ivan. - : International Committee on Computational Linguistics, 2020. : https://www.aclweb.org/anthology/2020.coling-main.559, 2020. : Proceedings of the 28th International Conference on Computational Linguistics (COLING 2020), 2020
|
|
BASE
|
|
Show details
|
|
97 |
Span-ConveRT: Few-shot Span Extraction for Dialog with Pretrained Conversational Representations
|
|
|
|
BASE
|
|
Show details
|
|
98 |
Specializing unsupervised pretraining models for word-level semantic similarity
|
|
|
|
BASE
|
|
Show details
|
|
99 |
Non-linear instance-based cross-lingual mapping for non-isomorphic embedding spaces
|
|
|
|
BASE
|
|
Show details
|
|
100 |
Classification-based self-learning for weakly supervised bilingual lexicon induction
|
|
|
|
BASE
|
|
Show details
|
|
Page: 1 2 3 4 5 6 7 8 9 10
|
|