Page: 1 2 3 4 5 6 7 8 9 10
81 |
Spatial multi-arrangement for clustering and multi-way similarity dataset construction
|
|
Majewska, Olga; McCarthy, D; van den Bosch, J. - : European Language Resources Association, 2020. : LREC 2020 - 12th International Conference on Language Resources and Evaluation, Conference Proceedings, 2020
|
|
BASE
|
|
Show details
|
|
82 |
Manual Clustering and Spatial Arrangement of Verbs for Multilingual Evaluation and Typology Analysis
|
|
Majewska, Olga; Vulic, Ivan; McCarthy, Diana; Korhonen, Anna. - : International Committee on Computational Linguistics, 2020. : https://www.aclweb.org/anthology/2020.coling-main.423, 2020. : Proceedings of the 28th International Conference on Computational Linguistics (COLING 2020), 2020
|
|
Abstract:
We present the first evaluation of the applicability of a spatial arrangement method (SpAM) to a typologically diverse language sample, and its potential to produce semantic evaluation resources to support multilingual NLP, with a focus on verb semantics. We demonstrate SpAM’s utility in allowing for quick bottom-up creation of large-scale evaluation datasets that balance cross-lingual alignment with language specificity. Starting from a shared sample of 825 English verbs, translated into Chinese, Japanese, Finnish, Polish, and Italian, we apply a two-phase annotation process which produces (i) semantic verb classes and (ii) fine-grained similarity scores for nearly 130 thousand verb pairs. We use the two types of verb data to (a) examine cross-lingual similarities and variation, and (b) evaluate the capacity of static and contextualised representation models to accurately reflect verb semantics, contrasting the performance of large language-specific pretraining models with their multilingual equivalent on semantic clustering and lexical similarity, across different domains of verb meaning. We release the data from both phases as a large-scale multilingual resource, comprising 85 verb classes and nearly 130k pairwise similarity scores, offering a wealth of possibilities for further evaluation and research on multilingual verb semantics.
|
|
URL: https://doi.org/10.17863/CAM.62213 https://www.repository.cam.ac.uk/handle/1810/315106
|
|
BASE
|
|
Hide details
|
|
84 |
SemEval-2020 Task 2: Predicting Multilingual and Cross-Lingual (Graded) Lexical Entailment
|
|
Glavas, Goran; Vulic, Ivan; Korhonen, Anna-Leena. - : International Committee for Computational Linguistics, 2020. : https://www.aclweb.org/anthology/2020.semeval-1.2, 2020. : Proceedings of the 14th International Workshop on Semantic Evaluation (SemEval 2020), 2020
|
|
BASE
|
|
Show details
|
|
85 |
Classification-Based Self-Learning for Weakly Supervised Bilingual Lexicon Induction
|
|
|
|
BASE
|
|
Show details
|
|
86 |
Towards Instance-Level Parser Selection for Cross-Lingual Transfer of Dependency Parsers
|
|
Glavas, Goran; Agic, Zeljko; Vulic, Ivan. - : International Committee on Computational Linguistics, 2020. : https://www.aclweb.org/anthology/2020.coling-main.345, 2020. : Proceedings of the 28th International Conference on Computational Linguistics (COLING 2020), 2020
|
|
BASE
|
|
Show details
|
|
87 |
From Zero to Hero: On the Limitations of Zero-Shot Cross-Lingual Transfer with Multilingual Transformers
|
|
|
|
BASE
|
|
Show details
|
|
88 |
Emergent Communication Pretraining for Few-Shot Machine Translation
|
|
Vulic, Ivan; Ponti, Edoardo; Korhonen, Anna. - : International Committee on Computational Linguistics, 2020. : https://www.aclweb.org/anthology/2020.coling-main.416, 2020. : Proceedings of the 28th International Conference on Computational Linguistics (COLING 2020), 2020
|
|
BASE
|
|
Show details
|
|
90 |
MAD-X: An Adapter-Based Framework for Multi-Task Cross-Lingual Transfer
|
|
|
|
BASE
|
|
Show details
|
|
91 |
SemEval-2020 Task 3: Graded Word Similarity in Context
|
|
Santos Armendariz, Carlos; Purver, Matthew; Pollak, Senja. - : International Committee for Computational Linguistics, 2020. : https://www.aclweb.org/anthology/2020.semeval-1.3, 2020. : Proceedings of the 14th International Workshop on Semantic Evaluation (SemEval 2020), 2020
|
|
BASE
|
|
Show details
|
|
92 |
Multidirectional Associative Optimization of Function-Specific Word Representations
|
|
Gerz, Daniela; Vulic, Ivan; Rei, Marek. - : Association for Computational Linguistics, 2020. : 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020
|
|
BASE
|
|
Show details
|
|
93 |
AdapterHub: A Framework for Adapting Transformers
|
|
Pfeiffer, Jonas; Ruckle, Andreas; Poth, Clifton. - : Association for Computational Linguistics, 2020. : Proceedings of the Conference on Empirical Methods in Natural Language Processing: System Demonstrations (EMNLP 2020), 2020
|
|
BASE
|
|
Show details
|
|
95 |
XCOPA: A Multilingual Dataset for Causal Commonsense Reasoning
|
|
|
|
BASE
|
|
Show details
|
|
96 |
XHate-999: Analyzing and Detecting Abusive Language Across Domains and Languages
|
|
Glavas, Goran; Karan, Mladen; Vulic, Ivan. - : International Committee on Computational Linguistics, 2020. : https://www.aclweb.org/anthology/2020.coling-main.559, 2020. : Proceedings of the 28th International Conference on Computational Linguistics (COLING 2020), 2020
|
|
BASE
|
|
Show details
|
|
97 |
Span-ConveRT: Few-shot Span Extraction for Dialog with Pretrained Conversational Representations
|
|
|
|
BASE
|
|
Show details
|
|
98 |
Specializing unsupervised pretraining models for word-level semantic similarity
|
|
|
|
BASE
|
|
Show details
|
|
99 |
Non-linear instance-based cross-lingual mapping for non-isomorphic embedding spaces
|
|
|
|
BASE
|
|
Show details
|
|
100 |
Classification-based self-learning for weakly supervised bilingual lexicon induction
|
|
|
|
BASE
|
|
Show details
|
|
Page: 1 2 3 4 5 6 7 8 9 10
|
|