2 |
On Cross-Lingual Retrieval with Multilingual Text Encoders ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
Evaluating Multilingual Text Encoders for Unsupervised Cross-Lingual Retrieval ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
Come hither or go away? Recognising pre-electoral coalition signals in the news ...
|
|
|
|
BASE
|
|
Show details
|
|
5 |
Evaluating multilingual text encoders for unsupervised cross-lingual retrieval
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Come hither or go away? Recognising pre-electoral coalition signals in the news
|
|
|
|
BASE
|
|
Show details
|
|
7 |
AraWEAT: Multidimensional Analysis of Biases in Arabic Word Embeddings ...
|
|
|
|
Abstract:
Recent work has shown that distributional word vector spaces often encode human biases like sexism or racism. In this work, we conduct an extensive analysis of biases in Arabic word embeddings by applying a range of recently introduced bias tests on a variety of embedding spaces induced from corpora in Arabic. We measure the presence of biases across several dimensions, namely: embedding models (Skip-Gram, CBOW, and FastText) and vector sizes, types of text (encyclopedic text, and news vs. user-generated content), dialects (Egyptian Arabic vs. Modern Standard Arabic), and time (diachronic analyses over corpora from different time periods). Our analysis yields several interesting findings, e.g., that implicit gender bias in embeddings trained on Arabic news corpora steadily increases over time (between 2007 and 2017). We make the Arabic bias specifications (AraWEAT) publicly available. ... : accepted for WANLP 20 ...
|
|
Keyword:
Computation and Language cs.CL; FOS Computer and information sciences
|
|
URL: https://dx.doi.org/10.48550/arxiv.2011.01575 https://arxiv.org/abs/2011.01575
|
|
BASE
|
|
Hide details
|
|
8 |
Word Sense Disambiguation for 158 Languages using Word Embeddings Only ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
SemEval-2020 Task 2: Predicting Multilingual and Cross-Lingual (Graded) Lexical Entailment ...
|
|
|
|
BASE
|
|
Show details
|
|
10 |
SemEval-2020 Task 2: Predicting Multilingual and Cross-Lingual (Graded) Lexical Entailment ...
|
|
|
|
BASE
|
|
Show details
|
|
11 |
SemEval-2020 Task 2: Predicting Multilingual and Cross-Lingual (Graded) Lexical Entailment
|
|
Glavas, Goran; Vulic, Ivan; Korhonen, Anna-Leena. - : International Committee for Computational Linguistics, 2020. : https://www.aclweb.org/anthology/2020.semeval-1.2, 2020. : Proceedings of the 14th International Workshop on Semantic Evaluation (SemEval 2020), 2020
|
|
BASE
|
|
Show details
|
|
12 |
A Twitter Political Corpus of the 2019 10N Spanish Election
|
|
|
|
BASE
|
|
Show details
|
|
13 |
AraWEAT: Multidimensional analysis of biases in Arabic word embeddings
|
|
|
|
BASE
|
|
Show details
|
|
14 |
SemEval-2020 Task 2: Predicting multilingual and cross-lingual (graded) lexical entailment
|
|
|
|
BASE
|
|
Show details
|
|
16 |
Datasets for Watset: Local-Global Graph Clustering with Applications in Sense and Frame Induction ...
|
|
|
|
BASE
|
|
Show details
|
|
17 |
Datasets for Watset: Local-Global Graph Clustering with Applications in Sense and Frame Induction ...
|
|
|
|
BASE
|
|
Show details
|
|
18 |
HHMM at SemEval-2019 Task 2: Unsupervised frame induction using contextualized word embeddings
|
|
|
|
BASE
|
|
Show details
|
|
19 |
Policy preference detection in parliamentary debate motions
|
|
|
|
BASE
|
|
Show details
|
|
20 |
Watset: Local-global graph clustering with applications in sense and frame induction
|
|
|
|
BASE
|
|
Show details
|
|
|
|