2 |
A Neighbourhood Framework for Resource-Lean Content Flagging ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
A Primer on Contrastive Pretraining in Language Processing: Methods, Lessons Learned and Perspectives ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
QA Dataset Explosion: A Taxonomy of NLP Resources for Question Answering and Reading Comprehension ...
|
|
|
|
BASE
|
|
Show details
|
|
5 |
Quantifying Gender Bias Towards Politicians in Cross-Lingual Language Models ...
|
|
|
|
Abstract:
While the prevalence of large pre-trained language models has led to significant improvements in the performance of NLP systems, recent research has demonstrated that these models inherit societal biases extant in natural language. In this paper, we explore a simple method to probe pre-trained language models for gender bias, which we use to effect a multi-lingual study of gender bias towards politicians. We construct a dataset of 250k politicians from most countries in the world and quantify adjective and verb usage around those politicians' names as a function of their gender. We conduct our study in 7 languages across 6 different language modeling architectures. Our results demonstrate that stance towards politicians in pre-trained language models is highly dependent on the language used. Finally, contrary to previous findings, our study suggests that larger language models do not tend to be significantly more gender-biased than smaller ones. ...
|
|
Keyword:
Computation and Language cs.CL; FOS Computer and information sciences; Machine Learning stat.ML
|
|
URL: https://dx.doi.org/10.48550/arxiv.2104.07505 https://arxiv.org/abs/2104.07505
|
|
BASE
|
|
Hide details
|
|
6 |
Few-Shot Cross-Lingual Stance Detection with Sentiment-Based Pre-Training ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
Can Edge Probing Tasks Reveal Linguistic Knowledge in QA Models? ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
CiteWorth: Cite-Worthiness Detection for Improved Scientific Document Understanding ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
How Does Counterfactually Augmented Data Impact Models for Social Computing Constructs? ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
Quantifying Gender Biases Towards Politicians on Reddit ...
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Semi-Supervised Exaggeration Detection of Health Science Press Releases ...
|
|
|
|
BASE
|
|
Show details
|
|
15 |
Inducing Language-Agnostic Multilingual Representations ...
|
|
|
|
BASE
|
|
Show details
|
|
17 |
SIGTYP 2020 Shared Task: Prediction of Typological Features ...
|
|
|
|
BASE
|
|
Show details
|
|
19 |
X-WikiRE: A Large, Multilingual Resource for Relation Extraction as Machine Comprehension ...
|
|
|
|
BASE
|
|
Show details
|
|
20 |
TX-Ray: Quantifying and Explaining Model-Knowledge Transfer in (Un-)Supervised NLP ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|