2 |
A Neighbourhood Framework for Resource-Lean Content Flagging ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
A Primer on Contrastive Pretraining in Language Processing: Methods, Lessons Learned and Perspectives ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
QA Dataset Explosion: A Taxonomy of NLP Resources for Question Answering and Reading Comprehension ...
|
|
|
|
Abstract:
Alongside huge volumes of research on deep learning models in NLP in the recent years, there has been also much work on benchmark datasets needed to track modeling progress. Question answering and reading comprehension have been particularly prolific in this regard, with over 80 new datasets appearing in the past two years. This study is the largest survey of the field to date. We provide an overview of the various formats and domains of the current resources, highlighting the current lacunae for future work. We further discuss the current classifications of ``reasoning types" in question answering and propose a new taxonomy. We also discuss the implications of over-focusing on English, and survey the current monolingual resources for other languages and multilingual resources. The study is aimed at both practitioners looking for pointers to the wealth of existing data, and at researchers working on new resources. ... : Under review ...
|
|
Keyword:
Artificial Intelligence cs.AI; Computation and Language cs.CL; FOS Computer and information sciences
|
|
URL: https://arxiv.org/abs/2107.12708 https://dx.doi.org/10.48550/arxiv.2107.12708
|
|
BASE
|
|
Hide details
|
|
5 |
Quantifying Gender Bias Towards Politicians in Cross-Lingual Language Models ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Few-Shot Cross-Lingual Stance Detection with Sentiment-Based Pre-Training ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
Can Edge Probing Tasks Reveal Linguistic Knowledge in QA Models? ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
CiteWorth: Cite-Worthiness Detection for Improved Scientific Document Understanding ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
How Does Counterfactually Augmented Data Impact Models for Social Computing Constructs? ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
Quantifying Gender Biases Towards Politicians on Reddit ...
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Semi-Supervised Exaggeration Detection of Health Science Press Releases ...
|
|
|
|
BASE
|
|
Show details
|
|
15 |
Inducing Language-Agnostic Multilingual Representations ...
|
|
|
|
BASE
|
|
Show details
|
|
17 |
SIGTYP 2020 Shared Task: Prediction of Typological Features ...
|
|
|
|
BASE
|
|
Show details
|
|
19 |
X-WikiRE: A Large, Multilingual Resource for Relation Extraction as Machine Comprehension ...
|
|
|
|
BASE
|
|
Show details
|
|
20 |
TX-Ray: Quantifying and Explaining Model-Knowledge Transfer in (Un-)Supervised NLP ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|