Page: 1 2 3 4 5 6 7 8... 690
61 |
Statistical and Spatio-temporal Hand Gesture Features for Sign Language Recognition using the Leap Motion Sensor ...
|
|
|
|
BASE
|
|
Show details
|
|
64 |
Giant Pigeon and Small Person: Prompting Visually Grounded Models about the Size of Objects ...
|
|
Zhang, Yi. - : Purdue University Graduate School, 2022
|
|
BASE
|
|
Show details
|
|
65 |
Giant Pigeon and Small Person: Prompting Visually Grounded Models about the Size of Objects ...
|
|
Zhang, Yi. - : Purdue University Graduate School, 2022
|
|
BASE
|
|
Show details
|
|
66 |
pNLP-Mixer: an Efficient all-MLP Architecture for Language ...
|
|
|
|
BASE
|
|
Show details
|
|
67 |
Multilingual Abusiveness Identification on Code-Mixed Social Media Text ...
|
|
|
|
BASE
|
|
Show details
|
|
68 |
hate-alert@DravidianLangTech-ACL2022: Ensembling Multi-Modalities for Tamil TrollMeme Classification ...
|
|
|
|
BASE
|
|
Show details
|
|
69 |
StableMoE: Stable Routing Strategy for Mixture of Experts ...
|
|
|
|
BASE
|
|
Show details
|
|
70 |
BERTuit: Understanding Spanish language in Twitter through a native transformer ...
|
|
|
|
BASE
|
|
Show details
|
|
71 |
EVI: Multilingual Spoken Dialogue Tasks and Dataset for Knowledge-Based Enrolment, Verification, and Identification ...
|
|
|
|
BASE
|
|
Show details
|
|
73 |
Towards the Next 1000 Languages in Multilingual Machine Translation: Exploring the Synergy Between Supervised and Self-Supervised Learning ...
|
|
|
|
BASE
|
|
Show details
|
|
74 |
Data Bootstrapping Approaches to Improve Low Resource Abusive Language Detection for Indic Languages ...
|
|
|
|
BASE
|
|
Show details
|
|
75 |
Out of Thin Air: Is Zero-Shot Cross-Lingual Keyword Detection Better Than Unsupervised? ...
|
|
|
|
BASE
|
|
Show details
|
|
76 |
Assessment of Massively Multilingual Sentiment Classifiers ...
|
|
|
|
Abstract:
Models are increasing in size and complexity in the hunt for SOTA. But what if those 2\% increase in performance does not make a difference in a production use case? Maybe benefits from a smaller, faster model outweigh those slight performance gains. Also, equally good performance across languages in multilingual tasks is more important than SOTA results on a single one. We present the biggest, unified, multilingual collection of sentiment analysis datasets. We use these to assess 11 models and 80 high-quality sentiment datasets (out of 342 raw datasets collected) in 27 languages and included results on the internally annotated datasets. We deeply evaluate multiple setups, including fine-tuning transformer-based models for measuring performance. We compare results in numerous dimensions addressing the imbalance in both languages coverage and dataset sizes. Finally, we present some best practices for working with such a massive collection of datasets and models from a multilingual perspective. ... : Accepted for WASSA at ACL 2022 ...
|
|
Keyword:
Computation and Language cs.CL; FOS Computer and information sciences; Machine Learning cs.LG
|
|
URL: https://dx.doi.org/10.48550/arxiv.2204.04937 https://arxiv.org/abs/2204.04937
|
|
BASE
|
|
Hide details
|
|
78 |
MASSIVE: A 1M-Example Multilingual Natural Language Understanding Dataset with 51 Typologically-Diverse Languages ...
|
|
|
|
BASE
|
|
Show details
|
|
79 |
DAMO-NLP at SemEval-2022 Task 11: A Knowledge-based System for Multilingual Named Entity Recognition ...
|
|
|
|
BASE
|
|
Show details
|
|
Page: 1 2 3 4 5 6 7 8... 690
|
|