DE eng

Search in the Catalogues and Directories

Hits 1 – 14 of 14

1
Challenges and Strategies in Cross-Cultural NLP ...
BASE
Show details
2
IGLUE: A Benchmark for Transfer Learning across Modalities, Tasks, and Languages ...
BASE
Show details
3
Vision-and-Language or Vision-for-Language? On Cross-Modal Influence in Multimodal Transformers ...
BASE
Show details
4
Multimodal Pretraining Unmasked: A Meta-Analysis and a Unified Framework of Vision-and-Language BERTs ...
Abstract: Large-scale pretraining and task-specific fine-tuning is now the standard methodology for many tasks in computer vision and natural language processing. Recently, a multitude of methods have been proposed for pretraining vision and language BERTs to tackle challenges at the intersection of these two key areas of AI. These models can be categorised into either single-stream or dual-stream encoders. We study the differences between these two categories, and show how they can be unified under a single theoretical framework. We then conduct controlled experiments to discern the empirical differences between five V&L BERTs. Our experiments show that training data and hyperparameters are responsible for most of the differences between the reported results, but they also reveal that the embedding layer plays a crucial role in these massive models. ...
Keyword: Computational Linguistics; Language Models; Machine Learning; Machine Learning and Data Mining; Natural Language Processing
URL: https://underline.io/lecture/38210-multimodal-pretraining-unmasked-a-meta-analysis-and-a-unified-framework-of-vision-and-language-berts
https://dx.doi.org/10.48448/0f6a-8189
BASE
Hide details
5
Multimodal pretraining unmasked: A meta-analysis and a unified framework of vision-and-language berts ...
BASE
Show details
6
Visually Grounded Reasoning across Languages and Cultures ...
BASE
Show details
7
On Language Models for Creoles ...
BASE
Show details
8
Visually Grounded Reasoning across Languages and Cultures ...
BASE
Show details
9
Multimodal pretraining unmasked: A meta-analysis and a unified framework of vision-and-language berts
In: Transactions of the Association for Computational Linguistics, 9 (2021)
BASE
Show details
10
The Role of Syntactic Planning in Compositional Image Captioning ...
BASE
Show details
11
On Language Models for Creoles ...
BASE
Show details
12
Visually Grounded Reasoning across Languages and Cultures ...
BASE
Show details
13
It’s Easier to Translate out of English than into it: Measuring Neural Translation Difficulty by Cross-Mutual Information ...
BASE
Show details
14
It’s Easier to Translate out of English than into it: Measuring Neural Translation Difficulty by Cross-Mutual Information
In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (2020)
BASE
Show details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
14
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern