1 |
USCORE: An Effective Approach to Fully Unsupervised Evaluation Metrics for Machine Translation ...
|
|
|
|
BASE
|
|
Show details
|
|
2 |
Constrained Density Matching and Modeling for Cross-lingual Alignment of Contextualized Representations ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
Towards Explainable Evaluation Metrics for Natural Language Generation ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
End-to-end style-conditioned poetry generation: What does it take to learn from examples alone? ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Changes in European Solidarity Before and During COVID-19: Evidence from a Large Crowd- and Expert-Annotated Twitter Dataset ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
BERT-Defense: A Probabilistic Model Based on BERT to Combat Cognitively Inspired Orthographic Adversarial Attacks ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
Global Explainability of BERT-Based Evaluation Metrics by Disentangling along Linguistic Factors ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Global Explainability of BERT-Based Evaluation Metrics by Disentangling along Linguistic Factors ...
|
|
|
|
BASE
|
|
Show details
|
|
10 |
Inducing Language-Agnostic Multilingual Representations ...
|
|
|
|
Abstract:
Cross-lingual representations have the potential to make NLP techniques available to the vast majority of languages in the world. However, they currently require large pretraining corpora or access to typologically similar languages. In this work, we address these obstacles by removing language identity signals from multilingual embeddings. We examine three approaches for this: (i) re-aligning the vector spaces of target languages (all together) to a pivot source language; (ii) removing language-specific means and variances, which yields better discriminativeness of embeddings as a by-product; and (iii) increasing input similarity across languages by removing morphological contractions and sentence reordering. We evaluate on XNLI and reference-free MT across 19 typologically diverse languages. Our findings expose the limitations of these approaches -- unlike vector normalization, vector space re-alignment and text normalization do not achieve consistent gains across encoders and languages. Due to the ... : *SEM2021 Camera Ready ...
|
|
Keyword:
Computation and Language cs.CL; FOS Computer and information sciences
|
|
URL: https://dx.doi.org/10.48550/arxiv.2008.09112 https://arxiv.org/abs/2008.09112
|
|
BASE
|
|
Hide details
|
|
11 |
Probing Multilingual BERT for Genetic and Typological Signals ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
On the Limitations of Cross-lingual Encoders as Exposed by Reference-Free Machine Translation Evaluation ...
|
|
|
|
BASE
|
|
Show details
|
|
13 |
How to Probe Sentence Embeddings in Low-Resource Languages: On Structural Design Choices for Probing Task Evaluation ...
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Vec2Sent: Probing Sentence Embeddings With Natural Language Generation ...
|
|
|
|
BASE
|
|
Show details
|
|
15 |
From Hero to Zéroe: A Benchmark of Low-Level Adversarial Attacks ...
|
|
|
|
BASE
|
|
Show details
|
|
16 |
On the limitations of cross-lingual encoders as exposed by reference-free machine translation evaluation
|
|
|
|
BASE
|
|
Show details
|
|
17 |
On aligning OpenIE extractions with Knowledge Bases: A case study
|
|
|
|
BASE
|
|
Show details
|
|
18 |
Semantic Change and Emerging Tropes In a Large Corpus of New High German Poetry ...
|
|
|
|
BASE
|
|
Show details
|
|
19 |
Cross-lingual Argumentation Mining: Machine Translation (and a bit of Projection) is All You Need! ...
|
|
|
|
BASE
|
|
Show details
|
|
20 |
What is the Essence of a Claim? Cross-Domain Claim Identification ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|