1 |
C3: Continued Pretraining with Contrastive Weak Supervision for Cross Language Ad-Hoc Retrieval ...
|
|
|
|
BASE
|
|
Show details
|
|
2 |
Transfer Learning Approaches for Building Cross-Language Dense Retrieval Models ...
|
|
|
|
Abstract:
The advent of transformer-based models such as BERT has led to the rise of neural ranking models. These models have improved the effectiveness of retrieval systems well beyond that of lexical term matching models such as BM25. While monolingual retrieval tasks have benefited from large-scale training collections such as MS MARCO and advances in neural architectures, cross-language retrieval tasks have fallen behind these advancements. This paper introduces ColBERT-X, a generalization of the ColBERT multi-representation dense retrieval model that uses the XLM-RoBERTa (XLM-R) encoder to support cross-language information retrieval (CLIR). ColBERT-X can be trained in two ways. In zero-shot training, the system is trained on the English MS MARCO collection, relying on the XLM-R encoder for cross-language mappings. In translate-train, the system is trained on the MS MARCO English queries coupled with machine translations of the associated MS MARCO passages. Results on ad hoc document ranking tasks in several ... : Accepted at ECIR 2022 (Full paper) ...
|
|
Keyword:
Computation and Language cs.CL; FOS Computer and information sciences; Information Retrieval cs.IR
|
|
URL: https://arxiv.org/abs/2201.08471 https://dx.doi.org/10.48550/arxiv.2201.08471
|
|
BASE
|
|
Hide details
|
|
3 |
The Multilingual TEDx Corpus for Speech Recognition and Translation ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
An Information Retrieval Test Collection for English SMS Conversations
|
|
|
|
BASE
|
|
Show details
|
|
5 |
Microblogging Temporal Summarization: Filtering Important Twitter Updates for Breaking News
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Frontiers, Challenges, and Opportunities for Information Retrieval – Report from SWIRL 2012, The Second Strategic Workshop on Information Retrieval in Lorne
|
|
|
|
BASE
|
|
Show details
|
|
7 |
Formative Evaluation for Multilingual Multimedia Search and Sense-Making
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Advances in Multilingual and Multimodal Information Retrieval : 8th Workshop of the Cross-Language Evaluation Forum, CLEF 2007, Budapest, Hungary, September 19-21, 2007, Revised Selected Papers
|
|
|
|
UB Frankfurt Linguistik
|
|
Show details
|
|
11 |
Combining Evidence from Unconstrained Spoken Term Frequency Estimation for Improved Speech Retrieval
|
|
|
|
BASE
|
|
Show details
|
|
12 |
Classifying Attitude by Topic Aspect for English and Chinese Document Collections
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Overview of the CLEF-2006 cross-language speech retrieval track
|
|
|
|
In: Oard, Douglas W., Wang, Jianqiang, Jones, Gareth J.F. orcid:0000-0003-2923-8365 , White, Ryen W., Pecina, Pavel, Soergel, Dagobert, Huang, Xiaoli and Shafran, Izhak (2007) Overview of the CLEF-2006 cross-language speech retrieval track. In: CLEF 2006: Workshop on Cross-Language Information Retrieval and Evaluation, 20-22 Sept. 2006, Alicante, Spain. (2007)
|
|
BASE
|
|
Show details
|
|
14 |
Investigating cross-language speech retrieval for a spontaneous conversational speech collection
|
|
|
|
In: Inkpen, Diana, Alzghool, Muath, Jones, Gareth J.F. orcid:0000-0003-2923-8365 and Oard, Douglas W. (2006) Investigating cross-language speech retrieval for a spontaneous conversational speech collection. In: HLT-NAACL 2006 - The Human Language Technology Conference - North American Chapter of the Association for Computational Linguistics Annual Meeting, 8-9 June 2006, New York, USA. (2006)
|
|
BASE
|
|
Show details
|
|
15 |
The Effect of Bilingual Term List Size on Dictionary-Based Cross-Language Information Retrieval
|
|
|
|
In: DTIC (2006)
|
|
BASE
|
|
Show details
|
|
16 |
TREC-9 Experiments at Maryland: Interactive CLIR
|
|
|
|
In: DTIC (2006)
|
|
BASE
|
|
Show details
|
|
17 |
COMPLEX QUESTION ANSWERING BASED ON A SEMANTIC DOMAIN MODEL OF CLINICAL MEDICINE
|
|
|
|
BASE
|
|
Show details
|
|
18 |
Comparing User-Assisted and Automatic Query Translation
|
|
|
|
In: DTIC AND NTIS (2005)
|
|
BASE
|
|
Show details
|
|
|
|