DE eng

Search in the Catalogues and Directories

Page: 1 2 3 4 5 6 7...52
Hits 41 – 60 of 1.029

41
KOAS: Korean Text Offensiveness Analysis System ...
BASE
Show details
42
Contrastive Code Representation Learning ...
BASE
Show details
43
Does Putting a Linguist in the Loop Improve NLU Data Collection ...
BASE
Show details
44
What are we learning from language? ...
BASE
Show details
45
Machine Translation Decoding beyond Beam Search ...
BASE
Show details
46
Say `YES' to Positivity: Detecting Toxic Language in Workplace Communications ...
BASE
Show details
47
Unsupervised Multi-View Post-OCR Error Correction With Language Models ...
Abstract: Anthology paper link: https://aclanthology.org/2021.emnlp-main.680/ Abstract: We investigate post-OCR correction in a setting where we have access to different OCR views of the same document. The goal of this study is to understand if a pretrained language model (LM) can be used in an unsupervised way to reconcile the different OCR views such that their combination contains fewer errors than each individual view. This approach is motivated by scenarios in which unconstrained text generation for error correction is too risky. We evaluated different pretrained LMs on two datasets and found significant gains in realistic scenarios with up to 15% WER improvement over the best OCR view. We also show the importance of domain adaptation for post-OCR correction on out-of-domain documents. ...
Keyword: Computational Linguistics; Language Models; Machine Learning; Machine Learning and Data Mining; Natural Language Processing
URL: https://dx.doi.org/10.48448/yhad-m366
https://underline.io/lecture/37406-unsupervised-multi-view-post-ocr-error-correction-with-language-models
BASE
Hide details
48
AttentionRank: Unsupervised Keyphrase Extraction using Self and Cross Attentions ...
BASE
Show details
49
ProtoInfoMax: Prototypical Networks with Mutual Information Maximization for Out-of-Domain Detection ...
BASE
Show details
50
Multi-granularity Textual Adversarial Attack with Behavior Cloning ...
BASE
Show details
51
Automatic Fact-Checking with Document-level Annotations using BERT and Multiple Instance Learning ...
BASE
Show details
52
Towards the Early Detection of Child Predators in Chat Rooms: A BERT-based Approach ...
BASE
Show details
53
TSDAE: Using Transformer-based Sequential Denoising Auto-Encoder for Unsupervised Sentence Embedding Learning ...
BASE
Show details
54
WebSRC: A Dataset for Web-Based Structural Reading Comprehension ...
BASE
Show details
55
Improving Math Word Problems with Pre-trained Knowledge and Hierarchical Reasoning ...
BASE
Show details
56
Semantic Categorization of Social Knowledge for Commonsense Question Answering ...
BASE
Show details
57
Adversarial Examples for Evaluating Math Word Problem Solvers ...
BASE
Show details
58
Pre-train or Annotate? Domain Adaptation with a Constrained Budget ...
BASE
Show details
59
Corpus-based Open-Domain Event Type Induction ...
BASE
Show details
60
Learning with Different Amounts of Annotation: From Zero to Many Labels ...
BASE
Show details

Page: 1 2 3 4 5 6 7...52

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
1.029
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern