Home
Catalogue search
Refine your search:
Keyword
Creator / Publisher:
Anastasopoulos, Antonios (4)
The 2021 Conference on Empirical Methods in Natural Language Processing 2021 (4)
Neubig, Graham (3)
Chaudhary, Aditi (2)
Rijhwani, Shruti (2)
Alam, Md Mahfuz Ibn (1)
Faisal, Fahim (1)
Keshava, Sharlina (1)
Mortensen, David R. (1)
Pratapa, Adithya (1)
more
Year
Medium
Type:
Article (4)
BLLDB-Access
Search in the Catalogues and Directories
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
Sort by
creator [A → Z]
'
creator [Z → A]
'
publishing year ↑ (asc)
'
publishing year ↓ (desc)
'
title [A → Z]
'
title [Z → A]
'
Simple Search
Hits 1 – 4 of 4
1
SD-QA: Spoken Dialectal Question Answering for the Real World ...
The 2021 Conference on Empirical Methods in Natural Language Processing 2021
;
Alam, Md Mahfuz Ibn
;
Anastasopoulos, Antonios
. - : Underline Science Inc., 2021
BASE
Show details
2
Evaluating the Morphosyntactic Well-formedness of Generated Texts ...
The 2021 Conference on Empirical Methods in Natural Language Processing 2021
;
Anastasopoulos, Antonios
;
Chaudhary, Aditi
. - : Underline Science Inc., 2021
BASE
Show details
3
When is Wall a Pared and when a Muro?: Extracting Rules Governing Lexical Selection ...
The 2021 Conference on Empirical Methods in Natural Language Processing 2021
;
Anastasopoulos, Antonios
;
Chaudhary, Aditi
. - : Underline Science Inc., 2021
BASE
Show details
4
Lexically-Aware Semi-Supervised Learning for OCR Post-Correction ...
The 2021 Conference on Empirical Methods in Natural Language Processing 2021
;
Anastasopoulos, Antonios
;
Neubig, Graham
;
Rijhwani, Shruti
;
Rosenblum, Daisy
. - : Underline Science Inc., 2021
Abstract:
Much of the existing linguistic data in many languages of the world is locked away in non-digitized books and documents. Optical character recognition (OCR) can be used to produce digitized text, and previous work has demonstrated the utility of neural post-correction methods that improve the results of general-purpose OCR systems on recognition of less-well-resourced languages. However, these methods rely on manually curated post-correction data, which are relatively scarce compared to the non-annotated raw images that need to be digitized. In this paper, we present a semi-supervised learning method that makes it possible to utilize these raw images to improve performance, specifically through the use of self-training, a technique where a model is iteratively trained on its own outputs. In addition, to enforce consistency in the recognized vocabulary, we introduce a lexically aware decoding method that augments the neural post-correction model with a count-based language model constructed from the ...
Keyword:
Computational Linguistics
;
Machine Learning
;
Machine Learning and Data Mining
;
Natural Language Processing
URL:
https://dx.doi.org/10.48448/fycy-h885
https://underline.io/lecture/38192-lexically-aware-semi-supervised-learning-for-ocr-post-correction
BASE
Hide details
Mobile view
All
Catalogues
UB Frankfurt Linguistik
0
IDS Mannheim
0
OLC Linguistik
0
UB Frankfurt Retrokatalog
0
DNB Subject Category Language
0
Institut für Empirische Sprachwissenschaft
0
Leibniz-Centre General Linguistics (ZAS)
0
Bibliographies
BLLDB
0
BDSL
0
IDS Bibliografie zur deutschen Grammatik
0
IDS Bibliografie zur Gesprächsforschung
0
IDS Konnektoren im Deutschen
0
IDS Präpositionen im Deutschen
0
IDS OBELEX meta
0
MPI-SHH Linguistics Collection
0
MPI for Psycholinguistics
0
Linked Open Data catalogues
Annohub
0
Online resources
Link directory
0
Journal directory
0
Database directory
0
Dictionary directory
0
Open access documents
BASE
4
Linguistik-Repository
0
IDS Publikationsserver
0
Online dissertations
0
Language Description Heritage
0
© 2013 - 2024 Lin|gu|is|tik
|
Imprint
|
Privacy Policy
|
Datenschutzeinstellungen ändern