3 |
Quality and Efficiency of Manual Annotation: Data from the Pre-annotation Bias Experiment (part of the PDT-C 2.0 project)
|
|
|
|
BASE
|
|
Show details
|
|
8 |
HaCzech: Dataset of Handwritten Czech
|
|
Procházka, Štěpán; Straka, Milan. - : Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL), 2021
|
|
Abstract:
The dataset of handwritten Czech text lines, sourced from two chronicles (municipal chronicles 1931-1944, school chronicles 1913-1933). The dataset comprises 25k lines machine-extracted from scanned pages, and provides manual annotation of text contents for a subset of size 2k.
|
|
Keyword:
chronicles; handwriting; htr; manuscripts; ocr
|
|
URL: http://hdl.handle.net/11234/1-3739
|
|
BASE
|
|
Hide details
|
|
16 |
Czech HS Contracts Dataset (CHSC) 1.0
|
|
Szabó, Adam; Straka, Milan. - : Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL), 2021
|
|
BASE
|
|
Show details
|
|
17 |
RobeCzech: Czech RoBERTa, a monolingual contextualized language representation model ...
|
|
|
|
BASE
|
|
Show details
|
|
18 |
ÚFAL at MultiLexNorm 2021: Improving Multilingual Lexical Normalization by Fine-tuning ByT5 ...
|
|
|
|
BASE
|
|
Show details
|
|
19 |
Morpho-syntactically annotated corpora provided for the PARSEME Shared Task on Semi-Supervised Identification of Verbal Multiword Expressions (edition 1.2)
|
|
|
|
BASE
|
|
Show details
|
|
20 |
Slovak MorphoDiTa Models 170914
|
|
Straka, Milan. - : Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL), 2020
|
|
BASE
|
|
Show details
|
|
|
|