DE eng

Search in the Catalogues and Directories

Page: 1 2
Hits 1 – 20 of 26

1
Coreference in Universal Dependencies 1.0 (CorefUD 1.0)
Abstract: CorefUD is a collection of previously existing datasets annotated with coreference, which we converted into a common annotation scheme. In total, CorefUD in its current version 1.0 consists of 17 datasets for 11 languages. The datasets are enriched with automatic morphological and syntactic annotations that are fully compliant with the standards of the Universal Dependencies project. All the datasets are stored in the CoNLL-U format, with coreference- and bridging-specific information captured by attribute-value pairs located in the MISC column. The collection is divided into a public edition and a non-public (ÚFAL-internal) edition. The publicly available edition is distributed via LINDAT-CLARIAH-CZ and contains 13 datasets for 10 languages (1 dataset for Catalan, 2 for Czech, 2 for English, 1 for French, 2 for German, 1 for Hungarian, 1 for Lithuanian, 1 for Polish, 1 for Russian, and 1 for Spanish), excluding the test data. The non-public edition is available internally to ÚFAL members and contains additional 4 datasets for 2 languages (1 dataset for Dutch, and 3 for English), which we are not allowed to distribute due to their original license limitations. It also contains the test data portions for all datasets. When using any of the harmonized datasets, please get acquainted with its license (placed in the same directory as the data) and cite the original data resource too. Version 1.0 consists of the same corpora and languages as the previous version 0.2; however, the English GUM dataset has been updated to a newer and larger version, and in the Czech/English PCEDT dataset, the train-dev-test split has been changed to be compatible with OntoNotes. Nevertheless, the main change is in the file format (the MISC attributes have new form and interpretation).
Keyword: bridging relations; coreference; dependency; harmonized annotation; treebank
URL: http://hdl.handle.net/11234/1-4698
BASE
Hide details
2
Quality and Efficiency of Manual Annotation: Data from the Pre-annotation Bias Experiment (part of the PDT-C 2.0 project)
Mikulová, Marie; Straka, Milan; Štěpánek, Jan. - : Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL), 2022
BASE
Show details
3
MorfFlex CZ 2.0
Hajič, Jan; Hlaváčová, Jaroslava; Mikulová, Marie. - : Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL), 2021
BASE
Show details
4
FAUST cs-en 0.5
Hajič, Jan; Mareček, David; Fučíková, Eva. - : Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL), 2021
BASE
Show details
5
PDT-Vallex: Czech Valency lexicon linked to treebanks 4.0 (PDT-Vallex 4.0)
Urešová, Zdeňka; Bémová, Alevtina; Fučíková, Eva. - : Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL), 2021
BASE
Show details
6
Prague Dependency Treebank - Consolidated 1.0 (PDT-C 1.0)
Hajič, Jan; Bejček, Eduard; Bémová, Alevtina. - : Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL), 2021
BASE
Show details
7
Prague Dependency Treebank of Spoken Czech 2.0 (PDTSC 2.0)
Mikulová, Marie; Bémová, Alevtina; Hajič, Jan. - : Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL), 2021
BASE
Show details
8
FAUST 0.5
Hajič, Jan; Mareček, David; Fučíková, Eva. - : Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL), 2021
BASE
Show details
9
Prague Dependency Treebank -- Consolidated 1.0 ...
BASE
Show details
10
Prague Dependency Treebank 3.5
Hajič, Jan; Bejček, Eduard; Bémová, Alevtina. - : Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL), 2018
BASE
Show details
11
Search for the Relation of Form and Function Using the ForFun Database
In: Prague Bulletin of Mathematical Linguistics , Vol 110, Iss 1, Pp 71-84 (2018) (2018)
BASE
Show details
12
Prague DaTabase of Spoken Czech 1.0
Hajič, Jan; Pajas, Petr; Ircing, Pavel. - : Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL), 2017. : University of West Bohemia, 2017
BASE
Show details
13
ForFun 1.0
Mikulová, Marie; Bejček, Eduard. - : Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL), 2017
BASE
Show details
14
Difference between Written and Spoken Czech: The Case of Verbal Nouns Denoting an Action
In: Prague Bulletin of Mathematical Linguistics , Vol 107, Iss 1, Pp 19-38 (2017) (2017)
BASE
Show details
15
Prague Czech-English Dependency Treebank 2.0 Coref
Nedoluzhko, Anna; Novák, Michal; Cinková, Silvie. - : Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL), 2016
BASE
Show details
16
PDT-Vallex: Czech Valency lexicon linked to treebanks
Urešová, Zdeňka; Štěpánek, Jan; Hajič, Jan. - : Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL), 2014
BASE
Show details
17
Prague Dependency Treebank 3.0
Bejček, Eduard; Hajičová, Eva; Hajič, Jan. - : Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL), 2014
BASE
Show details
18
Liší se mluvené a psané texty ve valenci?
In: Korpus, gramatika, axiologie. - Hradec Králové : Univerzita Hradec Králové 8 (2013), 37-46
BLLDB
Show details
19
Prague Czech-English Dependency Treebank 2.0
Hajič, Jan; Hajičová, Eva; Panevová, Jarmila. - : Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL), 2013
BASE
Show details
20
Prague Czech-English Dependency Treebank 2.0
Hajič, Jan; Hajičová, Eva; Panevová, Jarmila. - : Linguistic Data Consortium, 2012. : https://www.ldc.upenn.edu, 2012
BASE
Show details

Page: 1 2

Catalogues
0
0
1
0
0
0
0
Bibliographies
2
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
24
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern