DE eng

Search in the Catalogues and Directories

Page: 1 2 3 4 5...29
Hits 1 – 20 of 566

1
Coreference in Universal Dependencies 1.0 (CorefUD 1.0)
Abstract: CorefUD is a collection of previously existing datasets annotated with coreference, which we converted into a common annotation scheme. In total, CorefUD in its current version 1.0 consists of 17 datasets for 11 languages. The datasets are enriched with automatic morphological and syntactic annotations that are fully compliant with the standards of the Universal Dependencies project. All the datasets are stored in the CoNLL-U format, with coreference- and bridging-specific information captured by attribute-value pairs located in the MISC column. The collection is divided into a public edition and a non-public (ÚFAL-internal) edition. The publicly available edition is distributed via LINDAT-CLARIAH-CZ and contains 13 datasets for 10 languages (1 dataset for Catalan, 2 for Czech, 2 for English, 1 for French, 2 for German, 1 for Hungarian, 1 for Lithuanian, 1 for Polish, 1 for Russian, and 1 for Spanish), excluding the test data. The non-public edition is available internally to ÚFAL members and contains additional 4 datasets for 2 languages (1 dataset for Dutch, and 3 for English), which we are not allowed to distribute due to their original license limitations. It also contains the test data portions for all datasets. When using any of the harmonized datasets, please get acquainted with its license (placed in the same directory as the data) and cite the original data resource too. Version 1.0 consists of the same corpora and languages as the previous version 0.2; however, the English GUM dataset has been updated to a newer and larger version, and in the Czech/English PCEDT dataset, the train-dev-test split has been changed to be compatible with OntoNotes. Nevertheless, the main change is in the file format (the MISC attributes have new form and interpretation).
Keyword: bridging relations; coreference; dependency; harmonized annotation; treebank
URL: http://hdl.handle.net/11234/1-4698
BASE
Hide details
2
A morph-based and a word-based treebank for Beja
In: SyntaxFest ; TLT 2021 - 20th International Workshop on Treebanks and Linguistic Theories ; https://hal.archives-ouvertes.fr/hal-03494462 ; TLT 2021 - 20th International Workshop on Treebanks and Linguistic Theories, Mar 2022, Sofia, Bulgaria (2022)
BASE
Show details
3
Generación de flexión morfológica con UniMorph.: Evaluación con base de datos relacional y pautas de entrenamiento
In: Procesamiento del lenguaje natural, ISSN 1135-5948, Nº. 68, 2022, pags. 61-70 (2022)
BASE
Show details
4
Corpus linguistics for education : a guide for research
Pérez-Paredes, Pascual. - New York : Routledge, 2021. Abingdon, Oxon : Routledge, Taylor & Francis Group, 2021
UB Frankfurt Linguistik
Show details
5
Corpus linguistics for education : a guide for research
Pérez-Paredes, Pascual. - New York : Routledge, 2021
BLLDB
UB Frankfurt Linguistik
Show details
6
Statistics in corpus linguistics : a new approach
Wallis, Sean. - London : Routledge, 2021
BLLDB
UB Frankfurt Linguistik
Show details
7
Shallow discourse parsing for German
Stede, Manfred (Akademischer Betreuer); Bourgonje, Peter; Kosseim, Leila (Akademischer Betreuer). - Potsdam, 2021
BLLDB
UB Frankfurt Linguistik
Show details
8
Unifying dimensions in coherence relations: how various annotation frameworks are related
In: Corpus linguistics and linguistic theory. - Berlin ; New York : Mouton de Gruyter 17 (2021) 1, 1-71
BLLDB
Show details
9
A morph-based and a word-based treebank for Beja
In: SyntaxFest ; https://hal.archives-ouvertes.fr/hal-03494462 ; SyntaxFest, In press (2021)
BASE
Show details
10
Old Catalan Morphosyntax: developing an annotated corpus
In: EISSN: 2059-481X ; Journal of Open Humanities Data ; https://hal.archives-ouvertes.fr/hal-03617737 ; Journal of Open Humanities Data, Ubiquity Press, 2021, 7, pp.30. ⟨10.5334/johd.54⟩ (2021)
BASE
Show details
11
Das Referenzkorpus Mittelhochdeutsch: Nutzungsmöglichkeiten für morphologische Untersuchungen
In: Historische Wortbildung. - Hildesheim : Georg Olms Verlag (2021), 145-186
BLLDB
Show details
12
The word as a unit of internal predictability
In: Linguistics. - Berlin [u.a.] : Mouton de Gruyter 59 (2021) 6, 1427-1472
BLLDB
Show details
13
Deep Sequoia corpus - PARSEME-FR corpus - FrSemCor
BASE
Show details
14
Coreference in Universal Dependencies 0.2 (CorefUD 0.2)
Nedoluzhko, Anna; Novák, Michal; Popel, Martin. - : Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL), 2021
BASE
Show details
15
IWPT 2021 Shared Task Data and System Outputs
Zeman, Daniel; Bouma, Gosse; Seddah, Djamé. - : Universal Dependencies Consortium, 2021
BASE
Show details
16
Universal Dependencies 2.9
Zeman, Daniel; Nivre, Joakim; Abrams, Mitchell. - : Universal Dependencies Consortium, 2021
BASE
Show details
17
Universal Dependencies 2.8.1
Zeman, Daniel; Nivre, Joakim; Abrams, Mitchell. - : Universal Dependencies Consortium, 2021
BASE
Show details
18
Prague Dependency Treebank - Consolidated 1.0 (PDT-C 1.0)
Hajič, Jan; Bejček, Eduard; Bémová, Alevtina. - : Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL), 2021
BASE
Show details
19
FAUST 0.5
Hajič, Jan; Mareček, David; Fučíková, Eva. - : Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL), 2021
BASE
Show details
20
Universal Dependencies 2.8
Zeman, Daniel; Nivre, Joakim; Abrams, Mitchell. - : Universal Dependencies Consortium, 2021
BASE
Show details

Page: 1 2 3 4 5...29

Catalogues
54
0
72
0
0
2
0
Bibliographies
312
1
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
1
0
0
0
Open access documents
248
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern