4 |
Prague Dependency Treebank 3.5
|
|
Hajič, Jan; Bejček, Eduard; Bémová, Alevtina; Buráňová, Eva; Hajičová, Eva; Havelka, Jiří; Homola, Petr; Kárník, Jiří; Kettnerová, Václava; Klyueva, Natalia; Kolářová, Veronika; Kučová, Lucie; Lopatková, Markéta; Mikulová, Marie; Mírovský, Jiří; Nedoluzhko, Anna; Pajas, Petr; Panevová, Jarmila; Poláková, Lucie; Rysová, Magdaléna; Sgall, Petr; Spoustová, Johanka; Straňák, Pavel; Synková, Pavlína; Ševčíková, Magda; Štěpánek, Jan; Urešová, Zdeňka; Vidová Hladká, Barbora; Zeman, Daniel; Zikánová, Šárka; Žabokrtský, Zdeněk. - : Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL), 2018
|
|
Abstract:
The Prague Dependency Treebank 3.5 is the 2018 edition of the core Prague Dependency Treebank (PDT). It contains all PDT annotation made at the Institute of Formal and Applied Linguistics under various projects between 1996 and 2018 on the original texts, i.e., all annotation from PDT 1.0, PDT 2.0, PDT 2.5, PDT 3.0, PDiT 1.0 and PDiT 2.0, plus corrections, new structure of basic documentation and new list of authors covering all previous editions. The Prague Dependency Treebank 3.5 (PDT 3.5) contains the same texts as the previous versions since 2.0; there are 49,431 annotated sentences (832,823 words) on all layers, from tectogrammatical annotation to syntax to morphology. There are additional annotated sentences for syntax and morphology; the totals for the lower layers of annotation are: 87,913 sentences with 1,502,976 words at the analytical layer (surface dependency syntax) and 115,844 sentences with 1,956,693 words at the morphological layer of annotation (these totals include the annotation with the higher layers annotated as well). Closely linked to the tectogrammatical layer is the annotation of sentence information structure, multiword expressions, coreference, bridging relations and discourse relations.
|
|
Keyword:
bridging relations; clauses; coreference; dependency; discourse; lemmatization; lexical semantics; lexicon; morphology; multiword expressions; semantic relations; semantics; syntax; tectogrammatics; tokenization; topic-focus articulation; treebank
|
|
URL: http://hdl.handle.net/11234/1-2621
|
|
BASE
|
|
Hide details
|
|
5 |
Enriching VALLEX with Light Verbs: From Theory to Data and Back Again
|
|
|
|
In: Prague Bulletin of Mathematical Linguistics , Vol 111, Iss 1, Pp 29-56 (2018) (2018)
|
|
BASE
|
|
Show details
|
|
7 |
Reflexive Verbs in a Valency Lexicon: The Case of Czech Reflexive Morphemes
|
|
|
|
In: Proceedings of the 16th EURALEX International Congress: The User in Focus, Bolzano/Bozen, Italien 15 - 19 July 2014 (2014), 1007-1023
|
|
IDS OBELEX meta
|
|
Show details
|
|
14 |
Valencní slovník ceských sloves
|
|
|
|
MPI-SHH Linguistik
|
|
Show details
|
|
15 |
Studies in formal slavic linguistics : contributions from Formal Description of Slavic Languages 6.5, held at the University of Nova Gorica, December 1-3, 2006
|
|
|
|
BLLDB
|
|
UB Frankfurt Linguistik
|
|
Show details
|
|
|
|