DE eng

Search in the Catalogues and Directories

Page: 1 2
Hits 1 – 20 of 24

1
ParCzech 3.0
Kopp, Matyáš; Stankov, Vladislav; Bojar, Ondřej. - : Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL), 2021
BASE
Show details
2
Prague Dependency Treebank - Consolidated 1.0 (PDT-C 1.0)
Hajič, Jan; Bejček, Eduard; Bémová, Alevtina. - : Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL), 2021
BASE
Show details
3
ParCzech PS7 2.0
Hladká, Barbora; Kopp, Matyáš; Straňák, Pavel. - : Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL), 2020
BASE
Show details
4
ParCzech PS7 1.0
Hladká, Barbora; Kopp, Matyáš; Straňák, Pavel. - : Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL), 2020
BASE
Show details
5
Corpus for training and evaluating diacritics restoration systems
Náplava, Jakub; Straka, Milan; Hajič, Jan. - : Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL), 2018
BASE
Show details
6
Prague Dependency Treebank 3.5
Hajič, Jan; Bejček, Eduard; Bémová, Alevtina. - : Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL), 2018
BASE
Show details
7
Public License Selector
Sedlák, Michal; Straňák, Pavel; Kamocki, Pawel. - : Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL), 2015
BASE
Show details
8
Linguistic digital repository based on DSpace 5.2
Mišutka, Jozef; Kamran, Amir; Košarko, Ondřej. - : Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL), 2015
BASE
Show details
9
ROMi 1.0
Šebesta, Karel; Bedřichová, Zuzanna; Šormová, Kateřina. - : Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL), 2014
BASE
Show details
10
HindMonoCorp 0.5
Bojar, Ondřej; Diatka, Vojtěch; Rychlý, Pavel. - : Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL), 2014
BASE
Show details
11
Linguistic digital repository based on DSpace
Pajas, Petr; Vandas, Karel; Mišutka, Jozef. - : Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL), 2014
BASE
Show details
12
HindEnCorp 0.5
Bojar, Ondřej; Diatka, Vojtěch; Straňák, Pavel. - : Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL), 2014
BASE
Show details
13
Prague Dependency Treebank 2.5
Bejček, Eduard; Hajič, Jan; Panevová, Jarmila. - : Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL), 2012
BASE
Show details
14
2009 CoNLL Shared Task Part 1
Hajič, Jan; Maria Antonia Martí; Marquez, Lluis; Nivre, Joakim; Štěpánek, Jan; Padó, Sebastian; Straňák, Pavel. - : Linguistic Data Consortium, 2012. : https://www.ldc.upenn.edu, 2012
Abstract: *Introduction* 2009 CoNLL Shared Task Part 1, LDC Catalog Number LDC2012T03 and ISBN 1-58563-610-X, contains the Catalan, Czech, German and Spanish trial corpora, training corpora, development and test data for the 2009 CoNLL (Conference on Computational Natural Language Learning) Shared Task Evaluation. The 2009 Shared Task developed syntactic dependency annotations, including the semantic dependencies model roles of both verbal and nominal predicates. The Conference on Computational Natural Language Learning (CoNLL) is accompanied every year by a shared task intended to promote natural language processing applications and evaluate them in a standard setting. The 2004 and 2005 CoNLL shared tasks were dedicated to semantic role labeling (SRL) in a monolingual setting (English). In 2006 and 2007, the shared tasks were devoted to the parsing of syntactic dependencies and used corpora from up to thirteen languages. In 2008, the shared task focused on English and employed a unified dependency-based formalism and merged the task of syntactic dependency parsing and the task of identifying semantic arguments and labeling them with semantic roles that data has been released by LDC as 2008 CoNLL Shared Task Data. The 2009 task extended the 2008 task to several languages (English plus Catalan, Chinese, Czech, German, Japanese and Spanish). Among the new features were comparison of time and space complexity based on participants input, and learning curve comparison for languages with large datasets. The 2009 shared task was divided into two subtasks: * parsing syntactic dependencies * identification of arguments and assignment of semantic roles for each predicate 2009 CoNLL Shared Task Part 2 (LDC2012T04) contains the English and Chinese task data and is also available through LDC. LDC has also released the following CoNLL Shared Task data sets: * 2006 CoNLL Shared Task - Ten Languages (LDC2015T11) * 2006 CoNLL Shared Task - Arabic & Czech (LDC2015T12) * 2008 CoNLL Shared Task Data (LDC2009T12) * 2015-2016 CoNLL Shared Task (LDC2017T13) *Data* The materials in this release consist of excerpts from the following corpora: * Ancora (Spanish + Catalan): 500,000 words each of annotated news text developed by the University of Barcelona, Polytechnic University of Catalonia, the University of Alacante and the University of the Basque Country * Prague Dependency Treebank 2.0 (Czech): approximately 2 million words of annotated news, journal and magazine text developed by Charles University also available through LDC, LDC2006T01 * TIGER Treebank + SALSA Corpus (German): approximately 900,000 words of annotated news text and FrameNet annotation developed by the University of Potsdam, Saarland University and the University of Stuttgart In addition, an archive of all of the uploaded data from the participants is included in the eval-data folder. Users should note that not all data indicated in the individual READMEs is included in this release and neither are some of the corresponding DTDs for of the XML. Additionally, all data is presented in its uncompressed form for ease of use. Within the user eval-data folder, the two folders marked bad contain references to data from languages included in Part 2 of this release as well as to Japanese data. Japanese data is not included in this release. *Samples* For samples of documents from each language use the links below: * Catalan * Czech * German * Spanish *Updates* None at this time.
URL: https://catalog.ldc.upenn.edu/LDC2012T03
BASE
Hide details
15
2009 CoNLL Shared Task Part 2
Hajič, Jan; Ciaramita, Massimiliano; Johansson, Richard. - : Linguistic Data Consortium, 2012. : https://www.ldc.upenn.edu, 2012
BASE
Show details
16
Hindi Web Texts
Bojar, Ondřej; Straňák, Pavel; Zeman, Daniel. - : Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL), 2011
BASE
Show details
17
English-Hindi Parallel Corpus
Bojar, Ondřej; Straňák, Pavel; Zeman, Daniel. - : Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL), 2011
BASE
Show details
18
Czech WordNet 1.9 PDT
Pala, Karel; Čapek, Tomáš; Zajíčková, Barbora. - : Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL), 2011
BASE
Show details
19
Lexico-Semantic Annotation of PDT using Czech WordNet
Bejček, Eduard; Hoffmannová, Petra; Holub, Martin. - : Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL), 2011
BASE
Show details
20
CoNLL 2009 Shared Task Czech Trial Set
Hajič, Jan; Straňák, Pavel; Štěpánek, Jan. - : Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL), 2011
BASE
Show details

Page: 1 2

Catalogues
0
0
0
0
0
0
0
Bibliographies
1
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
22
0
1
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern