6 |
Penn Discourse Treebank Version 2.0 - German Translation ...
|
|
|
|
Abstract:
Introduction Penn Discourse Treebank Version 2.0 - German Translation was developed at the University of Potsdam's Applied Computational Linguistics group and consists of approximately one million tokens derived from Penn Discourse Treebank Version 2.0 (LDC2008T05). This data was translated into German and annotated for shallow discourse relations in the financial news domain. The aim of the University of Pennsylvania's Penn Discourse Treebank (PDTB) project is to annotate the Wall Street Journal text in Treebank-2 with discourse relations. PDTB 2.0 contains 40,600 tokens of annotation relations. PDTB2-German is based on a subset of PDTB2.0 used in the 2016 CoNLL Shared Task on Multilingual Shallow Discourse Parsing . Data Data is in CoNLL format. Text was automatically translated into German with deepL, and projections of the annotations using word alignments were produced with GIZA++. See the included documentation for more information on the relation annotations. Source text and CoNLL format annotations ...
|
|
URL: https://dx.doi.org/10.35111/x7qb-7h47 https://catalog.ldc.upenn.edu/LDC2021T05
|
|
BASE
|
|
Hide details
|
|
7 |
Shallow discourse parsing for German ; Shallow Discourse Parsing für Deutsch
|
|
|
|
BASE
|
|
Show details
|
|
8 |
Exploiting a lexical resource for discourse connective disambiguation in German ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Curation Technologies for a Cultural Heritage Archive: "Project Tongilbu" ...
|
|
|
|
BASE
|
|
Show details
|
|
10 |
Curation Technologies for a Cultural Heritage Archive: "Project Tongilbu" ...
|
|
|
|
BASE
|
|
Show details
|
|
11 |
Towards the Automatic Classification of Offensive Language and Related Phenomena in German Tweets
|
|
|
|
In: http://hw.oeaw.ac.at/8435-5 (2018)
|
|
BASE
|
|
Show details
|
|
|
|