1 |
DiaCollo für GEI-Digital - Ein experimentelles Projekt zur weiteren Erschließung digitalisierter historischer Schulbuchbestände ...
|
|
|
|
BASE
|
|
Show details
|
|
2 |
DiaCollo für GEI-Digital - Ein experimentelles Projekt zur weiteren Erschließung digitalisierter historischer Schulbuchbestände ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Who has ears, listen: Citizen Listening Program for disease prevention. ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
Who has ears, listen: Citizen Listening Program for disease prevention. ...
|
|
|
|
BASE
|
|
Show details
|
|
11 |
Pemanfaatan Bank-data Digital Dwibahasa dalam Kajian Terjemahan: Studi kasus padanan bahasa Indonesia untuk verba sinonim bahasa Inggris ROB & STEAL ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
Pemanfaatan Bank-data Digital Dwibahasa dalam Kajian Terjemahan: Studi kasus padanan bahasa Indonesia untuk verba sinonim bahasa Inggris ROB & STEAL ...
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Правила генерации глагольных словоформ для новописьменного варианта ливвиковского наречия ...
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Правила генерации глагольных словоформ для новописьменного варианта ливвиковского наречия ...
|
|
|
|
BASE
|
|
Show details
|
|
15 |
Правила генерации глагольных словоформ для новописьменного варианта ливвиковского наречия ...
|
|
|
|
BASE
|
|
Show details
|
|
16 |
Правила генерации глагольных словоформ для новописьменного варианта ливвиковского наречия ...
|
|
|
|
BASE
|
|
Show details
|
|
20 |
Lexica corpus (v2.0) ...
|
|
|
|
Abstract:
Second release of the lexica corpus: a corpus for German text simplification, total size now 3270 files. The corpus consists of approximately 3300 texts from three Wiki-based lexica in German language: MiniKlexikon, Klexikon and Wikipedia. The articles in the Wikis are created by volunteers and can be written, discussed, and improved upon collaboratively. Klexikon is aimed specifically at children aged between 6 and 12 and MiniKlexikon is designed for children who are beginner readers, and is therefore an even simpler version of the Klexikon. We make the assumption that the three different sub-corpora represent three different levels of conceptual complexity due to the target groups they are written for: younger children, children and adults. As Wikipedia articles can be extremely long, in comparison to the other two lexica, only the introduction or abstract was taken for this corpus. This repository contains the corpora from the original study (295 texts per sub-corpus in the orig_files folder), extended ...
|
|
Keyword:
automatic text simplification; computational linguistics; text complexity; text simplification
|
|
URL: https://dx.doi.org/10.5281/zenodo.5196029 https://zenodo.org/record/5196029
|
|
BASE
|
|
Hide details
|
|
|
|