1 |
Das ZDL-Regionalkorpus: Ein Korpus für die lexikografische Beschreibung der diatopischen Variation im Standarddeutschen
|
|
|
|
IDS Mannheim
|
|
2 |
A Reproducible IT-Blog Corpus
|
|
|
|
In: Journal of Open Humanities Data; Vol 7 (2021); 17 ; 2059-481X (2021)
|
|
BASE
|
|
Show details
|
|
3 |
Proceedings of the Workshop on Challenges in the Management of Large Corpora (CMLC-9) 2021. Limerick, 12 July 2021 (Online-Event) ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
Trafilatura: {A} Web Scraping Library and Command-Line Tool for Text Discovery and Extraction ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
Proceedings of the LREC 2020 Workshop, Language Resources and Evaluation Conference, 11–16 May 2020, 8th Workshop on Challenges in the Management of Large Corpora (CMLC-8)
|
|
|
|
DNB Subject Category Language
|
|
Show details
|
|
9 |
Out-of-the-Box and Into the Ditch? Multilingual Evaluation of Generic Text Extraction Tools
|
|
|
|
In: Language Resources and Evaluation Conference (LREC 2020) ; https://hal.archives-ouvertes.fr/hal-02732851 ; Language Resources and Evaluation Conference (LREC 2020), 2020, pp.5-13 (2020)
|
|
BASE
|
|
Show details
|
|
10 |
htmldate: A Python package to extract publication dates from web pages ...
|
|
|
|
BASE
|
|
Show details
|
|
11 |
Proceedings of the LREC 2020: 8th Workshop on Challenges in the Management of Large Corpora (CMLC-8)
|
|
In: Proceedings of the LREC 2020: 8th Workshop on Challenges in the Management of Large Corpora (CMLC-8). Edited by: Bański, Piotr; Barbaresi, Adrien; Clematide, Simon; Kupietz, Marc; Lüngen, Harald; Pisetta, Ines (2020). Marseille, France: European Language Ressources Association. (2020)
|
|
BASE
|
|
Show details
|
|
19 |
Diving Into The Complexities Of The Tech Blog Sphere
|
|
|
|
In: Digital Humanities 2019 ; https://hal.archives-ouvertes.fr/hal-02201532 ; Digital Humanities 2019, ADHO, Jul 2019, Utrecht, Netherlands ; https://dev.clariah.nl/files/dh2019/boa/0964.html (2019)
|
|
BASE
|
|
Show details
|
|
20 |
German Political Speeches Corpus ...
|
|
|
|
Abstract:
This text archive focuses on German political speeches held by top officials mostly from 1990 onwards, selected according to their political relevance. The currently included speeches come from the following sources: Official pages of the German Presidency, Chancellery, Bundestag, Ministry of Foreign Affairs Personal pages of the Helmut Kohl archive, Wolfgang Thierse and Norbert Lammert This resource is available online: Online queries on the DWDS website and usage instructions (the text base may be newer than the downloadable archives) http://purl.org/corpus/german-speeches The files below consist of texts with metadata encoded in XML format. For appropriate tooling see: Python tutorial using the speeches: Natural Language Processing — Einsteigen und Loslegen! CorpusExplorer, corpus linguistics and text mining software featuring the speeches List of off-the-shelf NLP tools for German This is work in progress, updated and extended versions will follow. ...
|
|
Keyword:
corpus linguistics; natural language processing; political science; political speeches
|
|
URL: https://zenodo.org/record/3611245 https://dx.doi.org/10.5281/zenodo.3611245
|
|
BASE
|
|
Hide details
|
|
|
|