DE eng

Search in the Catalogues and Directories

Hits 1 – 13 of 13

1
What's New in EuReCo? Interoperability, Comparable Corpora, Licensing
Kupietz, Marc [Verfasser]; Margaretha, Eliza [Verfasser]; Diewald, Nils [Verfasser]. - Mannheim : Leibniz-Institut für Deutsche Sprache (IDS), Bibliothek, 2019
DNB Subject Category Language
Show details
2
The Vast and the Focused: On the need for domain-focused web corpora
Barbaresi, Adrien [Verfasser]; Bański, Piotr [Herausgeber]; Barbaresi, Adrien [Herausgeber]. - Mannheim : Leibniz-Institut für Deutsche Sprache (IDS), Bibliothek, 2019
DNB Subject Category Language
Show details
3
Types and annotation of reply relations in computer-mediated communication
Lüngen, Harald [Verfasser]; Herzberg, Laura [Verfasser]. - Mannheim : Universitätsbibliothek Mannheim, 2019
DNB Subject Category Language
Show details
4
Asynchronous pipelines for processing huge corpora on medium to low resource infrastructures
Ortiz Suárez, Pedro Javier [Verfasser]; Sagot, Benoît [Verfasser]; Romary, Laurent [Verfasser]. - Mannheim : Leibniz-Institut für Deutsche Sprache (IDS), Bibliothek, 2019
DNB Subject Category Language
Show details
5
Proceedings of the Workshop on Challenges in the Management of Large Corpora (CMLC-7) 2019. Cardiff, 22 July 2019
Bański, Piotr [Herausgeber]; Barbaresi, Adrien [Herausgeber]; Biber, Hanno [Herausgeber]. - Mannheim : Leibniz-Institut für Deutsche Sprache (IDS), Bibliothek, 2019
DNB Subject Category Language
Show details
6
Modelling large parallel corpora. The Zurich Parallel Corpus Collection
Graën, Johannes [Verfasser]; Kew, Tannon [Verfasser]; Shaitarova, Anastassia [Verfasser]. - Mannheim : Leibniz-Institut für Deutsche Sprache (IDS), Bibliothek, 2019
DNB Subject Category Language
Show details
7
Deduplication in large web corpora
Benko, Vladimír [Verfasser]; Bański, Piotr [Herausgeber]; Barbaresi, Adrien [Herausgeber]. - Mannheim : Leibniz-Institut für Deutsche Sprache (IDS), Bibliothek, 2019
DNB Subject Category Language
Show details
8
cmc-core: a basic schema for encoding CMC corpora in TEI
Lüngen, Harald [Verfasser]; Wigham, Ciara R. [Verfasser]; Marinica, Claudia [Herausgeber]. - Mannheim : Leibniz-Institut für Deutsche Sprache (IDS), Bibliothek, 2019
DNB Subject Category Language
Show details
9
The best of both worlds: Multi-billion word “dynamic” corpora
Lüngen, Harald [Herausgeber]; Breiteneder, Evelyn [Herausgeber]; Barbaresi, Adrien [Herausgeber]. - Mannheim : Leibniz-Institut für Deutsche Sprache (IDS), Bibliothek, 2019
DNB Subject Category Language
Show details
10
Types and annotation of reply relations in computer-mediated communication
Herzberg, Laura [Verfasser]; Lüngen, Harald [Verfasser]. - Mannheim : Leibniz-Institut für Deutsche Sprache (IDS), Bibliothek, 2019
DNB Subject Category Language
Show details
11
Datenübernahmerichtlinien des Leibniz-Instituts für Deutsche Sprache
Schmidt, Thomas [Verfasser]; Witt, Andreas [Verfasser]; Arnold, Denis [Verfasser]. - Mannheim : Institut für Deutsche Sprache, Bibliothek, 2019
DNB Subject Category Language
Show details
12
Modelling Large Parallel Corpora: The Zurich Parallel Corpus Collection
In: Graën, Johannes; Kew, Tannon; Shaitarova, Anastassia; Volk, Martin (2019). Modelling Large Parallel Corpora: The Zurich Parallel Corpus Collection. In: Challenges in the Management of Large Corpora (CMLC-7), Cardiff, Wales, 22 July 2019 - 22 July 2019. (2019)
Abstract: Text corpora come in many different shapes and sizes and carry heterogeneous annotations, depending on their purpose and design. The true benefit of corpora is rooted in their annotation and the method by which this data is encoded is an important factor in their interoperability. We have accumulated a large collection of multilingual and parallel corpora and encoded it in a unified format which is compatible with a broad range of NLP tools and corpus linguistic applications. In this paper, we present our corpus collection and describe a data model and the extensions to the popular CoNLL-U format that enable us to encode it.
Keyword: 000 Computer science; 410 Linguistics; Institute of Computational Linguistics; knowledge & systems
URL: https://doi.org/10.14618/ids-pub-9020
https://www.zora.uzh.ch/id/eprint/175081/
https://doi.org/10.5167/uzh-175081
https://www.zora.uzh.ch/id/eprint/175081/1/Graen_Kew_Shaitarova_Volk_2019.pdf
BASE
Hide details
13
Types and annotation of reply relations in computer-mediated communication
Lüngen, Harald; Herzberg, Laura. - : de Gruyter, 2019
BASE
Show details

Catalogues
0
0
0
0
11
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
2
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern