Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Page: 1 2

Hits 1 – 20 of 31

1	What's New in EuReCo? Interoperability, Comparable Corpora, Licensing
	Kupietz, Marc [Verfasser]; Margaretha, Eliza [Verfasser]; Diewald, Nils [Verfasser]. - Mannheim : Leibniz-Institut für Deutsche Sprache (IDS), Bibliothek, 2019
	DNB Subject Category Language
	Show details

2	The Vast and the Focused: On the need for domain-focused web corpora
	Barbaresi, Adrien [Verfasser]; Bański, Piotr [Herausgeber]; Barbaresi, Adrien [Herausgeber]. - Mannheim : Leibniz-Institut für Deutsche Sprache (IDS), Bibliothek, 2019
	DNB Subject Category Language
	Show details

3	Asynchronous pipelines for processing huge corpora on medium to low resource infrastructures
	Ortiz Suárez, Pedro Javier [Verfasser]; Sagot, Benoît [Verfasser]; Romary, Laurent [Verfasser]. - Mannheim : Leibniz-Institut für Deutsche Sprache (IDS), Bibliothek, 2019
	DNB Subject Category Language
	Show details

4	Proceedings of the Workshop on Challenges in the Management of Large Corpora (CMLC-7) 2019. Cardiff, 22 July 2019
	Bański, Piotr [Herausgeber]; Barbaresi, Adrien [Herausgeber]; Biber, Hanno [Herausgeber]. - Mannheim : Leibniz-Institut für Deutsche Sprache (IDS), Bibliothek, 2019
	DNB Subject Category Language
	Show details

5	Modelling large parallel corpora. The Zurich Parallel Corpus Collection
	Graën, Johannes [Verfasser]; Kew, Tannon [Verfasser]; Shaitarova, Anastassia [Verfasser]. - Mannheim : Leibniz-Institut für Deutsche Sprache (IDS), Bibliothek, 2019
	DNB Subject Category Language
	Show details

6	Deduplication in large web corpora
	Benko, Vladimír [Verfasser]; Bański, Piotr [Herausgeber]; Barbaresi, Adrien [Herausgeber]. - Mannheim : Leibniz-Institut für Deutsche Sprache (IDS), Bibliothek, 2019
	DNB Subject Category Language
	Show details

7	The best of both worlds: Multi-billion word “dynamic” corpora
	Lüngen, Harald [Herausgeber]; Breiteneder, Evelyn [Herausgeber]; Barbaresi, Adrien [Herausgeber]. - Mannheim : Leibniz-Institut für Deutsche Sprache (IDS), Bibliothek, 2019
	DNB Subject Category Language
	Show details

8	Modelling Large Parallel Corpora: The Zurich Parallel Corpus Collection
	Graën, Johannes; Kew, Tannon; Shaitarova, Anastassia...
	In: Graën, Johannes; Kew, Tannon; Shaitarova, Anastassia; Volk, Martin (2019). Modelling Large Parallel Corpora: The Zurich Parallel Corpus Collection. In: Challenges in the Management of Large Corpora (CMLC-7), Cardiff, Wales, 22 July 2019 - 22 July 2019. (2019)
	BASE
	Show details

9	Proceedings of the LREC 2018 Workshop “Challenges in the Management of Large Corpora (CMLC-6)” 07 May 2018 – Miyazaki, Japan
	Bański, Piotr [Herausgeber]; Kupietz, Marc [Herausgeber]; Barbaresi, Adrien [Herausgeber]. - Mannheim : Institut für Deutsche Sprache, Bibliothek, 2018
	DNB Subject Category Language
	Show details

10	How to get the computation near the data: improving data accessibility to, and reusability of analysis functions in corpus query platforms
	Kupietz, Marc [Verfasser]; Diewald, Nils [Verfasser]; Frankhauser, Peter [Verfasser]. - Mannheim : Institut für Deutsche Sprache, Bibliothek, 2018
	DNB Subject Category Language
	Show details

11	Increasing Interoperability for Embedding Corpus Annotation Pipelines in Wmatrix and other corpus retrieval tools
	Rayson, Paul Edward. - 2018
	BASE
	Show details

12	Challenges in the Management of Large Corpora (CMLC-6)
	In: Challenges in the Management of Large Corpora (CMLC-6). Edited by: Banski, Piotr; Kupietz, Marc; Barbaresi, Adrien; Biber, Hanno; Breiteneder, Evelyn; Clematide, Simon; Witt, Andreas (2018). Paris: European Language Resources Association (ELRA). (2018)
	Abstract: Large corpora require careful design, licensing, collecting, cleaning, encoding, annotation, management, storage, retrieval, analysis, and curation to unfold their potential for a wide range of research questions and users, across a number of disciplines. Apart from the usual CMLC topics that fall into these areas, the 6th edition of the CMLC workshop features a special focus on corpus query and anal- ysis systems and specifically on goals concerning their interoperability. In the past 5 years, a whole new generation of corpus query engines that overcome limitations on the number of tokens and annotation layers has started to emerge at several research centers. While there seems to be a consensus that there can be no single corpus tool that fulfills the need of all communities and that a degree of heterogeneity is required, the time seems ripe to discuss whether (further, unre- stricted) divergence should be avoided in order to allow for some interoperability and reusability – and how this can be achieved. The two most prominent areas where interoperability seems highly desirable are query languages and software components for corpus analysis. The former issue is already partially addressed by the proposed ISO standard Corpus Query Lingua Franca (CQLF). Components for corpus analysis and further processing of results (e.g. for visualization), on the other hand, should in an ideal world be exchangeable and reusable across different platforms, not only to avoid redundancies, but also to foster replicability and a canonization of methodology in NLP and corpus linguistics. The 6th edition of the workshop is meant to address these issues, notably by including an expert panel discussion with representatives of tool development teams and power users.
	Keyword: 000 Computer science; 410 Linguistics; Institute of Computational Linguistics; knowledge & systems
	URL: https://www.zora.uzh.ch/id/eprint/162636/1/BanskiKupietz2018.pdf https://doi.org/10.5167/uzh-162636 http://lrec-conf.org/workshops/lrec2018/W17/index.html https://www.zora.uzh.ch/id/eprint/162636/
	BASE
	Hide details

13	Accelerating corpus search using multiple cores
	Rábara, Radoslav [Verfasser]; Rychlý, Pavel [Verfasser]; Herman, Ondřej [Verfasser]. - Mannheim : Institut für Deutsche Sprache, Bibliothek, 2017
	DNB Subject Category Language
	Show details

14	Are web corpora inferior? The Case of Czech and Slovak
	Benko, Vladimír [Verfasser]; Bański, Piotr [Herausgeber]; Kupietz, Marc [Herausgeber]. - Mannheim : Institut für Deutsche Sprache, Bibliothek, 2017
	DNB Subject Category Language
	Show details

15	Creating CorCenCC (Corpws Cenedlaethol Cymraeg Cyfoes - The National Corpus of Contemporary Welsh)
	Knight, Dawn Verfasser]. - Mannheim : Institut für Deutsche Sprache, Bibliothek, 2017
	DNB Subject Category Language
	Show details

16	CMC Corpora in DeReKo
	Lüngen, Harald [Verfasser] [Herausgeber]; Kupietz, Marc [Verfasser] [Herausgeber]; Bański, Piotr [Herausgeber]. - Mannheim : Institut für Deutsche Sprache, Bibliothek, 2017
	DNB Subject Category Language
	Show details

17	From ICE to ICC: The new International Comparable Corpus
	Kirk, John [Verfasser]; Čermáková, Anna [Verfasser]; Bański, Piotr [Herausgeber]. - Mannheim : Institut für Deutsche Sprache, Bibliothek, 2017
	DNB Subject Category Language
	Show details

18	Intra-connecting an exemplary literary corpus with semantic web technologies for exploratory literary studies
	Dittrich, Andreas [Verfasser]; Bański, Piotr [Herausgeber]; Kupietz, Marc [Herausgeber]. - Mannheim : Institut für Deutsche Sprache, Bibliothek, 2017
	DNB Subject Category Language
	Show details

19	Keeping Properties with the Data CL-MetaHeaders - An Open Specification
	Vidler, John [Verfasser]; Wattam, Stephen [Verfasser]; Bański, Piotr [Herausgeber]. - Mannheim : Institut für Deutsche Sprache, Bibliothek, 2017
	DNB Subject Category Language
	Show details

20	Removing spam from web corpora through supervised learning using FastText
	Suchomel, Vít [Verfasser]; Bański, Piotr [Herausgeber]; Kupietz, Marc [Herausgeber]. - Mannheim : Institut für Deutsche Sprache, Bibliothek, 2017
	DNB Subject Category Language
	Show details

Page: 1 2

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern