DE eng

Search in the Catalogues and Directories

Page: 1 2 3 4 5 6 7 8...10
Hits 61 – 80 of 189

61
Integrating corpora of computer-mediated communication into the language resources landscape: Initiatives and best practices from French, German, Italian and Slovenian projects
Beißwenger, Michael [Verfasser]; Chanier, Thierry [Verfasser]; Chiari, Isabella [Verfasser]. - Mannheim : Institut für Deutsche Sprache, Bibliothek, 2016
DNB Subject Category Language
Show details
62
GOLD and Discourse: Domain- and Community-Specific Extensions
Goecke, Daniela [Verfasser]; Lüngen, Harald [Verfasser]; Sasaki, Felix [Verfasser]. - Mannheim : Institut für Deutsche Sprache, Bibliothek, 2016
DNB Subject Category Language
Show details
63
Integrating corpora of computer-mediated communication in CLARIN-D: Results from the curation project ChatCorpus2CLARIN
Lüngen, Harald Verfasser]. - Mannheim : Institut für Deutsche Sprache, Bibliothek, 2016
DNB Subject Category Language
Show details
64
(Best) Practices for Annotating and Representing CMC and Social Media Corpora in CLARIN-D
Lüngen, Harald [Verfasser]; Storrer, Angelika [Verfasser]; Ehrhardt, Eric [Verfasser]. - Mannheim : Institut für Deutsche Sprache, Bibliothek, 2016
DNB Subject Category Language
Show details
65
Das Dortmunder Chat-Korpus in CLARIN-D: Modellierung und Mehrwerte
Herold, Axel [Verfasser]; Beißwenger, Michael [Verfasser]; Storrer, Angelika [Verfasser]. - Mannheim : Institut für Deutsche Sprache, Bibliothek, 2016
DNB Subject Category Language
Show details
66
Linguistische Annotationen für die Analyse von Gliederungsstrukturen wissenschaftlicher Texte
Lüngen, Harald Verfasser]. - Mannheim : Institut für Deutsche Sprache, Bibliothek, 2016
DNB Subject Category Language
Show details
67
Zur Erstellung und Interpretation der Zeitverlaufsgrafiken
Lüngen, Harald [Verfasser]; Keibel, Holger [Verfasser]; Steffens, Doris [Herausgeber]. - Mannheim : Institut für Deutsche Sprache, Bibliothek, 2016
DNB Subject Category Language
Show details
68
Zur Erstellung und Interpretation der Zeitverlaufsgrafiken
Lüngen, Harald [Verfasser]; Keibel, Holger [Verfasser]; Steffens, Doris [Herausgeber]. - Mannheim : Institut für Deutsche Sprache, Bibliothek, 2016
DNB Subject Category Language
Show details
69
Zur Erstellung und Interpretation der Zeitverlaufsgrafiken
Lüngen, Harald [Verfasser]; Keibel, Holger [Verfasser]; Steffens, Doris [Herausgeber]. - Mannheim : Institut für Deutsche Sprache, Bibliothek, 2016
DNB Subject Category Language
Show details
70
Proceedings of the 4th Workshop on Challenges in the Management of Large Corpora
Bański, Piotr (Hrsg.); Kupietz, Marc (Hrsg.); Lüngen, Harald (Hrsg.). - 2016
IDS Bibliografie zur deutschen Grammatik
Show details
71
Efficient Exploration of Translation Variants in Large Multiparallel Corpora Using a Relational Database
In: Graën, Johannes; Clematide, Simon; Volk, Martin (2016). Efficient Exploration of Translation Variants in Large Multiparallel Corpora Using a Relational Database. In: 4th Workshop on the Challenges in the Management of Large Corpora, Portorož, 28 May 2016 - 28 May 2016, 20-23. (2016)
Abstract: We present an approach for searching and exploring translation variants of multi-word units in large multiparallel corpora based on a relational database management system. Our web-based application Multilingwis, which allows for multilingual lookups of phrases and words in English, French, German, Italian and Spanish, is of interest to anybody who wants to quickly compare expressions across several languages, such as language learners without linguistic knowledge. In this paper, we focus on the technical aspects of how to represent and efficiently retrieve all occurrences that match the user’s query in one of five languages simultaneously with their translations into the other four languages. In order to identify such translations in our corpus of 220 million tokens in total, we use statistical sentence and word alignment. By using materialized views, composite indexes, and pre-planned search functions, our relational database management system handles large result sets with only moderate requirements to the underlying hardware. As our systematic evaluation on 200 search terms per language shows, we can achieve retrieval times below 1 second in 75 % of the cases for multi-word expressions.
Keyword: 000 Computer science; 410 Linguistics; Institute of Computational Linguistics; knowledge & systems
URL: https://www.zora.uzh.ch/id/eprint/124373/1/cmlc4.pdf
https://doi.org/10.5167/uzh-124373
https://www.zora.uzh.ch/id/eprint/124373/
http://www.lrec-conf.org/proceedings/lrec2016/workshops/LREC2016Workshop-CMLC_Proceedings.pdf
BASE
Hide details
72
Integrating corpora of computer-mediated communication in CLARIN-D: Results from the curation project ChatCorpus2CLARIN
Lüngen, Harald; Beißwenger, Michael; Ehrhard, Eric. - : Ruhr-Universität Bochum, 2016
BASE
Show details
73
Building and Annotating a Corpus of German-Language Newsgroups
Schröck, Jasmin [Verfasser]; Lüngen, Harald [Verfasser]; Beißwenger, Michael [Herausgeber]. - Mannheim : Institut für Deutsche Sprache, Bibliothek, 2015
DNB Subject Category Language
Show details
74
Adding Value to CMC Corpora: CLARINification and Part-of-speech Annotation of the Dortmund Chat Corpus
Beißwenger, Michael [Verfasser] [Herausgeber]; Ehrhardt, Eric [Verfasser]; Horbach, Andrea [Verfasser]. - Mannheim : Institut für Deutsche Sprache, Bibliothek, 2015
DNB Subject Category Language
Show details
75
Challenges in the Alignment, Management and Exploitation of Large and Richly Annotated Multi-Parallel Corpora
Graën, Johannes [Verfasser]; Clematide, Simon [Verfasser]; Piotr, Bański [Herausgeber]. - Mannheim : Institut für Deutsche Sprache, Bibliothek, 2015
DNB Subject Category Language
Show details
76
Integrated Linguistic Annotation Models and Their Application in the Domain of Antecedent Detection
Witt, Andreas Verfasser] [Herausgeber]; Stührenberg, Maik [Verfasser]; Goecke, Daniela [Verfasser]. - Mannheim : Institut für Deutsche Sprache, Bibliothek, 2015
DNB Subject Category Language
Show details
77
The Morphosyntactic Annotation of DeReKo: Interpretation, Opportunities, and Pitfalls
Belica, Cyril Verfasser]. - Mannheim : Institut für Deutsche Sprache, Bibliothek, 2015
DNB Subject Category Language
Show details
78
CoRoLa Starts Blooming – An update on the Reference Corpus of Contemporary Romanian Language
Tufiş, Dan [Verfasser]; Barbu Mititelu, Verginica [Verfasser]; Irimia, Elena [Verfasser]. - Mannheim : Institut für Deutsche Sprache, Bibliothek, 2015
DNB Subject Category Language
Show details
79
Valenz und Kookkurrenz
Perkuhn, Rainer [Verfasser]; Belica, Cyril [Verfasser]; Keibel, Holger [Verfasser]. - Mannheim : Institut für Deutsche Sprache, Bibliothek, 2015
DNB Subject Category Language
Show details
80
TEI across corpora, languages and genres: Towards a standard for the representation of social media and computer-mediated communication
In: Text Encoding Initiative: connect, animate, innovate. 2015 Annual Conference and Members’ Meeting of the TEI Consortium ; https://halshs.archives-ouvertes.fr/halshs-01222982 ; Text Encoding Initiative: connect, animate, innovate. 2015 Annual Conference and Members’ Meeting of the TEI Consortium, TEI Consortium, Oct 2015, Lyon, France ; http://tei2015.huma-num.fr (2015)
BASE
Show details

Page: 1 2 3 4 5 6 7 8...10

Catalogues
2
8
5
0
74
0
0
Bibliographies
10
0
4
1
0
0
4
0
1
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
23
0
61
1
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern