DE eng

Search in the Catalogues and Directories

Hits 1 – 13 of 13

1
WIP: LEONIDE PoS training
BASE
Show details
2
Kolipsi-1 Corpus v1.0
Glaznieks, Aivars; Frey, Jennifer-Carmen; Abel, Andrea. - : Institute for Applied Linguistics, Eurac Research, 2021
BASE
Show details
3
Kolipsi-2 Corpus v1.0
Glaznieks, Aivars; Frey, Jennifer-Carmen; Nicolas, Lionel; Abel, Andrea; Vettori, Chiara. - : Institute for Applied Linguistics, Eurac Research, 2021
Abstract: The Kolipsi-2 Corpus is a written learner corpus of German and Italian L2 speakers originating from South Tyrol (Italy). It has been developed as a by-product of the KOLIPSI II project, a replication study of the KOLIPSI project on “South-Tyrolean pupils and the second language: a linguistic and socio-psychological investigation” that was conducted 7 years after the original study. The data collection for this second edition took place in spring 2014 and is based on two standardized tests for written productions, that were aligned with the original tasks for the KOLIPSI study. However, while the first task remained the same for both editions, the second task was slightly adapted. The two tasks consisted of (1) writing an e-mail to a friend retelling a given event at the supermarket based on a picture story (narrative text genre) and (2) writing an e-mail about negative aspects of social-media chats prompted by a letter to the editor in a youth magazine (argumentative text genre). For both tasks a time limit of 25 minutes was fixed and no additional reference material was allowed. CEFR levels have been assigned to all L2 learner texts, providing a holistic score as well as evaluations of coherence, sociolinguistic appropriateness, lexical accuracy, lexical diversity, grammar and orthography. Person-related metadata provides information about: - the writer's language background, including L1(s), the L1(s) of mother and father, and a self-declared language group affiliation as well as the pre-dominant language spoken in the area the writer is residing in - the writer's results from an additional language test in the L2 (dialang test) - the writer's competence in the local German dialect (for students with L1 Italian only) - the writer's age, gender and socio-economic status - whether the writer lives in an urban or rural environment - the language, location and type of school the writer attended - an anonymous identifier for the writer's school class to account for class effects All texts have been transcribed manually adding transcription annotations that reflect surface features of the text, such as the graphical arrangement, and include error annotation on the orthographic level. In addition to that, all texts were automatically annotated, adding tokenisation, sentence splitting, POS-tagging and lemmatization using an orthographically corrected target version of the corpus. Kolipsi-1 L2 belongs to the Kolipsi Corpus Family, a series of related learner corpora collected in South Tyrolean upper secondary schools. The corpora of the Kolipsi Corpus Family contain Italian and German learner texts that were collected in the course of the KOLIPSI project in 2007/2008 (Kolipsi-1) and a follow-up study in 2014/2015 (Kolipsi-2). The aim of both corpus studies was to analyse the second language competences of South-Tyrolean pupils from upper secondary schools (between 16-18 years old), and to contextualize the results of such investigation by commenting on crucial sociolinguistic and psychosocial aspects that influence it. The results of the follow-up study should be compared to the results of the original KOLIPSI project.
Keyword: argumentative essay; L2 corpora; learner corpus; picture story; South Tyrol; student essay
URL: https://hdl.handle.net/20.500.12124/30
BASE
Hide details
4
The FAIR Index of CMC Corpora
In: CMC Corpora through the prism of Digital Humanities ; https://hal.archives-ouvertes.fr/hal-03121698 ; CMC Corpora through the prism of Digital Humanities, 2020 (2020)
BASE
Show details
5
Using data mining to repurpose German language corpora. An evaluation of data-driven analysis methods for corpus linguistics ...
Frey, Jennifer Carmen. - : amsdottorato, 2020
BASE
Show details
6
LEONIDE - Longitudinal Learner Corpus in Italiano, Deutsch and English 1.1
Glaznieks, Aivars; Frey, Jennifer-Carmen; Stopfner, Maria. - : Institute for Applied Linguistics, Eurac Research, 2020
BASE
Show details
7
Using data mining to repurpose German language corpora. An evaluation of data-driven analysis methods for corpus linguistics
Frey, Jennifer Carmen <1988>. - : Alma Mater Studiorum - Università di Bologna, 2020
BASE
Show details
8
Wie misst man Textqualität im digitalen Zeitalter? : (MIT.Qualität)
Abel, Andrea (VerfasserIn); Frey, Jennifer-Carmen (VerfasserIn)
In: Enthalten in: Neues vom heutigen Deutsch (2019)
IDS Mannheim
9
DIDI - The DiDi Corpus of South Tyrolean CMC 1.0.0
Frey, Jennifer-Carmen; Glaznieks, Aivars; Stemle, Egon W.. - : Institute for Applied Linguistics, Eurac Research, 2019
BASE
Show details
10
Collecting language data of non-public social media profiles
Frey, Jennifer-Carmen [Verfasser]; Stemle, Egon W. [Verfasser]; Glaznieks, Aivars [Verfasser]. - Hildesheim : Universität Hildesheim, 2014
DNB Subject Category Language
Show details
11
Collecting language data of non-public social media profiles
BASE
Show details
12
Wie misst man Textqualität im digitalen Zeitalter? (MIT.Qualität) [Online resource]
IDS-Repository
Show details
13
Das DiDi-Korpus: Internetbasierte Kommunikation aus Südtirol [Online resource]
IDS-Repository
Show details

Catalogues
0
1
0
0
1
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
9
0
2
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern