DE eng

Search in the Catalogues and Directories

Hits 1 – 12 of 12

1
Mehrdimensionale Beschreibung erbaulicher Textsorten des 17. Jahrhunderts : ein korpusbasierter Ansatz
In: Jahrbuch für germanistische Sprachgeschichte. - Berlin : de Gruyter 10 (2019), 324-344
BLLDB
Show details
2
Lightweight grammatical annotation in the TEI: new perspectives
Bański, Piotr Verfasser]. - Mannheim : Institut für Deutsche Sprache, Bibliothek, 2018
DNB Subject Category Language
Show details
3
Recherchieren, Arbeiten und Publizieren im Deutschen Textarchiv: ein Praxisbericht
In: Zeitschrift für germanistische Linguistik. - Berlin [u.a.] : de Gruyter 46 (2018) 1, 147-161
BLLDB
Show details
4
Integration heterogener historischer Textkorpora in das Deutsche Textarchiv: Strategien der Anlagerung und Perspektiven der Nachnutzung
In: Korpuslinguistik. - Duisburg : Universitätsverlag Rhein-Ruhr OHG (2018), 175-192
BLLDB
Show details
5
Die historischen Korpora des Deutschen Textarchivs als Grundlage für sprachgeschichtliche Forschungen.
In: Sprachgeschichte des Deutschen : Positionierungen in Forschung, Studium, Unterricht (2016), S. 217-234
Leibniz-Zentrum Allgemeine Sprachwissenschaft
Show details
6
Die historischen Korpora des Deutschen Textarchivs als Grundlage für sprachgeschichtliche Forschungen
BASE
Show details
7
Deutsches Textarchiv (Dta) Und Clarin-D ...
BASE
Show details
8
Measuring the Correctness of Double-Keying : Error Classification and Quality Control in a Large Corpus of TEI-Annotated Historical Text
Abstract: Among mass digitization methods, double-keying is considered to be the one with the lowest error rate. This method requires two independent transcriptions of a text by two different operators. It is particularly well suited to historical texts, which often exhibit deficiencies like poor master copies or other difficulties such as spelling variation or complex text structures. Providers of data entry services using the double-keying method generally advertise very high accuracy rates (around 99.95% to 99.98%). These advertised percentages are generally estimated on the basis of small samples, and little if anything is said about either the actual amount of text or the text genres which have been proofread, about error types, proofreaders, etc. In order to obtain significant data on this problem it is necessary to analyze a large amount of text representing a balanced sample of different text types, to distinguish the structural XML/TEI level from the typographical level, and to differentiate between various types of errors which may originate from different sources and may not be equally severe. This paper presents an extensive and complex approach to the analysis and correction of double-keying errors which has been applied by the DFG-funded project “Deutsches Textarchiv” (German Text Archive, hereafter DTA) in order to evaluate and preferably to increase the transcription and annotation accuracy of double-keyed DTA texts. Statistical analyses of the results gained from proofreading a large quantity of text are presented, which verify the common accuracy rates for the double-keying method.
Keyword: ddc:400; Digitalisierung; Genauigkeit; Historische Sprachwissenschaft; Korpus; Korrekturlesen; Qualitätssicherung; Transkription
URN: urn:nbn:de:kobv:b4-opus-23547
URL: https://edoc.bbaw.de/files/2058/HaafWiegandGeyken_CorrectnessDoubleKeying_2013_jTEI4.pdf
https://nbn-resolving.org/urn:nbn:de:kobv:b4-opus-23547
https://edoc.bbaw.de/frontdoor/index/index/docId/2058
BASE
Hide details
9
XML-Technologien als Grundlage dynamischer Textpräsentation: die digitale Quellenedition "Der Zürcher Sommer 1968"
In: Jahrbuch für Computerphilologie. - Paderborn : Mentis-Verl. 9 (2009), 87-107
BLLDB
Show details
10
Lightweight grammatical annotation in the TEI: new perspectives [Online resource]
IDS-Repository
Show details
11
Gute Forschungsdaten, bessere Forschung: wie Forschung durch Forschungsdatenmanagement unterstützt wird [Online resource]
IDS-Repository
Show details
12Deutsches Textarchiv (DTA)
http://www.deutschestextarchiv.de/
Topic: Computational linguistics; History of language; Morphology; ...
Language: German, Standard
Forschungstyp: Research projects
Access: free access

Catalogues
0
0
0
0
1
0
1
Bibliographies
4
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
1
0
0
0
Open access documents
3
0
2
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern