DE eng

Search in the Catalogues and Directories

Hits 1 – 5 of 5

1
TEICORPO: a conversion tool for spoken language transcription with a pivot file in TEI
In: ISSN: 2162-5603 ; EISSN: 2162-5603 ; Journal of the Text Encoding Initiative ; https://halshs.archives-ouvertes.fr/halshs-03043572 ; Journal of the Text Encoding Initiative, TEI Consortium, In press (2020)
Abstract: International audience ; CORLI is a consortium of Huma-Num, the French national infrastructure dedicated to the technical support and promotion of digital humanities. The goal of CORLI is to promote and provide tools and information for good and efficient research practices in corpus linguistics and especially spoken language corpora. Because of the time required to collect and transcribe spoken language resources, their number is limited and thus corpora need to be interoperable and reusable in order to improve research on various themes (phonology, prosody, interaction, syntax, textometry…). To help researchers reach this goal, CORLI has designed a set of tools: TEICORPO to assist in the conversion and use of spoken language corpora, and TEIMETA for metadata purposes. TEICORPO is based on the principle of an underlying common format, namely the TEI as described in its specification for spoken language use (ISO/TEI 24624:2016). This tool enables the conversion of transcriptions created with alignment software such as CLAN, Transcriber, Praat or ELAN as well as common file formats (csv, xlsx, txt or docx) and the TEI format, which plays the role of a pivot format, without losing information. Backward conversion is possible in many cases, with limitations inherent to the destination target format. TEICORPO can run the Treetagger Part of Speech tagger and the Stanford CoreNLP tools on TEI files and can export the resulting files to textometric tools such as TXM, Le Trameur, or Iramuteq, making it a tool dedicated to spoken language corpora editing as well as to various research purposes.
Keyword: [SHS.LANGUE]Humanities and Social Sciences/Linguistics; annotationBlock; conversion; oral corpora; TEI; transcription
URL: https://halshs.archives-ouvertes.fr/halshs-03043572
https://halshs.archives-ouvertes.fr/halshs-03043572/document
https://halshs.archives-ouvertes.fr/halshs-03043572/file/182-Article%20Text-1407-1-15-20201019.pdf
BASE
Hide details
2
Utilisation d'un format commun pour structurer les métadonnées de corpus oraux : objectifs, enjeux et méthode
In: Données, métadonnées des corpus et catalogage des objets en sciences humaines et sociales ; https://halshs.archives-ouvertes.fr/halshs-01357271 ; Données, métadonnées des corpus et catalogage des objets en sciences humaines et sociales, Jun 2016, Poitiers, France (2016)
BASE
Show details
3
Utilisation d'un format commun pour structurer les métadonnées de corpus oraux : objectifs, enjeux et méthode
In: Données, métadonnées des corpus et catalogage des objets en sciences humaines et sociales ; https://halshs.archives-ouvertes.fr/halshs-01357271 ; Données, métadonnées des corpus et catalogage des objets en sciences humaines et sociales, Jun 2016, Poitiers, France (2016)
BASE
Show details
4
Using the TEI as a pivot format for oral and multimodal language corpora
In: Text Encoding Initiative Conference and Member's meeting 2015 ; https://halshs.archives-ouvertes.fr/halshs-01345777 ; Text Encoding Initiative Conference and Member's meeting 2015, Oct 2015, Lyon, France ; http://tei2015.huma-num.fr/fr/ (2015)
BASE
Show details
5
Using the TEI as a pivot format for oral and multimodal language corpora
In: Text Encoding Initiative Conference and Member's meeting 2015 ; https://halshs.archives-ouvertes.fr/halshs-01345777 ; Text Encoding Initiative Conference and Member's meeting 2015, Oct 2015, Lyon, France ; http://tei2015.huma-num.fr/fr/ (2015)
BASE
Show details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
5
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern