DE eng

Search in the Catalogues and Directories

Hits 1 – 7 of 7

1
Lemmatiser des textes et corriger l'annotation grâcè a l'apprentissage profond avec Pyrrha
In: Humanistica 2021 ; https://hal.archives-ouvertes.fr/hal-03224112 ; Humanistica 2021, May 2021, Rennes, France (2021)
BASE
Show details
2
Corpus and Models for Lemmatisation and POS-tagging of Classical French Theatre
In: EISSN: 2416-5999 ; Journal of Data Mining and Digital Humanities ; https://halshs.archives-ouvertes.fr/halshs-02591388 ; Journal of Data Mining and Digital Humanities, Episciences.org, 2021, ⟨10.46298/jdmdh.6485⟩ (2021)
Abstract: International audience ; This paper describes the process of building an annotated corpus and training models for classical French literature, with a focus on theatre, and particularly comedies in verse. It was originally developed as a preliminary step to the stylometric analyses presented in Cafiero and Camps [2019]. The use of a recent lemmatiser based on neural networks and a CRF tagger allows to achieve accuracies beyond the current state-of-the art on the in-domain test, and proves to be robust during out-of-domain tests, i.e.up to 20th c.novels.
Keyword: 17th c. French; 17th century French; [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; [INFO.INFO-TT]Computer Science [cs]/Document and Text Processing; [SHS.LANGUE]Humanities and Social Sciences/Linguistics; [SHS.LITT]Humanities and Social Sciences/Literature; Classical theatre; Lemmatisation; POS tagging
URL: https://doi.org/10.46298/jdmdh.6485
https://halshs.archives-ouvertes.fr/halshs-02591388
https://halshs.archives-ouvertes.fr/halshs-02591388v2/file/Corpus_models_Classical_French_v2.pdf
https://halshs.archives-ouvertes.fr/halshs-02591388v2/document
BASE
Hide details
3
Expanding the content model of annotationBlock
In: Next Gen TEI, 2021 - TEI Conference and Members’ Meeting ; https://hal.archives-ouvertes.fr/hal-03380805 ; Next Gen TEI, 2021 - TEI Conference and Members’ Meeting, Oct 2021, Virtual, United States (2021)
BASE
Show details
4
SegmOnto ; SegmOnto: Un vocabulaire contrôlé pour décrire la page manuscrite et imprimée
In: https://hal.archives-ouvertes.fr/hal-03481089 ; 2021 (2021)
BASE
Show details
5
Guidelines for linguistic annotation of modern French (16th-18th c.) ; Manuel d'annotation linguistique pour le français moderne (XVIe -XVIIIe siècles)
In: https://hal.archives-ouvertes.fr/hal-02571190 ; 2020 (2020)
BASE
Show details
6
Standardizing linguistic data: method and tools for annotating (pre-orthographic) French ; Standardiser les données linguistiques: méthodes et outils pour l'annotation du français (pré-orthographique)
In: Proceedings of the 2nd International Digital Tools & Uses Congress (DTUC '20) ; https://hal.archives-ouvertes.fr/hal-03018381 ; Proceedings of the 2nd International Digital Tools & Uses Congress (DTUC '20), Oct 2020, Hammamet, Tunisia. ⟨10.1145/3423603.3423996⟩ (2020)
BASE
Show details
7
CORPUS17: a philological corpus for 17th c. French ; CORPUS17: un corpus philologique pour le 17ème siècle français
In: Proceedings of the 2nd International Digital Tools & Uses Congress (DTUC ’20) ; https://hal.archives-ouvertes.fr/hal-03041871 ; Proceedings of the 2nd International Digital Tools & Uses Congress (DTUC ’20), Oct 2020, Hammamet, Tunisia. ⟨10.1145/3423603.3424002⟩ (2020)
BASE
Show details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
7
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern