1 |
Lemmatiser des textes et corriger l'annotation grâcè a l'apprentissage profond avec Pyrrha
|
|
|
|
In: Humanistica 2021 ; https://hal.archives-ouvertes.fr/hal-03224112 ; Humanistica 2021, May 2021, Rennes, France (2021)
|
|
BASE
|
|
Show details
|
|
2 |
Handling Heavily Abbreviated Manuscripts: HTR engines vs text normalisation approaches
|
|
|
|
In: International Conference on Document Analysis and Recognition 2021 ; https://hal-enc.archives-ouvertes.fr/hal-03279602 ; International Conference on Document Analysis and Recognition 2021, 2021, Lausanne, Switzerland. pp.306-316, ⟨10.1007/978-3-030-86159-9_21⟩ (2021)
|
|
BASE
|
|
Show details
|
|
3 |
Corpus and Models for Lemmatisation and POS-tagging of Classical French Theatre
|
|
|
|
In: EISSN: 2416-5999 ; Journal of Data Mining and Digital Humanities ; https://halshs.archives-ouvertes.fr/halshs-02591388 ; Journal of Data Mining and Digital Humanities, Episciences.org, 2021, ⟨10.46298/jdmdh.6485⟩ (2021)
|
|
BASE
|
|
Show details
|
|
4 |
Corpus and Models for Lemmatisation and POS-tagging of Old French
|
|
|
|
In: https://halshs.archives-ouvertes.fr/halshs-03353125 ; 2021 (2021)
|
|
BASE
|
|
Show details
|
|
5 |
SegmOnto ; SegmOnto: Un vocabulaire contrôlé pour décrire la page manuscrite et imprimée
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-03481089 ; 2021 (2021)
|
|
BASE
|
|
Show details
|
|
6 |
Corpus and Models for Lemmatisation and POS-tagging of Old French ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
Guidelines for linguistic annotation of modern French (16th-18th c.) ; Manuel d'annotation linguistique pour le français moderne (XVIe -XVIIIe siècles)
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-02571190 ; 2020 (2020)
|
|
BASE
|
|
Show details
|
|
8 |
Standardizing linguistic data: method and tools for annotating (pre-orthographic) French ; Standardiser les données linguistiques: méthodes et outils pour l'annotation du français (pré-orthographique)
|
|
|
|
In: Proceedings of the 2nd International Digital Tools & Uses Congress (DTUC '20) ; https://hal.archives-ouvertes.fr/hal-03018381 ; Proceedings of the 2nd International Digital Tools & Uses Congress (DTUC '20), Oct 2020, Hammamet, Tunisia. ⟨10.1145/3423603.3423996⟩ (2020)
|
|
BASE
|
|
Show details
|
|
9 |
Standardizing linguistic data: method and tools for annotating (pre-orthographic) French ...
|
|
|
|
BASE
|
|
Show details
|
|
10 |
Standardizing linguistic data: method and tools for annotating(pre-orthographic) French ...
|
|
|
|
Abstract:
With the development of big corpora of various periods, it becomescrucial to standardise linguistic annotation (e.g.lemmas, POS tags,morphological annotation) to increase the interoperability of the dataproduced, despite diachronic variations. In the present paper, wedescribe both methodologically (by proposing annotation principles)and technically (by creating the required training data and therelevant models) the production of a linguistic tagger for (early)modern French (16-18th c.), taking as much as possible into accountalready existing standards for contemporary and, especially, medievalFrench ...
|
|
Keyword:
linguistic annotation, pre-orthographic language, lemmatisation,POS-tagging
|
|
URL: https://dx.doi.org/10.5281/zenodo.4084498 https://zenodo.org/record/4084498
|
|
BASE
|
|
Hide details
|
|
11 |
Standardizing linguistic data: method and tools for annotating(pre-orthographic) French ...
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Manuscripts in Time and Space: Experiments in Scriptometrics on an Old French Corpus ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|