DE eng

Search in the Catalogues and Directories

Hits 1 – 7 of 7

1
Automatic Normalisation of Early Modern French
In: https://hal.inria.fr/hal-03540226 ; 2022 (2022)
Abstract: Spelling normalisation is a useful step in the study and analysis of historical language texts, whether it is manual analysis by experts or automatic analysis using downstream natural language processing (NLP) tools. Not only does it help to homogenise the variable spelling that often exists in historical texts, but it also facilitates the use of off-the-shelf contemporary NLP tools, if contemporary spelling conventions are used for normalisation. We present FreEMnorm, a new benchmark for the normalisation of Early Modern French (from the 17th century) into contemporary French and provide a thorough comparison of three different normalisation methods: ABA, an alignment-based approach and MT-approaches, (both statistical and neural), including extensive parameter searching, which is often missing in the normalisation literature.
Keyword: [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; Digital Humanities; Historical; Machine Translation; Modern French; Normalisation; Spelling
URL: https://hal.inria.fr/hal-03540226/document
https://doi.org/10.5281/zenodo.5865428
https://hal.inria.fr/hal-03540226
https://hal.inria.fr/hal-03540226/file/LREC_2022_ModFr_Normalisation-18.pdf
BASE
Hide details
2
Variation graphique dans les documents d'Ancien Régime : Nouvelles approches scriptométriques
In: Journée d’étude : « Pour une histoire de la langue ‘par en bas’: textes privés et variation des langues dans le passé » ; https://hal.inria.fr/hal-03357080 ; Journée d’étude : « Pour une histoire de la langue ‘par en bas’: textes privés et variation des langues dans le passé », Sep 2021, Paris, France (2021)
BASE
Show details
3
Normalisation of 16th and 17th century texts in French and geographical named entity recognition
In: 4th ACM SIGSPATIAL International Workshop on Geospatial Humanities ; ACM SIGSPATIAL GeoHumanities'20 ; https://hal-upec-upem.archives-ouvertes.fr/hal-02955867 ; ACM SIGSPATIAL GeoHumanities'20, ACM, Nov 2020, Seattle (virtual), United States. pp.28-34, ⟨10.1145/3423337.3429437⟩ ; https://ludovicmoncla.github.io/sigspatial-geohumanities-2020/ (2020)
BASE
Show details
4
SMS communication : Natural language processing and information extraction ; Communiquer par SMS : Analyse automatique du langage et extraction de l'information véhiculée
Kogkitsidou, Eleni. - : HAL CCSD, 2018
In: https://tel.archives-ouvertes.fr/tel-01968698 ; Linguistique. Université Grenoble Alpes, 2018. Français. ⟨NNT : 2018GREAL012⟩ (2018)
BASE
Show details
5
Alpes4science project : SMS corpus processing and tokenization problems
Kogkitsidou, Eleni [Verfasser]; Antoniadis, Georges [Verfasser]. - Hildesheim : Universität Hildesheim, 2014
DNB Subject Category Language
Show details
6
Alpes4science project : SMS corpus processing and tokenization problems
BASE
Show details
7
Extraction de citations contenues dans des documents brevet
In: 32ème colloque international sur le lexique et la grammaire ; https://hal-upec-upem.archives-ouvertes.fr/hal-01090581 ; 32ème colloque international sur le lexique et la grammaire, Sep 2013, Faro, Portugal. pp.57-64 (2013)
BASE
Show details

Catalogues
0
0
0
0
1
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
6
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern