DE eng

Search in the Catalogues and Directories

Hits 1 – 7 of 7

1
Automatic Normalisation of Early Modern French
In: https://hal.inria.fr/hal-03540226 ; 2022 (2022)
BASE
Show details
2
Variation graphique dans les documents d'Ancien Régime : Nouvelles approches scriptométriques
In: Journée d’étude : « Pour une histoire de la langue ‘par en bas’: textes privés et variation des langues dans le passé » ; https://hal.inria.fr/hal-03357080 ; Journée d’étude : « Pour une histoire de la langue ‘par en bas’: textes privés et variation des langues dans le passé », Sep 2021, Paris, France (2021)
BASE
Show details
3
Normalisation of 16th and 17th century texts in French and geographical named entity recognition
In: 4th ACM SIGSPATIAL International Workshop on Geospatial Humanities ; ACM SIGSPATIAL GeoHumanities'20 ; https://hal-upec-upem.archives-ouvertes.fr/hal-02955867 ; ACM SIGSPATIAL GeoHumanities'20, ACM, Nov 2020, Seattle (virtual), United States. pp.28-34, ⟨10.1145/3423337.3429437⟩ ; https://ludovicmoncla.github.io/sigspatial-geohumanities-2020/ (2020)
Abstract: International audience ; Both statistical and rule-based methods for named entity recognition are quite sensitive to the type of language used in the analysed texts. Former studies have shown for example that it was harder to detect named entities in SMS or microblog messages where words are abridged or changed to lowercase. In this article, we focus on old French texts to evaluate the impact of manual and automatic normalization before applying five geographical named entity recognition tools, as well as an improved version of one of them, in order to help building maps displaying the locations mentioned in ancient texts. Our results show that manual normalisation leads to better results for all methods and that automatic normalisation performs differently depending on the tool used to extract geographical named entities, but with a significant improvement on most methods.
Keyword: [INFO.INFO-DS]Computer Science [cs]/Data Structures and Algorithms [cs.DS]; CasEN; CoreNLP; digital humanities; geographical named entity recognition; natural language processing; Perdido; SEM; Spacy; text normalization; Unitex
URL: https://hal-upec-upem.archives-ouvertes.fr/hal-02955867/file/KogkitsidouGambette-2020-postprint.pdf
https://doi.org/10.1145/3423337.3429437
https://hal-upec-upem.archives-ouvertes.fr/hal-02955867/document
https://hal-upec-upem.archives-ouvertes.fr/hal-02955867
BASE
Hide details
4
SMS communication : Natural language processing and information extraction ; Communiquer par SMS : Analyse automatique du langage et extraction de l'information véhiculée
Kogkitsidou, Eleni. - : HAL CCSD, 2018
In: https://tel.archives-ouvertes.fr/tel-01968698 ; Linguistique. Université Grenoble Alpes, 2018. Français. ⟨NNT : 2018GREAL012⟩ (2018)
BASE
Show details
5
Alpes4science project : SMS corpus processing and tokenization problems
Kogkitsidou, Eleni [Verfasser]; Antoniadis, Georges [Verfasser]. - Hildesheim : Universität Hildesheim, 2014
DNB Subject Category Language
Show details
6
Alpes4science project : SMS corpus processing and tokenization problems
BASE
Show details
7
Extraction de citations contenues dans des documents brevet
In: 32ème colloque international sur le lexique et la grammaire ; https://hal-upec-upem.archives-ouvertes.fr/hal-01090581 ; 32ème colloque international sur le lexique et la grammaire, Sep 2013, Faro, Portugal. pp.57-64 (2013)
BASE
Show details

Catalogues
0
0
0
0
1
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
6
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern