1 |
Collecting and annotating corpora for three under-resourced languages of France: Methodological issues
|
|
|
|
In: ISSN: 1934-5275 ; EISSN: 1934-5275 ; Language Documentation & Conservation ; https://hal.archives-ouvertes.fr/hal-03273196 ; Language Documentation & Conservation, University of Hawaiʻi Press 2021, 15, pp.316-357 ; http://hdl.handle.net/10125/74645 (2021)
|
|
BASE
|
|
Show details
|
|
2 |
Collecting and annotating corpora for three under-resourced languages of France: Methodological issues
|
|
|
|
BASE
|
|
Show details
|
|
3 |
Collecting and annotating corpora for three under-resourced languages of France: Methodological issues
|
|
|
|
BASE
|
|
Show details
|
|
4 |
L’avenir numérique des langues minoritaires : bilan du projet RESTAURE pour l’alsacien, l’occitan et le picard
|
|
|
|
In: ISSN: 2105-0368 ; Les Cahiers du GEPE ; Colloque « Langues minoritaires » : quels acteurs pour quel avenir ? ; https://hal.archives-ouvertes.fr/hal-02378172 ; Les Cahiers du GEPE, Université de Strasbourg, 2020, Langues minoritaires : Quels acteurs pour quel avenir ? ; http://cahiersdugepe.fr/index.php?id=3662 (2020)
|
|
BASE
|
|
Show details
|
|
6 |
Exploiting languages proximity for part-of-speech tagging of three French regional languages
|
|
|
|
In: ISSN: 1574-020X ; EISSN: 1574-0218 ; Language Resources and Evaluation ; https://hal.archives-ouvertes.fr/hal-02358020 ; Language Resources and Evaluation, Springer Verlag, 2019, pp.1-26 (2019)
|
|
BASE
|
|
Show details
|
|
7 |
Language Technologies for Regional Languages of France: The RESTAURE Project
|
|
|
|
In: International Conference Language Technologies for All (LT4All): Enabling Linguistic Diversity and Multilingualism Worldwide ; https://hal.archives-ouvertes.fr/hal-02418928 ; International Conference Language Technologies for All (LT4All): Enabling Linguistic Diversity and Multilingualism Worldwide, Dec 2019, Paris, France. pp.272‑275 ; https://lt4all.elra.info/proceedings/lt4all2019/ (2019)
|
|
BASE
|
|
Show details
|
|
8 |
A Corpus for Hybrid Question Answering Systems
|
|
|
|
In: Proceeding WWW '18 Companion Proceedings of the The Web Conference 2018 ; Workshop on Hybrid Question Answering with Structured and Unstructured Knowledge ; https://hal.archives-ouvertes.fr/hal-02284465 ; Workshop on Hybrid Question Answering with Structured and Unstructured Knowledge, Apr 2018, Lyon - FR, France. pp.1081-1086, ⟨10.1145/3184558.3191540⟩ (2018)
|
|
BASE
|
|
Show details
|
|
9 |
Étiquetage en parties du discours de langues peu dotées par spécialisation des plongements lexicaux
|
|
|
|
In: Conférence sur le Traitement Automatique des Langues Naturelles ; https://hal.archives-ouvertes.fr/hal-01793092 ; Conférence sur le Traitement Automatique des Langues Naturelles, May 2018, Rennes, France (2018)
|
|
BASE
|
|
Show details
|
|
10 |
Resources and Methods for the Automatic Recognition of Place Names in Alsatian
|
|
|
|
In: Corpus-Based Research in the Humanities ; https://hal.archives-ouvertes.fr/hal-01702656 ; Corpus-Based Research in the Humanities, Jan 2018, Vienna, Austria. pp.35-44 ; https://www.oeaw.ac.at/ac/crh2/proceedings/ (2018)
|
|
BASE
|
|
Show details
|
|
11 |
A French clinical corpus with comprehensive semantic annotations: development of the Medical Entity and Relation LIMSI annOtated Text corpus (MERLOT) [<Journal>]
|
|
|
|
DNB Subject Category Language
|
|
Show details
|
|
12 |
A French clinical corpus with comprehensive semantic annotations: development of the Medical Entity and Relation LIMSI annOtated Text corpus (MERLOT)
|
|
|
|
In: ISSN: 1574-020X ; EISSN: 1574-0218 ; Language Resources and Evaluation ; https://hal.archives-ouvertes.fr/hal-01631743 ; Language Resources and Evaluation, Springer Verlag, 2017, 52 (2), pp.571-601. ⟨10.1007/s10579-017-9382-y⟩ (2017)
|
|
Abstract:
International audience ; Quality annotated resources are essential for Natural Language Processing. The objective of this work is to present a corpus of clinical narratives in French annotated for linguistic, semantic and structural information, aimed at clinical information extraction. Six annotators contributed to the corpus annotation, using a comprehensive annotation scheme covering 21 entities, 11 attributes and 37 relations. All annotators trained on a small, common portion of the corpus before proceeding independently. An automatic tool was used to produce entity and attribute pre-annotations. About a tenth of the corpus was doubly annotated and annotation differences were resolved in consensus meetings. To ensure annotation consistency throughout the corpus, we devised harmonization tools to automatically identify annotation differences to be addressed to improve the overall corpus quality. The annotation project spanned over 24 months and resulted in a corpus comprising 500 documents (148,476 tokens) annotated with 44,740 entities and 26,478 relations. The average inter-annotator agreement is 0.793 F-measure for entities and 0.789 for relations. The performance of the pre-annotation tool for entities reached 0.814 F-measure when sufficient training data was available. The performance of our entity pre-annotation tool shows the value of the corpus to build and evaluate information extraction methods. In addition, we introduced harmonization methods that further improved the quality of annotations in the corpus.
|
|
Keyword:
[INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; Clinical narrative; Inter-annotator agreement; Personal health information; Semantic annotations
|
|
URL: https://doi.org/10.1007/s10579-017-9382-y https://hal.archives-ouvertes.fr/hal-01631743/file/lre.pdf https://hal.archives-ouvertes.fr/hal-01631743 https://hal.archives-ouvertes.fr/hal-01631743/document
|
|
BASE
|
|
Hide details
|
|
13 |
Chaînes de référence et lisibilité des textes : Le projet ALLuSIF
|
|
|
|
In: ISSN: 0023-8368 ; EISSN: 1957-7982 ; Langue française ; https://halshs.archives-ouvertes.fr/halshs-01665316 ; Langue française, Armand Colin, 2017, Les chaînes de référence en corpus (éds. Catherine Schnedecker, Julie Glikman, Frédéric Landragin), 195 (3), pp.35-52 ; http://www.revues.armand-colin.com/lettres-langues/langue-francaise/langue-francaise-ndeg-195-32017 (2017)
|
|
BASE
|
|
Show details
|
|
14 |
Chaînes de référence et lisibilité des textes : le projet ALLuSIF
|
|
|
|
In: Langue française, N 195, 3, 2017-09-25, pp.35-52 (2017)
|
|
BASE
|
|
Show details
|
|
15 |
Evaluating Lexical Simplification and Vocabulary Knowledge for Learners of French: Possibilities of Using the FLELex Resource
|
|
|
|
In: LREC 2016 proceedings ; Language Resources and Evaluation Conference (LREC) ; https://hal.archives-ouvertes.fr/hal-02517616 ; Language Resources and Evaluation Conference (LREC), May 2016, Portorož, Slovenia (2016)
|
|
BASE
|
|
Show details
|
|
16 |
Modèles adaptatifs pour prédire automatiquement la compétence lexicale d'un apprenant de français langue étrangère
|
|
|
|
In: Actes de la conférence conjointe JEP-TALN-RECITAL 2016 ; JEP-TALN-RECITAL 2016 ; https://hal.archives-ouvertes.fr/hal-01631772 ; JEP-TALN-RECITAL 2016, Jan 2016, Paris, France (2016)
|
|
BASE
|
|
Show details
|
|
17 |
Are Cohesive Features Relevant for Text Readability Evaluation?
|
|
|
|
In: 26th International Conference on Computational Linguistics (COLING 2016) ; https://hal.archives-ouvertes.fr/hal-01430554 ; 26th International Conference on Computational Linguistics (COLING 2016), Dec 2016, Osaka, Japan. pp.987 - 997 ; http://coling2016.anlp.jp/ (2016)
|
|
BASE
|
|
Show details
|
|
18 |
Représentation sémantique de questions pour interroger le Web sémantique.
|
|
|
|
In: CORIA 2015 - Conférence en Recherche d'Informations et Applications - 12th French Information Retrieval Conference, Paris, France, March 18-20, 2015. ; CORIA ; https://hal.archives-ouvertes.fr/hal-02289244 ; CORIA, Mar 2015, Paris, France. pp.453--468, ⟨10.24348/coria.2015.80⟩ (2015)
|
|
BASE
|
|
Show details
|
|
19 |
Représentation sémantique de questions pour interroger le Web sémantique. ...
|
|
|
|
BASE
|
|
Show details
|
|
20 |
LIMSI-CNRS@ CLEF 2014: Invalidating Answers for Multiple Choice Question Answering.
|
|
|
|
In: Working Notes for CLEF 2014 Conference, Sheffield, UK, September 15-18, 2014 ; CLEF 2014 ; https://hal.archives-ouvertes.fr/hal-02290008 ; CLEF 2014, Sep 2014, Sheffield, United Kingdom. pp.1386--1394 (2014)
|
|
BASE
|
|
Show details
|
|
|
|