1 |
Annoter et prédire des représentations linguistiques de phrases
|
|
|
|
In: https://hal.archives-ouvertes.fr/tel-03544267 ; Informatique et langage [cs.CL]. Université de Paris, 2022 (2022)
|
|
BASE
|
|
Show details
|
|
6 |
A French corpus annotated for multiword expressions and named entities
|
|
|
|
In: ISSN: 2299-856X ; EISSN: 2299-8470 ; Journal of Language Modelling ; https://hal.archives-ouvertes.fr/hal-03016721 ; Journal of Language Modelling, Institute of Computer Science, Polish Academy of Sciences, Poland, 2020, 8 (2), pp.415-479. ⟨10.15398/jlm.v8i2.265⟩ (2020)
|
|
Abstract:
International audience ; We present the enrichment of a French treebank of various genres with a new annotation layer for multiword expressions (MWEs) and named entities (NEs).1 Our contribution with respect to previous work on NE and MWE annotation is the particular care taken to use formal criteria, organized into decision flowcharts, shedding some light on the interactions between NEs and MWEs. Moreover, in order to cope with the well-known difficulty to draw a clear-cut frontier between compositional expressions and MWEs, we chose to use sufficient criteria only. As a result, annotated MWEs satisfy a varying number of sufficient criteria, accounting for the scalar nature of the MWE status.In addition to the span of the elements, annotation includes the subcategory of NEs (e.g., person, location) and one matching sufficient criterion for non-verbal MWEs (e.g., lexical substitution). The 3,099 sentences of the treebank were double-annotated and adjudicated, and we paid attention to cross-type consistency and compatibility with thesyntactic layer. Overall inter-annotator agreement on non-verbal MWEs and NEs reached 71.1%. The released corpus contains 3,112 annotated NEs and 3,440 MWEs, and is distributed under an open license.
|
|
Keyword:
[INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; [SCCO.COMP]Cognitive science/Computer science; [SCCO.LING]Cognitive science/Linguistics; [SHS.LANGUE]Humanities and Social Sciences/Linguistics; ACM: I.: Computing Methodologies/I.2: ARTIFICIAL INTELLIGENCE/I.2.7: Natural Language Processing; annotation; corpus; French; multiword expressions
|
|
URL: https://hal.archives-ouvertes.fr/hal-03016721/file/265-Article%20Text%20%E2%80%93%20PDF-2149-2-10-20210224%20%281%29.pdf https://hal.archives-ouvertes.fr/hal-03016721/document https://doi.org/10.15398/jlm.v8i2.265 https://hal.archives-ouvertes.fr/hal-03016721
|
|
BASE
|
|
Hide details
|
|
7 |
Edition 1.2 of the PARSEME Shared Task on Semi-supervised Identification of Verbal Multiword Expressions
|
|
|
|
In: Joint Workshop on Multiword Expressions and Electronic Lexicons (MWE-LEX 2020) ; https://hal.archives-ouvertes.fr/hal-03014927 ; Joint Workshop on Multiword Expressions and Electronic Lexicons (MWE-LEX 2020), 2020, Barcelona, Spain ; https://www.aclweb.org/anthology/volumes/2020.mwe-1/ (2020)
|
|
BASE
|
|
Show details
|
|
8 |
Morpho-syntactically annotated corpora provided for the PARSEME Shared Task on Semi-Supervised Identification of Verbal Multiword Expressions (edition 1.2)
|
|
|
|
BASE
|
|
Show details
|
|
10 |
Annotated corpora and tools of the PARSEME Shared Task on Semi-Supervised Identification of Verbal Multiword Expressions (edition 1.2)
|
|
|
|
BASE
|
|
Show details
|
|
12 |
Comparing linear and neural models for competitive MWE identification
|
|
|
|
In: Proceedings of the 22nd Nordic Conference on Computational Linguistics ; The 22nd Nordic Conference on Computational Linguistics ; https://hal.archives-ouvertes.fr/hal-02436433 ; The 22nd Nordic Conference on Computational Linguistics, Sep 2019, Turku, Finland ; https://www.aclweb.org/anthology/W19-6109.pdf (2019)
|
|
BASE
|
|
Show details
|
|
15 |
Linked Open Treebanks. Interlinking Syntactically Annotated Corpora in the LiLa Knowledge Base of Linguistic Resources for Latin
|
|
Mambrini, Francesco (orcid:0000-0003-0834-7562); Passarotti, Marco (orcid:0000-0002-9806-7187). - : Association for Computational Linguistics, 2019. : country:FRA, 2019. : place:Paris, 2019
|
|
BASE
|
|
Show details
|
|
16 |
Edition 1.1 of the PARSEME Shared Task on Automatic Identification of Verbal Multiword Expressions
|
|
|
|
In: Proceedings of the Joint Workshop on Linguistic Annotation, Multiword Expressions and Constructions (LAW-MWE-CxG-2018) ; https://hal.archives-ouvertes.fr/hal-01865575 ; Proceedings of the Joint Workshop on Linguistic Annotation, Multiword Expressions and Constructions (LAW-MWE-CxG-2018), Aug 2018, Santa Fe, United States. pp.222 - 240 (2018)
|
|
BASE
|
|
Show details
|
|
17 |
A transition-based verbal multiword expression analyzer
|
|
|
|
In: Multiword expressions at length and in depth ; https://hal.archives-ouvertes.fr/hal-01930522 ; Stella Markantonatou, Carlos Ramisch, Agata Savary, Veronika Vincze. Multiword expressions at length and in depth, Language Science Press, 2018, 978-3-96110-123-8. ⟨10.5281/zenodo.1469561⟩ ; http://langsci-press.org/catalog/book/204 (2018)
|
|
BASE
|
|
Show details
|
|
18 |
Cheating a Parser to Death: Data-driven Cross-Treebank Annotation Transfer
|
|
|
|
In: Eleventh International Conference on Language Resources and Evaluation (LREC 2018) ; https://hal.inria.fr/hal-01798801 ; Eleventh International Conference on Language Resources and Evaluation (LREC 2018), May 2018, Miyazaki, Japan (2018)
|
|
BASE
|
|
Show details
|
|
19 |
Universal Dependencies 2.2
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-01930733 ; 2018 (2018)
|
|
BASE
|
|
Show details
|
|
|
|