1 |
Corpus-based Language Universals Analysis using Universal Dependencies ; Analyse orientée corpus d'universaux linguistiques sur Universal Dependencies
|
|
|
|
In: SyntaxFest Quasy 2021 - Quantitative Syntax ; https://hal.inria.fr/hal-03501774 ; SyntaxFest Quasy 2021 - Quantitative Syntax, Mar 2022, Sofia, Bulgaria (2022)
|
|
BASE
|
|
Show details
|
|
2 |
Corpus-based Language Universals Analysis using Universal Dependencies ; Analyse orientée corpus d'universaux linguistiques sur Universal Dependencies
|
|
|
|
In: Quasy (Quantitative Syntax), SyntaxFest 2021 ; https://hal.inria.fr/hal-03501774 ; Quasy (Quantitative Syntax), SyntaxFest 2021, Mar 2022, Sofia, Bulgaria (2022)
|
|
BASE
|
|
Show details
|
|
3 |
Starting a new treebank? Go SUD! Theoretical and practical benefits of the Surface-Syntactic distributional approach
|
|
|
|
In: Sixth International Conference on Dependency Linguistics (Depling, SyntaxFest 2021) ; https://hal.inria.fr/hal-03509136 ; Sixth International Conference on Dependency Linguistics (Depling, SyntaxFest 2021), Mar 2022, Sofia, Bulgaria (2022)
|
|
BASE
|
|
Show details
|
|
4 |
A morph-based and a word-based treebank for Beja
|
|
|
|
In: SyntaxFest ; TLT 2021 - 20th International Workshop on Treebanks and Linguistic Theories ; https://hal.archives-ouvertes.fr/hal-03494462 ; TLT 2021 - 20th International Workshop on Treebanks and Linguistic Theories, Mar 2022, Sofia, Bulgaria (2022)
|
|
BASE
|
|
Show details
|
|
5 |
Convertir le Trésor de la Langue Française en Ontolex-Lemon : un zeste de données liées
|
|
|
|
In: Journées LIFT 2021 - Linguistique informatique, formelle et de terrain ; https://hal.inria.fr/hal-03463294 ; Journées LIFT 2021 - Linguistique informatique, formelle et de terrain, Dec 2021, Grenoble, France (2021)
|
|
BASE
|
|
Show details
|
|
6 |
A morph-based and a word-based treebank for Beja
|
|
|
|
In: SyntaxFest ; https://hal.archives-ouvertes.fr/hal-03494462 ; SyntaxFest, In press (2021)
|
|
BASE
|
|
Show details
|
|
7 |
Analyse orientée corpus d'universaux de Greenberg sur Universal Dependencies
|
|
|
|
In: Journées LIFT 2021 - Linguistique informatique, formelle et de terrain ; https://hal.inria.fr/hal-03462112 ; Journées LIFT 2021 - Linguistique informatique, formelle et de terrain, GDR LIFT - Linguistique Informatique, Formelle et de Terrain, Dec 2021, Grenoble, France (2021)
|
|
BASE
|
|
Show details
|
|
8 |
Graph Matching and Graph Rewriting: GREW tools for corpus exploration, maintenance and conversion
|
|
|
|
In: EACL 2021 - 16th conference of the European Chapter of the Association for Computational Linguistics ; https://hal.inria.fr/hal-03177701 ; EACL 2021 - 16th conference of the European Chapter of the Association for Computational Linguistics, Apr 2021, Kiev/Online, Ukraine ; https://2021.eacl.org/ (2021)
|
|
BASE
|
|
Show details
|
|
14 |
Convertir le Trésor de la Langue Française en Ontolex-Lemon : un zeste de données liées ...
|
|
|
|
BASE
|
|
Show details
|
|
15 |
Convertir le Trésor de la Langue Française en Ontolex-Lemon : un zeste de données liées ...
|
|
|
|
BASE
|
|
Show details
|
|
16 |
A French corpus annotated for multiword expressions and named entities
|
|
|
|
In: ISSN: 2299-856X ; EISSN: 2299-8470 ; Journal of Language Modelling ; https://hal.archives-ouvertes.fr/hal-03016721 ; Journal of Language Modelling, Institute of Computer Science, Polish Academy of Sciences, Poland, 2020, 8 (2), pp.415-479. ⟨10.15398/jlm.v8i2.265⟩ (2020)
|
|
Abstract:
International audience ; We present the enrichment of a French treebank of various genres with a new annotation layer for multiword expressions (MWEs) and named entities (NEs).1 Our contribution with respect to previous work on NE and MWE annotation is the particular care taken to use formal criteria, organized into decision flowcharts, shedding some light on the interactions between NEs and MWEs. Moreover, in order to cope with the well-known difficulty to draw a clear-cut frontier between compositional expressions and MWEs, we chose to use sufficient criteria only. As a result, annotated MWEs satisfy a varying number of sufficient criteria, accounting for the scalar nature of the MWE status.In addition to the span of the elements, annotation includes the subcategory of NEs (e.g., person, location) and one matching sufficient criterion for non-verbal MWEs (e.g., lexical substitution). The 3,099 sentences of the treebank were double-annotated and adjudicated, and we paid attention to cross-type consistency and compatibility with thesyntactic layer. Overall inter-annotator agreement on non-verbal MWEs and NEs reached 71.1%. The released corpus contains 3,112 annotated NEs and 3,440 MWEs, and is distributed under an open license.
|
|
Keyword:
[INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; [SCCO.COMP]Cognitive science/Computer science; [SCCO.LING]Cognitive science/Linguistics; [SHS.LANGUE]Humanities and Social Sciences/Linguistics; ACM: I.: Computing Methodologies/I.2: ARTIFICIAL INTELLIGENCE/I.2.7: Natural Language Processing; annotation; corpus; French; multiword expressions
|
|
URL: https://hal.archives-ouvertes.fr/hal-03016721/file/265-Article%20Text%20%E2%80%93%20PDF-2149-2-10-20210224%20%281%29.pdf https://hal.archives-ouvertes.fr/hal-03016721/document https://doi.org/10.15398/jlm.v8i2.265 https://hal.archives-ouvertes.fr/hal-03016721
|
|
BASE
|
|
Hide details
|
|
17 |
Edition 1.2 of the PARSEME Shared Task on Semi-supervised Identification of Verbal Multiword Expressions
|
|
|
|
In: Joint Workshop on Multiword Expressions and Electronic Lexicons (MWE-LEX 2020) ; https://hal.archives-ouvertes.fr/hal-03014927 ; Joint Workshop on Multiword Expressions and Electronic Lexicons (MWE-LEX 2020), 2020, Barcelona, Spain ; https://www.aclweb.org/anthology/volumes/2020.mwe-1/ (2020)
|
|
BASE
|
|
Show details
|
|
18 |
When Collaborative Treebank Curation Meets Graph Grammars ; When Collaborative Treebank Curation Meets Graph Grammars: Arborator With a Grew Back-End
|
|
|
|
In: LREC 2020 - 12th Language Resources and Evaluation Conference ; https://hal.inria.fr/hal-03021720 ; LREC 2020 - 12th Language Resources and Evaluation Conference, May 2020, Marseille, France ; http://www.lrec-conf.org/proceedings/lrec2020/index.html (2020)
|
|
BASE
|
|
Show details
|
|
19 |
A French Version of the FraCaS Test Suite ; Une version française de la ressource FraCaS
|
|
|
|
In: LREC 2020 - Language Resources and Evaluation Conference ; https://hal.inria.fr/hal-02619239 ; LREC 2020 - Language Resources and Evaluation Conference, May 2020, Marseille, France. pp.9 (2020)
|
|
BASE
|
|
Show details
|
|
20 |
Morpho-syntactically annotated corpora provided for the PARSEME Shared Task on Semi-Supervised Identification of Verbal Multiword Expressions (edition 1.2)
|
|
|
|
BASE
|
|
Show details
|
|
|
|