1 |
Strategies to select examples for Active Learning with Conditional Random Fields
|
|
|
|
In: CICLing 2017 - 18th International Conference on Computational Linguistics and Intelligent Text Processing ; https://hal.archives-ouvertes.fr/hal-01621338 ; CICLing 2017 - 18th International Conference on Computational Linguistics and Intelligent Text Processing, Apr 2017, Budapest, Hungary. pp.1-14 (2017)
|
|
BASE
|
|
Show details
|
|
2 |
Direct vs. indirect evaluation of distributional thesauri
|
|
|
|
In: Proceedings of the International Conference on Computational Linguistics, COLING ; International Conference on Computational Linguistics, COLING ; https://hal.archives-ouvertes.fr/hal-01394739 ; International Conference on Computational Linguistics, COLING, Dec 2016, Osaka, Japan (2016)
|
|
Abstract:
International audience ; With the success of word embedding methods in various Natural Language Processing tasks, all the fields of distributional semantics have experienced a renewed interest. Beside the famous word2vec, recent studies have presented efficient techniques to build distributional thesaurus; in particular, Claveau et al. (2014) have already shown that Information Retrieval (IR) tools and concepts can be successfully used to build a thesaurus. In this paper, we address the problem of the evaluation of such thesauri or embedding models. Several evaluation scenarii are considered: direct evaluation through reference lexicons and specially crafted datasets, and indirect evaluation through a third party tasks, namely lexical subsitution and Information Retrieval. For this latter task, we adopt the query expansion framework proposed by Claveau and Kijak (2016). Through several experiments, we first show that the recent techniques for building distributional thesaurus outperform the word2vec approach, whatever the evaluation scenario. We also highlight the differences between the evaluation scenarii, which may lead to very different conclusions when comparing distributional models. Last, we study the effect of some parameters of the distributional models on these various evaluation scenarii.
|
|
Keyword:
[INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]; [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; [INFO.INFO-IR]Computer Science [cs]/Information Retrieval [cs.IR]
|
|
URL: https://hal.archives-ouvertes.fr/hal-01394739/file/Claveau_Kijak_IR_COLING2016.pdf https://hal.archives-ouvertes.fr/hal-01394739 https://hal.archives-ouvertes.fr/hal-01394739/document
|
|
BASE
|
|
Hide details
|
|
3 |
Distributional Thesauri for Information Retrieval and vice versa
|
|
|
|
In: Proceedings of Language and Resource Conference, LREC ; Language and Resource Conference, LREC ; https://hal.archives-ouvertes.fr/hal-01394770 ; Language and Resource Conference, LREC, May 2016, Portoroz, Slovenia (2016)
|
|
BASE
|
|
Show details
|
|
4 |
Thésaurus distributionnels pour la recherche d'information et vice-versa
|
|
|
|
In: Conférence en Recherche d’Information et Applications ; https://hal.archives-ouvertes.fr/hal-01226532 ; Conférence en Recherche d’Information et Applications, Mar 2015, Paris, France (2015)
|
|
BASE
|
|
Show details
|
|
5 |
Thésaurus distributionnels pour la recherche d'information et vice-versa
|
|
|
|
In: ISSN: 1279-5127 ; EISSN: 1963-1014 ; Document Numérique ; https://hal.archives-ouvertes.fr/hal-01226551 ; Document Numérique, Lavoisier, 2015, 18 (2-3), ⟨10.3166/DN.18.2-3.101-121⟩ (2015)
|
|
BASE
|
|
Show details
|
|
6 |
Stratégies de sélection des exemples pour l’apprentissage actif avec des champs aléatoires conditionnels
|
|
|
|
In: Actes de la conférence TALN 2015 ; Conférence TALN 2015 ; https://hal.archives-ouvertes.fr/hal-01206847 ; Conférence TALN 2015, Jun 2015, Caen, France (2015)
|
|
BASE
|
|
Show details
|
|
|
|