1 |
ViQuAE, a Dataset for Knowledge-based Visual Question Answering about Named Entities
|
|
|
|
In: ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR ’22) ; https://hal-universite-paris-saclay.archives-ouvertes.fr/hal-03650618 ; 2022 (2022)
|
|
BASE
|
|
Show details
|
|
2 |
L3i at SemEval-2022 Task 11: Straightforward Additional Context for Multilingual Named Entity Recognition ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
L3i at SemEval-2022 Task 11: Straightforward Additional Context for Multilingual Named Entity Recognition ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
Stratégie Multitâche pour la Classification Multiclasse
|
|
|
|
In: à paraître : Actes de la 28e Conférence sur le Traitement Automatique des Langues Naturelles. Volume 1 : conférence principale ; Traitement Automatique des Langues Naturelles (TALN 2021) ; https://hal.archives-ouvertes.fr/hal-03265870 ; Traitement Automatique des Langues Naturelles (TALN 2021), 2021, Lille, France. pp.227-236 ; https://talnrecital2021.inria.fr/articles-acceptes/ (2021)
|
|
BASE
|
|
Show details
|
|
5 |
Understanding Feature Focus in Multitask Settings for Lexico-semantic Relation Identification
|
|
|
|
In: à paraître ; Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (ACL-IJCNLP 2021) ; https://hal.archives-ouvertes.fr/hal-03220236 ; Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (ACL-IJCNLP 2021), ACL : Association for Computational Linguistics; Asian Federation of Natural Language Processing, Aug 2021, Bangkok (complete virtual format), Thailand ; https://2021.aclweb.org/ (2021)
|
|
BASE
|
|
Show details
|
|
6 |
Atténuer les erreurs de numérisation dans la reconnaissance d'entités nommées pour les documents historiques
|
|
|
|
In: Conférence en Recherche d'Informations et Applications (CORIA 2021) ; https://hal.archives-ouvertes.fr/hal-03320332 ; Conférence en Recherche d'Informations et Applications (CORIA 2021), ARIA : Association Francophone de Recherche d’Information (RI) et Applications, Apr 2021, Grenoble (virtuel), France. pp.1 - 7 ; http://coria.asso-aria.org/2021/articles/mini_24/main.pdf (2021)
|
|
BASE
|
|
Show details
|
|
7 |
MELHISSA: a multilingual entity linking architecture for historical press articles ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
MELHISSA: a multilingual entity linking architecture for historical press articles ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Multilingual Dataset for Named Entity Recognition, Entity Linking and Stance Detection in Historical Newspapers ...
|
|
|
|
BASE
|
|
Show details
|
|
10 |
A Multilingual Dataset for Named Entity Recognition, Entity Linking and Stance Detection in Historical Newspapers ...
|
|
|
|
BASE
|
|
Show details
|
|
11 |
Multilingual Dataset for Named Entity Recognition, Entity Linking and Stance Detection in Historical Newspapers ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
A Multilingual Dataset for Named Entity Recognition, Entity Linking and Stance Detection in Historical Newspapers ...
|
|
|
|
BASE
|
|
Show details
|
|
13 |
A Multilingual Approach for Unsupervised Search Task Identification
|
|
|
|
In: SIGIR '20: Proceeding of the 43rd International ACM SIGIR conference on research and development in Information Retrieval ; 43rd International ACM SIGIR conference on research and development in Information Retrieval - SIGIR 2020 ; https://hal.archives-ouvertes.fr/hal-03014724 ; 43rd International ACM SIGIR conference on research and development in Information Retrieval - SIGIR 2020, Jul 2020, Virtual Event China, China. pp.2041-2044, ⟨10.1145/3397271.3401258⟩ ; https://dl.acm.org/doi/10.1145/3397271.3401258 (2020)
|
|
BASE
|
|
Show details
|
|
14 |
Entity Linking for Historical Documents: Challenges and Solutions
|
|
Pontes, Elvys Linhares; Cabrera-Diego, Luis Adrián; Moreno, José G.; Boros, Emanuela; Pontes, Elvys,; Hamdi, Ahmed; Sidère, Nicolas; Coustaty, Mickaël; Doucet, Antoine
|
|
In: 22nd International Conference on Asia-Pacific Digital Libraries, ICADL 2020 ; https://hal.archives-ouvertes.fr/hal-03034492 ; 22nd International Conference on Asia-Pacific Digital Libraries, ICADL 2020, 12504, Springer, pp.215-231, 2020, Lecture Notes in Computer Science, 978-3-030-64452-9. ⟨10.1007/978-3-030-64452-9_19⟩ (2020)
|
|
Abstract:
International audience ; Named entities (NEs) are among the most relevant type of information that can be used to efficiently index and retrieve digital documents. Furthermore, the use of Entity Linking (EL) to disambiguate and relate NEs to knowledge bases, provides supplementary information which can be useful to differentiate ambiguous elements such as geographical locations and peoples' names. In historical documents, the detection and disambiguation of NEs is a challenge. Most historical documents are converted into plain text using an optical character recognition (OCR) system at the expense of some noise. Documents in digital libraries will, therefore, be indexed with errors that may hinder their accessibility. OCR errors affect not only document indexing but the detection, disambiguation, and linking of NEs. This paper aims at analysing the performance of different EL approaches on two multilingual historical corpora, CLEF HIPE 2020 (English, French, German) and NewsEye (Finnish, French, German, Swedish), while proposes several techniques for alleviating the impact of historical data problems on the EL task. Our findings indicate that the proposed approaches not only outperform the baseline in both corpora but additionally they considerably reduce the impact of historical document issues on different subjects and languages.
|
|
Keyword:
[INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]; [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; [INFO.INFO-DL]Computer Science [cs]/Digital Libraries [cs.DL]; [INFO.INFO-HC]Computer Science [cs]/Human-Computer Interaction [cs.HC]; [INFO.INFO-IR]Computer Science [cs]/Information Retrieval [cs.IR]; [INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG]; [INFO.INFO-TT]Computer Science [cs]/Document and Text Processing; Deep learning; Digital libraries; Entity linking; Historical data
|
|
URL: https://doi.org/10.1007/978-3-030-64452-9_19 https://hal.archives-ouvertes.fr/hal-03034492/document https://hal.archives-ouvertes.fr/hal-03034492/file/ICADL_2020___12_14_pages___references.pdf https://hal.archives-ouvertes.fr/hal-03034492
|
|
BASE
|
|
Hide details
|
|
15 |
Robust Named Entity Recognition and Linking on Historical Multilingual Documents
|
|
|
|
In: Conference and Labs of the Evaluation Forum (CLEF 2020) ; https://hal.archives-ouvertes.fr/hal-03026969 ; Conference and Labs of the Evaluation Forum (CLEF 2020), Sep 2020, Thessaloniki, Greece. pp.1-17, ⟨10.5281/zenodo.4068074⟩ ; http://ceur-ws.org/Vol-2696/paper_171.pdf (2020)
|
|
BASE
|
|
Show details
|
|
16 |
Linking Named Entities across Languages using Multilingual Word Embeddings
|
|
|
|
In: JCDL '20: The ACM/IEEE Joint Conference on Digital Libraries in 2020 ; ACM/IEEE Joint Conference on Digital Libraries - JCDL 2020 ; https://hal.archives-ouvertes.fr/hal-03026933 ; ACM/IEEE Joint Conference on Digital Libraries - JCDL 2020, Aug 2020, Wuhan, Hubei - Virtual event, China. pp.329-332, ⟨10.1145/3383583.3398597⟩ ; https://dl.acm.org/doi/10.1145/3383583.3398597 (2020)
|
|
BASE
|
|
Show details
|
|
17 |
Knowledge Base Embedding By Cooperative Knowledge Distillation
|
|
|
|
In: Proceedings of the 28th International Conference on Computational Linguistics, ; International Conference on Computational Linguistics (COLING 2020) ; https://hal.archives-ouvertes.fr/hal-03172074 ; International Conference on Computational Linguistics (COLING 2020), Dec 2020, Barcelone (on line), Spain. pp.5579-5590, ⟨10.18653/v1/2020.coling-main.489⟩ ; https://www.aclweb.org/anthology/2020.coling-main.489/ (2020)
|
|
BASE
|
|
Show details
|
|
18 |
Robust Named Entity Recognition and Linking on Historical Multilingual Documents ...
|
|
|
|
BASE
|
|
Show details
|
|
19 |
Robust Named Entity Recognition and Linking on Historical Multilingual Documents ...
|
|
|
|
BASE
|
|
Show details
|
|
20 |
Linking Named Entities across Languages using Multilingual Word Embeddings ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|