1 |
Towards combined semantic and lexical scores based on a new representation of textual data to extract experimental data from scientific publications
|
|
|
|
In: ISSN: 1751-5858 ; EISSN: 1751-5866 ; International Journal of Intelligent Information and Database Systems ; https://hal.inrae.fr/hal-03616243 ; International Journal of Intelligent Information and Database Systems, Inderscience, 2022, 15 (1), pp.78. ⟨10.1504/IJIIDS.2022.120146⟩ (2022)
|
|
BASE
|
|
Show details
|
|
2 |
Assessing the impact of OCR noise on multilingual event detection over digitised documents
|
|
|
|
In: ISSN: 1432-5012 ; EISSN: 1432-1300 ; International Journal on Digital Libraries ; https://hal.archives-ouvertes.fr/hal-03635985 ; International Journal on Digital Libraries, Springer Verlag, 2022, ⟨10.1007/s00799-022-00325-2⟩ (2022)
|
|
BASE
|
|
Show details
|
|
3 |
Introducing the HIPE 2022 Shared Task: Named Entity Recognition and Linking in Multilingual Historical Documents
|
|
|
|
In: Advances in Information Retrieval. 44th European Conference on IR Research, ECIR 2022, Stavanger, Norway, April 10–14, 2022, Proceedings, Part II ; https://hal.archives-ouvertes.fr/hal-03635971 ; Matthias Hagen; Suzan Verberne; Craig Macdonald; Christin Seifert; Krisztian Balog; Kjetil Nørvåg; Vinay Setty. Advances in Information Retrieval. 44th European Conference on IR Research, ECIR 2022, Stavanger, Norway, April 10–14, 2022, Proceedings, Part II, 13186, Springer International Publishing, pp.347-354, 2022, Lecture Notes in Computer Science, 978-3-030-99738-0. ⟨10.1007/978-3-030-99739-7_44⟩ (2022)
|
|
BASE
|
|
Show details
|
|
8 |
Text Mining from Free Unstructured Text: An Experiment of Time Series Retrieval for Volcano Monitoring
|
|
|
|
In: Applied Sciences; Volume 12; Issue 7; Pages: 3503 (2022)
|
|
BASE
|
|
Show details
|
|
9 |
Sentence Boundary Extraction from Scientific Literature of Electric Double Layer Capacitor Domain: Tools and Techniques
|
|
|
|
In: Applied Sciences; Volume 12; Issue 3; Pages: 1352 (2022)
|
|
BASE
|
|
Show details
|
|
10 |
Analysis of the Full-Size Russian Corpus of Internet Drug Reviews with Complex NER Labeling Using Deep Learning Neural Networks and Language Models
|
|
|
|
In: Applied Sciences; Volume 12; Issue 1; Pages: 491 (2022)
|
|
BASE
|
|
Show details
|
|
11 |
Experiences on the Improvement of Logic-Based Anaphora Resolution in English Texts
|
|
|
|
In: Electronics; Volume 11; Issue 3; Pages: 372 (2022)
|
|
BASE
|
|
Show details
|
|
13 |
Topic models do not model topics: epistemological remarks and steps towards best practices
|
|
|
|
In: EISSN: 2416-5999 ; Journal of Data Mining and Digital Humanities ; https://hal.archives-ouvertes.fr/hal-03261599 ; Journal of Data Mining and Digital Humanities, Episciences.org, 2021, 2021, ⟨10.46298/jdmdh.7595⟩ (2021)
|
|
BASE
|
|
Show details
|
|
14 |
Indirectly Named Entity Recognition ; Reconnaissance d'entités indirectement nommées
|
|
|
|
In: ISSN: 2530-9455 ; Journal of Computer-Assisted Linguistic Research (JCLR) ; https://hal.archives-ouvertes.fr/hal-03476411 ; Journal of Computer-Assisted Linguistic Research (JCLR), Universitat Politècnica de València, 2021, 5 (1), pp.27-46. ⟨10.4995/JCLR.2021.15922⟩ ; https://polipapers.upv.es/index.php/jclr/index (2021)
|
|
BASE
|
|
Show details
|
|
15 |
Atténuer les erreurs de numérisation dans la reconnaissance d'entités nommées pour les documents historiques
|
|
|
|
In: Conférence en Recherche d'Informations et Applications (CORIA 2021) ; https://hal.archives-ouvertes.fr/hal-03320332 ; Conférence en Recherche d'Informations et Applications (CORIA 2021), ARIA : Association Francophone de Recherche d’Information (RI) et Applications, Apr 2021, Grenoble (virtuel), France. pp.1 - 7 ; http://coria.asso-aria.org/2021/articles/mini_24/main.pdf (2021)
|
|
BASE
|
|
Show details
|
|
16 |
WEIR-P: An Information Extraction Pipeline for the Wastewater Domain
|
|
Chahinian, Nanée; Bonnabaud La Bruyère, Thierry; Frontini, Francesca; Delenne, Carole; Julien, Marin; Panckhurst, Rachel; Roche, Mathieu; Deruelle, Laurent; Sautot, Lucile; Teissiere, Maguelonne
|
|
In: RCIS 2021 - 5th International Conference on Research Challenges in Information Science ; https://hal.archives-ouvertes.fr/hal-03211461 ; RCIS 2021 - 5th International Conference on Research Challenges in Information Science, May 2021, Virtual, Cyprus (2021)
|
|
Abstract:
International audience ; We present the MeDO project, aimed at developing resourcesfor text mining and information extraction in the wastewater domain.We developed a specific Natural Language Processing (NLP) pipelinenamed WEIR-P (WastewatEr InfoRmation extraction Platform) which identifies the entities and relations to be extracted from texts, pertaining to network information, wastewater treatment, accidents and works,organizations, spatio-temporal information, measures and water quality. We present and evaluate the first version of the NLP system which was developed to automate the extraction of the aforementioned annotationfrom texts and its integration with existing domain knowledge. The preliminary results obtained on the Montpellier corpus are encouraging and show how a mix of supervised and rule-based techniques can be used to extract useful information and reconstruct the various phases of theextension of a given wastewater network. While the NLP and Information Extraction (IE) methods used are state of the art, the novelty of our work lies in their adaptation to the domain, and in particular in the wastewater management conceptual model, which defines the relations between entities. French resources are less developed in the NLP community than English ones. The datasets obtained in this project are another original aspect of this work.
|
|
Keyword:
[INFO]Computer Science [cs]; [SCCO.LING]Cognitive science/Linguistics; [SDU.STU.HY]Sciences of the Universe [physics]/Earth Sciences/Hydrology; Domain adapted systems; Extraction d'information IE; Fouille de données textuelles; Information extraction; NER; NLP; Reconnaissance d'Entités Nommées (REN); Réseaux d'assainissement; TALN Traitement Automatique des Langues Naturelles; Text mining; Wastewater
|
|
URL: https://hal.archives-ouvertes.fr/hal-03211461/document https://hal.archives-ouvertes.fr/hal-03211461 https://hal.archives-ouvertes.fr/hal-03211461/file/RCIS_MeDO.pdf
|
|
BASE
|
|
Hide details
|
|
17 |
Mapping the evolution of topics published by Education for Information. Interdisciplinary Journal of Information Studies
|
|
|
|
In: ISSN: 0167-8329 ; Education for Information ; https://hal.archives-ouvertes.fr/hal-03392553 ; Education for Information, IOS Press, 2021 (2021)
|
|
BASE
|
|
Show details
|
|
18 |
LILLIE : information extraction and database integration using linguistics and learning-based algorithms ...
|
|
|
|
BASE
|
|
Show details
|
|
19 |
Exploring Construction of a Company Domain-Specific Knowledge Graph from Financial Texts Using Hybrid Information Extraction
|
|
Jen, Chun-Heng. - : KTH, Skolan för elektroteknik och datavetenskap (EECS), 2021
|
|
BASE
|
|
Show details
|
|
|
|