1 |
Introducing the HIPE 2022 Shared Task: Named Entity Recognition and Linking in Multilingual Historical Documents
|
|
|
|
In: Advances in Information Retrieval. 44th European Conference on IR Research, ECIR 2022, Stavanger, Norway, April 10–14, 2022, Proceedings, Part II ; https://hal.archives-ouvertes.fr/hal-03635971 ; Matthias Hagen; Suzan Verberne; Craig Macdonald; Christin Seifert; Krisztian Balog; Kjetil Nørvåg; Vinay Setty. Advances in Information Retrieval. 44th European Conference on IR Research, ECIR 2022, Stavanger, Norway, April 10–14, 2022, Proceedings, Part II, 13186, Springer International Publishing, pp.347-354, 2022, Lecture Notes in Computer Science, 978-3-030-99738-0. ⟨10.1007/978-3-030-99739-7_44⟩ (2022)
|
|
BASE
|
|
Show details
|
|
2 |
Cross-media Scientific Research Achievements Query based on Ranking Learning ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
GENOME: A GENeric methodology for Ontological Modelling of Epics ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
Quantifying knowledge synchronisation in the 21st century ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
EMAKG: An Enhanced Version Of The Microsoft Academic Knowledge Graph ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
Topic models do not model topics: epistemological remarks and steps towards best practices
|
|
|
|
In: EISSN: 2416-5999 ; Journal of Data Mining and Digital Humanities ; https://hal.archives-ouvertes.fr/hal-03261599 ; Journal of Data Mining and Digital Humanities, Episciences.org, 2021, 2021, ⟨10.46298/jdmdh.7595⟩ (2021)
|
|
BASE
|
|
Show details
|
|
8 |
Topic models do not model topics: epistemological remarks and steps towards best practices
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-03261599 ; 2021 (2021)
|
|
Abstract:
The social sciences and digital humanities have recently adopted the machine learning technique of topic modeling to address research questions in their fields. This is problematic in a number of ways, some of which have not received much attention in the debate yet. This paper adds epistemological concerns centering around the interface between topic modeling and linguistic concepts and the argumentative embedding of evidence obtained through topic modeling. It concludes that topic modeling in its present state of methodological integration does not meet the requirements of an independent research method. It operates from relevantly unrealistic assumptions, is non-deterministic, cannot effectively be validated against a reasonable number of competing models, does not lock into a well-defined linguistic interface, and does not scholarly model topics in the sense of themes or content. These features are intrinsic and make the interpretation of its results prone to apophenia (the human tendency to perceive random sets of elements as meaningful patterns) and confirmation bias (the human tendency to perceptually prefer patterns that are in alignment with pre-existing biases). While partial validation of the statistical model is possible, a conceptual validation would require an extended triangulation with other methods and human ratings, and clarification of whether statistical distinctivity of lexical co-occurrence correlates with conceputal topics in any reliable way.
|
|
Keyword:
[INFO.INFO-DL]Computer Science [cs]/Digital Libraries [cs.DL]; [SHS.LANGUE]Humanities and Social Sciences/Linguistics; [SHS]Humanities and Social Sciences
|
|
URL: https://hal.archives-ouvertes.fr/hal-03261599 https://hal.archives-ouvertes.fr/hal-03261599/document https://hal.archives-ouvertes.fr/hal-03261599/file/topic_models_do_not_model_topics_final_draft.pdf
|
|
BASE
|
|
Hide details
|
|
9 |
Topic models do not model topics: epistemological remarks and steps towards best practices
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-03261599 ; 2021 (2021)
|
|
BASE
|
|
Show details
|
|
10 |
MOR Digital: The Advent of a New Lexicographical Portuguese Project
|
|
|
|
In: eLex 2021 - Seventh biennial conference on electronic lexicography ; https://hal.inria.fr/hal-03195362 ; eLex 2021 - Seventh biennial conference on electronic lexicography, Jul 2021, Brno, Czech Republic ; https://elex.link/elex2021/ (2021)
|
|
BASE
|
|
Show details
|
|
11 |
État de l'art du changement sémantique à partir de plongements contextualisés
|
|
|
|
In: COnférence en Recherche d'Informations et Applications - CORIA 2021, French Information Retrieval Conference ; https://hal.archives-ouvertes.fr/hal-03320337 ; COnférence en Recherche d'Informations et Applications - CORIA 2021, French Information Retrieval Conference, Apr 2021, Grenoble (virtuel), France (2021)
|
|
BASE
|
|
Show details
|
|
12 |
MORDigital: The Advent of a New Lexicographical Portuguese Project
|
|
|
|
In: eLex 2021 - Seventh biennial conference on electronic lexicography ; https://hal.inria.fr/hal-03195362 ; eLex 2021 - Seventh biennial conference on electronic lexicography, Jul 2021, Brno, Czech Republic ; https://elex.link/elex2021/ (2021)
|
|
BASE
|
|
Show details
|
|
13 |
From the Formal Definition of Concept to the Linguistic Definition of Term ; De la définition formelle du concept à la définition en langue du terme
|
|
|
|
In: ISSN: 2299-7164 ; EISSN: 2353-3218 ; Academic Journal of Modern Philology ; https://hal.archives-ouvertes.fr/hal-03549751 ; Academic Journal of Modern Philology, Uniwersytet Wroctawski, 2021, ⟨10.34616/ajmp.2021.13⟩ (2021)
|
|
BASE
|
|
Show details
|
|
14 |
Atténuer les erreurs de numérisation dans la reconnaissance d'entités nommées pour les documents historiques
|
|
|
|
In: Conférence en Recherche d'Informations et Applications (CORIA 2021) ; https://hal.archives-ouvertes.fr/hal-03320332 ; Conférence en Recherche d'Informations et Applications (CORIA 2021), ARIA : Association Francophone de Recherche d’Information (RI) et Applications, Apr 2021, Grenoble (virtuel), France. pp.1 - 7 ; http://coria.asso-aria.org/2021/articles/mini_24/main.pdf (2021)
|
|
BASE
|
|
Show details
|
|
15 |
Multilingual Epidemic Event Extraction
|
|
|
|
In: Towards Open and Trustworthy Digital Societies. 23rd International Conference on Asia-Pacific Digital Libraries, ICADL 2021, Virtual Event, December 1–3, 2021, Proceedings ; https://hal.archives-ouvertes.fr/hal-03480551 ; Hao-Ren Ke; Chei Sian Lee; Kazunari Sugiyama. Towards Open and Trustworthy Digital Societies. 23rd International Conference on Asia-Pacific Digital Libraries, ICADL 2021, Virtual Event, December 1–3, 2021, Proceedings, 13133, Springer, pp.139-156, 2021, Lecture Notes in Computer Science, 978-3-030-91668-8. ⟨10.1007/978-3-030-91669-5_12⟩ (2021)
|
|
BASE
|
|
Show details
|
|
16 |
Étude comparative de méthodes de classification multilingue appliquées à l'épidémiologie
|
|
|
|
In: COnférence en Recherche d'Informations et Applications - CORIA 2021, French Information Retrieval Conference ; https://hal.archives-ouvertes.fr/hal-03320343 ; COnférence en Recherche d'Informations et Applications - CORIA 2021, French Information Retrieval Conference, Apr 2021, Grenoble (virtuel), France (2021)
|
|
BASE
|
|
Show details
|
|
17 |
A Multilingual Dataset for Named Entity Recognition, Entity Linking and Stance Detection in Historical Newspapers
|
|
|
|
In: SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval ; https://hal.archives-ouvertes.fr/hal-03418387 ; SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, Jul 2021, Virtual Event, Canada. pp.2328-2334, ⟨10.1145/3404835.3463255⟩ (2021)
|
|
BASE
|
|
Show details
|
|
18 |
From disparate disciplines to unity in diversity How the PARTHENOS project has brought European humanities Research Infrastructures together
|
|
|
|
In: ISSN: 1753-8548 ; EISSN: 1755-1706 ; International Journal of Humanities and Arts Computing ; https://hal.inria.fr/hal-03402145 ; International Journal of Humanities and Arts Computing, Edinburgh University Press, 2021, 15 (1-2), pp.101-116. ⟨10.3366/ijhac.2021.0264⟩ (2021)
|
|
BASE
|
|
Show details
|
|
19 |
Topic modelling on archive documents from the 1970s: global policies on refugees
|
|
|
|
In: ISSN: 2055-7671 ; EISSN: 2055-768X ; Digital Scholarship in the Humanities ; https://hal.archives-ouvertes.fr/hal-03435806 ; Digital Scholarship in the Humanities, Oxford University Press, 2021, 36 (4), pp.886-904. ⟨10.1093/llc/fqab018⟩ (2021)
|
|
BASE
|
|
Show details
|
|
20 |
Identification et gestion des données personnelles dans les textes ; Identification et gestion des données personnelles dans les textes: modèle sémantique et applications
|
|
|
|
In: CiDE.22 : 22éme édition du Colloque International sur le Document Electronique Données Documents Connaissances : Perspectives de recherche et d’enseignement ; https://hal.archives-ouvertes.fr/hal-03506075 ; CiDE.22 : 22éme édition du Colloque International sur le Document Electronique Données Documents Connaissances : Perspectives de recherche et d’enseignement, Dec 2021, Paris, France (2021)
|
|
BASE
|
|
Show details
|
|
|
|