DE eng

Search in the Catalogues and Directories

Page: 1 2 3 4 5...289
Hits 1 – 20 of 5.761

1
ViQuAE, a Dataset for Knowledge-based Visual Question Answering about Named Entities
In: ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR ’22) ; https://hal-universite-paris-saclay.archives-ouvertes.fr/hal-03650618 ; 2022 (2022)
BASE
Show details
2
Towards combined semantic and lexical scores based on a new representation of textual data to extract experimental data from scientific publications
In: ISSN: 1751-5858 ; EISSN: 1751-5866 ; International Journal of Intelligent Information and Database Systems ; https://hal.inrae.fr/hal-03616243 ; International Journal of Intelligent Information and Database Systems, Inderscience, 2022, 15 (1), pp.78. ⟨10.1504/IJIIDS.2022.120146⟩ (2022)
BASE
Show details
3
Obvie: interface web pour la fouille et la comparaison de textes
In: Atelier DigitAl Humanities and cuLtural herItAge: data and knowledge management and analysis durant la conférence francophone sur l'Extraction et la Gestion des Connaissances (egc2022) ; https://hal.archives-ouvertes.fr/hal-03543362 ; Atelier DigitAl Humanities and cuLtural herItAge: data and knowledge management and analysis durant la conférence francophone sur l'Extraction et la Gestion des Connaissances (egc2022), Jan 2022, Blois, France ; https://egc2022.univ-tours.fr/ateliers/ (2022)
BASE
Show details
4
Preprint Citation Praxis in PLOS
In: ISSN: 0138-9130 ; EISSN: 1588-2861 ; Scientometrics ; https://hal.archives-ouvertes.fr/hal-03506094 ; In press (2022)
BASE
Show details
5
Islands and Bridges of Language: Bio-Inspired Structural Analysis of Language Embedding Data
Zhou, Hongwei. - : eScholarship, University of California, 2022
BASE
Show details
6
Assessing the impact of OCR noise on multilingual event detection over digitised documents
In: ISSN: 1432-5012 ; EISSN: 1432-1300 ; International Journal on Digital Libraries ; https://hal.archives-ouvertes.fr/hal-03635985 ; International Journal on Digital Libraries, Springer Verlag, 2022, ⟨10.1007/s00799-022-00325-2⟩ (2022)
BASE
Show details
7
Introducing the HIPE 2022 Shared Task: Named Entity Recognition and Linking in Multilingual Historical Documents
In: Advances in Information Retrieval. 44th European Conference on IR Research, ECIR 2022, Stavanger, Norway, April 10–14, 2022, Proceedings, Part II ; https://hal.archives-ouvertes.fr/hal-03635971 ; Matthias Hagen; Suzan Verberne; Craig Macdonald; Christin Seifert; Krisztian Balog; Kjetil Nørvåg; Vinay Setty. Advances in Information Retrieval. 44th European Conference on IR Research, ECIR 2022, Stavanger, Norway, April 10–14, 2022, Proceedings, Part II, 13186, Springer International Publishing, pp.347-354, 2022, Lecture Notes in Computer Science, 978-3-030-99738-0. ⟨10.1007/978-3-030-99739-7_44⟩ (2022)
BASE
Show details
8
Can Character-based Language Models Improve Downstream Task Performance in Low-Resource and Noisy Language Scenarios?
In: Seventh Workshop on Noisy User-generated Text (W-NUT 2021, colocated with EMNLP 2021) ; https://hal.inria.fr/hal-03527328 ; Seventh Workshop on Noisy User-generated Text (W-NUT 2021, colocated with EMNLP 2021), Jan 2022, punta cana, Dominican Republic ; https://aclanthology.org/2021.wnut-1.47/ (2022)
Abstract: International audience ; Recent impressive improvements in NLP, largely based on the success of contextual neural language models, have been mostly demonstrated on at most a couple dozen high-resource languages. Building language models and, more generally, NLP systems for non-standardized and low-resource languages remains a challenging task. In this work, we focus on North-African colloquial dialectal Arabic written using an extension of the Latin script, called NArabizi, found mostly on social media and messaging communication. In this low-resource scenario with data displaying a high level of variability, we compare the downstream performance of a character-based language model on part-of-speech tagging and dependency parsing to that of monolingual and multilingual models. We show that a character-based model trained on only 99k sentences of NArabizi and fined-tuned on a small treebank of this language leads to performance close to those obtained with the same architecture pre-trained on large multilingual and monolingual models. Confirming these results a on much larger data set of noisy French user-generated content, we argue that such character-based language models can be an asset for NLP in low-resource and high language variability set-tings.
Keyword: [INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]; [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; [INFO.INFO-IR]Computer Science [cs]/Information Retrieval [cs.IR]; [INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG]; [INFO.INFO-SI]Computer Science [cs]/Social and Information Networks [cs.SI]; [INFO.INFO-TT]Computer Science [cs]/Document and Text Processing
URL: https://hal.inria.fr/hal-03527328
BASE
Hide details
9
Between History and Natural Language Processing: Study, Enrichment and Online Publication of French Parliamentary Debates of the Early Third Republic (1881-1899)
In: ParlaCLARIN III at LREC2022 - Workshop on Creating, Enriching and Using Parliamentary Corpora ; https://hal.archives-ouvertes.fr/hal-03623351 ; ParlaCLARIN III at LREC2022 - Workshop on Creating, Enriching and Using Parliamentary Corpora, Jun 2022, Marseille, France ; https://www.clarin.eu/ParlaCLARIN-III (2022)
BASE
Show details
10
Unsupervised quantification of entity consistency between photos and text in real-world news ...
Müller-Budack, Eric. - : Hannover : Institutionelles Repositorium der Leibniz Universität Hannover, 2022
BASE
Show details
11
A Dataset for Toponym Resolution in Nineteenth-Century English Newspapers
In: Journal of Open Humanities Data; Vol 8 (2022); 3 ; 2059-481X (2022)
BASE
Show details
12
Cross-media Scientific Research Achievements Query based on Ranking Learning ...
Wang, Benzhi; Liang, Meiyu; Li, Ang. - : arXiv, 2022
BASE
Show details
13
Reproducibility of the Experimental Result of BERT for Evidence Retrieval and Claim Verification ...
BASE
Show details
14
Reproducibility of the Experimental Result of BERT for Evidence Retrieval and Claim Verification ...
BASE
Show details
15
Exploring Sub-skeleton Trajectories for Interpretable Recognition of Sign Language ...
BASE
Show details
16
Cross-Lingual Query-Based Summarization of Crisis-Related Social Media: An Abstractive Approach Using Transformers ...
Vitiugin, Fedor; Castillo, Carlos. - : arXiv, 2022
BASE
Show details
17
Simplifying Multilingual News Clustering Through Projection From a Shared Space ...
BASE
Show details
18
Towards Best Practices for Training Multilingual Dense Retrieval Models ...
BASE
Show details
19
Addressing Issues of Cross-Linguality in Open-Retrieval Question Answering Systems For Emergent Domains ...
BASE
Show details
20
C3: Continued Pretraining with Contrastive Weak Supervision for Cross Language Ad-Hoc Retrieval ...
BASE
Show details

Page: 1 2 3 4 5...289

Catalogues
303
39
157
0
1
5
0
Bibliographies
1.665
1
0
0
0
0
0
1
2
Linked Open Data catalogues
0
Online resources
28
0
0
0
Open access documents
3.925
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern