1 |
Towards a Cleaner Document-Oriented Multilingual Crawled Corpus
|
|
|
|
In: https://hal.inria.fr/hal-03536361 ; 2022 (2022)
|
|
BASE
|
|
Show details
|
|
2 |
Lexicographic Data Seal of Compliance
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-03344267 ; [Research Report] ELEXIS; DARIAH. 2021 (2021)
|
|
BASE
|
|
Show details
|
|
3 |
Building, Encoding, and Annotating a Corpus of Parliamentary Debates in XML-TEI: A Cross-Linguistic Account
|
|
|
|
In: https://halshs.archives-ouvertes.fr/halshs-03097333 ; 2020 (2020)
|
|
BASE
|
|
Show details
|
|
4 |
CamemBERT: a Tasty French Language Model
|
|
|
|
In: https://hal.inria.fr/hal-02445946 ; 2019 (2019)
|
|
BASE
|
|
Show details
|
|
5 |
From disparate disciplines to unity in diversity. How the PARTHENOS project brings Humanities Research Infrastructures together ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
From disparate disciplines to unity in diversity. How the PARTHENOS project brings Humanities Research Infrastructures together ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
Automatic TEI encoding of manuscripts catalogues with GROBID-Dictionaries ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Automatic TEI encoding of manuscripts catalogues with GROBID-Dictionaries ...
|
|
|
|
Abstract:
Manuscript Sales Catalogues (MSC) are highly important for authenticating documents and studying the reception of authors. Their regular publication throughout Europe since the beginning of the 19th c. has consequently raised the interest around scaling up the means for automatically structuring their contents. Following successful first encoding tests with GROBID-Dictionaries on a single MSC collection, we aim in this paper to present the results of more advanced tests of the system’s capacity to handle a larger corpus with MSC of different dealers, and therefore multiple layouts. Four different types of catalogues published between the middle of the 19th c. and the beginning of the 20th c. have been tested. ... : {"references": ["Mohamed Khemakhem, Laurent Romary, Simon Gabay, Herv\u00e9 Bohbot, Francesca Frontini, et al.. Automatically Encoding Encyclopedic-like Resources in TEI. The annual TEI Conference and Members Meeting, Sep 2018, Tokyo, Japan.", "Mohamed Khemakhem, Luca Foppiano, Laurent Romary. Automatic Extraction of TEI Structures in Digitized Lexical Resources using Conditional Random Fields. electronic lexicography, eLex 2017, Sep 2017, Leiden, Netherlands.", "Mohamed Khemakhem, Axel Herold, Laurent Romary. Enhancing Usability for Automatically Structuring Digitised Dictionaries. GLOBALEX workshop at LREC 2018, May 2018, Miyazaki, Japan. 2018."]} ...
|
|
Keyword:
19th c. France; Machine learning; manuscript sales catalogues
|
|
URL: https://zenodo.org/record/3383658 https://dx.doi.org/10.5281/zenodo.3383658
|
|
BASE
|
|
Hide details
|
|
10 |
Open Access in Japan – a multi-institutional perspective
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-01290936 ; [Research Report] Ambassade de France au Japon. 2016 (2016)
|
|
BASE
|
|
Show details
|
|
12 |
IPERION CH Data Management Plan
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-02139658 ; [Research Report] D 2.1, Inria. 2015 (2015)
|
|
BASE
|
|
Show details
|
|
15 |
[Tiger2/] Documentation
|
|
|
|
In: https://hal.inria.fr/inria-00593903 ; [Technical Report] 2010 (2010)
|
|
BASE
|
|
Show details
|
|
16 |
HANDLING MULTILINGUAL CONTENT IN DIGITAL MEDIA: A CRITICAL ANALYSIS
|
|
|
|
In: https://hal.inria.fr/inria-00001120 ; [Research Report] 2006, pp.60 (2006)
|
|
BASE
|
|
Show details
|
|
17 |
Unification of multi-lingual scientific terminological resources using the ISO 16642 standard. The TermSciences initiative ...
|
|
|
|
BASE
|
|
Show details
|
|
18 |
Towards Multimodal Content Representation
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-00323338 ; 2002 (2002)
|
|
BASE
|
|
Show details
|
|
19 |
The ELAN Architecture ; The ELAN Architecture: ELAN Deliverables WP3
|
|
|
|
In: https://hal.inria.fr/hal-01875371 ; [Contract] Deliverables D3.1-1 and D3.2-1, Inria. 1999 (1999)
|
|
BASE
|
|
Show details
|
|
20 |
A cognitive model for the representation of time in a man-machine dialogue.
|
|
|
|
In: https://hal.inria.fr/hal-00721871 ; 1989 (1989)
|
|
BASE
|
|
Show details
|
|
|
|