DE eng

Search in the Catalogues and Directories

Hits 1 – 14 of 14

1
Indonesian web corpus
MEDVEĎ, MAREK; Suchomel, Vít. - : Masaryk University, NLP Centre, 2019
BASE
Show details
2
SkELL corpora as a part of the language portal Sõnaveeb: problems and perspectives ...
BASE
Show details
3
Automating Dictionary Production: a Tagalog-English-Korean Dictionary from Scratch ...
Abstract: In this paper we present lexicographic work on a Tagalog-English-Korean dictionary. The dictionary is created entirely from scratch and all of its content (besides audio pronunciation) is initially generated fully automatically from a large web corpus that we built for these purposes, and then post-edited by human editors. The full size of the dictionary is 45,000 entries, out of which 15,000 most frequent entries are manually post-edited, while the remaining 30,000 entries are left only as automated. The project is currently ongoing and will be finished in December 2019. The dictionary will be part of the online platform run by the Naver Corporation1 and freely available. ...
Keyword: strategies, tools, standards for lexicographic resources objective 3; WP1; WP4
URL: https://dx.doi.org/10.5281/zenodo.3691445
https://zenodo.org/record/3691445
BASE
Hide details
4
Automating Dictionary Production: a Tagalog-English-Korean Dictionary from Scratch ...
BASE
Show details
5
SkELL corpora as a part of the language portal Sõnaveeb: problems and perspectives ...
BASE
Show details
6
Somali Web Corpus
Suchomel, Vít; Rychlý, Pavel. - : Masaryk University, NLP Centre, 2018
BASE
Show details
7
Oromo web corpus
Suchomel, Vít; Rychlý, Pavel. - : Masaryk University, NLP Centre, 2018
BASE
Show details
8
Amharic Web Corpus
Suchomel, Vít; Rychlý, Pavel. - : Masaryk University, NLP Centre, 2018
BASE
Show details
9
Tigrinya Web Corpus
Suchomel, Vít; Rychlý, Pavel. - : Masaryk University, NLP Centre, 2018
BASE
Show details
10
Indonesian web corpus (idWac)
Medveď, Marek; Suchomel, Vít. - : Natural Language Processing Centre, Faculty of Informatics, Masaryk University, 2018
BASE
Show details
11
Removing spam from web corpora through supervised learning using FastText
Suchomel, Vít [Verfasser]; Bański, Piotr [Herausgeber]; Kupietz, Marc [Herausgeber]. - Mannheim : Institut für Deutsche Sprache, Bibliothek, 2017
DNB Subject Category Language
Show details
12
The Sketch Engine: ten years on
In: Lexicography. Journal of ASIALEX 1 (2014) 1, 7-36
IDS OBELEX meta
Show details
13
HindMonoCorp 0.5
Bojar, Ondřej; Diatka, Vojtěch; Rychlý, Pavel. - : Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL), 2014
BASE
Show details
14
Removing spam from web corpora through supervised learning using FastText [Online resource]
IDS-Repository
Show details

Catalogues
0
0
0
0
1
0
0
Bibliographies
0
0
0
0
0
0
1
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
11
0
1
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern