Home Catalogue search

eng

Refine your search:
- Keyword
- Creator / Publisher
- Year:
  - 2018 (6)
- Medium
- Type
- BLLDB-Access:
  - free (6)
  - subject to license (0)

Search in the Catalogues and Directories






	Sort by
Simple Search

Hits 1 – 6 of 6

1	Training corpus hr500k 1.0
	Ljubešić, Nikola; Agić, Željko; Klubička, Filip. - : Jožef Stefan Institute, 2018
	BASE
	Show details

2	Quantitative Fine-Grained Human Evaluation of Machine Translation Systems: a Case Study on English to Croatian ...
	Klubička, Filip; Toral, Antonio; Sánchez-Cartagena, Víctor M.. - : arXiv, 2018
	BASE
	Show details

3	Is it worth it? Budget-related evaluation metrics for model selection ...
	Klubička, Filip; Salton, Giancarlo D.; Kelleher, John D.. - : arXiv, 2018
	BASE
	Show details

4	Quantitative Fine-grained Human Evaluation of Machine Translation Systems: a Case Study on English to Croatian
	Sanchez-Cartagena, Victor Manuel; Toral, Antonio; Klubicka, Filip
	In: Articles (2018)
	BASE
	Show details

5	Is it worth it? Budget-related evaluation metrics for model selection
	Klubicka, Filip; Salton, Giancarlo; Kelleher, John D.
	In: Conference papers (2018)
	BASE
	Show details

6	hr500k – A Reference Training Corpus of Croatian.
	Erjavec, Tomaž; Ljubešić, Nikola; Klubicka, Filip; Agić, Željko; Batanović, Vuk
	In: Conference papers (2018)
	Abstract: In this paper we present hr500k, a Croatian reference training corpus of 500 thousand tokens, segmented at document, sentence and word level, and annotated for morphosyntax, lemmas, dependency syntax, named entities, and semantic roles. We present each annotation layer via basic label statistics and describe the final encoding of the resource in CoNLL and TEI formats. We also give a description of the rather turbulent history of the resource and give insights into the topic and genre distribution in the corpus. Finally, we discuss further enrichments of the corpus with additional layers, which are already underway.
	Keyword: annotation; computational linguistics; Croatian; Digital Humanities; linguistic resource; machine learning; reference corpus; Slavic Languages and Societies
	URL: https://arrow.tudublin.ie/cgi/viewcontent.cgi?article=1254&context=scschcomcon https://arrow.tudublin.ie/scschcomcon/244
	BASE
	Hide details

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern