Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Hits 1 – 12 of 12

1	Enhancing Sequence-to-Sequence Neural Lemmatization with External Resources ...
	Milintsevich, Kirill; Sirts, Kairit. - : arXiv, 2021
	BASE
	Show details

2	Evaluating Multilingual BERT for Estonian ...
	Kittask, Claudia; Milintsevich, Kirill; Sirts, Kairit. - : arXiv, 2020
	BASE
	Show details

3	EstBERT: A Pretrained Language-Specific BERT for Estonian ...
	Tanvir, Hasan; Kittask, Claudia; Eiche, Sandra. - : arXiv, 2020
	BASE
	Show details

4	STransE: a novel embedding model of entities and relationships in knowledge bases ...
	Nguyen, Dat Quoc; Sirts, Kairit; Qu, Lizhen. - : arXiv, 2016
	BASE
	Show details

5	STransE : a novel embedding model of entities and relationships in knowledge bases
	Nguyen, Dat Quoc; Sirts, Kairit; Qu, Lizhen. - : Red Hook, New York : Association for Computational Linguistics, 2016
	BASE
	Show details

6	Query-based single document summarization using an Ensemble Noisy Auto-Encoder
	Yousefi Azar, Mahmood; Sirts, Kairit; Molla Aliod, Diego. - : Melbourne, Australia : Association for Computational Linguistics, 2015
	BASE
	Show details

7	POS induction with distributional and morphological information using a distance-dependent Chinese Restaurant Process
	Sirts, Kairit; Eisenstein, Jacob; Elsner, Micha. - : Stroudsburg, PA : Association for Computational Linguistics, 2014
	BASE
	Show details

8	Minimally-supervised morphological segmentation using adaptor grammars
	Sirts, Kairit; Goldwater, Sharon. - : Association for Computational Linguistics, 2013
	BASE
	Show details

9	Noisy-channel spelling correction models for Estonian learner language corpus lemmatisation
	Sirts, Kairit. - : Amsterdam : IOS Press, 2012
	BASE
	Show details

10	A Hierarchical dirichlet process model for joint part-of-speech and morphology induction
	Alumäe, Tanel; Sirts, Kairit. - : Stroudsburg, PA : Association for Computational Linguistics, 2012
	BASE
	Show details

11	Korpuste tükeldamine : rakendusi silpide ning allkeeltega ; Cutting the text corpora : applications with syllables and sub-languages
	Sirts, Kairit; Võhandu, Leo. - : Eesti Rakenduslingvistika Uhing = Estonian Association for Applied Linguistics, 2009
	Abstract: In this paper we study different aspects of language by using different cuts of language corpora. There are two particular cuts under observation, which are very different by their nature: mincing the text into syllables for developing a statistical language model and dividing the language into sub-languages for identifying the base vocabulary. Our syllable based statistical language model includes the 500 most frequently observed syllables. It is a three-level model consisting of frequency tables for syllables, syllable pairs and syllable triplets. A frequency table is a matrix with syllables, syllable pairs or syllable triplets in rows and syllables in columns. The numbers in matrix cells show how many times the syllable in the column happened to follow the element in the row. The Estonian pseudo language generator is an application of the syllable based statistical language model. Using the Estonian pseudo language generator it is possible to generate a text which is not fully Estonian, but definitely sounds like one. The purpose of categorizing syllables is to assort the syllables according to their possible locations in a word. We propose an algorithm for automatic syllable grouping using the data in the syllable frequency table. We show experimentally how syllables are grouped into word-initial, word-internal and word-final syllables. Language can be divided into general language using a base vocabulary and different sub-languages, which contain particular terminology. In this paper we discuss the definition of general language. We also propose an automatic algorithm for defining its base vocabulary. ; 16 page(s)
	Keyword: computational linguistics; Estonian; general language; graph representation; language model; sub-languages; syllabification; syllable association; syllable grouping
	URL: http://hdl.handle.net/1959.14/1105040
	BASE
	Hide details

12	Eesti silbisüsteemi struktuurist ; A preliminary structural view of the Estonian syllable system
	Võhandu, Leo; Sirts, Kairit; Aab, Eik. - : Eesti Rakenduslingvistika Uhing, 2008
	BASE
	Show details

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern