Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Hits 1 – 6 of 6

1	Strictly lexicalised dependency parsing
	Wang, Qin Iris; Schuurmans, Dale; Lin, Dekang
	In: Trends in parsing technology. - Dordrecht [u.a.] : Springer (2010), 105-120
	BLLDB
	Show details

2	Creating robust supervised classifiers via web-scale n-gram data
	Bergsma, Shane; Pitler, Emily; Lin, Dekang
	In: Association for Computational Linguistics. Proceedings of the conference. - Stroudsburg, Penn. : ACL 48 (2010) 2, 865-874
	BLLDB
	Show details

3	Chinese Web 5-gram Version 1
	Liu, Fang; Yang, Meng; Lin, Dekang. - : Linguistic Data Consortium, 2010. : https://www.ldc.upenn.edu, 2010
	BASE
	Show details

4	Chinese Web 5-gram Version 1 ...
	Liu, Fang; Yang, Meng; Lin, Dekang. - : Linguistic Data Consortium, 2010
	Abstract: Introduction Chinese Web 5-gram Version 1, Linguistic Data Consortium (LDC) catalog number LDC2010T06 and isbn 1-58563-539-1, was created by researchers at Google Inc. It consists of Chinese word n-grams and their observed frequency counts generated from over 800 million tokens of text. The length of the n-grams ranges from unigrams (single words) to 5-grams. This data should be useful for statistical language modeling (e.g., segmentation, machine translation) as well as for other uses. Included with this publication is a simple segmenter written in Perl using the same algorithm used to generate the data. Data Collection N-gram counts were generated from approximately 883 billion word tokens of text from publicly accessible web pages. This data set contains only n-grams that appeared at least 40 times in the processed sentences. Less frequent n-grams were discarded. While the aim was to identify and collect only Chinese language pages, some text from other languages ...
	URL: https://dx.doi.org/10.35111/647p-yt29 https://catalog.ldc.upenn.edu/LDC2010T06
	BASE
	Hide details

5	Large-scale semi-supervised learning for natural language processing
	Bergsma, Shane A. - : University of Alberta. Department of Computing Science., 2010
	BASE
	Show details

6	Large-scale semi-supervised learning for natural language processing
	Bergsma, Shane A. - : University of Alberta. Department of Computing Science., 2010
	BASE
	Show details

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern