8 |
Universal Dependencies 2.2
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-01930733 ; 2018 (2018)
|
|
BASE
|
|
Show details
|
|
11 |
Universal Dependencies 2.1
|
|
|
|
In: https://hal.inria.fr/hal-01682188 ; 2017 (2017)
|
|
BASE
|
|
Show details
|
|
14 |
Universal Dependencies 2.0 – CoNLL 2017 Shared Task Development and Test Data
|
|
|
|
BASE
|
|
Show details
|
|
17 |
Maximum Entropy Approach to Sentence Boundary Detection of Vietnamese Texts
|
|
|
|
In: http://hal.inria.fr/docs/00/33/47/62/PDF/rivf2008.pdf (2008)
|
|
BASE
|
|
Show details
|
|
18 |
A hybrid approach to word segmentation of Vietnamese texts
|
|
|
|
In: http://hal.inria.fr/docs/00/33/47/61/PDF/LATA039.pdf (2008)
|
|
Abstract:
Abstract We present in this article a hybrid approach to automatically tokenize Vietnamese text. The approach combines both finite-state automata technique, regular expression parsing and the maximal-matching strategy which is augmented by statistical methods to resolve ambiguities of segmentation. The Vietnamese lexicon in use is compactly represented by a minimal finite-state automaton. A text to be tokenized is first parsed into lexical phrases and other patterns using pre-defined regular expressions. The automaton is then deployed to build linear graphs corresponding to the phrases to be segmented. The application of a maximalmatching strategy on a graph results in all candidate segmentations of a phrase. It is the responsibility of an ambiguity resolver, which uses a smoothed bigram language model, to choose the most probable segmentation of the phrase. The hybrid approach is implemented to create vnTokenizer, a highly accurate tokenizer for Vietnamese texts. 1
|
|
URL: http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.233.4431 http://hal.inria.fr/docs/00/33/47/61/PDF/LATA039.pdf
|
|
BASE
|
|
Hide details
|
|
19 |
Word Segmentation of Vietnamese Texts: a comparison of approaches. LREC : 2008
|
|
|
|
In: http://lamda.nju.edu.cn/nguyenct/files/papers/thang493_paper.pdf (2008)
|
|
BASE
|
|
Show details
|
|
20 |
Author manuscript, published in "N/P" Word segmentation of Vietnamese texts: a comparison of approaches
|
|
|
|
In: http://hal.inria.fr/docs/00/33/47/60/PDF/lrec2008final.pdf
|
|
BASE
|
|
Show details
|
|
|
|