1 |
Towards a part-of-speech tagger for Sranan Tongo ...
|
|
Nicolás, C.V.; Viktor, Z.. - : Фонд содействия развитию интернет-медиа, ИТ-образования, человеческого потенциала "Лига интернет-медиа", 2022
|
|
Abstract:
Abstract—This paper is the continuation of a work submitted to the International Conference Corpus Linguistics 2021 [1]. On that occasion, a rule-based stochastic hybrid part-of-speech tagger (POS) was introduced for Sranan Tongo, a Creole language from South America with around half a million speakers. Since Sranan Tongo does not have a written corpus and text annotation is an expensive and time-consuming task, it was proposed to take a first step in training a POS tagger using only 550 hand-annotated sentences with part of speech tags. In this new contribution, the development of the POS tagger for Sranan Tongo goes a step further with the addition of more training data. For this matter, the tagger was used to annotate 2,406 sentences. The tagging results were hand-corrected and employed to retrain the model. A comparison is shown between the performance of the POS tagger on three texts before and after the inclusion of the new training data. ... : International Journal of Open Information Technologies, Выпуск 12 2022 ...
|
|
Keyword:
Hidden Markov Model; lowresource; part-of-speech tagger; Sranan Tongo
|
|
URL: http://injoit.org/index.php/j1/article/view/1235 https://dx.doi.org/10.25559/injoit.2307-8162.09.202112.99-103
|
|
BASE
|
|
Hide details
|
|
2 |
The CLASSLA-StanfordNLP model for morphosyntactic annotation of standard Slovenian 1.3
|
|
|
|
BASE
|
|
Show details
|
|
3 |
Enhancing Communication Reliability from the Semantic Level under Low SNR
|
|
|
|
In: Electronics; Volume 11; Issue 9; Pages: 1358 (2022)
|
|
BASE
|
|
Show details
|
|
4 |
Leveraging Part-of-Speech Tagging Features and a Novel Regularization Strategy for Chinese Medical Named Entity Recognition
|
|
|
|
In: Mathematics; Volume 10; Issue 9; Pages: 1386 (2022)
|
|
BASE
|
|
Show details
|
|
5 |
Tagged Corpus of Early English Correspondence Extension Sampler (TCEECES) ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Tagged Corpus of Early English Correspondence Extension Sampler (TCEECES) ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
Easy-to-use combination of POS and BERT model for domain-specific and misspelled terms
|
|
|
|
In: NL4IA Workshop Proceedings ; https://hal.archives-ouvertes.fr/hal-03474696 ; NL4IA Workshop Proceedings, Nov 2021, Milan, Italy (2021)
|
|
BASE
|
|
Show details
|
|
8 |
Collecting and annotating corpora for three under-resourced languages of France: Methodological issues
|
|
|
|
In: ISSN: 1934-5275 ; EISSN: 1934-5275 ; Language Documentation & Conservation ; https://hal.archives-ouvertes.fr/hal-03273196 ; Language Documentation & Conservation, University of Hawaiʻi Press 2021, 15, pp.316-357 ; http://hdl.handle.net/10125/74645 (2021)
|
|
BASE
|
|
Show details
|
|
9 |
Hierarchical-Task Reservoir for Online Semantic Analysis from Continuous Speech
|
|
|
|
In: ISSN: 2162-237X ; IEEE Transactions on Neural Networks and Learning Systems ; https://hal.inria.fr/hal-03031413 ; IEEE Transactions on Neural Networks and Learning Systems, IEEE, 2021, ⟨10.1109/TNNLS.2021.3095140⟩ ; https://ieeexplore.ieee.org/abstract/document/9548713/metrics#metrics (2021)
|
|
BASE
|
|
Show details
|
|
11 |
Annotated Corpus of Pre-Standardized Balkan Slavic Literature 1.1
|
|
Šimko, Ivan. - : Slavic Seminary, University of Zurich, 2021
|
|
BASE
|
|
Show details
|
|
13 |
The CLASSLA-StanfordNLP model for morphosyntactic annotation of standard Slovenian 1.2
|
|
|
|
BASE
|
|
Show details
|
|
15 |
Automatic Part-of-Speech Tagging for Security Vulnerability Descriptions ...
|
|
|
|
BASE
|
|
Show details
|
|
16 |
Automatic Part-of-Speech Tagging for Security Vulnerability Descriptions ...
|
|
|
|
BASE
|
|
Show details
|
|
19 |
Developing Core Technologies for Resource-Scarce Nguni Languages
|
|
|
|
In: Information; Volume 12; Issue 12; Pages: 520 (2021)
|
|
BASE
|
|
Show details
|
|
20 |
A Comparative Study of Arabic Part of Speech Taggers Using Literary Text Samples from Saudi Novels
|
|
|
|
In: Information; Volume 12; Issue 12; Pages: 523 (2021)
|
|
BASE
|
|
Show details
|
|
|
|