2 |
Joint learning of morphology and syntax with cross-level contextual information flow
|
|
|
|
In: 2022 ; 1 ; 33 (2022)
|
|
BASE
|
|
Show details
|
|
3 |
Developing Core Technologies for Resource-Scarce Nguni Languages
|
|
|
|
In: Information; Volume 12; Issue 12; Pages: 520 (2021)
|
|
Abstract:
The creation of linguistic resources is crucial to the continued growth of research and development efforts in the field of natural language processing, especially for resource-scarce languages. In this paper, we describe the curation and annotation of corpora and the development of multiple linguistic technologies for four official South African languages, namely isiNdebele, Siswati, isiXhosa, and isiZulu. Development efforts included sourcing parallel data for these languages and annotating each on token, orthographic, morphological, and morphosyntactic levels. These sets were in turn used to create and evaluate three core technologies, viz. a lemmatizer, part-of-speech tagger, morphological analyzer for each of the languages. We report on the quality of these technologies which improve on previously developed rule-based technologies as part of a similar initiative in 2013. These resources are made publicly accessible through a local resource agency with the intention of fostering further development of both resources and technologies that may benefit the NLP industry in South Africa.
|
|
Keyword:
canonical segmentation; core technologies; lemmatization; morphological analysis; part-of-speech tagging; resource-scarce languages; South African languages
|
|
URL: https://doi.org/10.3390/info12120520
|
|
BASE
|
|
Hide details
|
|
4 |
Incorporating word embeddings in unsupervised morphological segmentation
|
|
|
|
In: 2020 ; 1 ; 21 (2020)
|
|
BASE
|
|
Show details
|
|
5 |
Extending adaptor grammars to learn phonological alternations
|
|
|
|
In: Proceedings of the Society for Computation in Linguistics (2020)
|
|
BASE
|
|
Show details
|
|
6 |
Script Independent Morphological Segmentation for Arabic Maghrebi Dialects: An Application to Machine Translation
|
|
|
|
In: ISSN: 1405-5546 ; EISSN: 2007-9737 ; Computación y sistemas ; https://hal.archives-ouvertes.fr/hal-02274533 ; Computación y sistemas, Instituto Politécnico Nacional IPN Centro de Investigación en Computación, In press, 23 (3), pp.979-989. ⟨10.13053/cys-23-3-3267⟩ (2019)
|
|
BASE
|
|
Show details
|
|
7 |
LSTM Ağları ile Türkçe Kök Bulma ; Stemming Turkish Words with LSTM Networks
|
|
|
|
In: 12 ; 3 ; 183 ; 193 (2019)
|
|
BASE
|
|
Show details
|
|
8 |
When is a corner like corn? Morpho-orthographic segmenting skills in children who struggle with reading
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Data-Driven Identification of German Phrasal Compounds
|
|
|
|
In: Text, Speech, and Dialogue ; https://hal.archives-ouvertes.fr/hal-01575651 ; Kamil Ekštein; Václav Matoušek. Text, Speech, and Dialogue, 10415, Springer International Publishing, pp.192-200, 2017, Lecture Notes in Computer Science, 978-3-319-64205-5. ⟨10.1007/978-3-319-64206-2_22⟩ ; https://link.springer.com/bookseries/558 (2017)
|
|
BASE
|
|
Show details
|
|
10 |
Modeling morpheme triplets with a three-level hierarchical Dirichlet process
|
|
|
|
In: 366 ; 369 (2017)
|
|
BASE
|
|
Show details
|
|
11 |
Automatic processing of Tunisian dialect: construction of linguistic resources ; TRAITEMENT AUTOMATIQUE DU DIALECTE TUNISIEN : CONSTRUCTION DE RESSOURCES LINGUISTIQUES
|
|
|
|
In: https://hal.archives-ouvertes.fr/tel-02869866 ; Informatique et langage [cs.CL]. Université de Sfax (Tunisie), 2016. Français (2016)
|
|
BASE
|
|
Show details
|
|
12 |
НАЦИОНАЛЬНЫЙ КОРПУС КАЛМЫЦКОГО ЯЗЫКА: ИТОГИ РАБОТЫ И ПЕРСПЕКТИВЫ
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Processing of Compound Terms: Segmentation, Translation and Variation ; Traitement automatique des termes composés : segmentation, traduction et variation
|
|
|
|
In: https://hal.archives-ouvertes.fr/tel-01116104 ; Traitement du texte et du document. Université de Nantes, 2014. Français (2014)
|
|
BASE
|
|
Show details
|
|
15 |
Methods and algorithms for unsupervised learning of morphology
|
|
|
|
In: 8403 ; 177 ; 205 (2014)
|
|
BASE
|
|
Show details
|
|
16 |
Traduction statistique vers une langue à morphologie riche : combinaison d’algorithmes de segmentation morphologique et de modèles statistiques de traduction automatique
|
|
|
|
BASE
|
|
Show details
|
|
17 |
Traduction statistique vers une langue à morphologie riche : combinaison d’algorithmes de segmentation morphologique et de modèles statistiques de traduction automatique
|
|
|
|
BASE
|
|
Show details
|
|
18 |
Inter-speaker speech variability assessment using statistical deformable models from 3.0 Tesla magnetic resonance images
|
|
|
|
BASE
|
|
Show details
|
|
19 |
'Fell' primes 'fall', but does 'bell' prime 'ball'? Masked priming with irregularly-inflected primes
|
|
|
|
In: Journal of Memory and Language, 63 (1) (2010)
|
|
BASE
|
|
Show details
|
|
20 |
Is morphological decomposition limited to low-frequency words?
|
|
|
|
In: Quarterly Journal of Experimental Psychology, 62 (9) (2009)
|
|
BASE
|
|
Show details
|
|
|
|