4 |
Resources and Tools for Automated Speech Segmentation of the African Language Naija (Nigerian Pidgin)
|
|
|
|
In: Human Language Technology. Challenges for Computer Science and Linguistics. 8th Language and Technology Conference, LTC 2017, Poznań, Poland, November 17–19, 2017, Revised Selected Papers ; https://halshs.archives-ouvertes.fr/halshs-03097325 ; Vetulani, Z; Paroubek, P. Human Language Technology. Challenges for Computer Science and Linguistics. 8th Language and Technology Conference, LTC 2017, Poznań, Poland, November 17–19, 2017, Revised Selected Papers, 12598, Springer, pp.164-173, 2020, Human Language Technology. Challenges for Computer Science and Linguistics (2020)
|
|
BASE
|
|
Show details
|
|
7 |
A Surface-Syntactic UD Treebank for Naija
|
|
|
|
In: TLT 2019, Treebanks and Linguistic Theories, Syntaxfest ; https://hal.archives-ouvertes.fr/hal-02270530 ; TLT 2019, Treebanks and Linguistic Theories, Syntaxfest, Aug 2019, Paris, France (2019)
|
|
Abstract:
International audience ; This paper presents a syntactic treebank for spoken Naija, an English pidgincreole, which is rapidly spreading across Nigeria. The syntactic annotation is developed in the Surface-Syntactic Universal Dependency annotation scheme (SUD) (Gerdes et al., 2018) and automatically converted into UD. We present the workflow of the treebank development for this under-resourced language. A crucial step in the syntactic analysis of a spoken language consists in manually adding a markup onto the transcription, indicating the segmentation into major syntactic units and their internal structure. We show that this so-called "macrosyntactic" markup improves parsing results. We also study some iconic syntactic phenomena that clearly distinguish Naija from English.
|
|
Keyword:
[INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; [SHS.LANGUE]Humanities and Social Sciences/Linguistics
|
|
URL: https://hal.archives-ouvertes.fr/hal-02270530 https://hal.archives-ouvertes.fr/hal-02270530/file/syntaxfest.A%20Surface-Syntactic%20UD%20Treebank%20for%20Naija.pdf https://hal.archives-ouvertes.fr/hal-02270530/document
|
|
BASE
|
|
Hide details
|
|
10 |
Chapter 6. Macrosyntactic corpus annotation. The case of Zaar
|
|
|
|
In: Information structure in lesser-described languages: Studies in prosody and syntax ; https://halshs.archives-ouvertes.fr/halshs-01958333 ; Evangelia Adamou; Katharina Haude; Martine Vanhove. Information structure in lesser-described languages: Studies in prosody and syntax, 199, Benjamins, pp.157-192, 2018, Studies in Language Companion Series, ⟨10.1075/slcs.199.06car⟩ ; https://benjamins.com/catalog/slcs.199.06car (2018)
|
|
BASE
|
|
Show details
|
|
11 |
Macrosyntactic corpus annotation. The case of Zaar
|
|
|
|
In: https://halshs.archives-ouvertes.fr/halshs-01701816 ; 2018 (2018)
|
|
BASE
|
|
Show details
|
|
12 |
Establishing a Language by Annotating a Corpus ; Establishing a Language by Annotating a Corpus: The Case of Naija, a Post-creole Spoken in Nigeria
|
|
|
|
In: annDH 2018 Annotation in Digital Humanities ; https://halshs.archives-ouvertes.fr/halshs-01958330 ; annDH 2018 Annotation in Digital Humanities, Aug 2018, Sofia, Bulgaria. pp.7-11 ; http://ceur-ws.org/Vol-2155/ (2018)
|
|
BASE
|
|
Show details
|
|
13 |
Universal Dependencies 2.2
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-01930733 ; 2018 (2018)
|
|
BASE
|
|
Show details
|
|
|
|