3 |
A large Portuguese corpus on-line: cleaning and preprocessing
|
|
|
|
BASE
|
|
Show details
|
|
4 |
Annotating the Interaction between Focus and Modality : the case of exclusive particles
|
|
|
|
BASE
|
|
Show details
|
|
5 |
Introducing the Reference Corpus of Contemporary Portuguese On-Line
|
|
|
|
BASE
|
|
Show details
|
|
8 |
Proposal for Multi-word Expression annotation in running text
|
|
|
|
BASE
|
|
Show details
|
|
10 |
Discovering the Language of Wine Reviews: A Text Mining Account
|
|
|
|
BASE
|
|
Show details
|
|
11 |
Modality annotation for Portuguese: from manual annotation to automatic labeling
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Manuscripts and machines: the automatic replacement of spelling variants in a Portuguese historical corpus
|
|
|
|
BASE
|
|
Show details
|
|
15 |
The Gulf of Guinea Creole Corpora
|
|
|
|
Abstract:
We present the process of building linguistic corpora of the Portuguese-related Gulf of Guinea creoles, a cluster of four historically related languages: Santome, Angolar, Principense and Fa d’Ambô. We faced the typical difficulties of languages lacking an official status, such as lack of standard spelling, language variation, lack of basic language instruments, and small data sets, which comprise data from the late 19th century to the present. In order to tackle these problems, the compiled written and transcribed spoken data collected during field work trips were adapted to a normalized spelling that was applied to the four languages. For the corpus compilation we followed corpus linguistics standards. We recorded meta data for each file and added morphosyntactic information based on a part-of-speech tag set that was designed to deal with the specificities of these languages. The corpora of three of the four creoles are already available and searchable via an online web interface. ; info:eu-repo/semantics/publishedVersion
|
|
Keyword:
Corpus annotation and management; Gulf of Guinea creoles; Language documentation
|
|
URL: http://hdl.handle.net/10451/30690
|
|
BASE
|
|
Hide details
|
|
16 |
Towards a unified approach to modality annotation in portuguese
|
|
|
|
BASE
|
|
Show details
|
|
19 |
Enhancing access to online education: quality machine translation of MOOC content
|
|
|
|
In: Kordoni, Valia, van den Bosch, Antal orcid:0000-0003-2493-656X , Kermanidis, Katia Lida orcid:0000-0002-3270-5078 , Sosoni, Vilelmini orcid:0000-0002-9583-4651 , Cholakov, Kostadin, Hendrickx, Iris, Huck, Matthias and Way, Andy orcid:0000-0001-5736-5930 (2016) Enhancing access to online education: quality machine translation of MOOC content. In: Tenth International Conference on Language Resources and Evaluation (LREC 2016), 23-28 May 2016, Portorož, Slovenia. ISBN 978-2-9517408-9-1 (2016)
|
|
BASE
|
|
Show details
|
|
|
|