1 |
Creating Expert Knowledge by Relying on Language Learners: a Generic Approach for Mass-Producing Language Resources by Combining Implicit Crowdsourcing and Language Learning
|
|
Nicolas, Lionel; Lyding, Verena; Borg, Claudia; Forascu, Corina; Fort, Karën; Zdravkova, Katerina; Kosem, Iztok; Cibej, Jaka; Holdt, Špela,; Millour, Alice; König, Alexander; Rodosthenous, Christos; Sangati, Federico; Hassan, Umair ul; Katinskaia, Anisia; Barreiro, Anabela; Aparaschivei, Lavinia; Hacohen-Kerner, Yaakov
|
|
In: LREC 2020 - Language Resources and Evaluation Conference ; https://hal.inria.fr/hal-02879883 ; LREC 2020 - Language Resources and Evaluation Conference, May 2020, Marseille, France (2020)
|
|
Abstract:
International audience ; We introduce in this paper a generic approach to combine implicit crowdsourcing and language learning in order to mass-produce language resources (LRs) for any language for which a crowd of language learners can be involved. We present the approach by explaining its core paradigm that consists in pairing specific types of LRs with specific exercises, by detailing both its strengths and challenges, and by discussing how much these challenges have been addressed at present. Accordingly, we also report on ongoing proof-of-concept efforts aiming at developing the first prototypical implementation of the approach in order to correct and extend an LR called ConceptNet based on the input crowdsourced from language learners. We then present an international network called the European Network for Combining Language Learning with Crowdsourcing Techniques (enetCollect) that provides the context to accelerate the implementation of the generic approach. Finally, we exemplify how it can be used in several language learning scenarios to produce a multitude of NLP resources and how it can therefore alleviate the long-standing NLP issue of the lack of LRs.
|
|
Keyword:
[INFO.INFO-TT]Computer Science [cs]/Document and Text Processing; [SHS.LANGUE]Humanities and Social Sciences/Linguistics; Collaborative Resource Construction; Computer-Assisted Language Learning; COST Action; Crowdsourcing
|
|
URL: https://hal.inria.fr/hal-02879883/document https://hal.inria.fr/hal-02879883 https://hal.inria.fr/hal-02879883/file/EnetCollect___LREC_2020.pdf
|
|
BASE
|
|
Hide details
|
|
2 |
Substituto - A Synchronous Educational Language Game for Simultaneous Teaching and Crowdsourcing
|
|
|
|
In: 9th Workshop on Natural Language Processing for Computer Assisted Language Learning (NLP4CALL 2020) ; https://hal.inria.fr/hal-03114898 ; 9th Workshop on Natural Language Processing for Computer Assisted Language Learning (NLP4CALL 2020), Nov 2020, Gothenburg, Sweden. pp.1-9, ⟨10.3384/ecp201759⟩ ; https://www.aclweb.org/anthology/volumes/2020.nlp4call-1/ (2020)
|
|
BASE
|
|
Show details
|
|
3 |
The FAIR Index of CMC Corpora
|
|
|
|
In: CMC Corpora through the prism of Digital Humanities ; https://hal.archives-ouvertes.fr/hal-03121698 ; CMC Corpora through the prism of Digital Humanities, 2020 (2020)
|
|
BASE
|
|
Show details
|
|
4 |
TRIPLE Deliverable: D2.2 Data Harvesting Best Practices Document for Data Providers ...
|
|
|
|
BASE
|
|
Show details
|
|
5 |
TRIPLE Deliverable: D2.2 Data Harvesting Best Practices Document for Data Providers ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
The CLARIN ERIC deployment infrastructure and its applicability to reproducible research ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
The CLARIN ERIC deployment infrastructure and its applicability to reproducible research ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Ergänzung und Aktualisierung der Stichworte zur Individualversicherung
|
|
|
|
BASE
|
|
Show details
|
|
|
|