1 |
AUTOLEX: An Automatic Framework for Linguistic Exploration ...
|
|
|
|
BASE
|
|
Show details
|
|
2 |
When Being Unseen from mBERT is just the Beginning: Handling New Languages With Multilingual Language Models
|
|
|
|
In: NAACL-HLT 2021 - 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies ; https://hal.inria.fr/hal-03251105 ; NAACL-HLT 2021 - 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Jun 2021, Mexico City, Mexico (2021)
|
|
BASE
|
|
Show details
|
|
3 |
SD-QA: Spoken Dialectal Question Answering for the Real World ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
SD-QA: Spoken Dialectal Question Answering for the Real World ...
|
|
|
|
BASE
|
|
Show details
|
|
5 |
Phoneme Recognition through Fine Tuning of Phonetic Representations: a Case Study on Luhya Language Varieties ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Machine Translation into Low-resource Language Varieties ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
Code to Comment Translation: A Comparative Study on Model Effectiveness & Errors ...
|
|
|
|
Abstract:
Automated source code summarization is a popular software engineering research topic wherein machine translation models are employed to "translate" code snippets into relevant natural language descriptions. Most evaluations of such models are conducted using automatic reference-based metrics. However, given the relatively large semantic gap between programming languages and natural language, we argue that this line of research would benefit from a qualitative investigation into the various error modes of current state-of-the-art models. Therefore, in this work, we perform both a quantitative and qualitative comparison of three recently proposed source code summarization models. In our quantitative evaluation, we compare the models based on the smoothed BLEU-4, METEOR, and ROUGE-L machine translation metrics, and in our qualitative evaluation, we perform a manual open-coding of the most common errors committed by the models when compared to ground truth captions. Our investigation reveals new insights into ... : Accepted to the 2021 NLP4Prog Workshop co-located with The Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (ACL-IJCNLP 2021) ...
|
|
Keyword:
Artificial Intelligence cs.AI; Computation and Language cs.CL; FOS Computer and information sciences; Machine Learning cs.LG; Software Engineering cs.SE
|
|
URL: https://dx.doi.org/10.48550/arxiv.2106.08415 https://arxiv.org/abs/2106.08415
|
|
BASE
|
|
Hide details
|
|
8 |
Systematic Inequalities in Language Technology Performance across the World's Languages ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Multilingual Code-Switching for Zero-Shot Cross-Lingual Intent Prediction and Slot Filling ...
|
|
|
|
BASE
|
|
Show details
|
|
10 |
Investigating Post-pretraining Representation Alignment for Cross-Lingual Question Answering ...
|
|
|
|
BASE
|
|
Show details
|
|
11 |
Towards More Equitable Question Answering Systems: How Much More Data Do You Need? ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
Cross-Lingual Text Classification of Transliterated Hindi and Malayalam ...
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Evaluating the Morphosyntactic Well-formedness of Generated Texts ...
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Evaluating the Morphosyntactic Well-formedness of Generated Texts ...
|
|
|
|
BASE
|
|
Show details
|
|
15 |
Towards more equitable question answering systems: How much more data do you need? ...
|
|
|
|
BASE
|
|
Show details
|
|
16 |
Lexically Aware Semi-Supervised Learning for OCR Post-Correction ...
|
|
|
|
BASE
|
|
Show details
|
|
17 |
When is Wall a Pared and when a Muro? -- Extracting Rules Governing Lexical Selection ...
|
|
|
|
BASE
|
|
Show details
|
|
18 |
When is Wall a Pared and when a Muro?: Extracting Rules Governing Lexical Selection ...
|
|
|
|
BASE
|
|
Show details
|
|
19 |
Lexically-Aware Semi-Supervised Learning for OCR Post-Correction ...
|
|
|
|
BASE
|
|
Show details
|
|
20 |
AlloVera: a multilingual allophone database
|
|
|
|
In: LREC 2020: 12th Language Resources and Evaluation Conference ; https://halshs.archives-ouvertes.fr/halshs-02527046 ; LREC 2020: 12th Language Resources and Evaluation Conference, European Language Resources Association, May 2020, Marseille, France ; https://lrec2020.lrec-conf.org/ (2020)
|
|
BASE
|
|
Show details
|
|
|
|