1 |
Equitable teaching for cultural and linguistic diversity: exploring the possibilities for engaged pedagogy in post-COVID-19 higher education
|
|
|
|
BASE
|
|
Show details
|
|
2 |
SemEval-2022 Task 2: Multilingual Idiomaticity Detection and Sentence Embedding ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
Improving Tokenisation by Alternative Treatment of Spaces ...
|
|
|
|
Abstract:
Tokenisation is the first step in almost all NLP tasks, and state-of-the-art transformer-based language models all use subword tokenisation algorithms to process input text. Existing algorithms have problems, often producing tokenisations of limited linguistic validity, and representing equivalent strings differently depending on their position within a word. We hypothesise that these problems hinder the ability of transformer-based models to handle complex words, and suggest that these problems are a result of allowing tokens to include spaces. We thus experiment with an alternative tokenisation approach where spaces are always treated as individual tokens. Specifically, we apply this modification to the BPE and Unigram algorithms. We find that our modified algorithms lead to improved performance on downstream NLP tasks that involve handling complex words, whilst having no detrimental effect on performance in general natural language understanding tasks. Intrinsically, we find our modified algorithms give ...
|
|
Keyword:
Computation and Language cs.CL; FOS Computer and information sciences
|
|
URL: https://dx.doi.org/10.48550/arxiv.2204.04058 https://arxiv.org/abs/2204.04058
|
|
BASE
|
|
Hide details
|
|
4 |
Changing Perspectives on Pediatric Human Papillomavirus (HPV) Vaccination among Dental Students and Residents Reveals Recent Increase in Vaccine Hesitancy
|
|
|
|
In: Vaccines; Volume 10; Issue 4; Pages: 570 (2022)
|
|
BASE
|
|
Show details
|
|
5 |
Rare Disorders: Diagnosis and Therapeutic Planning for Patients Seeking Orthodontic Treatment
|
|
|
|
In: Journal of Clinical Medicine; Volume 11; Issue 6; Pages: 1527 (2022)
|
|
BASE
|
|
Show details
|
|
6 |
Gender Agreement in a Language Contact Situation
|
|
|
|
In: Languages; Volume 7; Issue 2; Pages: 81 (2022)
|
|
BASE
|
|
Show details
|
|
7 |
NILC-Metrix: assessing the complexity of written and spoken language in Brazilian Portuguese ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
Korpora als Grundlage für das Lehren und Lernen von Deutsch als Fremdsprache
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Speech Perception and Dichotic Listening Are Associated With Hearing Thresholds and Cognition, Respectively, in Unaided Presbycusis
|
|
|
|
In: Front Aging Neurosci (2022)
|
|
BASE
|
|
Show details
|
|
10 |
Building global capacity for COVID-19 vaccination through interactive virtual learning
|
|
|
|
In: Hum Resour Health (2022)
|
|
BASE
|
|
Show details
|
|
11 |
Genetic and Epigenetic Control of Puberty
|
|
|
|
In: Sex Dev (2022)
|
|
BASE
|
|
Show details
|
|
12 |
Social+Me: a persuasive application to increase communication between students and their support networks in Southern Chile
|
|
|
|
In: PeerJ Comput Sci (2022)
|
|
BASE
|
|
Show details
|
|
13 |
Expanding horizons of cross-linguistic research on reading: The Multilingual Eye-movement Corpus (MECO)
|
|
|
|
In: Behav Res Methods (2022)
|
|
BASE
|
|
Show details
|
|
14 |
Bolsistas de produtividade em pesquisa do CNPQ da grande área Ciências da Saúde
|
|
|
|
BASE
|
|
Show details
|
|
15 |
[In Press] Equitable teaching for cultural and linguistic diversity : exploring the possibilities for engaged pedagogy in post-COVID-19 higher education
|
|
|
|
BASE
|
|
Show details
|
|
16 |
Relatório de estágio para obtenção de grau de mestre em Educação Pré-Escolar e Ensino do 1.º Ciclo do Ensino Básico
|
|
|
|
BASE
|
|
Show details
|
|
17 |
Why are linguistic features and PTSD symptoms related? An analysis of cognitive reappraisal and rumination
|
|
|
|
BASE
|
|
Show details
|
|
18 |
Manifestação de templates no desenvolvimento fonológico de gêmeos e não gêmeos
|
|
|
|
In: Miguilim - Revista Eletrônica do Netlli; v. 10, n. 4 (2021): Número Especial; 1851-1867 (2022)
|
|
BASE
|
|
Show details
|
|
19 |
Análisis del discurso de odio en función de la ideología: Efectos emocionales y cognitivos
|
|
|
|
In: Comunicar: Revista científica iberoamericana de comunicación y educación, ISSN 1134-3478, Nº 71, 2022 (Ejemplar dedicado a: Discursos de odio en comunicación: Investigaciones y propuestas), pags. 37-48 (2022)
|
|
BASE
|
|
Show details
|
|
20 |
Método de Abordajes Lingüísticos Convergentes para el ACD: una propuesta aplicada al análisis de comentarios digitales
|
|
|
|
In: Onomázein: Revista de lingüística, filología y traducción de la Pontificia Universidad Católica de Chile, ISSN 0718-5758, Nº. 55, 2022, pags. 92-114 (2022)
|
|
BASE
|
|
Show details
|
|
|
|