DE eng

Search in the Catalogues and Directories

Page: 1 2 3 4 5...24
Hits 1 – 20 of 470

1
A prospective study of associations between early fearfulness and perceptual sensitivity and later restricted and repetitive behaviours in infants with typical and elevated likelihood of Autism
BASE
Show details
2
Semi-supervised cycle-consistency training for end-to-end ASR using unpaired speech
Wu, Ningkai. - 2022
BASE
Show details
3
Incorporating Temporal Information in Entailment Graph Mining ...
BASE
Show details
4
Blindness to Modality Helps Entailment Graph Mining ...
BASE
Show details
5
Implementing two-stage consent pathway in neonatal trials
BASE
Show details
6
Integrating Lexical Information into Entity Neighbourhood Representations for Relation Prediction ...
NAACL 2021 2021; ., Stephen; Johnson, Mark. - : Underline Science Inc., 2021
BASE
Show details
7
Investigating the Mechanisms Driving Referent Selection and Retention in Toddlers at Typical and Elevated Likelihood for Autism Spectrum Disorder. ...
Gliga, Teodora; Skolnick, Alex; Liersch, Ute. - : Apollo - University of Cambridge Repository, 2021
BASE
Show details
8
Implementing two-stage consent pathway in neonatal trials
BASE
Show details
9
Improving multilingual speech recognition systems
Gao, Heting. - 2021
BASE
Show details
10
Enforcing constraints for multi-lingual and cross-lingual speech-to-text systems
Ni, Junrui. - 2021
BASE
Show details
11
Knowledge base integration in biomedical natural language processing applications
Sakakini, Tarek. - 2021
BASE
Show details
12
Learning speech embeddings for speaker adaptation and speech understanding
Sari, Leda. - 2021
Abstract: In recent years, deep neural network models gained popularity as a modeling approach for many speech processing tasks including automatic speech recognition (ASR) and spoken language understanding (SLU). In this dissertation, there are two main goals. The first goal is to propose modeling approaches in order to learn speaker embeddings for speaker adaptation or to learn semantic speech embeddings. The second goal is to introduce training objectives that achieve fairness for the ASR and SLU problems. In the case of speaker adaptation, we introduce an auxiliary network to an ASR model and learn to simultaneously detect speaker changes and adapt to the speaker in an unsupervised way. We show that this joint model leads to lower error rates as compared to a two-step approach where the signal is segmented into single speaker regions and then fed into an adaptation model. We then reformulate the speaker adaptation problem from a counterfactual fairness point-of-view and introduce objective functions to match the ASR performance of the individuals in the dataset to that of their counterfactual counterparts. We show that we can achieve lower error rate in an ASR system while reducing the performance disparity between protected groups. In the second half of the dissertation, we focus on SLU and tackle two problems associated with SLU datasets. The first SLU problem is the lack of large speech corpora. To handle this issue, we propose to use available non-parallel text data so that we can leverage the information in text to guide learning of the speech embeddings. We show that this technique increases the intent classification accuracy as compared to a speech-only system. The second SLU problem is the label imbalance problem in the datasets, which is also related to fairness since a model trained on skewed data usually leads to biased results. To achieve fair SLU, we propose to maximize the F-measure instead of conventional cross-entropy minimization and show that it is possible to increase the number of classes with nonzero recall. In the last two chapters, we provide additional discussions on the impact of these projects from both technical and social perspectives, propose directions for future research and summarize the findings.
Keyword: automatic speech recognition; fairness in machine learning; Neural networks; speaker adaptation; spoken language understanding
URL: http://hdl.handle.net/2142/110438
BASE
Hide details
13
Modeling phones, keywords, topics and intents in spoken languages
Chen, Wenda. - 2021
BASE
Show details
14
Investigating the Mechanisms Driving Referent Selection and Retention in Toddlers at Typical and Elevated Likelihood for Autism Spectrum Disorder.
BASE
Show details
15
Infant EEG theta modulation predicts childhood intelligence
Jones, Emily J.H.; Goodwin, A.; Orekhova, E.. - : Nature Publishing Group, 2020
BASE
Show details
16
Neural and behavioural indices of face processing in siblings of children with autism spectrum disorder (ASD): a longitudinal study from infancy to mid-childhood
BASE
Show details
17
Speech technology for unwritten languages
In: ISSN: 2329-9290 ; EISSN: 2329-9304 ; IEEE/ACM Transactions on Audio, Speech and Language Processing ; https://hal.inria.fr/hal-02480675 ; IEEE/ACM Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2020, ⟨10.1109/TASLP.2020.2973896⟩ (2020)
BASE
Show details
18
Incorporating Temporal Information in Entailment Graph Mining ...
BASE
Show details
19
How Phonotactics Affect Multilingual and Zero-shot ASR Performance ...
BASE
Show details
20
That Sounds Familiar: an Analysis of Phonetic Representations Transfer Across Languages ...
BASE
Show details

Page: 1 2 3 4 5...24

Catalogues
25
7
56
0
5
0
2
Bibliographies
93
0
2
5
0
0
0
0
44
Linked Open Data catalogues
0
Online resources
1
0
0
0
Open access documents
272
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern