DE eng

Search in the Catalogues and Directories

Page: 1 2 3 4
Hits 1 – 20 of 73

1
Deep Neural Convolutive Matrix Factorization for Articulatory Representation Decomposition ...
Abstract: Most of the research on data-driven speech representation learning has focused on raw audios in an end-to-end manner, paying little attention to their internal phonological or gestural structure. This work, investigating the speech representations derived from articulatory kinematics signals, uses a neural implementation of convolutive sparse matrix factorization to decompose the articulatory data into interpretable gestures and gestural scores. By applying sparse constraints, the gestural scores leverage the discrete combinatorial properties of phonological gestures. Phoneme recognition experiments were additionally performed to show that gestural scores indeed code phonological information successfully. The proposed work thus makes a bridge between articulatory phonology and deep neural networks to leverage informative, intelligible, interpretable,and efficient speech representations. ... : Submitted to 2022 Interspeech ...
Keyword: Artificial Intelligence cs.AI; Audio and Speech Processing eess.AS; FOS Computer and information sciences; FOS Electrical engineering, electronic engineering, information engineering; Signal Processing eess.SP
URL: https://arxiv.org/abs/2204.00465
https://dx.doi.org/10.48550/arxiv.2204.00465
BASE
Hide details
2
Focused Attention Improves Document-Grounded Generation ...
BASE
Show details
3
Switch Point biased Self-Training: Re-purposing Pretrained Models for Code-Switching ...
BASE
Show details
4
CodemixedNLP: An Extensible and Open NLP Toolkit for Code-Mixing ...
BASE
Show details
5
Unsupervised Self-Training for Sentiment Analysis of Code-Switched Data ...
BASE
Show details
6
Intent Recognition and Unsupervised Slot Identification for Low Resourced Spoken Dialog Systems ...
BASE
Show details
7
Towards Language Modelling in the Speech Domain Using Sub-word Linguistic Units ...
Katakkar, Anurag; Black, Alan W. - : arXiv, 2021
BASE
Show details
8
AlloVera: A Multilingual Allophone Database ...
BASE
Show details
9
Nonlinear ISA with Auxiliary Variables for Learning Speech Representations ...
BASE
Show details
10
Towards Minimal Supervision BERT-based Grammar Error Correction ...
BASE
Show details
11
Towards Zero-shot Learning for Automatic Phonemic Transcription ...
BASE
Show details
12
Automatically Identifying Language Family from Acoustic Examples in Low Resource Scenarios ...
Wu, Peter; Zhong, Yifan; Black, Alan W. - : arXiv, 2020
BASE
Show details
13
Case Study: Deontological Ethics in NLP ...
BASE
Show details
14
Topological Sort for Sentence Ordering ...
BASE
Show details
15
A Corpus for Large-Scale Phonetic Typology ...
BASE
Show details
16
Acoustics Based Intent Recognition Using Discovered Phonetic Units for Low Resource Languages ...
BASE
Show details
17
A Corpus for Large-Scale Phonetic Typology ...
BASE
Show details
18
Style Variation as a Vantage Point for Code-Switching ...
BASE
Show details
19
Mere account mein kitna balance hai? -- On building voice enabled Banking Services for Multilingual Communities ...
BASE
Show details
20
Universal Phone Recognition with a Multilingual Allophone System ...
BASE
Show details

Page: 1 2 3 4

Catalogues
1
0
10
0
0
1
0
Bibliographies
14
0
0
0
0
0
0
0
1
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
53
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern