1 |
Modeling speech recognition and synthesis simultaneously: Encoding and decoding lexical and sublexical semantic information into speech with no access to speech data ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
Cetacean Translation Initiative: a roadmap to deciphering the communication of sperm whales ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
Identity-Based Patterns in Deep Convolutional Networks: Generative Adversarial Phonology and Reduplication ...
|
|
|
|
BASE
|
|
Show details
|
|
5 |
Interpreting intermediate convolutional layers of CNNs trained on raw speech ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Identity-Based Patterns in Deep Convolutional Networks: Generative Adversarial Phonology and Reduplication ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
Interpreting intermediate convolutional layers in unsupervised acoustic word classification ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
Generative Adversarial Phonology: Modeling unsupervised phonetic and phonological learning with neural networks ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Local and non-local dependency learning and emergence of rule-like representations in speech data by Deep Convolutional Generative Adversarial Networks ...
|
|
|
|
BASE
|
|
Show details
|
|
10 |
Identity-Based Patterns in Deep Convolutional Networks: Generative Adversarial Phonology and Reduplication ...
|
|
|
|
BASE
|
|
Show details
|
|
11 |
Deep Sound Change: Deep and Iterative Learning, Convolutional Neural Networks, and Language Change ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
Modeling unsupervised phonetic and phonological learning in Generative Adversarial Phonology ...
|
|
|
|
BASE
|
|
Show details
|
|
13 |
CiwGAN and fiwGAN: Encoding information in acoustic data to model lexical learning with Generative Adversarial Networks ...
|
|
|
|
Abstract:
How can deep neural networks encode information that corresponds to words in human speech into raw acoustic data? This paper proposes two neural network architectures for modeling unsupervised lexical learning from raw acoustic inputs, ciwGAN (Categorical InfoWaveGAN) and fiwGAN (Featural InfoWaveGAN), that combine a Deep Convolutional GAN architecture for audio data (WaveGAN; arXiv:1705.07904) with an information theoretic extension of GAN -- InfoGAN (arXiv:1606.03657), and propose a new latent space structure that can model featural learning simultaneously with a higher level classification and allows for a very low-dimension vector representation of lexical items. Lexical learning is modeled as emergent from an architecture that forces a deep neural network to output data such that unique information is retrievable from its acoustic outputs. The networks trained on lexical items from TIMIT learn to encode unique information corresponding to lexical items in the form of categorical variables in their ... : Published in Neural Networks ...
|
|
Keyword:
Computation and Language cs.CL; FOS Computer and information sciences; Machine Learning cs.LG
|
|
URL: https://arxiv.org/abs/2006.02951 https://dx.doi.org/10.48550/arxiv.2006.02951
|
|
BASE
|
|
Hide details
|
|
14 |
Generative Adversarial Phonology: Modeling Unsupervised Phonetic and Phonological Learning With Neural Networks
|
|
|
|
In: Front Artif Intell (2020)
|
|
BASE
|
|
Show details
|
|
15 |
Modeling unsupervised phonetic and phonological learning in Generative Adversarial Phonology
|
|
|
|
In: Proceedings of the Society for Computation in Linguistics (2020)
|
|
BASE
|
|
Show details
|
|
16 |
Unnatural Phonology: A Synchrony-Diachrony Interface Approach
|
|
|
|
BASE
|
|
Show details
|
|
17 |
Relativna kronologija naglasnih pojavov govora Žirovske kotline poljanskega narečja ; The Relative Chronology of Word-Prosodic Phenomena in the Local Dialect of the Žiri Basin (Poljana Dialect)
|
|
Beguš, Gašper. - : ZRC SAZU and Hall Center for the Humanities, 2011
|
|
BASE
|
|
Show details
|
|
|
|