Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Page: 1 2 3 4 5

Hits 1 – 20 of 99

1	How2Sign: A large-scale multimodal dataset for continuous American sign language
	Duarte, Amanda; Palaskar, Shruti; Ventura, Lucas. - : Institute of Electrical and Electronics Engineers, 2021
	BASE
	Show details

2	Multilingual Multimodal Pre-training for Zero-Shot Cross-Lingual Transfer of Vision-Language Models ...
	Huang, Po-Yao; Patrick, Mandela; Hu, Junjie. - : arXiv, 2021
	BASE
	Show details

3	Multilingual Multimodal Pre-training for Zero-Shot Cross-Lingual Transfer of Vision-Language Models ...
	NAACL 2021 2021; Hauptmann, Alexander; Hu, Junjie. - : Underline Science Inc., 2021
	BASE
	Show details

4	Differentiable Allophone Graphs for Language-Universal Speech Recognition ...
	Yan, Brian; Dalmia, Siddharth; Mortensen, David R.; Metze, Florian; Watanabe, Shinji. - : arXiv, 2021
	Abstract: Building language-universal speech recognition systems entails producing phonological units of spoken sound that can be shared across languages. While speech annotations at the language-specific phoneme or surface levels are readily available, annotations at a universal phone level are relatively rare and difficult to produce. In this work, we present a general framework to derive phone-level supervision from only phonemic transcriptions and phone-to-phoneme mappings with learnable weights represented using weighted finite-state transducers, which we call differentiable allophone graphs. By training multilingually, we build a universal phone-based speech recognition model with interpretable probabilistic phone-to-phoneme mappings for each language. These phone-based systems with learned allophone graphs can be used by linguists to document new languages, build phone-based lexicons that capture rich pronunciation variations, and re-evaluate the allophone mappings of seen language. We demonstrate the ... : INTERSPEECH 2021. Contains additional studies on phone recognition for unseen languages ...
	Keyword: Audio and Speech Processing eess.AS; Computation and Language cs.CL; FOS Computer and information sciences; FOS Electrical engineering, electronic engineering, information engineering; Sound cs.SD
	URL: https://arxiv.org/abs/2107.11628 https://dx.doi.org/10.48550/arxiv.2107.11628
	BASE
	Hide details

5	Speech technology for unwritten languages
	Scharenborg, Odette; Besacier, Laurent; Black, Alan...
	In: ISSN: 2329-9290 ; EISSN: 2329-9304 ; IEEE/ACM Transactions on Audio, Speech and Language Processing ; https://hal.inria.fr/hal-02480675 ; IEEE/ACM Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2020, ⟨10.1109/TASLP.2020.2973896⟩ (2020)
	BASE
	Show details

6	AlloVera: a multilingual allophone database
	Mortensen, David,; Li, Xinjian; Littell, Patrick...
	In: LREC 2020: 12th Language Resources and Evaluation Conference ; https://halshs.archives-ouvertes.fr/halshs-02527046 ; LREC 2020: 12th Language Resources and Evaluation Conference, European Language Resources Association, May 2020, Marseille, France ; https://lrec2020.lrec-conf.org/ (2020)
	BASE
	Show details

7	AlloVera: A Multilingual Allophone Database ...
	Mortensen, David R.; Li, Xinjian; Littell, Patrick. - : arXiv, 2020
	BASE
	Show details

8	Towards Zero-shot Learning for Automatic Phonemic Transcription ...
	Li, Xinjian; Dalmia, Siddharth; Mortensen, David R.. - : arXiv, 2020
	BASE
	Show details

9	How2Sign: A Large-scale Multimodal Dataset for Continuous American Sign Language ...
	Duarte, Amanda; Palaskar, Shruti; Ventura, Lucas. - : arXiv, 2020
	BASE
	Show details

10	Universal Phone Recognition with a Multilingual Allophone System ...
	Li, Xinjian; Dalmia, Siddharth; Li, Juncheng. - : arXiv, 2020
	BASE
	Show details

11	AlloVera: a multilingual allophone database
	Mortensen, David,; Li, Xinjian; Littell, Patrick...
	In: LREC 2020: 12th Language Resources and Evaluation Conference ; https://halshs.archives-ouvertes.fr/halshs-02527046 ; LREC 2020: 12th Language Resources and Evaluation Conference, European Language Resources Association, May 2020, Marseille, France ; https://lrec2020.lrec-conf.org/ (2020)
	BASE
	Show details

12	Phoneme Level Language Models for Sequence Based Low Resource ASR ...
	Dalmia, Siddharth; Li, Xinjian; Black, Alan W. - : arXiv, 2019
	BASE
	Show details

13	Multilingual Speech Recognition with Corpus Relatedness Sampling ...
	Li, Xinjian; Dalmia, Siddharth; Black, Alan W.. - : arXiv, 2019
	BASE
	Show details

14	On Leveraging the Visual Modality for Neural Machine Translation ...
	Raunak, Vikas; Choe, Sang Keun; Lu, Quanyang. - : arXiv, 2019
	BASE
	Show details

15	Acoustic-to-Word Models with Conversational Context Information ...
	Kim, Suyoun; Metze, Florian. - : arXiv, 2019
	BASE
	Show details

16	Learned In Speech Recognition: Contextual Acoustic Word Embeddings ...
	Palaskar, Shruti; Raunak, Vikas; Metze, Florian. - : arXiv, 2019
	BASE
	Show details

17	On Dimensional Linguistic Properties of the Word Embedding Space ...
	Raunak, Vikas; Kumar, Vaibhav; Gupta, Vivek. - : arXiv, 2019
	BASE
	Show details

18	Linguistic unit discovery from multi-modal inputs in unwritten languages: Summary of the “Speaking rosetta” JSALT 2017 workshop
	Scharenborg, Odette; Besacier, Laurent; Black, Alan...
	In: ICASSP 2018 - IEEE International Conference on Acoustics, Speech and Signal Processing ; https://hal.archives-ouvertes.fr/hal-01709578 ; ICASSP 2018 - IEEE International Conference on Acoustics, Speech and Signal Processing, Apr 2018, Calgary, Alberta, Canada (2018)
	BASE
	Show details

19	Late fusion of individual engines for improved recognition of negative emotion in speech - learning vs. democratic vote ...
	Schuller, Bjorn; Metze, Florian; Steidl, Stefan. - : Figshare, 2018
	BASE
	Show details

20	Sequence-based Multi-lingual Low Resource Speech Recognition ...
	Dalmia, Siddharth; Sanabria, Ramon; Metze, Florian. - : arXiv, 2018
	BASE
	Show details

Page: 1 2 3 4 5

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern