Home Catalogue search

eng

Refine your search:
- Keyword
- Creator / Publisher
- Year:
  - 2022 (2)
  - 2021 (3)
  - 2020 (2)
  - 2019 (2)
  - 2016 (1)
  - 2015 (2)
  - 2014 (4)
  - 2012 (3)
  - 2006 (3)
  - 2005 (2)
  - more
- Medium
- Type
- BLLDB-Access

Search in the Catalogues and Directories






	Sort by
Simple Search

Page: 1 2 3

Hits 1 – 20 of 51

1	A Comparison of Hybrid and End-to-End ASR Systems for the IberSpeech-RTVE 2020 Speech-to-Text Transcription Challenge
	Juan M. Perero-Codosero; Fernando M. Espinoza-Cuadros; Luis A. Hernández-Gómez
	In: Applied Sciences; Volume 12; Issue 2; Pages: 903 (2022)
	BASE
	Show details

2	What Do We Know about Hybrid Regimes after Two Decades of Scholarship?
	Mufti, Mariam
	In: Politics and Governance ; 6 ; 2 ; 112-119 ; Authoritarianism in the 21st Century (2022)
	BASE
	Show details

3	Automatic Speech Recognition : from hybrid to end-to-end approach ; Reconnaissance automatique de la parole à large vocabulaire : des approches hybrides aux approches End-to-End
	Heba, Abdelwahab. - : HAL CCSD, 2021
	In: https://tel.archives-ouvertes.fr/tel-03616588 ; Intelligence artificielle [cs.AI]. Université Paul Sabatier - Toulouse III, 2021. Français. ⟨NNT : 2021TOU30116⟩ (2021)
	Abstract: At Linagora, the OpenPaasNG project was launched in 2015 for a period of 4 years with the purpose of building a new generation of collaborative platform which provides a videoconferencing tool using artificial intelligence technologies such as speech transcription, keyword extraction and automatic meeting summary. In this context, my industrial thesis tackled Automatic Speech Recognition (ASR) for virtual meetings in real time and offiine. This thesis deals with the study of acoustic modeling methods for ASR. The upheaval of sequential neural architectures has nowadays allowed major advances in the improvement of modeling and learning of acoustic models. I explored in this work the two main families of RAP systems : hybrid and End-to-End systems. A first part of this thesis concerns traditional approaches and is dedicated to the implementation of a large vocabulary RAP system in French for spontaneous speech, which has been deployed in several industrial use cases. A large work of data collection, processing and standardization was carried out in a first step to reach the goal of 1000 hours of annotated speech. An evaluation of the acoustic, lexical and linguistic components is proposed to refine the choice and orientation of the hybrid DNN-HMM modeling for the French language. I proposed for this part an industrial platform for the adaptation of hybrid components called "LinSTT Model Factory" allowing an adaptation of the models to the conditions of use, namely : a particular acoustic context, a vocabulary specific to a target domain. In a second part, I approached the problem of automatically transcribing speech directly from acoustic observations. To do so, I conducted an in-depth study of the End-to-End RAP approaches : how can we learn sequential alignments between audio and text ? What type of architecture to use ? And especially, what type of output units to choose (character, word piece, word) ? I tried to answer to these questions with a set of experiments on the TIMIT and LibriSpeech datasets. A large part of this work has been conducted during a scientific stay at the Mila laboratory in Canada, where I actively contributed to the development of the open-source tool "SpeechBrain". In a third part, I report multi-task learning experiments using end-to-end systems in order to exploit several output representations, in our case characters and consonant / vowel categories. I proposed a new technique of combining thses representations for improving recognition performance. ; Ma thèse s'inscrit dans le cadre du projet de recherche OpenPaasNG 1 qui a été lancé en 2015 pour une durée de 4 ans avec l'objectif de développer une nouvelle génération de plateforme collaborative proposant un outil de visio-conférence utilisant des technologies d'intelligence artificielle pour la transcription de la parole, l'extraction des mots clés et le résumé automatique des réunions. Dans ce contexte, l'objectif de cette thèse est d'étudier, d'approfondir, de construire et d'enrichir la brique de reconnaissance automatique de la parole (RAP) pour aider à l'exploitation de l'information verbale durant les réunions en temps réel et en off-line à des fins d'archivage. Cette thèse porte sur l'étude des méthodes de modélisation acoustique pour la RAP. Le boule- versement des architectures neuronales séquentielles ont permis de nos jours des avancées majeures dans l'amélioration de la modélisation et l'apprentissage du modèle acoustique. Nous explorons dans ce travail les deux grandes familles des systèmes RAP à savoir : les approches traditionnelles hybrides et End-to-End. Une première partie de cette thèse concerne les approches traditionnelles et est dédiée à la mise en place d'un système RAP large vocabulaire en français pour la parole spontanée, déployé dans le contexte industriel. Un grand travail de collecte de données, de traitement et de normalisation a été effectué dans un premier temps pour atteindre l'objectif des 1000 heures de parole annotée. Une évaluation des composants acoustique, lexical et linguistique est proposée pour affiner au mieux le choix et l'orientation de la modélisation hybride DNN-HMM pour la langue française. Nous décri- vons dans cette partie le développement de la plateforme industrielle d'adaptation des composants hybrides appelée "LinSTT Model Factory" permettant une adaptation des modèles aux conditions d'utilisations, à savoir : un contexte acoustique particulier, un vocabulaire spécifique à un domaine cible. Dans une deuxième partie, nous abordons la problématique de prédiction de représentation textuelle directement à partir des observations acoustiques. Pour cela, nous effectuons une étude approfondie des approches RAP End-to-End : comment pouvons nous apprendre des alignements séquentiels entre l'audio et le texte ? Quel type d'architecture utiliser ? Et surtout, quel type d'unités en sortie choisir (caractère, pièce de mot, mot) ? Nous répondons à ces questions avec un ensemble d'expérience sur les corpus TIMIT et LibriSpeech. Ces travaux ont été, dans une grande partie, menés au cours d'un séjour scientifique au laboratoire du Mila au Canada dans le cadre du développement de l'outil open-source "SpeechBrain". Dans une troisième partie, nous explorons des approches multi-tâche sur les systèmes RAP End-to-End afin d'exploiter plusieurs représentations textuelles, dans notre cas, des sorties caractères et des sorties consonnes/voyelles. Nous avons proposé une nouvelle technique de combinaisons des représentations textuelles pour l'amélioration des performances de reconnaissance.
	Keyword: [INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]; Approche multi-tâche; ASR Hybrid systems; Automatique Speech Recognition; Deep Learning; End-to-End ASR systems; Multi-Task learning approach for ASR; Reconnaissance automatique de la parole; Système RAP hybride; Systèmes RAP End-to-End
	URL: https://tel.archives-ouvertes.fr/tel-03616588/file/2021TOU30116a.pdf https://tel.archives-ouvertes.fr/tel-03616588/document https://tel.archives-ouvertes.fr/tel-03616588
	BASE
	Hide details

4	Large vocabulary automatic speech recognition: from hybrid to end-to-end approaches ; Reconnaissance automatique de la parole à large vocabulaire : des approches hybrides aux approches End-to-End
	Heba, Abdelwahab. - : HAL CCSD, 2021
	In: https://hal.archives-ouvertes.fr/tel-03269807 ; Son [cs.SD]. Université toulouse 3 Paul Sabatier, 2021. Français (2021)
	BASE
	Show details

5	Implementing an Intelligent Collaborative Agent as Teammate in Collaborative Writing: toward a Synergy of Humans and AI
	Wiethof, Christina; Tavanapour, Navid; Bittner, Eva. - 2021
	BASE
	Show details

6	Script optimization for TTS voice corpus design in audio-book generation ; Optimisation de script pour la conception de corpus vocaux de TTS dans la génération de livres audio
	Shamsi, Meysam. - : HAL CCSD, 2020
	In: https://tel.archives-ouvertes.fr/tel-03270968 ; Computation and Language [cs.CL]. Université Rennes 1, 2020. English. ⟨NNT : 2020REN1S107⟩ (2020)
	BASE
	Show details

7	'n Model vir 'n aanlyn GIS-vakwoordeboek
	Lize-Mari Theron
	In: Lexikos, Vol 30, Pp 499-518 (2020) (2020)
	BASE
	Show details

8	When humans and machines collaborate: Cross-lingual Label Editing in Wikidata ...
	Kaffee, Lucie-Aimée; Endris, Kemele M.; Simperl, Elena. - : New York, NY : Association for Computing Machinery, Inc., 2019
	BASE
	Show details

9	When humans and machines collaborate: Cross-lingual Label Editing in Wikidata ...
	Kaffee, L.-A.; Endris, K.M.; Simperl, E.. - : New York City : Association for Computing Machinery, 2019
	BASE
	Show details

10	An Ontological Driven Approach of HAD Specific Language Designing
	Nkenlifack, Marcellin; Mboubé - Etouké, Serge
	In: https://hal.archives-ouvertes.fr/hal-01309151 ; 2016 (2016)
	BASE
	Show details

11	Evaluating the impact of using a domain-specific bilingual lexicon on the performance of a hybrid machine translation approach
	Semmar, N.; Zennaki, O.; Laib, M.
	In: 10th International Conference on Recent Advances in Natural Language Processing, RANLP 201 ; https://hal-cea.archives-ouvertes.fr/cea-01844051 ; 10th International Conference on Recent Advances in Natural Language Processing, RANLP 201, Sep 2015, Hissar, Bulgaria. pp.579-587 (2015)
	BASE
	Show details

12	Projetos sobre tradução automática do português no laboratório de sistemas de língua falada do INESC-ID
	Barreiro, A.; Coheur, L.; Batista, F.. - : Universidade do Minho; Universidade de Vigo, 2015
	BASE
	Show details

13	Interaction between Linguists and Machine Learning ; L'interaction entre linguistes et apprentissage automatique
	Laporte, Éric
	In: https://hal.archives-ouvertes.fr/hal-01118870 ; 2014 (2014)
	BASE
	Show details

14	Modelo de controlo difuso de um sistema de produção de energia com base em recursos renováveis
	Moreira, Miguel Luís de Babo. - 2014
	BASE
	Show details

15	Enhancing a Rule-Based MT System with Cross-Lingual WSD
	Rios, Annette; Gasser, Michael; Rudnick, Alex
	In: Rudnick, Alex; Rios, Annette; Gasser, Michael (2014). Enhancing a Rule-Based MT System with Cross-Lingual WSD. In: SaLTMiL Workshop on free/open-source language resources for the machine translation of less-resourced languages (LREC'14), Reykjavik, Iceland, 22 May 2014. SALTMIL, 31-36. (2014)
	BASE
	Show details

16	Hibridación de sistemas borrosos para el modelado y control ; Hybridization of fuzzy systems for modeling and control
	Andújar, José Manuel; Barragán, Antonio Javier. - : Elsevier, 2014
	BASE
	Show details

17	A HYBRID FUZZY/GENETIC ALGORITHM FOR INTRUSION DETECTION IN RFID SYSTEMS
	Geta, Gemechu. - 2012
	BASE
	Show details

18	A HYBRID FUZZY/GENETIC ALGORITHM FOR INTRUSION DETECTION IN RFID SYSTEMS
	Geta, Gemechu. - 2012
	BASE
	Show details

19	A random forest system combination approach for error detection in digital dictionaries
	Rodrigues, Paul; Zajic, David; Doermann, David. - : Association for Computational Linguistics, 2012
	BASE
	Show details

20	A Hybrid Approach for QA Track Definitional Questions
	Blair-Goldensohn, Sasha; McKeown, Kathleen R.; Schlaikjer, Andrew H.
	In: DTIC (2006)
	BASE
	Show details

Page: 1 2 3

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern