2 |
Confirmation detection in human-agent interaction using non-lexical speech cues ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
Confirmation detection in human-agent interaction using non-lexical speech cues
|
|
|
|
BASE
|
|
Show details
|
|
4 |
An Alternative to Mapping a Word onto a Concept in Language Acquisition: Pragmatic Frames
|
|
|
|
In: ISSN: 1664-1078 ; Frontiers in Psychology ; https://hal.inria.fr/hal-01404385 ; Frontiers in Psychology, Frontiers, 2016, 7, pp.18. ⟨10.3389/fpsyg.2016.00470⟩ ; http://journal.frontiersin.org/article/10.3389/fpsyg.2016.00470/full (2016)
|
|
BASE
|
|
Show details
|
|
5 |
Pragmatic Frames for Teaching and Learning in Human–Robot Interaction: Review and Challenges
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Embodied language learning and cognitive bootstrapping: methods and design principles
|
|
|
|
BASE
|
|
Show details
|
|
7 |
Embodied Language Learning and Cognitive Bootstrapping: Methods and Design Principles
|
|
|
|
BASE
|
|
Show details
|
|
8 |
An Alternative to Mapping a Word onto a Concept in Language Acquisition: Pragmatic Frames
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Alignment to the Actions of a Robot
|
|
|
|
In: ISSN: 1875-4791 ; EISSN: 1875-4805 ; International Journal of Social Robotics ; https://hal.inria.fr/hal-01249226 ; International Journal of Social Robotics, Springer, 2015, ⟨10.1007/s12369-014-0252-0⟩ (2015)
|
|
BASE
|
|
Show details
|
|
10 |
The ITALK project : A developmental robotics approach to the study of individual, social, and linguistic learning
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Towards robots with teleological action and language understanding
|
|
|
|
In: Humanoids 2012 Workshop on Developmental Robotics: Can developmental robotics yield human-like cognitive abilities? ; https://hal.inria.fr/hal-00788627 ; Humanoids 2012 Workshop on Developmental Robotics: Can developmental robotics yield human-like cognitive abilities?, Nov 2012, Osaka, Japan (2012)
|
|
BASE
|
|
Show details
|
|
20 |
Modelling the effects of speech rate variation for automatic speech recognition
|
|
|
|
Abstract:
Wrede B. Modelling the effects of speech rate variation for automatic speech recognition . Bielefeld (Germany): Bielefeld University; 2002. ; In automatic speech recognition it is a widely observed phenomenon that variations in speech rate cause severe degradations of the speech recognition performance. This is due to the fact that standard stochastic based speech recognition systems specialise on average speech rate. Although many approaches to modelling speech rate variation have been made, an integrated approach in a substantial system still has be to developed. General approaches to rate modelling are based on rate dependent models which are trained with rate specific subsets of the training data. During decoding a signal based rate estimation is performed according to which the set of rate dependent models is selected. While such approaches are able to reduce the word error rate significantly, they suffer from shortcomings such as the reduction of training data and the expensive training and decoding procedure. However, phonetic investigations show that there is a systematic relationship between speech rate and the acoustic characteristics of speech. In fast speech a tendency of reduction can be observed which can be described in more detail as a centralisation effect and an increase in coarticulation. Centralisation means that the formant frequencies of vowels tend to shift towards the vowel space center while increased coarticulation denotes the tendency of the spectral features of a vowel to shift towards those of its phonemic neighbour. The goal of this work is to investigate the possibility to incorporate the knowledge of the systematic nature of the influence of speech rate variation on the acoustic features in speech rate modelling. In an acoustic-phonetic analysis of a large corpus of spontaneous speech it was shown that an increased degree of the two effects of centralisation and coarticulation can be found in fast speech. Several measures for these effects were developed and used in speech recognition experiments with rate dependent models. A thorough investigation of rate dependent models showed that with duration and coarticulation based measures significant increases of the performance could be achieved. It was shown that by the use of different measures the models were adapted either to centralisation or coarticulation. Further experiments showed that by a more detailed modelling with more rate classes a further improvement can be achieved. It was also observed that a general basis for the models is needed before rate adaptation can be performed. In a comparison to other sources of acoustic variation it was shown that the effects of speech rate are as severe as those of speaker variation and environmental noise. All these results show that for a more substantial system that models rate variations accurately it is necessary to focus on both, durational and spectral effects. The systematic nature of the effects indicates that a continuous modelling is possible.
|
|
Keyword:
Abtastratenumsetzung; Automatische Spracherkennung; ddc:620; Gesprochene Sprache; Koartikulation; Korpus (Linguistik); Mensch-Maschine-Kommunikation; Phonetik; Sprachsignal; Sprechgeschwindigkeit
|
|
URL: https://pub.uni-bielefeld.de/download/2301772/2301778 https://pub.uni-bielefeld.de/download/2301772/2301775 https://nbn-resolving.org/urn:nbn:de:hbz:361-3733 https://pub.uni-bielefeld.de/record/2301772 https://pub.uni-bielefeld.de/download/2301772/2301776 https://pub.uni-bielefeld.de/download/2301772/2301777
|
|
BASE
|
|
Hide details
|
|
|
|