8 |
Vystadial 2013 – scripts
|
|
|
|
Abstract:
Vystadial 2013 is a dataset of telephone conversations in English and Czech, developed for training acoustic models for automatic speech recognition in spoken dialogue systems. It ships in three parts: Czech data, English data, and scripts. The data comprise over 41 hours of speech in English and over 15 hours in Czech, plus orthographic transcriptions. The scripts implement data pre-processing and building acoustic models using the HTK and Kaldi toolkits. This is the scripts part of the dataset. ; This research was funded by the Ministry of Education, Youth and Sports of the Czech Republic under the grant agreement LK11221.
|
|
Keyword:
acoustic model; ASR; HTK; Kaldi
|
|
URL: http://hdl.handle.net/11858/00-097C-0000-0023-466F-C
|
|
BASE
|
|
Hide details
|
|
|
|