Abstract
The echo state network (ESN) is a recurrent neural network proposed by Herbert Jaeger with a simplified training routine. Previously, we have demonstrated the noise-robust performance of the predictive ESN classifier in automatic speech recognition experiments in which the network was trained to predict the next frame of speech features. Classification performance was limited because the predictive models lacked discriminability, so we changed the model output to a one-of-many output encodings scheme and trained the model discriminatively. Performance was compared to a hidden Markov model (HMM) in small-vocabulary ASR experiments with additive noise. Accuracy of 50% was achieved by a discriminative ESN classifier at -8.5 dB SNR, compared to 0.4 dB SNR for a predictive ESN classifier and 6.6 dB SNR for an HMM. With discriminative training, a larger reservoir was employed for the discriminative ESN classifier compared to the predictive ESN classifier which resulted a larger memory depth and more noise-robust performance. © 2007 IEEE.
Recommended Citation
M. D. Skowronski and J. G. Harris, "Noise-robust Automatic Speech Recognition Using A Discriminative Echo State Network," Proceedings IEEE International Symposium on Circuits and Systems, pp. 1771 - 1774, article no. 4253002, Institute of Electrical and Electronics Engineers, Jan 2007.
The definitive version is available at https://doi.org/10.1109/iscas.2007.378015
Department(s)
Electrical and Computer Engineering
International Standard Serial Number (ISSN)
0271-4310
Document Type
Article - Conference proceedings
Document Version
Citation
File Type
text
Language(s)
English
Rights
© 2025 Institute of Electrical and Electronics Engineers, All rights reserved.
Publication Date
01 Jan 2007
