Improving performance of spectral subtraction in speech recognition using a model for additive noise

Yoma, N.B.; McInnes, F.R.; Jack, M.A.


Addresses the problem of speech recognition with signals corrupted by additive noise at moderate signal-to-noise ratio (SNR). A model for additive noise is presented and used to compute the uncertainty about the hidden clean signal so as to weight the estimation provided by spectral subtraction. Weighted dynamic time warping (DTW) and Viterbi (HMM) algorithms are tested, and the results show that weighting the information along the signal can substantially increase the performance of spectral subtraction, an easily implemented technique, even with a poor estimation for noise and without using any information about the speaker. It is also shown that the weighting procedure can reduce the error rate when cepstral mean normalization is also used to cancel the convolutional noise.

Más información

Título de la Revista: IEEE Transactions on Speech and Audio Processing
Volumen: 6
Número: 6
Editorial: Society of Laparoendoscopic Surgeons
Fecha de publicación: 1998
Página de inicio: 579
Página final: 582