Modeling, estimating, and compensating low-bit rate coding distortion in speech recognition

Yoma, NB; Molina C.; Silva J.; Busso C.

Abstract

A solution to the problem of speech recognition with signals distorted by low-bit rate coders is presented in this paper. A model for the coding-decoding distortion, a HMM compensation method to include this model, and an EM-based adaptation algorithm to estimate this distortion are proposed here. Medium vocabulary continuous-speech speaker-independent recognition experiments with 8 kbps G.729(CS-CELP), 13 kbps RPE-LTP (GSM), 5.3 kbps G723.1, 4.8 kbps FS-1016 and 32 kbps G.726(ADPCM) coders show that the approach described in this paper is able to dramatically reduce the effect of the coding distortion and, in some cases, gives a word accuracy higher than the baseline system with uncoded speech. Finally, the EM estimation algorithm requires only one adapting utterance and the approach described is certainly suitable for dialogue systems where just a few adapting utterances are available.

Más información

Título según WOS: Modeling, estimating, and compensating low-bit rate coding distortion in speech recognition
Título de la Revista: IEEE Transactions on audio speech and language processing
Volumen: 14
Número: 1
Editorial: IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
Fecha de publicación: 2006
Página de inicio: 246
Página final: 255
Idioma: English
URL: http://ieeexplore.ieee.org/lpdocs/epic03/wrapper.htm?arnumber=1561281
DOI:

10.1109/TSA.2005.852994

Notas: ISI