Robust speaker verification with state duration modeling

Yoma, NB; Pegoraro, TF

Abstract

This paper addresses the problem of state duration modeling in the Viterbi algorithm in a text-dependent speaker verification task. The results presented in this paper suggest that temporal constraints can lead to reductions of 10% and 20% in the error rates with signals corrupted by noise at SNR equal to 6 and 0 dB, respectively, and that the accurate statistical modeling of state duration (e.g. with gamma probability distribution) does not seem to be very relevant if maximal and minimal state duration restrictions are imposed. In contrast, temporal restrictions do not seem to give any improvement in a speaker verification task with clean speech or high SNR. It is also shown that state duration constraints can easily be applied with the likelihood normalization metrics based on speaker-dependent temporal parameters. Finally, the results here presented show that word position-dependent state duration parameters give no significant improvement when compared with the word position-independent approach if the coarticulation effect between contiguous words is low. © 2002 Elsevier Science B.V. All rights reserved.

Más información

Título según WOS: Robust speaker verification with state duration modeling
Título según SCOPUS: Robust speaker verification with state duration modeling
Título de la Revista: SPEECH COMMUNICATION
Volumen: 38
Número: 01-feb
Editorial: ELSEVIER SCIENCE BV, PO BOX 211, 1000 AE AMSTERDAM, NETHERLANDS
Fecha de publicación: 2002
Página de inicio: 77
Página final: 88
Idioma: English
URL: http://linkinghub.elsevier.com/retrieve/pii/S0167639301000449
DOI:

10.1016/S0167-6393(01)00044-9

Notas: ISI, SCOPUS