Robust speaker verification with state duration modeling
Abstract
This paper addresses the problem of state duration modeling in the Viterbi algorithm in a text-dependent speaker verification task. The results presented in this paper suggest that temporal constraints can lead to reductions of 10% and 20% in the error rates with signals corrupted by noise at SNR equal to 6 and 0 dB, respectively, and that the accurate statistical modeling of state duration (e.g. with gamma probability distribution) does not seem to be very relevant if maximal and minimal state duration restrictions are imposed. In contrast, temporal restrictions do not seem to give any improvement in a speaker verification task with clean speech or high SNR. It is also shown that state duration constraints can easily be applied with the likelihood normalization metrics based on speaker-dependent temporal parameters. Finally, the results here presented show that word position-dependent state duration parameters give no significant improvement when compared with the word position-independent approach if the coarticulation effect between contiguous words is low. © 2002 Elsevier Science B.V. All rights reserved.
Más información
Título según WOS: | Robust speaker verification with state duration modeling |
Título según SCOPUS: | Robust speaker verification with state duration modeling |
Título de la Revista: | SPEECH COMMUNICATION |
Volumen: | 38 |
Número: | 01-feb |
Editorial: | ELSEVIER SCIENCE BV, PO BOX 211, 1000 AE AMSTERDAM, NETHERLANDS |
Fecha de publicación: | 2002 |
Página de inicio: | 77 |
Página final: | 88 |
Idioma: | English |
URL: | http://linkinghub.elsevier.com/retrieve/pii/S0167639301000449 |
DOI: |
10.1016/S0167-6393(01)00044-9 |
Notas: | ISI, SCOPUS |