Speakers counting by proposed nested microphone array in combination with limited space SRP

Dehghan Firoozabadi, Ali; Irarrazaval, Pablo; Adasme, Pablo; Zabala-Blanco, David; Palacios-Jativa, Pablo; Durney, Hugo; Sanhueza, Miguel; Azurdia-Meza, Cesar; EURASIP

Abstract

In this paper, a novel method is presented for estimating the number of speakers based on the microphone arrays. Firstly, a 3D snowflake nested microphone array (SNMA) is proposed for recording the speech signals. In the following, the steered response power (SRP) algorithm is implemented on subbands in limited spaces conditions for all microphone pairs related to the subarrays. Therefore, a weighted averaging method is implemented on subband limited spaces SRPs (LSRP), and the final energy map is compared with the histogram of the maximums of the SRP function on different subbands for various time frames. The passed candidate points are categorized by unsupervised K-means clustering and the number of speakers is estimated by the silhouette criteria. The accuracy of the proposed method is compared with PENS, i-vector PLDA, and wavelet-GEVD algorithms. The results show the superiority of the proposed method in comparison with other previous research.

Más información

Título según WOS: ID WOS:000764066600055 Not found in local WOS DB
Título de la Revista: 29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021)
Editorial: EUROPEAN ASSOC SIGNAL SPEECH & IMAGE PROCESSING-EURASIP
Fecha de publicación: 2021
Página de inicio: 271
Página final: 275
DOI:

10.23919/EUSIPCO54536.2021.9616309

Notas: ISI