Generalized black hole clustering algorithm

Saltos, Ramiro; Weber, R.

Abstract

The Black Hole Clustering (BHC) algorithm is a density-based partitional clustering method inspired by the Density-based Spatial Clustering of Applications with Noise (DBSCAN). It does not require the number of clusters nor the computation of the pair-wise distance matrix between the data points, making it faster than DBSCAN. Also, it only needs one parameter that is intuitively easier to set than the epsilon parameter of DBSCAN. However, BHC needs the allocation of the so-called black holes that have to be linearly independent, making the algorithm in its current version suitable only for two or three-dimensional data sets. In this paper, we propose a generalized version of the black hole clustering algorithm (GBHC) by introducing a novel black hole allocation procedure for higher-dimensional data spaces. Furthermore, the proposed method is data-independent, so we have to run it once to obtain the black hole positions for all finite-dimensional metric spaces. We performed extensive computational experiments to compare GBHC with DBSCAN. The results show that both algorithms obtain comparable clustering solutions. GBHC, however, outperforms DBSCAN in computational complexity and explainability.

Más información

Título según WOS: Generalized black hole clustering algorithm
Título según SCOPUS: ID SCOPUS_ID:85178951558 Not found in local SCOPUS DB
Título de la Revista: PATTERN RECOGNITION LETTERS
Volumen: 176
Editorial: Elsevier
Fecha de publicación: 2023
Página de inicio: 196
Página final: 201
DOI:

10.1016/J.PATREC.2023.11.006

Notas: ISI, SCOPUS