BitClust: Fast Geometrical Clustering of Long Molecular Dynamics Simulations

González-Alemán, Roy; Hernández-Castillo, David; Rodríguez-Serradet, Alejandro; Caballero, Julio; Hernández-Rodríguez, Erix W.; Montero-Cabrera, Luis

Abstract

The growing computational capacity allows the investigation of large biomolecular systems by increasingly extensive molecular dynamics simulations. The resulting huge trajectories demand efficient partition methods to discern relevant structural dissimilarity. Clustering algorithms are available to address this task, but their implementations still need to be improved to gain in computational speed and to reduce the consumption of random access memory. We propose the BitClust code which, based on a combination of Python and C programming languages, performs fast structural clustering of long molecular trajectories. BitClust takes advantage of bitwise operations applied to a bit-encoded pairwise similarity matrix. Our approach allowed us to process a half-million frame trajectory in 6 h using less than 35 GB, a task that is not affordable with any of the similar alternatives.

Más información

Título según WOS: BitClust: Fast Geometrical Clustering of Long Molecular Dynamics Simulations
Título según SCOPUS: BitClust: Fast Geometrical Clustering of Long Molecular Dynamics Simulations
Título de la Revista: JOURNAL OF CHEMICAL INFORMATION AND MODELING
Volumen: 60
Número: 2
Editorial: AMER CHEMICAL SOC
Fecha de publicación: 2020
Página de inicio: 444
Página final: 448
Idioma: English
DOI:

10.1021/acs.jcim.9b00828

Notas: ISI, SCOPUS