Improving the discovery and clustering of three-dimensional protein patterns with OpenMP

Valdes-Jimenez, Alejandro; Reyes-Parada, Miguel; Nunez-Vivanco, Gabriel; Duran-Verdugo, Fabio; Jimenez-Gonzalez, Daniel; IEEE

Abstract

The discovery of conserved three-dimensional (3D) amino-acid patterns among a set of protein structures can be useful, for instance, to predict the functions of unknown proteins or for the rational design of multi-target drugs. There are several applications that perform a three-dimensional search of patterns in the structures of proteins. However, discovering conserved 3D patterns in a set of proteins with no other baseline patterns is a challenge. In this paper, we analyze and improve a state-of-the-art algorithm, 3D-PP, that implements this discovery. In this algorithm, the 3D patterns are detected and clustered using the root mean square deviation value, measured among each pair of 3D patterns (topological variability indicator). Even when 3D-PP deals with this task, the simultaneous processing of high amounts of proteins becomes a computational challenge with the size and the number of proteins to be evaluated. In this work, we present and analyze different shared memory parallel strategies of 3D-PP, using OpenMP. Those strategies improve the overall performance of the original implementation by reducing parallel load unbalance among threads and overall increasing parallelism. The results show significant performance improvements compared to the original version, achieving up to 13x speedup for a small number of proteins and 17.7x for a larger set.

Más información

Título según WOS: Improving the discovery and clustering of three-dimensional protein patterns with OpenMP
Título de la Revista: 2023 IEEE 35TH INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING, SBAC-PAD
Editorial: IEEE
Fecha de publicación: 2023
Página de inicio: 202
Página final: 208
DOI:

10.1109/SBAC-PAD59825.2023.00029

Notas: ISI