An optimized relational database for querying structural patterns in proteins

Angles, Renzo; Arenas-Salinas, Mauricio; Garcia, Roberto; Ingram, Ben

Abstract

A database is an essential component in almost any software system, and its creation involves more than just data modeling and schema design. It also includes query optimization and tuning. This paper focuses on a web system called GSP4PDB, which is used for searching structural patterns in proteins. The system utilizes a normalized relational database, which has proven to be inefficient even for simple queries. This article discusses the optimization of the GSP4PDB database by implementing two techniques: denormalization and indexing. The empirical evaluation described in the article shows that combining these techniques enhances the efficiency of the database when querying both real and artificial graph-based structural patterns.

Más información

Título según WOS: An optimized relational database for querying structural patterns in proteins
Título de la Revista: DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION
Volumen: 2024
Editorial: OXFORD UNIV PRESS
Fecha de publicación: 2024
DOI:

10.1093/database/baad093

Notas: ISI