Big Data Trends in Bioinformatics

da Silva, Dennis Savio M.; da Silva, Waldeyr M. C.; RuiZhe, Guo; Bernardi, Ana Paula; Mariano, Ari Melo; Holanda, Maristela; Yoo, IH; Bi, JB; Hu, X

Abstract

The amount of biological data available for both the academic community and industry has increased due to the rise of high throughput omics technologies, biotechnology, and health monitoring. This scenario demands efficient storage and analysis of the massive amount of data involved. This work aimed to investigate the scientific literature to map topics related to the use of Big Data in Bioinformatics. Due to the large number of relevant papers to inspect, we employed a three-step data-driven systematic approach. We performed a literature search and selected works related to the theme in three search bases (Scopus, ACM, and Web of Science). Then, we proceeded to a text mining step to analyze terms frequently used in the documents and prepare the documents to be inspected. Afterward, we performed a topic modeling using the LDA (Latent Dirichlet Allocation) algorithm. Twenty groups of topics were obtained and drawn in twenty trend topics, which show a revealing state-of-the-art of the Big Data application in Bioinformatics.

Más información

Título según WOS: ID WOS:000555804900336 Not found in local WOS DB
Título de la Revista: 2019 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM)
Editorial: IEEE
Fecha de publicación: 2019
Página de inicio: 1862
Página final: 1867
Notas: ISI