Dataset and analysis of genetic variability of Severe Acute Respiratory Syndrome Coronavirus 2 from different areas of the respiratory tract

Rojas-Perez, Victor Eduardo; Villegas, Eduardo Parra

Abstract

Shannon entropy data were obtained from whole genome sequences of SARS-CoV-2 virus from COVID-19 positive patients from nasopharyngeal and tracheal aspirate samples, these sequences were downloaded from the NCBI public database. The study included 57 genomic sequences of the SARS-CoV-2 virus. A total of 57 samples (n = 30 oronasopharyngeal and n = 27 tracheal aspirates) underwent bioinformatics-based analysis with the objective of determining the extent of viral variability. These sequences were used to perform a genetic variability analysis of the virus with the H(x) function of the BioEdit program and using the Wuhan-Hu-1 reference genome. The generated Shannon entropy data were filtered to comparatively determine the magnitudes of genetic variability between sequences from nasopharyngeal and tracheal aspirate samples using an R script. • This paper proposes a workflow for the bionformatic analysis of SARS-CoV-2 genetic variation from different respiratory system samples. • This bioinformatics workflow could help to differentiate changes in the evolution of the viral population in different compartments of the respiratory system. © 2025

Más información

Título según WOS: Dataset and analysis of genetic variability of Severe Acute Respiratory Syndrome Coronavirus 2 from different areas of the respiratory tract
Título según SCOPUS: Dataset and analysis of genetic variability of Severe Acute Respiratory Syndrome Coronavirus 2 from different areas of the respiratory tract
Título de la Revista: MethodsX
Volumen: 14
Editorial: Elsevier B.V.
Fecha de publicación: 2025
Idioma: English
DOI:

10.1016/j.mex.2025.103369

Notas: ISI, SCOPUS