GalaxyTrakr: a distributed analysis tool for public health whole genome sequence data accessible to non-bioinformaticians

Gangiredla, Jayanthi; Rand, Hugh; Benisatto, Daniel; Payne, Justin; Strittmatter, Charles; Sanders, Jimmy; Wolfgang, William J.; Libuit, Kevin; Herrick, James B.; Prarat, Melanie; Toro, Magaly; Farrell, Thomas; Strain, Errol

Abstract

Background: Processing and analyzing whole genome sequencing (WGS) is computationally intense: a single Illumina MiSeq WGS run produces similar to 1 million 250-base-pair reads for each of 24 samples. This poses significant obstacles for smaller laboratories, or laboratories not affiliated with larger projects, which may not have dedicated bioinformatics staff or computing power to effectively use genomic data to protect public health. Building on the success of the cloud-based Galaxy bioinformatics platform (http://galaxyproject.org), already known for its user-friendliness and powerful WGS analytical tools, the Center for Food Safety and Applied Nutrition (CFSAN) at the U.S. Food and Drug Administration (FDA) created a customized 'instance' of the Galaxy environment, called GalaxyTrakr (https://www.galaxytrakr.org), for use by laboratory scientists performing food-safety regulatory research. The goal was to enable laboratories outside of the FDA internal network to (1) perform quality assessments of sequence data, (2) identify links between clinical isolates and positive food/environmental samples, including those at the National Center for Biotechnology Information sequence read archive (https://www.ncbi.nlm.nih.gov/sra/), and (3) explore new methodologies such as metagenomics. GalaxyTrakr hosts a variety of free and adaptable tools and provides the data storage and computing power to run the tools. These tools support coordinated analytic methods and consistent interpretation of results across laboratories. Users can create and share tools for their specific needs and use sequence data generated locally and elsewhere.

Más información

Título según WOS: GalaxyTrakr: a distributed analysis tool for public health whole genome sequence data accessible to non-bioinformaticians
Título de la Revista: BMC GENOMICS
Volumen: 22
Número: 1
Editorial: BMC
Fecha de publicación: 2021
DOI:

10.1186/S12864-021-07405-8

Notas: ISI