Dataset and analysis of genetic variability of severe acute Respiratory Syndrome Coronavirus 2 from different areas of the respiratory tract
Abstract
Shannon entropy data were obtained from whole genome sequences of SARS-CoV-2 virus from COVID-19 positive patients from nasopharyngeal and tracheal aspirate samples, these sequences were downloaded from the NCBI public database. The study included 57 genomic sequences of the SARS-CoV-2 virus. A total of 57 samples (n = 30 oronasopharyngeal and n = 27 tracheal aspirates) underwent bioinformatics-based analysis with the objective of determining the extent of viral variability. These sequences were used to perform a genetic variability analysis of the virus with the H(x) function of the BioEdit program and using the Wuhan-Hu-1 reference genome. The generated Shannon entropy data were filtered to comparatively determine the magnitudes of genetic variability between sequences from nasopharyngeal and tracheal aspirate samples using an R script.
This paper proposes a workflow for the bionformatic analysis of SARS-CoV-2 genetic variation from different respiratory system samples.
This bioinformatics workflow could help to differentiate changes in the evolution of the viral population in different compartments of the respiratory system.
Fuente
MethodsX, 14, 103369Identificador DOI
doi.org/10.1016/j.mex.2025.103369Collections
The following license files are associated with this item:


