Abstract
Background In the US, community circulation of the SARS-CoV-2 virus likely began in February 2020 after mostly travel-related cases. Children’s Hospital of Philadelphia began testing on 3/9/2020 for pediatric and adult patients, and for all admitted patients on 4/1/2020, allowing an early glimpse into the local molecular epidemiology of the virus.
Methods We obtained 169 SARS-CoV-2 samples (83 from patients <21 years old) from March through May and produced whole genome sequences. We used genotyping tools to track variants over time and to test for possible genotype associated clinical presentations and outcomes in children.
Results Our analysis uncovered 13 major lineages that changed in relative abundance as cases peaked in mid-April in Philadelphia. We detected at least 6 introductions of distinct viral variants into the population. As a group, children had more diverse virus genotypes than the adults tested. No strong differences in clinical variables were associated with genotypes.
Conclusions Whole genome analysis revealed unexpected diversity, and distinct circulating viral variants within the initial peak of cases in Philadelphia. Most introductions appeared to be local from nearby states. Although limited by sample size, we found no evidence that different genotypes had different clinical impacts in children in this study.
Summary Using sequencing and a novel technique for quantifying SARS-CoV-2 diversity, we investigated 169 SARS-CoV-2 genomes (83 <21 years old). This analysis revealed unexpected diversity especially in children. No clear differences in clinical presentation were associated with the different virus lineages.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
This work was supported by the National Institute of Health [1R01AI137526-01 and 1R21AI144561-01A1] to PJP and AMM and [R01NR015639] to PJP. This work was supported by the Cystic Fibrosis Foundation [PLANET19G0] to PJP and AMM.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
Samples were obtained under CHOP IRB protocol 17-014648
All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
Data Availability
The 169 genomes from our dataset will be available from the corresponding author and available online for download through a permanent Zenodo DOI[13]. Other de-identified clinical data used in the manuscript are available upon request from the corresponding author. The GNUVID compressed database and GNUVID source code can be found in its most up-to-date version here, https://github.com/ahmedmagds/GNUVID, under the GNU General Public License.
DOI:10.5281/zenodo.4282048
List of abbreviations
- GNUVID
- Gene Novelty Unit-based Virus Identification
- ST
- Sequence Type
- CC
- Clonal Complex
- SARS-CoV-2
- Severe Acute Respiratory Syndrome Corona Virus 2
- COVID-19
- Corona Virus Disease 2019
- wgMLST
- whole genome Multilocus Sequence Typing