RT Journal Article SR Electronic T1 Genomic Diversity of SARS-CoV-2 During Early Introduction into the United States National Capital Region JF medRxiv FD Cold Spring Harbor Laboratory Press SP 2020.08.13.20174136 DO 10.1101/2020.08.13.20174136 A1 Thielen, Peter M. A1 Wohl, Shirlee A1 Mehoke, Thomas A1 Ramakrishnan, Srividya A1 Kirsche, Melanie A1 Falade-Nwulia, Oluwaseun A1 Trovão, Nídia S. A1 Ernlund, Amanda A1 Howser, Craig A1 Sadowski, Norah A1 Morris, Paul A1 Hopkins, Mark A1 Schwartz, Matthew A1 Fan, Yunfan A1 Gniazdowski, Victoria A1 Lessler, Justin A1 Sauer, Lauren A1 Schatz, Michael C. A1 Evans, Jared D. A1 Ray, Stuart C. A1 Timp, Winston A1 Mostafa, Heba H. YR 2020 UL http://medrxiv.org/content/early/2020/08/15/2020.08.13.20174136.abstract AB Background The early COVID-19 pandemic has been characterized by rapid global spread. In the United States National Capital Region, over 2,000 cases were reported within three weeks of its first detection in March 2020. We aimed to use genomic sequencing to understand the initial spread of SARS-CoV-2, the virus that causes COVID-19, in the region. By correlating genetic information to disease phenotype, we also aimed to gain insight into any correlation between viral genotype and case severity or transmissibility.Methods We performed whole genome sequencing of clinical SARS-CoV-2 samples collected in March 2020 by the Johns Hopkins Health System. We analyzed these regional SARS-CoV-2 genomes alongside detailed clinical metadata and the global phylogeny to understand early establishment of the virus within the region.Results We analyzed 620 samples from the Johns Hopkins Health System collected between March 11–31, 2020, comprising 37.3% of the total cases in Maryland during this period. We selected 143 of these samples for sequencing, generating 114 complete viral genomes. These genomes belong to all five major Nextstrain-defined clades, suggesting multiple introductions into the region and underscoring the diversity of the regional epidemic. We also found that clinically severe cases had genomes belonging to all of these clades.Conclusions We established a pipeline for SARS-CoV-2 sequencing within the Johns Hopkins Health system, which enabled us to capture the significant viral diversity present in the region as early as March 2020. Efforts to control local spread of the virus were likely confounded by the number of introductions into the region early in the epidemic and interconnectedness of the region as a whole.Competing Interest StatementWT has two patents (8,748,091 and 8,394,584) licensed to Oxford Nanopore Technologies. Funding StatementThis work was supported by the Johns Hopkins University President's Fund Research Response (W.T., S.C.R., P.M.T), the Johns Hopkins University Applied Physics Laboratory Internal Research and Development (P.M.T), The Johns Hopkins Hospital (S.W. and J.L.), the Johns Hopkins Department of Pathology, NIH awards U41HG006620 (M.C.S.) and R01AI134384 (M.C.S.), and NSF award DBI-1627442 (M.C.S.).Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:This study was approved by the Johns Hopkins University School of Medicine Institutional Review Board.All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesRaw nanopore and Illumina data are deposited at SRA (Bioproject PRJNA629390). Consensus sequences are deposited at GISAID and Genbank (MT509452-MT509493, MT646048-MT646120)