PT - JOURNAL ARTICLE AU - Vaughan, Timothy G. AU - Sciré, Jérémie AU - Nadeau, Sarah A. AU - Stadler, Tanja TI - Estimates of outbreak-specific SARS-CoV-2 epidemiological parameters from genomic data AID - 10.1101/2020.09.12.20193284 DP - 2020 Jan 01 TA - medRxiv PG - 2020.09.12.20193284 4099 - http://medrxiv.org/content/early/2020/09/14/2020.09.12.20193284.short 4100 - http://medrxiv.org/content/early/2020/09/14/2020.09.12.20193284.full AB - We estimate the basic reproductive number and case counts for 15 distinct SARS-CoV-2 outbreaks, distributed across 10 countries and one cruise ship, based solely on phylodynamic analyses of genomic data. Our results indicate that, prior to significant public health interventions, the reproductive numbers for a majority (10) of these outbreaks are similar, with median posterior estimates ranging between 1.4 and 2.8. These estimates provide a view which is complementary to that provided by those based on traditional line listing data. The genomic-based view is arguably less susceptible to biases resulting from differences in testing protocols, testing intensity, and import of cases into the community of interest. In the analyses reported here, the genomic data primarily provides information regarding which samples belong to a particular outbreak. We observe that once these outbreaks are identified, the sampling dates carry the majority of the information regarding the reproductive number. Finally, we provide genome-based estimates of the cumulative case counts for each outbreak, which allow us to speculate on the amount of unreported infections within the populations housing each outbreak. These results indicate that for the majority (7) of the populations studied, the number of recorded cases is much bigger than the estimated cumulative case counts, suggesting the presence of unsequenced pathogen diversity in these populations.Significance Statement Since the beginning of the COVID-19 outbreak in late 2019, researchers around the globe have sought to estimate the rate at which the disease spread through populations prior to public health intervention, as quantified by the parameter R0. This is often estimated based on case count data and may be biased due to the presence of import cases. To overcome this, we estimate R0 by applying Bayesian phylodynamic methods to SARS-CoV-2 genomes which have been made available by laboratories worldwide. We provide R0 and absolute infection count estimates for 15 distinct outbreaks. These estimates contribute to our understanding of the baseline transmission dynamics of the disease, which will be critical in guiding future public health responses to the pandemic.Competing Interest StatementThe authors have declared no competing interest.Funding StatementNo external funding was used to support this research.Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:This research is based on publicly available data only.All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesAll of the files necessary to replicate the analyses presented in this manuscript are made available, as detailed in the manuscript, including accession numbers for genomic data from the GISAID sequence database. https://github.com/tgvaughan/R0-manuscript-materials/