Abstract
Background Global spread of COVID-19 created an unprecedented infectious disease crisis that progressed to a pandemic with >180,000 cases in >100 countries. Reproductive number (R) is an outbreak metric estimating the transmission of a pathogen. Initial R values were published based on the early outbreak in China with limited number of cases with whole genome sequencing. Initial comparisons failed to show a direct relationship viral genomic diversity and epidemic severity was not established for SARS-Cov-2.
Methods Each country’s COVID-19 outbreak status was classified according to epicurve stage (index, takeoff, exponential, decline). Instantaneous R estimates (Wallinga and Teunis method) with a short and standard serial interval examined asymptomatic spread. Whole genome sequences were used to quantify the pathogen genome identity score that were used to estimate transmission time and epicurve stage. Transmission time was estimated based on evolutionary rate of 2 mutations/month.
Findings The country-specific R revealed variable infection dynamics between and within outbreak stages. Outside China, R estimates revealed propagating epidemics poised to move into the takeoff and exponential stages. Population density and local temperatures had variable relationship to the outbreaks. GENI scores differentiated countries in index stage with cryptic transmission. Integration of incidence data with genome variation directly increases in cases with increased genome variation.
Interpretation R was dynamic for each country and during the outbreak stage. Integrating the outbreak dynamic, dynamic R, and genome variation found a direct association between cases and genome variation. Synergistically, GENI provides an evidence-based transmission metric that can be determined by sequencing the virus from each case. We calculated an instantaneous country-specific R at different stages of outbreaks and formulated a novel metric for infection dynamics using viral genome sequences to capture gaps in untraceable transmission. Integrating epidemiology with genome sequencing allows evidence-based dynamic disease outbreak tracking with predictive evidence.
Funding Philippine California Advanced Research Institute (Quezon City, Philippines) and the Weimer laboratory.
Research in context Reproductive number is (R) an epidemiological parameter that defines outbreak transmission dynamics. While early estimates of R exist for COVID-19, the sample size is relatively small (<2000 individuals) taken during the early stages of the disease in China. The outbreak is now a pandemic and a more comprehensive assessment is needed to guide public health efforts in making informed decisions to control regional outbreaks. Commonly, R is computed using a sliding window approach, hence assessment of impact of intervention is more difficult to estimate and often underestimates the dynamic nature of R as the outbreak progresses and expands to different regions of the world. Parallel to epidemiological metrics, pathogen whole genome sequencing is being used to infer transmission dynamics. Viral genome analysis requires expert knowledge in understanding viral genomics that can be integrated with the rapid responses needed for public health to advance outbreak mitigation. This study establishes integrative approaches of genome sequencing with established epidemiological outbreak metrics to provide an easily understandable estimate of transmission dynamics aimed at public health response using evidence-based estimates.
Added value of this study Estimates of R are dynamic within the progression of the epidemic curve. Using the framework defined in this study with dynamic estimates of R specific to each epicurve stage combined with whole genome sequencing led to creation of a novel metric called GENI (pathogen genome identity) that provides genomic evolution and variation in the context of the outbreak dynamics. The GENI scores were directly linked and proportional to outbreak changes when using disease incidence from epicurve stages (index, takeoff, exponential, and decline). By simulating short and standard (2 day and 7 day, respectively) serial intervals, we calculated instantaneous R followed by a global comparison that was associated with changes in GENI. This approach quantified R values that are impacted by public health intervention to change the outbreak trajectory and were linked to case incidence (i.e. exponential expansion or decelerating) by country. Integrating viral whole genome sequences to estimate GENI we were able to infer circulation time, local transmission, and index case introduction. Systematic integration of viral whole genome sequences with epidemiological parameters resulted in a simplified approach in assessing the status of outbreak that facilitates decisions using evidence from genomics and epidemiology in combination.
Implications of all the available evidence This study created a framework of evidence-based intervention by integrating whole genome sequencing and epidemiology during the COVID-19 pandemic. Calculating instantaneous R at different stages of the epicurve for different countries provided an evidence-based assessment of control measures as well as the underlying genomic variation globally that changed the outbreak trajectory for all countries examined. Use of the GENI score translates sequencing data into a public health metric that can be directly integrated in epidemiology for outbreak intervention and global preparedness systems.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
Philippine California Advanced Research Institute (Quezon City, Philippines) and the Weimer laboratory.
Author Declarations
All relevant ethical guidelines have been followed; any necessary IRB and/or ethics committee approvals have been obtained and details of the IRB/oversight body are included in the manuscript.
Yes
All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
Data Availability
the data are public and a list of the sequences used is included