Reconciling early-outbreak estimates of the basic reproductive number and its uncertainty: framework and applications to the novel coronavirus (SARS-CoV-2) outbreak

Sang Woo Park; Benjamin M. Bolker; David Champredon; David J. D. Earn; Michael Li; Joshua S. Weitz; Bryan T. Grenfell; Jonathan Dushoff

doi:10.1101/2020.01.30.20019877

Abstract

A novel coronavirus (SARS-CoV-2) has recently emerged as a global threat. As the epidemic progresses, many disease modelers have focused on estimating the basic reproductive number ℛ₀– the average number of secondary cases caused by a primary case in an otherwise susceptible population. The modeling approaches and resulting estimates of ℛ₀ vary widely, despite relying on similar data sources. Here, we present a novel statistical framework for comparing and combining different estimates of ℛ₀ across a wide range of models by decomposing the basic reproductive number into three key quantities: the exponential growth rate r, the mean generation interval , and the generation-interval dispersion κ. We then apply our framework to early estimates of ℛ₀ for the SARS-CoV-2 outbreak. We show that many early ℛ₀ estimates are overly confident. Our results emphasize the importance of propagating uncertainties in all components of ℛ₀, including the shape of the generation-interval distribution, in efforts to estimate ℛ₀ at the outset of an epidemic.

SARS-CoV-2
COVID-19
novel coronavirus
basic reproductive number
generation interval
Bayesian multilevel model

1 Introduction

Since December 2019, a novel coronavirus (SARS-CoV-2) has been spreading in China and other parts of the world (World Health Organization, 2020d). Although the virus is believed to have originated from animal reservoirs (Centers for Disease Control and Prevention, 2020), the ability of SARS-CoV-2 ability to directly transmit between humans has posed a greater threat for its spread (Huang et al., 2020; World Health Organization, 2020c). As of February 27, 2020, the World Health Organization (WHO) has confirmed 82,294 cases of the coronavirus disease (COVID-19), including 3,664 confirmed cases in 46 different countries, outside China (World Health Organization, 2020a).

As the disease continues to spread, many researchers have already published their analyses of the outbreak as pre-prints (e.g., Bedford et al. (2020); Imai et al. (2020); Liu et al. (2020); Majumder and Mandl (2020); Read et al. (2020a); Zhao et al. (2020)) and in peer-reviewed journals (e.g., Li et al. (2020); Riou and Althaus (2020b); Wu et al. (2020); Zhao et al. (2020)), focusing in particular on estimates of the basic reproductive number ℛ₀ (i.e., the average number of secondary cases generated by a primary case in a fully susceptible population (Anderson and May, 1991; Diekmann et al., 1990)). Estimates of the basic reproductive number are of interest during an outbreak because they provide information about the level of intervention required to interrupt transmission (Anderson and May, 1991), and about the potential final size of the outbreak (Anderson and May, 1991; Ma and Earn, 2006). We commend these researchers for their timely contribution and those who made the data publicly available. However, it can be difficult to compare a disparate set of estimates of ℛ₀ from different research groups (as well as the associated degrees of uncertainty) when the estimation methods and their underlying assumptions vary widely.

Here, we show that a wide range of approaches to estimating ℛ₀ can be understood and compared in terms of estimates of three quantities: the exponential growth rate r, the mean generation interval , and the generation-interval dispersion κ. The generation interval, defined as the interval between the time when an individual becomes infected and the time when that individual infects another individual (Svensson, 2007), plays a key role in shaping the relationship between r and ℛ₀ (Wearing et al., 2005; Roberts and Heesterbeek, 2007; Wallinga and Lipsitch, 2007; Park et al., 2019); therefore, estimates of ℛ₀ from different models directly depend on their implicit assumptions about the generation-interval distribution and the exponential growth rate. Early in an epidemic, information is scarce and there is inevitably a great deal of uncertainty surrounding both case reports (affecting the estimates of the exponential growth rate) and contact tracing (affecting the estimates of the generation-interval distribution). We suggest that disease modelers should make sure their assumptions about these three quantities are clear and reasonable, and that estimates of uncertainty in ℛ₀ should propagate error from all three sources (Elderd et al., 2006).

We compare seven disparate models published online between January 23–26, 2020 that estimated ℛ₀ for the SARS-CoV-2 outbreak (Bedford et al., 2020; Imai et al., 2020; Liu et al., 2020; Majumder and Mandl, 2020; Read et al., 2020a; Riou and Althaus, 2020a; Zhao et al., 2020). We use a Bayesian multilevel model to construct pooled estimates for the three key quantities: r, , and κ; the pooled estimates reflect the uncertainties present in modeling approaches and their underlying assumptions. We use these pooled estimates to illustrate the importance of propagating different sources of error, particularly uncertainty in both the growth rate and the generation interval. We also use our framework to tease apart which assumptions of these different models led to their different estimates and confidence intervals. Despite the availability of more recent and/or updated estimates of ℛ₀, we restrict ourselves to the estimates above in order to focus on the resolution of uncertainty in the earliest stages of an epidemic.

2 Methods

2.1 Description of the studies

We gathered information on estimates of ℛ₀ and their assumptions about the underlying generation-interval distributions from 7 articles that were published online between January 23–26, 2020 (Table 1). Five studies (Liu et al., 2020; Majumder and Mandl, 2020; Read et al., 2020a; Riou and Althaus, 2020a; Zhao et al., 2020) were uploaded to pre-print servers (bioRxiv, medRxiv, and SSRN); one report was posted on the web site of Imperial College London (Imai et al., 2020); and one report was posted on nextstrain.org (Bedford et al., 2020). Their modeling approaches vary widely: a branching process model (Bedford et al., 2020; Imai et al., 2020; Riou and Althaus, 2020a), a deterministic Susceptible-Exposed-Infected-Recovered (SEIR) model (Read et al., 2020a), an exponential growth model (Zhao et al., 2020), a Poisson offspring distribution model (Liu et al., 2020), and the Incidence Decay and Exponential Adjustment (IDEA) model (Majumder and Mandl, 2020). Four studies estimated ℛ₀ by directly fitting their models to incidence data (Read et al., 2020a; Zhao et al., 2020; Liu et al., 2020; Majumder and Mandl, 2020). The remaining three studies estimated ℛ₀ by comparing the predicted number of cases from their models with the estimated number of total cases by January 18 (between 1,000 and 9,7000 (Imai et al., 2020)) Some of these studies have now been published in peer-reviewed journals (Riou and Althaus, 2020b; Zhao et al., 2020) or have been updated with better uncertainty quantification (Read et al., 2020b).

View this table:

Table 1: Reported estimates of the basic reproductive number and the assumptions about the generation-interval distributions.

Estimates of ℛ₀ and their assumptions about the shape of the generation interval distributions were collected from 7 studies. ^∗We treat these intervals as a 95% confidence interval in our analysis. ^†We assume κ = 0.5 in our analysis. ^‡The authors presented ℛ₀ estimates under different assumptions regarding the reporting rate; we use their baseline scenario in our analysis to remain consistent with other studies, which do not account for changes in the reporting rate.

2.2 Gamma approximation framework for linking r and R₀

Early in an outbreak, ℛ₀ is difficult to estimate directly; instead, ℛ₀ is often inferred from the exponential growth rate r, which can be estimated reliably from incidence data (Ma et al., 2014). Given an estimate of the exponential growth rate r and an intrinsic generation- interval distribution g(τ) (Champredon and Dushoff, 2015), the basic reproductive number can be estimated via the Euler-Lotka equation (Wallinga and Lipsitch, 2007):

In other words, estimates of ℛ₀ must depend on the assumptions about the exponential growth rate r and the shape of the generation-interval distribution g(τ).

Here, we use the gamma approximation framework (Park et al., 2019) to (i) characterize the amount of uncertainty present in the exponential growth rates and the shape of the generation-interval distribution and (ii) assess the degree to which these uncertainties affect the estimate of ℛ₀. Assuming that generation intervals follow a gamma distribution with the mean and the squared coefficient of variation κ, we have

This equation demonstrates that a generation-interval distribution that has a larger mean (higher ) or is less variable (lower κ) will give a higher estimate of ℛ₀ for the same value of r (Wallinga and Lipsitch, 2007).

2.3 Statistical framework

As most studies do not report their estimates of the exponential growth rate, we first recalculate the exponential growth rate that correspond to their model assumptions. We do so by modeling reported distributions of the reproductive number ℛ₀, the mean generation interval, and the generation-interval dispersion parameter κ with appropriate probability distributions; we used gamma distributions to model values reported with confidence intervals and uniform distributions to model values reported with ranges. For example, Study 3 estimated ℛ₀ = 2.92 (95% CI: 2.28–3.67); we model this estimate as a gamma distribution with a mean of 2.92 and a shape parameter of 67, which has a 95% probability of containing a value between 2.28 and 3.67 (see Table 2 for a complete description). For each study i, we construct a family of parameter sets by drawing 100,000 random samples from the probability distributions (Table 2) that represent the estimates of ℛ_0i and the assumed values of and κ_i and calculate the exponential growth rate r_i via the inverse of Eq. 2:

View this table:

Table 2: Probability distributions for ℛ₀,

, and κ.

We use these probability distributions to obtain a probability distribution for the exponential growth rate r. The gamma distribution is parameterized by its mean and shape α. Constant values are fixed according to Table 1. ^∗We do not account for this uncertainty during our recalculation of the exponential growth rate r because the reported estimate of ℛ₀ and its uncertainty assumes ; the original article reports three ℛ₀ (and 95% CIs) estimates using three different values of : 7.6 (MERS-like), 8 (average), and 8.4 (SARS-like). We still account for this uncertainty in our pooled estimates (µ_G). ^†Study 6 uses the IDEA model (Fisman et al., 2013), through which the authors effectively fit an exponential curve to the cumulative number of confirmed cases without propagating any statistical uncertainty. Instead of modeling ℛ₀ with a probability distribution and recalculating r, we use r = 0.114 days⁻¹, which explains all uncertainty in the reported ℛ₀, when combined with the considered range of .

This allows us to approximate the probability distributions of the estimated exponential growth rates by each study; uncertainties in the probability distributions that we calculate for the estimated exponential growth rates will reflect the methods and assumptions that the studies rely on.

We construct pooled estimates for each parameter (r, , and κ) using a Bayesian multilevel modeling approach, which assumes that the parameters across different studies come from the same gamma distribution. The pooled estimates, which are represented as probability distributions rather than point estimates, allow us to average across different modeling approaches, while accounting for the uncertainties in the assumptions they make: where µ_r, µ_G, µ_κ represent the pooled estimates, and σ_r, σ_G, and σ_κ represent between-study standard deviations. We account for uncertainties associated with r_i, and κ_i (and their correlations), by drawing a random set from the family of parameter sets for each study at each Metropolis-Hastings step. Since the gamma distribution does not allow zeros, we use κ = 0.02 instead for Study 7. We note that this approach does not account for nonindependence between the parameter estimates made by different modelers. As we add more models, the pooled estimates can become sharper even when the models no longer add more information. Thus, the pooled estimator should be interpreted with care.

We use weakly informative priors on hyperparameters:

We followed recommendations outlined in Gelman et al. (2006), parameterizing the top-level gamma distributions in terms of their means and standard deviations and imposing weakly informative prior distributions on between-study standard deviations, i.e., half-normal(0, 10). We had initially used gamma priors with small shape parameters (< 1) on between-study shape parameters (= µ²/σ²) but found this put too much prior probability on large between-study variances. This phenomenon is a known problem (Gelman et al., 2006). Alternative choices of prior for the between-study shape parameters are also suboptimal: imposing strong priors (e.g. half-t(µ = 0, σ = 1, ν = 4) assumes a priori that between-study variance is large, while weak priors (e.g. half-Cauchy(0,5)) can lead to poor mixing.

We run 4 independent Markov Chain Monte Carlo chains each consisting of 500,000 burnin steps and 500,000 sampling steps. Posterior samples are thinned every 1000 steps. Convergence is assessed by ensuring that the Gelman-Rubin statistic is below 1.01 for all hyperparameters (Gelman et al., 1992); trace plots and marginal posterior distribution plots are presented in Appendix. 95% confidence intervals are calculated by taking 2.5% and 97.5% quantiles from the marginal posterior distribution for each parameter.

3 Results

Fig. 1 compares the reported values of the exponential growth rate r, mean generation interval , and the generation-interval dispersion κ from different studies with the pooled estimates that we calculate from our multilevel model. We find that there is a large uncertainty associated with the underlying parameters; many models rely on stronger assumptions that ignore these uncertainties. Surprisingly, no studies take into account how the variation in generation intervals affects their estimates of ℛ₀: all studies assumed fixed values for κ, ranging from 0 to 1. Assuming fixed parameter values can lead to overly strong conclusions (Elderd et al., 2006).

Figure 1: Comparisons of the reported parameter values with our pooled estimate.

We inferred point estimates (black), uniform distributions (orange) or confidence intervals (purple) for each parameter from each study, and combined them into pooled estimates (red; see text). Open triangle: we assumed κ = 0.5 for Study 2 which does not report generation-interval dispersion.

Fig. 2 shows how propagating uncertainty in different combinations would affect estimates and CIs for ℛ₀. For illustrative purposes, we use our pooled estimates, which may represent a reasonable proxy for the state of knowledge as of January 23–26 (Fig. 2A). Comparing the models that include only some sources of uncertainty to the “all” model, we see that propagating error from the growth rate (which all but one of the studies reviewed did) is absolutely crucial: the middle bar (“GI mean”), which lacks growth-rate uncertainty, is relatively narrow. In this case, propagating error from the mean generation interval has negligible effect compared to propagating the uncertainty in r. Uncertainty in the generation-interval dispersion also has important effects as it determines the functional form of the relationship between r and ℛ₀ (compare “growth rate + GI mean” with “all”). For example, reducing the dispersion parameter κ from 1 (assuming exponentially distributed generation intervals) to 0 (assuming fixed generation intervals) changes the r– ℛ₀ relationship from linear to exponential, therefore increasing the sensitivity of ℛ₀ estimates to r and .

Figure 2: Effects of r,

, and κ on the estimates of ℛ₀.

We compare estimates of ℛ₀ under five scenarios that propagate different combinations of uncertainties (A) based on our pooled estimates (µ_r, µ_G, and µ_κ) and (B) assuming a 4-fold reduction in uncertainty of our pooled estimate of the exponential growth rate (using (µ_r + 3 × median(µ_r))/4, instead). base: ℛ₀ estimates based on the median estimates of µ_r, µ_G, and µ_κ. growth rate: ℛ₀ estimates based on the the posterior distribution of µ_r while using median estimates of µ_G and µ_κ. GI mean: ℛ₀ estimates based on the the posterior distribution of µ_G while using median estimates of µ_r and µ_κ. growth rate + GI mean: ℛ₀ estimates based on the the joint posterior distributions of µ_r and µ_G while using a median estimate of µ_κ. all: ℛ₀ estimates based on the joint posterior distributions of µ_r, µ_G, and µ_κ. Vertical lines represent the 95% confidence intervals.

As uncertainty associated with the exponential growth rate decreases, accounting for uncertainties in generation intervals becomes even more important (Fig. 2B). Propagating error only from the growth rate gives very narrow confidence intervals in this case. Likewise, propagating errors from the growth rate and the mean generation interval gives wider but still too narrow confidence intervals. We expect this hypothetical example to better reflect more recent scenarios, as increased data availability will allow researchers to estimate r with more certainty.

We also compare the estimates of ℛ₀ across different studies by replacing their values of r, , and κ with our pooled estimates (µ_r, µ_G, and µ_κ, respectively) one at a time and recalculating the basic reproductive number ℛ₀ (Fig. 3). This procedure allows us to assess the sensitivity of the estimates of ℛ₀ across appropriate ranges of uncertainties. We find that incorporating uncertainties one at a time increases the width of the confidence intervals in all but 7 cases. We estimate narrower confidence intervals for Study 3, Study 6, and Study 7 when we account for proper uncertainties in the generation-interval dispersion because they assume a narrow generation-interval distribution (compare “base” with “GI variation”); when higher values of κ are used, their estimates of ℛ₀ become less sensitive to the values of r and , giving narrower confidence intervals. We estimate narrower confidence intervals for Study 5 and Study 7 when we account for proper uncertainties in the mean generation interval (compare “base” with “GI mean”) because the range of uncertainty in the mean generation interval they consider is much wider than the pooled range (Fig. 1). Substituting the reported r or from Study 1 with our pooled estimates give narrower confidence intervals for similar reasons.

Figure 3: Sensitivity of the reported ℛ₀ estimates with respect to our pooled estimates of the underlying parameters.

We replace the reported parameter values (growth rate r, GI mean , and GI variation κ) with our corresponding pooled estimates (µ_r, µ_G, and µ_κ) one at a time and recalculate ℛ₀ (growth rate, GI mean, and GI variation). The pooled estimate of ℛ₀ is calculated from the joint posterior distribution of µ_r, µ_G, and µ_κ (all); this corresponds to replacing all reported parameter values with our pooled estimates, which gives identical results across all studies. Horizontal dashed lines represent the 95% confidence intervals of our pooled estimate of ℛ₀. The reported ℛ₀ estimates (base) have been adjusted to show the approximate 95% confidence interval using the probability distributions that we defined if they had relied on different measures for parameter uncertainties.

We find that accounting for uncertainties in the estimate of r has the largest effect on the estimates of ℛ₀ in most cases (Fig. 3). For example, recalculating ℛ₀ for Study 7 by using our pooled estimate of r gives ℛ₀ = 3.9 (95% CI: 2.3–8.6), which is much wider than the uncertainty range they reported (2.0–3.1). There are two explanations for this result. First, even though the exponential growth rate r and the mean generation interval have identical mathematical effects on ℛ₀ in our framework (Eq. 2 in Methods), r is more influential in this case because it is associated with more uncertainty (Fig. 1). Second, assuming a fixed generation interval (κ = 0) makes the estimate of ℛ₀ too sensitive to r and . One exception is Study 1: we find this estimate of ℛ₀ is most sensitive to generation-interval dispersion κ. This is because Study 1 assumes an exponentially distributed generation interval (κ = 1): estimates that rely on this assumption make ℛ₀ relatively insensitive and thus tend to have particularly narrow confidence intervals.

Finally, we incorporate all uncertainties by using posterior samples for µ_r, µ_G, and µ_κ to recalculate R₀ and compare it with the reported ℛ₀ estimates. Our estimated ℛ₀ from the pooled distribution has a median of 2.9 (95% CI: 2.1–4.5). While the point estimate of ℛ₀ is similar to other reported values from this date range, the confidence intervals are wider than all but one study. This result does not imply that assumptions based on the pooled estimate are too weak; we believe that this confidence interval more accurately reflects the level of uncertainties present in the information that was available when these models were fitted. In fact, because the pooled estimate does not account for overlap in data sources used by the models, we feel that it is more likely to be over-confident than under-confident. Our median estimate averages over the various studies, and therefore particular studies have higher or lower median estimates. We note in particular that, while the baseline example we used from Study 6 may appear to be an outlier, the authors of this study also explore different scenarios involving changes in reporting rate over time, under which their estimates of ℛ₀ are similar to other reported estimates. Here, our focus is on estimating uncertainty, not on identifying potential explanations for these discrepancies.

4 Discussion

Estimating the basic reproductive number ℛ₀ is crucial for predicting the course of an outbreak and planning intervention strategies. Here, we use a gamma approximation (Park et al., 2019) to decompose ℛ₀ estimates into three key quantities (r, , and κ) and apply a multilevel Bayesian framework to compare estimates of ℛ₀ for the novel coronavirus outbreak. Our results demonstrate the importance of accounting for uncertainties associated with the underlying generation-interval distributions, including uncertainties in the amount of dispersion in the generation intervals. Our analysis of individual studies shows that many early estimates of ℛ₀ rely on strong assumptions.

Of the seven studies that we reviewed, two of them directly fit their models to cumulative number of confirmed cases. This approach can be appealing because of its simplicity and apparent robustness, but fitting a model to cumulative incidence instead of raw incidence can both bias parameters and give overly narrow confidence intervals, if the resulting non-independent error structures are not taken into account (Ma et al., 2014; King et al., 2015). Naive fits to cumulative incidence data should therefore be avoided.

Many sources of noise affect real-world incidence data, including both dynamical, or “process”, noise (randomness that directly or indirectly affects disease transmission); and observation noise (randomness underlying how many of the true cases are reported). Disease modelers face the choice of incorporating one or both of these in their data-fitting and modeling steps. This is not always a serious problem, particularly if the goal is inferring parameters rather than directly making forecasts (Ma et al., 2014). Modelers should however be aware of the possibility that ignoring one kind of error can give overly narrow confidence intervals (King et al., 2015; Taylor et al., 2016).

There are other important phenomena not covered by our simple framework. Examples that seem relevant to this outbreak include: changing reporting rates, reporting delays (including the effects of weekends and holidays), and changing generation intervals. For emerging pathogens such as SARS-CoV-2, there may be an early period of time when the reporting rate is very low due to limited awareness or diagnostic resources; for example, Zhao et al. (2020) (Study 6) demonstrated that estimates of ℛ₀ can change from 5.47 (95% CI: 4.16–7.10) to 3.30 (95% CI: 2.73–3.96) when they assume 2-fold changes in the reporting rate between January 17, when the official diagnostic guidelines were released (World Health Organization, 2020b), and January 20. Delays between key epidemiological timings (e.g., infection, symptom onset, and detection) can also shift the shape of an observed epidemic curve and, therefore, affect parameter estimates as well as predictions of the course of an outbreak (Tariq et al., 2019). Even though a constant delay between infection and detection may not affect the estimate of the growth rate, it can still affect the associated confidence intervals. Finally, generation intervals can become shorter throughout an epidemic as intervention strategies, such as quarantine, can reduce the infectious period (Hethcote et al., 2002). Accounting for these factors is crucial for making accurate inferences.

Here, we focused on the estimates of ℛ₀ that were published within a very short time frame (January 23–26). During early phases of an outbreak, it is reasonable to assume that the epidemic grows exponentially (Anderson and May, 1991). However, as the number of susceptible individuals decreases, the epidemic will saturate, and estimates of r used for ℛ₀ should account for the possibility that r is decreasing through time. Although our analysis only reflects a snapshot of a fast-moving epidemic, we expect certain lessons to hold: confidence intervals must combine different sources of uncertainty. In fact, as epidemics progress and more data becomes available, it is likely that inferences about exponential growth rate (and other epidemiological parameters) will become more precise; thus the risk of over-confidence when uncertainty about the generation-interval distribution is neglected will become greater.

We strongly emphasize the value of attention to accurate characterization of the transmission chains via contact tracing and better statistical frameworks for inferring generation-interval distributions from such data (Britton and Scalia Tomba, 2019). A combined effort between public-health workers and modelers in this direction will be crucial for predicting the course of an epidemic and controlling it. We also emphasize the value of transparency from modelers. Model estimates during an outbreak, even in pre-prints, should include code links and complete explanations. We suggest using methods based on open-source tools allow for maximal reproducibility.

In summary, we have provided a basis for comparing exponential-growth based estimates of ℛ₀ and its associated uncertainty in terms of three components: the exponential growth rate, mean generation interval, and generation interval dispersion. We hope this framework will help researchers understand and reconcile disparate estimates of disease transmission early in an epidemic.

Funding

BMB and DJDE were supported by Natural Sciences and Engineering Research Council (NSERC). ML was supported by Canadian Institutes of Health Research (CIHR). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Competing interests

We declare no competing interests.

Contribution

SWP and JD developed the statistical framework. SWP reviewed the published literature. SWP performed the analysis. SWP, BMB, and JD created the figures. SWP and JD wrote the first draft. All authors contributed to the writing and approval of the final report.

Data availability

R code is available in GitHub (https://github.com/parksw3/nCoV_framework).

Acknowledgements

We thank Daihai He for providing helpful comments on the manuscript.

Appendix

Appendix

Figure A1: Trace plots of the multilevel model.

Each chain is represented by a different color.

Figure A2: Marginal posterior distributions of the multilevel model.

Each chain is represented by a different color.

References

↵
Anderson, R. M. and R. M. May (1991). Infectious diseases of humans: dynamics and control. Oxford university press.
Google Scholar
↵
Bedford, T., R. Neher, J. Hadfield, E. Hodcroft, M. Ilcisin, and N. Müller (2020). Genomic analysis of nCoV spread. Situation report 2020-01-23. https://nextstrain.org/narratives/ncov/sit-rep/2020-01-23. Accessed 24, January, 2020.
Google Scholar
Britton, T. and G. Scalia Tomba (2019). Estimation in emerging epidemics: Biases and remedies. J R Soc Interface 16 (150), 20180670.
OpenUrl PubMed Google Scholar
↵
Centers for Disease Control and Prevention (2020). 2019 Novel Coronavirus (2019-nCoV), Wuhan, China. https://www.cdc.gov/coronavirus/2019-ncov/summary.html. Accessed 29, January, 2020.
Google Scholar
↵
Champredon, D. and J. Dushoff (2015). Intrinsic and realized generation intervals in infectious-disease transmission. Proc R Soc Lond B Biol Sci 282 (1821), 20152026.
OpenUrl CrossRef PubMed Google Scholar
↵
Diekmann, O., J. A. P. Heesterbeek, and J. A. Metz (1990). On the definition and the compu-tation of the basic reproduction ratio R _>0 in models for infectious diseases in heterogeneous populations. J Math Biol 28 (4), 365–382.
OpenUrl CrossRef PubMed Web of Science Google Scholar
↵
Elderd, B. D., V. M. Dukic, and G. Dwyer (2006, October). Uncertainty in predictions of disease spread and public health responses to bioterrorism and emerging diseases. Proc Natl Acad Sci USA 103 (42), 15693 –15697.
OpenUrl Abstract/FREE Full Text Google Scholar
↵
Fisman, D. N., T. S. Hauck, A. R. Tuite, and A. L. Greer (2013). An IDEA for short term outbreak projection: nearcasting using the basic reproduction number. PloS One 8 (12).
Google Scholar
↵
Gelman, A. et al. (2006). Prior distributions for variance parameters in hierarchical models (comment on article by Browne and Draper). Bayesian analysis 1 (3), 515–534.
OpenUrl Google Scholar
↵
Gelman, A., D. B. Rubin, et al. (1992). Inference from iterative simulation using multiple sequences. Stat Sci 7 (4), 457–472.
OpenUrl CrossRef PubMed Google Scholar
↵
Hethcote, H., M. Zhien, and L. Shengbing (2002). Effects of quarantine in six endemic models for infectious diseases. Math Biosci 180 (1-2), 141–160.
OpenUrl CrossRef PubMed Google Scholar
↵
Huang, C., Y. Wang, X. Li, L. Ren, J. Zhao, Y. Hu, L. Zhang, G. Fan, J. Xu, X. Gu, et al. (2020). Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China. Lancet.
Google Scholar
↵
Imai, N., A. Cori, I. Dorigatti, M. Baguelin, C. A. Donelly, S. Riley, and N. M. Ferguson (2020). Report 3: Transmissibility of 2019-nCoV. https://www.imperial.ac.uk/media/imperial-college/medicine/sph/ide/gida-fellowships/Imperial-2019-nCoV-transmissibility.pdf. Accessed 26, January, 2020.
Google Scholar
Imai, N., I. Dorigatti, A. Cori, C. A. Donelly, S. Riley, and N. M. Ferguson (2020). Report 2: Estimating the potential total number of novel Coronavirus cases in Wuhan City, China. https://www.imperial.ac.uk/media/imperial-college/medicine/sph/ide/gida-fellowships/2019-nCoV-outbreak-report-22-01-2020.pdf. Accessed 3, February, 2020.
Google Scholar
↵
King, A. A., M. Domenech de Cellès, F. M. Magpantay, and P. Rohani (2015). Avoidable errors in the modelling of outbreaks of emerging pathogens, with special reference to Ebola. Proc R Soc Lond B Biol Sci 282 (1806), 20150347.
OpenUrl CrossRef PubMed Google Scholar
↵
Li, Q., X. Guan, P. Wu, X. Wang, L. Zhou, Y. Tong, R. Ren, K. S. Leung, E. H. Lau, J. Y. Wong, et al. (2020). Early transmission dynamics in Wuhan, China, of novel coronavirus– infected pneumonia. N Engl J Med.
Google Scholar
↵
Liu, T., J. Hu, M. Kang, L. Lin, H. Zhong, J. Xiao, G. He, T. Song, Q. Huang, Z. Rong, A. Deng, W. Zeng, X. Tan, S. Zeng, Z. Zhu, J. Li, D. Wan, J. Lu, H. Deng, J. He, and W. Ma (2020). Transmission dynamics of 2019 novel coronavirus (2019-nCoV). https://www.biorxiv.org/content/10.1101/2020.01.25.919787v1. Accessed 27, January, 2020.
Google Scholar
↵
Ma, J., J. Dushoff, B. M. Bolker, and D. J. Earn (2014). Estimating initial epidemic growth rates. Bull Math Biol 76 (1), 245–260.
OpenUrl CrossRef PubMed Google Scholar
↵
Ma, J. and D. J. Earn (2006). Generality of the final size formula for an epidemic of a newly invading infectious disease. Bull Math Biol 68 (3), 679–702.
OpenUrl CrossRef PubMed Web of Science Google Scholar
↵
Majumder, M. and K. D. Mandl (2020). Early transmissibility assessment of a novel coronavirus in Wuhan, China. https://papers.ssrn.com/sol3/papers.cfm?abstract id=3524675. Accessed 27, January, 2020.
Google Scholar
↵
Park, S. W., D. Champredon, J. S. Weitz, and J. Dushoff (2019). A practical generation-interval-based approach to inferring the strength of epidemics from their speed. Epidemics 27, 12–18.
OpenUrl Google Scholar
↵
Read, J. M., J. R. Bridgen, D. A. Cummings, A. Ho, and C. P. Jewell (2020a). Novel coronavirus 2019-nCoV: early estimation of epidemiological parameters and epidemic predictions. https://www.medrxiv.org/content/10.1101/2020.01.23.20018549v1. Accessed 26, January, 2020.
Google Scholar
↵
Read, J. M., J. R. Bridgen, D. A. Cummings, A. Ho, and C. P. Jewell (2020b). Novel coronavirus 2019-nCoV: early estimation of epidemiological parameters and epidemic predictions. https://www.medrxiv.org/content/10.1101/2020.01.23.20018549v2. Accessed 5, February, 2020.
Google Scholar
↵
Riou, J. and C. L. Althaus (2020a). Pattern of early human-to-human transmission of wuhan 2019-nCoV. https://www.biorxiv.org/content/10.1101/2020.01.23.917351v1. Accessed 26, January, 2020.
Google Scholar
↵
Riou, J. and C. L. Althaus (2020b). Pattern of early human-to-human transmission of Wuhan 2019 novel coronavirus (2019-nCoV), December 2019 to January 2020. Euro Surveill 25 (4), 2000058.
OpenUrl CrossRef PubMed Google Scholar
↵
Roberts, M. and J. Heesterbeek (2007). Model-consistent estimation of the basic reproduction number from the incidence of an emerging infection. J Math Biol 55 (5-6), 803.
OpenUrl CrossRef PubMed Web of Science Google Scholar
↵
Svensson, Å. (2007). A note on generation times in epidemic models. Math Biosci 208 (1), 300–311.
OpenUrl CrossRef PubMed Web of Science Google Scholar
↵
Tariq, A., K. Roosa, K. Mizumoto, and G. Chowell (2019). Assessing reporting delays and the effective reproduction number: The Ebola epidemic in DRC, May 2018–January 2019. Epidemics 26, 128–133.
OpenUrl PubMed Google Scholar
↵
Taylor, B. P., J. Dushoff, and J. S. Weitz (2016). Stochasticity and the limits to confidence when estimating R _>0 of Ebola and other emerging infectious diseases. J Theor Biol 408, 145–154.
OpenUrl Google Scholar
↵
Wallinga, J. and M. Lipsitch (2007). How generation intervals shape the relationship between growth rates and reproductive numbers. Proc R Soc Lond B Biol Sci 274 (1609), 599–604.
OpenUrl CrossRef PubMed Web of Science Google Scholar
↵
Wearing, H. J., P. Rohani, and M. J. Keeling (2005). Appropriate models for the management of infectious diseases. PLoS Med 2 (7).
Google Scholar
↵
World Health Organization (2020a). Coronavirus disease 2019 (COVID-19) Situation Report - 38. https://www.who.int/docs/default-source/coronaviruse/situation-reports/20200227-sitrep-38-covid-19.pdf?sfvrsn=9f98940c 2. Accessed February 27, 2020.
Google Scholar
↵
World Health Organization (2020b). Laboratory testing for 2019 novel coronavirus (2019-nCoV) in suspected human cases. https://www.who.int/publications-detail/laboratory-testing-for-2019-novel-coronavirus-in-suspected-human-cases-20200117. Accessed February 4, 2020.
Google Scholar
↵
World Health Organization (2020c). Novel Coronavirus (2019-nCoV) Situation Report −6. https://www.who.int/docs/default-source/coronaviruse/situation-reports/20200126-sitrep-6-2019-ncov.pdf?sfvrsn=beaeee0c 4. Accessed January 26, 2020.
Google Scholar
↵
World Health Organization (2020d). Pneumonia of unknown cause – China. https://www.who.int/csr/don/05-january-2020-pneumonia-of-unkown-cause-china/en/. Accessed January 30, 2020.
Google Scholar
↵
Wu, J. T., K. Leung, and G. M. Leung (2020). Nowcasting and forecasting the potential domestic and international spread of the 2019-nCoV outbreak originating in Wuhan, China: a modelling study. Lancet.
Google Scholar
↵
Zhao, S., Q. Lin, J. Ran, S. S. Musa, G. Yang, W. Wang, Y. Lou, D. Gao, L. Yang, D. He, et al. (2020). Preliminary estimation of the basic reproduction number of novel coronavirus (2019-nCoV) in China, from 2019 to 2020: A data-driven analysis in the early phase of the outbreak. Int J Infect Dis.
Google Scholar
Zhao, S., J. Ran, S. S. Musa, G. Yang, Y. Lou, D. Gao, L. Yang, and D. He (2020). Preliminary estimation of the basic reproduction number of novel coronavirus (2019-nCoV) in China, from 2019 to 2020: A data-driven analysis in the early phase of the outbreak. https://www.biorxiv.org/content/10.1101/2020.01.23.916395v1. Accessed 26, January, 2020.
Google Scholar

Comments

medRxiv aims to provide a venue for anyone to comment on a medRxiv preprint. Comments are moderated for offensive or irrelevant content (this can take ~24 h). Please avoid duplicate submissions and read our Comment Policy before commenting. The content of a comment is not endorsed by medRxiv.

View 1 comment on earlier versions of this paper

Community Reviews

medRxiv aims to inform readers about online discussion of this preprint occurring elsewhere. The content at the links below is not endorsed by either medRxiv or the preprint's authors.

Community reviews for this article:

There are no community reviews for this paper.

Automated Evaluations

Certain services provide automated analysis of preprints. Analyses invited by the authors are displayed at the top of this tab. Those done independently of authors are shown underneath . None of these analyses is endorsed by medRxiv.

Automated Evaluations:

There are no automated evaluations for this paper.

[1] ↵
Anderson, R. M. and R. M. May (1991). Infectious diseases of humans: dynamics and control. Oxford university press.
Google Scholar

[2] ↵
Bedford, T., R. Neher, J. Hadfield, E. Hodcroft, M. Ilcisin, and N. Müller (2020). Genomic analysis of nCoV spread. Situation report 2020-01-23. https://nextstrain.org/narratives/ncov/sit-rep/2020-01-23. Accessed 24, January, 2020.
Google Scholar

[3] Britton, T. and G. Scalia Tomba (2019). Estimation in emerging epidemics: Biases and remedies. J R Soc Interface 16 (150), 20180670.
OpenUrl PubMed Google Scholar

[4] ↵
Centers for Disease Control and Prevention (2020). 2019 Novel Coronavirus (2019-nCoV), Wuhan, China. https://www.cdc.gov/coronavirus/2019-ncov/summary.html. Accessed 29, January, 2020.
Google Scholar

[5] ↵
Champredon, D. and J. Dushoff (2015). Intrinsic and realized generation intervals in infectious-disease transmission. Proc R Soc Lond B Biol Sci 282 (1821), 20152026.
OpenUrl CrossRef PubMed Google Scholar

[6] ↵
Diekmann, O., J. A. P. Heesterbeek, and J. A. Metz (1990). On the definition and the compu-tation of the basic reproduction ratio R _>0 in models for infectious diseases in heterogeneous populations. J Math Biol 28 (4), 365–382.
OpenUrl CrossRef PubMed Web of Science Google Scholar

[7] ↵
Elderd, B. D., V. M. Dukic, and G. Dwyer (2006, October). Uncertainty in predictions of disease spread and public health responses to bioterrorism and emerging diseases. Proc Natl Acad Sci USA 103 (42), 15693 –15697.
OpenUrl Abstract/FREE Full Text Google Scholar

[8] ↵
Fisman, D. N., T. S. Hauck, A. R. Tuite, and A. L. Greer (2013). An IDEA for short term outbreak projection: nearcasting using the basic reproduction number. PloS One 8 (12).
Google Scholar

[9] ↵
Gelman, A. et al. (2006). Prior distributions for variance parameters in hierarchical models (comment on article by Browne and Draper). Bayesian analysis 1 (3), 515–534.
OpenUrl Google Scholar

[10] ↵
Gelman, A., D. B. Rubin, et al. (1992). Inference from iterative simulation using multiple sequences. Stat Sci 7 (4), 457–472.
OpenUrl CrossRef PubMed Google Scholar

[11] ↵
Hethcote, H., M. Zhien, and L. Shengbing (2002). Effects of quarantine in six endemic models for infectious diseases. Math Biosci 180 (1-2), 141–160.
OpenUrl CrossRef PubMed Google Scholar

[12] ↵
Huang, C., Y. Wang, X. Li, L. Ren, J. Zhao, Y. Hu, L. Zhang, G. Fan, J. Xu, X. Gu, et al. (2020). Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China. Lancet.
Google Scholar

[13] ↵
Imai, N., A. Cori, I. Dorigatti, M. Baguelin, C. A. Donelly, S. Riley, and N. M. Ferguson (2020). Report 3: Transmissibility of 2019-nCoV. https://www.imperial.ac.uk/media/imperial-college/medicine/sph/ide/gida-fellowships/Imperial-2019-nCoV-transmissibility.pdf. Accessed 26, January, 2020.
Google Scholar

[14] Imai, N., I. Dorigatti, A. Cori, C. A. Donelly, S. Riley, and N. M. Ferguson (2020). Report 2: Estimating the potential total number of novel Coronavirus cases in Wuhan City, China. https://www.imperial.ac.uk/media/imperial-college/medicine/sph/ide/gida-fellowships/2019-nCoV-outbreak-report-22-01-2020.pdf. Accessed 3, February, 2020.
Google Scholar

[15] ↵
King, A. A., M. Domenech de Cellès, F. M. Magpantay, and P. Rohani (2015). Avoidable errors in the modelling of outbreaks of emerging pathogens, with special reference to Ebola. Proc R Soc Lond B Biol Sci 282 (1806), 20150347.
OpenUrl CrossRef PubMed Google Scholar

[16] ↵
Li, Q., X. Guan, P. Wu, X. Wang, L. Zhou, Y. Tong, R. Ren, K. S. Leung, E. H. Lau, J. Y. Wong, et al. (2020). Early transmission dynamics in Wuhan, China, of novel coronavirus– infected pneumonia. N Engl J Med.
Google Scholar

[17] ↵
Liu, T., J. Hu, M. Kang, L. Lin, H. Zhong, J. Xiao, G. He, T. Song, Q. Huang, Z. Rong, A. Deng, W. Zeng, X. Tan, S. Zeng, Z. Zhu, J. Li, D. Wan, J. Lu, H. Deng, J. He, and W. Ma (2020). Transmission dynamics of 2019 novel coronavirus (2019-nCoV). https://www.biorxiv.org/content/10.1101/2020.01.25.919787v1. Accessed 27, January, 2020.
Google Scholar

[18] ↵
Ma, J., J. Dushoff, B. M. Bolker, and D. J. Earn (2014). Estimating initial epidemic growth rates. Bull Math Biol 76 (1), 245–260.
OpenUrl CrossRef PubMed Google Scholar

[19] ↵
Ma, J. and D. J. Earn (2006). Generality of the final size formula for an epidemic of a newly invading infectious disease. Bull Math Biol 68 (3), 679–702.
OpenUrl CrossRef PubMed Web of Science Google Scholar

[20] ↵
Majumder, M. and K. D. Mandl (2020). Early transmissibility assessment of a novel coronavirus in Wuhan, China. https://papers.ssrn.com/sol3/papers.cfm?abstract id=3524675. Accessed 27, January, 2020.
Google Scholar

[21] ↵
Park, S. W., D. Champredon, J. S. Weitz, and J. Dushoff (2019). A practical generation-interval-based approach to inferring the strength of epidemics from their speed. Epidemics 27, 12–18.
OpenUrl Google Scholar

[22] ↵
Read, J. M., J. R. Bridgen, D. A. Cummings, A. Ho, and C. P. Jewell (2020a). Novel coronavirus 2019-nCoV: early estimation of epidemiological parameters and epidemic predictions. https://www.medrxiv.org/content/10.1101/2020.01.23.20018549v1. Accessed 26, January, 2020.
Google Scholar

[23] ↵
Read, J. M., J. R. Bridgen, D. A. Cummings, A. Ho, and C. P. Jewell (2020b). Novel coronavirus 2019-nCoV: early estimation of epidemiological parameters and epidemic predictions. https://www.medrxiv.org/content/10.1101/2020.01.23.20018549v2. Accessed 5, February, 2020.
Google Scholar

[24] ↵
Riou, J. and C. L. Althaus (2020a). Pattern of early human-to-human transmission of wuhan 2019-nCoV. https://www.biorxiv.org/content/10.1101/2020.01.23.917351v1. Accessed 26, January, 2020.
Google Scholar

[25] ↵
Riou, J. and C. L. Althaus (2020b). Pattern of early human-to-human transmission of Wuhan 2019 novel coronavirus (2019-nCoV), December 2019 to January 2020. Euro Surveill 25 (4), 2000058.
OpenUrl CrossRef PubMed Google Scholar

[26] ↵
Roberts, M. and J. Heesterbeek (2007). Model-consistent estimation of the basic reproduction number from the incidence of an emerging infection. J Math Biol 55 (5-6), 803.
OpenUrl CrossRef PubMed Web of Science Google Scholar

[27] ↵
Svensson, Å. (2007). A note on generation times in epidemic models. Math Biosci 208 (1), 300–311.
OpenUrl CrossRef PubMed Web of Science Google Scholar

[28] ↵
Tariq, A., K. Roosa, K. Mizumoto, and G. Chowell (2019). Assessing reporting delays and the effective reproduction number: The Ebola epidemic in DRC, May 2018–January 2019. Epidemics 26, 128–133.
OpenUrl PubMed Google Scholar

[29] ↵
Taylor, B. P., J. Dushoff, and J. S. Weitz (2016). Stochasticity and the limits to confidence when estimating R _>0 of Ebola and other emerging infectious diseases. J Theor Biol 408, 145–154.
OpenUrl Google Scholar

[30] ↵
Wallinga, J. and M. Lipsitch (2007). How generation intervals shape the relationship between growth rates and reproductive numbers. Proc R Soc Lond B Biol Sci 274 (1609), 599–604.
OpenUrl CrossRef PubMed Web of Science Google Scholar

[31] ↵
Wearing, H. J., P. Rohani, and M. J. Keeling (2005). Appropriate models for the management of infectious diseases. PLoS Med 2 (7).
Google Scholar

[32] ↵
World Health Organization (2020a). Coronavirus disease 2019 (COVID-19) Situation Report - 38. https://www.who.int/docs/default-source/coronaviruse/situation-reports/20200227-sitrep-38-covid-19.pdf?sfvrsn=9f98940c 2. Accessed February 27, 2020.
Google Scholar

[33] ↵
World Health Organization (2020b). Laboratory testing for 2019 novel coronavirus (2019-nCoV) in suspected human cases. https://www.who.int/publications-detail/laboratory-testing-for-2019-novel-coronavirus-in-suspected-human-cases-20200117. Accessed February 4, 2020.
Google Scholar

[34] ↵
World Health Organization (2020c). Novel Coronavirus (2019-nCoV) Situation Report −6. https://www.who.int/docs/default-source/coronaviruse/situation-reports/20200126-sitrep-6-2019-ncov.pdf?sfvrsn=beaeee0c 4. Accessed January 26, 2020.
Google Scholar

[35] ↵
World Health Organization (2020d). Pneumonia of unknown cause – China. https://www.who.int/csr/don/05-january-2020-pneumonia-of-unkown-cause-china/en/. Accessed January 30, 2020.
Google Scholar

[36] ↵
Wu, J. T., K. Leung, and G. M. Leung (2020). Nowcasting and forecasting the potential domestic and international spread of the 2019-nCoV outbreak originating in Wuhan, China: a modelling study. Lancet.
Google Scholar

[37] ↵
Zhao, S., Q. Lin, J. Ran, S. S. Musa, G. Yang, W. Wang, Y. Lou, D. Gao, L. Yang, D. He, et al. (2020). Preliminary estimation of the basic reproduction number of novel coronavirus (2019-nCoV) in China, from 2019 to 2020: A data-driven analysis in the early phase of the outbreak. Int J Infect Dis.
Google Scholar

[38] Zhao, S., J. Ran, S. S. Musa, G. Yang, Y. Lou, D. Gao, L. Yang, and D. He (2020). Preliminary estimation of the basic reproduction number of novel coronavirus (2019-nCoV) in China, from 2019 to 2020: A data-driven analysis in the early phase of the outbreak. https://www.biorxiv.org/content/10.1101/2020.01.23.916395v1. Accessed 26, January, 2020.
Google Scholar

Reconciling early-outbreak estimates of the basic reproductive number and its uncertainty: framework and applications to the novel coronavirus (SARS-CoV-2) outbreak

Abstract

1 Introduction

2 Methods

2.1 Description of the studies

2.2 Gamma approximation framework for linking r and R₀

2.3 Statistical framework

3 Results

4 Discussion

Data Availability

Funding

Competing interests

Contribution

Data availability

Acknowledgements

Appendix

References

Subject Area

Citation Manager Formats

Reconciling early-outbreak estimates of the basic reproductive number and its uncertainty: framework and applications to the novel coronavirus (SARS-CoV-2) outbreak

Abstract

1 Introduction

2 Methods

2.1 Description of the studies

2.2 Gamma approximation framework for linking r and R0

2.3 Statistical framework

3 Results

4 Discussion

Data Availability

Funding

Competing interests

Contribution

Data availability

Acknowledgements

Appendix

References

Subject Area

Follow this preprint

2.2 Gamma approximation framework for linking r and R₀