SARS-CoV-2 phylodynamics differentiates the effectiveness of non-pharmaceutical interventions

Jean-Philippe Rasigade; Anaïs Barray; Julie Teresa Shapiro; Charlène Coquisart; Yoann Vigouroux; Antonin Bal; Grégory Destras; Philippe Vanhems; Bruno Lina; Laurence Josset; Thierry Wirth

doi:10.1101/2020.08.24.20180927

Abstract

Quantifying the effectiveness of large-scale non-pharmaceutical interventions against COVID-19 is critical to adapting responses against future waves of the pandemic. By combining phylogenetic data of 5,198 SARS-CoV-2 genomes with the chronology of non-pharmaceutical interventions in 57 countries, we examine how interventions and combinations thereof alter the divergence rate of viral lineages, which is directly related to the epidemic reproduction number. Home containment and education lockdown had the largest independent impacts and were predicted to reduce the reproduction number by 35% and 26%, respectively. However, we find that in contexts with a reproduction number >2, no individual intervention is sufficient to stop the epidemic and increasingly stringent intervention combinations may be required. Our phylodynamic approach can complement epidemiological models to inform public health strategies against COVID-19.

Coronavirus disease 2019 (COVID-19), caused by the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), emerged in China in late 2019 (1-3). Facing or anticipating the pandemic, the governments of most countries implemented a wide range of large-scale non- pharmaceutical interventions in order to reduce COVID-19 transmission (4). These included closing schools, universities, or workplaces, cancelling public events, or restricting personal movements. Concerns have been raised regarding the impact of such interventions on the economy, education, and, indirectly, the healthcare system (5).

Understanding the effectiveness of each non-pharmaceutical intervention against COVID- 19 is critical to implementing appropriate responses against current or future waves of the pandemic. Comparative studies of interventions published so far yielded conflicting results (6-9). Epidemiological studies of interventions against an epidemic face several challenges. Mathematical models informed by counts of confirmed cases or deaths ignore the relationships and transmission patterns between cases. Counts themselves can vary in accuracy and timeliness depending on countries’ health facilities, surveillance systems, and the changing definitions of cases. Even when an intervention immediately reduces the transmission rate, a detectable reduction of disease incidence can be much delayed, especially when testing and diagnoses are restricted to specific patient populations. This delay from intervention to incidence reduction, combined with the variety and simultaneous implementation of interventions (4, 9), complicates the identification of their individual effects.

Unlike epidemiological case counts, viral genomes bear phylogenetic information relevant to disease transmission. Extracting this information is the goal of phylodynamics, which relies on evolutionary theory and bioinformatics to model the dynamics of an epidemic (10). The dates of viral transmission events can also be inferred from genome sequences to alleviate, at least in part, the problems of delayed detection of an intervention’s effect. Here, we conducted a phylodynamic analysis of 5,198 SARS-CoV-2 genomes from 57 countries to estimate the independent effects of 9 large-scale non-pharmaceutical interventions on the transmission rate of COVID-19 during the early dissemination phase of the pandemic. We adapted an established phylogenetic method (11, 12) to model variations of the divergence rate of SARS-CoV-2 in response to interventions and combinations thereof. Building on known relationships between the viral divergence rate and the effective reproduction number R_t (13), we quantify the reduction of R_t independently attributable to each intervention, exploiting heterogeneities in their nature and timing across countries in multivariate models. In turn, these results allow us to estimate the probability of stopping the epidemic (R_t < 1) when implementing selected combinations of interventions.

Survival modelling of viral transmission

The dissemination and detection of a virus in a population can be described as a transmission tree (Fig. 1A) whose shape reflects that of the dated phylogeny of the sampled pathogens (Fig. 1B). In a phylodynamic context, it is assumed that each lineage, represented by a branch in the phylogenetic tree, belongs to a single patient and that lineage divergence events, represented by tree nodes, coincide with transmission events (10). Thus, branches in a dated phylogeny represent intervals of time between divergence events interpreted as transmission events. This situation can be translated in terms of survival analysis, which models rates of event occurrence, by considering divergence as the event of interest and by treating branch lengths as time-to-event intervals (Fig. 1C-D). Phylogenetic survival analysis was devised by E. Paradis and applied to detecting temporal variations in the divergence rate of tanagers (11) or fishes (14), but it has not been applied to pathogens so far (12, 15, 16).

Fig. 1. Conceptual overview of phylodynamic survival analysis.

Under the assumption that each viral lineage in a phylogeny belongs to an infected patient, the dates of viral transmission and sampling events in a transmission tree (A) coincide with the dates of divergence events (nodes) and tips, respectively, of the dated phylogeny reconstructed from the viral genomes (B). Treating viral transmission as the event of interest for survival modelling, internal branches connecting two divergence events are interpreted as time-to-event intervals while terminal branches, that do not end with a transmission event, are interpreted as censored intervals (C). Translating the dated phylogeny in terms of survival events enables visualizing the probability of transmission through time as a Kaplan-Meier curve (D) and modelling the transmission rate using Cox proportional hazards regression.

To quantify the effect of non-pharmaceutical interventions on the transmission rate of COVID-19, we adapted the original model in (11) to account for the specific setting of viral phylodynamics (see Methods). Hereafter, we refer to the modified model as phylodynamic survival analysis. In survival analysis terms, we interpret internal branches of the phylogeny (those that end with a transmission event) as time-to-event intervals and terminal branches (those that end with a sampling event) as censored intervals (Fig. 1C; see Methods). The time-to-event intervals are loosely related to the so-called clinical onset serial interval, which is the delay between the onset of symptoms in the source and infected patients in a transmission pair (but see (17)).

The predictors of interest in our setting, namely, the non-pharmaceutical interventions, vary both through time and across lineages depending on their geographic location. To model this, we assigned each divergence event (and subsequent branch) to a country using maximum- likelihood ancestral state reconstruction (18). Each assigned branch was then associated with the set of non-pharmaceutical interventions that were active or not in the country during the interval spanned by the branch. Intervals containing a change of intervention were split into subintervals (19). These (sub)intervals were the final observations (statistical units) used in the survival models. Models were adjusted for the hierarchical dependency structure introduced by interval splits and country assignations (18).

Phylodynamic survival models estimate variations of the reproduction number

The evolution of lineages in a dated viral phylogeny can be described as a birth-death process with a divergence (or birth) rate λ and an extinction (or death) rate μ (20). In a phylodynamic context, the effective reproduction number R_t equals the ratio of the divergence and extinction rates (20). Coefficients of phylodynamic survival models (the so-called hazard ratios; see Methods) act as multiplicative factors of the divergence rate λ, independent of the true value of λ which needs not be specified nor evaluated. As R_t = λ/μ, multiplying λ by a coefficient also multiplies R_t, independent of the true value of μ. Thus, coefficients of phylodynamic survival models estimate variations of R_t in response to predictor variables without requiring external knowledge of, or making assumptions about λ and μ.

Variations in COVID-19 transmission rates across countries

We assembled a composite dataset by combining a dated phylogeny of SARS-CoV-2 (Fig. 2A), publicly available from Nextstrain (21) and built from the GISAID initiative data (22), with a detailed timeline of non-pharmaceutical interventions available from the Oxford COVID-19 Government Response Tracker (OxCGRT) (18, 23). Figure S1 shows a flowchart outlining the data sources, sample sizes and selection steps of the study. Phylogenetic and intervention data covered the early phase of the epidemic up to May 4, 2020.

Figure 2. Comparison of the timing and reproduction numbers of the COVID-19 epidemic in 74 countries based on a dated phylogeny.

A: dated phylogeny of 5,198 SARS-CoV-2 genomes where internal (time-to-event) and terminal (time-to-censoring) branches are colored red and blue, respectively. B: histogram of internal and terminal branch lengths. C: box-and-whisker plots of the distribution over time of the inferred transmission events in each country, where boxes denote interquartile range (IQR) and median, whiskers extend to dates at most 1.5x the IQR away from the median date, and circle marks denote dates farther than 1.5 IQR from the median date. D: point estimates and 95% confidence intervals of the relative effective reproduction number, expressed as percent changes relative to China, in 27 countries with ≥10 assigned transmission events. Countries with <10 assigned transmission events (n=32) were pooled into the ‘Others’ category. E, F: representative Kaplan-Meier survival curves of the probability of transmission through time in countries with comparable (E) or highly different (F) transmission rates. ‘+’ marks denote censoring events. Numbers denote counts of internal branches and, in brackets, terminal branches. G, H: scatter plots of the reported numbers of COVID-19 cases and deaths per country, in absolute values (G) and per million inhabitants (H).

The 5,198 SARS-CoV-2 genomes used to reconstruct the dated phylogeny were collected from 74 countries. Detailed per-country data including sample sizes are shown in Data S1. Among the 10,394 branches in the phylogeny, 2,162 branches (20.8%) could not be assigned to a country with >95% confidence and were excluded, also reducing the number of represented countries from 74 to 59 (Fig. S1; a comparison of included and excluded branches is shown in Fig. S2). The remaining 4,025 internal branches had a mean time-to-event (delay between transmission events) of 4.4 days (Fig. 2B). These data were congruent with previous estimates of the mean serial interval of COVID-19 ranging from 3.1 days to 7.5 days (24). The 4,207 terminal branches had a mean time-to-censoring (delay from infection to detection) of 10.6 days (Figure 2A-B). This pattern of longer terminal vs. internal branches is typical of a viral population in fast expansion (10).

We compared the timing and dynamics of COVID-19 spread in countries represented in our dataset (Figure 2C-D), pooling countries with <10 assigned transmission events into an ‘others’ category. The estimated date of the first local transmission event in each country showed good concordance with the reported dates of the epidemic onset (Pearson correlation = 0.84; Fig. S3). The relative effective reproduction number R_t per country, taking China as reference, ranged from -55.6% (95%CI, -71.4% to -29.9%) in Luxembourg to +11.7% (95% CI, -6.7% to +33.8%) in Spain (Fig. 2C). Notice that these estimates are averages over variations of R_t through time in each country, up to May 4, 2020. Exemplary survival curves of transmission events are shown in Fig. 2E-F. Relative R_t’s are not expected to necessarily correlate with the reported counts of COVID-19 cases across all countries due to the confounding effects of population sizes, case detection policies and the number of genomes included. Nevertheless, the relative R_t’s across countries substantially correlated with the reported cumulative counts up to May 12 (Fig. 2G-H), including COVID-19 cases (Pearson correlation with log-transformed counts, 0.46, 95% CI, 0.07 to 0.73), deaths (correlation 0.59, 95% CI, 0.24 to 0.80), cases per million inhabitants (correlation 0.39, 95% CI, -0.01 to 0.69) and deaths per million inhabitants (correlation 0.56, 95% CI, 0.21 to 0.79).

Disentangling the individual effectiveness of non-pharmaceutical interventions

The implementation and release dates of large-scale non-pharmaceutical interventions against COVID-19 were available for 57 countries out of the 59 represented in the dated phylogeny. Definitions of the selected interventions are shown in Table 1 (18). Branches assigned to countries with missing intervention data, namely, Latvia and Senegal, were excluded from further analysis (n=22/8,262 (0.3%); see Fig. S1). Up to May 4, 2020, the interventions most universally implemented were information campaigns, restrictions on international travel and education lockdown (>95% of countries) (Fig. S4). The least frequent were the closure of public transportation (35%) and home containment (72%). Public information campaigns came first and home containment came last (median delay across countries, 5 days before and 24 days after the first local transmission event, respectively; Fig. 3A). Survival curves for each intervention are shown in Fig. S5. Most interventions were implemented in combination and accumulated over time rather than replacing each other (Fig. S4; median delays between interventions are shown in Fig. S6; correlations in Fig. S7; and a detailed timeline of interventions in Data S2). However, we observed a substantial heterogeneity of intervention timing across the 57 countries (Fig. 3A), suggesting that individual intervention effects can be discriminated by multivariate analysis given the large sample size (n=8,210 subintervals).

Figure 3. Non-pharmaceutical interventions variably reduce the reproduction number of COVID-19.

Data derive from the phylogenetic survival analysis of 4,191 internal and 4,019 terminal branches of a dated phylogeny of SARS-CoV-2 genomes, combined with a chronology of interventions in 57 countries. A: box-and-whisker plots of the delay between the 1st SARS-CoV-2 divergence event and the intervention. Plot interpretation is similar with Fig. 2C. B: point estimates and 95% confidence intervals of the independent % change of the effective reproduction number predicted by each intervention in a multivariate, mixed-effect phylogenetic survival model adjusted for between-country variations. C: matrix of pairwise interactions between the interventions (in rows) estimated using 9 multivariate models (in columns), where each model ignores exactly one intervention. Negative (positive) differences in blue (red) denote a stronger (lesser) predicted effect of the intervention in row when ignoring the intervention in column. D, E: simulated impact of interventions implemented independently (D) or in sequential combination (E) on the count of simultaneous cases in an idealized population of 1 million susceptible individuals using compartmental SIR models with a basic reproduction number R₀ = 3 (black lines) and a mean infectious period of 2 weeks. Shaded areas in (D) denote 95% confidence bands.

View this table:

Table 1. Selected large-scale non-pharmaceutical interventions against COVID-19.

A multivariate phylogenetic survival model, including the 9 interventions and controlling for between-country R_t variations (see Methods), showed a strong fit to the data (likelihood-ratio test compared to the null model, P < 10^-196). In this model, the interventions most strongly and independently associated with a reduction of the effective reproduction number R_t of SARS-CoV-2 were home containment (R_t percent change, -34.6%, 95%CI, -43.2 to -24.7%), education lockdown (−25.6%, 95%CI, -33.4 to -16.9%), restricting gatherings (−22.3%, 95%CI, -33.4 to - 9.4%) and international travel (−16.9%, 95%CI, -27.5 to -4.8%). We failed to detect a substantial impact of other interventions, namely information campaigns, cancelling public events, closing workplaces, restricting internal movement, and closing public transportation (Fig. 3B). Based on coefficient estimates, all interventions were independently predicted to reduce R_t (even by a negligible amount), in line with the intuition that no intervention should accelerate the epidemic. Contrasting with previous approaches that constrained coefficients (9), this intuition was notenforced a priori in our multivariate model, in which positive coefficients (increasing R_t) might have arisen due to noise or collinearity between interventions. The absence of unexpectedly positive coefficients suggests that our model correctly captured the epidemic slowdown that accompanied the accumulation of interventions.

Estimated intervention effects are robust to time-dependent confounders and collinearity

A reduction of R_t through time, independent of the implementation of interventions, might lead to overestimate their effect in our model. Several potential confounders might reduce R_t through time but cannot be precisely estimated and included as control covariates. These included the progressive acquisition of herd immunity, the so-called artificial diversification slowdown possibly caused by incomplete sampling, and time-dependent variations of the sampling effort (see Methods). To quantify this potential time-dependent bias, we constructed an additional model including the age of each branch as a covariate (Table S1). The coefficients in this time-adjusted model only differed by small amounts compared to the base model. Moreover, the ranking by effectiveness of the major interventions remained unchanged, indicating that our estimates were robust to time-dependent confounders.

We also quantified the sensitivity of the estimated intervention effects to the inclusion of other interventions (collinearity) by excluding interventions one by one in 9 additional models (Fig. 3C). This pairwise interaction analysis confirmed that most of the estimated effects were strongly independent. Residual interferences were found for gathering restrictions, whose full-model effect of -22.3% was reinforced to -33.5% when ignoring home containment; and for cancelling public events, whose full-model effect of -0.97% was reinforced to -15.1% when ignoring gathering restrictions. These residual interferences make epidemiological sense because home containment prevents gatherings and gathering restrictions also prohibit public events. Overall, the absence of strong interferences indicated that our multivariate model reasonably captured the independent, cumulative effect of interventions, enabling ranking their impact on COVID-19 spread.

Simulating intervention effectiveness in an idealized population

To facilitate the interpretation of our estimates of the effectiveness of interventions against COVID-19, we simulated each intervention’s impact on the peak number of cases, whose reduction is critical to prevent overwhelming the healthcare system (Fig. 3D and Fig. S8). We used compartmental Susceptible-Infected-Recovered (SIR) models with a basic reproduction number R₀ = 3 and a mean infectious period of 2 weeks based on previous estimates (25, 26), in an idealized population of 1 million susceptible individuals (see Methods). In each model, we simulated the implementation of a single intervention at a date chosen to reflect the median delay across countries (Fig. 3A) relative to the epidemic onset (see Methods). On implementation date, the effective reproduction number was immediately reduced according to the estimated intervention’s effect shown in Fig. 3B.

In this idealized setting, home containment, independent of all other restrictions, only halved the peak number of cases from 3.0×10⁵ to 1.5×10⁵ (95% CI, 1.0×10⁵ to 2.0×10⁵) (Fig. 3D). However, a realistic implementation of home containment also implies other restrictions including, at least, restrictions on movements, gatherings, and public events. This combination resulted in a relative R_t of -50.8% (95% CI, -59.4% to -40.2%) and a 5-fold reduction of the peak number of cases to 6.0×10⁴ (95% CI, 1.9×10⁴ to 1.2×10⁵). Nevertheless, if R₀ = 3 then a 50% reduction is still insufficient to reduce R_t below 1 and stop the epidemic. This suggests that even when considering the most stringent interventions, combinations may be required. To further examine this issue, we estimated the effect of accumulating interventions by their average chronological order shown in Fig. 3A, from information campaigns alone to all interventions combined including home containment (Fig. 3E). Strikingly, only the combination of all interventions completely stopped the epidemic under our assumed value of R₀. To estimate the effectiveness of combined interventions in other epidemic settings, we computed the probabilities of reducing R_t below 1 for values of R₀ ranging from 1.5 to 3.5 (Table 2; see Methods). The same probabilities for individual interventions are presented in Table S2, showing that no single intervention would stop the epidemic if R₀ ≥ 2. These results may help inform decisions on the appropriate stringency of the restrictions required to control the epidemic under varying transmission regimes.

View this table:

Table 2. Predicted reduction of the COVID-19 effective reproduction number under increasingly stringent combinations of non-pharmaceutical interventions.

Discussion

We present a phylodynamic analysis of how the divergence rate and reproduction number of SARS-CoV-2 varies in response to large-scale non-pharmaceutical interventions in 57 countries. Our results suggest that no single intervention, including home containment, is sufficient on its own to stop the epidemic (R_t < 1). Increasingly stringent combinations of interventions may be required depending on the effective reproduction number.

Home containment was repeatedly estimated to be the most effective response in epidemiological studies from China (27), France (28), the UK (29), and Europe (9). Other studies modelled the additional (or residual) reduction of R_t by an intervention after taking into account those previously implemented (4, 8). Possibly because home containment was the last implemented intervention in many countries, these studies reported a weaker or even negligible additional effect compared to earlier interventions. In our study, home containment, even when implemented last, had the strongest independent impact on epidemic spread (R_t percent change, - 34.6%), which was further amplified (−50.8%) when taking into account implicit restrictions on movements, gatherings and public events.

We found that education lockdown substantially decreased COVID-19 spread (R_t percent change, -25.6%). Contrasting with home containment, the effectiveness of education lockdown has been more hotly debated. This intervention ranked among the most effective ones in a study of 41 countries (4) but had virtually no effect on transmission in other reports from Europe (8, 9). Children have been estimated to be poor spreaders of COVID-19 and less susceptible than adults to develop disease after an infectious contact, counteracting the effect of their higher contact rate (6, 30). However, the relative susceptibility to infection was shown to increase sharply between 15 and 25 years, from 0.40 to 0.79 (30). Importantly, we could not differentiate the effect of closing schools and universities because both closures coincided in all countries. Thus, our finding that education lockdowns reduce COVID-19 transmission might be driven by contact rate reductions in older students rather than in children, as hypothesized elsewhere (4), and, in addition, by parents staying at home with their children.

Restrictions on gatherings of >100 persons appeared more effective than cancelling public events (R_t percent changes, -22.3% vs. -1.0%, respectively) in our phylodynamic model, in line with previous results from epidemiological models (4). Notwithstanding that gathering restrictions prohibit public events, possibly causing interferences between estimates (Fig. 3C), this finding is intriguing. Indeed, several public events resulted in large case clusters, the so-called superspreading events, that triggered epidemic bursts in France (31), South Korea (32) or the U.S. (33). A plausible explanation for not detecting the effectiveness of cancelling public events is that data-driven models, including ours, better capture the cumulative effect of more frequent events such as gatherings than the massive effect of much rarer events such as superspreading public events. This bias towards ignoring the so-called ‘Black Swan’ exceptional events (34) suggests that our findings (and others’ (4)) regarding restrictions on public events should not be interpreted as an encouragement to relax these restrictions but as a potential limit of modelling approaches (but see (35)).

There are other limitations to our study, including its retrospective design. We could not consider important non-pharmaceutical interventions that are difficult to date and quantify, such as contact tracing or case isolation policies. Data were analyzed at the national level, although much virus transmission was often concentrated in specific areas and some non-pharmaceutical interventions were implemented at the sub-national level (36). From a statistical standpoint, the interval lengths in the dated phylogeny were treated as fixed quantities in the survival models. Ignoring the uncertainty of the estimated lengths might underestimate the width of confidence intervals, although this is unlikely to have biased the pointwise estimates and the ranking of interventions’ effects. The number of genomes included by country did not necessarily reflect the true number of cases, which might have influenced country comparison results in Fig. 2, but not intervention effectiveness models in Fig. 3 which were adjusted for between-country variations of R_t. Finally, our estimates represent averages over many countries with different epidemiological contexts, healthcare systems, cultural behaviors and nuances in intervention implementation details and population compliance. This global approach facilitates unifying the interpretation of intervention effectiveness, but this interpretation still needs to be adjusted to local contexts by policy makers.

Beyond the insights gained into the impact of interventions against COVID-19, our findings highlight how phylodynamic survival analysis can help leverage pathogen sequence data to estimate epidemiological parameters. Contrasting with the Bayesian approaches adopted by most, if not all, previous assessments of intervention effectiveness (4, 7, 9), phylodynamic survival analysis does not require any quantitative prior assumption or constraint on model parameters. The method should also be simple to implement and extend by leveraging the extensive software arsenal of survival modelling. Phylodynamic survival analysis may complement epidemiological models as pathogen sequences accumulate, allowing to address increasingly complex questions relevant to public health strategies.

Materials and Methods

Definitions and chronology of non-pharmaceutical interventions

The nature, stringency and timing of non-pharmaceutical interventions against COVID-19 have been collected and aggregated daily since January 1, 2020 by the Oxford COVID-19 Government Response Tracker initiative of the Blavatnik School of Government, UK (4, 23). As of May 12, 2020, the interventions are grouped into three categories, namely: closures and containment (8 indicators), economic measures (4 indicators) and health measures (5 indicators). Indicators use 2- to 4-level ordinal scales to represent each intervention’s stringency, and an additional flag indicating whether the intervention is localized or general. Details of the coding methods for indicators can be found in (37). We focused on large-scale interventions against transmission that did not target specific patients (for instance, we did not consider contact tracing) and we excluded economic and health interventions except for information campaigns. This rationale led to the selection of the 9 indicators shown in Table 1. To facilitate interpretation while constraining model complexity, the ordinal-scale indicators in OxCGRT data were recoded as binary variables in which we only considered government requirements (as opposed to recommendations) where applicable. We did not distinguish between localized and nation-wide interventions because localized interventions, especially in larger countries, targeted the identified epidemic hotspots. As the data did not allow to differentiate closures of schools and universities, we use the term ‘education lockdown’ (as opposed to ‘school closure’ in (23)) to avoid misinterpretation regarding the education levels concerned.

Phylodynamic survival analysis in measurably evolving populations

The original phylogenetic survival model in (11) and its later extensions (38) considered intervals backward in time, from the tips to the root of the tree, and were restricted to trees with all tips sampled at the same date relative to the root (ultrametric trees). Censored intervals (intervals that do not end with an event) in (11) were used to represent lineages with known sampling date but unknown age. In contrast, viral samples in ongoing epidemics such as COVID-19 are typically collected through time. A significant evolution of the viruses during the sampling period violates the ultrametric assumption. To handle phylogenies of these so-called measurably evolving populations (39), we propose a different interpretation of censoring compared to (11). Going forward in time, the internal branches of a tree connect two divergence events while terminal branches, those that end with a tip, connect a divergence event and a sampling event (Fig. 1B). Thus, we considered internal branches as time-to-event intervals and terminal branches as censored intervals representing the minimal duration during which no divergence occurred (Fig. 1C).

SARS-CoV-2 phylogenetic data

SARS-CoV-2 genome sequences have been continuously submitted to the Global Initiative on Sharing All Influenza Data (GISAID) by laboratories worldwide (22). To circumvent the computational limits of phylogeny reconstruction and time calibration techniques, the sequences of the GISAID database are subsampled before analysis by the Nextstrain initiative, using a balanced subsampling scheme through time and space (21, 40). Phylogenetic reconstruction uses maximum-likelihood phylogenetic inference based on IQ-TREE (41) and time-calibration uses TreeTime (42). See (43) for further details on the Nextstrain bioinformatics pipeline. A dated phylogeny of 5,211 SARS-CoV-2 genomes, along with sampling dates and locations, was retrieved from nextstrain.org/ncov on May 12, 2020. Genomes of non-human origin (n = 13) were discarded from analysis. Polytomies (unresolved divergences represented as a node with >2 descendants) were resolved as branches with an arbitrarily small length of 1 hour, as recommended for adjustment of zero-length risk intervals in Cox regression (44). Of note, excluding these zero-length branches would bias the analysis by underestimating the number of divergence events in specific regions of the phylogeny. Maximum-likelihood ancestral state reconstruction was used to assign internal nodes of the phylogeny to countries in a probabilistic fashion, taking the tree shape and sampling locations as input data (45). To prepare data for survival analysis, we decomposed the branches of the dated phylogeny into a set of time-to-event and time-to-censoring intervals (Fig. 1C). Intervals were assigned to the most likely country at the origin of the branch when this country’s likelihood was >0.95. Intervals in which no country reached a likelihood of 0.95 were excluded from further analysis (Figs. S1-S2). Finally, intervals during which a change of intervention occurred were split into sub-intervals, such that all covariates, including the country and interventions, were held constant within each sub-interval and only the last subinterval of an internal branch was treated as a time-to-event interval. This interval-splitting approach is consistent with an interpretation of interventions as external time-dependent covariates (19), which are not dependent on the event under study (the viral divergence).

Mixed-effect Cox proportional hazard models

Variations of the divergence rate λ in response to non-pharmaceutical interventions were modelled using mixed-effect Cox proportional hazard regression (reviewed in (46)). Models treated the country and phylogenetic branch as random effects to account for non-independence between sub-intervals of the same branch and between branches assigned to the same country. The predictors of interest were not heritable traits of SARS-CoV-2, thus, phylogenetic autocorrelation between intervals was not corrected for. Time-to-event data were visualized using Kaplan-Meier curves with 95% confidence intervals. The regression models had the form where λ_i(t) is the hazard function (here, the divergence rate) at time t for the ith observation, λ₀(t) is the baseline hazard function, which is neither specified or explicitly evaluated, X_i is the set of predictors of the ith observation (the binary vector of active non-pharmaceutical interventions), β is the vector of fixed-effect coefficients, α_j is the random intercept associated with the jth phylogenetic branch and γ_k is the random intercept associated with the kth country. Country comparison models (Fig. 2D), in which the country was the only predictor and branches were not divided into subintervals, did not include random intercepts. Raw model coefficients (the log-hazard ratios) additively shift the logarithm of the divergence rate λ. Exponentiated coefficients exp β (the hazard ratios) are multiplicative factors (fold-changes) of the divergence rate. To ease interpretation, hazard ratios were reported as percentage changes of the divergence rate or, equivalently, of the effective reproduction number R_t, equal to (exp β − 1) × 100. Analyses were conducted using R 3.6.1 (the R Foundation for Statistical Computing, Vienna, Austria) with additional packages ape, survival and coxme.

Estimating the effect of combined interventions

Pointwise estimates and confidence intervals of combined interventions were estimated by adding individual coefficients and their variance-covariances. Cox regression coefficients have approximately normal distribution with mean vector m and variance-covariance matrix V, estimated from the inverse Hessian matrix of the likelihood function evaluated at m. From well-known properties of the normal distribution, the distribution of a sum of normal deviates is normal with mean equal to the sum of the means and variance equal to the sum of the variance-covariance matrix of the deviates. Thus, the coefficient corresponding to a sum of coefficients with mean m and variance V has mean Σm and variance ΣV, from which we derive the point estimates and confidence intervals of a combination of predictors. Importantly, summing over the covariances captures the correlation between coefficients when estimating the uncertainty of the combined coefficient.

Probability of stopping an epidemic

A central question regarding the effectiveness of interventions or combinations thereof is whether their implementation can stop an epidemic by reducing R_t below 1 (Table 2). Suppose that some intervention has an estimated log-hazard ratio . has approximately normal distribution with mean β and variance σ², written . For some fixed value of R₀, the estimated post-intervention reproduction number . The probability p that is where d denotes the probability density function. To solve the integral, remark that , hence, . Using a change of variables in the integral and noting that log 1 = 0, we obtain the closed-form solution where Φ is the cumulative density function of the normal distribution with mean and variance σ². By integrating over the coefficient distribution, this method explicitly considers the estimation uncertainty of when estimating p.

Potential time-dependent confounders

Time-dependent phylodynamic survival analysis assumes that variations of branch lengths though time directly reflect variations of the divergence rate, which implies that branch lengths are conditionally independent of time given the divergence rate. When the phylogeny is reconstructed from a fraction of the individuals, as is the case in virtually all phylodynamic studies including ours, this conditional independence assumption can be violated. This is because incomplete sampling increases the length of more recent branches relative to older branches (47), an effect called the diversification slowdown (48, 49). Noteworthy, this effect can be counteracted by a high extinction rate (16, 47), which is expected in our setting and mimicks an acceleration of diversification. Moreover, whether the diversification slowdown should be interpreted as a pure artifact has been controversial (49, 50). Notwithstanding, we considered incomplete sampling as a potential source of bias in our analyses because a diversification slowdown might lead to an overestimation of the effect of non-pharmaceutical interventions. Additionally, the selection procedure used by Nextstrain to collect genomes included in the dated phylogeny possibly amplified the diversification slowdown by using a higher sampling fraction in earlier phases of the epidemic (40). To verify whether the conclusions of our models were robust to this potential bias, we built an additional multivariate model including the estimated date of each divergence event (the origin of the branch) as a covariate. The possible relation between time and the divergence rate is expectedly non-linear (47) and coefficient variations resulting from controlling for time were moderate (Table S1), thus, we refrained from including a time covariate in the reported regression models as this might lead to overcontrol. Further research is warranted to identify an optimal function of time that might be included as a covariate in phylodynamic survival models to control for sources of diversification slowdown.

Compartmental epidemiological models

Epidemic dynamics can be described by partitioning a population of size N into three compartments, the susceptible hosts S, the infected hosts I, and the recovered hosts R. The infection rate b governs the transitions from S to I and the recovery rate g governs the transitions from I to R (we avoid the standard notation β and γ for infection and recovery rates to prevent confusion with Cox model parameters). The SIR model describes the transition rates between compartments as a set of differential equations with respect to time t,

The transition rates of the SIR model define the basic reproduction number of the epidemic, R₀ = b/g. From a phylodynamic standpoint, if the population dynamics of a pathogen is described as a birth-death model with divergence rate λ and extinction rate μ, then R_t = λ/μ or, alternatively, (51). We simulated the epidemiological impact of each individual intervention in SIR models with R₀ = 3 and g⁻¹ = 2 weeks based on previous estimates (25, 26), yielding a baseline infection rate b = gR₀ = 6. In each model, the effective infection rate changed from b to b · exp β on the implementation date of an intervention with log-hazard ratio β. To determine realistic implementation delays, the starting time of the simulation was set at the date of the first local divergence event in each country and the implementation date was set to the observed median delay across countries (see Fig. 3A). All models started with 100 infected individuals at t = 0, a value assumed to reflect the number of unobserved cases at the date of the first divergence event, based on the temporality between the divergence events and the reported cases (Fig. S3) and on a previous estimate from the U.S. suggesting that the total number of cases might be two orders of magnitude larger than the reported count (52). Evaluation of the SIR models used the R package deSolve.

Data and software availability

All data and software code used to generate the results are available at github.com/rasigadel ab/covid-npi.

Data Availability

Both data and analysis code are available online at https://github.com/rasigadelab/covid-npi.

https://github.com/rasigadelab/covid-npi

Funding

JPR received support from the FINOVI Foundation (grant R18037CC).

Author contributions

JPR, LJ, TW designed research. JPR, ABarray, JTS, CQ, YV, GD, LJ conducted research. JPR, TW analyzed the data. JPR created figures. JPR, Abal, GD, LJ, PV, BL, TW interpreted the data. All authors wrote the paper.

Competing interests

BL is currently active in groups advising the French government for which BL is not receiving payment.

Data and material availability

Both data and analysis code are available online at https://github.com/rasigadelab/covid-npi.

Supplementary Materials

Figures S1-S8

Tables S1-S2

External Databases S1-S3

References (37-53)

Acknowledgements

We thank Philip Supply, François Vandenesch, Jean-Sébastien Casalegno, Vanessa Escuret, and Christophe Ramière for fruitful discussions and reviews of our work. We thank the GISAID, Nextstrain and OxCGRT teams for making their high-quality datasets available to the community. A list of authors and laboratories contributing SARS-CoV-2 genome sequences is shown in Data S3.

References and Notes

1.↵
F. Wu, S. Zhao, B. Yu, Y.-M. Chen, W. Wang, Z.-G. Song, Y. Hu, Z.-W. Tao, J.-H. Tian, Y.-Y. Pei, M.-L. Yuan, Y.-L. Zhang, F.-H. Dai, Y. Liu, Q.-M. Wang, J.-J. Zheng, L. Xu, E. C. Holmes, Y.-Z. Zhang, A new coronavirus associated with human respiratory disease in China. Nature. 579, 265–269 (2020).
OpenUrl CrossRef PubMed
2.
P. Zhou, X.-L. Yang, X.-G. Wang, B. Hu, L. Zhang, W. Zhang, H.-R. Si, Y. Zhu, B. Li, C-L. Huang, H.-D. Chen, J. Chen, Y. Luo, H. Guo, R.-D. Jiang, M.-Q. Liu, Y. Chen, X.-R. Shen, X. Wang, X.-S. Zheng, K. Zhao, Q.-J. Chen, F. Deng, L.-L. Liu, B. Yan, F.-X. Zhan, Y.-Y. Wang, G.-F. Xiao, Z.-L. Shi, A pneumonia outbreak associated with a new coronavirus of probable bat origin. Nature. 579, 270–273 (2020).
OpenUrl CrossRef PubMed
3.↵
N. Zhu, D. Zhang, W. Wang, X. Li, B. Yang, J. Song, X. Zhao, B. Huang, W. Shi, R. Lu, P. Niu, F. Zhan, X. Ma, D. Wang, W. Xu, G. Wu, G. F. Gao, W. Tan, A Novel Coronavirus from Patients with Pneumonia in China, 2019. New England Journal of Medicine. 382, 727–733 (2020).
OpenUrl CrossRef PubMed
4.↵
J. M. Brauner, S. Mindermann, M. Sharma, A. B. Stephenson, T. Gavenčiak, D. Johnston, G. Leech, J. Salvatier, G. Altman, A. J. Norman, J. T. Monrad, T. Besiroglu, H. Ge, V. Mikulik, M. A. Hartwick, Y. W. Teh, L. Chindelevitch, Y. Gal, J. Kulveit. The effectiveness of eight nonpharmaceutical interventions against COVID-19 in 41 countries. medRxiv, in press, doi:10.1101/2020.05.28.20116129.
OpenUrl Abstract/FREE Full Text
5.↵
J. Bayham, E. P. Fenichel, Impact of school closures for COVID-19 on the US health-care workforce and net mortality: a modelling study. The Lancet Public Health. 0 (2020), doi:10.1016/S2468-2667(20)30082-7.
OpenUrl CrossRef PubMed
6.↵
L. Heavey, G. Casey, C. Kelly, D. Kelly, G. McDarby, No evidence of secondary transmission of COVID-19 from children attending school in Ireland, 2020. Eurosurveillance. 25, 2000903 (2020).
OpenUrl
7.↵
J. Dehning, J. Zierenberg, F. P. Spitzner, M. Wibral, J. P. Neto, M. Wilczek, V. Priesemann, Inferring change points in the spread of COVID-19 reveals the effectiveness of interventions. Science (2020), doi:10.1126/science.abb9789.
OpenUrl Abstract/FREE Full Text
8.↵
N. Banholzer, E. van Weenen, B. Kratzwald, A. Seeliger, D. Tschernutter, P. Bottrighi, A. Cenedese, J. P. Salles, W. Vach, S. Feuerriegel. Impact of non-pharmaceutical interventions on documented cases of COVID-19. medRxiv, in press, doi:10.1101/2020.04.16.20062141.
OpenUrl Abstract/FREE Full Text
9.↵
S. Flaxman, S. Mishra, A. Gandy, H. J. T. Unwin, T. A. Mellan, H. Coupland, C. Whittaker, H. Zhu, T. Berah, J. W. Eaton, M. Monod, A. C. Ghani, C. A. Donnelly, S. M. Riley, M. A. C. Vollmer, N. M. Ferguson, L. C. Okell, S. Bhatt, Estimating the effects of non-pharmaceutical interventions on COVID-19 in Europe. Nature, 1-8 (2020).
10.↵
E. M. Volz, K. Koelle, T. Bedford, Viral phylodynamics. PLoS Comput. Biol. 9, e1002947 (2013).
OpenUrl CrossRef PubMed
11.↵
E. Paradis, Assessing temporal variations in diversification rates from phylogenies: estimation and hypothesis testing. Proceedings of the Royal Society of London. Series B: Biological Sciences. 264, 1141–1147 (1997).
OpenUrl CrossRef GeoRef Web of Science
12.↵
D. L. Rabosky, Likelihood Methods for Detecting Temporal Shifts in Diversification Rates. Evolution. 60, 1152–1164 (2006).
OpenUrl CrossRef PubMed Web of Science
13.↵
The effective reproduction number Rt of an epidemic can be interpreted as the average number of new infections directly caused by a single infected patient. The effective reproduction number equals the basic reproduction number R0 in a fully susceptible population when no mitigation strategy is active.
14.↵
N. Tj, K. Tw, K. Jb, D. Cb, P. Dp, Speciation in North American black basses, Micropterus (Actinopterygii: Centrarchidae). Evolution. 57, 1610–1621 (2003).
OpenUrl CrossRef PubMed
15.↵
Z. Ma, A. W. Krings, in 2008 International Conference on BioMedical Engineering and Informatics (2008), vol. 1, pp. 178-185.
OpenUrl
16.↵
S. Nee, Inferring Speciation Rates from Phylogenies. Evolution. 55, 661–668 (2001).
OpenUrl CrossRef PubMed Web of Science
17.↵
Between-divergence intervals are only approximately equal to serial intervals (up to variations in the incubation period) when there is a single infectee and the phylogenetic branches does not contain unobserved divergence events. When there are several infectees, between-divergence intervals, which begin with each divergence event, are shorter than serial intervals which begin with the infector’s illness. In contrast, unobserved divergence events result in inflated branche lengths that span several serial intervals.
18.↵
Materials and methods are available as supplementary materials at the Science Website.
19.↵
L. D. Fisher, D. Y. Lin, Time-dependent covariates in the Cox proportional-hazards regression model. Annu Rev Public Health. 20, 145–157 (1999).
OpenUrl CrossRef PubMed Web of Science
20.↵
T. Stadler, D. Kühnert, S. Bonhoeffer, A. J. Drummond, Birth-death skyline plot reveals temporal changes of epidemic spread in HIV and hepatitis C virus (HCV). Proc. Natl. Acad. Sci. U.S.A. 110, 228–233 (2013).
OpenUrl Abstract/FREE Full Text
21.↵
J. Hadfield, C. Megill, S. M. Bell, J. Huddleston, B. Potter, C. Callender, P. Sagulenko, T. Bedford, R. A. Neher, Nextstrain: real-time tracking of pathogen evolution. Bioin/ormatics. 34, 4121–4123 (2018).
OpenUrl
22.↵
Y. Shu, J. McCauley, GISAID: Global initiative on sharing all influenza data - from vision to reality. Euro Surveill. 22 (2017), doi:10.2807/1560-7917.ES.2017.22.13.30494.
OpenUrl CrossRef PubMed
23.↵
T. Hale, S. Webster, A. Petherick, T. Philips, B. Kira, Coronavirus Government Response Tracker. Oxford COVID-19 Government Response Tracker, Blavatnik School of Government. Data use policy: Creative Commons Attribution CC BY standard. (2020), (available at https://www.bsg.ox.ac.uk/research/research-projects/coronavirus-government-response-tracker).
24.↵
S. T. Ali, L. Wang, E. H. Y. Lau, X.-K. Xu, Z. Du, Y. Wu, G. M. Leung, B. J. Cowling, Serial interval of SARS-CoV-2 was shortened over time by nonpharmaceutical interventions. Science (2020), doi:10.1126/science.abc9004.
OpenUrl Abstract/FREE Full Text
25.↵
K. Xu, Y. Chen, J. Yuan, P. Yi, C. Ding, W. Wu, Y. Li, Q. Ni, R. Zou, X. Li, M. Xu, Y. Zhang, H. Zhao, X. Zhang, L. Yu, J. Su, G. Lang, J. Liu, X. Wu, Y. Guo, J. Tao, D. Shi, L. Yu, Q. Cao, B. Ruan, L. Liu, Z. Wang, Y. Xu, Y. Liu, J. Sheng, L. Li, Factors Associated With Prolonged Viral RNA Shedding in Patients with Coronavirus Disease 2019 (COVID-19). Clin Infect Dis, doi:10.1093/cid/ciaa351.
OpenUrl CrossRef
26.↵
Q. Bi, Y. Wu, S. Mei, C. Ye, X. Zou, Z. Zhang, X. Liu, L. Wei, S. A. Truelove, T. Zhang, W. Gao, C. Cheng, X. Tang, X. Wu, Y. Wu, B. Sun, S. Huang, Y. Sun, J. Zhang, T. Ma, J. Lessler, T. Feng, Epidemiology and transmission of COVID-19 in 391 cases and 1286 of their close contacts in Shenzhen, China: a retrospective cohort study. The Lancet Infectious Diseases. 0 (2020), doi:10.1016/S1473-3099(20)30287-5.
OpenUrl CrossRef PubMed
27.↵
J. Zhang, M. Litvinova, Y. Liang, Y. Wang, W. Wang, S. Zhao, Q. Wu, S. Merler, C. Viboud, A. Vespignani, M. Ajelli, H. Yu, Changes in contact patterns shape the dynamics of the COVID-19 outbreak in China. Science. 368, 1481–1486 (2020).
OpenUrl Abstract/FREE Full Text
28.↵
H. Salje, C. Tran Kiem, N. Lefrancq, N. Courtejoie, P. Bosetti, J. Paireau, A. Andronico, N. Hozé, J. Richet, C.-L. Dubost, Y. Le Strat, J. Lessler, D. Levy-Bruhl, A. Fontanet, L. Opatowski, P.-Y. Boelle, S. Cauchemez, Estimating the burden of SARS-CoV-2 in France. Science (2020), doi:10.1126/science.abc3517.
OpenUrl Abstract/FREE Full Text
29.↵
N. G. Davies, A. J. Kucharski, R. M. Eggo, A. Gimma, W. J. Edmunds, Centre for the Mathematical Modelling of Infectious Diseases COVID-19 working group, Effects of non-pharmaceutical interventions on COVID-19 cases, deaths, and demand for hospital services in the UK: a modelling study. Lancet Public Health. 5, e375-e385 (2020).
OpenUrl PubMed
30.↵
N. G. Davies, P. Klepac, Y. Liu, K. Prem, M. Jit, R. M. Eggo, Age-dependent effects in the transmission and control of COVID-19 epidemics. Nature Medicine, 1-7 (2020).
31.↵
S. Fafi-Kremer, T. Bruel, Y. Madec, R. Grant, L. Tondeur, L. Grzelak, I. Staropoli, F. Anna, P. Souque, C. Mutter, N. Collongues, A. Bolle, A. Velay, N. Lefebvre, M. Mielcarek, N. Meyer, D. Rey, P. Charneau, B. Hoen, J. D. Seze, O. Schwartz, A. Fontanet. Serologic responses to SARS-CoV-2 infection among hospital staff with mild disease in eastern France. medRxiv, in press, doi:10.1101/2020.05.19.20101832.
OpenUrl Abstract/FREE Full Text
32.↵
E. Shim, A. Tariq, W. Choi, Y. Lee, G. Chowell, Transmission potential and severity of COVID-19 in South Korea. Int. J. Infect. Dis. 93, 339–344 (2020).
OpenUrl CrossRef PubMed
33.↵
C. Aschwanden, How Superspreading Events Drive Most COVID-19 Spread. Scientific American, (available at https://www.scientificamerican.com/article/how-superspreading-events-drive-most-covid-19-spread1/).
34.↵
S. R. Bray, B. Wang, Forecasting unprecedented ecological fluctuations. PLoS Comput. Biol. 16, e1008021 (2020).
OpenUrl
35.↵
M. P. Kain, M. L. Childs, A. D. Becker, E. A. Mordecai. Chopping the tail: how preventing superspreading can help to maintain COVID-19 control. medRxiv, in press, doi:10.1101/2020.06.30.20143115.
OpenUrl Abstract/FREE Full Text
36.↵
Rapid Risk Assessment: Coronavirus disease 2019 (COVID-19) in the EU/EEA and the UK - tenth update. European Centre for Disease Prevention and Control (2020), (available at https://www.ecdc.europa.eu/en/publications-data/rapid-risk-assessment-coronavirus-disease-2019-covid-19-pandemic-tenth-update).
37.↵
Variation in government responses to COVID-19, (available at https://www.bsg.ox.ac.uk/research/publications/variation-government-responses-covid-19).
38.↵
E. Paradis, Statistical Analysis of Diversification with Species Traits. Evolution. 59, 1–12 (2005).
OpenUrl CrossRef PubMed Web of Science
39.↵
R. Biek, O. G. Pybus, J. O. Lloyd-Smith, X. Didelot, Measurably evolving pathogens in the genomic era. Trends Ecol. Evol. (Amst.). 30, 306–313 (2015).
OpenUrl
40.↵
nextstrain/ncov (Nextstrain, 2020; https://github.com/nextstrain/ncov).
41.↵
B. Q. Minh, H. A. Schmidt, O. Chernomor, D. Schrempf, M. D. Woodhams, A. von Haeseler, R. Lanfear, IQ-TREE 2: New Models and Efficient Methods for Phylogenetic Inference in the Genomic Era. Mol. Biol. Evol. 37, 1530–1534 (2020).
OpenUrl
42.↵
P. Sagulenko, V. Puller, R. A. Neher, TreeTime: Maximum-likelihood phylodynamic analysis. Virus Evol. 4 (2018), doi:10.1093/ve/vex042.
OpenUrl CrossRef PubMed
43.↵
Orientation: so, what does Nextstrain do? Tutorial: Using Nextstrain for SARS-CoV-2, (available at https://nextstrain.github.io/ncov/orientation-workflow.html).
44.↵
T. M. Therneau, P. M. Grambsch, Modeling Survival Data: Extending the Cox Model (Springer Science & Business Media, 2013).
45.↵
M. Pagel, The Maximum Likelihood Approach to Reconstructing Ancestral Character States of Discrete Characters on Phylogenies. Syst Biol. 48, 612–622 (1999).
OpenUrl CrossRef Web of Science
46.↵
P. C. Austin, A Tutorial on Multilevel Survival Analysis: Methods, Models and Applications. International Statistical Review. 85, 185–203 (2017).
OpenUrl
47.↵
S. Nee, R. M. May, P. H. Harvey, The reconstructed evolutionary process. Philos. Trans. R. Soc. Lond., B, Biol. Sci. 344, 305–311 (1994).
OpenUrl GeoRef PubMed Web of Science
48.↵
S. B. Hedges, J. Marin, M. Suleski, M. Paymer, S. Kumar, Tree of Life Reveals Clock-Like Speciation and Diversification. Mol Biol Evol. 32, 835–845 (2015).
OpenUrl CrossRef PubMed
49.↵
R. S. Etienne, J. Rosindell, Prolonging the past counteracts the pull of the present: protracted speciation can explain observed slowdowns in diversification. Syst. Biol. 61, 204–213 (2012).
OpenUrl CrossRef PubMed
50.↵
A. R. Nabhan, I. N. Sarkar, The impact of taxon sampling on phylogenetic inference: a review of two decades of controversy. Brief Bioinform. 13, 122–134 (2012).
OpenUrl CrossRef PubMed
51.↵
O. G. Pybus, M. A. Charleston, S. Gupta, A. Rambaut, E. C. Holmes, P. H. Harvey, The epidemic behavior of the hepatitis C virus. Science. 292, 2323–2325 (2001).
OpenUrl Abstract/FREE Full Text
52.↵
J. D. Silverman, N. Hupert, A. D. Washburne, Using influenza surveillance networks to estimate state-specific prevalence of SARS-CoV-2 in the United States. Science Translational Medicine (2020), doi:10.1126/scitranslmed.abc1126.
OpenUrl FREE Full Text
53.
Coronavirus Pandemic Data Explorer. Our World in Data, (available at https://ourworldindata.org/coronavirus-data-explorer).

View the discussion thread.

Posted August 26, 2020.

Download PDF

Supplementary Material

Data/Code

Citation Tools

Subject Area

Epidemiology

Subject Areas

All Articles

Addiction Medicine (399)
Allergy and Immunology (708)
Anesthesia (201)
Cardiovascular Medicine (2936)
Dentistry and Oral Medicine (333)
Dermatology (249)
Emergency Medicine (439)
Endocrinology (including Diabetes Mellitus and Metabolic Disease) (1035)
Epidemiology (12741)
Forensic Medicine (12)
Gastroenterology (828)
Genetic and Genomic Medicine (4581)
Geriatric Medicine (417)
Health Economics (729)
Health Informatics (2914)
Health Policy (1069)
Health Systems and Quality Improvement (1077)
Hematology (388)
HIV/AIDS (924)
Infectious Diseases (except HIV/AIDS) (14097)
Intensive Care and Critical Care Medicine (846)
Medical Education (425)
Medical Ethics (115)
Nephrology (468)
Neurology (4348)
Nursing (236)
Nutrition (638)
Obstetrics and Gynecology (803)
Occupational and Environmental Health (734)
Oncology (2268)
Ophthalmology (646)
Orthopedics (258)
Otolaryngology (324)
Pain Medicine (279)
Palliative Medicine (83)
Pathology (501)
Pediatrics (1196)
Pharmacology and Therapeutics (504)
Primary Care Research (496)
Psychiatry and Clinical Psychology (3752)
Public and Global Health (6934)
Radiology and Imaging (1527)
Rehabilitation Medicine and Physical Therapy (903)
Respiratory Medicine (915)
Rheumatology (437)
Sexual and Reproductive Health (443)
Sports Medicine (385)
Surgery (488)
Toxicology (60)
Transplantation (211)
Urology (180)

[1] 1.↵
F. Wu, S. Zhao, B. Yu, Y.-M. Chen, W. Wang, Z.-G. Song, Y. Hu, Z.-W. Tao, J.-H. Tian, Y.-Y. Pei, M.-L. Yuan, Y.-L. Zhang, F.-H. Dai, Y. Liu, Q.-M. Wang, J.-J. Zheng, L. Xu, E. C. Holmes, Y.-Z. Zhang, A new coronavirus associated with human respiratory disease in China. Nature. 579, 265–269 (2020).
OpenUrl CrossRef PubMed

[2] 2.
P. Zhou, X.-L. Yang, X.-G. Wang, B. Hu, L. Zhang, W. Zhang, H.-R. Si, Y. Zhu, B. Li, C-L. Huang, H.-D. Chen, J. Chen, Y. Luo, H. Guo, R.-D. Jiang, M.-Q. Liu, Y. Chen, X.-R. Shen, X. Wang, X.-S. Zheng, K. Zhao, Q.-J. Chen, F. Deng, L.-L. Liu, B. Yan, F.-X. Zhan, Y.-Y. Wang, G.-F. Xiao, Z.-L. Shi, A pneumonia outbreak associated with a new coronavirus of probable bat origin. Nature. 579, 270–273 (2020).
OpenUrl CrossRef PubMed

[3] 3.↵
N. Zhu, D. Zhang, W. Wang, X. Li, B. Yang, J. Song, X. Zhao, B. Huang, W. Shi, R. Lu, P. Niu, F. Zhan, X. Ma, D. Wang, W. Xu, G. Wu, G. F. Gao, W. Tan, A Novel Coronavirus from Patients with Pneumonia in China, 2019. New England Journal of Medicine. 382, 727–733 (2020).
OpenUrl CrossRef PubMed

[4] 4.↵
J. M. Brauner, S. Mindermann, M. Sharma, A. B. Stephenson, T. Gavenčiak, D. Johnston, G. Leech, J. Salvatier, G. Altman, A. J. Norman, J. T. Monrad, T. Besiroglu, H. Ge, V. Mikulik, M. A. Hartwick, Y. W. Teh, L. Chindelevitch, Y. Gal, J. Kulveit. The effectiveness of eight nonpharmaceutical interventions against COVID-19 in 41 countries. medRxiv, in press, doi:10.1101/2020.05.28.20116129.
OpenUrl Abstract/FREE Full Text

[5] 5.↵
J. Bayham, E. P. Fenichel, Impact of school closures for COVID-19 on the US health-care workforce and net mortality: a modelling study. The Lancet Public Health. 0 (2020), doi:10.1016/S2468-2667(20)30082-7.
OpenUrl CrossRef PubMed

[6] 6.↵
L. Heavey, G. Casey, C. Kelly, D. Kelly, G. McDarby, No evidence of secondary transmission of COVID-19 from children attending school in Ireland, 2020. Eurosurveillance. 25, 2000903 (2020).
OpenUrl

[7] 7.↵
J. Dehning, J. Zierenberg, F. P. Spitzner, M. Wibral, J. P. Neto, M. Wilczek, V. Priesemann, Inferring change points in the spread of COVID-19 reveals the effectiveness of interventions. Science (2020), doi:10.1126/science.abb9789.
OpenUrl Abstract/FREE Full Text

[8] 8.↵
N. Banholzer, E. van Weenen, B. Kratzwald, A. Seeliger, D. Tschernutter, P. Bottrighi, A. Cenedese, J. P. Salles, W. Vach, S. Feuerriegel. Impact of non-pharmaceutical interventions on documented cases of COVID-19. medRxiv, in press, doi:10.1101/2020.04.16.20062141.
OpenUrl Abstract/FREE Full Text

[9] 9.↵
S. Flaxman, S. Mishra, A. Gandy, H. J. T. Unwin, T. A. Mellan, H. Coupland, C. Whittaker, H. Zhu, T. Berah, J. W. Eaton, M. Monod, A. C. Ghani, C. A. Donnelly, S. M. Riley, M. A. C. Vollmer, N. M. Ferguson, L. C. Okell, S. Bhatt, Estimating the effects of non-pharmaceutical interventions on COVID-19 in Europe. Nature, 1-8 (2020).

[10] 10.↵
E. M. Volz, K. Koelle, T. Bedford, Viral phylodynamics. PLoS Comput. Biol. 9, e1002947 (2013).
OpenUrl CrossRef PubMed

[11] 11.↵
E. Paradis, Assessing temporal variations in diversification rates from phylogenies: estimation and hypothesis testing. Proceedings of the Royal Society of London. Series B: Biological Sciences. 264, 1141–1147 (1997).
OpenUrl CrossRef GeoRef Web of Science

[12] 12.↵
D. L. Rabosky, Likelihood Methods for Detecting Temporal Shifts in Diversification Rates. Evolution. 60, 1152–1164 (2006).
OpenUrl CrossRef PubMed Web of Science

[13] 13.↵
The effective reproduction number Rt of an epidemic can be interpreted as the average number of new infections directly caused by a single infected patient. The effective reproduction number equals the basic reproduction number R0 in a fully susceptible population when no mitigation strategy is active.

[14] 14.↵
N. Tj, K. Tw, K. Jb, D. Cb, P. Dp, Speciation in North American black basses, Micropterus (Actinopterygii: Centrarchidae). Evolution. 57, 1610–1621 (2003).
OpenUrl CrossRef PubMed

[15] 15.↵
Z. Ma, A. W. Krings, in 2008 International Conference on BioMedical Engineering and Informatics (2008), vol. 1, pp. 178-185.
OpenUrl

[16] 16.↵
S. Nee, Inferring Speciation Rates from Phylogenies. Evolution. 55, 661–668 (2001).
OpenUrl CrossRef PubMed Web of Science

[17] 17.↵
Between-divergence intervals are only approximately equal to serial intervals (up to variations in the incubation period) when there is a single infectee and the phylogenetic branches does not contain unobserved divergence events. When there are several infectees, between-divergence intervals, which begin with each divergence event, are shorter than serial intervals which begin with the infector’s illness. In contrast, unobserved divergence events result in inflated branche lengths that span several serial intervals.

[18] 18.↵
Materials and methods are available as supplementary materials at the Science Website.

[19] 19.↵
L. D. Fisher, D. Y. Lin, Time-dependent covariates in the Cox proportional-hazards regression model. Annu Rev Public Health. 20, 145–157 (1999).
OpenUrl CrossRef PubMed Web of Science

[20] 20.↵
T. Stadler, D. Kühnert, S. Bonhoeffer, A. J. Drummond, Birth-death skyline plot reveals temporal changes of epidemic spread in HIV and hepatitis C virus (HCV). Proc. Natl. Acad. Sci. U.S.A. 110, 228–233 (2013).
OpenUrl Abstract/FREE Full Text

[21] 21.↵
J. Hadfield, C. Megill, S. M. Bell, J. Huddleston, B. Potter, C. Callender, P. Sagulenko, T. Bedford, R. A. Neher, Nextstrain: real-time tracking of pathogen evolution. Bioin/ormatics. 34, 4121–4123 (2018).
OpenUrl

[22] 22.↵
Y. Shu, J. McCauley, GISAID: Global initiative on sharing all influenza data - from vision to reality. Euro Surveill. 22 (2017), doi:10.2807/1560-7917.ES.2017.22.13.30494.
OpenUrl CrossRef PubMed

[23] 23.↵
T. Hale, S. Webster, A. Petherick, T. Philips, B. Kira, Coronavirus Government Response Tracker. Oxford COVID-19 Government Response Tracker, Blavatnik School of Government. Data use policy: Creative Commons Attribution CC BY standard. (2020), (available at https://www.bsg.ox.ac.uk/research/research-projects/coronavirus-government-response-tracker).

[24] 24.↵
S. T. Ali, L. Wang, E. H. Y. Lau, X.-K. Xu, Z. Du, Y. Wu, G. M. Leung, B. J. Cowling, Serial interval of SARS-CoV-2 was shortened over time by nonpharmaceutical interventions. Science (2020), doi:10.1126/science.abc9004.
OpenUrl Abstract/FREE Full Text

[25] 25.↵
K. Xu, Y. Chen, J. Yuan, P. Yi, C. Ding, W. Wu, Y. Li, Q. Ni, R. Zou, X. Li, M. Xu, Y. Zhang, H. Zhao, X. Zhang, L. Yu, J. Su, G. Lang, J. Liu, X. Wu, Y. Guo, J. Tao, D. Shi, L. Yu, Q. Cao, B. Ruan, L. Liu, Z. Wang, Y. Xu, Y. Liu, J. Sheng, L. Li, Factors Associated With Prolonged Viral RNA Shedding in Patients with Coronavirus Disease 2019 (COVID-19). Clin Infect Dis, doi:10.1093/cid/ciaa351.
OpenUrl CrossRef

[26] 26.↵
Q. Bi, Y. Wu, S. Mei, C. Ye, X. Zou, Z. Zhang, X. Liu, L. Wei, S. A. Truelove, T. Zhang, W. Gao, C. Cheng, X. Tang, X. Wu, Y. Wu, B. Sun, S. Huang, Y. Sun, J. Zhang, T. Ma, J. Lessler, T. Feng, Epidemiology and transmission of COVID-19 in 391 cases and 1286 of their close contacts in Shenzhen, China: a retrospective cohort study. The Lancet Infectious Diseases. 0 (2020), doi:10.1016/S1473-3099(20)30287-5.
OpenUrl CrossRef PubMed

[27] 27.↵
J. Zhang, M. Litvinova, Y. Liang, Y. Wang, W. Wang, S. Zhao, Q. Wu, S. Merler, C. Viboud, A. Vespignani, M. Ajelli, H. Yu, Changes in contact patterns shape the dynamics of the COVID-19 outbreak in China. Science. 368, 1481–1486 (2020).
OpenUrl Abstract/FREE Full Text

[28] 28.↵
H. Salje, C. Tran Kiem, N. Lefrancq, N. Courtejoie, P. Bosetti, J. Paireau, A. Andronico, N. Hozé, J. Richet, C.-L. Dubost, Y. Le Strat, J. Lessler, D. Levy-Bruhl, A. Fontanet, L. Opatowski, P.-Y. Boelle, S. Cauchemez, Estimating the burden of SARS-CoV-2 in France. Science (2020), doi:10.1126/science.abc3517.
OpenUrl Abstract/FREE Full Text

[29] 29.↵
N. G. Davies, A. J. Kucharski, R. M. Eggo, A. Gimma, W. J. Edmunds, Centre for the Mathematical Modelling of Infectious Diseases COVID-19 working group, Effects of non-pharmaceutical interventions on COVID-19 cases, deaths, and demand for hospital services in the UK: a modelling study. Lancet Public Health. 5, e375-e385 (2020).
OpenUrl PubMed

[30] 30.↵
N. G. Davies, P. Klepac, Y. Liu, K. Prem, M. Jit, R. M. Eggo, Age-dependent effects in the transmission and control of COVID-19 epidemics. Nature Medicine, 1-7 (2020).

[31] 31.↵
S. Fafi-Kremer, T. Bruel, Y. Madec, R. Grant, L. Tondeur, L. Grzelak, I. Staropoli, F. Anna, P. Souque, C. Mutter, N. Collongues, A. Bolle, A. Velay, N. Lefebvre, M. Mielcarek, N. Meyer, D. Rey, P. Charneau, B. Hoen, J. D. Seze, O. Schwartz, A. Fontanet. Serologic responses to SARS-CoV-2 infection among hospital staff with mild disease in eastern France. medRxiv, in press, doi:10.1101/2020.05.19.20101832.
OpenUrl Abstract/FREE Full Text

[32] 32.↵
E. Shim, A. Tariq, W. Choi, Y. Lee, G. Chowell, Transmission potential and severity of COVID-19 in South Korea. Int. J. Infect. Dis. 93, 339–344 (2020).
OpenUrl CrossRef PubMed

[33] 33.↵
C. Aschwanden, How Superspreading Events Drive Most COVID-19 Spread. Scientific American, (available at https://www.scientificamerican.com/article/how-superspreading-events-drive-most-covid-19-spread1/).

[34] 34.↵
S. R. Bray, B. Wang, Forecasting unprecedented ecological fluctuations. PLoS Comput. Biol. 16, e1008021 (2020).
OpenUrl

[35] 35.↵
M. P. Kain, M. L. Childs, A. D. Becker, E. A. Mordecai. Chopping the tail: how preventing superspreading can help to maintain COVID-19 control. medRxiv, in press, doi:10.1101/2020.06.30.20143115.
OpenUrl Abstract/FREE Full Text

[36] 36.↵
Rapid Risk Assessment: Coronavirus disease 2019 (COVID-19) in the EU/EEA and the UK - tenth update. European Centre for Disease Prevention and Control (2020), (available at https://www.ecdc.europa.eu/en/publications-data/rapid-risk-assessment-coronavirus-disease-2019-covid-19-pandemic-tenth-update).

[37] 37.↵
Variation in government responses to COVID-19, (available at https://www.bsg.ox.ac.uk/research/publications/variation-government-responses-covid-19).

[38] 38.↵
E. Paradis, Statistical Analysis of Diversification with Species Traits. Evolution. 59, 1–12 (2005).
OpenUrl CrossRef PubMed Web of Science

[39] 39.↵
R. Biek, O. G. Pybus, J. O. Lloyd-Smith, X. Didelot, Measurably evolving pathogens in the genomic era. Trends Ecol. Evol. (Amst.). 30, 306–313 (2015).
OpenUrl

[40] 40.↵
nextstrain/ncov (Nextstrain, 2020; https://github.com/nextstrain/ncov).

[41] 41.↵
B. Q. Minh, H. A. Schmidt, O. Chernomor, D. Schrempf, M. D. Woodhams, A. von Haeseler, R. Lanfear, IQ-TREE 2: New Models and Efficient Methods for Phylogenetic Inference in the Genomic Era. Mol. Biol. Evol. 37, 1530–1534 (2020).
OpenUrl

[42] 42.↵
P. Sagulenko, V. Puller, R. A. Neher, TreeTime: Maximum-likelihood phylodynamic analysis. Virus Evol. 4 (2018), doi:10.1093/ve/vex042.
OpenUrl CrossRef PubMed

[43] 43.↵
Orientation: so, what does Nextstrain do? Tutorial: Using Nextstrain for SARS-CoV-2, (available at https://nextstrain.github.io/ncov/orientation-workflow.html).

[44] 44.↵
T. M. Therneau, P. M. Grambsch, Modeling Survival Data: Extending the Cox Model (Springer Science & Business Media, 2013).

[45] 45.↵
M. Pagel, The Maximum Likelihood Approach to Reconstructing Ancestral Character States of Discrete Characters on Phylogenies. Syst Biol. 48, 612–622 (1999).
OpenUrl CrossRef Web of Science

[46] 46.↵
P. C. Austin, A Tutorial on Multilevel Survival Analysis: Methods, Models and Applications. International Statistical Review. 85, 185–203 (2017).
OpenUrl

[47] 47.↵
S. Nee, R. M. May, P. H. Harvey, The reconstructed evolutionary process. Philos. Trans. R. Soc. Lond., B, Biol. Sci. 344, 305–311 (1994).
OpenUrl GeoRef PubMed Web of Science

[48] 48.↵
S. B. Hedges, J. Marin, M. Suleski, M. Paymer, S. Kumar, Tree of Life Reveals Clock-Like Speciation and Diversification. Mol Biol Evol. 32, 835–845 (2015).
OpenUrl CrossRef PubMed

[49] 49.↵
R. S. Etienne, J. Rosindell, Prolonging the past counteracts the pull of the present: protracted speciation can explain observed slowdowns in diversification. Syst. Biol. 61, 204–213 (2012).
OpenUrl CrossRef PubMed

[50] 50.↵
A. R. Nabhan, I. N. Sarkar, The impact of taxon sampling on phylogenetic inference: a review of two decades of controversy. Brief Bioinform. 13, 122–134 (2012).
OpenUrl CrossRef PubMed

[51] 51.↵
O. G. Pybus, M. A. Charleston, S. Gupta, A. Rambaut, E. C. Holmes, P. H. Harvey, The epidemic behavior of the hepatitis C virus. Science. 292, 2323–2325 (2001).
OpenUrl Abstract/FREE Full Text

[52] 52.↵
J. D. Silverman, N. Hupert, A. D. Washburne, Using influenza surveillance networks to estimate state-specific prevalence of SARS-CoV-2 in the United States. Science Translational Medicine (2020), doi:10.1126/scitranslmed.abc1126.
OpenUrl FREE Full Text

[53] 53.
Coronavirus Pandemic Data Explorer. Our World in Data, (available at https://ourworldindata.org/coronavirus-data-explorer).