Summary
Background Modelling and projections of COVID-19 using a single set of transmission parameters can be an elaborated because the application of different levels of containment measures at different stages of the worldwide COVID-19 outbreak.
Methods We developed a piecewise fitting SEIR methodology to fit the progress of the COVID-19 that can be applied on any of the 185 countries listed in John Hopkins Coronavirus Resource Center. The contagious contact rate, the rate of removal and the initially exposed population were obtained at three different stages of the pandemic for a set of 18 countries, and globally for the total number of cases worldwide. The active number of infections and the removed populations were fitted simultaneously to validate the SEIR model against the available time series reports on the number of confirmed infections, recoveries and deaths. We evaluate the effect of a reduction of contagious contact rate on the level of burden put on local healthcare infrastructure considering different levels of intervention. As a guideline for future public health interventions, we also estimated the maximum number of future cases and its potential peak date.
Findings We project that the peak in the number of infections worldwide will take place after the third quarter of 2020 with a decline rate that might extend beyond 2020. For 12 out of the 18 countries analyzed, we observe that, following the trend at the date of this study, the number of severe infections will surpass their healthcare capacity. For a 90% reduction scenario of the contagious contact rate, four out of the 18 countries analyzed will undergo a significant delay in the peak of infection, extending the course of the epidemic further than our simulation window (365 days).
Interpretation We identify three stages for the COVID-19 transmission dynamics, which suggest that it is highly heterogeneous between countries and its contagious contact rate, is currently affected by both local responses of the public health interventions and to the population’s adherence to the measures.
Funding No funding received.
Introduction
The SARS-CoV-2 virus responsible for the coronavirus disease 2019 (COVID-19) emerged in China at the end of 2019. First reports from the World Health Organization (WHO) indicated that most of the regions with more than a thousand cases of SARS-CoV-2 infection exhibited an exponential progression in the number of cases within three weeks of the first confirmed case.1 A mortality rate of 3% was suggested, which has reached an alarming value of 15·2% outside of China.2 Notably, the mortality rate and the number of confirmed cases vary from country to country, and as of April 2020, the COVID-19 pandemic has globally reached more than 1,840,000 confirmed cases and 117,000 deaths.3 Initial studies have provided valuable information at the early stages of the epidemic,4 and preliminarily estimates showed that the different effects of governmental actions can decrease the spread to at least some extents.5 Social distancing and isolation of the infected population have been widely adopted measures to control the propagation of SARS-CoV-2. Such non-pharmaceutical interventions have proven to be effective in managing the epidemic in a few countries; nevertheless, limitations associated to the implementation of strict containment policies have led to difficulties in controlling the fast increase in the number of cases.6,7 Currently, Europe and the US report a large number of fatalities despite social distancing measures. As SARS-CoV-2 continues to spread, healthcare systems are facing a multitude of challenges at all stages of the pandemic.8
Analysis on the effectiveness of the interventions aimed at controlling the transmission requires adequate mathematical models to describe the SARS-CoV-2 spreading dynamics. Despite the early efforts to model the spread of COVID-19 in mainland China, the dynamics of transmission does not extrapolate seamlessly to other countries. Based on the time series of two other well-known coronavirus diseases (MERS and SARS), the estimated basic reproduction number of COVID-19, was reported between 2·24 and 3·58, which indicates that almost the totality of the population could be infected unless effective control measures are implemented.9 Compartmental epidemic models such as susceptible-infected-removed (SIR)10 and susceptible-exposed-infected-removed (SEIR) have been widely used to describe several infectious diseases.5,9,10–14 In particular the SEIR model has been recently used to model the progression of COVID-19 in China.4 While, models of a higher complexity have been recently proposed to describe the effects of social distancing and isolation,15 their large number of unknown parameters renders them inadequate to extract accurate information using mathematical fits to the time series of the infections. In this work, we use the susceptible-exposed-infected-removed (SEIR) compartmental model to simultaneously fit the progression of active infections and removed populations. While it is a common practice to fit only the infected compartment, the lack of consideration of the removed population might result in a fit that does not describe the actual progression of recoveries and deaths. We present a piecewise methodology that allows us to compare the progression of the epidemic at different stages of the epidemic. We identify the contagious contact rate, the removal rate and the initially exposed population at different stages of the epidemic. We use the results of the fits to project the number infections for SARS-CoV-2 within a 365-day simulation window. Using our methodology, we make comparisons on the evolution of the infectiousness for 18 countries at different stages of the epidemic. We use the extracted pandemic parameters to make relevant projections on the dynamics of the disease for the oncoming 12 months. We consider that this information can be useful to anticipate the impact of interventions to project healthcare supply needs in the short and the long terms.16
Methods
Data sources
We used the time series of the COVID-19 reported by the Johns Hopkins Coronavirus Resource Center (hereafter JHCRC), starting from January 21 to April 14, 2020.17,18 We selected 18 countries corresponding to ∼10% of the total number of countries currently reported in the JHCRC database. Such a particular choice was made based on the consistency of the data with the SEIR model as evaluated by R2, which is used as a metric of the goodness of fit. We only take into account countries with a number of confirmed cases larger than one per 100k and a number of deaths exceeding 200. Country-level data on population size was extracted from the Department of Economic and Social Affairs of the United Nations.19 To evaluate the burden that the pandemic lays on the country-level healthcare system, we used the number of hospital beds reported by the World Bank Database.20
The SEIR model
The total population N is considered to be constant and is partitioned into four disjoint groups, namely: susceptible, S(t), exposed, E(t), infected, I(t), and recovered, R(t), which are all functions of time. Individuals who have had contact with an infected person are not themselves instantaneously infectious, but rather move to the exposed compartment E(t) where they develop infectious symptoms at an incubation rate σ.21 A set of coupled ordinary differential equations (ODEs) describing the spreading of the disease and the evolution of the partitions can define the SEIR model as follows: subject to the restriction where s(t) = S(t)/N, e(t) = E(t)/N, i(t) = I(t)/N, r(t) = R(t)/N, the coefficients to be determined β ≥ 0, γ ≥ 0 and σ ≥ 0 are the effective contagious contact rate of spread, the removal (including deaths and recovered patients) rate, and the incubation rate, respectively. The time t is taken in days. A local epidemic scenario occurs when the number of the first infected patients increases, namely,. The basic reproductive number, R = β/γ, commonly gives the number of secondary infections generated by the first infected individual.
We solve the coupled system using SciPy’s implementation of the explicit Runge-Kutta method of order 8 (DOP853).22 This methodology enables us to estimate β and γ, as well as the initially exposed population E0 ≡ E(t = 0) by fitting the progression of R(t) and I(t). We use the trust-reflective non-linear least-squares method to minimize the residuals between the SEIR model and the data from the time series. The values R(t) and I(t) were taken as linear combinations of the number of confirmed cases C(t), confirmed recoveries Rr(t) and confirmed deaths Rd(t) with R(t) = Rr(t) + Rd(t)and I(t) = C(t) − R(t). For purposes of following the progression at different stages of the epidemic, the values In = I(t = tn), Rn = R(t = tn) and En = E(t = tn) can be used as inputs of the SEIR model to determine the evolution of the system.
Fitting SEIR parameters
We estimate the contagious contact rate (β) by fitting the data at three different stages of the COVID-19 spread. The parameter β is difficult to estimate accurately because the transmission rate is susceptible to changes related to government intervention. Hence, β can range several orders of magnitude, depending on the government response to contain the outbreak locally. Early reports from Wuhan estimated a value of 0·064 at the initial stage of the spread,23 but after clinical support was adopted by the Chinese government and considered in the models, it increased to 0·322. On the other hand, the removal and incubation rates (γ and σ, respectively) are not directed by population controls; according to empirical data, it was shown that these rates could range in the order of days (from 2 to 14 days).24 Because of this, we impose the following bounds on the possible values that these fitted parameters can take: 1/β ∈ [0,10], 1/γ ∈ [1 day, 1 year], E(t = 0) ∈ [0, N], and finally 1/σ= 5·1 days which is a robust estimation of the average incubation time.25
Piecewise SEIR fitting and projection
We fitted I(t) and R(t) piecewise and optimized the interval ranges until the value of R2was higher than 0·9 (0·8 for the particular case of Mexico which is discussed in the following sections of this work). We defined the ratio of transmission rates from subsequent intervals (Qn = βn+1/ βn), for n = 1,2,3 being the interval number. This estimate is useful to identify the changes in the COVID-19 spread dynamics upon changes in public health policies. Functional prediction bands pb(t) = f(x) ± δ(x) were obtained using δ(x) =(variance)×(inverse of the t-student cumulative distribution).
Model projections on healthcare burden
A fundamental question for decision-makers is whether the healthcare system will be able to cope with the increasing number of patients infected with SARS-CoV-2.26–28 To add this practical perspective into our analysis, we considered the number of hospital beds per capita.20 We split the projected number of COVID-19 into three categories according to their severity: (a) mild cases that do not require hospitalization accounting for 80% of the projected infections, (b) severe disease cases which do require hospitalization, accounting for 15% of the projected infections and (c) acute disease cases, which require admission to the intensive care unit, accounting for 5% of the number of projected infections.29 We have performed projections for four different scenarios: one assuming the best fit of the SEIR model at the latest stage, and three scenarios where we assume a 10%, 50%, and 90% decrease in β. As a metric of the level of healthcare burden under each scenario, we define the healthcare stress as the ratio between the number of infections that require hospitalization at the peak of the disease and the number of available beds.20 The lethality rates were used to assess the status of the COVID-19 outbreak.
Results
Confirmed cases and lethality rates in different countries
The current officially reported lethality rate for COVID-19 is approximately 3·4%.1 Our fitted data indicates a global lethality rate of about 6·87%. Since this estimation considers only confirmed cases, we argue that the former value could be an overestimation introduced by underreporting the actual number of infections. A cluster of European countries (UK, Italy, Spain and France) whose calculated lethality rate range from 10·3 to 12·7% and their number of confirmed cases exceeds 100 cases per 100k encompasses the most affected countries (figure 1). Germany, Austria, Switzerland and the US reported >100 cases per 100k and their lethality rate is similar or less than that of China (4·02%). South Korea, Singapore, Australia and Canada also can be clustered together, being countries with a lower lethality rate (< 3%) and confirmed cases less than 100 per 100k. Brazil, Ecuador and Iran have lethality rates higher than China and their number of confirmed cases is currently increasing (>10 per 100k). Finally, Japan and Mexico show a number of confirmed cases less than 10 per 100k. The lethality rate for Mexico (6·47%) is higher than that of China, however, in this study, Mexico shows the lowest number of confirmed cases per 100k habitants. Such a comparison is only a geographical indication that the epidemic has different dynamics of transmissibility.
Differences in SEIR model fitting across countries
In figure 2b, the time series of the active infections and removed individuals shows an effective change in both contagious contact and removal rates, leading to a crossover in February 15 with a transitory decline in the number of active infections. A second cross-point around March 21 is observed, when the number of active infections takes over the number of recoveries. These features can be interpreted to be due to a lag between the pandemic’s onsets across different regions of the world. Such timing differences introduce an additional heterogeneity due to local reporting and containment policies. To address the heterogeneity of the COVID-19 outbreak, the SEIR model is used to model the progression of COVID-19 in 18 countries (see Supplementary Material). The crossing point between the infected and removed curves is currently missing on all of the countries studied, with a clear exception for China, South Korea and Austria. Because of the inherently high effectiveness of the transmission of SARS-CoV-2, it is expected that the epidemic will extend for several months in some countries. According to the current trend, we project that the peak in the number of infections worldwide will take place after the third quarter of 2020 (figure 2c). We estimate that a 90% reduction in the contagious contact rate (0.1β) will be required to bring the peak of on the number of infections to a date within the second quarter of 2020 (figure 2d).
So far, European countries still lead in the number of confirmed cases and COVID-19 deaths per capita, followed by the United States (see Table 2). With the exception of Japan and Mexico, it seems that other countries studied have managed to slow down the spreading of the epidemic as revealed by lower β3 and Q1,2 < 1 values (figure 3). In the case of Mexico, the goodness of fit is limited (R2 > 0.8) due to a different sampling methodology. Namely, as a response to AH1N1 2009 outbreak in Mexico, the healthcare system implemented a proper sentinel surveillance model, which samples only a few of all SARS-CoV-2 infections;30 therefore, the effect of public policy measures for this country might not be accurately estimated using our predictions. Substantial declines in spreading rates were observed for Austria, China and South Korea. Notably, Austria was an early responder of the epidemic spread by enforcing more stringent restrictions, and thus it seems that is preparing to lift lockdown.31 Removal rates (associated to both: deaths and recoveries in figure 2c) appear to be increasing in all countries, which goes along with a rise in the estimations of lethality rates, and it might also be related with an effective increase in the recovery rates as available data on the progress of the epidemic becomes available.
Two distinct values of Q are defined per country to reflect changes in their contagious contact rate for the three fitted intervals (figure 3d). Values of Q above the dashed line indicate an increase of β from interval n to interval n + 1, whereas values below that threshold indicate a reduction in the transmission rate. Q1 gives a metric of how susceptible the contagious contact rate is to updates in the reported number of cases, as more information becomes available since the early stages of COVID-19 spread for each country. Except for Switzerland, Austria, Iran China and Japan, all other studied countries show Q1 > 1. Overall, the projections show that Q2 < 1 with the exception for Japan that recently reported a surge in new cases; and Mexico, which has recently entered the exponential phase of the epidemic.
Projections on the healthcare burden and effects of containment measures
Assuming that treatment and prevention of the disease do not have an immediate impact on the parameters of the model, our projections indicate that tighter measures should be taken to effectively reduce the spread of COVID-19. Our projections also suggest that stringent measures are required in most countries; with the only exceptions of Australia, Austria, China, Iran and South Korea, which could either continue with their ongoing policies or, up to some safe extent, relax their local interventions (figure 4). As expected, a monotonic decrease in the number of cases at the peak of the infection Ipeak is observed with a decrease in the value of β (see figure 4b). For 12 out of the 18 countries analyzed (figure 4c), the current trend indicates that the number of severe infections will surpass the healthcare capacity (see details in Table 2). According to our projections, Germany, South Korea, Spain and Switzerland would avoid a saturation of the healthcare capacity if they will apply only with a 10% reduction in their current transmission rate (0.9βlatest). We observe that for four out of the 18 countries analyzed, a 50% reduction (0.5βlatest) is enough to avoid the saturation of the available health care infrastructure. We also see that a reduction of 90% (0.1βlatest) will be a failsafe intervention for all the analyzed countries. For such a level of intervention, it is expected that 11 out of 18 countries (not including Australia, Austria, China and South Korea, which seem to have already passed their infection peaks) will reach their peak before the third quarter of 2020. Additionally, it is observed that five out of the 18 analyzed countries will significantly delay the peak of the infection even at 90% reduction. For such cases, the course of the epidemic will be extended further beyond the 365-day simulation window. The countries that follow the former trend are Brazil, Ecuador, Japan, the United Kingdom and the United States.
Discussion
Our multinational modeling of the COVID-19 spread shows a significant heterogeneity of the transmission rates across different 18 countries (10% of those reported in JHCRC). Given that the peak in the number of infections is projected to occur at different times across countries (Table 2), it is not feasible to instrument a single policy recommendation that will effectively curb the spread of the epidemic for all countries. For most cases, it is expected that the course of the pandemic will be extended way past the third quarter of 2020, when the number of new cases is projected to decline. Our projections suggest that local measures imposed in most countries so far have not been enough to contain the spread; at least at the time they were implemented. In some scenarios, the likelihood of significantly reducing the number of cases and mortality rates would require tighter measures to reinforce social distancing.16 Countries that acted earlier benefited from the social lockdown and those countries facing a more challenging COVID-19 spread should enforce more stringent measures to avoid overwhelming the capacities of their healthcare systems. Should the course of the pandemic continue as it has so far, our model projects that the global peak of confirmed cases will be reached after the third quarter of 2020. It should be bear in mind that, although the date of the peak on the number of infections is projected to occur around the fourth quarter of 2020, the number of active infections and hence the likelihood of becoming infected remains high until the number of contagious individuals drops to a negligible number. This means that containment measures should be extended well beyond the peak of infections, for at least the same period required to reach the peak. Our approach to fit different stages of the COVID-19 spread is useful to evaluate changes in both, government response and adherence to the public health policy.32 Notably, our proposed QT metric evidences changes in the contagious contact rates and elucidates country-specific changes in the rate of spread after implementation of measures to contain the virus. So far, social distancing has been implemented differently across countries, perhaps based on local considerations aimed at reducing the socio-economic impact on vulnerable populations.33 Preliminary data from China demonstrates that public health interventions including traffic restrictions, social distancing, home quarantine, and universal symptom survey can be temporally associated with a reduced effective reproduction number of SARS-CoV-2.34 This fact is in agreement with our data, which shows that China experienced significant reductions in β values; indicating a containment of the COVID-19 spread. Austria and South Korea are additional examples of appropriate public health measures, which have effectively curbed the total number of cases.35,36 Additional modelling efforts are required to retrospectively assess the effect of social distancing in curbing the pandemic worldwide. The latter likely indicates that countries at early phase of the spread (e.g. Africa) would experience long-term benefits if the governmental policies are adopted before the onset of the exponential progression. Notably, our analysis shows that a larger decrease in the transmission of SARS-CoV-2 is needed to mitigate the healthcare stress (see figure 4c) and decrease the number of total cases. A side result of flattening the peak of infections is that, depending on the local progress of the epidemic, even a stringent reduction in β could have the effect of prolonging the time to reach the peak in the number of infections. The relevance of decreasing the contagious contact rate stems from the fact that the lethality rate could be exacerbated once the number of severe and acute cases exceeds the healthcare capacity.37 With respect to mobility, the citywide quarantine at Wuhan since January 23, 2020, had negligible effect on the epidemic trajectories for the rest of the country, demonstrating that a 50% reduction in inter-city mobility leads to a negligible effect on epidemic dynamics.4 Therefore, we concluded that accounting for inter-city and inter-country mobility in modeling countries where the epidemic is already present is not of paramount importance and could mostly be neglected in the model. Given the heterogeneity of the COVID-19 outbreak, our projections should be helpful in understanding the effect of reducing β under different levels of intervention.
We fitted the time series of both confirmed and removed populations, which provides higher confidence in our estimations (see Table 1). However, some limitations to our analysis can significantly influence the model predictions. For instance, our analysis does not consider the fact that the number of confirmed cases could be underestimated (accordingly to official reports). Some countries such as South Korea, Germany and to some extent the US have enacted massive testing which has shown benefits in reporting the spread of the disease,38 whilst others countries such as Mexico have enacted a proper sentinel surveillance model because of effectiveness in tracking outbreaks in other respiratory virus infections.39 It is noteworthy to mention that using only the official reported numbers our model lessens the effect of methodological biases introduced by curating local reporting approaches. An underestimation in the number of active infections arising from the use of potentially under-sampled data40 could lead to offsets in our forecasted peak date and in the expected number of peak infections. A second limitation of our approach is that, in contrast with Bayesian models, no prior distribution for constraining the parameter values to a credible interval is used when fitting the models; this might decrease the precision of the estimations in countries with limited data due to a lag in the pandemic’s spread. Because the model parameters were loosely bounded (see Methods), the basic reproduction number obtained for some countries do not precisely agree with other reports for the early onset (see Supplementary Information). In our study we emphasize the comparison of the contagious contact rate at different stages of the epidemic, based on breaking down the logarithmic progression of the time series using a piecewise fitting methodology, with R2 used as a metric of the goodness-of-fit. This allows us to confidently make comparisons of the progress of the pandemic at different stages and across different countries.
Ongoing data collection and epidemiological analysis are essential parts of assessing the impacts of mitigation strategies against COVID-19. In this study, a multinational SEIR modeling considering three different epidemiological stages was used to compare the COVID-19 spreading dynamics in both global and local scenarios. This allowed us to identify the effectiveness of the containment measures across 18 countries and determine the potential burden to the healthcare system assuming three different levels of reduction of the contagious contact rate. Our estimations for the global progress of COVID-19 indicate that the maximum number of active infections will take place after the third quarter of 2020. However, a few countries will experience a decline within the coming weeks. As the contagious contact rate decreases, we observe a monotonic decrease in the peak number of infections. Nevertheless, it should be noted that, depending on the local state of the spread, a side effect of flattening the peak of infections is that some countries will experience a considerable delay of the evolution of COVID-19. For all studied countries, a 90% reduction in the contagious contact rate is required to avoid overwhelming the healthcare capacity. COVID-19 exhibits a heterogeneous spread, a fact that can be related to several variations in both timing and efficacy of the containment measures implemented in each country. Our results provide informative guidelines to avoid healthcare overcapacity and weigh the long-term effects of more lenient policies.
Data Availability
The authors confirm that the data supporting the findings of this study are available within the article and/or its supplementary materials.
Contributors
EML and FFCT conceived the study; EML and FFCT designed the model; EML programmed the SEIR model; EML, FFCT, JJN and OYBC analyzed the data; JJN, OYBC, EML and FFCT interpreted the projections; EML, FFCT, JJN and OYBC wrote original paper. All authors made data curation and resources, and approved the final version for submission.
Source code
Python sources for this work are available at https://github.com/erickmartinez/covid19
Funding Sources
No funding received.
Supplementary results
S1. China
S2. Europe
S3. Asia and Australia
S4. Middle East
S5. North and South America
S6. Basic reproductive number estimations