Abstract
The role of lockdown measures in mitigating COVID-19 in Mexico is investigated using a comprehensive nonlinear ODE model. The model includes both asymptomatic and presymptomatic populations with the latter leading to sickness (with recovery, hospitalization and death possibilities). We consider the situation involving the imposed application of partial social distancing measures in the time series of interest and find optimal parametric fits to the time series of deaths (only), as well as to that of deaths and cumulative infections. We discuss the merits and disadvantages of each approach, we interpret the parameters of the model and assess the realistic nature of the parameters resulting from the optimization procedure. Importantly, we explore a model involving two sub-populations (younger and older than a specific age), to more accurately reflect the observed impact as concerns symptoms and behavior in different age groups. For definitiveness and to separate people that are (typically) in the active workforce, our partition of population is with respect to members younger vs. older than the age of 65. The basic reproductive number of the model is computed for both the single- and the two-population variant. Finally, we consider what would be the impact on the number of deaths and cumulative infections upon imposition of partial lockdown (involving only the older population) and full lockdown (involving the entire population).
I. INTRODUCTION
COVID-19, the disease caused by the novel coronavirus SARS-CoV-2 has, as of August 2020, affected 216 countries and changed the daily lives of billions of people [1]. It has at the same time been the focus of numerous studies, both clinical and mathematical in nature. The study of compartmental models that address the spreading of such epidemics has a time-honored history since the seminal contribution of [2], which by now has been summarized in various reviews [3] and books [4-6]. In recent years, variations to such models focusing on the particularities of coronaviruses have been incorporated, such as the role of asymptomatic carriers of the virus, both as regards earlier CoV examples, such as MERS (see for a related example the work of [7]), and lately in the case of COVID-19 (see for a related example the work of [8]).
Following on some of these more recent developments, the present study focuses on a comprehensive compartmental epidemiological model that takes into account some of the intricacies of COVID-19, while considering its applicability to an urgent and important case example, namely the country of Mexico. More specifically, the model is an extension of the standard SEIR (Susceptible, Exposed, Infectious, Recovered) model that includes a presymptomatic stage, during which a person experiences no symptoms, but is nevertheless infectious [9]. The proposed mathematical setup also accounts for both the asymptomatic infectious and symptomatic infectious individuals. Asymptomatic infectious cases have been found in numerous studies and reports argue that they may be significantly under-reported [8, 10], which may complicate mitigation efforts such as contact tracing and self-isolation. Those with severe disease symptoms may require lengthy hospitalization, which has strained the health system of many countries [11]. In light of that, the model also includes a compartment describing the hospitalizations. Another distinctive feature of the disease is the nonhomogeneity with which it manifests in different age groups, especially as it pertains to symptom severity and mortality risk [12, 13]. Other factors, such as preexisting conditions and intergenerational contacts may also play a role [14]. While population age-structure may be often averaged out and deemphasized in numerous modeling attempts [15], in our model we choose to consider both a single age-group and a two age-group version of the model. The rationale behind this choice is the multifold inhomogeneity in the population of various countries (including our example of interest). Firstly, as mentioned above, the severity in younger people (especially children [16]) is smaller than that in adults. Secondly, older and more vulnerable people may shed more viral particles, thus being more infectious [17]. Thirdly, contacts per day [18] and the contact network itself of older people is different from those of younger people. The partition especially between professionally active (i.e., non-retired) individuals and retirees is important in connection to the above two points, both as regards the differential in average number of contacts of these two groups, and as regards the potential vulnerabilities thereof.
As a case study, we focus on the COVID-19 outbreak in Mexico. While studies for Mexico based on mathematical models exist, they differ from ours in several significant ways. Some ignore social-distancing and other mitigation measures [19], others focus on the estimation of R0 and infections, using a Bayesian hierarchical model [20], and yet others have since become outdated [21]. Mexico faces a unique challenge, due to the prevalence of COVID-19 risk factors, such as obesity, diabetes and hypertension, among its population [22]. This is being reflected in the reported data and our predictive results, which show almost as many fatalities in the < 65 years old group as in the > 65 years old group. We feel that these particular features of the Mexican population in conjunction with the large number of infections and especially of deaths in the country warrant an examination through the prism of different age-structured models (e.g., single-age vs. two-age models; in future studies, possibly further partitioning may be of interest) and an assessment of the potential impact of lockdown measures in the immediate future.
Following the formulation of the single-population model and the results obtained through it in section II, we continue with the two age-group model in section III. In each case, we obtain the optimal parameters of the model in matching the available data regarding deaths, which are considered to be the single most reliable piece of available information. We do discuss the advantages and disadvantages of potentially matching the number of cumulative infections (and the number of deaths). Once the optimal fitting parameters are obtained we assess the impact in both deaths (but also cumulative infections) of immediate lockdown measures in either the case of the entire population or in that of just the older age group. The prediction of the model is that thousands of lives may be saved in just the following month alone, should such measures be imposed effective immediately. In section IV we summarize our conclusions and present ideas for future investigation.
II. SINGLE-POPULATION MODEL
A. Equations
In the model presented herein, we modify somewhat the setup of the earlier work of a subset of the present authors [23], by incorporating the effect of presymptomatic individuals. More concretely, we start with a susceptible (S) population that can become exposed (E) to the SARS-CoV-2 virus upon interaction with three categories of already infected individuals: (a) the presymptomatic (P), individuals who are infected, infectious, and eventually will develop symptoms; (b) the asymptomatic (A), individuals who are infected, infectious, and will not develop (clinical) symptoms; and (c) the symptomatically infected/sick (I) population members carrying the virus (infectious). Upon such interaction, the susceptible become exposed to the virus.
Notice that in the susceptible population, there is the potential for “removal” (i.e., death) by other causes, via the last term of the corresponding ODE proportional to µ, however this term is practically irrelevant for our purposes. For this reason, we set µ =0.
Once a member of the population becomes exposed (E), a latent period (τl =1/σ1) of the virus follows (expected to be in the vicinity of 3 days [24]), during which the exposed population is infected but not infectious. After this period, we assume that the host can naturally be partitioned to either asymptomatic (A) or presymptomatic (P). The fraction of the former is ϕ, while of the latter 1 − ϕ. While both A and P play a role (along with the infected I) in further transmitting the virus, and indeed A have been argued to play a crucial role [9, 10], it is only P that will present symptoms after an additional time scale, the preclinical period τp =1/σ2. The incubation period, i.e., the period from infection to the development of (clinical) symptoms, for this partition τinc = τl + τp =1/σ1 +1/σ2 is of the order of 5 days [24], and represents the time till the onset of symptoms.
From there on, the asymptomatics A continue as if nothing happened, given that they have minimal or no symptoms. Their path is only towards recovery with a characteristic rate MAR, or with a time scale representing the asymptomatic infectious period that is typically expected to be on the order of 7 days. This is why in most countries quarantine is expected to last around 14 days (7 days of infectiousness till recovery and another 7 till recovery of any other person that may be infected among the ones in close contact with the person of interest). In order to distinguish those recovered from asymptomatics (which cannot be directly monitored, unless extensive testing is performed in a community) from those coming from a path of symptoms/sickness (which can be –at least partially– monitored), we denote those recovered from A as AR.
The path of the presymptomatics P is more complicated. Indeed, these may still recover (R) without the need for hospitalization and without severe manifestation of symptoms (going through class I). However, in a fraction γ of the cases hospitalization is needed. Recovery or hospitalization in the model is associated with a time scale 1/M, the symptomatic infectious period . Subsequent steps involve a fraction ω of the hospitalized that die, over a time scale 1/ψ and a fraction 1 − ω that recover over a time scale 1/χ. The above offers an, in principle, complete description of the modeled quantities within our system. We should note that β refer to the coefficients of interaction between A (or P) and S, as well as I and S, leading to new infections; these are, respectively, βSA and βSI. We note here, that we somewhat abuse notation as far as β’s are concerned. Namely, it is relevant consider the transmission rate
, where N is the total population. What we report in the tables that follow is actually
. In terms of the relevant time scales, the infectious period associated with symptomatically infected is
, with asymptomatic individuals
, whereas the infectious period associated with presymptomatic individuals is
.
The transcription of the above steps in equations leads to the following ODEs:

A schematic diagram of the model is shown in figure 1.
In the practical aspects of what follows, we will consider the data for Mexico, with a total population of 127,575,528 people (in 2019) and more than 430,000 confirmed cases and 47,000 deaths by the beginning of August 2020. Data were taken from “Dirección General de Epidemiología (DGE)” of “Gobierno de México” [28]. These data have the particularity of including each clinical case, from which we retrieve three basic pieces of information: the date when symptoms start, the death date (if applicable) and the age. The data are updated on a daily basis. As the way of measuring always implies an underestimation of the number of cases and deaths in the days close the report’s (especially because of the delay in death communications to the DGE), we have performed fits up to dates about 20 days from the report date (i.e. the report date is July 29 and fits are performed until July 10).
Schematic diagram of the single-population model (left panel) and a diagram of the two disease progression pathways: the asymptomatic and the symptomatic one (right panel, based on a relevant variation of [8] adapted to the specific compartments and time scales of the present model).
Our principal diagnostic quantities in order to obtain the optimal parameters for the model will be the time series for deaths D(t), but we will also monitor the cumulative infections. The latter in the realm of the present model amounts to C(t)= I(t)+ H(t)+ R(t)+ D(t), i.e., the sum of the individuals going through the part of the network involving the symptomatically infected. We take t =0 as March 22, and fit data to July 10. This reflects our effort to be (in terms of the total numbers of both diagnostics) well within the “well mixed” regime from the beginning of the infection where the ODEs of interest are expected to be relevant.
We are particularly interested in the effect of non-pharmaceutical intervention strategies on the spreading and development of the disease. Such strategies render the transmission rates time dependent. On April 21 (tq = 30) a light form of social distancing was enforced in Mexico. In addition, we will consider a number of scenarios according to which a more severe lockdown may be imposed on August 10 (tL = 141).
We are particularly interested in how non-pharmaceutical intervention strategies modify the spreading and development of the disease. Such strategies render the transmission rates time dependent. On April 21 (tq = 30) a light form of social distancing was enforced in Mexico. In addition, we will consider a number of scenarios according to which a more severe lockdown may have been imposed on August 10 (tL = 141).
The effect of interventions on the overall transmission rate β may be estimated by considering biological and physical properties of expelled respiratory droplets, which are the carriers of the pathogens and specifically of SARS-CoV-2. The transmission rate is usually written as β = cp, with c the number of contacts per day a susceptible individual has and p the transmission probability. For pathogen transmission via infectious respiratory droplets (of diameter d), be they airborne (airborne transmission) or settled (contact transmission), the transmission rate has been expressed as, cf. Refs. [25] and [26] (for airborne transmission)
where κd is the respiratory droplet emission rate (viral shedding) by e.g. breathing, speaking, coughing, sneezing, αd is the droplet effective removal rate, by e.g. gravitational settling, ambient airflow, pathogen inactivation, and βd the transmission rate per deposited respiratory droplet. The subscript “d” refers to a specific droplet size. As we will use it Eq. (1) to estimate how the overall transmission rate changes we will neglect its complex dependence on d. Lastly, the droplet transmission rate βd depends on the number of effective contacts a susceptible has with other individuals, the number of pathogens contained in an infectious droplet (its pathogen load), and the virus transmission probability per inhaled/deposited droplet.
Social distancing, or other lockdown measures that restrict human mobility, decreases the average number of daily contacts, and possibly the average duration of contact, thereby decreasing βd (and consequently β). In what follows we denote this fractional decrease as ηSA and ηSI. Another common intervention measure is the use of surgical face masks. Milton et al. (2013) [27] argued that their use produced a 3.4-fold reduction in viral aerosol shedding. Face masks also render physical contact of infected hands with susceptible areas on an individual’s face (mouth, eyes, nose) more difficult. They, also, modify the expelled air flow, with consequential effects on droplet transport and dispersion in the environment and their airborne lifetime (thus their removal rate). We suggest that the combined effect of wearing surgical face masks is primarily conditioned by the decrease in viral shedding. We, thus, estimate that their use (if the whole population used them continuously and correctly fitted) would decrease the overall transmission rate to ~ 0.2β.
The effect of lockdown measures is modelled herein by the parameter ζ, Eq. (2): we set the after lockdown transmission rate to be ζβ. This parameter incorporates the effect of all lockdown measures, including the requirement that face mask be worn. As the previous estimate leads to a considerable decrease in the transmission rate, we decided to be conservative and we varied ζ from 1.0 to 0.5.
These features should be kept in mind, as we aim not only to capture the current trend of the pandemic, but also to suggest mitigation strategies that may reduce the cumulative infections, as well as the fatalities as a result of COVID-19 in the time series of interest, namely in the case of Mexico. In what follows, we suggest possible scenarios of social distancing, and we accordingly evaluate their impact as regards the potential reduction induced in the number of deaths and cumulative infections for the time series of interest that is obtained from the DGE data.
To explore the impact of further lockdown effects to mitigate the spread of the infection, we will consider the following time dependence for the β’s:
with 0 <ζ ≤ 1, as argued, and tq, tL previously specified. For ζ =1, Eqs. (2) reduce to the equations modelling the decrease in the number of personal contacts when a light form of social distancing is imposed (no lockdown, no requirement to wear face masks). Figure 2 shows the time-dependence of the β’s. The first jump in the value of βSI and βSA (i.e., the transmission rates to infected and presymptomatic-asymptomatic virus carriers) occurs at tq. The next jump occurs at tL and is assumed to take place only for the presymptomatic or asymptomatic carriers of the virus through the imposition of lockdown restrictions at t = tL.
As our data will show below, the growth of fatalities as well as infections in the country is considerable and thus our model suggests the relevance of the application of significant lockdown restrictions so that the progress of the infection be substantially curbed.
In the parameters of the single-population model, we assume the following constraints:
ηSI ≤ 1
ηSA ≤ 1
5 ≤ 1/σ1 +1/σ2 ≤ 6
The first two are rather trivial (and without loss of generality) implying that we go from a value of the transmission rate β, to a lower value β ×η in both the interactions of S with A (or P) and in those of S with I. The third constraint is based on observations associated with SARS-CoV-2 [24] positioning the incubation period associated with this virus between roughly 5 and 6 days from the start of exposure.
Single-population model. Transmission-rate time dependence (βSI and βSA). For the asymptomatic carriers of the virus the transmission rate, which initially decreases due to social distancing (reflected in ηSA) is further assumed to decrease by a factor ζ (shown for different cases), reflecting the effect of obligatory wearing of (effective) face masks or other lockdown-based decrease of contacts. For the infected population instead, we assume that βSI decreases only once due to social distancing, (ηSI), but it does not further decrease, given the self-isolation of such individuals due to the presence of symptoms.
B. Results
We now present the results determined by an optimization procedure that identifies under the above minimal constraints the optimal parameters of the model in comparison to the time series of data obtained from the Government of Mexico [28]. Table I shows the optimal parameters found from minimizing the norm
with α1 =0, α2 =1 as displayed in Fig. 3. We will compare these results shortly with the case of α1 = α2 =0.5 shown in Fig. 4. The rationalization of these two choices is as follows. In the former case, we take the view that the only “ground truth” data is that of the deaths; in fact, even those can be under-estimated (deaths with “suspicion” of COVID-19, but no definitive test) or –perhaps less likely– over-estimated (deaths attributed to COVID-19 without explicit testing), but here we will assume that this is the most well-defined piece of data, as is generally expected to be the case. On the other hand, infections are broadly expected to be under-reported. This is because many of the cases with symptoms do not get to be serious enough to lead to hospitalization or to be reported. In that light, it is expected that assuming the deaths as ground truth, we should expect to find a significant over-estimation of the number of infections (we return to this point below). If, on the other hand, we “force” the model to match the current reporting of infections, then we will end up with a better approximation “on average” to both curves but with a potential adverse by-product in the resulting number of deaths that we will discuss below. This will be due, in addition, to issues of measurement errors in the setting of cumulative incidences, as discussed, e.g., in [29].
Our observation in Fig. 3 is that the model can provide an excellent fit to the total number of deaths, but in that case, there is a considerable over-prediction of the number of infections, presumably because numerous of the incurred infections are not reported in the official data. The results can be compared with Fig. 4 where both deaths and cumulative infections are attempted to be fitted, i.e., α1 = α2 in the minimization procedure above. Here, we see that while the model is capable of doing a very adequate job in following C(t), it also does a reasonable job of capturing D(t). Nevertheless, there is a caveat to the latter. A closer observation of the semilogarithmic scale of the graph will lead the astute reader to observe that as the model is trying to juggle the optimization of both time-series, it slightly over-predicts deaths early on, slightly under-predicts them at the middle of the time series and eventually slightly overpredicts again at longer times, likely predicting a much more catastrophic scenario (with multiple hundreds of thousands of deaths at the end of the examined evolution) than is warranted by the data trends. For this reason, we will stick to the consideration of the former case of Fig. 3 hereafter.
Before we discuss the implications of mitigation strategies, let us briefly comment on the optimal parameter values identified by the model, as illustrated in Table I. The latent period 1/σ1 is indeed found to be in the vicinity of 3 days (2.8765), while the total incubation period is comfortably within the prescribed interval of 5-6 days (1/σ1 +1/σ2 ≈ 5.3836). The time scale of recovery for asymptomatics is a little under 7 days as expected (1/MAR =6.0983), while the time scale of going from symptomatic infected to hospitalization is close to 4 days which is also fairly reasonable (1/M =3.6221). The model predicts a large fraction of asymptomatics (ϕ =0.8134) which is in line with discussions such as that of [10]. However, such results should be taken with a grain of salt. This is because of issues associated with the notion of identifiability [30]. In particular, a systematic analysis of the model in connection to identifiability (the formal mathematical details of which are outside the scope of the present work) suggest that ϕ itself will not end up being an identifiable parameter, but rather the product of ϕ with β’s will be on such. As a result, we do not significantly focus on the larger value obtained for ϕ, but we do note it. Interestingly, the model predicts that roughly 30% of those symptomatically infected need hospitalization, while the rest recover. Of those hospitalized, nearly 30% result in fatalities, leading to a death percentage of about 10% among those that present symptoms. Indeed, this is a rather significant fatality percentage accounting for the large number of deaths in the population. The average time scale of recovery upon hospitalization is about 10.7 days (1/χ), while that of death (1/ψ) is found to optimally be near 11.5 days. However, here too we should highlight that ω and ψ are not independently identifiable (only their product is) and neither is (1 − ω) and χ (again only their product is). Hence, the relevant optimization parameter values should be considered as coming together with the corresponding caveats (and the relevant products as being the genuine model derived quantities), despite their realistic individual values. It is relevant to note that at the initial time of the model, it is not known how many members of the population are exposed, asymptomatic or presymptomatic. We thus optimize those parameters too, obtaining the last 3 entries of the Table.
Our aim is now to explore the impact of mitigating measures reducing the obtained optimal case by a factor of ζ< 1 (ζ =1 is the current optimization result without additional measures). Table II summarizes the results of Fig. 5, in which the predictions for ζ =0.9, 0.7, 0.5 are displayed. It can be seen that a decrease of β by a factor between 0.9 and 0.5 could have a catalytic result as concerns the predictions of the model both for the deaths and as regards the cumulative infections. In particular, a decrease of ζ by a factor of even as little as 0.9 will reduce the number of deaths by more than 800, only within the time frame between August 10 and September 10, while a decrease by a factor of 1/2 is predicted within this single population model to save close to 4000 lives in this interval alone. The corresponding effect to the number of infections is perhaps even more striking (also accounting for the fact that many of these infections could lead to fatalities at a later stage). In particular, a decrease of ζ to 0.9 leads to about 24000 less infections, while to a factor of 1/2 would lead to about 100000 less infections again just within this 31-day period. One can clearly see the significant potential impact of further lockdown measures, a feature that may be worthwhile to factor into further public health considerations.
Optimal parameters for the single population model. For a discussion thereof, see the text.
Single-population model. Number of cases C(t) (left) and of deaths D(t) (right) found by minimizing norm (3) with α1 =0 and α2 =1. To produce the fit we have used the data until July 10. The prediction of the optimized model is shown by the solid line, while the official time series [28] is given by dots.
As in the previous figure, but for α1 = α2. The number of cases C(t) is captured significantly better, but the lower accuracy in capturing D(t) and its implications are further discussed in the text.
Single-population model. Predictions for C and D at September 10 if lockdown measures had been applied on August 10.
Single-population model. Evolution of the number of cases (left) and of deaths (right) for different ζ if lockdown had been applied on August 10 (t = 141). Notice the significant curbing of the pandemic as a result of such intervention measures, especially so at the level of long term effects in the case of ζ =0.7 and ζ =0.5.
As our final comment about the single population variant of the model, we note that an important quantity in epidemiological models of this general type is the notion of the basic reproductive number R0; see, e.g., [3]. The number effectively represents the expected new infections (so-called secondary infections) from a single infection in a population where all subjects are susceptible. Using the so-called next-generation approach [31], we can determine the basic reproductive number on the basis of the parameters of the model and the susceptible population [see the Appendix for details], according to the expression:
with S* the initial susceptible population, namely S* =1 as parameters β are normalized by N, as discussed above. For the parameters in Tab. I, one obtains a value of R0 =2.0755. The effective reproductive number at the beginning of social distancing measures (t = tq) is Re =1.5745 which is unable to mitigate the pandemic effects. In order that Re < 1, it is needed that ζ is smaller than 0.8506. That is why among the case examples that we considered, those with ζ =0.7 and 0.5 present a significant decrease in the number of deaths not only imminently (i.e., within the interval of August 10 to September 10) but also over the longer scale prediction of Fig. 3.
III. TWO-POPULATION MODEL
A. Equations
We now turn to the two-population variant of the model. Recall that due to the different structural characteristics of the two populations as considered herein, namely below and above 65 years, we expect that this model will be more adequate in capturing both the deaths and the cumulative infections of the full population. This is because on the one hand, the younger population in our considerations is more active (belonging typically in the workforce), hence carries a different number of contacts. On the other hand, the older population has its own vulnerabilities to the impact of the virus SARS-CoV-2 and the associated disease, namely COVID-19. On the other hand, as explained in the Introduction, the prevalence of various risk factors within the Mexican population [22] render this partition even more relevant to consider towards capturing the detailed data trends.
Firstly, we discuss the mathematical structure of the model. Here, we basically assume that each of the populations has its own set of parameters. Thus, the superscript y will denote the parameters associated with the younger population, while the superscript o will be connected to the older population. Naturally, the number of variables (S, E, A, P, I, H, R, D) now doubles with each part having a younger and an older component. It is worthwhile that for some quantities that are associated with the virus, such as σ1 and σ2, characterizing the latent and incubation time thereof, we assume these to be independent of age. Lastly, we explore a mildly anisotropic variant of the model where in terms of the interactions βo ≡ βoy = βyo = βoo and βy ≡ βyy. That is to say, we assume that the older population has a different interaction within itself and with the younger individuals, than the younger members of the population between themselves [18]. This is a reasonable assumption under the present conditions where the more sensitive older members of the population are advised to reduce their interactions. While, in principle, we could have used a fully anisotropic variant of the model, we find it relevant to attempt to reduce the overall number of model parameters, hence the above choice.

In a natural extension of what we assumed for the fitting of the single population model, we assume that as a result of the partial measures already in place, the value of each of the β’s has a temporal variation of the form:

This implies that each of the β’s (associated with younger or older populations) has been reduced by a respective factor of η (with the corresponding superscript).
Again, extending the respective constraints (without loss of generality for the η’s) to each population, as well as the incubation time expectation, we impose the following conditions on the proposed optimization.
5 ≤ 1/σ1 +1/σ2 ≤ 6
Recall that part of our motivation involves the intention to consider the impact of potential measures in the form of social distancing and the associated reduction of contacts (as well as the additional use of protective measures such as face masks etc.). In that light, for each of the two populations, we assume again that the infected members maintain (in their quarantine) their β’s, while the circulating asymptomatic individuals would be affected by these measures through a further reduction of their β’s by factors ζy and ζo. The associated temporal dependence of the transmission rate factors then reads:

Figure 6 shows the time-dependence of the β’s. This Figure is dedicated to the scenario in which only the old population is required to adhere to the additional restrictive measures (suggested to be imposed as of August 10), i.e. ζy =1 and ζo ≡ ζ. This is intended to explore the mitigation of contacts within the older population alone. This approach has been argued in other cases to be a reasonably safe approach towards attempting to return to economic and social stability [32]. Later, we will explore the possibility of lockdown measures imposed on both older and younger populations. The latter will be more relevant to compare with the single population results of the previous section.
Once again, the optimal parameters can be found through our optimization procedure stemming from minimizing the norm
with α1 =0, and α2 =1 as displayed in Fig. 7. The optimal parameters in this case for the two population model are given in Table III. We have also performed the optimization for α1 = α2, finding (results not shown here for brevity) that it bears similar advantages (such as capturing more adequately C(t) and similar pathologies such as the overestimation of deaths, most notably in the younger population, but also in the older population).
We can observe by comparing the optimal coefficients of the model that the nature of the data for the two populations (older and younger) leads to a corresponding selection of the optimization coefficients so that some suitable inequalities arise. In particular, it is evident for the data of Table III that the transmission rates are considerably higher both for asymptomatic (presymptomatic) and also for infected individuals in the older population, in comparison to the younger one. The model predicts a comparable fraction of asymptomatics within the two populations, although again given the issues of identifiability, we do not assign any special weight to this observation. We can see that also within the symptomatic component of the respective generations, the younger population has a far smaller γ signifying a far smaller fraction of individuals that are symptomatically infected and need hospitalization, in comparison to the older population. Similarly, the fraction among those hospitalized that are led towards fatalities is more than triple in the older population in comparison to the younger one (i.e., comparing the respective values of ω). Once again though, we do highlight that one should not really interpret ω and ψ separately, but only the product thereof (which is still significantly larger for the older population) and similarly not consider (1 − ω) and χ separately but once again their product which suggests a significantly larger recovery rate for the younger population.
Optimal parameters for the two populations model.
Two-population model: Time dependence of βSI and βSA under the existing policy (ζ =1) and under the effect of additional (suggested) lockdown measures when they are applied to the older (than 65) population, i.e. ζo =1 and ζy ≡ ζ.
We now turn to the comparison of the implications of maintaining ζ =1 (i.e., the current policy) vs. the projection upon a reduction of ζ by a factor of 0.9 to 0.5 as before. The relevant results can be found in Table IV and Fig. 8. It is evident from these data that a corresponding reduction of the ζ factor by means of social distancing or lockdown measures is predicted by the model to reduce the number of infections by anywhere between nearly 16 to about 75 thousand in the two respective cases. The respective projected implications in the loss of human life range from about 400 (in the former case) to over 1800 people (in the latter case) in this interval of a month. Clearly, the projection of the model, both for the specific interval of August 10 to September 10 that is specifically monitored, but also over the longer time scale presented in Fig. 8 seem to warrant the consideration of such measures, to the degree possible.
Let us now touch upon the case where the relevant measures are applied to both populations. The corresponding variation of the β’s involving an equal restrictive factor of ζ in the contacts (or related social distancing measures) for both young and old populations is shown in Fig. 6. In turn, the TableV shows the effect on both deaths and cumulative infections of these restrictive measures applied equally to both populations (this can also be compared to Table II where this was applied to the single-age, full population model). This is for the period from August 10 to September 10, i.e., for an interval of one month. Figure 10 presents a longer time scale perspective of these mitigation effects, extending well past September 10. Finally, a complementary view of the potential effect of measures can be observed in Fig. 11. Here the number of potentially fewer deaths as predicted by the model is presented both in the case of the single age model, as well as in that of the two-age model, both for restriction of just the older population (red curve) and for that of older and younger populations (blue curve). It can generally be seen that the structured form of the two population models predicts fewer avoided deaths in comparison to the less structured one-age mode. More concretely, from the Table V, we notice that within the month of interest (August 10 to September 10), the decrease in infections is by around 22000 for ζ =0.9 or by close to 100000 for ζ =0.5. These numbers are comparable to the ones of the single age model. However, as concerns the number of deaths the situation is less promising as the decrease in number of deaths over the same time frame is by less than 500 for ζ =0.9, while for ζ =0.5, the decrease in deaths is by 2174, i.e., well below the single population prediction. Nevertheless, these predictions of the model and comparisons between the restrictive measures for a single (older) population group vs. ones for the entire population, we hope may be informative towards imminent public health considerations.
Two-population model: number of cases (left) and of deaths (right) found by minimizing norm (8). The younger (than 65 years of age) population is shown in the top panels and the older (than 65) one in the bottom panels. While the corresponding fatalities are somewhat comparable, recall that there is a far more significant susceptible population in the former category in comparison to the latter one.
We also calculated R0 for the two-population version of our model. In this case,
where the explicit expressions of
are the given in the Appendix. We note, however, that
,
are identical to (4) in the case when the old and young groups, respectively, are isolated.
correspond to the case when the young (old) age group comes into contact with only old (young) infectious hosts. This form of community R0 has been found in other contexts, such in HIV spread using a multi-group model [33]. Based on the parameter values found through our optimization procedure, it holds
and R0 =2.0615. We note that this is remarkably close to the R0 of the one-population version of our model. The effective reproductive number at the beginning of social distancing measures (t = tq) is Re =1.5653 with

Upon imposition of the relevant measures, e.g., for ζ =0.7 or ζ =0.5, we have confirmed that both in the case of constraints only for the older population or in that of restrictive measures both the older and the younger population, it is true that Re < 1.
Two-population model. Predictions for C and D at September 10 for parameters found by fitting the norm (8) and measures are applied to the older (than 65) population, i.e. ζo =1 and ζy ≡ ζ
Same as Fig. 5 but for the two-population model when measures are applied to the older (than 65) population, i.e. ζy =1 and ζo ≡ ζ. Notice the significant deviation of both the number of infections (left) and of the number of deaths (right), both for the younger (than 65) population and for the older (than 65) one between the current policy (ζ =1) and the suggested social-distanced (by a factor of 0.5 to 0.9) in the cases shown.
Two-population model: Time dependence of βSI and βSA under the existing policy (ζ =1) and under the effect of additional (suggested) lockdown measures when applied to both populations (i.e. ζo = ζy ≡ ζ).
Two-population model. Predictions for C and D at September 10 for parameters found by fitting the norm (8) when the measures are applied to both populations (i.e., ζo = ζy ≡ ζ)
IV. CONCLUSIONS AND FUTURE CHALLENGES
In the present work, we have proposed a model for the examination and fitting of the progression of the pandemic of COVID19, with an aim towards applications to data given for different countries. The model has been developed to include susceptible individuals, turning to exposed upon an infectious interaction. Subsequently, these are split to presymptomatic or asymptomatic after a latent period. The asymptomatics can only recover, while the presymptomatics are led to an onset of symptoms. Subsequently, this can lead to either recovery or the need for hospitalization, and the latter again can either lead to recovery or to fatality. The model naturally involves assumptions that are not always met. E.g., in some cases people may die before getting a chance to be hospitalized. In other settings where there is heavy testing involved, asymptomatics may be counted in the reported infections: while this is not an explicit assumption of the model, we did not count any fraction of asymptomatics in the cumulative infections when presenting the relevant comparison. Additionally, also, some of the hospitalized individuals may transmit the virus (e.g. to medical personnel) despite the much more substantial health and safety protocols applicable within hospitals. In any event, we consider these features to be the exception rather than the rule and hence have excluded them from our more mainstream considerations.
Same as Fig. 8 but now with the restrictive measures applied from August 10 onward to both populations (i.e. ζo = ζy ≡ ζ).
Number of deaths that could be saved in the period Aug. 10 – Sep. 10 if the different measures discussed herein are applied.
Within the realm of the model, we have exposed the meaning of the relevant parameters (e.g., transmission rates, latent and incubation times, fractions of asymptomatics vs. presymptomatics, of hospitalized vs. directly recovered, and of recovered vs. dying individuals at the hospital; also the time scales of the latter partitions were considered). I.e., we have attempted to assign an epidemiological meaning to our different parameters and to examine the associated results of the optimal fit of these parameters to the data from a specific time series to illustrate the “reasonable” nature of the findings. Notice that in addition to explaining the fitting process (to either deaths or deaths and cumulative infections), we have taken the approach of using a minimal number of assumptions to avoid constraining the system to the degree possible. We have also illustrated how the model can be partitioned to different age groups, based on the data that may be available for the country or region of interest. Here, we have opted to consider the simplest partition to 2-age models, yet while tedious, it is structurally straightforward (and of some interest in its own right) to generalize considerations to many age group models.
As our prototypical illustration of choice, we have used the case of data from Mexico which have been available through [28]. This is a case where a significant number of cases has arisen and the consideration of potential further lockdown measures is an important topic of ongoing debate. Indeed, our findings suggest that the present measures appear not to be sufficient to mitigate the catastrophic consequences of the pandemic, since the current value of the basic reproductive number of the epidemic is R0 > 1 and hence the situation appears to need further mitigation measures and strategies to avoid significant loss of life. In that vein, we have discussed in the realm of the model what consequences different measures would have at the level of deaths and of cumulative infections. It was found, e.g., that a reduction of transmission rates by a factor of around 1/2 via social distancing or related measures would lead to a nontrivial curbing of the rampant growth of both C(t) and D(t). The relevant reductions could of the order of 100000 in terms of infections and of more than 2000 in terms of deaths in the interval of the next 30 days alone. While our results are only suggestive (and relevant with the context of the model), we hope that they may be of some value towards policy considerations in the near future.
Naturally, there are numerous directions that are worthwhile to consider towards extensions of the present work. A natural feature of many of the models (e.g., associated with the US [15], but also elsewhere) is the incorporation of uncertainty. We are currently in the process of buidling into the formulation an uncertainty quantification framework on the basis of polynomial chaos considerations [34] and the use of suitable distributions for quantities such as the transmission rates. Constructing a robust such framework would be of considerable value to models such as the one proposed herein. In addition, as indicated in numerous cases, there are data broken down by age groups (e.g., not only for Mexico, but also for other countries such as Portugal [35], etc.). Clearly, a generalization of the model that considers the data by decade would offer a more complete and systematic picture of the impact of COVID-19 to different sub-populations and hence their potential risk. This would offer, in turn, a clearer picture of which age groups to attempt to protect and would be worthwhile (even if somewhat cumbersome). Lastly, as different countries are delving into a re-opening exercise, the formulation of a meta-population model with different hubs and a quantification of the transport coefficients between these [36] would be central towards going beyond the well-mixed assumption and factoring in a spatial structure and transportation features within the model. Such directions are currently under active investigation and will be reported in future work.
Disclaimer
The views expressed in this manuscript are purely those of the authors and may not, under any circumstances, be regarded as an official position of the European Commission.
Acknowledgments
This material is based upon work supported by the US National Science Foundation under Grants No. DMS-1815764 (ZR), PHY-1602994, and DMS-1809074 (PGK). PGK also acknowledges support from the Leverhulme Trust via a Visiting Fellowship and thanks the Mathematical Institute of the University of Oxford for its hospitality during this work.
Appendix A: R0 calculations
1. One-population model
We use the next generation matrix approach to find R0 [31]. We define the relevant vectors:

We then focus on the 5 infectious/infected compartments (E, P, A, I, H) and ignore the rest (S, AR, R, D). We find the Jacobians of ,
with respect to E, P, A, I, H in the order in which they appear. This will yield two 5 × 5 matrices:


The basic reproductive number is the spectral radius of FV−1 which in our case is

The first term is due to the presymptomatic hosts P, the second due to the asymptomatic hosts A, and the last one due to the symptomatic infectious hosts I. In each term, the numerator yields the rate of new infections βSAS*, βSI S* and this is then multiplied with the average duration of the stay in that infectious stage ,
,
. Each term is multiplied with the fraction of hosts in that stage/state ϕ for asymptomatics and 1 − ϕ for symptomatics.
2. Two-population model
Using again the next generation matrix approach results in the following two matrices:


Since V is a block matrix it holds
from which is readily follows that the spectral radius of FV−1 is given by

and
are the basic reproductive numbers in the young and old age groups, respectively, if they were completely isolated of each other.
is the basic reproductive number if susceptible young hosts come into contact with only old infectious hosts, and
is the basic reproductive number if old hosts come into contact with only young infectious hosts.