Frailty variation models for susceptibility and exposure to SARS-CoV-2

M. Gabriela M. Gomes; Marcelo U. Ferreira; Maria Chikina; Wesley Pegden; Ricardo Aguas

doi:10.1101/2021.05.25.21257766

Summary

Individual variation in susceptibility and exposure is subject to selection by force of infection, accelerating the natural acquisition of immunity, and reducing herd immunity thresholds and epidemic final sizes. This is a manifestation of a wider population phenomenon known as “frailty variation” in demography. Despite this theoretical understanding, public health policies continue to be guided by mathematical models that leave out most of the relevant variation and as a result inflate projected infection burdens. Here we focus on the trajectories of the coronavirus disease (COVID-19) pandemic in England and Scotland. We fit models to series of daily deaths and estimate relevant epidemiological parameters, including coefficients of variation which we find in agreement with direct measurements based on published contact surveys. Our estimates are robust to whether the data series encompass one or two pandemic waves. We conclude that herd immunity thresholds are being reached with a larger contribution of vaccination in Scotland than in England, where naturally acquired immunity is higher. These results are relevant to global vaccination policies.

1. Introduction

Almost 100 years ago, Kermack and McKendrick (Kermack and McKendrick 1927) and McKendrick (McKendrick 1939) fitted susceptible-infected-recovered (SIR) models to observed epidemics and alerted for the simplifying assumption “that all infected persons are equivalent, and that all susceptible persons are equally liable to acquire infection” (McKendrick 1939). In their fittings they adjust not only transmission parameters but also the size of the susceptible and exposed population at epidemic onset. Susceptible and exposed population sizes needed to be adjusted so their homogeneous models could fit the data.

Thirty years later, Gart (Gart 1968) admited that “it is difficult to define exactly the size of the population of susceptible hosts. In this instance the difficulty is associated with the heterogeneous nature of the population”. The author divided the population in two groups, depending on their history of infection, and allowed much greater susceptibility in the group with no history. This did not seem sufficient to provide good fit to observed epidemics as the author adds “we assume that the first group is a homogeneous group of susceptibles, while the second is actually a mixture of immune and susceptible individuals”. In (Gart 1971) the author extend the model to several susceptibility groups, and, more than a decade later decade later, (Ball 1985) compared a model with several susceptibility groups with the homogeneous version and described how homogeneity increases epidemic size. (Coutinho et al. 1999) expand the formalisms but conclude that “at present practical applications might be difficult”. (Pastor-Satorras and Vespignani 2001) developed related formalisms to describe epidemics on contact networks.

Meanwhile, frailty variation had been formalised in demography (Vaupel et al. 1979) and introduced in practice in survival analysis (Aalen 1988) and non-communicable disease epidemiology (Aalen et al. 2015) to improve model fits and interpretation.

On the experimental front, (Dwyer et al. 1997) measured nonlinear relationships between transmission and densities of susceptible hosts, implying that the bilinear term in the classical SIR model may not be appropriate. The authors attributed this non-linearity in transmission to heterogeneity in host susceptibility to infection which they estimated from the shapes of dose-response curves.

(Finkenstadt and Grenfell 2000) fitted a model with nonlinear relationships between transmission and density of susceptible hosts to an observed epidemic and estimated the exponent which they interpreting as representing heterogeneity in mixing. (Novozhilov 2008) derived the expressions for the exponents from explicit distributions of susceptibility.

Here we build on this history and analyse the coronavirus disease (COVID-19) with frailty variation models. The study is focused on England and Scotland, where infection was first detected in early 2020.

We use susceptible-exposed-infected-recovered (SEIR) models (Diekmann et al. 2013) incorporating distributions of individual susceptibility or exposure to infection (Ball 1985; Coutinho et al. 1999; Gomes et al. 2020; Katriel 2012; Novozhilov 2008). We use Bayesian inference to estimate the model parameters by fitting series of deaths while accounting for the combined effects of non-pharmaceutical interventions (NPIs), behavioural change, seasonality and viral evolution. We estimate coefficients of variation which are in agreement with direct measurements based on contact-pattern studies, e.g., (Mossong et al. 2008; Hens et al. 2009; Willem et al. 2012). We show that individual variation in susceptibility or exposure to infection can significantly affect model projections and should be accounted for to describe the dynamics of SARS-CoV-2.

2. Mathematical models

The basic compartmental SEIR model describing the transmission dynamics of SARS-CoV-2 is represented diagrammatically in Fig. 1. Following (Gomes et al. 2020) the model accounts for individual variation in susceptibility or exposure to infection.

Fig. 1.

Susceptible-exposed-infected-recovered (SEIR) compartmental model representing the transmission dynamics of SARS-CoV-2 in a heterogeneous host population.

2.1. Individual variation in susceptibility to infection

Let x denote the individual susceptibility to infection in relation to the mean, which we describe by a gamma distribution q(x) with mean ∫ xq(x)dx = 1 and parametrised by a coefficient of variation, . Susceptible individuals, S(x), become exposed at a rate that depends on their susceptibility x and on the average force of infection λ which accounts for the total number of infectious individuals in the population over time. Upon exposure, susceptible individuals enter an incubation phase, E(x), during which they gradually become infectious (Wei et al. 2020; To et al. 2020; Arons et al. 2020; He et al. 2020). The infectiousness in this phase is made to be half that in the following stage (ρ = 0.5), to which individuals transit within an average of 5.5 days (δ = 1/5.5) (McAloon et al. 2020). The fully infectious state is denoted by I(x). Infected individuals are eventually removed, on average approximately 4 days after becoming fully infectious (γ = 1/4) (Nishiura et al. 2020; Lauer et al. 2020; Li et al. 2020). A small fraction ϕ(x) die to COVID-19 while the remaining majority recover into R(x) where they are noninfectious and temporarily resistant to reinfection due to acquired immunity. The model is represented diagrammatically in Fig. 1 and mathematically by the infinite system of ordinary differential equations: The average force of infection upon susceptible individuals in a population of size N and transmission coefficient β is defined by: An epidemic is simulated by introducing a small seed of infectious individuals in a susceptible population. Initial growth of infected numbers is near exponential but decelerates as individuals are removed from the susceptible pool by infection and immunity. With variation in susceptibility, highly susceptible individuals tend to be infected earlier, leaving behind a residual pool of lower mean susceptibility. This selective depletion intensifies the deceleration of epidemic growth and gives an efficient head start to the acquisition of population immunity. Eventually the epidemic will subside and the herd immunity threshold, defining the percentage of the population that needs to be immune to reverse epidemic growth and prevent future waves, is lower when variation in susceptibility is higher.

The basic reproduction number, defined as the number of infections caused by an average infected individual in a totally susceptible population, is written for system (1)-(4) with force of infection (5) as: This is a crude indicator of early transmissibility but its use quickly becomes cumbersome. Several factors, such as NPIs, human behaviour, seasonality and viral evolution, affect ℛ₀ in a time-dependent manner. We denote the resulting quantity by ℛ_c(t) = c(t) ℛ₀, where c(t) > 0 describes the basic risk of infection at time t in relation to baseline.

In the estimation of ℛ_c(t) we assume a profile for c(t) as illustrated in Fig. 2: T₀ is the time when ℛ₀ begins to show decrease due to behavioural change or seasonality; L₁ is the period of maximal contact restrictions due to lockdown (48 days in England, from 26 March to 12 May, and 66 days in Scotland, from 24 March to 28 May, 2020) and c₁ ≤ 1 is the value of ℛ_c(t) during L₁ in relation to the initial ℛ₀; T₁ is the time elapsed between T₀ and L₁, over which transmission is allowed to decrease linearly. After L₁, contact restrictions are progressively relaxed and we allow transmission to begin a linear increase such that c(t) reaches 1 in T₂ days, which may or may not be within the range of the study. Changes in other factors that affect transmission (such as seasonality or viral evolution) are inseparable from contact changes in this framework and are also accounted for by c(t).

Fig. 2.

Schematic illustration of factor c(t) representing the combined effects of NPIs, seasonality and viral evolution on the reproduction number.

The model will be used to analyse COVID-19 deaths recorded over approximately one year. Mathematically this is constructed as: Second and third lockdowns in the autumn and winter season are implemented more simply as a further reduction in transmission (by a factor c₂) over the stipulated time periods. Specifically: A finite version of system (1)-(4) and (5) can be derived exactly (Novozhilov 2008): If ℛ₀ had remained constant throughout the duration of the study, a herd immunity threshold (HIT) would be derived as in (Montalb’san et al. 2020): The expectation, however, is that ℛ₀ changes due to seasonal effects and viral evolution. These effects are currently inseparable from those of NPIs and behavioural change and, consequently, we cannot obtain a time-depend ℛ₀. Although our results for variable susceptibility models will be accompanied by HIT estimates calculated according to formula (12) we highlight that these refer to a virus as transmissible as the SARS-CoV-2 of early 2020. Once reliable estimates are available for evolving transmissibility, HIT estimates can be updated.

In reality, as infection spreads, the susceptible compartment S is depleted and recovered individuals populate compartment R where they are protected by acquired immunity. Eventually they lose that protection as immunity wanes or is evaded by new viral variants. This is omitted in this version of the model given our purpose to analyse data reported over one year when the frequency of reinfection has been relatively low in our study setting (Hall et al. 2021). In Supplementary Information, however, we formulate an extended model with reinfection to show that the addition does not change our conclusions.

2.2. Individual variation in exposure to infection

In a directly transmitted infectious disease, such as COVID-19, variation in exposure to infection is primarily governed by patterns of connectivity among individuals. We incorporate this in system (1)-(4) assuming that individuals mix at random (Pastor-Satorras and Vespignani 2001; Miller et al. 2012). In a separate study we developed an assortative mixing version of the model adopted here and did not find the results of interest to change (Aguas et al. 2020). Under random mixing and heterogeneous connectivity, the force of infection is written as The basic reproduction number is and ℛ_c(t) = c(t) · ℛ₀ is as above.

As with variable susceptibility, model (1)-(4) with variable exposure (13) can be reduced to a 3-dimensional system of ODEs: the effective reproduction number written as: and the herd immunity threshold derived as in (Montalb’san et al. 2020):

3. Data

We used publicly available epidemiological data from the UK coronavirus dashboard [https://coronavirus.data.gov.uk/] describing the unfolding of the SARS-CoV-2 epidemic, to estimate relevant transmission related parameters for the larger nations: England (56 million inhabitants) and Scotland (5.5 million). Namely, we collected datasets containing daily deaths (deaths within 28 days of positive test by date of death), , where k = 1 is the day when the cumulative moving average of death numbers exceeded 5 · 10⁻⁸ of the population in both nations (10 March 2020).

Model outputs would then be fitted to the raw series of daily deaths between 10 March 2020 and 1 February 2021 (n = 329 days in total) adopting an infection fatality ratio of 0.9% (Ward et al. 2021) throughout the study period and initial conditions: where η is the excess duration of a fatal infection relatively to non-fatal. In Supplementary Information we explore the sensitivity of the heterogeneous susceptibility results to changing the value of IFR.

4. Model fitting and parameter estimation

In order to preserve identifiability, we made five simplifying assumptions: (i) the infection fatality ratio (IFR) is constant throughout the study period; (ii) natural (seasonality and viral evolution) and interventional (NPI) modulators of the reproduction number are encapsulated into a single time varying parameter c(t) as illustrated in Fig. 2; (iii) excess transmission from critically ill stages is negligible; (iv) reinfection is negligible; (v) vaccine effects are negligible during the fitted period, which in section 5.1 ends on 1 February 2021 (less than 1% fully vaccinated in the UK) and in section 5.2 ends on 1 July 2020 (no vaccines were in use).

Parameter estimation was performed with the software MATLAB (MathWorks, Natick, MA) using the PESTO (Parameter EStimation TOolbox) package (Stapor et al. 2018). We assumed that the number of SARS-CoV-2 infections are Poisson distributed.

We try to reproduce the dynamics of COVID-19 deaths by estimating the set of parameters θ that maximises the log-likelihood (LL) of observing the daily numbers of reported deaths Y : in which y^E(k, θ) and y^S(k, θ) are the simulated model output numbers of COVID-19 deaths at day k in England and Scotland for the set of parameters θ, and are the numbers of daily reported deaths, and n is the total number of days included in the analysis.

The set of parameters to be estimated is: To ensure that the estimated maximum is a global maximum, we performed 50 multistart optimisations with initialisation parameters sampled from a Latin-Hypercube. The combination of parameters resulting in the maximal log-likelihood were used as a starting point for 100, 000 Markov Chain Monte-Carlo iterations. From the resulting posterior distributions, we extracted the median estimates for each parameter and the respective 95% credible intervals. We used uniformly distributed priors with wide ranges.

We apply the outlined fitting procedure using both heterogeneity models (specifically individual variation in susceptibility to infection and individual variation in exposure to infection) as well as a homogeneity model where we set the coefficient of variation to zero (CV = 0). The Akaike information criterion (AIC) is then applied to select the best fitting model.

5. Results and Discussion

5.1. Estimated parameters and herd immunity thresholds

Variable susceptibility, variable connectivity and homogeneous models were fitted to series of COVID-19 death reported in England and Scotland until 1 February 2021. The fits are shown in Figs. 3, 4 and 5 (fitted data points in green), and the estimated parameters in Table 1. Maximum log-likelihoods are also displayed, as well as AIC scores for model selection. We conclude that variable susceptibility and variable connectivity models are better supported by the data (lower AIC in bold) than the homogeneous model\as found previously (Aguas et al. 2020; Colombo et al. 2020). Variable connectivity is slightly better supported although the reality most likely combines the two forms of heterogeneity.

View this table:

Table 1.

Model parameters estimated by Bayesian inference based on daily deaths until 1 February 2021. Model selection based on maximum log-likelihood (LL) and Akaike information criterion (AIC). Best fitting models have lower AIC scores. Infection fatality ratio, IFR = 0.9%. ℋ₀, calculated from ℛ₀ and CV using formulas (12) or (19), as appropriate.

Fig. 3. SARS-CoV-2 transmission in England and Scotland with individual variation in susceptibility to infection.

Susceptibility factors implemented as gamma distributions. Modelled trajectories of COVID-19 deaths (black and red curves) and cumulative percentage infected (blue). Dots are data for daily reported deaths: fitted (green); posterior to fitted time period (yellow). Basic reproduction numbers under control (ℛ_c) are displayed on shallow panels underneath the main plots. Left panels represent fitted segments as solid curves and projected scenarios as dashed: without vaccination (black); with a vaccination programme that effectively immunises 8% of the unvaccinated population per month from February onwards (red). Right panels prolong those projections further in time assuming ℛ_c(t) = ℛ₀ (heavier curves) and explore additional vaccination scenarios (thin curves), from top to bottom (in % of unvaccinated population per month): 0.5, 1, 1.5, 2, 3, 4, 5, 6, 7 (black); 9, 10, 11, 12, 13, 14, 15, 16 (red). Inputed parameter values: δ = 1/4 per day; γ = 1/5.5 per day; ρ = 0.5; and infection fatality ratio IFR = 0.9%. Inicial basic reproduction numbers, coefficients of variation and control parameters estimated by Bayesian inference (estimates in Table 1). Fitted curves represent best fitting trajectories and shades are 95% credible intervals generated from 100, 000 posterior samples.

Fig. 4. SARS-CoV-2 transmission in England and Scotland with individual variation in exposure to infection.

Connectivity factors implemented as gamma distributions. Modelled trajectories of COVID-19 deaths (black and red curves) and cumulative percentage infected (blue). Dots are data for daily reported deaths: fitted (green); posterior to fitted time period (yellow). Basic reproduction numbers under control (ℛ_c) are displayed on shallow panels underneath the main plots. Left panels represent fitted segments as solid curves and projected scenarios as dashed: without vaccination (black); with a vaccination programme that effectively immunises 8% of the unvaccinated population per month from February onwards (red). Right panels prolong those projections further in time assuming ℛ_c(t) = ℛ₀ (heavier curves) and explore additional vaccination scenarios (thin curves), from top to bottom (in % of unvaccinated population per month): 0.5, 1, 1.5, 2, 3, 4, 5, 6, 7 (black); 9, 10, 11, 12, 13, 14, 15, 16 (red). Inputed parameter values: δ = 1/4 per day; γ = 1/5.5 per day; ρ = 0.5; and infection fatality ratio IFR = 0.9%. Inicial basic reproduction numbers, coefficients of variation and control parameters estimated by Bayesian inference (estimates in Table 1). Fitted curves represent best fitting trajectories and shades are 95% credible intervals generated from 100, 000 posterior samples.

Fig. 5. SARS-CoV-2 transmission in England and Scotland under homogeneity.

Modelled trajectories of COVID-19 deaths (black and red curves) and cumulative percentage infected (blue). Dots are data for daily reported deaths: fitted (green); posterior to fitted time period (yellow). Basic reproduction numbers under control (ℛ_c) are displayed on shallow panels underneath the main plots. Left panels represent fitted segments as solid curves and projected scenarios as dashed: without vaccination (black); with a vaccination programme that effectively immunises 8% of the unvaccinated population per month from February onwards (red). Right panels prolong those projections further in time assuming ℛ_c(t) = ℛ₀ (heavier curves) and explore additional vaccination scenarios (thin curves), from top to bottom (in % of unvaccinated population per month): 0.5, 1, 1.5, 2, 3, 4, 5, 6, 7 (black); 9, 10, 11, 12, 13, 14, 15, 16 (red). Inputed parameter values: δ = 1/4 per day; γ = 1/5.5 per day; ρ = 0.5; and infection fatality ratio IFR = 0.9%. Inicial basic reproduction numbers and control parameters estimated by Bayesian inference (estimates in Table 1). Fitted curves represent best fitting trajectories and shades are 95% credible intervals generated from 100, 000 posterior samples.

Given the estimated values for ℛ₀ and CV we derive the herd immunity thresholds by natural infection by applying formulas (12) or (19) as appropriate, obtaining ℋ₀ = 29% in England and ℋ₀ = 31% in Scotland. If population immunity was to be acquired by random vaccination alone, selection would not play a role and HIT would be given by 1 - 1/ℛ₀, resulting in 71% in England and 73% in Scotland. In reality, both natural infection and vaccination contribute to population immunity, and HIT will be somewhere in between. The homogeneous model suggests ℋ₀ = 73% in England and ℋ₀ = 75% in Scotland, in agreement with conventional expectations (Kwok et al. 2020).

We then prolong model trajectories (dashed curves) over another 4 months (until 1 June 2021) to compare with data beyond the fitted period (yellow). All models project more deaths than observed, as expected given that the UK initiated a mass vaccination campaign in late 2020 which should start impacting the epidemic by February 2021. To illustrate this we simulate the effective immunisation of 8% of the unvaccinated population per month from February onwards (crude approximation for the UK programme, where 32% were fully vaccinated in the 4-month period February-May) and depict the result by the red dashed curve in the figures. Agreement with data is visually good for the heterogeneous models but insufficient under homogeneity.

The most timely question is perhaps whether achieved population immunity is enough to prevent an exit wave as contact restrictions are lifted. To contribute towards an answers we plot the cumulative number of infections estimated by the model (blue) and find the percentage of the population infected by May 2021 to remain below HIT (more so in Scotland than in England). Hence without vaccination an exit wave might have been expected. To visualise its magnitude we include separate panels on the right where the model is run for 8 months, using as initial conditions the end conditions of the left panels and ℛ_c(t) = ℛ₀. This is done without vaccination (heavy black) and with vaccination (effective immunisation of 8% per month; heavy red). We explore additional vaccination scenarios (thin curves), from top to bottom (in % of unvaccinated population per month): 0.5, 1, 1.5, 2, 3, 4, 5, 6, 7 (black); 9, 10, 11, 12, 13, 14, 15, 16 (red). We find 1 - 2% effective immunisation by the vaccines per month to be sufficient to prevent the exit wave over the first 8 months, according to the heterogeneous models. This is in stark contrast with the homogeneous scenario, which suggest that only a vaccination programme twice as efficient as the current (e.g., approximately 16% effective immunisation per month) would prevent the exit wave.

In supplementary Information we show that these results are robust to changing IFR (to 0.7% and 1.1%) and including reinfection (with a risk of 0.1 (Hall et al. 2021) relative to the average risk of first infection). As expected, assuming a higher IFR results in lower HIT and assuming a lower IFR results in higher HIT. However, when we fit the model with a different IFR, all parameters readjust and the exit wave appears relatively invariant to IFR. The same happens when reinfection is included. The sensitivity analysis is presented for the heterogeneous susceptibility model but replication with heterogeneous connectivity and homogeneity showed similar robustness.

So far we have considered that ℛ_c(t) would always be contained below the original ℛ₀. Given current concerns that the virus may evolve towards higher transmissibility, such as the emergence of the B.1.1.7 variant (Volz et al. 2021; Davies et al. 2021; Richard et al. 2021), we replicate the above exploration of exit scenarios assuming 50% higher transmissibility (ℛ_c(t) = 1.5 · ℛ₀) (Fig. 6).

Fig. 6. Projected SARS-CoV-2 trajectories with 50% increase in ℛ₀ in relation to that estimated for the early phase of the pandemic.

Assuming ℛ_c(t) = 1.5 ℛ₀: without vaccination (black heavy curve) and with a vaccination programme that effectively immunises 8% of the unvaccinated population per month from February onwards (red heavy curve). Thin lines explore additional vaccination scenarios, from top to bottom (in % of unvaccinated population per month): 0.5, 1, 1.5, 2, 3, 4, 5, 6, 7 (black); 9, 10, 11, 12, 13, 14, 15, 16 (red). Inputed parameter values: δ = 1/4 per day; γ = 1/5.5 per day; ρ = 0.5; and infection fatality ratio IFR = 0.9%. (a) Individual variation in susceptibility to infection. (c) Individual variation in exposure to infection. (c) Homogeneous susceptibility and exposure to infection.

5.2. Parameter estimation early in the pandemic

In a pandemic it is important to estimate model parameters early when data series are relatively short. To test the suitability of our methods for that task, we apply the fitting procedure to series of COVID-19 deaths in England and Scotland until 1 July 2020, as Europe was just recovering from the first wave (Fig. 7 and Table 2). The results are remarkably similar to those obtained with longer series above (Figs. 3, 4. 5 and Table 1). As before the heterogeneous models are selected as better models than the homogeneous as they have lower AIC. However the difference is not as great as with larger series and in some situations it may not be possible to discriminate.

View this table:

Table 2.

Model parameters estimated by Bayesian inference based on daily deaths until 1 July 2020. Model selection based on maximum log-likelihood (LL) and Akaike information criterion (AIC). Best fitting models have lower AIC scores. Infection fatality ratio, IFR = 0.9%. ℋ₀, calculated from ℛ₀ and CV using formulas (12) or (19), as appropriate.

Fig. 7. Model-based estimates based on first wave of SARS-CoV-2 pandemic.

Modelled trajectories of COVID-19 deaths (black) and cumulative percentage infected (blue). Dots are data for daily reported deaths. Basic reproduction numbers under control (ℛ_c) are displayed on shallow panels underneath the main plots. Inputed parameter values: δ = 1/4 per day; γ = 1/5.5 per day; ρ = 0.5; and infection fatality ratio IFR = 0.9%. Inicial basic reproduction numbers, coefficients of variation and control parameters estimated by Bayesian inference (estimates in Table 2). Fitted curves represent best fitting trajectories and shades are 95% credible intervals generated from 100, 000 posterior samples. (a) Individual variation in susceptibility to infection. (c) Individual variation in exposure to infection. (c) Homogeneous susceptibility and exposure to infection.

As an alternative we run the same fits assuming two scenarios for a fixed ramp of contact restriction relaxation out of lockdown (i.e., T₂ fixed). The first is motivated by Table 2, where we find the estimated T₂ to be very large or, equivalently, the ramp to be practically horizontal. Hence we run a scenario where ℛ_c(t) remains strictly horizontal until the end of the fitting period. The support for the heterogeneous models becomes stronger and the estimated parameters (Table 3) remain similar to when T₂ was estimated (Table 2). Second we assume a slope such that if maintained ℛ_c(t) would intersect the original ℛ₀ in 120 days (i.e., T₂ = 120 days). In this case there is also strong support for the heterogeneous models but the estimated coefficients of variation are larger which is reflected in lower HIT (Table 4).

View this table:

Table 3.

Model parameters estimated by Bayesian inference based on daily deaths until 1 July 2020, assuming constant ℛ_c(t) from the first lockdown onwards (i.e. T₂ → ∞). Model selection based on maximum log-likelihood (LL) and Akaike information criterion (AIC). Best fitting models have lower AIC scores. Infection fatality ratio, IFR = 0.9%. ℋ₀, calculated from ℛ₀ and CV using formulas (12) or (19), as appropriate.

View this table:

Table 4.

Model parameters estimated by Bayesian inference based on daily deaths until 1 July 2020, assuming that after the first lockdown ℛ_c(t) begins a gradual return to the baseline ℛ₀ at a fixed rate (T₂ = 120 days in this case). Model selection based on maximum log-likelihood (LL) and Akaike information criterion (AIC). Best fitting models have lower AIC scores. Infection fatality ratio, IFR = 0.9%. ℋ₀, calculated from ℛ₀ and CV using formulas (12) or (19), as appropriate.

This suggests that in the eventually of future pandemics these models can be used with similar confidence after one or two waves. Although earlier applications might benefit more from knowledge of time-dependent effects of contact restrictions on transmission this information does not appear critical.

5.3. Coefficients of variation from contact surveys

Contact patterns provide one of the easiest sources of heterogeneity to study directly. One approach is to use large-scale diary experiments to collect self-reported logs of close or physical contact from study participants. In this section we show data from several of these contact-pattern studies, listed in Table 5. In Fig. 8 we show gamma-fits for the contact distribution of each study on a log scale, along with the CV for each empirical distribution. These empirically measured contact distributions reveal CV between 0.7 and 1.5 (depending on study and setting).

View this table:

Table 5.

Contact pattern studies.

Fig. 8. Gamma fits for the included contact surveys.

In addition to the magnitude of individual variation in connectivity, the time scale of that variation is another important determinant of the effect of selection in accelerating the acquisition of population immunity (Tkachenko et al. 2021). The referenced studies typically report contact patterns for individuals over a very short (e.g., 1-day) period which is insufficient for assessing persistence of the measured variation. One of the studies (Hens et al. 2009) made an extra step and measured contact patterns for each individual on two different days (one weekday and one weekend day). In Fig. 9, we show fits for contact patterns by these two days individually, and for the average. The CV for the contact heterogeneity that persists over the two days is approximately 1.1 (in contrast with the larger 1.4 or 1.6 for each day alone).

Fig. 9. Persistence of contact heterogeneity.

When averaged over two separate diary days the contact distribution CV is 17-28% lower.

For a heterogeneous model of an epidemic which unfolds over a timescale like a year, local dynamics are driven by assumptions about heterogeneity in short-term (e.g., week-long) averages in contact patterns, while global dynamics depend on assumptions about persistence of heterogeneity in those short-term averages over the timescale of the simulation. However, these long-term averages are frequently not evaluated directly by contact diary experiments.

One way to estimate persistent contact heterogeneity from below is to bin contact data by age groups. For example, heterogeneity in contact patterns which persists in contact data after binning in 5-year age groups represents population-level heterogeneity in contact patterns that is persistent on multi-year time scales. Of course, this approach only captures heterogeneity mediated by age; i.e., it would only capture the full extent of heterogeneity in contact patterns if there was no within-age-group variation in contact patterns. As such we should expect age-binned contact data to underestimate the level of persistent heterogeneity in contact patterns (Britton et al. 2020), perhaps quite substantially. In Fig. 10 we show the effect on CV of binning the data from the studies in Table 5 in 1- and 10-year age groups, respectively. Generally, binning by larger periods tends to reduce heterogeneity substantially.

Fig. 10. Binning by age.

There have also been detailed studies that specifically measure mobility which suggest higher levels of individual variation in number of contacts (Eubank et al. 2004).

An entirely different approach to measure heterogeneity is to trace contacts of infected individuals and count how many secondary infections each has caused. Adam et al (Adam et al. 2020) conducted such contact tracing in Hong Kong and estimated a coefficient of variation of 2.5. This is expected to measure more variation as it captures infectiousness as well. The question remains as how persistent this variation is and hence how responsive to selection.

Our model-based inference of persistent heterogeneity based on epidemic trajectories indicates CV = 1.1 for England and Scotland (Table 1, heterogeneous connectivity) which appears generally consistent with what is currently available from empirical studies. This level of individual variation lies above that inputed in common epidemic models (whether compartmental or individual-based) that implement contact heterogeneity reduced to 5-year averages and below estimates based on contact tracing which combine the effects of heterogeneity in infectiousness.

6. Conclusion

For over a century, mathematical epidemiologists have realised that individual variation in susceptibility and exposure to infection are key determinants of the shape of epidemic curves. Models that underrepresent these forms of variation tend to overpredict epidemic sizes and consequently inflate the effects attributed to control measures. Infectious disease models can include many interacting processes, informed by many data sources, to aid understanding of epidemic dynamics but complexity does not necessarily make them more suitable for predictive purposes. Predictive ability in population dynamics is critically dependent on the complete account of forms of heterogeneity that are under selection. In infectious disease the most impactful selection is that exerted by the force of infection on individual susceptibility and exposure.

With the aim of capturing heterogeneity in full we opted for simple model formalisms with inbuilt distributions of susceptibility or exposure in such a way that we could seek to estimate coefficients of variation by fitting to epidemic curves. We have used the COVID-19 pandemic to test the approach as epidemic trajectories unfold. We estimate final sizes of unmitigated epidemics to be less than half those predicated by studies based on homogeneous models, which is naturally linked to lower herd immunity thresholds. Homogeneous models with realistic vaccination rates also fail to reproduce the low levels of infection and deaths registered over recent months in England and Scotland.

We estimate coefficients of variation consistent with a panoply of empirical studies of contact patterns resulting in herd immunity thresholds by natural infection around 30%, in England and Scotland. Given that vaccination is also contributing to immunisation, herd immunity thresholds are reached with lower infected percentages. Based on forward simulations with a range of vaccination scenarios we conclude HIT is close to being achieved in both nations, with a lower infection burden and greater contribution of vaccination in Scotland. Several countries in Europe and America are in similar situations (Washburne et al. 2021) and may be in a comfortable position to redirect vaccines to more susceptible countries.

Data Availability

All data referred to in the manuscript are publicly available.

https://coronavirus.data.gov.uk/

Acknowledgements

We thank Rodrigo Corder and Antonio Montalbásan for valuable discussions throughout this study.

Footnotes

E-mail: muferrei{at}usp.br
E-mail: mchikina{at}pitt.edu
E-mail: wes{at}math.cmu.edu
E-mail: Ricardo{at}tropmedres.ac

References

↵
Aalen, O. O. (1988) Heterogeneity in survival analysis. Stat. Med., 7, 1121–1137.
OpenUrl CrossRef PubMed Web of Science
↵
Aalen, O. O., Valberg, M., Grotmol, T. and Tretli, S. (2015) Understanding variation in disease risk: the elusive concept of frailty. Int. J. Epidemiol., 4, 1408–1421.
OpenUrl
↵
Adam, D. C., Wu, P., Wong, J. Y., Lau, E. H. Y., Tsang, T. K., Cauchemez, S., et al. (2020) Clustering and superspreading potential of SARS-CoV-2 infections in Hong Kong. Nat. Med., 26, 1714–1719.
OpenUrl PubMed
↵
Aguas, R., Corder, R. M., King, J. G., Gonçalves, G., Ferreira, M. U. and Gomes, M. G. M. (2020) Herd immunity thresholds for SARS-CoV-2 estimated from unfolding epidemics. Preprint medRxiv: doi:10.1101/2020.07.23.20160762.
OpenUrl Abstract/FREE Full Text
↵
Arons, M. M., Hatfield, K. M., Reddy, S. C., Kimball, A., James, A., Jacobs, et al. (2020) Public Health–Seattle and King County and CDC COVID-19 Investigation Team. Presymptomatic SARS-CoV-2 infections and transmission in a skilled nursing facility. N. Engl. J. Med., 382, 2081–2090.
OpenUrl CrossRef PubMed
↵
Ball, F. (1985) Deterministic and stochastic epidemic models with several kinds of susceptibles. Adv. Appl. Probab., 17, 1–22.
OpenUrl
B’ seraud, G., Kazmercziak, S., Beutels, P., Levy-Bruhl, D., Lenne, X., Mielcarek, N., et al. (2015) The French connection: the first large population-based contact survey in France relevant for the spread of infectious diseases. PLOS One, 10, e0133203.
OpenUrl CrossRef PubMed
↵
Britton, T., Ball, F. and Trapman, P. (2020) A mathematical model reveals the influence of population heterogeneity on herd immunity to SARS-CoV-2. PLOS One, 369, 846–849.
OpenUrl
↵
Colombo, M., Mellor, J., Colhoun, H. M., Gomes, M. G. M. and McKeigue, P. M. (2020) Trajectory of COVID-19 epidemic in Europe. Preprint medRxiv: doi:10.1101/2020.09.26.20202267.
OpenUrl Abstract/FREE Full Text
↵
Coutinho, F. A. B., Massad, E., Lopez, L. F., Burattini, M. N., Struchiner, C. J. and Azevedo-Neto, R. S. (1999) Modelling heterogeneities in individual frailties in epidemic models. Math. Comput. Model., 30, 97–115.
OpenUrl CrossRef
↵
Davies, N. G., Abbott, S., Barnard, R. C., Jarvis, C. I., Kucharski, A. J., Munday, J. D., et al. (2021) Estimated transmissibility and impact of SARS-CoV-2 lineage B.1.1.7 in England. Science, 372, eabg3055.
OpenUrl Abstract/FREE Full Text
↵
Diekmann, O., Heesterbeek, H. and Britton, T. (2013) Mathematical Tools for Understanding Infectious Disease Dynamics. Princeton University Press, Princeton, New Jersey.
Dodd, P. J., Looker, C., Plumb, I. D., Bond, V., Schaap, A., Shanaube, K., et al. (2015) Age-and sex-specific social contact patterns and incidence of Mycobacterium tuberculosis infection. Am. J. Epidemiol., 2, 156–166.
OpenUrl
↵
Dwyer, G., Elkinton, J. S. and Buonaccorsi, J. P. (1997) Host heterogeneity in susceptibility and disease dynamics: Tests of a mathematical model. Am. Nat., 150, 685–707.
OpenUrl CrossRef PubMed Web of Science
↵
Eubank, S., Guclu, H., Kumar, V. S. A., Marathe, M. V., Srinivasan, A., Toroczkai, Z., et al. (2004) Modelling disease outbreaks in realistic urban social networks. Nature, 429, 180–184.
OpenUrl CrossRef PubMed Web of Science
↵
Finkenstadt, B. F. and Grenfell, B. T. (2000) Time series modelling of childhood diseases: a dynamical systems approach. Appl. Statist., 49, 187–205.
OpenUrl
↵
Gart, J. J. (1968) The mathematical analysis of an epidemic with two kinds of susceptibles. Biometrics, 24, 557–566.
OpenUrl CrossRef PubMed Web of Science
↵
Gart, J. J. (1971) The statistical analysis of chain-binomial epidemic models with several kinds of susceptibles. Biometrics, 28, 921–930.
OpenUrl
↵
Gomes, M. G. M., Corder, R. M., King, J. G., Langwig, K. E., Souto-Maior, C., Carneiro, J., et al. (2020) Individual variation in susceptibility or exposure to SARS-CoV-2 lowers the herd immunity threshold. Preprint medRxiv: doi:10.1101/2020.04.27.20081893.
OpenUrl Abstract/FREE Full Text
Grijalva, C. G., Goeyvaerts, N., Verastegui, H., Edwards, K. M., Gil, A. I., Lanata, C.F., et al. (2015) A household-based study of contact networks relevant for the spread of infectious diseases in the highlands of Peru. PLOS One, 10, e0118457.
OpenUrl CrossRef PubMed
↵
Hall, V. J., Foulkes, S., Charlett, A., Atti, A., Monk, E. J., Simmons, R., et al. (2021) SARS-CoV-2 infection rates of antibody-positive compared with antibody-negative health-care workers in England: a large, multicentre, prospective cohort study (SIREN). Lancet, 397, 1459–1469.
OpenUrl CrossRef
↵
He, X., Lau, E. H. Y., Wu, P., Deng, X., Wang, J., Hao, X., et al. (2020) Temporal dynamics in viral shedding and transmissibility of COVID-19. Nat. Med., 26, 672–675.
OpenUrl CrossRef PubMed
↵
Hens, N., Goeyvaerts, N., Aerts, M., Shkedy, Z., Van Damme, P. and Beutels, P. (2009) Mining social mixing patterns for infectious disease models based on a two-day population survey in Belgium. BMJ Infect. Dis., 9, 1–18.
OpenUrl
Horby, P., Thai, P. Q., Hens, N., Yen, N. T. T., Thoang, D. D., Linh, N. M., et al. (2011) Social contact patterns in Vietnam and implications for the control of infectious diseases. PLOS One, 6, e16965.
OpenUrl CrossRef PubMed
↵
Katriel, G. (2012) The size of epidemics in populations with heterogeneous susceptibility. J. Math. Biol., 65, 237–262.
OpenUrl CrossRef PubMed
↵
Kermack, W. O. and McKendrick, A. G. (1927) A contribution to the mathematical theory of epidemics. Proc. R. Soc. Lond. A, 115, 700–721.
OpenUrl CrossRef
↵
Kwok, K. O., Lai, F., Wei, W. I., Wong, S. Y. S. and Tang, J. (2020) Herd immunity – estimating the level required to halt the COVID-19 epidemics in affected countries. J. Infect., 80, e32–e33.
OpenUrl CrossRef PubMed
↵
Lauer, S. A., Grantz, K. H., Bi, Q., Jones, F. K., Zheng, Q., Meredith, H. R., Azman, A. S., et al. (2020) The incubation period of coronavirus disease 2019 (COVID-19) from publicly reported confirmed cases: estimation and application. Ann. Intern. Med., 172, 577–582.
OpenUrl CrossRef PubMed
Leung, K., Jit, M., Lau, E. H. Y. and Wu, J. Y. (2017) Social contact patterns relevant to the spread of respiratory infectious diseases in Hong Kong. Sci. Rep, 7, 1–12.
OpenUrl CrossRef PubMed
↵
Li, Q., Guan, X., Wu, P., Wang, X., Zhou, L., Tong, Y., et al. (2020) Early transmission dynamics in Wuhan, China, of novel coronavirus-infected pneumonia. N. Engl. J. Med., 382, 1199–1207.
OpenUrl CrossRef PubMed
Litvinova, M., Liu, Q.-H., Kulikov, E. S., Evgeny, S. and Ajelli. M. (2019) Reactive school closure weakens the network of social interactions and reduces the spread of influenza. Proc. Natl. Acad. Sci. U.S.A., 27, 13174–13181.
OpenUrl
↵
McKendrick, A. G. (1939) The dynamics of crowd infection. Edinb. Med. J., 47, 117–136.
OpenUrl
Mahikul, W., Kripattanapong, S., Hanvoravongchai, P., Meeyai, A., Iamsirithaworn, S., Auewarakul, P., et al. (2020) Contact Mixing Patterns and Population Movement among Migrant Workers in an Urban Setting in Thailand. Int. J. Environ. Res. Public Health, 17, 2237.
OpenUrl
↵
McAloon, C., Collins, A., Hunt, K., Barber, A., Byrne, A. W., Butler, F., et al. (2020) Incubation period of COVID-19: a rapid systematic review and meta-analysis of observational research. BMJ Open, 10, e039652.
OpenUrl Abstract/FREE Full Text
Melegaro, A., Del Fava, E., Poletti, P., Merler, S., Nyamukapa, C., Williams, J., et al. (2017) Social contact structures and time use patterns in the Manicaland Province of Zimbabwe. PLOS One, 12, e0170459.
OpenUrl CrossRef PubMed
↵
Miller, J. C., Slim, A. C. and Volz, E. M. (2012) Edge-based compartmental modelling for infectious disease spread. J. R. Soc. Interface, 9, 890–906.
OpenUrl CrossRef PubMed
↵
Montalb’ san, A., Corder, R. M. and Gomes, M. G. M. (2020) Herd immunity under individual variation and reinfection. Preprint arxiv:2008.00098v2.
↵
Mossong, J., Hens, N., Jit, M., Beutels, P., Auranen, K., Mikolajczyk, R., et al. (2008) Social contacts and mixing patterns relevant to the spread of infectious diseases. PLOS Med., 5, e74.
OpenUrl CrossRef PubMed
↵
Nishiura, H., Linton, N. M. and Akhmetzhanov, A. R. (2020) Serial interval of novel coronavirus (COVID-19) infections. Int. J. Infect. Dis., 93, 284–286.
OpenUrl CrossRef PubMed
↵
Novozhilov, A. S. (2008) On the spread of epidemics in a closed heterogeneous population. Math. Biosci., 215, 177–185.
OpenUrl CrossRef PubMed
↵
Pastor-Satorras, R. and Vespignani, A. (2001) Epidemic dynamics and endemic states in complex networks. Phys. Rev. E, 63, 066117.
OpenUrl
↵
Richard, D., Shaw, L. P., Lanfear, R., Acman, M., Owen, C.J., Tan, C. C. S., et al. (2021) A phylogeny-based metric for estimating changes in transmissibility from recurrent mutations in SARS-CoV-2. Preprint bioRxiv: doi:10.1101/2021.05.06.442903.
OpenUrl Abstract/FREE Full Text
↵
Stapor, P., Weindl, D., Ballnus, B., Hug, S., Loos, C., Fiedler, A., et al. (2018) PESTO: Parameter EStimation TOolbox. Bioinformatics, 34, 705–707.
OpenUrl
↵
Tkachenko, A. V., Maslov, S., Elbanna, A., Wong, G. N., Weiner, Z. J. and Goldenfeld, N. (2021) Time-dependent heterogeneity leads to transient suppression of the COVID-19 epidemic, not herd immunity. Proc. Natl. Acad. Sci. U. S. A., 118, e2015972118.
OpenUrl Abstract/FREE Full Text
↵
To, K. K., Tsang, O. T., Leung, W. S., Tam, A. R., Wu, T. C., Lung, D. C., et al. (2020) Temporal profiles of viral load in posterior oropharyngeal saliva samples and serum antibody responses during infection by SARS-CoV-2: an observational cohort study. Lancet Infect. Dis., 20, 565–574.
OpenUrl CrossRef PubMed
↵
Vaupel, J. H., Manton, K. G. and Stallard. E. (1979) The impact of heterogeneity in individual frailty in the dynamics of mortality. Demography, 16, 439–454.
OpenUrl CrossRef PubMed Web of Science
↵
Volz, E., Mishra, S., Chand, M., Barrett, J. C., Johnson, R., Geidelberg, L., et al. (2021) Assessing transmissibility of SARS-CoV-2 lineage B.1.1.7 in England. Nature, doi:10.1038/s41586-021-03470-x.
OpenUrl CrossRef PubMed
↵
Ward, H., Atchison, C., Whitaker, M., Ainslie, K. E. C., Elliott, J., Okell, L., et al. (2021) SARS-CoV-2 antibody prevalence in England following the first peak of the pandemic. Nat. Commun., 12, 905.
OpenUrl CrossRef PubMed
↵
Washburne, A., Silverman, J., Lourenço, J. and Hupert, N. (2021) Analysis and visualization of epidemics on the timescale of burden: derivation and application of Epidemic Resistance Lines (ERLs) to COVID-19 outbreaks in the US. Preprint medRxiv: doi:10.1101/2021.05.03.21256542.
OpenUrl Abstract/FREE Full Text
↵
Wei, W. E., Li, Z., Chiew, C. J., Yong, S. E., Toh, M. P. and Lee, V. J. (2020) Presymptomatic transmission of SARS-CoV-2 - Singapore, January 23-March 16, 2020. MMWR Morb. Mortal. Wkly. Rep., 69, 411–415.
OpenUrl CrossRef PubMed
↵
Willem, L., Van Kerckhove, K., Chao, D.L., Hens, N. and Beutels, P. (2012) A nice day for an infectionã Weather conditions and social contact patterns relevant to influenza transmission. PLOS One, 7, e48695.
OpenUrl CrossRef PubMed
Zhang, J., Litvinova, M., Wang, W., Wang, Y., Deng, X., Chen, X., et al. (2020) Evolving epidemiology and transmission dynamics of coronavirus disease 2019 outside Hubei province, China: a descriptive and modelling study. Lancet Infect. Dis., 20, 793–802.
OpenUrl PubMed

View the discussion thread.

Posted July 25, 2021.

Download PDF

Supplementary Material

Data/Code

Citation Tools

Subject Area

Epidemiology

Subject Areas

All Articles

Addiction Medicine (322)
Allergy and Immunology (626)
Anesthesia (162)
Cardiovascular Medicine (2359)
Dentistry and Oral Medicine (286)
Dermatology (206)
Emergency Medicine (377)
Endocrinology (including Diabetes Mellitus and Metabolic Disease) (833)
Epidemiology (11748)
Forensic Medicine (10)
Gastroenterology (700)
Genetic and Genomic Medicine (3714)
Geriatric Medicine (348)
Health Economics (632)
Health Informatics (2384)
Health Policy (928)
Health Systems and Quality Improvement (892)
Hematology (340)
HIV/AIDS (776)
Infectious Diseases (except HIV/AIDS) (13297)
Intensive Care and Critical Care Medicine (767)
Medical Education (364)
Medical Ethics (104)
Nephrology (397)
Neurology (3477)
Nursing (197)
Nutrition (522)
Obstetrics and Gynecology (670)
Occupational and Environmental Health (661)
Oncology (1813)
Ophthalmology (534)
Orthopedics (218)
Otolaryngology (286)
Pain Medicine (232)
Palliative Medicine (66)
Pathology (445)
Pediatrics (1027)
Pharmacology and Therapeutics (426)
Primary Care Research (417)
Psychiatry and Clinical Psychology (3166)
Public and Global Health (6122)
Radiology and Imaging (1269)
Rehabilitation Medicine and Physical Therapy (742)
Respiratory Medicine (825)
Rheumatology (379)
Sexual and Reproductive Health (371)
Sports Medicine (320)
Surgery (398)
Toxicology (50)
Transplantation (171)
Urology (145)

[1] ↵
Aalen, O. O. (1988) Heterogeneity in survival analysis. Stat. Med., 7, 1121–1137.
OpenUrl CrossRef PubMed Web of Science

[2] ↵
Aalen, O. O., Valberg, M., Grotmol, T. and Tretli, S. (2015) Understanding variation in disease risk: the elusive concept of frailty. Int. J. Epidemiol., 4, 1408–1421.
OpenUrl

[3] ↵
Adam, D. C., Wu, P., Wong, J. Y., Lau, E. H. Y., Tsang, T. K., Cauchemez, S., et al. (2020) Clustering and superspreading potential of SARS-CoV-2 infections in Hong Kong. Nat. Med., 26, 1714–1719.
OpenUrl PubMed

[4] ↵
Aguas, R., Corder, R. M., King, J. G., Gonçalves, G., Ferreira, M. U. and Gomes, M. G. M. (2020) Herd immunity thresholds for SARS-CoV-2 estimated from unfolding epidemics. Preprint medRxiv: doi:10.1101/2020.07.23.20160762.
OpenUrl Abstract/FREE Full Text

[5] ↵
Arons, M. M., Hatfield, K. M., Reddy, S. C., Kimball, A., James, A., Jacobs, et al. (2020) Public Health–Seattle and King County and CDC COVID-19 Investigation Team. Presymptomatic SARS-CoV-2 infections and transmission in a skilled nursing facility. N. Engl. J. Med., 382, 2081–2090.
OpenUrl CrossRef PubMed

[6] ↵
Ball, F. (1985) Deterministic and stochastic epidemic models with several kinds of susceptibles. Adv. Appl. Probab., 17, 1–22.
OpenUrl

[7] B’ seraud, G., Kazmercziak, S., Beutels, P., Levy-Bruhl, D., Lenne, X., Mielcarek, N., et al. (2015) The French connection: the first large population-based contact survey in France relevant for the spread of infectious diseases. PLOS One, 10, e0133203.
OpenUrl CrossRef PubMed

[8] ↵
Britton, T., Ball, F. and Trapman, P. (2020) A mathematical model reveals the influence of population heterogeneity on herd immunity to SARS-CoV-2. PLOS One, 369, 846–849.
OpenUrl

[9] ↵
Colombo, M., Mellor, J., Colhoun, H. M., Gomes, M. G. M. and McKeigue, P. M. (2020) Trajectory of COVID-19 epidemic in Europe. Preprint medRxiv: doi:10.1101/2020.09.26.20202267.
OpenUrl Abstract/FREE Full Text

[10] ↵
Coutinho, F. A. B., Massad, E., Lopez, L. F., Burattini, M. N., Struchiner, C. J. and Azevedo-Neto, R. S. (1999) Modelling heterogeneities in individual frailties in epidemic models. Math. Comput. Model., 30, 97–115.
OpenUrl CrossRef

[11] ↵
Davies, N. G., Abbott, S., Barnard, R. C., Jarvis, C. I., Kucharski, A. J., Munday, J. D., et al. (2021) Estimated transmissibility and impact of SARS-CoV-2 lineage B.1.1.7 in England. Science, 372, eabg3055.
OpenUrl Abstract/FREE Full Text

[12] ↵
Diekmann, O., Heesterbeek, H. and Britton, T. (2013) Mathematical Tools for Understanding Infectious Disease Dynamics. Princeton University Press, Princeton, New Jersey.

[13] Dodd, P. J., Looker, C., Plumb, I. D., Bond, V., Schaap, A., Shanaube, K., et al. (2015) Age-and sex-specific social contact patterns and incidence of Mycobacterium tuberculosis infection. Am. J. Epidemiol., 2, 156–166.
OpenUrl

[14] ↵
Dwyer, G., Elkinton, J. S. and Buonaccorsi, J. P. (1997) Host heterogeneity in susceptibility and disease dynamics: Tests of a mathematical model. Am. Nat., 150, 685–707.
OpenUrl CrossRef PubMed Web of Science

[15] ↵
Eubank, S., Guclu, H., Kumar, V. S. A., Marathe, M. V., Srinivasan, A., Toroczkai, Z., et al. (2004) Modelling disease outbreaks in realistic urban social networks. Nature, 429, 180–184.
OpenUrl CrossRef PubMed Web of Science

[16] ↵
Finkenstadt, B. F. and Grenfell, B. T. (2000) Time series modelling of childhood diseases: a dynamical systems approach. Appl. Statist., 49, 187–205.
OpenUrl

[17] ↵
Gart, J. J. (1968) The mathematical analysis of an epidemic with two kinds of susceptibles. Biometrics, 24, 557–566.
OpenUrl CrossRef PubMed Web of Science

[18] ↵
Gart, J. J. (1971) The statistical analysis of chain-binomial epidemic models with several kinds of susceptibles. Biometrics, 28, 921–930.
OpenUrl

[19] ↵
Gomes, M. G. M., Corder, R. M., King, J. G., Langwig, K. E., Souto-Maior, C., Carneiro, J., et al. (2020) Individual variation in susceptibility or exposure to SARS-CoV-2 lowers the herd immunity threshold. Preprint medRxiv: doi:10.1101/2020.04.27.20081893.
OpenUrl Abstract/FREE Full Text

[20] Grijalva, C. G., Goeyvaerts, N., Verastegui, H., Edwards, K. M., Gil, A. I., Lanata, C.F., et al. (2015) A household-based study of contact networks relevant for the spread of infectious diseases in the highlands of Peru. PLOS One, 10, e0118457.
OpenUrl CrossRef PubMed

[21] ↵
Hall, V. J., Foulkes, S., Charlett, A., Atti, A., Monk, E. J., Simmons, R., et al. (2021) SARS-CoV-2 infection rates of antibody-positive compared with antibody-negative health-care workers in England: a large, multicentre, prospective cohort study (SIREN). Lancet, 397, 1459–1469.
OpenUrl CrossRef

[22] ↵
He, X., Lau, E. H. Y., Wu, P., Deng, X., Wang, J., Hao, X., et al. (2020) Temporal dynamics in viral shedding and transmissibility of COVID-19. Nat. Med., 26, 672–675.
OpenUrl CrossRef PubMed

[23] ↵
Hens, N., Goeyvaerts, N., Aerts, M., Shkedy, Z., Van Damme, P. and Beutels, P. (2009) Mining social mixing patterns for infectious disease models based on a two-day population survey in Belgium. BMJ Infect. Dis., 9, 1–18.
OpenUrl

[24] Horby, P., Thai, P. Q., Hens, N., Yen, N. T. T., Thoang, D. D., Linh, N. M., et al. (2011) Social contact patterns in Vietnam and implications for the control of infectious diseases. PLOS One, 6, e16965.
OpenUrl CrossRef PubMed

[25] ↵
Katriel, G. (2012) The size of epidemics in populations with heterogeneous susceptibility. J. Math. Biol., 65, 237–262.
OpenUrl CrossRef PubMed

[26] ↵
Kermack, W. O. and McKendrick, A. G. (1927) A contribution to the mathematical theory of epidemics. Proc. R. Soc. Lond. A, 115, 700–721.
OpenUrl CrossRef

[27] ↵
Kwok, K. O., Lai, F., Wei, W. I., Wong, S. Y. S. and Tang, J. (2020) Herd immunity – estimating the level required to halt the COVID-19 epidemics in affected countries. J. Infect., 80, e32–e33.
OpenUrl CrossRef PubMed

[28] ↵
Lauer, S. A., Grantz, K. H., Bi, Q., Jones, F. K., Zheng, Q., Meredith, H. R., Azman, A. S., et al. (2020) The incubation period of coronavirus disease 2019 (COVID-19) from publicly reported confirmed cases: estimation and application. Ann. Intern. Med., 172, 577–582.
OpenUrl CrossRef PubMed

[29] Leung, K., Jit, M., Lau, E. H. Y. and Wu, J. Y. (2017) Social contact patterns relevant to the spread of respiratory infectious diseases in Hong Kong. Sci. Rep, 7, 1–12.
OpenUrl CrossRef PubMed

[30] ↵
Li, Q., Guan, X., Wu, P., Wang, X., Zhou, L., Tong, Y., et al. (2020) Early transmission dynamics in Wuhan, China, of novel coronavirus-infected pneumonia. N. Engl. J. Med., 382, 1199–1207.
OpenUrl CrossRef PubMed

[31] Litvinova, M., Liu, Q.-H., Kulikov, E. S., Evgeny, S. and Ajelli. M. (2019) Reactive school closure weakens the network of social interactions and reduces the spread of influenza. Proc. Natl. Acad. Sci. U.S.A., 27, 13174–13181.
OpenUrl

[32] ↵
McKendrick, A. G. (1939) The dynamics of crowd infection. Edinb. Med. J., 47, 117–136.
OpenUrl

[33] Mahikul, W., Kripattanapong, S., Hanvoravongchai, P., Meeyai, A., Iamsirithaworn, S., Auewarakul, P., et al. (2020) Contact Mixing Patterns and Population Movement among Migrant Workers in an Urban Setting in Thailand. Int. J. Environ. Res. Public Health, 17, 2237.
OpenUrl

[34] ↵
McAloon, C., Collins, A., Hunt, K., Barber, A., Byrne, A. W., Butler, F., et al. (2020) Incubation period of COVID-19: a rapid systematic review and meta-analysis of observational research. BMJ Open, 10, e039652.
OpenUrl Abstract/FREE Full Text

[35] Melegaro, A., Del Fava, E., Poletti, P., Merler, S., Nyamukapa, C., Williams, J., et al. (2017) Social contact structures and time use patterns in the Manicaland Province of Zimbabwe. PLOS One, 12, e0170459.
OpenUrl CrossRef PubMed

[36] ↵
Miller, J. C., Slim, A. C. and Volz, E. M. (2012) Edge-based compartmental modelling for infectious disease spread. J. R. Soc. Interface, 9, 890–906.
OpenUrl CrossRef PubMed

[37] ↵
Montalb’ san, A., Corder, R. M. and Gomes, M. G. M. (2020) Herd immunity under individual variation and reinfection. Preprint arxiv:2008.00098v2.

[38] ↵
Mossong, J., Hens, N., Jit, M., Beutels, P., Auranen, K., Mikolajczyk, R., et al. (2008) Social contacts and mixing patterns relevant to the spread of infectious diseases. PLOS Med., 5, e74.
OpenUrl CrossRef PubMed

[39] ↵
Nishiura, H., Linton, N. M. and Akhmetzhanov, A. R. (2020) Serial interval of novel coronavirus (COVID-19) infections. Int. J. Infect. Dis., 93, 284–286.
OpenUrl CrossRef PubMed

[40] ↵
Novozhilov, A. S. (2008) On the spread of epidemics in a closed heterogeneous population. Math. Biosci., 215, 177–185.
OpenUrl CrossRef PubMed

[41] ↵
Pastor-Satorras, R. and Vespignani, A. (2001) Epidemic dynamics and endemic states in complex networks. Phys. Rev. E, 63, 066117.
OpenUrl

[42] ↵
Richard, D., Shaw, L. P., Lanfear, R., Acman, M., Owen, C.J., Tan, C. C. S., et al. (2021) A phylogeny-based metric for estimating changes in transmissibility from recurrent mutations in SARS-CoV-2. Preprint bioRxiv: doi:10.1101/2021.05.06.442903.
OpenUrl Abstract/FREE Full Text

[43] ↵
Stapor, P., Weindl, D., Ballnus, B., Hug, S., Loos, C., Fiedler, A., et al. (2018) PESTO: Parameter EStimation TOolbox. Bioinformatics, 34, 705–707.
OpenUrl

[44] ↵
Tkachenko, A. V., Maslov, S., Elbanna, A., Wong, G. N., Weiner, Z. J. and Goldenfeld, N. (2021) Time-dependent heterogeneity leads to transient suppression of the COVID-19 epidemic, not herd immunity. Proc. Natl. Acad. Sci. U. S. A., 118, e2015972118.
OpenUrl Abstract/FREE Full Text

[45] ↵
To, K. K., Tsang, O. T., Leung, W. S., Tam, A. R., Wu, T. C., Lung, D. C., et al. (2020) Temporal profiles of viral load in posterior oropharyngeal saliva samples and serum antibody responses during infection by SARS-CoV-2: an observational cohort study. Lancet Infect. Dis., 20, 565–574.
OpenUrl CrossRef PubMed

[46] ↵
Vaupel, J. H., Manton, K. G. and Stallard. E. (1979) The impact of heterogeneity in individual frailty in the dynamics of mortality. Demography, 16, 439–454.
OpenUrl CrossRef PubMed Web of Science

[47] ↵
Volz, E., Mishra, S., Chand, M., Barrett, J. C., Johnson, R., Geidelberg, L., et al. (2021) Assessing transmissibility of SARS-CoV-2 lineage B.1.1.7 in England. Nature, doi:10.1038/s41586-021-03470-x.
OpenUrl CrossRef PubMed

[48] ↵
Ward, H., Atchison, C., Whitaker, M., Ainslie, K. E. C., Elliott, J., Okell, L., et al. (2021) SARS-CoV-2 antibody prevalence in England following the first peak of the pandemic. Nat. Commun., 12, 905.
OpenUrl CrossRef PubMed

[49] ↵
Washburne, A., Silverman, J., Lourenço, J. and Hupert, N. (2021) Analysis and visualization of epidemics on the timescale of burden: derivation and application of Epidemic Resistance Lines (ERLs) to COVID-19 outbreaks in the US. Preprint medRxiv: doi:10.1101/2021.05.03.21256542.
OpenUrl Abstract/FREE Full Text

[50] ↵
Wei, W. E., Li, Z., Chiew, C. J., Yong, S. E., Toh, M. P. and Lee, V. J. (2020) Presymptomatic transmission of SARS-CoV-2 - Singapore, January 23-March 16, 2020. MMWR Morb. Mortal. Wkly. Rep., 69, 411–415.
OpenUrl CrossRef PubMed

[51] ↵
Willem, L., Van Kerckhove, K., Chao, D.L., Hens, N. and Beutels, P. (2012) A nice day for an infectionã Weather conditions and social contact patterns relevant to influenza transmission. PLOS One, 7, e48695.
OpenUrl CrossRef PubMed

[52] Zhang, J., Litvinova, M., Wang, W., Wang, Y., Deng, X., Chen, X., et al. (2020) Evolving epidemiology and transmission dynamics of coronavirus disease 2019 outside Hubei province, China: a descriptive and modelling study. Lancet Infect. Dis., 20, 793–802.
OpenUrl PubMed

Frailty variation models for susceptibility and exposure to SARS-CoV-2

Summary

1. Introduction

2. Mathematical models

2.1. Individual variation in susceptibility to infection

2.2. Individual variation in exposure to infection

3. Data

4. Model fitting and parameter estimation

5. Results and Discussion

5.1. Estimated parameters and herd immunity thresholds

5.2. Parameter estimation early in the pandemic

5.3. Coefficients of variation from contact surveys

6. Conclusion

Data Availability

Acknowledgements

Footnotes

References

Citation Manager Formats

Subject Area