Herd immunity thresholds for SARS-CoV-2 estimated from unfolding epidemics

Ricardo Aguas; Guilherme Gonçalves; Marcelo U. Ferreira; M. Gabriela M. Gomes

doi:10.1101/2020.07.23.20160762

Abstract

Variation in individual susceptibility or frequency of exposure to infection accelerates the rate at which populations acquire immunity by natural infection. Individuals that are more susceptible or more frequently exposed tend to be infected earlier and hence more quickly selected out of the susceptible pool, decelerating the incidence of new infections as the epidemic progresses. Eventually, susceptible numbers become low enough to prevent epidemic growth or, in other words, the herd immunity threshold (HIT) is reached. We have recently proposed a method whereby mathematical models, with gamma distributions of susceptibility or exposure to SARS-CoV-2, are fitted to epidemic curves to estimate coefficients of individual variation among epidemiological parameters of interest. In the initial study we estimated HIT around 25-29% for the original Wuhan virus in England and Scotland. Here we explore the limits of applicability of the method using Spain and Portugal as case studies. Results are robust and consistent with England and Scotland, in the case of Spain, but fail in Portugal due to particularities of the dataset. We describe failures, identify their causes, and propose methodological extensions.

Introduction

Selection acting on unmeasured individual variation is a well-known source of bias in the analysis of populations. It has been shown to affect measured rates of mortality (Keyfitz and Littman; Vaupel et al 1979; Vaupel and Yashin 1985), the survival of endangered species (Kendall and Fox 2002; Jenouvrier et al 2018), the scope of neutral theories of biodiversity and molecular evolution (Steiner and Tuljapurkar 2012, Gomes et al 2019), the measured risk of diseases whether non-communicable (Aalen et al 2015; Stensrud and Valberg 2017) or infectious (Anderson et al 1986; Dwyer et al 1997; Smith et al 2005; Bellan et al 2015; Gomes et al 2019; Corder et al 2020; Montalbán et al 2020), and the efficacy of interventions such as vaccines (Halloran et al 1996; O’Hagan et al 2012; Gomes et al 2014; Gomes et al 2016; Langwig et al 2017) or symbionts (Pessoa et al 2016; King et al 2018). Building on this knowledge, we previously addressed how selection on individual variation might affect the course of the coronavirus disease (COVID-19) pandemic (Gomes et al 2022).

COVID-19 is an infectious respiratory disease caused by a virus (severe acute respiratory syndrome coronavirus 2 [SARS-CoV-2]), which was first identified in China in late 2019 and has since spread worldwide leading to considerable human suffering and social disruption. European and American continents have been the most affected, with 0.16% and 0.20% of the respective total populations having died as of the 15 July 2021 (WHO 2021). Here we analyse series of daily deaths attributed to COVID-19 in Spain and Portugal (Iberian Peninsula) to study how individual variation in susceptibility and exposure to a respiratory virus affects its epidemic trajectory. Besides adding to the compendium of neglected effects of selection in population dynamics we hope to stimulate a new approach to study epidemic dynamics.

The essence to the approach is that individual variation in susceptibility or exposure (connectivity) accelerates the acquisition of immunity in populations. More susceptible and more connected individuals have a higher propensity to be infected and thus are likely to become immune earlier. Due to this selective immunization by natural infection, heterogeneous populations acquire herd immunity more efficiently than suggested by models that do not fully account for these types of variation. Here we integrate a continuous distribution of susceptibility or connectivity in an otherwise basic COVID-19 epidemiological model, which necessarily accounts for non-pharmaceutical intervention effects, and generate three types of results. First, at national levels the herd immunity threshold by natural infection declines from around 70% to 20-30%. This is newly reported here for Spain and in agreement with recent estimates for England and Scotland (Gomes et al 2022). Second, these inferences can be made relatively early in the pandemic, such as between first and second waves, provided the first wave is sufficiently large and spatially synchronous. Third, we include a selection of results for Portugal to illustrate how the inferential procedure degenerates when national data do not meet certain conditions.

Individual variation in SARS-CoV-2 transmission

SARS-CoV-2 is transmitted primarily by respiratory droplets and modelled as a susceptible-exposed-infectious-recovered (SEIR) process.

Variation in susceptibility to infection

Individual variation in susceptibility is integrated as a continuously distributed factor that multiplies the force of infection upon individuals (Diekmann et al 1990) in the form of an infinite system of ordinary differential equations (ODEs): where S(x) is the density of individuals with susceptibility x, E(x) and I(x) are the densities of individuals who originally had susceptibility x and became exposed and infectious, while R(x) represents those who have recovered and have their susceptibility reduced to a reinfection factor σ due to acquired immunity. Parameter δ is the rate of progression from exposed to a period of maximal infectiousness (= 1/5.5 per day [McAloon et al. 2020; Lauer et al. 2020]), γ is the rate of recovery from maximal infectiousness (= 1/4 per day [Nishiura et al. 2020; Li et al. 2020]), ϕ is the proportion of individuals who die as a result of infection (= 0.008 [Pastor-Barriuso et al. 2021]), and: is the average force of infection upon susceptible individuals in a population of approximately constant size N and transmission coefficient β. Standardizing so that susceptibility distributions have mean ∫ xg(x) dx = 1, given a probability density function g(x), the basic reproduction number, defined as the expected number of secondary infections generated by an infected individual in a population that has no specific immunity to the virus (Diekmann et al 1990), is: where ρ is a factor measuring the infectiousness of individuals in compartment E in relation to those in I (= 0.5 [Wei et al. 2020; To et al. 2020; Arons et al. 2020; He et al. 2020]). The coefficient of variation (CV) in individual susceptibility is also treated as a parameter.

The basic reproduction number ℛ₀ is a theoretical framework. It is usually estimated from the initial growth in case numbers. However, as the virus spreads through the population, infected and immune individuals accumulate, reducing the availability of susceptible hosts. As a result, growth in case numbers deviates from being a direct indication of ℛ₀ but rather of a so-called effective reproduction number ℛ_eff.

When susceptibility is given by a gamma distribution and acquired immunity is totally protective (σ = 0), the effective reproduction number is: where S(t) = ∫ S(x, t) dx is the total number of susceptible individuals at time t, and (Equations 1-4) reduce exactly to a finite system of ODEs: where S, E, I and R are the total numbers of susceptible, exposed, infectious and recovered individuals, respectively (Novozhilov 2008; Montalbán et al. 2020).

Variation in connectivity

In a directly transmitted infectious disease, such as COVID-19, variation in exposure to infection is governed primarily by patterns of connectivity among individuals. Here we incorporate this in the system (Equations 1-4) under the assumption that individuals mix at random (Pastor-Satorras and Vespignani 2001; Miller et al. 2012), while in Supplementary Information we conduct some sensitivity analyses to this assumption. Under random mixing and heterogeneous connectivity, the force of infection is written as: and the basic reproduction number is: In this setup, when connectivity is given by a gamma distribution and acquired immunity is totally protective, the effective reproduction number is approximated by: and (Equations 1-4) reduce approximately to the finite system of ODEs: where S, E, I and R are the total numbers of susceptible, exposed, infectious and recovered individuals, respectively (Montalbán et al. 2020).

We have combined the basic models described by the reduced systems in Equations 8-11 and Equations 15-18 with non-pharmaceutical interventions (NPIs) to produce COVID-19 transmission models. We then estimate the relevant parameters for those models by fitting to data series of daily deaths from each of the study countries.

Non-pharmaceutical interventions and other transmissibility modifiers

NPIs designed to control transmission typically reduce β and hence ℛ₀. Denoting the time-dependent reproduction number when control measures are in place by ℛ_c(t), the modified effective reproduction number is obtained by replacing ℛ₀ with ℛ_c(t) in (Equation 7) and (Equation 14) as appropriate. For the estimation of ℛ_c(t) we introduce flexible transmissibility profiles c(t) as illustrated in Figure 1.

Figure 1: Transmissibility profile.

Schematic illustration of factor c(t) representing the combined effects of NPIs, seasonality and viral evolution on the reproduction number ℛ_c(t). T₀ is the time when ℛ_c begins to decrease due to behavioural change or seasonality (estimated); T₁ (> T₀) is the day first lockdown begins (informed by data); c₁ ≤ 1 is the average c(t) achieved during the first lockdown; L₁, L₂ and L₃, denote the length in days of the successive periods of strictest NPI measures (L₁ being the first lockdown). These profiles are adopted in fits until: 1 July 2020 (top); 1 March 2021 (bottom).

One-wave transmissibility profile

When the model is applied to the first pandemic wave only (until 1 July 2020) we use the top profile in Figure 1. T₀ is the time when ℛ₀ begins to decrease due to behavioural changes or seasonality; T₁ (> T₀) is the day first lockdown begins (transmission is allowed to decrease between T₀ and T₁); c₁ ≤ 1 is the average c(t) from the beginning of the first lockdown onwards (14 March in Spain, 19 March in Portugal). Mathematically this is constructed as:

Two-wave transmissibility profile

Applying the model over longer periods which capture multiple waves and multiple lockdowns requires additional features on the transmissibility profile. Denoting by L₁ the duration of the first lockdown (29 days in Spain, 44 days in Portugal), we allow restrictions to be progressively relaxed at the end of this period by letting transmission begin a linear increase such that c(t) reaches 1 in T₂ days, which may or may not be within the range of the study. Changes in other factors that affect transmission (such as seasonality or viral evolution) are inseparable from contact changes in this framework and are also accounted for by c(t). Mathematically this is constructed as: Second and third lockdowns in the autumn and winter season are implemented as a further reduction in transmission (by factors c₂ and c₃, respectively) over the stipulated time periods (L₂ and L₃ in the bottom panel of Figure 1): In Spain, second and third lockdowns were effectively a single intervention, moderately interrupted by a short relaxation over Christmas, and hence we assume c₂ = c₃ in this country. This is contrasted by Portugal where the third lockdown was much stricter than the second and estimated independently.

Herd immunity thresholds

Individual variation in risk of acquiring infection is under selection by the force of infection, whether individual differences are due to biological susceptibility, exposure, or both. The most susceptible or exposed individuals are selectively removed from the susceptible pool as they become infected and eventually recover with immunity (some die), resulting in decelerated epidemic growth and accelerated acquisition of immunity in the population. The herd immunity threshold (HIT) defines the percentage of the population that needs to be immune to reverse epidemic growth and prevent future waves. In the absence of NPIs or other transmissibility modifiers, if individual susceptibility or connectivity is gamma-distributed and mixing is random, basic HIT curves (ℋ) can be derived analytically (Montalbán et al 2020) from the model systems (Equations 1-4, with the respective forces of infections). In the case of variation in susceptibility to infection we obtain while variable connectivity results in a different exponent: In less straightforward cases, such as when the characteristics follow a distribution other than gamma (Gomes et al. 2022), when mixing is not random (Supplementary Information), when both distributions in susceptibility and connectivity are considered, or when contact networks are rewired as result of NPIs or otherwise, ℋ can be obtained numerically.

In the absence of reinfection (σ = 0), both (Equation 23) and (Equation 24) convey substantial declines in HIT as individual variation increases (Figure 2, Gomes et al. 2022 and Montalbán et al. 2020), most strikingly over relatively low CV (from ν = 0 up to 1 or 2). For concreteness, when ℛ₀ = 3, ℋ = 67% for ν = 0, while ν = 1 brings ℋ down to 42% for heterogeneous susceptibility and 30% for heterogeneous connectivity, and ν = 2 brings ℋ further down to 20% and 11%, respectively. Accounting for reinfection (σ > 0) might moderate these reductions. In any case, such differences in HIT are an indication of strong sensitivity of the epidemic dynamics to the parameter ν over a range that appears realistic (Gomes et al. 2022).

Figure 2: Herd immunity threshold and epidemic final size.

Herd immunity thresholds (solid curves) are calculated according to (Equation 23) for heterogeneous susceptibility and (Equation 24) for heterogeneous connectivity, assuming ℛ₀ = 3 for concreteness. Final sizes of the corresponding unmitigated epidemics are also shown (dashed). Curves are generated for different values of the efficacy of immunity conferred by natural infection (1 − σ) as displayed in the legend: 100% (blue); 90% (green); 80% (yellow); 70% (orange); 67% (corresponding to 1 − 1/ℛ₀ in this case; red).

We emphasise, nevertheless, that ℋ is a theoretical framework to the extent that ℛ₀ is a theoretical framework. It cannot be measured directly when epidemic trajectories are affected by interventions, but it can be inferred indirectly from epidemiological data. By construction, ℋ changes if the parameters that determine its value change. Most notably, natural changes in ℛ₀ through time, which can happen due to seasonal forces or viral evolution, transfer to ℋ according to (Equation 23), (Equation 24), or even their homogeneity equivalent 1 − 1/ℛ₀. As a result, the percentage of the population immune required to prevent sustained epidemic growth may deviate from the initial ℋ. Notwithstanding, a model with lower ℋ results in smaller epidemics than a model with higher ℋ, all non-basic processes being the same.

The phenomenon of variation and selection which accounts for lower HIT was widely explained around mid-2020 (Hartnett 2020) and generated broad public interest in the context of COVID-19. By early 2021, vaccines had become available, and a competing belief emerged to imply that the HIT might be unachievable for COVID-19 (Aschwanden 2021). While Hartnett (2020) writes about using the basic ℋ to assess pandemic potential, stressing how that is weakened by variation and selection by natural infection, Aschwanden (2021) focuses on the imperfect nature of both vaccine induced and natural immunity to endorse that reinfection may become frequent enough to make herd immunity unachievable. In the light of the theory presented here – specifically (Equation 23) and (Equation 24) – these views are orthogonal and do not contradict each other.

Data

We use publicly available epidemiological data from the coronavirus dashboards for Spain [https://cnecovid.isciii.es/covid19] and Portugal [https://covid19.min-saude.pt/ponto-de-situacao-atual-em-portugal] to fit the models and estimate parameters of interest. Namely, we fit model reconstructed mortality timeseries assuming a fixed infection fatality ratio (IFR) to datasets containing daily deaths, , where k = 0 is the day when the cumulative moving average of death numbers exceed 5 · 10⁻⁷ of the population (7 March in Spain, 19 March 2020 in Portugal).

Model fits were carried out to the raw series of daily deaths until the 1 July 2020 in the first instance (to cover the first wave of the epidemic in the study countries), and until the 1 March 2021 in an extended analysis (as a compromise between having a series sufficiently long to capture much of the second wave and not so long that it would be affected by vaccination and require the vaccine to be modelled). We defined the initial conditions as: where η is the excess duration of a fatal infection relative to non-fatal, y₀ is the number of deaths in the first day of the study, and the population size N was obtained from the most recent respective censuses (approx. 46.94 million in Spain, 10.28 million in Portugal, 3.57 million in the North Region of Portugal, 3.66 million in Lisbon and Tagus Valley Region of Portugal).

Model fitting and parameter estimating

We assumed that reinfection was negligible throughout the study period. A study conducted in England (Hall et al. 2021), between June 2020 and January 2021, concluded that previous SARS-CoV-2 infection induced 84% effective immunity to future infections. In (Gomes et al. 2122) we fit models to daily COVID-19 deaths in England and Scotland, assuming no reinfection (i.e. 100% effective immunity) or 90% effective immunity, and found it to have no significant effect on projected model trajectories. Basically, the fit readjusts the parameters when reinfection is added to the model in such a way that the HIT remains similar.

Parameter estimation was performed with the software MATLAB by employing a multi-start local optimization approach followed by Markov chain Monte Carlo (MCMC) posterior distribution sampling, using the PESTO (Parameter EStimation Toolbox) package (Stapor et al. 2018). We assumed the daily number of SARS-CoV-2 infections to be Poisson distributed.

We approximate the dynamics of COVID-19 deaths by estimating the set of parameters θ that maximises the log-likelihood (LL) of observing the daily numbers of reported deaths Y: where are the simulated model output numbers of COVID-19 deaths at day k for the set of parameters θ, are the numbers of daily reported deaths, and n is the total number of days included in the analysis.

Fitting models to one pandemic wave

The models exploring heterogeneity in susceptibility (Equations 8-11) and connectivity (Equations 15-18) both with transmissibility profile as in (Equation 19), were fit to COVID-19 daily reported deaths in Spain and Portugal recorded until 1 July 2020. A homogeneous version obtained by setting ν = 0 in either model was also fitted. Results for Spain are shown in Table 1 and Figure 3, and for Portugal in Table 2 and Figure 4.

View this table:

Table 1: Model parameters for Spain (one wave).

Estimated by Bayesian inference based on daily deaths until 1 July 2020. Model selection based on maximum log-likelihood (LL) and Akaike information criterion (AIC). Best fitting models have lower AIC scores (best in red, second best in blue). Herd immunity threshold (ℋ) derived from estimated ℛ₀ and CV (ν).

View this table:

Table 2: Model parameters for Portugal (one wave).

Figure 3: Estimating SARS-CoV-2 transmission in Spain by fitting one wave of COVID-19 deaths.

Variation in susceptibility (top panels); variation in connectivity (middle panels); and homogeneous model (bottom panels). Susceptibility or connectivity factors implemented as gamma distributions. Controlled (ℛ_c) and effective (ℛ_eff) reproduction numbers are displayed on shallow panels underneath the main plots. Basic reproduction number, coefficients of variation and transmissibility profile parameters estimated by Bayesian inference as described in Methods (estimates in Table 1). Curves represent model reconstructions from the median posterior parameter estimates. Shades represent 95% credible intervals from 100,000 posterior samples.

Figure 4: Estimating SARS-CoV-2 transmission in Portugal by fitting one wave of COVID-19 deaths.

Variation in susceptibility (top panels); variation in connectivity (middle panels); and homogeneous model (bottom panels). Susceptibility or connectivity factors implemented as gamma distributions. Controlled (ℛ_c) and effective (ℛ_eff) reproduction numbers are displayed on shallow panels underneath the main plots. Basic reproduction number, coefficients of variation and transmissibility profile parameters estimated by Bayesian inference as described in Methods (estimates in Table 2). Curves represent model reconstructions from the median posterior parameter estimates. Shades represent 95% credible intervals from 100,000 posterior samples.

We estimate the basic reproduction number ℛ₀ with 95% credible intervals (CI) around 3.5 − 3.8 in Spain and 2.3 − 3.4 in Portugal. For the minimal transmissibility factor c₁ we estimate 0.21 − 0.27 when individual variation is allowed and 0.18 − 0.19 with the homogeneity constraint in Spain, while in Portugal we estimate the wider intervals 0.26 − 0.39. For coefficients of variation in Spain, we estimate CV in the range 1.3 − 2.4 under heterogeneous susceptibility and 1.0 − 1.7 under heterogeneous connectivity. In Portugal, we obtain the much wider and uninformative ranges 0.0 − 3.2.

Left plots in Figures 3 and 4 show the best fitting model solutions generated from the median posterior estimates of each parameter in the respective countries as well as the 95% CI generated from 100,000 posterior samples. The herd immunity thresholds ℋ, calculated from ℛ₀ and CV estimates, are ℋ = 19% (95% CI, 13-32%) under heterogeneous susceptibility, ℋ = 19% (95% CI, 13-36%) under heterogeneous connectivity, and ℋ = 72% (95% CI, 71-73%) when homogeneity is imposed, in Spain. In Portugal, we obtain ℋ = 19% (95% CI, 5-69%) under heterogeneous susceptibility, ℋ = 13% (95% CI, 3-69%) under heterogeneous connectivity, and ℋ = 67% (95% CI, 66-69%) when homogeneity is imposed. Credible intervals for ℋ in Portugal are wide and uninformative when individual variation is allowed which is expected given the wide ranges obtained for CV. NPIs in Portugal were initiated very early in the epidemic which resulted in transmissibility reductions blending with ℛ₀, making parameter identification a major challenge from the data accessible to us.

In Spain, where the three models provide good fits to the data, model selection criteria such as AIC (Akaike information criterion) support the heterogeneous implementations. To better distinguish the various models, we run the respective systems of equations forward, under a set of conventions, and compare the respective projected outcomes. Right plots in Figures 3 were generated by taking the end conditions of the left plots, moving all exposed and infectious individuals to recovered (except a residual proportion to seed a new outbreak) and running each model with the estimated ℛ₀ and CV until the susceptible pool has been effectively depleted in all implementations. In this manner we can visualise how much more burden of infection appears to be ahead when models are constrained to be homogeneous (a manifestation of their relatively higher ℋ). Roughly, epidemics peak one order of magnitude higher when models are homogeneous. This must have broad implications, which we believe remain largely unappreciated, for how a population will experience an epidemic, irrespective of how this basic scenario is adapted to specific factors such as behavioural patterns, seasonality, viral evolution, or vaccination.

The same analysis applied to Portugal, selects in favour of the homogeneity assumption and larger projected waves. We recall, however, the great uncertainty associated with these specific results. We investigate this further through fittings to longer data series, in the first instance.

Fitting models to two serial pandemic waves

Here we take the models with heterogeneity in susceptibility (Equations 8-11) and connectivity (Equations 15-18) and apply the transmissibility profile in (Equations 20-22) to both before fitting the model outputs to COVID-19 deaths recorded daily until 1 March 2021, in Spain and Portugal. As before a homogeneous version obtained by setting ν = 0 was also fitted. Results for Spain are shown in Table 3 and Figure 5, and for Portugal in Table 4 and Figure 6.

View this table:

Table 3: Model parameters for Spain (two waves).

Estimated by Bayesian inference based on daily deaths until 1 March 2021. Model selection based on maximum log-likelihood (LL) and Akaike information criterion (AIC). Best fitting models have lower AIC scores (best in red, second best in blue). Herd immunity threshold (ℋ) derived from estimated ℛ₀ and CV (ν).

View this table:

Table 4: Model parameters for Portugal (two waves).

Figure 5: Estimating SARS-CoV-2 transmission in Spain by fitting two serial waves of COVID-19 deaths.

Variation in susceptibility (top panels); variation in connectivity (middle panels); and homogeneous model (bottom panels). Susceptibility or connectivity factors implemented as gamma distributions. Controlled (ℛ_c) and effective (ℛ_eff) reproduction numbers are displayed on shallow panels underneath the main plots. Basic reproduction number, coefficients of variation and transmissibility profile parameters estimated by Bayesian inference as described in Methods (estimates in Table 3). Curves represent model reconstructions from the median posterior parameter estimates. Shades represent 95% credible intervals from 100,000 posterior samples.

Figure 6: Estimating SARS-CoV-2 transmission in Portugal by fitting two serial waves of COVID-19 deaths.

Variation in susceptibility (top panels); variation in connectivity (middle panels); and homogeneous model (bottom panels). Susceptibility or connectivity factors implemented as gamma distributions. Controlled (ℛ_c) and effective (ℛ_eff) reproduction numbers are displayed on shallow panels underneath the main plots. Basic reproduction number, coefficients of variation and transmissibility profile parameters estimated by Bayesian inference as described in Methods (estimates in Table 4). Curves represent model reconstructions from the median posterior parameter estimates. Shades represent 95% credible intervals from 100,000 posterior samples.

We estimate ℛ₀ with 95% CI around 3.9 − 4.4 in Spain and 2.7 − 4.6 in Portugal. For the minimal transmissibility factor c₁ we estimate 0.20 − 0.21 when individual variation is allowed and 0.17 with the homogeneity constraint in Spain, while in Portugal we obtain 0.23 − 032 with individual variation and 0.16 − 018 without. For coefficients of variation in Spain, we estimate CV around 2.0 under heterogeneous susceptibility and 1.3 under heterogeneous connectivity. In Portugal, the estimates are around 0.5.

Left plots in Figures 5 and 6 show best fitting model solutions as well as the respective 95% CI. In Spain, basic herd immunity thresholds calculated from best fitting ℛ₀ and CV are ℋ = 25% under heterogeneous susceptibility, ℋ = 27% under heterogeneous connectivity, and ℋ = 76% when homogeneity is imposed. In Portugal, we obtain again wider and higher ranges: ℋ = 56% (95% CI, 52-57%) under heterogeneous susceptibility, ℋ = 56% (95% CI, 54-57%) under heterogeneous connectivity, and ℋ = 77% (95% CI, 76-78%) when homogeneity is imposed.

Puzzling, as in the case of shorter data series, model selection supports the homogeneous model for Portugal while still favouring the incorporation of individual variation in Spain. Moreover, for each country, results are consistent whether we base our estimates on one or two waves of the national epidemic. This consistency was also verified in England and Scotland (Gomes et al 2022). This suggests that the earlier initiation of NPIs may not fully explain why results for Portugal contrast with those for other countries studied. In the next section we explore whether this may be due to the asynchrony between the two largest regions (comprising approximately 66% of the total population), which could ultimately result in a mis-specified model for this country.

Fitting models to regional data in Portugal

Intrigued by the puzzling results for Portugal at country level, we gathered regional data. We found that the epidemic dynamics were considerably different between the two largest regions: North, home to roughly one third of the Portuguese population; and Lisbon and Tagus Valley, home to another third. Asynchrony of epidemic dynamics between regions of similar sizes may require disaggregated analyses. We then decided to fit the mortality data for the two regions simultaneously, estimating common parameters to describe country-wide lockdowns, and regions-specific parameters to describe the basic transmission dynamics (ℛ₀ and CV). The results are provided in Tables 5, 6 and Figures 7-9. According to these analyses, the best-fitting models include heterogeneity.

View this table:

Table 5: Model parameters for the North and Lisbon regions of Portugal (one wave).

View this table:

Table 6: Model parameters for the North and Lisbon regions of Portugal (two waves).

Figure 7: Estimating SARS-CoV-2 transmission in the two larger regions of Portugal.

Heterogeneous susceptibility implemented as a gamma distribution. Controlled (ℛ_c) and effective (ℛ_eff) reproduction numbers are displayed on shallow panels underneath the main plots. Basic reproduction number, coefficients of variation and transmissibility profile parameters estimated by Bayesian inference as described in Methods (estimates in Table 5). Curves represent model reconstructions from the median posterior parameter estimates. Shades represent 95% credible intervals from 100,000 posterior samples.

Figure 8: Estimating SARS-CoV-2 transmission in the two larger regions of Portugal.

Heterogeneous connectivity implemented as a gamma distribution. Controlled (ℛ_c) and effective (ℛ_eff) reproduction numbers are displayed on shallow panels underneath the main plots. Basic reproduction number, coefficients of variation and transmissibility profile parameters estimated by Bayesian inference as described in Methods (estimates in Table 5). Curves represent model reconstructions from the median posterior parameter estimates. Shades represent 95% credible intervals from 100,000 posterior samples.

Figure 9: Estimating SARS-CoV-2 transmission in the two larger regions of Portugal.

Homogeneous model. Controlled (ℛ_c) and effective (ℛ_eff) reproduction numbers are displayed on shallow panels underneath the main plots. Basic reproduction number, coefficients of variation and transmissibility profile parameters estimated by Bayesian inference as described in Methods (estimates in Table 5). Curves represent model reconstructions from the median posterior parameter estimates. Shades represent 95% credible intervals from 100,000 posterior samples.

Herd immunity thresholds inferred by fits to the longer series (until 1 March 2021) are around 25 − 34% in the North (similar as Spain, England and Scotland) and 47 − 52% in Lisbon and Tagus Valley. The higher ℋ in the capital region results from the estimation of a lower CV. This may be real and due to the more urban character of Lisbon and Tagus Valley, or, in contrast, it may be a spurious result of the lack of a first wave in the region to inform the model. It would be interesting to replicate the regional analysis in other countries to investigate to what extent more urban regions have higher HIT. As for the North, ℋ is much closer to that of Spain, England and Scotland, but slightly higher, nevertheless. This may also be a slightly spurious consequence of the first wave being more suppressed there (although not as much as in Lisbon and Tagus Valley) than in those other nations. Alternatively, there may be differences in reporting across countries (particularly those affecting whether deaths are declared with or from COVID-19 [Ferreira 2022; Gonçalves 2022]) influencing the accuracy of estimated model parameters.

Fits to the shorter series (until 1 July 2020) are again inconclusive. Not only the uncertainty around parameter estimates is large but some estimates are implausible. First, ℛ₀ around 2 or less is on the low end of consensus estimates (Flaxman et al. 2020; Keeling et al. 2020; Viana et al. 2021; Wood 2021). More strikingly, the algorithm is incapable of estimating CV from these regional series, resulting in convergence to the upper and lower limits of the prior distribution (uniform between ) in the North and Lisbon regions, respectively.

In summary, the results reported in this section support two notions. First, asynchronous dynamics may compromise parameter estimation based on model fittings to aggregated data. This should depend on whether the asynchrony in question is between similar-sized regions. Second, the early estimation of parameters for heterogeneous models (especially CV) may require a larger first wave than that required by models that are either homogeneous (Flaxman et al. 2020; Wood 2021) or have their heterogeneity informed directly by specific data (Keeling et al. 2020). The downside of these other approaches, however, is that heterogeneity is either absent or possibly incomplete, resulting in reduced selection and biased estimates. Among the studies we have completed, England, Scotland and Spain had sufficiently sized first waves to inform the inference of CV while Portugal did not.

Discussion

We fitted SEIR models, with inbuilt distributions of individual susceptibility or exposure to infection, to daily series of COVID-19 deaths in Spain and Portugal. We estimated relevant transmission parameters, such as the basic reproduction number ℛ₀, and a time dependent transmissibility profile c(t) which multiplies ℛ₀ to account for effects of NPIs, seasonality, viral evolution, or any unexplicit factor that modifies the ability of the virus to infect new hosts. In addition, as in (Gomes et al. 2022) we estimated coefficients of variation that characterise distributions of unmeasured individual susceptibility or connectivity.

The inference of selectable variation as presented here is uncommon in infectious disease modelling. Prior to attempting this in the context of the COVID-19 pandemic, we and others have conducted related studies in systems that were either experimentally controlled (Dwyer et al. 1997; Ben-Ami et al. 2008; Zwart et al. 2011; Pessoa et al. 2014; Langwig et al. 2017; King et al. 2018) or already endemic (Smith et al. 2005; Bellan et al. 2015; Corder et al. 2020). Several aspects of the pandemic made it more challenging. There was an urgency for early results from inherently scarce data, a challenge amplified by non-pharmaceutical interventions designed to suppress the epidemic. In countries where interventions began earlier (relative to the epidemic momentum), such as Portugal, it has been impossible to reach conclusive results without finer data and methods. In Spain (this study) and England and Scotland (Gomes et al 2022), on the other hand, interventions started later, and results were consistent, both internally and between each other. Our coefficients of variation for individual connectivity are similar to those measured directly by contact surveys (Gomes et al 2022).

Another group of authors highlighted the importance of considering the interplay between social dynamics and spread of infection when interpreting coefficients of variation (Tkachenko 2021). If society changed over time in such a way that individuals with low susceptibility/exposure in one wave became high susceptibility/exposure in a later wave, then coefficients of variation estimated from two-wave fits should be lower than those estimated from one-wave fits, resulting in higher herd immunity thresholds. Our results for Spain are not indicative of this process playing a significant role in COVID-19. We estimate CV around 1.41 (95%, 0.97 − 1.74) when we fit the first wave only, and 1.30 (95%, 1.28 − 1.31) when we fit two waves. The two-wave estimate is not significantly lower than that obtained from the one-wave analysis, suggesting that the postulated mechanism is not affecting our inferences. We saw the same consistency in our previous analysis of England and Scotland (Gomes et al. 2022). We find this unsurprising given the amply reported evidence of socioeconomic determinants as key drivers of heterogeneity in infectious diseases (e.g., Millett et al. 2020, Xia et al. 2022). Societal changes may not significantly impact our inferences unless major inversions in socioeconomic gradients had occurred which is unimaginable in the time scale of a pandemic. On the contrary, the opposite seems more plausible as more disadvantaged social groups suffer more from both disease and containment measures, exacerbating preexisting heterogeneity (Okonkwo et al. 2021).

The exploration presented here for Spain confirms recent findings for England and Scotland that the original SARS-CoV-2 had a herd immunity threshold in the range 20-30%. The main specificity of the underlying studies is to include individual variation in susceptibility and exposure to infection in the set of model parameters being estimated. The modelling approach is relatively new, and its limits of applicability remain a subject for research and further methodological developments. Here we adopt a dataset from Portugal to highlight features that compromise the applicability of the basic method and, in some instances, propose refinements to push those limits.

Data Availability

Datasets are publicly available at the respective national ministry of health websites.

Author contributions

M.G.M.G. conceived the study. R.A. and M.G.M.G. performed the analyses. All authors interpreted the data and wrote the paper.

Competing interests

The authors declare no competing interests.

Data availability

We used publicly available data from the coronavirus dashboards for Spain [https://cnecovid.isciii.es/covid19] and Portugal [https://covid19.min-saude.pt/ponto-de-situacao-atual-em-portugal]. Population sizes were obtained from recent censuses: 46,771,836 for Spain; 10,196,709 for Portugal; 3,573,000 for Portugal-North; and 3,447,173 for Lisbon and Tagus Valley.

Acknowledgements

We thank Rodrigo Corder, Jessica King and Antonio Montalbán for technical discussions and contributions to related research.

Footnotes

The text has been revised to emphasise that this is an exploratory study of the limites of applicability of a method previously published in: Gomes, M. G. M., et al. (2022) Individual variation in susceptibility or exposure to SARS-CoV-2 lowers the herd immunity threshold. J. Theor. Biol. 540, 111063.

References

1.
Keyfitz, N. & Littman, G. (1979) Mortality in a heterogeneous population. Popul. Stud. 33, 333–342.
OpenUrl
2.↵
Vaupel, J., Manton, K. & Stallard, E. (1979) Impact of heterogeneity in individual frailty on the dynamics of mortality. Demography 16, 439–454.
OpenUrl CrossRef PubMed Web of Science
3.↵
Vaupel, J., & Yashin, A. (1985) Heterogeneity ruses – some surprising effects of selection on population dynamics. Am. Stat. 39, 176–185.
OpenUrl CrossRef PubMed Web of Science
4.↵
Kendall, B. E. & Fox, G. A. (2002) Variation among individuals and reduced demographic stochasticity. Conserv. Biol. 16, 109–116.
OpenUrl CrossRef Web of Science
5.↵
Jenouvrier, S, Aubry, L. M., Barbraud, C, Weimerskirch, H & Caswell, H. (2018) Interacting effects of unobserved heterogeneity and individual stochasticity in the life history of the southern fulmar. J. Anim. Ecol. 87, 212–222.
OpenUrl
6.↵
Steiner, U. K. & Tuljapurkar, S. (2012) Neutral theory for life histories and individual variability in fitness components. Proc. Natl. Acad. Sci U. S. A. 109, 4684–4689.
OpenUrl Abstract/FREE Full Text
7.↵
Gomes, M. G. M., King, J. G., Nunes, A., Colegrave, N. & Hoffmann, A. (2019) The effects of individual nonheritable variation on fitness estimation and coexistence. Ecol. Evol. 16, 8995–9004.
OpenUrl
8.↵
Aalen, O. O., Valberg, M., Grotmol, T. & Tretli, S. (2015) Understanding variation in disease risk: the elusive concept of frailty. Int. J. Epidemiol. 4, 1408–1421.
OpenUrl
9.↵
Stensrud, M. J. & Valberg, M. (2017) Inequality in genetic cancer risk suggests bad genes rather than bad luck. Nat. Commun. 8, 1165.
OpenUrl
10.↵
Anderson, R. M., Medley, G. F., May, R. M. & Johnson, A. M. (1986) A preliminary study of the transmission dynamics of the human immunodeficiency virus (HIV), the causative agent of AIDS. IMA J. Math. Appl. Med. Biol. 3, 229–263.
OpenUrl CrossRef PubMed
11.↵
Dwyer, G., Elkinton, J. S. & Buonaccorsi, J. P. (1997) Host heterogeneity in susceptibility and disease dynamics: Tests of a mathematical model. Am. Nat. 150, 685–707.
OpenUrl CrossRef PubMed Web of Science
12.↵
Smith, D. L., Dushoff, J., Snow, R. W. & Hay, S. I. (2005) The entomological inoculation rate and Plasmodium falciparum infection in African children. Nature 438, 492–495.
OpenUrl CrossRef PubMed Web of Science
13.↵
Bellan, S. E., Dushoff, J., Galvani, A. P. & Meyers, L. A. (2015) Reassessment of HIV-1 acute phase infectivity: accounting for heterogeneity and study design with simulated cohorts. PLOS Med. 12, e1001801.
OpenUrl CrossRef PubMed
14.
Gomes, M. G. M., et al. (2019) Introducing risk inequality metrics in tuberculosis policy development. Nat. Commun. 10, 2480.
OpenUrl
15.↵
Corder, R. M., Ferreira, M. U. & Gomes, M. G. M. (2020) Modelling the epidemiology of residual Plasmodium vivax in a heterogeneous host population: a case study in the Amazon Basin. PLOS Comput. Biol. 16, e1007377.
OpenUrl PubMed
16.↵
Halloran, M. E., Longini, I. M. Jr.. & Struchiner, C. J. (1996) Estimability and interpretability of vaccine efficacy using frailty mixing models. Am. J. Epidemiol. 144, 83–97.
OpenUrl CrossRef PubMed Web of Science
17.↵
O’Hagan, J. J., Hernán, M. A., Walensky, R. P. & Lipsitch, M. (2012) Apparent declining efficacy in randomized trials: Examples of the Thai RV144 HIV vaccine and CAPRISA 004 microbicide trials. AIDS 26, 123.
OpenUrl CrossRef PubMed
18.↵
Gomes, M. G. M., et al. (2014) A missing dimension in measures of vaccination impacts. PLOS Pathog. 10, e1003849.
OpenUrl CrossRef
19.↵
Gomes, M. G. M., Gordon, S. B. & Lalloo, D. G. (2016) Clinical trials: the mathematics of falling vaccine efficacy with rising disease incidence. Vaccine 34, 3007.
OpenUrl CrossRef
20.↵
Langwig, K. E., et al. (2017) Vaccine effects on heterogeneity in susceptibility and implications for population health management, mBio 8, e00796–17.
OpenUrl
21.↵
Pessoa, D., et al. (2016) Unveiling time in dose-response models to infer host susceptibility to pathogens. PLOS Comput. Biol. 10, e1003773.
OpenUrl
22.↵
King, J. G., Souto-Maior, C., Sartori, L. M., Maciel-de-Freitas, R. & Gomes, M. G. M. (2018) Variation in Wolbachia effects on Aedes mosquitoes as a determinant of invasiveness and vectorial capacity. Nat. Commun. 9, 1–8.
OpenUrl CrossRef PubMed
23.↵
Gomes, M. G. M., et al. (2022) Individual variation in susceptibility or exposure to SARS-CoV-2 lowers the herd immunity threshold. J. Theor. Biol. 540, 111063.
OpenUrl
24.
World Health Organization (2021) COVID-19 weekly epidemiological update: https://www.who.int/publications/m/item/weekly-epidemiological-update-on-covid-19---13-july-2021.
25.↵
Diekmann, O., Heesterbeek, J. A. P. & Metz, J. A. J. (1990) On the definition and computation of the basic reproduction ratio R₀ in models for infectious diseases in heterogeneous populations. J. Math. Biol. 28, 365–382.
OpenUrl CrossRef PubMed Web of Science
26.↵
McAloon, C., et al. (2020) Incubation period of COVID-19: a rapid systematic review and meta-analysis of observational research. BMJ Open 10, e039652.
OpenUrl Abstract/FREE Full Text
27.↵
Nishiura, H., Linton, N. M. & Akhmetzhanov, A. R. (2020) Serial interval of novel coronavirus (COVID-19) infections. Int. J. Infect. Dis. 93, 284–6.
OpenUrl CrossRef PubMed
28.↵
Lauer, S. A., et al. (2020) The Incubation Period of Coronavirus Disease 2019 (COVID-19) From Publicly Reported Confirmed Cases: Estimation and Application. Ann. Intern. Med. 172, 577–582.
OpenUrl CrossRef PubMed
29.
Li, Q., et al. (2020) Early transmission dynamics in Wuhan, China, of novel coronavirus-infected pneumonia. N. Engl. J. Med. 382, 1199–1207.
OpenUrl CrossRef PubMed
30.↵
Wei, W. E., et al. (2020) Presymptomatic Transmission of SARS-CoV-2 — Singapore, January 23–March 16, 2020. MMWR Morb. Mortal. Wkly. Rep. 69, 411–415.
OpenUrl CrossRef PubMed
31.↵
To, K. K. W., et al. (2020) Temporal profiles of viral load in posterior oropharyngeal saliva samples and serum antibody responses during infection by SARS-CoV-2: an observational cohort study. Lancet Infect. Dis. 20, 565–74.
OpenUrl CrossRef PubMed
32.↵
Arons, M. M., et al. (2020) Presymptomatic SARS-CoV-2 Infections and Transmission in a Skilled Nursing Facility. N. Engl. J. Med. 382, 2081–2090.
OpenUrl CrossRef PubMed
33.↵
He, X., et al. (2020) Temporal dynamics in viral shedding and transmissibility of COVID-19. Nat. Med. 26, 672–675.
OpenUrl CrossRef PubMed
34.↵
Pastor-Barriuso, R., et al. (2021) SARS-CoV-2 infection fatality risk in a nationwide seroepidemiological study. medRvix doi:10.1101/2020.08.06.20169722.
OpenUrl Abstract/FREE Full Text
35.↵
Novozhilov, A. S. (2008) On the spread of epidemics in a closed hoterogeneous population. Math. Biosci. 215, 177–185.
OpenUrl CrossRef PubMed
36.
Montalbán, A., Corder, R. M. & Gomes, M. G. M. (2022) Herd immunity under individual variation and reinfection. J. Math. Biol. 85, 2.
OpenUrl
37.↵
Pastor-Satorras, R. & Vespignani, A. (2001) Epidemic dynamics and endemic states in complex networks. Phys. Rev. E 63, 066117.
OpenUrl
38.↵
Miller, J. C., Slim, A. C. & Volz, E. M. (2012) Edge-based compartmental modelling for infectious disease spread. J. R. Soc. Interface 9, 890–906.
OpenUrl CrossRef PubMed
39.
Britton, T., Ball, F. & Trapman, P. (2020) A mathematical model reveals the influence of population heterogeneity on herd immunity to SARS-CoV-2. Science 369, 846–849.
OpenUrl Abstract/FREE Full Text
40.↵
Hartnett, K. (2020) The tricky math of herd immunity for COVID-19. Quanta Magazine. https://www.quantamagazine.org/the-tricky-math-of-covid-19-herd-immunity-20200630/
41.↵
Aschwanden, C. (2021) Five reasons why COVID herd immunity is probably impossible. Nature 591, 520–522. https://www.nature.com/articles/d41586-021-00728-2
OpenUrl CrossRef PubMed
42.↵
Hall, et al. (2021) SARS-CoV-2 infection rates of antibody-positive compared with antibody-negative health-care workers in England: a large, multicentre, prospective cohort study (SIREN). Lancet 397, 1459–1469.
OpenUrl CrossRef PubMed
43.↵
Ferreira, M. L. Mortes por Covid ou com Covid? DGS diz que só entram no boletim óbitos causados pelo vírus. Como funciona a equipa que tem a última palavra. Observador 21 feb 2022. Available at: https://observador.pt/especiais/mortes-por-covid-ou-com-covid-dgs-diz-que-so-entram-no-boletim-obitos-causados-pelo-virus-como-funciona-a-equipa-que-tem-a-ultima-palavra/
44.↵
Gonçalves, J. Todos os dias há mortes declaradas no boletim da DGS que não foram por Covid-19”, garante médico infecciologista. Rádio Renascença 08 feb 2022. Available at: https://rr.sapo.pt/especial/pais/2022/02/08/todos-os-dias-ha-mortes-declaradas-no-boletim-da-dgs-que-nao-foram-por-covid-19-garante-medico-infecciologista/271641/
45.↵
Flaxman, S., et al. (2020) Estimating the effects of non-pharmaceutical interventions on COVID-19 in Europe. Nature 584, 257–261.
OpenUrl CrossRef PubMed
46.↵
Keeling, M. J., et al. (2020) Fitting to the UK COVID-19 outbreak, short-term forecasts and estimating the reproductive number. medRvix doi:10.1101/2020.08.04.20163782.
OpenUrl Abstract/FREE Full Text
47.↵
Viana, J. et al. (2021) Controlling the pandemic during the SARS-CoV-2 vaccination rollout. Nat. Commun. 12, 3674.
OpenUrl
48.↵
Wood, S. N. (2021) Inferring UK COVID-19 fatal infection trajectories from daily mortality data: Were infections already in decline before the UK lockdowns? Biometrics doi:10.1111/biom.13462.
OpenUrl CrossRef
49.↵
Tkachenko, A. V., Maslov, S., Elbanna, A., Wong, G. N., Weiner, Z. J. & Goldenfeld, N. (2021) Time-dependent heterogeneity leads to transient suppression of the COVID-19 epidemic, not herd immunity. Proc. Natl. Acad. Sci. U.S.A. 118, e2015972118.
OpenUrl Abstract/FREE Full Text
50.↵
Millett, G. A., et al. (2020) Assessing differential impacts of COVID-19 on black communities. Ann. Epidemiol. 47, 37–44.
OpenUrl PubMed
51.↵
Xia, Y., et al. (2022) Concentration of SARS-CoV-2 cases by social determinants of health in metropolitan areas in Canada: a cross-sectional study. CMAJ 194, E195–E204.
OpenUrl Abstract/FREE Full Text
52.↵
Okonkwo, N. E., et al. (2021) COVID-19 and the US response: accelerating health inequalities. BMJ EBM 26, 176–179.
OpenUrl

View the discussion thread.

Posted August 30, 2022.

Download PDF

Supplementary Material

Data/Code

Citation Tools

Subject Area

Epidemiology

Subject Areas

All Articles

Addiction Medicine (330)
Allergy and Immunology (651)
Anesthesia (174)
Cardiovascular Medicine (2503)
Dentistry and Oral Medicine (307)
Dermatology (210)
Emergency Medicine (386)
Endocrinology (including Diabetes Mellitus and Metabolic Disease) (884)
Epidemiology (11985)
Forensic Medicine (10)
Gastroenterology (721)
Genetic and Genomic Medicine (3906)
Geriatric Medicine (365)
Health Economics (653)
Health Informatics (2521)
Health Policy (973)
Health Systems and Quality Improvement (935)
Hematology (350)
HIV/AIDS (813)
Infectious Diseases (except HIV/AIDS) (13486)
Intensive Care and Critical Care Medicine (778)
Medical Education (386)
Medical Ethics (106)
Nephrology (416)
Neurology (3661)
Nursing (205)
Nutrition (546)
Obstetrics and Gynecology (709)
Occupational and Environmental Health (681)
Oncology (1904)
Ophthalmology (555)
Orthopedics (230)
Otolaryngology (299)
Pain Medicine (243)
Palliative Medicine (71)
Pathology (463)
Pediatrics (1074)
Pharmacology and Therapeutics (445)
Primary Care Research (434)
Psychiatry and Clinical Psychology (3293)
Public and Global Health (6338)
Radiology and Imaging (1341)
Rehabilitation Medicine and Physical Therapy (780)
Respiratory Medicine (846)
Rheumatology (389)
Sexual and Reproductive Health (384)
Sports Medicine (334)
Surgery (426)
Toxicology (51)
Transplantation (179)
Urology (156)

[1] 1.
Keyfitz, N. & Littman, G. (1979) Mortality in a heterogeneous population. Popul. Stud. 33, 333–342.
OpenUrl

[2] 2.↵
Vaupel, J., Manton, K. & Stallard, E. (1979) Impact of heterogeneity in individual frailty on the dynamics of mortality. Demography 16, 439–454.
OpenUrl CrossRef PubMed Web of Science

[3] 3.↵
Vaupel, J., & Yashin, A. (1985) Heterogeneity ruses – some surprising effects of selection on population dynamics. Am. Stat. 39, 176–185.
OpenUrl CrossRef PubMed Web of Science

[4] 4.↵
Kendall, B. E. & Fox, G. A. (2002) Variation among individuals and reduced demographic stochasticity. Conserv. Biol. 16, 109–116.
OpenUrl CrossRef Web of Science

[5] 5.↵
Jenouvrier, S, Aubry, L. M., Barbraud, C, Weimerskirch, H & Caswell, H. (2018) Interacting effects of unobserved heterogeneity and individual stochasticity in the life history of the southern fulmar. J. Anim. Ecol. 87, 212–222.
OpenUrl

[6] 6.↵
Steiner, U. K. & Tuljapurkar, S. (2012) Neutral theory for life histories and individual variability in fitness components. Proc. Natl. Acad. Sci U. S. A. 109, 4684–4689.
OpenUrl Abstract/FREE Full Text

[7] 7.↵
Gomes, M. G. M., King, J. G., Nunes, A., Colegrave, N. & Hoffmann, A. (2019) The effects of individual nonheritable variation on fitness estimation and coexistence. Ecol. Evol. 16, 8995–9004.
OpenUrl

[8] 8.↵
Aalen, O. O., Valberg, M., Grotmol, T. & Tretli, S. (2015) Understanding variation in disease risk: the elusive concept of frailty. Int. J. Epidemiol. 4, 1408–1421.
OpenUrl

[9] 9.↵
Stensrud, M. J. & Valberg, M. (2017) Inequality in genetic cancer risk suggests bad genes rather than bad luck. Nat. Commun. 8, 1165.
OpenUrl

[10] 10.↵
Anderson, R. M., Medley, G. F., May, R. M. & Johnson, A. M. (1986) A preliminary study of the transmission dynamics of the human immunodeficiency virus (HIV), the causative agent of AIDS. IMA J. Math. Appl. Med. Biol. 3, 229–263.
OpenUrl CrossRef PubMed

[11] 11.↵
Dwyer, G., Elkinton, J. S. & Buonaccorsi, J. P. (1997) Host heterogeneity in susceptibility and disease dynamics: Tests of a mathematical model. Am. Nat. 150, 685–707.
OpenUrl CrossRef PubMed Web of Science

[12] 12.↵
Smith, D. L., Dushoff, J., Snow, R. W. & Hay, S. I. (2005) The entomological inoculation rate and Plasmodium falciparum infection in African children. Nature 438, 492–495.
OpenUrl CrossRef PubMed Web of Science

[13] 13.↵
Bellan, S. E., Dushoff, J., Galvani, A. P. & Meyers, L. A. (2015) Reassessment of HIV-1 acute phase infectivity: accounting for heterogeneity and study design with simulated cohorts. PLOS Med. 12, e1001801.
OpenUrl CrossRef PubMed

[14] 14.
Gomes, M. G. M., et al. (2019) Introducing risk inequality metrics in tuberculosis policy development. Nat. Commun. 10, 2480.
OpenUrl

[15] 15.↵
Corder, R. M., Ferreira, M. U. & Gomes, M. G. M. (2020) Modelling the epidemiology of residual Plasmodium vivax in a heterogeneous host population: a case study in the Amazon Basin. PLOS Comput. Biol. 16, e1007377.
OpenUrl PubMed

[16] 16.↵
Halloran, M. E., Longini, I. M. Jr.. & Struchiner, C. J. (1996) Estimability and interpretability of vaccine efficacy using frailty mixing models. Am. J. Epidemiol. 144, 83–97.
OpenUrl CrossRef PubMed Web of Science

[17] 17.↵
O’Hagan, J. J., Hernán, M. A., Walensky, R. P. & Lipsitch, M. (2012) Apparent declining efficacy in randomized trials: Examples of the Thai RV144 HIV vaccine and CAPRISA 004 microbicide trials. AIDS 26, 123.
OpenUrl CrossRef PubMed

[18] 18.↵
Gomes, M. G. M., et al. (2014) A missing dimension in measures of vaccination impacts. PLOS Pathog. 10, e1003849.
OpenUrl CrossRef

[19] 19.↵
Gomes, M. G. M., Gordon, S. B. & Lalloo, D. G. (2016) Clinical trials: the mathematics of falling vaccine efficacy with rising disease incidence. Vaccine 34, 3007.
OpenUrl CrossRef

[20] 20.↵
Langwig, K. E., et al. (2017) Vaccine effects on heterogeneity in susceptibility and implications for population health management, mBio 8, e00796–17.
OpenUrl

[21] 21.↵
Pessoa, D., et al. (2016) Unveiling time in dose-response models to infer host susceptibility to pathogens. PLOS Comput. Biol. 10, e1003773.
OpenUrl

[22] 22.↵
King, J. G., Souto-Maior, C., Sartori, L. M., Maciel-de-Freitas, R. & Gomes, M. G. M. (2018) Variation in Wolbachia effects on Aedes mosquitoes as a determinant of invasiveness and vectorial capacity. Nat. Commun. 9, 1–8.
OpenUrl CrossRef PubMed

[23] 23.↵
Gomes, M. G. M., et al. (2022) Individual variation in susceptibility or exposure to SARS-CoV-2 lowers the herd immunity threshold. J. Theor. Biol. 540, 111063.
OpenUrl

[24] 24.
World Health Organization (2021) COVID-19 weekly epidemiological update: https://www.who.int/publications/m/item/weekly-epidemiological-update-on-covid-19---13-july-2021.

[25] 25.↵
Diekmann, O., Heesterbeek, J. A. P. & Metz, J. A. J. (1990) On the definition and computation of the basic reproduction ratio R₀ in models for infectious diseases in heterogeneous populations. J. Math. Biol. 28, 365–382.
OpenUrl CrossRef PubMed Web of Science

[26] 26.↵
McAloon, C., et al. (2020) Incubation period of COVID-19: a rapid systematic review and meta-analysis of observational research. BMJ Open 10, e039652.
OpenUrl Abstract/FREE Full Text

[27] 27.↵
Nishiura, H., Linton, N. M. & Akhmetzhanov, A. R. (2020) Serial interval of novel coronavirus (COVID-19) infections. Int. J. Infect. Dis. 93, 284–6.
OpenUrl CrossRef PubMed

[28] 28.↵
Lauer, S. A., et al. (2020) The Incubation Period of Coronavirus Disease 2019 (COVID-19) From Publicly Reported Confirmed Cases: Estimation and Application. Ann. Intern. Med. 172, 577–582.
OpenUrl CrossRef PubMed

[29] 29.
Li, Q., et al. (2020) Early transmission dynamics in Wuhan, China, of novel coronavirus-infected pneumonia. N. Engl. J. Med. 382, 1199–1207.
OpenUrl CrossRef PubMed

[30] 30.↵
Wei, W. E., et al. (2020) Presymptomatic Transmission of SARS-CoV-2 — Singapore, January 23–March 16, 2020. MMWR Morb. Mortal. Wkly. Rep. 69, 411–415.
OpenUrl CrossRef PubMed

[31] 31.↵
To, K. K. W., et al. (2020) Temporal profiles of viral load in posterior oropharyngeal saliva samples and serum antibody responses during infection by SARS-CoV-2: an observational cohort study. Lancet Infect. Dis. 20, 565–74.
OpenUrl CrossRef PubMed

[32] 32.↵
Arons, M. M., et al. (2020) Presymptomatic SARS-CoV-2 Infections and Transmission in a Skilled Nursing Facility. N. Engl. J. Med. 382, 2081–2090.
OpenUrl CrossRef PubMed

[33] 33.↵
He, X., et al. (2020) Temporal dynamics in viral shedding and transmissibility of COVID-19. Nat. Med. 26, 672–675.
OpenUrl CrossRef PubMed

[34] 34.↵
Pastor-Barriuso, R., et al. (2021) SARS-CoV-2 infection fatality risk in a nationwide seroepidemiological study. medRvix doi:10.1101/2020.08.06.20169722.
OpenUrl Abstract/FREE Full Text

[35] 35.↵
Novozhilov, A. S. (2008) On the spread of epidemics in a closed hoterogeneous population. Math. Biosci. 215, 177–185.
OpenUrl CrossRef PubMed

[36] 36.
Montalbán, A., Corder, R. M. & Gomes, M. G. M. (2022) Herd immunity under individual variation and reinfection. J. Math. Biol. 85, 2.
OpenUrl

[37] 37.↵
Pastor-Satorras, R. & Vespignani, A. (2001) Epidemic dynamics and endemic states in complex networks. Phys. Rev. E 63, 066117.
OpenUrl

[38] 38.↵
Miller, J. C., Slim, A. C. & Volz, E. M. (2012) Edge-based compartmental modelling for infectious disease spread. J. R. Soc. Interface 9, 890–906.
OpenUrl CrossRef PubMed

[39] 39.
Britton, T., Ball, F. & Trapman, P. (2020) A mathematical model reveals the influence of population heterogeneity on herd immunity to SARS-CoV-2. Science 369, 846–849.
OpenUrl Abstract/FREE Full Text

[40] 40.↵
Hartnett, K. (2020) The tricky math of herd immunity for COVID-19. Quanta Magazine. https://www.quantamagazine.org/the-tricky-math-of-covid-19-herd-immunity-20200630/

[41] 41.↵
Aschwanden, C. (2021) Five reasons why COVID herd immunity is probably impossible. Nature 591, 520–522. https://www.nature.com/articles/d41586-021-00728-2
OpenUrl CrossRef PubMed

[42] 42.↵
Hall, et al. (2021) SARS-CoV-2 infection rates of antibody-positive compared with antibody-negative health-care workers in England: a large, multicentre, prospective cohort study (SIREN). Lancet 397, 1459–1469.
OpenUrl CrossRef PubMed

[43] 43.↵
Ferreira, M. L. Mortes por Covid ou com Covid? DGS diz que só entram no boletim óbitos causados pelo vírus. Como funciona a equipa que tem a última palavra. Observador 21 feb 2022. Available at: https://observador.pt/especiais/mortes-por-covid-ou-com-covid-dgs-diz-que-so-entram-no-boletim-obitos-causados-pelo-virus-como-funciona-a-equipa-que-tem-a-ultima-palavra/

[44] 44.↵
Gonçalves, J. Todos os dias há mortes declaradas no boletim da DGS que não foram por Covid-19”, garante médico infecciologista. Rádio Renascença 08 feb 2022. Available at: https://rr.sapo.pt/especial/pais/2022/02/08/todos-os-dias-ha-mortes-declaradas-no-boletim-da-dgs-que-nao-foram-por-covid-19-garante-medico-infecciologista/271641/

[45] 45.↵
Flaxman, S., et al. (2020) Estimating the effects of non-pharmaceutical interventions on COVID-19 in Europe. Nature 584, 257–261.
OpenUrl CrossRef PubMed

[46] 46.↵
Keeling, M. J., et al. (2020) Fitting to the UK COVID-19 outbreak, short-term forecasts and estimating the reproductive number. medRvix doi:10.1101/2020.08.04.20163782.
OpenUrl Abstract/FREE Full Text

[47] 47.↵
Viana, J. et al. (2021) Controlling the pandemic during the SARS-CoV-2 vaccination rollout. Nat. Commun. 12, 3674.
OpenUrl

[48] 48.↵
Wood, S. N. (2021) Inferring UK COVID-19 fatal infection trajectories from daily mortality data: Were infections already in decline before the UK lockdowns? Biometrics doi:10.1111/biom.13462.
OpenUrl CrossRef

[49] 49.↵
Tkachenko, A. V., Maslov, S., Elbanna, A., Wong, G. N., Weiner, Z. J. & Goldenfeld, N. (2021) Time-dependent heterogeneity leads to transient suppression of the COVID-19 epidemic, not herd immunity. Proc. Natl. Acad. Sci. U.S.A. 118, e2015972118.
OpenUrl Abstract/FREE Full Text

[50] 50.↵
Millett, G. A., et al. (2020) Assessing differential impacts of COVID-19 on black communities. Ann. Epidemiol. 47, 37–44.
OpenUrl PubMed

[51] 51.↵
Xia, Y., et al. (2022) Concentration of SARS-CoV-2 cases by social determinants of health in metropolitan areas in Canada: a cross-sectional study. CMAJ 194, E195–E204.
OpenUrl Abstract/FREE Full Text

[52] 52.↵
Okonkwo, N. E., et al. (2021) COVID-19 and the US response: accelerating health inequalities. BMJ EBM 26, 176–179.
OpenUrl

Herd immunity thresholds for SARS-CoV-2 estimated from unfolding epidemics

Abstract

Introduction

Individual variation in SARS-CoV-2 transmission

Variation in susceptibility to infection

Variation in connectivity

Non-pharmaceutical interventions and other transmissibility modifiers

One-wave transmissibility profile

Two-wave transmissibility profile

Herd immunity thresholds

Data

Model fitting and parameter estimating

Fitting models to one pandemic wave

Fitting models to two serial pandemic waves

Fitting models to regional data in Portugal

Discussion

Data Availability

Author contributions

Competing interests

Data availability

Acknowledgements

Footnotes

References

Citation Manager Formats

Subject Area