Hardness of Herd Immunity and Success Probability of Quarantine Measures: A Branching Process Approach
======================================================================================================

* Sujit Kumar Nath

## Abstract

Herd immunity refers to the collective resistance of a population against the spreading of an infection as an epidemic. Understanding the dependencies of herd immunity on various epidemiological parameters is of immense importance for strategizing control measures against an infection in a population. Using an age-dependent branching process model of infection propagation, we obtain interesting functional dependencies of herd immunity on the incubation period of the contagion, contact rate, and the probability of disease transmission from an infected to a susceptible individual. We show that herd immunity is difficult to achieve in case of a high incubation period of the contagion. We derive a method to quantify the success probabilities of quarantine measures to mitigate infection from a population, before achieving herd immunity. We provide a mechanistic derivation of the distribution of generation time from basic principles, which is of central importance to estimate the reproduction number *R*, but has been assumed in an ad hoc manner in epidemiological studies, by far. This derivation of the generation time distribution has the generality to be applied in the study of many other age-dependent branching processes, such as the growth of bacterial colonies, various problems in evolutionary and population biology etc.

Keywords
*   Age-dependent branching process
*   infectious disease
*   extinction probability
*   herd immunity
*   generation time distribution

## I. INTRODUCTION

The propagation of an infectious disease in a population can be viewed as a branching process where an infected individual comes in contact with susceptible individuals in the population, and transmits the infection to the susceptible individuals with some probability of transmission. Branching process model of infection propagation has been widely used to study the propagation of HIV, Rabies etc. infections, both in the population, as well as in the cells or tissues of an infected individual [1–3]. There are many reasons for which a stochastic model of an infectious disease is much preferable than its deterministic counterparts [4, 5]. Due to the inherent randomness of the infectious diseases, arising from the incubation period of the contagions [6–9], infectious period of the infected individuals [10–13], contact and transmission probabilities etc. [14–16], a stochastic model is preferable than a deterministic one[17–20]. However, as all the models have their own advantages and limitations, both of the stochastic [18, 21] and deterministic [22] models, and sometimes a combination of the two [23], are used to model infectious diseases, as per their suitability.

Here we model the propagation of an infectious disease in a large homogeneous population as an age-dependent branching process. Like every other model, this simplified model also has limitations, and sometimes could be inappropriate to capture the detailed dynamics of some specific disease. However, our goal here is to present some new interesting theoretical understanding of the disease propagation, and the effects of interventions. We derive a few general theorems on age-dependent branching process, and apply them to study our model of infectious disease, to quantitatively understand the difficulty for a population to achieve herd immunity, depending on the incubation period of the contagion, social interactions, and transmission probability of the contagion from an infected to a susceptible individual. We also provide a quantitative method to estimate success probabilities of quarantine measures, depending on the epidemiological state of a population.

The distribution of generation time or serial interval time for an infectious disease [24–27] is a very important quantity to understand the transmission potential, and to calculate the basic and effective reproduction number which is a key parameter to estimate the epi-demiological state of an infection in a population [28, 29]. Until now, the distribution of generation time is estimated by fitting gamma, lognormal, Weibull, or Gaussian distributions to the transmission pairs data [26, 30–32]. However, there is no formal derivation available in favour of the particular choice of these distributions, and hence, they are chosen heuristically. In this paper, a general theorem on generation time is derived which can have large applicability in the study of infectious diseases, as well as any kind of age-dependent population dynamics, such as cell divisions [33, 34], growth of microbial colonies [35, 36] etc. We apply this general theorem to derive the distribution of generation time for our model of infectious disease.

In section II we introduce our age-dependent model of infection propagation. Section III presents the derivation of the functional dependence of herd immunity threshold on the incubation period, contact and transmission probabilities, and fraction of susceptible population. We study the success probabilities of various quarantine measures, depending on the state of the infection in the population, and compare early vs. later lockdown in a hypothetical population in sections IV and V, respectively. In section VI we prove a general theorem to derive the distribution of generation time from the generating function of the first generation progenies of any general age-dependent branching process, and apply it to derive the distribution of generation time for our model. We present a concise summary of this work in section VII.

## II. MODEL OF INFECTION PROPAGATION USING AN AGE-DEPENDENT BRANCHING PROCESS

To map the infection propagation with a branching process, we identify an infected individual to be the ancestor of the individuals to whom it transmits the infection directly. The individuals who are directly infected by an ancestor, are called the progenies of the first generation. The individuals who are infected directly by a progeny of the first generation, are called the progenies of the second generation, and so on. The set of all progenies or descendants of an individual is called a line. If the number of descendants at the *r**th* generation is zero for an individual, its line is called extinct at the *r**th* generation. Now, let an ancestor gives rise to *n* number of first generation progenies with probability *p**n*, where *n* ∈ ℤ+, the set of all non-negative integers. Then the function ![Formula][1]</img>  is the generating function for the distribution of number of progenies in the first generation. We now state a well-known theorem which establishes the basic reproduction number as a key parameter to determine the fate of an infection in a population, using the generating function of the first generation progenies.

Theorem 1.
*Let G*(*s*) *be the generating function of a branching process for which the average number of progenies created by an ancestor be µ. If µ* ≤ 1, *the process dies out with probability one. If however, µ* > 1 *the probability x**r* *that the process terminates at or before the r**th* *generation tends to the unique root x* < 1 *of the equation s* = *G*(*s*), *as r* → ∞.

See [37, 38] for a proof of theorem 1.

In the infectious disease literature, the average number of first generation progenies (newly created infections) created by an ancestor (primary infected individual) is termed as basic reproduction number, symbolically denoted by *R* [28, 29]. Therefore, it is clear from theorem 1 that the infection will die out after sufficient time when *R* ≤ 1, which is called the herd immunity.

The assumptions of our model are as follows.

Assumption 1:
*The number of contacts made by an infected individual, with other individuals in the population, is a Poisson process with a rate α*.

Assumption 2:
*The population is large and homogeneous. Contacts made by the individuals in the population are independent of each other*.

Assumption 3:
*The rate of contacts is constant over time, unless any intervention is imposed, such as quarantine measures*.

Assumption 4:
*The incubation period and infectious period are used synonymously, because of the uncertainty of the initiation of the infectiousness in an individual carrying the contagion, and there is no relapse of infection*.

Assumption 5:
*Incubation period is memoryless*.

Assumption 6:
*The fraction of susceptibles in the population varies very slowly with respect to the incubation/infectious time period*.

We now prove a general theorem to derive the generating function for the number of first generation progenies, created by an ancestor, in an age-dependent branching process; and apply it to calculate the number of infectious contacts made by an infected individual for our model.

Theorem 2.
*Let X(t) be a continuous time stochastic process with atomic distribution* ℙ(*X*(*t*) = *n*) = *p**n*(*t*), *and generating function G(s,t), where n is any non-negative integer. Let X*1(*t*) *be the process X*(*t*) *with survival time T which is a continuous non-negative random variable with distribution function F* (*t*) = ℙ(*T* ≤ *t*). *Then the generating function G*1(*s, t*) *of the process X*1(*t*) *is given by* ![Formula][2]</img> 

*Proof*. The given condition, ℙ(*X*(*t*) = *n*) = *p**n*(*t*), implies the probability that *X*(*t*) = *n*, when the process survives at least for time *t*, is *p**n*(*t*). Since the survival time *T* is distributed as *F* (*t*), the probability that *X*1(*t*) = *n* is ![Formula][3]</img> 

Therefore, the generating function for *X*1(*t*) is ![Formula][4]</img> 

Now, identifying ![Formula][5]</img>  as an expectation value on a discrete/atomic measure, and applying Fubini’s theorem for changing the order of summation and integration, we obtain ![Formula][6]</img> 

□

An important point to note here is that, in general, the function *G*1(*s, t*) could be the generating function of an improper distribution [39], i.e., *G*1(1, *t*) ≤ 1. In other words, the total probability that the number of progenies will be *n* = 0, 1, 2, 3, … etc., up to time *t*, could be less than or equal to 1.

Corollary 3.
*If a Poisson process with rate α has exponentially distributed survival time with mean λ, then the generating function for the process at time t is given by* ![Formula][7]</img> 

*Proof*. A Poisson process with rate *α*, has the generating function ![Formula][8]</img> 

Using theorem 2 and equation 4, we therefore obtain the generating function for a Poisson process with exponentially distributed survival time having mean *λ* as ![Formula][9]</img> 

□

Remark 4.
*Since* |*s*| *<* 1 *and α, λ* > 0, *we have* ![Graphic][10]</img>. *Therefore, taking limit t* → ∞, *in equation (3), we obtain the generating function for the process, described in corollary 3, at the steady state as* ![Formula][11]</img> 

## III. HARDNESS OF HERD IMMUNITY

In our model of infection dynamics, an infected individual makes contacts with other individuals as a Poisson process with rate *α*, and remains infectious for an exponentially distributed random time period, having mean *λ*. Therefore, the generating function for the number of infectious contacts (contacts made while the person is still infectious) made by an infected individual up to time *t*, and during its full lifetime are given by *C*(*s, t*) and *C*(*s*) in equations (3) and (5) respectively.

We now recall a theorem for deriving the generating functions for compound distributions, which we shall use for some of our calculations.

Theorem 5.
*Let N be a non-negative integral valued random variable with generating function G(s) and let* {*X**i*} *be a sequence of independent and identically distributed (iid) non-negative integral valued random variables with generating function R*(*s*). *Then the generating function for the compound random variable S**N* = *X*1 + *X*2 + … + *X**N* *is G*(*R*(*s*)).

For a proof of theorem 5 see [37].

With our model assumptions, let us further assume that at time *t* the fraction of susceptibles present in a population be *p**s*, and the probability of disease transmission from an infected individual to a susceptible individual, who have been in an infectious contact, be *p**c*. Therefore, the probability of generating a new infected individual from a random contact (either with susceptible or with already infected/immuned) with an infected individual is *p**s**p**c*. This event of creating a new infected individual, from a random contact with an infected individual, can be thought of as a Bernoulli trial *X* with success probability ℙ(*X* = 1) = *p**s**p**c* and ℙ(*X* = 0) = 1 − *p**s**p**c*. The generating function corresponding to this Bernoulli trial is ![Formula][12]</img> 

Now, the number of newly infected individuals from an existing infected individual, in time *t*, is the sum of *N* number of Bernoulli trials *X*, where *N* is a random number representing the number of infectious contacts made by the infected individual. Therefore, using theorem 5, from equation (3) and (6) we obtain the number of newly infected individuals, from an existing infected individual, in time *t* from the onset of its infection, has generating function ![Formula][13]</img> 

Taking limit *t* → ∞ in equation (7) we get the generating function for the same at steady state as ![Formula][14]</img> 

Equation (8) can also be obtained directly from equation (5), by replacing *s* by *B*(*s*) from equation (6). Note that here we are using assumption 6, i.e. the fraction of susceptibles *p**s*, in the population, is constant over the infectious time period of an individual. Figure 1 shows the probability of the number of first generation progenies created by an infected individual, for different incubation periods of the contagion (evaluating the Taylor series coefficients of *C*(*B*(*s*)) in equation (8)). Equation (8) is the generating function for the number of newly created infected individuals, by an existing infected individual throughout its whole infectious period. Therefore, the average number of new infections generated by an existing infected individual is ![Formula][15]</img> 

![FIG. 1.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/06/17/2020.10.22.20216481/F1.medium.gif)

[FIG. 1.](http://medrxiv.org/content/early/2021/06/17/2020.10.22.20216481/F1)

FIG. 1. 
(Color online) Probability distributions of the number of newly created infections from an existing infected individual. Horizontal axis is represents the number of new infections created, and the vertical axis represents the corresponding probabilities. Red circle, blue square, and green diamond plots are for *λ* = 10, 20, and 50, respectively. *α* = 10, *p**s* = 0.5, *p**c* = 0.05 for all plots.

Recalling the definition of basic reproduction number *R* for an infectious disease, as the average number of new infections generated by an existing infected individual, we identify that equation (9) gives the *R* for our model of infection propagation. It is clear from equation (9) that *R* is a function of the fraction of susceptibles present in the population (*p**s*), probability of disease transmission (*p**c*), the rate of contacts between the individuals in the population (*α*), and the average incubation period (*λ*). Now, using theorem 1 we can conclude that the disease will die out over time, with probability one, if *R* = *αλp**s**p**c* ≤ 1, i.e. if ![Formula][16]</img> 

Inequality (10) has many significant implications about the achievement of herd immunity. First of all, it gives an upper bound for the fraction of susceptibles in the population, below which the infection in the population dies out with probability one, which is known as the herd immunity in epidemiological terminology. It is also evident from inequality (10) that when the mean incubation period (*λ*) is high, the upper bound of *p**s* is small. This means, to achieve herd immunity in the case of a contagion with high incubation period, (1 − *p**s*) (which is close to 1) fraction of total population must be either immuned or infected, and hence, very hard to achieve. Proper use of personal protective equipments (PPE), like masks, hand sanitizers, face shields etc. can reduce the probability of disease transmission (*p**c*), and hence helps to achieve herd immunity easily. Alternatively, imposing quarantine measures can reduce the rate of contacts between infected and susceptible population (*α*), and helps to achieve herd immunity. Therefore, in absense of any pharmaceutical interventions, like vaccines or other medicines, the alternative way to mitigate infection is by the use of PPE and/or quarantine measures.

## IV. SUCCESS PROBABILITY OF QUARANTINE MEASURES

Denoting the generating function for the number of first generation progenies created by an ancestor as *G*(*s*), we now recall a theorem for calculating the extinction probability of the line of a single ancestor.

Theorem 6.
*For a branching process where the number of progenies produced by each particle are iid random variables, with generating function G*(*s*) *for the first generation progenies, the extinction probability of a line is given by the minimum of the positive roots of the equation x* = *G*(*x*), *and* 1.

See [37, 38] for a proof of theorem 6.

In our model of infection propagation, the generating function for the number of first generation progenies (newly generated infections) from a single infected individual is given by *C*(*B*(*s*)) in equation (8). Therefore, the extinction probability of all progenies (of all generations, i.e. the line of an ancestor) of an infected individual is obtained by solving the equation *x* = *C*(*B*(*x*)), which is same as the quadratic equation ![Formula][17]</img> 

Here we make a strong use of assumption 6, which assumes that *p**s* varies so slowly that it remains almost constant for a line of an ancestor. This assumption can be supported by realizing that the line of an ancestor is likely to be in the same region or environment in the population. Equation (11) has two roots, 1 and 1*/αλp**s**p**c*. Therefore, with the help of theorem 6, the extinction probability for all progenies or line of an infected individual is given by ![Formula][18]</img> 

We now intend to calculate the number of individuals that will remain infected after time *t* = *T* if we start observing a fixed *N**I* number of infected individuals from time *t* = 0.

Theorem 7.
*Let the incubation period T**I* *of a contagion has a distribution function* ℙ(*T**I* ≤ *t*) = *F* (*t*). *If we start observing N**I* *number of infected individuals, each infected from time t* = 0, *then the generating function for the number of individuals who will still remain infected after time t* = *T is given by* ![Formula][19]</img> 

*Proof*. An infected individual remains infectious after time *T*, only when *T**I* *> T*. Since the incubation time *T**I* has distribution function *F* (*t*), ![Formula][20]</img> 

Now, we can think of *N**I* number of Bernoulli trials with success probability (1 − *F* (*T*)). The total number of successes of these Bernoulli trials results the number of individuals who will still remain infectious after time *T*. In other words, this number is given by the random variable ![Graphic][21]</img>, where each *X**i* is a Bernoulli random variable with ℙ(*X**i* = 1) = 1 − *F* (*T*), and ℙ(*X**i* = 0) = *F* (*T*). The generating function for each Bernoulli trial *X**i* is [*F* (*T*) + (1 − *F* (*T*))*s*]. Therefore, using theorem 5 we obtain the generating function of ![Graphic][22]</img> as ![Formula][23]</img> 

□

Corollary 8.
*Let the incubation period of a contagion be exponentially distributed with mean λ. If we start observing N**I* *number of infected individuals from time t* = 0, *then the generating function for the number of individuals who will still remain infected after time t* = *T is given by* ![Formula][24]</img> 

*Proof*. As the exponential distribution is memoryless, we need not worry about the instants when each individual got infected. Therefore, we can assume that all of the *N**I* individuals got the infection, simultaneously at *t* = 0, i.e. the time we start our observation. Since the incubation period *T**I* is exponentially distributed with mean *λ*, its distribution function is given by ![Formula][25]</img> 

Therefore, using theorem 7, we obtain the desired result. □

Figure 2 shows the probabilities of the number of remaining infected individuals (among those who are being observed) after the time duration of observation, by plotting the Taylor series coefficients of equation (14).

![FIG. 2.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/06/17/2020.10.22.20216481/F2.medium.gif)

[FIG. 2.](http://medrxiv.org/content/early/2021/06/17/2020.10.22.20216481/F2)

FIG. 2. 
(Color online) Probability distribution for the number of infected individuals remaining after the withdrawal of quarantine measure. The horizontal axis denotes the number of infected, remaining after *T* = 40 days of quarantine. The average incubation period *λ*, of the contagion, is taken to be 20 days. The number of infected *N**I* at time *t* = 0 is taken to be 100, 200, 300, 400, and 500, for the red circle, blue square, green diamond, orange inverted triangle, and black triangle plots, respectively.

We now prove a theorem which will be used for calculating the extinction probability of the infectious disease by quarantine measures.

Theorem 9.
*Let N number of iid Bernoulli trials, each having success probability p, are performed, where N is a positive integral valued random variable with generating function Q*(*s*). *Then the probability of obtaining all success is Q*(*p*).

*Proof*. Let ℙ(*N* = *k*) = *q**k*, for *k* = 0, 1, 2, Then by the definition of generating function ![Formula][26]</img> 

By conditioning on the number of trials *N*, we obtain the probability of all success in random number of Bernoulli trials as ![Formula][27]</img> 

□

Theorems 7 and 9 together can be used to estimate the success probability of any quarantine measure in mitigating infection. Here we assume an ideal quarantine measure where no contact between an infected person and a susceptible person occurs. Let at some point of time there be *N**I* number of infected individuals present in the population. Let an ideal quarantine measure is imposed at that time for next *T* days, and after *T* days the quarantine is withdrawn. During the quarantine period *T*, some infected individuals can be recovered and others will remain infected. The generating function for the number of individuals (*N**R*), who will still remain infectious after *T* days of quarantine, is given by *H*(*s*) in equation (14). After the quarantine is withdrawn, the remaining infected individuals will again start interacting with the susceptible individuals in the population and create new infections. The probability that the number of all progenies of a single infected individual will be zero, after sufficient time, is given by *p**e* in equation (12). Now, the event that the total number of progenies of all the remaining *N**R* number of infected individuals is zero after sufficient time, is equivalent to the event that *N**R* number of iid Bernoulli trials, each having success probability *p**e*, results all successes. Since *N**R* has the generating function same as in equation (14), using theorem 9, we obtain the probability that the infection will be mitigated after sufficient time from the withdrawal of quarantine is ![Formula][28]</img> 

As *T* → ∞ in equation (15), the extinction probability *H*(*p**e*) → 1. This implies that as the duration of quarantine increases, the probability of infection mitigation becomes higher. Also, when *p**e* = 1, i.e. *αλp**s**p**c* ≤ 1 (by equation (12)), and hence herd immunity is achieved, *H*(*p**e*) = 1, i.e. the infection will be mitigated with probability one. Figure 3 shows the probability of zero infection as a function of the duration of quarantine, before achieving herd immunity.

![FIG. 3.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/06/17/2020.10.22.20216481/F3.medium.gif)

[FIG. 3.](http://medrxiv.org/content/early/2021/06/17/2020.10.22.20216481/F3)

FIG. 3. 
(Color online) Extinction probabilities of the infection as a function of the duration of quarantine. Horizontal axis represents the duration of quarantine, and vertical axis represents the corresponding extinction probabilities. Black solid, red dashed, blue dotted, and green dot-dashed curves are for *N**I* = 10, 100, 500, and 1000 respectively. *α* = 10, *λ* = 20, *p**s* = 0.5, *p**c* = 0.05 are taken for all plots.

## V. EARLY IMPOSITION OF LOCKDOWN CAN SOMETIMES BE LESS EFFECTIVE THAN A DELAYED IMPOSITION

We now study our infection propagation model in a toy population, and see some counter-intuitive results. Let the size of the total population be *N* = 10000, among which *N**I* number of individuals are infected at time *t* = 0. Therefore, the number of susceptibles at time *t* = 0 is *N* − *N**I*. Hence, the fraction of susceptibles in the population is *p**s* = (*N* − *N**I*)*/N*. Let the transmission probability *p**c* be 0.05. Let us consider that there is no relapse of infection, i.e., if someone has ever been infected, they cannot be susceptible anymore. Consequently, the fraction of susceptibles cannot be higher than (*N* −*N**I*)*/N* for any *t* ≥ 0. If we now calculate the extinction probability as in equation (15), as a function of lockdown/quarantine duration, we obtain Figure 4. We see in Figure 4 that when *N**I* = 8500, the infection dies out with higher probability than the situation when *N**I* = 5000 or 6000, upon the imposition of quarantine measures of same duration. It is to be noted that none of these three cases have achieved herd immunity threshold, as *αλp**s**p**c* *>* 1 for all three cases. The reason for this is, as the size of the infected (or immune, after recovery from infection) population increases, the fraction of susceptibles *p**s* decreases, and so is the probability of infectious contacts with susceptibles. Hence, it is a tug of war between the number of infected *N**I*, and the fraction of remaining susceptibles *p**s* = (*N* − *N**I*)*/N*, in the population, to maximize the probability of total extinction, *H*(*p**e*) in equation (15).

![FIG. 4.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/06/17/2020.10.22.20216481/F4.medium.gif)

[FIG. 4.](http://medrxiv.org/content/early/2021/06/17/2020.10.22.20216481/F4)

FIG. 4. 
(Color online) Extinction probabilities of the infection as a function of the duration of quarantine, in a toy population of size *N* = 10000. Horizontal axis represents the duration of quarantine, and vertical axis represents the corresponding extinction probabilities. Black solid, red dashed, and blue dot-dashed curves are for *N**I* = 5000, 6000, and 8500, respectively. *α* = 10, *λ* = 20, *p**c* = 0.05 are taken for all plots.

This, therefore, implies that imposing an early quarantine measure may not always be most effective, unless the quarantine is maintained sufficiently long. Otherwise, it is a better strategy to let the infection spread into the population up to some time to ensure the number of remaining susceptibles is relatively low, and then impose the quarantine measure for the same duration. The later strategy will then have a higher probability of success in mitigating infection from the population.

It is to be noted here that after the withdrawal of lockdown the infection still keeps on propagating in the population to some extent, even before eradication. Therefore, it needs a detailed calculation to comment on the total number of infections occurring before it is eradicated from the population (known as the final size of infection, in the literature) [17, 18]. However, it is also to be understood that when total *N**I* number of people become infected, up to some point of time from the initiation of the infection in the population, not all of them remains infected at that time instant since some of them have already recovered and have become immune to the infection. So, the extinction probabilities in figure 4 is prone to underestimation. Therefore, to decide on the best strategy for mitigating an infection from a population, using quarantine measures, it needs an extensive analysis, to save both economy and healthcare system in an optimized way [40].

## VI. THE DISTRIBUTION OF GENERATION TIME

The generation time for an infectious disease is defined as the time interval between the onset of infection in an individual to the first generation of another new infected individual by the primary infected individual [24]. We now calculate the distribution of the generation time for our model of infectious disease. To derive the distribution function of the generation time we need to use the generating function for the tail of a discrete random variable. More specifically, let *N* be a non-negative integral valued random variable having distribution ℙ(*N* = *k*) = *p**k* (possibly improper distribution), for *k* ∈ {0, 1, 2, …}. Let *M* be defined as the tail of *N* having distribution ℙ(*M* = *k*) = ℙ(*N > k*) = *p**k*+1 + *p**k*+2 + … = *q**k*, for *k* ∈ {0, 1, 2, …}. We now state a theorem which relates the generating functions of *N* and *M*.

Theorem 10.
*If the generating function of N is P* (*s*), *then the generating function for its tail M is given by* ![Formula][29]</img> 

*Proof*. The proof goes exactly in the same line as in [37], where the generating function for the tail distribution is derived for a proper distribution. By definition, ![Formula][30]</img>  where *q**k* = *p**k*+1 +*p**k*+2 +…, for *k* ∈ {0, 1, 2, …}. Therefore, the coefficient of *s**n* in (1−*s*)*Q*(*s*) equals *q**n* − *q**n*−1 = −*p**n* when *n* ≥ 1, and equals *q* = *p*1 + *p*2 + *p*3 + … = *P* (1) − *p* when *n* = 0. Therefore, ![Formula][31]</img>  and hence the desired result. □

We now derive a general theorem to calculate the distribution of generation time, with the help of theorem 10.

Theorem 11
(Generation time distribution). *Let G*(*s, t*) *be the generating function for the number of first generation progenies, created by an ancestor of an age-dependent branching process up to time t, with steady state generating function G*(*s*), *i*.*e*. lim*t*→∞ *G*(*s, t*) = *G*(*s*). *Let the number of progenies created by the ancestor throughout its lifetime be greater than or equal to m. Then the time τ**m*, *required to create the first m number of progenies by the ancestor, has the distribution* ![Formula][32]</img>  *and hence, the generation time τ has the distribution* ![Formula][33]</img> 

*Proof*. The event that *τ**m* ≤ *t* is same as the event that the number of progenies *N* (*t*), created up to time *t*, is greater than or equal to *m*. This implies ![Formula][34]</img> 

Identifying ℙ(*N* (*t*) *> m* − 1) as the tail distribution of the branching process under consideration, at time *t*, using theorem 10 we conclude that its generating function has the form ![Formula][35]</img>  and ℙ(*N* (*t*) *> m* − 1) is the coefficient of *s**m*−1 of the Taylor series expansion of *Q*(*s, t*) with respect to *s*. Similarly, ℙ(*N* (∞) *> m* − 1) is the coefficient of *s**m*−1 in the Taylor series expansion of *Q*(*s*), where *Q*(*s*) = lim*t*→∞ *Q*(*s, t*). Hence, substituting these values of ℙ(*N* (*t*) *> m* − 1) and ℙ(*N* (∞) *> m* − 1) in equation (19) we obtain equation (17). Finally, we obtain equation (18) from equation (17), for *m* = 1. □

Having obtained the general procedure to derive the distribution function of generation time, we now apply theorem 11 to derive the distribution of generation time for our model.

In our model we have *G*(*s, t*) = *C*(*B*(*s*), *t*). Therefore, using equations (7) and (18), we obtain the distribution function for generation time as ![Formula][36]</img> 

Therefore, the density function for the generation time is given by ![Formula][37]</img> 

Figure 5 plots the probability density function *g*(*t*), for the generation time, in case of two different incubation periods. As seen from the plot that the mode of the distribution does not change much with the incubation period. Dependence on other parameters can also be checked, and it can be shown that decreasing *α, p**s*, and *p**c* will shift the mode towards right, as it is evident that decreasing these parameters will delay the generation of a new infection.

![FIG. 5.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/06/17/2020.10.22.20216481/F5.medium.gif)

[FIG. 5.](http://medrxiv.org/content/early/2021/06/17/2020.10.22.20216481/F5)

FIG. 5. 
(Color online) Probability density function for the generation time. Black solid curve is for *λ* = 10; and red dashed curve is for *λ* = 20. Other parameters for both the curves are taken to be *α* = 10, *p**s* = 0.5, *p**c* = 0.05.

## VII. CONCLUSION

Modeling the propagation of an infectious disease, in a population, as an age-dependent branching process, we obtain the functional dependence of herd immunity on various epidemiological parameters. We show that herd immunity is difficult to achieve when the incubation period is high. We show that the mass use of PPE can help to achieve herd immunity, and to eradicate the infection faster from the population. We provide a method to estimate the success probabilities of various quarantine measures, and show by considering a hypothetical situation that an early imposition of lockdown may not always be a better strategy against a delayed imposition of lockdown of the same duration. We derive a general theorem to calculate the distribution of generation time, which can be used in the study of many systems, modeled as age-dependent branching process. Using this theorem we calculate the generation time distribution for our model of infection propagation, and obtain a two parameter (effectively) distribution which can be used in epidemiological studies, as a logical replacement of hitherto used heuristic distributions, such as gamma, lognormal, Weibull, Gaussian etc. [26, 27, 31, 32, 41].

## Data Availability

The manuscript uses no data files.

## Footnotes

*   The abstract and introduction are rewritten in a better way and figures are made a little bigger in size for better visibility purpose. One trivial plot is removed as it was not adding much extra information to the discussion. All other technical discussions are kept same as the previous version.

*   Received October 22, 2020.
*   Revision received June 16, 2021.
*   Accepted June 17, 2021.


*   © 2021, Posted by Cold Spring Harbor Laboratory

This pre-print is available under a Creative Commons License (Attribution-NoDerivs 4.0 International), CC BY-ND 4.0, as described at [http://creativecommons.org/licenses/by-nd/4.0/](http://creativecommons.org/licenses/by-nd/4.0/)

## References

1.  [1]. R. Bartoszynski, Proceedings of the Fifth Berkeley symposium,, 259 (1967).
    
    
2.  [2]. R. Bartoszyński, Mathematical Biosciences 24, 355 (1975).
    
    
3.  [3]. S. J. Merrill, Journal of computational and applied mathematics 184, 242 (2005).
    
    
4.  [4]. T. Britton, Mathematical biosciences 225, 24 (2010).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.mbs.2010.01.006&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20102724&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F06%2F17%2F2020.10.22.20216481.atom) 

5.  [5]. T. Britton and  D. Lindenstrand, Mathematical biosciences 222, 109 (2009).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.mbs.2009.10.001&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=19837097&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F06%2F17%2F2020.10.22.20216481.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000273101900005&link_type=ISI) 

6.  [6]. J. A. Backer,  D. Klinkenberg, and  J. Wallinga, Eurosurveillance 25, 2000062 (2020).
    
    
7.  [7]. P. E. Sartwell et al., American Journal of Epidemiology 83, 204 (1966).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/oxfordjournals.aje.a120576&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=5930773&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F06%2F17%2F2020.10.22.20216481.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=A19667894400003&link_type=ISI) 

8.  [8]. C. McAloon,  Á. Collins,  K. Hunt,  A. Barber,  A. W. Byrne,  F. Butler,  M. Casey,  J. Griffin,  E. Lane,  D. McEvoy,  P. Wall,  M. Green,  L. O’Grady, and  S. J. More, BMJ Open 10 (2020), doi:10.1136/bmjopen-2020-039652, [https://bmjopen.bmj.com/content/10/8/e039652.full.pdf](https://bmjopen.bmj.com/content/10/8/e039652.full.pdf).
    
    [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NzoiYm1qb3BlbiI7czo1OiJyZXNpZCI7czoxMjoiMTAvOC9lMDM5NjUyIjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjEvMDYvMTcvMjAyMC4xMC4yMi4yMDIxNjQ4MS5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 

9.  [9]. J. Lessler,  N. G. Reich,  R. Brookmeyer,  T. M. Perl,  K. E. Nelson, and  D. A. Cummings, The Lancet infectious diseases 9, 291 (2009).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S1473-3099(09)70069-6&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=19393959&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F06%2F17%2F2020.10.22.20216481.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000265805200017&link_type=ISI) 

10. [10]. M. J. Lydeamore,  P. T. Campbell,  D. J. Price,  Y. Wu,  A. J. Marcato,  W. Cuningham,  J. R. Carapetis,  R. M. Andrews,  M. I. McDonald,  J. McVernon, et al., PLOS Computational Biology 16, e1007838 (2020).
    
    
11. [11]. I. Czumbel, (2016).
    
    
12. [12]. A. L. Lloyd, Theoretical population biology 60, 59 (2001).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1006/tpbi.2001.1525&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=11589638&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F06%2F17%2F2020.10.22.20216481.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000171555800004&link_type=ISI) 

13. [13]. B. F. Nielsen,  L. Simonsen, and  K. Sneppen, Physical Review Letters 126, 118301 (2021).
    
    
14. [14]. K. Pandey,  S. Basu,  R. Pinto, and  S. P. Pandey, Bulletin of the American Physical Society (2020).
    
    
15. [15]. M. G. Hudgens,  I. M. Longini Jr.,  M. E. Halloran,  K. Choopanya,  S. Vanichseni,  D. Kitayaporn,  T. D. Mastro, and  P. A. Mock, Journal of the Royal Statistical Society: Series C (Applied Statistics) 50, 1 (2001).
    
    
16. [16]. M. Abkarian and  H. Stone, Physical Review Fluids 5, 102301 (2020).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1103/PhysRevFluids.5.102301&link_type=DOI) 

17. [17]. T. Sellke, Journal of Applied Probability, 390 (1983).
    
    
18. [18]. F. Ball, Advances in Applied Probability, 289 (1986).
    
    
19. [19]. F. Ball, Advances in applied probability, 1 (1985).
    
    
20. [20]. P. Picard and  C. Lefevre, Advances in Applied Probability 22, 269 (1990).
    
    
21. [21]. C. L. Addy,  I. M. Longini Jr, and  M. Haber, Biometrics, 961 (1991).
    
    
22. [22]. W. O. Kermack and  A. G. McKendrick, Proceedings of the royal society of london. Series A, Containing papers of a mathematical and physical character 115, 700 (1927).
    
    
23. [23]. Y. Cai,  Y. Kang,  M. Banerjee, and  W. Wang, Journal of Differential Equations 259, 7463 (2015).
    
    
24. [24]. H. Nishiura, Mathematical Biosciences & Engineering 7, 851 (2010).
    
    
25. [25]. Y. Deng,  C. You,  Y. Liu,  J. Qin, and  X.-H. Zhou, Biometrics (2020).
    
    
26. [26]. X. He,  E. H. Lau,  P. Wu,  X. Deng,  J. Wang,  X. Hao,  Y. C. Lau,  J. Y. Wong,  Y. Guan,  X. Tan, et al., Nature medicine 26, 672 (2020).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.7326/M20-3012&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F06%2F17%2F2020.10.22.20216481.atom) 

27. [27]. N. G. Davies,  P. Klepac,  Y. Liu,  K. Prem,  M. Jit,  R. M. Eggo, C. C.-. working group, et al., Nature Medicine 26, 1205 (2020).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41591-020-0962-9&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=32546824&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F06%2F17%2F2020.10.22.20216481.atom) 

28. [28]. K. Dietz, Statistical methods in medical research 2, 23 (1993).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1177/096228029300200103&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=8261248&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F06%2F17%2F2020.10.22.20216481.atom) 

29. [29]. G. N. Milligan and  A. D. Barrett,.
    
    
30. [30]. A. Cori,  N. M. Ferguson,  C. Fraser, and  S. Cauchemez, American Journal of Epidemiology 178, 1505 (2013).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/aje/kwt133&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24043437&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F06%2F17%2F2020.10.22.20216481.atom) 

31. [31]. Z. Du,  X. Xu,  Y. Wu,  L. Wang,  B. J. Cowling, and  L. A. Meyers, Emerging infectious diseases 26, 1341 (2020).
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F06%2F17%2F2020.10.22.20216481.atom) 

32. [32]. P. Van Mieghem and  Q. Liu, Physical Review E 100, 022317 (2019).
    
    
33. [33]. D. G. Kendall, Biometrika 35, 316 (1948).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/biomet/35.3-4.316&link_type=DOI) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=A1948UZ95400013&link_type=ISI) 

34. [34]. D. G. Hoel and  K. S. Crump, Biometrics, 125 (1974).
    
    
35. [35]. C. Kelly and  O. Rahn, Journal of Bacteriology 23, 147 (1932).
    
    [FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6MzoiUERGIjtzOjExOiJqb3VybmFsQ29kZSI7czoyOiJqYiI7czo1OiJyZXNpZCI7czo4OiIyMy8yLzE0NyI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIxLzA2LzE3LzIwMjAuMTAuMjIuMjAyMTY0ODEuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 

36. [36]. R. Cowan, Biometrics, 681 (1985).
    
    
37. [37]. W. Feller, An introduction to Probability theory and its applications, Vol. 1 (John Wiley and Sons, Inc., New York, 1950).
    
    
38. [38]. T. E. Harris, The theory of branching process (Rand Corporation, 1964).
    
    
39. [39]. W. Feller, An introduction to probability theory and its applications, Vol. 2 (John Wiley and Sons, Inc., New York, 1965).
    
    
40. [40]. J. Samuel and  S. Sinha, Physical Review E 103, L010301 (2021).
    
    
41. [41]. T. Ganyani,  C. Kremer,  D. Chen,  A. Torneri,  C. Faes,  J. Wallinga, and  N. Hens, Eurosurveil-lance 25, 2000257 (2020).

 [1]: /embed/graphic-1.gif
 [2]: /embed/graphic-2.gif
 [3]: /embed/graphic-3.gif
 [4]: /embed/graphic-4.gif
 [5]: /embed/graphic-5.gif
 [6]: /embed/graphic-6.gif
 [7]: /embed/graphic-7.gif
 [8]: /embed/graphic-8.gif
 [9]: /embed/graphic-9.gif
 [10]: /embed/inline-graphic-1.gif
 [11]: /embed/graphic-10.gif
 [12]: /embed/graphic-11.gif
 [13]: /embed/graphic-12.gif
 [14]: /embed/graphic-13.gif
 [15]: /embed/graphic-15.gif
 [16]: /embed/graphic-16.gif
 [17]: /embed/graphic-17.gif
 [18]: /embed/graphic-18.gif
 [19]: /embed/graphic-19.gif
 [20]: /embed/graphic-20.gif
 [21]: /embed/inline-graphic-2.gif
 [22]: /embed/inline-graphic-3.gif
 [23]: /embed/graphic-21.gif
 [24]: /embed/graphic-22.gif
 [25]: /embed/graphic-23.gif
 [26]: /embed/graphic-25.gif
 [27]: /embed/graphic-26.gif
 [28]: /embed/graphic-27.gif
 [29]: /embed/graphic-30.gif
 [30]: /embed/graphic-31.gif
 [31]: /embed/graphic-32.gif
 [32]: /embed/graphic-33.gif
 [33]: /embed/graphic-34.gif
 [34]: /embed/graphic-35.gif
 [35]: /embed/graphic-36.gif
 [36]: /embed/graphic-37.gif
 [37]: /embed/graphic-38.gif