ABSTRACT
We apply the simple logistic model to the four waves of COVID-19 taking place in South Africa over the period 2020 January 1 through 2022 January 11. We show that this model provides an excellent fit to the time history of three of the four waves. We then derive a theoretical correlation between the growth rate of each wave and its duration, and demonstrate that it is well obeyed by the South Africa data.
We then turn to the data for the United States. As shown by Roberts (2020a, 2020b), the basic logistic model provides only a marginal fit to the early data. Here we break the data into six “waves,” and treat each one separately. For four of the six the logistic model is useful, and we present full results. We then ask if these data provide a way to predict the length of the ongoing Omicron wave in the US (commonly called “wave 4,” but the sixth wave as we have broken the data up). Comparison of these data to those from South Arica, and internal comparison of the US data, suggest that this last wave will die out by about 2022-January-20.
1. Introduction
Since the spring of 2020 a pandemic of infection of a novel coronavirus (SARS-CoV-2) has overspread the world. Epidemiologists have struggled to describe adequately this pandemic and to predict its future course. This is a complex undertaking, involving the mathematics of epidemiology and a huge amount of uncertain data that serve as input to the models. In this paper we examine the ability of a simple epidemiological model, the logistic model, to describe the course of the pandemic in the Republic of South Africa and the United States. After showing that it is adequate to the task for South Africa, we derive a theoretical correlation between the growth rate of each wave of the pandemic and its duration, and show that it is well obeyed for the South Africa data.
2. The Logistic Model
2.1. Derivation
The exposition in this section is from Roberts (2020).
A simple model for the evolution of an pandemic is based on the logistic differential equation. This describes an pandemic that begins with a small number f0 of infected individuals, and subsequently spreads through a population. The motivation for this model is as follows. If the population of infected individuals as a function of time is f(t), simple exponential growth with growth rate r is determined by the differential equation with the growing exponential solution where f0 = f(0). This is what happens with an infinite pool of subjects. However, for a finite pool of subjects, as the population of infected individuals grows the number of subjects available to be infected gets smaller. This is taken into account by modifying the exponential differential equation to become where K is the total available pool of individuals. The solution of this equation2 is which satisfies the required limits f(0) = f0 and f(∞) = K. The time course described by Equation 2 is the familiar “S curve” used to describe bacterial growth and other phenomena (see Figure 3).
An analysis of the total number of cases as a function of time f(t) is just one way to compare model and data. Instead we can examine the number of new cases per day as a function of time; this is the time derivative of f(t).3 This is easily found by substituting Equation 2 into Equation 1,
In either case, for the logistic model the three parameters to be adjusted are f0, K, and r.
2.2. COVID data for the United States and the Republic of South Africa
2.3. Fit to the COVID-19 Pandemic in the Republic of South Africa
When applied to the ongoing COVID-19 pandemic in the United States the logistic model does only a fair job of accounting for the actual history of the total number of infected individuals as a function of time in the first wave of COVID-19, which took place in the first half of 2020 (see Roberts 2020a). It fails even more spectacularly for later waves (Roberts 2020b).
2.4. RSA Wave 1
2.5. RSA Wave 2
2.6. RSA Wave 4
3. Correlation of the Growth Rate and Duration for the COVID-19 Waves in the Republic of South Africa
It is natural that the growth rate and the duration of the waves of a pandemic should be correlated, higher growth rates leading to shorter durations. It can be shown (see the appendix) that for a logistic model the daily cases distribution has a full-with-half-maximum given by
Note that this result is independent of f0 and K. The data and the prediction of Eq.4 are compared in Figure 6, where we see that the agreement is excellent.
4. Six Waves in the United States
4.1. USA Wave 1
4.2. USA Wave 3
4.3. USA Wave 5
5. Predictions for United States Wave 6
Fig.. 12 Shows the correlation between growth and duration of the various waves in the United States. Because the various waves tend to overlap, for all but wave 6 we can find only lower limits to the widths. The curve is Eq. 4, identical to the one in Fig. 6.
In Figs. 13 & 14 we show the predictions for wave 6 that follow from the L1 and LSQ solutions. The latter provides a better fit to the data (see Figs. 10 & 11), so we prefer the predictions made from those solutions. Thus we expect the Omicron wave in the US to be dramatically reduced within about ten days from now, roughly 22 January 2022. Needless to say, this would be very welcome.
6. Conclusions
In this paper we have used the logistic model for epidemics to describe the COVID-19 outbreaks in the Republic of South Africa and in the United States. We find a universal analytic relationship between the growth rates and the durations of each wave, and this is closely followed by the (very clean) data from South Africa. When applied to the United States, the data are messier, but a tentative prediction is possible for the expected duration of the current Omicron wave in the US – it should be mostly over by about ten days from now (roughly 22-January-2022).
7. Acknowledgements
We thank Brian Boyle, Mary Roberts, and Bob Sauer for their helpful comments and suggestions.
9. Appendix
Derivation of Equation 4
The differential distribution for the logistic function is4 and is derivative is
This is zero at and df /dt at this maximum is
Setting df /dt equal to half of this value and solving for the locations yields a complicated expression that reduces to
Note that neither f0 nor K enters this expression.