Summary
All the newly reported COVID-19 cases of April in the U.S. have not acquired the virus in the same month. We estimate that there was an average of 29,000/day COVID-19 cases in the U.S. transmitted from infected to susceptible during April 1–24, 2020 after adjusting for under-reported and under-diagnosed. We have provided model-base d predictions of COVID-19 for the low and high range of transmission rates and with varying degrees of preventive measures including the lockdowns. We predict that even if 10% of the susceptible and 20 % of the infected who were not identified as of April 23, 2020, do not adhere to proper care or do not obey lockdown, then by the end of May and by end of June 50,000 and 55,000 new cases will emerge, respectively. These values for the months of May and June with worse adherence rates of 50% by susceptible and infected (but not identified) will be 251,000 and 511,000, respectively. Continued and serious lockdown measures could bring this average daily new cases to a further low at 4,300/day to 8,000/day in May.
Introduction
As the cumulative number of novel coronavirus cases in the U.S. is still very high at more than 895,766 as of April 24, 2020, and more than 709,665 of these cases were reported during April 1–24, 2020 [1], we have asked a couple of questions: namely, did the lockdown implemented by various states in the U.S. by the end of March 2020 have any impact so far, especially in preventing the new COVID-19 cases in the recent weeks in the U.S.? Will there be any degree of damage in further controlling the spread of COVID-19 in the U.S. by relaxing the lockdown measures to keep-up the economy? We found that relaxing ongoing efforts on controlling the pandemic in the U.S. could have striking impact on the number of new cases for the months of May and June 2020. A 50% relaxed approach in the lockdown or poor preventive measures in the U.S. could lead to additional new COVID cases up to 200,000 to 370,000. We provide several other scenarios too which are in Table 1 to Table 6.
It is not always easy to conduct experiments to get the parameters of transmissions during the lockdowns [2]. One can indirectly obtain the number of people who were infected in a geographical area within a specified time and obtain population-level incidence rates based on the newly reported cases. But we know that often not all cases are reported or diagnosed or both [3]. That means a true incidence rate in the population is not easy to obtain unless we adjust the reported cases with the number not reported and retrospectively adjust the past reported data [3].
Methods, Models and Data
We want to understand through simple transmission dynamics principles what fraction of these cases were possibly acquired during April. We want to understand these numbers by adjusting for under-reporting including the under-diagnosis during the period. Further, we develop age-structured population models of COVId-19 spread and, combining the outcomes with wavelet analysis, we have provided impact of not adhering to lockdown and other preventive measures combined in the U.S. for the period May-June, 2020. Our methods, models and the data are described in the next paragraphs.
Let fn be the cumulative number of COVID-19 cases reported at the end of the nth day and f0 = 0 be the number of COVID-19 cases at the beginning. Let f1 be the number of cases at the end of 1st day. Let Ii = fi+1 − fi for i = 0,1,2,…,n − 1 be the new cases on the each day. Let ci ∊ (0,1) be the fraction of Ii those are infected on the ith day such that where d is the average incubation period. That is, is satisfied. Through the equation (A.1), we have distributed newly reported cases on the ith day into ith day and d − 1 number of days prior to ith day. We assumed that ci is constant in (1), so that, Ii is uniformly distributed during [ith day, d days prior]. Instead of uniform distribution of Ii as considered in (1), one can consider a skewed or non-uniform or some random distribution if there is strong evidence of the same. Under such non-uniform situations as well we will have , and . Once each Ii value is partitioned into d − 1 days prior to the ith day, then the number of transmuted cases in each day is computed using the below set of equations:
These continue until the last reported cases are available for a given population. For the computations in this article we considered uniform ci values over d days by assuming ci = c within the period of study. Then we assumed that the daily new cases April 1 to April 13 were uniformly distributed into 1/14th, 2/14th, 3/14th and so on until 13/14th fractions of the reported and we assumed all the cases from April 14 were the result of those transmitted during the month of April. A similar backward adjustment of reported cases was theoretically demonstrated in our earlier article with advanced network structures [5].
Let Y be the total number of newly infected cases who were not traced into the system. Let Y = Y1 + Y2+Y3, where Yk is the size of infected in the ag-group k for k =0–17, 18–64, 65+, respectively. Let X be the total number of susceptible individuals and , where Xk is the size of susceptible in the ag-group k. Let L(Xk) = ØkXk be the fraction Øk of susceptible Xk who do not practice lockdown or other preventive measures in the age-group k, such that where is the total fraction of susceptibles in the population who do not adhere to any preventive measures including the lockdown. Similarly, we will assume that where is the total fraction of infected (who are not traced) in the population who do not adhere to any preventive measures including the lockdown. The differential equations describing the transmission dynamics from infected to susceptible are given below:
Here βkl is the average transmission rate from an infected in the age group k to a susceptible in the age group l and γk is the recovery rate for the age-group k. Similar age structure population models for the COVID-19 were developed by us in predicting the U.S. population aged 65+ with underlying medical conditions [7]. Once we obtain predicted values of new COVID-19 cases we will use Meyer’s wavelets to demonstrate the difference of magnitudes between observed and predicted cases and wavelets for the difference between various degrees of not adhering to lockdown or other preventive measures. Meyer’s wavelets are a natural improvement over the Fourier series and transforms. A Fourier series f(x) is written in terms of coefficients a0, an and bn for n ∈ Ν, the set of natural numbers, and with trigonometric functions as below where cos nx + i sin nx = einx. Further, if f(x) is Lebesgue integrable on an interval [0, a] with a period a, then we write where
Once we have Fourier series, one can obtain Fourier transformations, fT, through (9), written as where z is a complex numbers and f(s) is a Fourier series function (8).
The Meyer wavelets ψ(ω) also use trigonometric functions which are infinitely differentiable on a certain domain. Several useful resources on wavelets are available in [8–14]. We have used such wavelets in COVID-19 modeling [3,5,7,15]. These wavelets when plotted helps us to distinguish the degree of difference in the magnitude between various levels of relaxation of lockdown and other preventive measures during COVID-19 lockdowns. The Meyer wavelets U(ω) together with accompanying function ν are given below.
What is interesting about this new wavelet technique is the following. The traditional trigonometric functions sine and cosine do not localize well. Once we know a trigonometric function on a small interval then it is uniquely determined on the entire real line. But wavelets localize nicely in both the space and phase variables. This makes them particularly useful for image compression, signal processing, and data analysis. Wavelets are the key to our new approach to understanding the corona pandemic.
Here ν(x) = 0 for x < 0, and
General population age-structure models are well-known in epidemiology, population biology, ecology, see for example [16–25]. However, the novelty of our work lies in linking the wavelets in conceptualizing the differences between the magnitudes of L(Y1), [L(Y2), L(Y3) with varying Ø and ψ values. Such exposure of Meyer’s wavelets or other wavelets were done earlier. For a more general application potential and beauty of applicability of wavelets one can refer to [31].
Data
As of April 23, 2020 the age distribution of the COVID-19 cases in the U.S. among those whose age can be ascertained are as follows: 2,791 (aged <18), 475,659 (aged 18–64), 149,243 (aged 65+). The number of infected during April 1–22, 2020 who were not reported are estimated at 224,785 by our previous calculation of under-reporting in the U.S. [6]. These new cases of April we have divided into the age groups as per the CDC data above. This would provide us Y1 = 999, Y2 = 170,339, and Y3 = 53,446. Demographic data from census.gov [26] and adjusting it for annual growth that was observed over the past decadal growth gives us susceptible populations X1, X2, X3. We considered the lack of adherence from 10% to 70% for both susceptible and infected. We could provide combinations of these and provide more predictions, but we prepared for these finite number of combinations. One can try all other permutations and combinations. Distributing Ø and ψ within each age-groups, we have considered the following ratio [10%: 60%: 30%]. The β values for low range of predictions considered are
These low range of predictions are calibrated just as in our previous medium range of values in the study [7]. Originally, these transmission rates were calibrated earlier for the U.S. data on reported COVID-19 cases during March 1–14 and March 15-April 6, 2020 which were calibrated from our paper [3]. The high range transmission parameters considered are,
Results
Out of the 709,665 COVID-19 cases newly reported during April 1–24, 2020, we have estimated that 560,524 cases were the result of transmissions within this period. To do this we have assumed a few things, the first being the average incubation period as per CDC as 2–14 days [4]. Adjusting the reported cases with the not-reported cases computed as of April 6, 2020, in the ratio of 1:1.5 as noted in [6] the true acquired estimated numbers during April 1–19 stands at 674,354. According to the CDC data released on April 23, 23% of the cases are in the age group 65+ [1]. This implies that 23% of 551,630, that is, 155,101 number of new cases who are aged 65+ occurred during the period. This number is consistent with another modeling study on 65+ [7] which predicts the number of new cases during April – June 2020.
In-addition to backward computation of newly transmitted cases in the month o April 2020 explained above and also in the Appendix, we have also developed dynamics models to understand the spread of COVID-19 for the immediate months in the U.S. We further used wavelets to demonstrate the difference in the spread that could occur due to varying degrees of seriousness of the lockdown implementation by susceptible and infected (who are not aware of their status). We assumed all the detected cases are been either self-quarantined or adhering to the preventive measures.
Serious preventive measures and continued lockdown could bring down the new cases in May at In-addition to the main results of our study mentioned in the introduction. We also note that relaxed approach towards lockdown and other preventive measures might be a burden on the hospital system.
Concluding Remarks
In summary, we see that on average about 29,000/day new cases occurred that were transmitted during April 1–23, 2020. This is very high. This possibly indicates that the lockdowns and prevention measures have not been adhered to at the fullest. A complete lockdown could bring new infections to a much lower level in the next few weeks. Average daily cases with strict measures can be brought down to 4,300/day to 8,000/day in May. There is reason to believe that under-reporting is skewing our understanding of coronavirus infection and spread. It can be shown that lockdown measures are among the most effective for controlling virus development and spread.
As an ending remark, we want to emphasize that one could build more complex models for the transmission dynamics of COVID-19 in the US or for any population if there is strong evidence of transmission level parameters at the sub-population level. Making models more complex unnecessarily without any supportive evidence would not yield any better results and sometimes such exercises could bring misleading results [33]. There are two levels of complexities one can introduce, one at the level of the parameters and another at the subpopulation selection level but that needs very careful assessment of the situation, and modeling should not stand as a mere mathematical exercise.
Premature lifting of the lockdown could lead to disaster.
Data Availability
Yes
Authors contributions
Both the authors contributed in writing. ASRS Rao designed the study, developed the methods, models, collected data, performed analysis, computing, creating Figures and Tables, wrote the first draft. SG Krantz participated in the design, interpretation of results, and contributed in editing the draft, approving the methods. Both the authors approved the final manuscript.
Conflicts
None.
Funding
None to report to this study.