Abstract
Mandated social distancing has been globally applied to limit the spread of corona virus disease 2019 (COVID-19) from highly pathogenic severe acute respiratory syndrome (SARS)-associated coronavirus 2 (SARS-CoV-2). The benefit of this community-based intervention in limiting COVID-19 has not been proven nor quantified. We examined the effect of timing of mandated social distancing on the rate of COVID-19 in 119 geographic regions derived from 41 states within United States and 78 countries. We found that highest number of new COVID-19 cases per day per million persons was significantly associated with total number of COVID-19 cases per million persons on the day before mandated social distancing (β=0.66, p<0.0001). Our findings suggest that the initiation of mandated social distancing for each doubling in number of existing COVID-19 cases would result in eventual peak with 58% higher number of COVID-19 infections per day. Subgroup analysis on those regions where the highest number of new COVID-19 cases per day have peaked increased β to .85 (p<0.0001). We demonstrate that initiating mandated social distancing at a 10 times smaller number of COVID-19 cases will reduce the number of daily new COVID-19 cases at peak by 80% highlighting the importance of this community-based intervention.
Introduction
Quarantine and isolation are standard procedures to avoid transmission of infectious disease from infected to non-infected persons and have been used in numerous epidemics.1 Social distancing has been another method for reducing frequency and closeness of contact between people in order to decrease the risk of transmission of disease. Social distancing has been used against influenza and corona virus disease 2019 (COVID-19) pandemics (caused by severe acute respiratory syndrome (SARS)-associated coronavirus 2 (SARS-CoV-2). Social distancing can be voluntary at individual level or mandated at a community level by governing authorities. Mandated social distancing comprises of a combination of travel restrictions, closure of non-essential group meeting venues (restaurants, schools, shops) and steps to avoid close contact at essential meeting venues (hospitals, food supply, pharmacies). Mandated social distancing is also referred to as societal lockdown and may have variable effect on disease transmission depending upon mode of transmission and ability to identify and isolate persons infected by the disease.2 Critical analysis of mandated social distancing in 17 cities in United States during the 1918 pandemic (caused by H1N1 influenza A virus) found cities in which mandated social distancing were implemented at an early phase of the epidemic had peak death rates of 50% lower than those cities that did not implement such steps.3 Although, the results from 1918 pandemic, influenzae pandemics, and SARS have been used to justify mandated social distancing in various parts of the world, limited analysis of the effect of mandated social distancing on COVID-19 pandemic are available. The value of mandated social distancing requires a critical assessment for each pandemic because of adverse psychological and health consequences on individuals4,5 and financial effects on society.6 We examine the effect of timing of mandated social distancing on the rate of COVID-19 infection in 119 geographic regions derived from 41 states within United States and 78 other countries.
Methods
Daily cumulative COVID-19 case volumes for individual regions (countries and individual states of U.S.) since Jan 22,2020 are publicly available.7,8 Mandated social distancing start dates for different regions have been compiled and are also available.9 For this analysis, we included regions for which both mandated social distancing start dates and daily cumulative case volumes for COVID-19 were available. Except for the United States, we used national mandated social distancing start dates and national COVID-19 case volumes. For France, Denmark, Netherlands and United Kingdom, overseas regions were not included in calculation of national case volumes.
New COVID-19 cases per day were calculated from cumulative daily case volumes. We used 2019 population estimates for states in United States and other countries to calculate daily new and cumulative total COVID-19 case volumes per million persons residing within the region.10,11 For further analysis, data was smoothed using moving average to remove daily fluctuations in reported COVID-19 cases. Smoothed data was plotted over raw data for all geographical regions to ensure that it is representative of raw data (appendix A). We excluded China as the curve was visually different from other regions and above methodology could not be reliably applied to that curve.
We used total number of COVID-19 cases per million on the day before mandated social distancing was implemented as the independent variable and predictor for our analysis. Peak of smoothed curve was used to determine highest number of new COVID-19 cases per day (expressed in per million persons) and used as the dependent variable. Due to the skewness in both the dependent and independent variables, log transformation was applied. To determine if the number of daily new cases have plateaued or are still increasing, we used linear regression for last 13 days. Last 13 days was selected after visually checking the trend for all geographic regions and repeating linear regression for various intervals ranging from 5 to 13 days. Linear positive trend for last 13 days (April 12th to 25th) correlated best with visual interpretation of upward trend.
For all regression analysis we used log-transformed per million person values of highest number of new COVID-19 cases per day and total number of COVID-19 cases on the day before mandated social distancing. We used linear regression analysis to predict highest number of new COVID-19 cases per day using total number of COVID-19 cases on the day before mandated social distancing as predictor(Model A). Additional analysis of this association was performed after adjustment for the day mandated social distancing started in the course of COVID-19 pandemic -- calculated as number of days since 1/22/2020, log transformed population of geographic region and proportion of persons living in urban area (Model B).12,13
We repeated the analyses after classifying the geographic regions into those where the daily new COVID-19 case volume has plateaued and those regions where the COVID-19 was still increasing.
Using internet search, we manually abstracted individual elements of mandated social distancing for each of the geographical regions included in the analysis (Appendix B) and performed additional analysis after adjusting for these elements.
For regions where average (of last 5 days) daily new case volume has trended down to less than 20% of the peak daily new case volume (reached tail end), we performed linear regression analysis to predict the overall number of new COVID-19 cases per million from total number of COVID-19 cases per million persons on the day before mandated social distancing after log transformation of both variables.
Results
Mandated social distancing dates of 85 countries and 42 states were available. Daily COVID-19 case volume data was available for 183 countries and all 52 states. After merger, both mandated social distancing starting dates and daily COVID-19 case census were available for 78 countries and 41 states. After excluding three regions for which the date of peak number of daily new cases was either after (Israel and Maine) or on the start day of mandated social distancing (Eritrea), number of days from the start date of mandated social distancing to the peak in daily new COVID-19 cases ranged from (1 – 45 days) (Figure 1).
Mandated social distancing start dates within individual states of United States ranged from 3/17/2020 to 4/3/2020 and mandated social distancing dates of other countries ranged from 3/9/2020 to 4/15/2020. The total number of COVID-19 cases ranged from no cases to 1571 cases per million persons on the day before the start date of mandated social distancing, (Figure 2). Highest number of new COVID-19 cases per day ranged from 0.10 to 503 per million persons (Figure 3). There was clear trend towards association between total number of COVID-19 cases on the start date of mandated social distancing with highest number of new COVID-19 cases per day when plotted on logarithmic scale on scatter plot (Figure 5).
Results of linear regression analysis with different models are reported in Table 1. In Model A, highest number of new COVID-19 cases per day was significantly associated with total number of COVID-19 cases on the day before mandated social distancing (β=0.66, p<0.0001). Model B improved the adjusted R2 from 0.59 to 0.72 but did not change the β for total number of COVID-19 cases on the day before mandated social distancing. Subgroup analysis on those regions where the daily new COVID-19 cases have already peaked increased β for total number of COVID-19 cases on the day before mandated social distancing to .85 for both unadjusted and adjusted model (p<0.0001).
Similar results from analysis for states within United States are reported in Table 1. There was less clear association between highest number of new COVID-19 cases per day and total number of COVID-19 cases on the day before mandated social distancing (β=0.3, p<0.001) in unadjusted model, but was stronger in adjusted model (β=0.72, p<0.0001). In a model (not shown in table) adjusted for only the day of mandated social distancing, the association between highest number of new COVID-19 cases per day and total number of COVID-19 cases on the day before mandated social distancing was strong (β=0.78, p<0.0001). Daily COVID-19 case volume had plateaued in only 13 states, and both unadjusted (Model A) and adjusted (Model B) association between highest number of new COVID-19 cases per day and total number of COVID-19 cases on the day before mandated social distancing was stronger in states where number of new cases had plateaued compared with states where the number of new COVID-19 case per day had not plateaued (Table 2).
Internationally, there was a strong association between the highest number of new COVID-19 cases per day and total number of COVID-19 cases on the day before mandated social distancing both in the unadjusted or adjusted models (Table 3). This association was stronger for countries where the number of new COVID-19 cases per day has already plateaued (β=0.85, p<0.0001)
Addition of individual elements of mandated social distancing (closure of educational institutes, public transport, restaurants and other shops) did not affect the association between highest number of new COVID-19 cases per day and total number of COVID-19 cases on the day before mandated social distancing. Visually Australia was noted to have plateaued but based on a positive trend on last 13 days of regression it was classified as not plateaued. We repeated analysis of plateaued regions after manual addition of Australia and noticed no change in the above noted results.
For 17 regions (including 3 states of the United States), the daily new case volume was reduced to less than 20% of the peak daily new case volume. Log-transformed total number of cases was strongly predicted by total number of COVID-19 cases on the day before mandated social distancing (adjusted R-Square 0.87, F=112, (3=0.97, p<0.0001).
Discussion
Our analysis confirmed the benefit and provided a quantitative estimate of value of mandated social distancing. Our findings suggest that the initiation of mandated social distancing for each doubling in number of existing COVID-19 cases would result in eventual peak with 60% higher number of COVID-19 infections per day. We found that that initiation of mandated social distancing at two folds higher number of existing COVID-19 cases would result in eventual peak with 58% higher number of COVID-19 cases (using (β of 0.66). If mandated social distancing is started when 100 persons are infected with COVID-19 and the highest number of cases is 1000 persons, initiating mandated social distancing when 200 persons are infected would increase the peak number of cases to 1580 persons. An example is mandated social distancing in New York which was initiated on day 61 when there were 10,356 cases. According to our analysis, if mandated social distancing was initiated on day 50 (142 cases), then the maximum number of cases per day would have been reduced by a factor of 16 (31 per million compared with 500 per million persons). We also identified what we consider is a spillover effect. There was blunting of effect in states within United States in the quantitative value of mandated social distancing when mandated social distancing was initiated later in the course of pandemic. We think that the blunting of effect was confounded by earlier mandated social distancing in surrounding states which resulted in mitigating the effect by reducing inflow of infected COVID-19 patients. This effect was not seen between countries where boundaries between countries may serve to insulate by restricting travel into the country. There are no restrictions between movement between states in United States enhancing the spillover effect.
Ferguson et al.1 estimated that combining school and workplace closure with area quarantine and antiviral prophylaxis can result in 90% containment of infection (assuming the infection has R0 = 1.9), and when containment was initiated with less than 200 detected cases. The model was based on spread of H5N1 highly pathogenic avian influenza in wild and domestic poultry in Southeast Asia. Longini et al14 modeled the Avian influenza A (subtype H5N1) outbreaks in Southeast Asia. The reported that local household quarantine is effective at containing the epidemic if R0 ≤ 2.1 but is not as effective at R0 = 2.4. However, a combination of 80% antiviral prophylaxis plus quarantine is effective at an R0 as high as 2.4, and adding previous vaccination makes antiviral prophylaxis plus quarantine even more effective. Both analyses mentioned that one of the reasons that limit the effect of mandated social distancing is the increase in contacts between household and neighborhood during social distancing which in highly infectious agents may offset the benefit. Ferguson et al.1 assumed in their model that household and random contact rates increase by 100% and 50%, respectively, for individuals no longer able to attend school or work. Previous models have been based on H1N1 epidemiological experience. The R0 H1N1 flu has ranged between 1.25 in Canada15, 1.682 in China16, 1.96 in New Zealand17, 1.6 in Mexico18 and 1.7 in United States19. One of the surprising findings is the benefit of mandated social distancing in COVID-19 pandemic despite the high infectivity of the SARS-CoV-2. The R0 of the SARS-CoV-2 infection was originally estimated between 2.2 and 2.7.20-25 More recent data that suggest that the R0 of SARS-CoV-2 infection may be as high at 5.7.20 The R0 of SARS-Co-V2 is higher than the threshold of 2.4 estimated by Longini et al14 and 1.8 for new viral strains estimated by Ferguson et al.1 where mandated social distancing will lose the beneficial effect.
There may be other reasons which may explain the beneficial effect of mandated social distancing in COVID-19 pandemic. Ridenhour et al.26 had stressed upon the role of transmission rate, recovery rate, and size of population in the overall speed of the epidemic independent of Ro. Tang et al16 had stressed upon the role of asymptomatic patients and those who are in prodromal period without symptoms in spread of pandemic H1N1 influenza in the province of Shaanxi. The beneficial effect of mandated social distancing may also be related to relatively long prodromal period and high proportion of asymptomatic SARS-CoV-2 infected patients. The time between transmission and symptoms ranges between 2–14 days for SARS-CoV-2.27 Data on 468 COVID-19 transmission events reported in mainland China outside of Hubei Province found that in 59 (12.6%) of the 468 patients developed symptoms before the potential source developed symptoms suggesting that transmission occurred in the prodromal period.28 There have been small case studies highlighting that the COVID-19 can be acquired from patients who are and will remain asymptomatic.29'31 The estimated asymptomatic proportion of asymptomatic COVID-19 was 17.9% based on screening of travelers on board a cruise ship32, and 30.8% from data of Japanese citizens evacuated from Wuhan.33 However, the viral loads in upper respiratory specimens appeared similar in symptomatic and asymptomatic persons.34 We think that the beneficial effect of mandated social distancing may to related to reducing contact between asymptomatic person infected SARS-CoV-2. Another unique aspect of the SARS-CoV-2 is ability to persist on various surfaces and thus be transmitted by indirect contact from high touch surfaces.35,36 SARS-CoV-2 can persist on plastic, stainless steel, copper and cardboard, and viable virus was detected up to 72 hours after application to these surfaces. The longest viability was on stainless steel and plastic; the estimated median half-life of SARS-CoV-2 was approximately 5.6 hours on stainless steel and 6.8 hours on plastic. Therefore, the mandated social distancing is likely to reduce contamination and transmission from high touch surfaces in society.
One of the limitations of our model is variability in policies pertaining to mandated social distancing and compliance to the policies in various geographic regions. The compliance to mandated social distancing introduced is an important factor in determining success of intervention.1 There is also variability in exposure risk reduction among a given population as each individual within the population does not have the same chance of coming in contact with others.26 There appears to be a difference according to age of the individuals37 and population structure such as number of household, workplace, school, and community groups.38 Differences in age and population structure between geographic regions may confound the results. There is also a confounding effect of case identification and isolation and robustness of testing for asymptomatic persons which may vary in various geographic units in our analysis. The Center for Disease Control and Prevention (CDC) concluded that the degree to which COVID-19 cases might go undetected or unreported varies in geographic regions because testing practices differ widely and might contribute significantly to the observed variations.39,40. For example, the state of New York (excluding NYC) reported administering 4.9 tests per 1,000 population, which was higher than the national average of 1.6 (CDC, unpublished data, March 25, 2020).
The variability in highest number of new cases per day that was not explained by our statistical models is likely due to variability in mandating social distancing in different regions. Although most of the businesses were closed during mandated social distancing, certain businesses like meat and poultry processing facilities were recognized as critical for infrastructure and permitted to continue work with precautions. Outbreaks in such places resulted in increasing number of new cases per day not explained by our model.41,42 We also noted that in some regions (excluded from the analysis), highest number of new cases per day plateaued prior to mandated social distancing. This suggests that there may be other mechanisms that can reduce the number of new cases in certain regions. There were certain analyses which could not be performed for all the regions included in the analysis as the pandemic is ongoing. In subgroup analysis, it was clear that the relationship was strongest when the highest number of new cases per day had reached its peak. Some regions were still in the period where the highest number of new cases per day may continue to increase. The other issue was the total number of COVID-19 cases in a region which can only be determined after pandemic subsides. We did analyzel7 regions where daily new cases have reached to less than 20% of the highest number of new cases per day observed (tail end of pandemic). There was a clear relationship between total cases before the start date of mandated social distancing and overall total number of case in the region indicating that early mandated social distancing also reduces the total number of COVID-19 infected persons affected overtime.
The value of mandated social distancing in reducing the spread of COVID-19 has been questioned at multiple levels due to widespread effects on individuals' wellbeing and sustenance and financial consequences on society. We demonstrate that initiating mandated social distancing when smaller number of COVID-19 cases are present will reduce the highest number of new cases per day and perhaps even the overall total number of COVID-19 cases in the region highlighting the importance of this community-based intervention.
Data Availability
All data used for the analysis is either publicly available or included in the manuscript. I am in possession of all the data used in the manuscript and can provide upon request.