Abstract
Importance In the United States, schools closed in March 2020 to reduce the burden of COVID-19. They are now reopening amid high incidence in many places, necessitating analyses of the associated risks and benefits.
Objective To determine the impact of school reopening with varying levels of operating capacity and face-mask adherence on COVID-19 burden.
Design Modeling study using an agent-based model that simulates daily activities of the population. Transmission can occur in places such as schools, workplaces, community, and households. Model parameters were calibrated to and validated against multiple types of COVID-19 data.
Setting Indiana, United States of America.
Participants Synthetic population of Indiana. K-12 students, teachers, their families, and others in the state were studied separately.
Interventions Reopening of schools under three levels of school operating capacity (50%, 75%, and 100%), as well as three assumptions about face-mask adherence in schools (50%, 75%, and 100%). We compared the impact of these scenarios to reopening at full capacity without face masks and a scenario with schools operating remotely, for a total of 11 scenarios.
Main outcomes SARS-CoV-2 infections, symptomatic cases, and deaths due to COVID-19 from August 24 to December 31.
Results We projected 19,527 (95% CrI: 4,641–56,502) infections and 360 (95% CrI: 67–967) deaths in the scenario where schools operated remotely from August 24 to December 31. Reopening at full capacity with low face-mask adherence in schools resulted in a proportional increase of 81.7 (95% CrI: 78.2–85.3) times the number of infections and 13.4 (95% CrI: 12.8–14.0) times the number of deaths. High face-mask adherence resulted in a proportional increase of 3.0 (95% CrI: 2.8–3.1) times the number of infections. Operating at reduced capacity with high face-mask adherence resulted in only an 11.6% (95% CrI: 5.50%-17.9%) increase in the number of infections.
Conclusions and Relevance Reduced capacity and high face-mask adherence in schools would substantially reduce the burden of COVID-19 in schools and across the state. We did not explore the impact of other reopening scenarios, such as alternating days of attendance. Heterogeneous decisions could be made across different districts throughout the state, which our model does not capture. Hence, caution should be taken in interpreting our results as specific quantitative targets for operating capacity or face-mask adherence. Rather, our results suggest that schools should give serious consideration to reducing capacity as much as is feasible and enforcing adherence to wearing face masks.
Question What will be the impact of reopening schools on SARS-CoV-2 infections and COVID-19 mortality in Indiana?
Findings Reopening schools at full capacity with low (50%) face-mask adherence could have a substantial effect on the burden of COVID-19 across Indiana, leading to 83 times more infections and 13 times more deaths than if schools operated remotely. Reopening at 50% capacity and with 100% face-mask adherence could mitigate much of this risk, leading to 12% more infections and 5% more deaths than if schools operated remotely.
Meaning Schools should make efforts to reduce capacity and enforce face-mask adherence to prevent increases in the statewide burden of COVID-19.
1 Introduction
The United States has been the country most severely impacted by the COVID-19 pandemic in terms of total reported cases and deaths, with over 5.2 million cases and 167,000 deaths through August 14 [1]. This severity led to social interventions on an unprecedented scale, including restrictions on mass gatherings, bans on non-essential travel, and school closures [2, 3, 4, 5]. While such restrictions were initially successful in reducing transmission, the subsequent relaxation of restrictions on mass gatherings and movement were followed by large increases in notified cases and, more recently, deaths [1, 3, 6, 7]. The state of Indiana reported its highest daily number of cases to date (1,249) on August 6 [8].
It is within this context of intense community transmission that attention has turned to the reopening of schools for the start of the academic year in August [9, 10, 11]. During influenza epidemics, school closures have been estimated to reduce transmission community-wide [12, 13, 14]. In general, schools are seen as key drivers of the transmission of respiratory pathogens due to close contact among children at school [15, 16, 17]. However, several factors complicate the effect of school reopenings on SARS-CoV-2 transmission. In particular, children and adolescents appear less susceptible to infection and are much less likely to experience severe outcomes following infection [18, 19, 20, 21, 22, 23]. It is also still unclear what their contribution to transmission is, but several studies suggest they can play an important role [18, 24, 25, 26]. There are additional economic and social factors to consider, too, such as the economic costs of school closures for families that must then stay home from work, and the nutritional benefits of school reopening for children who rely on free and subsidized school meals [27, 28, 29].
Our objective in this study was to explore the impacts of strategies for school reopening on the burden of COVID-19 in the state of Indiana, USA. We synthesized evidence around features of COVID-19 epidemiology in children [19] in an agent-based model originally developed for pandemic influenza [30] that is equipped to translate such evidence into projections of community-wide transmission under alternative scenarios about school reopening. We focused on the role of reducing the capacity of classrooms and adherence to wearing face masks in schools, given that both physical distancing and face masks have been shown to reduce transmission of SARS-CoV-2 in community settings [31]. Our main outcome measures were changes in statewide totals of SARS-CoV-2 infections, symptomatic infections, and deaths, across different scenarios of school capacity and face-mask adherence, compared with a baseline scenario in which schools operate remotely.
2 Methods
Approach
Our approach to modeling SARS-CoV-2 transmission is based on the Framework for Reconstructing Epidemic Dynamics (FRED) model [30]. Using this model, we simulated the spread of SARS-CoV-2 in the population of Indiana, USA using a synthetic population with demographic and geographic characteristics of the population, including age, household composition, household location, and occupation [32]. We analyzed the impact of school reopening from August 24 (first day of classes in Marion County, the most populous) to December 31, 2020 in the overall population of Indiana, as well as in students, teachers, and their households. We quantified impact as the difference in the number of COVID-19 infections, symptomatic infections, and deaths, between each scenario and the baseline scenario.
Agent-based model
We chose an agent-based model for this analysis to address key issues such as heterogeneity of transmission within a population due to population density, age, occupation, and contact network. We used a synthetic population of Indiana to realistically represent characteristics of the population [32]. FRED simulates pathogen spread in a population by recreating interactions among people on a daily basis. Each human is modeled as an agent who visits a set of places defined by their activity space (houses, schools, workplaces, and neighborhood locations). Transmission can occur when an infected person visits the same location as a susceptible person on the same day, with numbers of contacts per person specific to each location type. For instance, school contacts depend not on the size of the school but on the age of the student. Every day of the week, students and teachers visit their school, and students are assigned to classrooms based on their age. Given that schools are closed during the weekends, community contact is increased by 50% [30]. For both schools and other locations, we adopted contact rates for each location type that were previously calibrated to attack rates for influenza specific to each location type [30, 33].
Once infected, each individual had latent and infectious periods drawn from distributions calibrated so that the average generation interval distribution matched estimates from Singapore (mean = 5.20, standard deviation = 1.72) [34]. The absolute risk of transmission depended on the number and location of an infected individual’s contacts and a parameter that controls SARS-CoV-2 transmissibility upon contact, which we calibrated. A proportion of the infections were asymptomatic [35]. We assumed these infections were as infectious as symptomatic infections and had identical incubation and infectious period distributions [36, 37, 26, 38]. We assumed that children were less susceptible to infection than adults, which we modeled with a logistic function calibrated to model-based estimates of this relationship by Davies et al. [19]. We assumed that severity of disease increased with age, consistent with statistical analyses performed elsewhere [39, 40, 41, 21].
Agent behavior in FRED has the potential to change over the course of an epidemic. Following the onset of symptoms, infected agents self-isolate at home according to a fixed daily rate, whereas others continue their daily activities [42, 43]. This rate was chosen so that, on average, 68% of agents will self-isolate at some point during their symptoms, assuming that all individuals who develop a fever will isolate at some point during their symptoms [43]. Agents can also respond to public health interventions, including school closure, shelter in place, and a combination of mask-wearing and social distancing. School closures occur on specific dates [44], resulting in students limiting their activity space to household and neighborhood locations. Shelter-in-place interventions reduce some agents’ activity spaces to their households only, whereas others continue with their daily routines. We used mobility reports from Google [45] to drive daily compliance with shelter-in-place, such that shelter-in-place compliance in our model accounts for both the effects of shelter-in-place orders and some people deciding to continue staying at home after those orders are lifted [46]. We used Google Trends data for Indiana using the terms “face mask” and “social distancing” [47] to capture temporal trends and survey data [48] to capture the magnitude of these behaviors. Further details about the model are available in the Supplementary Text.
Data and outcomes
We obtained daily incidence of death from the New York Times COVID-19 database [1]. Hospitalizations and the age distribution of deaths reported in Indiana were obtained from the Indiana COVID-19 dashboard [8]. Daily numbers of tests performed in the state were available from The Covid Project [49]. We calibrated the model to daily values of deaths, hospitalizations, and test positivity through August 10 and to the age distribution of deaths cumulative through July 13 to estimate nine model parameters using a sobol design sampling algorithm [50, 51] (Supplementary Material). We validated the model by comparing its predictions of the infection attack rate overall and by age to results from two statewide serological surveys undertaken in late April and early June [52, 53].
To account for uncertainty in the calibrated parameters in our future projections, we simulated nine scenarios about school reopening with values of the model’s nine free parameters sampled from the calibrated distribution. From August 24 to December 31, we outputted numbers of daily infections, symptomatic cases, hospitalizations, and deaths, structured by age, place of infection (school, home, other), and affiliation with schools (student, teacher, none). For our analysis, we focused on infections, symptomatic infections, and deaths in the overall population and the subgroups of students, teachers, and their families.
We explored three scenarios for school operating capacity (50%, 75%, or 100% of students receiving in-person instruction) and three scenarios for face-mask adherence in schools (50%, 75%, or 100%). We compared each of the nine combinations of these scenarios to a scenario in which schools reopened normally (100% capacity, 0% face-mask adherence) and to a scenario in which schools operated remotely until the end of the calendar year. We analyzed the sensitivity of the model outcomes to parameter uncertainty by calculating the partial rank correlation coefficient [54] of each calibrated parameter and two outcomes: the cumulative number of infections between August 24 and December 31, and the proportion of infections acquired in schools during that period.
3 Results
Our model was generally consistent with the data to which it was calibrated, capturing trends over time in daily deaths, hospitalizations, and test positivity (Fig. 1A-C), as well as greater proportions of deaths among older age groups (Fig. 1D). Some trade-offs in the model’s ability to recreate different data types were apparent, such as a recent increase in hospitalizations that the model failed to capture (Fig. 1C), likely due to the predominance of data on deaths in the likelihood. Even so, the model’s predictions reproduced the range of variability in the data, as assessed by coverage probability of its 95% posterior predictive intervals (daily deaths: 0.91; daily hospitalizations: 1.0; daily test positivity: 0.97; cumulative deaths by age: 1.0). The model was also consistent with data withheld from fitting. Across all ages, the 95% credible intervals of the cumulative proportion infected through late April (median: 0.0174; 95% CrI: 0.0058–0.064) and early June (median: 0.0221; 95% CrI: 0.0072–0.08) from two state-wide serological surveys [52] fell within the 95% posterior predictive intervals of our model (Fig. S1A). Our model’s predictions also overlapped with age-stratified estimates from those surveys (Fig. S1B), although it underpredicted infections among individuals aged 40–60 years.
Calibration of the parameter that scaled the magnitude of SARS-CoV-2 importations [55, 56] in our model resulted in a median of 1.118 (95% PPI: 0.654–1.456) imported infections per day from February 1 to August 10. To ensure that the model reliably reproduced the high occurrence of deaths observed in long-term care facilities, we seeded infections into those facilities at a daily rate proportional to the prevalence of infection on that day; this calibrated proportion was 0.046 (95% PPI: 0.007–0.093). On the opposite end of the age spectrum, our calibration resulted in a median estimate of susceptibility among children of 0.357 (95% CrI: 0.263–0.526), compared to 0.788 (95% CrI: 0.651–0.946) in adults (Fig. S2). Our calibration resulted in an estimate of transmissibility (median: 0.641; 95% CrI: 0.546–0.965) that corresponded to values of R(t) during the initial phase of the epidemic in Indiana of 1.67 (95% CrI: 0.47–3.57), which represents an average of daily values across the month of February (Fig. 1A). Driven by a calibrated estimate that the proportion of people sheltering in place rose in early March and peaked at a median of 38.2% (95% CrI: 25.2–68.7%) on April 7 (Fig. S3A), our estimates of R(t) dropped to a low of 0.57 (95% CrI: 0.41–0.75) on April 7 and have remained below 1 since (Fig. 1A). Also impacting our estimates of R(t) is the increasing use of face masks in the community, which we estimated at 0.50 (95% CrI: 0.458–0.545) as of July 19 (Fig. S3B). Note that this estimated distribution of community face-mask adherence does not differ between scenarios and is not the same as the level of adherence in schools, which we imposed at different levels depending on the scenario.
In the event that schools reopen at full capacity and without any use of face masks, our model projects that R(t) would increase to 1.94 (95% CrI: 1.47–2.20) by mid-September (Fig. 2A). This assumes that levels of sheltering in place and face-mask adherence in the community remain at their estimated levels as of August 13 (Fig. S3B). Consistent with our model’s prediction that schools contributed appreciably to transmission early in the epidemic (median: 24.6%; 95% CrI: 23.2–27.7%), this increase in transmission is associated with an increase in the proportion of infections arising in schools upon their reopening (Fig. 2B). The sensitivity of the proportion of infections arising in schools to model parameters was highest for the inflection point of the age-susceptibility relationship, the transmissibility parameter, and the rate at which infections were imported into long-term care facilities (Fig. S7). This increase in infections arising in schools is projected to give rise to additional transmission statewide (Fig. 2C). An example of transmission chains arising from schools and spilling out into school-affiliated families and the broader community is shown in Fig. 3.
In the event that schools remain open throughout the semester and no other policies or behavioral responses occur, the increase in R(t) driven by reopening schools at full capacity and without the use of face masks would be projected to result in 2.49 million (95% CrI: 2.08–2.70 million) infections (Fig. 4A) and 9,117 (95% CrI: 6,117–6,248) deaths (Fig. 4B) from Indiana’s population as a whole between August 24 and December 31. At the other extreme, if schools were to go to remote instruction and all children remained at home, our model projects that R(t) would remain near current levels through the remainder of 2020 (Fig. S4A). Again, this assumes that levels of sheltering in place and face-mask adherence in the community as a whole remain at their estimated levels as of August 13. Under this scenario, transmission would continue through contacts at workplaces, within homes, and elsewhere in the community (Fig. S4B), with 19,527 (95% CrI: 4,651–56,502) infections (Fig. S4C) and 360 (95% CrI: 67–967) deaths between August 24 and December 31.
Differing policies on the capacity at which schools operate in person and enforce the use of face masks have a strong influence on the projected statewide burden of COVID-19. If schools operate at 50% capacity and achieve high face-mask adherence, the number of infections and deaths that we project is similar to what we project under the scenario in which schools operate remotely (Fig. 4, Table S1). Of these two policies, projections of infections and deaths were more sensitive to the capacity at which schools operate, with the worst outcomes projected to occur when schools operate at full capacity and with low face-mask adherence. Under this scenario, cumulative infections statewide were projected to be 300% (95% CrI: 280–310%) greater than if schools operated remotely, and cumulative deaths statewide were projected to be 50% (95% CrI: 50–60%) greater (Table S1). The sensitivity of these results to model parameters was relatively high for parameters that define the age-susceptibility relationship and, in some cases, for the transmissibility and shelter-in-place compliance parameters (Fig. S6).
The burden of COVID-19 associated with reopening schools differed for students, teachers, and their families. Relative to a scenario with remote instruction, risk of infection and symptomatic infection was greatest for students (Fig. 5, left column), with a hundred-fold or greater increase in the risk of infection if schools operate at full capacity with moderate or low face-mask adherence (Table S2). Due to their older ages, teachers and families experienced a much higher risk of death under scenarios with high capacity and moderate or low face-mask adherence, as compared with a scenario with remote instruction (Fig. 5, center & right columns). Under a scenario with high capacity and low face-mask adherence, there was a 227-fold higher risk of death for teachers (Table S3) and a 266-fold higher risk of death for family members (Table S4). At the same time, the risk of death under a scenario with high capacity and low face-mask adherence was around half the risk of death if schools were to operate at full capacity with no masks (Tables S7 & S8).
4 Discussion
Our model provides a detailed, demographically realistic representation of SARS-CoV-2 transmission in Indiana that is consistent both with data to which it was calibrated and to data that was withheld from calibration. In contrast to models that rely on assumptions about intervention impacts or estimate them statistically [2, 57], our model makes predictions about intervention impacts based on first-principles assumptions about individual-level behavior and contact patterns. Consistent with results from other analyses [2, 57, 20], the inputs and assumptions in our model led to a prediction that schools made a considerable contribution to SARS-CoV-2 transmission in February and early March, prior to large-scale changes in behavior. Extending that, a primary result of our analysis is that schools could, once again, make a considerable contribution to SARS-CoV-2 transmission if they resume normal activity in the fall semester.
Our results indicate that operating at reduced capacity and achieving high face-mask adherence would reduce the burden of COVID-19 in schools and across the state. In the event that both interventions are pursued fully, our model projects that infections and deaths statewide would be around 10% greater than under a scenario with fully remote instruction. In the event that schools operate at full capacity, our model projects that infections and deaths statewide could be one to two orders of magnitude greater than under a scenario with fully remote instruction, especially if there is poor face-mask adherence in schools. The impacts associated with reducing capacity result from reductions in both the number of contacts within the school and the probability that an infected student would be in attendance in the first place, similar to the logic behind why smaller gatherings are associated with reduced risk of transmission [3, 58, 59].
Although the scenarios we considered resulted in projected impacts spanning nearly the full range between fully remote instruction and fully in-person instruction with no face masks, they are a simplification of the full range of scenarios of how schools could operate this fall. Scenarios that we did not explore include different groups of students attending in person or remotely [60], varying degrees of modularization within schools [61], and the implementation of testing-based control strategies in schools [62]. A related simplification of our statewide analysis is that the state will, in reality, consist of a patchwork of policies across districts. In light of this complexity that our model does not capture, our results should be interpreted with caution in setting specific, quantitative targets for capacity or face-mask adherence. For any scenario though, our results illustrate the importance of reducing capacity and maximizing face-mask adherence to the extent possible, as do other modeling studies [60, 61, 62, 63, 64].
The burden of COVID-19 associated with reopening schools is not expected to fall evenly across the state’s population. Under scenarios with schools operating at full capacity, our model projects that hundreds of thousands of children could be infected during the fall semester. Whereas very few deaths are expected among infected children, the numbers of deaths among teachers and school-affiliated families could number in the hundreds. In comparison, the total number of deaths projected across the state is projected to be in the low thousands, meaning that adults with close ties to schools could represent a sizable fraction of deaths across the state in coming months, if schools operate at full capacity and with low face-mask adherence. Importantly, the magnitude of these projections depend further on factors outside the control of schools [2]. In the absence of school reopening, our projections assume steady, controlled levels of transmission with R(t) < 1. Changes in government policies, individual behavior, and face-mask adherence in the community all have the potential to alter this trajectory for the state as a whole, as well as its schools [2, 31, 65].
A critical assumption of our analysis is that children are capable of being infected with SARS-CoV-2 and transmitting it to others at meaningful levels. Although the burden of severe disease skews strongly towards older ages [22, 66, 8], there are other lines of evidence that support our assumption. These include a contact-tracing study that found no distinguishable difference between infectivity of children and adults [26], several studies that found no distinguishable difference in viral load between children and adults [67, 37, 36, 68], a study that observed a greater secondary attack rate among children in homes [26], and a modeling study that found no evidence that children were less infectious [69]. More direct evidence comes from COVID-19 outbreaks that have already been observed in schools, including one in a high school in Israel in which 13.2% of students and 16.6% of staff were infected in just 10 days [70]. Even more pertinent, some schools in Indiana recently reopened and have already reported cases of COVID-19 among students [71]. Our analysis offers perspective on what those initial cases could give rise to in coming months, depending on the degree to which schools choose to operate at reduced capacity and enforce face-mask adherence.
There is now a growing body of evidence that school closures contributed to mitigating the first wave of the epidemic and could lead to rising case numbers if relaxed [6, 62]. Our study adds to this evidence, and suggests an even greater impact of school reopening than several other studies [64, 62, 61, 72, 60]. This is due in part to our assumption that asymptomatic and symptomatic infections contribute similarly to transmission [26, 67, 37, 36, 68], and in part to our model’s ability to capture chains of transmission within schools and extending out into the community. Our study echoes several modeling studies in emphasizing the importance of reducing school capacity to impede transmission [61, 62, 63, 64, 72]. Taking all this evidence together, those deciding on strategies to safely reopen schools should strongly consider operating at reduced capacity and strictly enforcing face-mask adherence. Both schools, and the communities in which they are embedded, stand to be affected by these policy choices.
Data Availability
The data is publicly available
5 Acknowledgements
This work was supported by an NSF RAPID grant to TAP (DEB 2027718), an Arthur J. Schmitt Fellowship and Eck Institute for Global Health Fellowship to RJO, and a Richard and Peggy Notabaert Premier Fellowship to MP. We thank the University of Notre Dame Center for Research Computing for computing resources.
References
- [1].↵
- [2].↵
- [3].↵
- [4].↵
- [5].↵
- [6].↵
- [7].↵
- [8].↵
- [9].↵
- [10].↵
- [11].↵
- [12].↵
- [13].↵
- [14].↵
- [15].↵
- [16].↵
- [17].↵
- [18].↵
- [19].↵
- [20].↵
- [21].↵
- [22].↵
- [23].↵
- [24].↵
- [25].↵
- [26].↵
- [27].↵
- [28].↵
- [29].↵
- [30].↵
- [31].↵
- [32].↵
- [33].↵
- [34].↵
- [35].↵
- [36].↵
- [37].↵
- [38].↵
- [39].↵
- [40].↵
- [41].↵
- [42].↵
- [43].↵
- [44].↵
- [45].↵
- [46].↵
- [47].↵
- [48].↵
- [49].↵
- [50].↵
- [51].↵
- [52].↵
- [53].↵
- [54].↵
- [55].↵
- [56].↵
- [57].↵
- [58].↵
- [59].↵
- [60].↵
- [61].↵
- [62].↵
- [63].↵
- [64].↵
- [65].↵
- [66].↵
- [67].↵
- [68].↵
- [69].↵
- [70].↵
- [71].↵
- [72].↵