Abstract
Importance This study presents the first set of estimates of predicted health service utilization and deaths due to COVID-19 by day for the next 4 months for each state in the US.
Objective To determine the extent and timing of deaths and excess demand for hospital services due to COVID-19 in the US.
Design, Setting, and Participants This study used data on confirmed COVID-19 deaths by day from WHO websites and local and national governments; data on hospital capacity and utilization for US states; and observed COVID-19 utilization data from select locations to develop a statistical model forecasting deaths and hospital utilization against capacity by state for the US over the next 4 months.
Exposure(s) COVID-19.
Main outcome(s) and measure(s) Deaths, bed and ICU occupancy, and ventilator use.
Results Compared to licensed capacity and average annual occupancy rates, excess demand from COVID-19 at the peak of the pandemic in the second week of April is predicted to be 64,175 (95% UI 7,977 to 251,059) total beds and 17,380 (95% UI 2,432 to 57,955) ICU beds. At the peak of the pandemic, ventilator use is predicted to be 19,481 (95% UI 9,767 to 39,674). The date of peak excess demand by state varies from the second week of April through May. We estimate that there will be a total of 81,114 (95% UI 38,242 to 162,106) deaths from COVID-19 over the next 4 months in the US. Deaths from COVID-19 are estimated to drop below 10 deaths per day between May 31 and June 6.
Conclusions and Relevance In addition to a large number of deaths from COVID-19, the epidemic in the US will place a load well beyond the current capacity of hospitals to manage, especially for ICU care. These estimates can help inform the development and implementation of strategies to mitigate this gap, including reducing non-COVID-19 demand for services and temporarily increasing system capacity. These are urgently needed given that peak volumes are estimated to be only three weeks away. The estimated excess demand on hospital systems is predicated on the enactment of social distancing measures in all states that have not done so already within the next week and maintenance of these measures throughout the epidemic, emphasizing the importance of implementing, enforcing, and maintaining these measures to mitigate hospital system overload and prevent deaths.
Data availability statement A full list of data citations are available by contacting the corresponding author.
Funding Statement Bill & Melinda Gates Foundation and the State of Washington
Question Assuming social distancing measures are maintained, what are the forecasted gaps in available health service resources and number of deaths from the COVID-19 pandemic for each state in the United States?
Findings Using a statistical model, we predict excess demand will be 64,175 (95% UI 7,977 to 251,059) total beds and 17,380 (95% UI 2,432 to 57,955) ICU beds at the peak of COVID-19. Peak ventilator use is predicted to be 19,481 (95% UI 9,767 to 39,674) ventilators. Peak demand will be in the second week of April. We estimate 81,114 (95% UI 38,242 to 162,106) deaths in the United States from COVID-19 over the next 4 months.
Meaning Even with social distancing measures enacted and sustained, the peak demand for hospital services due to the COVID-19 pandemic is likely going to exceed capacity substantially. Alongside the implementation and enforcement of social distancing measures, there is an urgent need to develop and implement plans to reduce non-COVID-19 demand for and temporarily increase capacity of health facilities.
Background
The Coronavirus Disease 2019 (COVID-19) pandemic started in Wuhan, China, in December 20191 and has since spread to the vast majority of countries.2 As of March 24, 5 countries have recorded more than a thousand deaths: China, France, Iran, Italy, and Spain. COVID-19 is not only causing mortality but is also putting considerable stress on health systems with large case numbers. In the US, COVID-19 has spread to all 50 states, with 31 states reporting deaths so far. Estimates of the potential magnitude of COVID-19 patient volume are urgently needed for US hospitals to effectively manage the rising case load and provide the highest quality of care possible.
COVID-19 forecasts have largely been based on mathematical models that capture the probability of moving between states from susceptible to infected, and then to a recovered state or death (SIR models). Many SIR models have been published or posted online.3–20 In general, these models assume random mixing between all individuals in a given population. While results of these models are sensitive to starting assumptions and thus differ between models considerably, they generally suggest that given current estimates of the basic reproductive rate (the number of cases caused by each case in a susceptible population), 25% to 70% of the population will eventually become infected.6,20 Based on reported case-fatality rates, these projections imply that there would be millions of deaths in the United States due to COVID-19. However, individual behavioral responses and government-mandated social distancing (school closures, non-essential service closures, and shelter-in-place orders) can dramatically influence the course of the epidemic. In Wuhan, strict social distancing was instituted on January 23, 2020, and by the time new infections reached 1 or fewer a day (March 15, 2020), the confirmed proportion of the population infected was less than 0.5%. SIR models with assumptions of random mixing can overestimate health service need by not taking into account behavioral change and government-mandated action. Using reported case numbers and models based on those for health service planning is also not ideal because of widely varying COVID-19 testing rates and strategies. For example, South Korea has undertaken aggressive population-based screening and testing, while in the US, limited test availability has led to largely restricting testing to those with more severe disease or those who are at risk of serious complications.
An alternative strategy is to focus on modeling the empirically observed COVID-19 population death rate curves, which directly reflect both the transmission of the virus and the case-fatality rates in each community. Deaths are likely more accurately reported than cases in settings with limited testing capacity where tests are usually prioritized for the more severely ill patients. Hospital service need is likely going to be highly correlated with deaths, given predictable disease progression probabilities by age for severe cases. In this study, we use statistical modeling to implement this approach and derive state-specific forecasts with uncertainty for deaths and for health service resource needs and compare these to available resources in the US.
Methods
The modeling approach in this study is divided into four components: (i) identification and processing of COVID-19 data; (ii) statistical model estimation for population death rates as a function of time since the death rate exceeds a threshold in a location; (iii) predicting time to exceed a given population death threshold in states early in the pandemic; and (iv) modeling health service utilization as a function of deaths.
Data identification and processing
Local government, national government, and WHO websites21–25 were used to identify data on confirmed COVID-19 deaths by day at the first administrative level (state or province, hereafter “admin 1”). Government declarations were used to identify the day different jurisdictions implemented various social distancing policies (stay-at-home or shelter-in-place orders, school closures, closures of non-essential services focused on bars and restaurants, and the deployment of severe travel restrictions) following the New Zealand government schema.26 Data on timings of interventions were compiled by checking national and state governmental websites, executive orders, and newly initiated COVID-19 laws. Data on licensed bed and ICU capacity and average annual utilization by state were obtained from the American Hospital Association.27 We estimated ICU utilization rates by multiplying total bed utilization rates by the ratio of ICU bed utilization rates over total bed utilization rates from a published study.28 Observed COVID-19 utilization data were obtained for Italy21 and the United States,29 providing information on inpatient and ICU use. Data from China30 were used to approximate inpatient and ICU use by assuming that severe patients were hospitalized and critical patients required an ICU stay. Other parameters were sourced from the scientific literature and an analysis of available patient data.31 Age-specific data on the relative population death rate by age are available from China,30 Italy,32 Korea,33 and the US29 and show a strong relationship with age (Figure 1).
Using the average observed relationship between the population death rate and age, data from different locations can be standardized to the age structure using indirect standardization. For the estimation of statistical models for the population death rate, only admin 1 locations with an observed death rate greater than 0.31 per million (e-15) were used. This threshold was selected by testing which threshold minimized the variance of the slope of the death rate across locations in subsequent days.
A covariate of days with expected exponential growth in the cumulative death rate was created using information on the number of days after the death rate exceeded 0.31 per million to the day when 4 different social distancing measures were mandated by local and national government: school closures, non-essential business closures including bars and restaurants, stay-at-home recommendations, and travel restrictions including public transport closures. Days with 1 measure were counted as 0.67 equivalents, days with 2 measures as 0.334 equivalents and with 3 or 4 measures as 0. For states that have not yet implemented all of the closure measures, we assumed that the remaining measures will be put in place within 1 week. This lag between reaching a threshold death rate and implementing more aggressive social distancing was combined with the observed period of exponential growth in the cumulative death rate seen in Wuhan after Level 4 social distancing was implemented, adjusted for the median time from incidence to death. For ease of interpretation of statistical coefficients, this covariate was normalized so the value for Wuhan was 1.
Statistical model for the cumulative death rate
We developed a curve-fitting tool to fit a nonlinear mixed effects model to the available admin 1 cumulative death data. The cumulative death rate for each location is assumed to follow a parametrized Gaussian error function: where the function ψ is the Gaussian error function (written explicitly above), p controls the maximum death rate at each location, t is the time since death rate exceeded 1e-15, β (beta) is a location-specific inflection point (time at which rate of increase of the death rate is maximum), and α (alpha) is a location-specific growth parameter. Other sigmoidal functional forms (alternatives to ψ) were considered but did not fit the data as well. Data were fit to the log of the death rate in the available data, using an optimization framework described in the appendix.
We ensembled two types of models to produce the estimates. We either parametrized the level parameter p or the time-axis shift parameter beta to depend on a covariate based on time from when the initial death rate exceeds 1e-15 to the implementation of social distancing. The value of the covariate multipliers in each type of model was assumed to closely follow the fit obtained from data from Wuhan, which is the time series to reach a stable state in the training dataset. To be specific, the generalizable information from Wuhan was the impact that social distancing had on maximum death rate and time to reach the inflection point. For each type of model, we both considered ‘short-range’ and ‘long-range’ variants, to explain existing data and forecast long- term trends, respectively. In the former case, covariate multipliers could deviate from those fit to Wuhan, while in the latter, the data from Wuhan had a larger impact on the final covariate multiplier. At the draw level, we linearly interpolate between the models as we go from times where we have collected data already to long-term forecasts. Specifically, we take the weighted combination of the daily increment of the log death rates from these models, with the weight linearly transitioning from short-range to long-range. The two remaining parameters (not modeled using covariates) were allowed to vary among locations to explain location-specific data.
Uncertainty in the model estimates is driven by two components: (1) uncertainty from fixed effect estimation and (2) uncertainty from random effects, with the latter dominant because of the high variation between locations. Uncertainty of fixed effects is estimated using asymptotic statistics derived from the likelihood. In every model, we estimated location-specific parameters or multipliers, and therefore used the empirical variance-covariance matrix of these parameters as a prior for location-specific fit. Posterior uncertainty within each location was then obtained using a standard asymptotic approximation at that location. Once we estimated total uncertainty by location at the draw level, overall uncertainty was then obtained by aggregation of draws. The dataset age-standardized to the age-structure of California is shown in Figure 2.
Because of the unique high-intensity epidemic in the Life Care Kirkland facility in Washington state,34,35 we have modeled this facility separately from the general population – see the appendix for details. Furthermore, as our initial development of the model was focused on King and Snohomish counties in Washington state, we have also stratified these 2 counties from the rest of Washington state. In other words, for Washington state, we model 3 populations explicitly: (i) the Life Care Kirkland facility; (ii) the remainder of the King and Snohomish county population; and (iii) all other counties in Washington state.
Time to threshold death rate
Only 27 states have deaths greater than 0.31 per million (e-15) and were included in the model estimation along with data on 44 other admin 1 locations. For other US states, we estimated the expected time from the current case count to reach the threshold level for the population death rate model. Using the observed distribution of the time from each level of case count to the threshold death rate for all admin 1 locations with data, we estimated this distribution. We used the mean and standard deviation of days from a given case count to the threshold death rate to develop the probability distribution for the day each state will cross over the threshold death rate, and then we applied the death rate epidemic curve after crossing the threshold.
Hospital service utilization microsimulation model
From the projected death rates, we estimated hospital service utilization using an individual-level microsimulation model. We simulated deaths by age using the average age pattern from Italy, China, South Korea, and the US (Figure 1) due to the relatively small number of deaths included for the US (n = 46) and the fact that the US age pattern is likely biased toward older-age deaths due to the early nursing home outbreak in Washington state. For each simulated death, we estimated the date of admission using the median length of stay for deaths estimated from the global line list (10 days <75 years; 8 days 75+ years). Simulated individuals requiring admission who were discharged alive were generated using the age-specific ratio of admissions to death (Figure 3), based again on the average across Italy, China, and the US. The age-specific fraction of admissions requiring ICU care was based on data from the US (122 total ICU admissions over 509 total admissions). The fraction of ICU admissions requiring invasive ventilation was estimated as 54% (total n = 104) based on 2 studies from China.36,37 To determine daily bed and ICU occupancy and ventilator use, we applied median lengths of stay of 12 days based on the analysis of available unit record data and 8 days for those admissions with ICU care.37
Results
By aggregating forecasts across states, we determined the overall trajectory of expected health care need in different categories and deaths, as shown in Figure 4. Demand for health services rapidly increases in the last week of March and first 2 weeks of April and then slowly declines through the rest of April and May, with demand continuing well into June. The shape of the curve reflects both the epidemic curves within each state and the staggered nature of the epidemic around the country. Daily deaths in the mean forecast exceed 2,300 by the second week of April. While peak demand will occur at the national level in the second week of April, this varies by state as shown in Figure 5. Peak demand will occur in the first half of April in about a third of states. This includes states like New York which have had early epidemics and a corresponding sharp rise in deaths. In contrast, other states such as Washington State and California with early epidemics have experienced slower increases in deaths.
Figure 6 shows aggregated excess demand for services above the capacity available currently in each state. Excess demand will be above 60,000 beds (64,175 [95% UI 7,977 to 251,059]) at the peak in the second week of April – about 7% of all hospital beds nationally. More concerning is the peak excess demand of more than 17,000 ICU beds (17,380 [95% UI 2,432 to 57,955]) – about a quarter of all ICU beds nationally. The upper bound of the 95% uncertainty intervals suggest the potential for a much more massive overload of the system, particularly amongst many patients needing ICU beds not having this level of care available. We have not been able to estimate current ventilator capacity; however, the number of ventilators implied by the peak (19,481 [95% UI 9,767 to 39,674]; Figure 4) also suggests potentially large gaps in availability of ventilators. Peak excess demand varies considerably by state; Figure 7 and 8 shows the peak % excess demand by state for total beds and ICU beds, respectively. Peak excess demand for total beds is particularly high in states such as New York, New Jersey, Connecticut, and Michigan. Peak excess demand for ICU beds is more of an issue across all states and is highest in the same set of states listed above as well as Louisiana, Missouri, Nevada, Vermont and Massachusetts.
Figure 9 shows the expected cumulative death numbers with 95% uncertainty intervals. The average forecast suggests 81,114 deaths, but the range is large, from 38,242 to 162,106 deaths. The figure shows that uncertainty widens markedly as the peak of the epidemic approaches, given that the exact timing of the peak is uncertain. Based on our projections, the number of daily deaths in the US will likely drop below 10 deaths between May 31 and June 6 (Figure 10). The date at which the projected daily death rate drops below 0.3 per million by state varies from the first half of May to the first of July (Figure 11). Those states where the death rate drops early overlap considerably with those states with large peak excess demand.
Results for each state are accessible through a visualization tool at http://covid19.healthdata.org/projections - the estimates presented in this tool will be continually updated as new data are incorporated and ultimately will supersede the results in this paper. Summary information on cumulative deaths, the date of peak demand, the peak demand, peak excess demand, and aggregate demand are provided for each US state in Table 1.
Discussion
This study has generated the first set of estimates of predicted health service utilization and deaths due to COVID-19 by day for the next 4 months for all US states, assuming that social distancing efforts will continue throughout the epidemic. The analysis shows large gaps between need for hospital services and available capacity, especially for inpatient and ICU beds. A similar or perhaps even greater gap for ventilators is also likely, but detailed state data on ventilator capacity is not available to directly estimate that gap. Uncertainty in the time course of the epidemic, its duration, and the peak of utilization and deaths is large this early in the epidemic. Given this, it is critical to update these projections as new data on deaths in the US are collected. Uncertainty will also be reduced as we gain more knowledge about the course of the epidemic in other countries, particularly in Europe, where countries such as Italy and Spain have a more advanced epidemic than the US. A critical aspect to the size of the peak is when aggressive measures for social distancing are implemented in each state. Delays in implementing government-mandated social distancing will have an important effect on the resource gaps that states’ health systems will have to manage.
Our estimates of excess demand suggest hospital systems will face difficult choices to continue providing high-quality care to their patients in need. This model was first developed for use by the UW Medicine system, and their practical experience provides insight into how it may be useful for planning purposes. From the perspective of planning for the 4 hospitals in the UW Medicine system, these projections immediately made apparent the need to rapidly build internal capacity. Strategies to do so included the suspension of elective and non-urgent surgeries and procedures, while supporting surge planning efforts and reconfiguration of medical/surgical and ICU beds across the system. This information also allowed us to more effectively engage UW Medicine leadership in clinician and staff workforce pooling and redeployment planning and trigger points while focusing efforts to seek and secure necessary personal protective equipment (PPE) and other equipment to close the identified gap. It also supported a proactive discussion regarding the potential shift from current standards of care to crisis standards of care, with the goal to do the most good for the greatest number in the setting of limited resources.
There are a variety of options available to deal with the situation, some of which have already been implemented or are being implemented in Washington and New York. One option is to reduce non-COVID-19 patient use. State governments have cancelled elective procedures38–40 (and many hospitals but not all have followed suit). However, this decision has significant financial implications for health systems, as elective procedures are a major source of revenue for hospitals.41 For example, in the UW Medicine system, non-COVID-19 utilization is down 14% over a 2-week period since elective activity was reduced. Also, aggressive social distancing policies reduce not only the transmission of COVID-19 but will likely have the added benefit of reducing health care utilization due to other causes such as injuries. Reducing non-COVID-19 demand alone will not be sufficient, and strategies to increase capacity are clearly needed. This includes setting up additional beds by repurposing unused operating rooms, pre- and post- recovery rooms, procedural areas, medical and nursing staff quarters, and hallways. For example, in UW Medicine, the use of such strategies has enabled planning to increase bed capacity temporarily by 65%.
Currently, one of the largest constraints on effective care may be the lack of ventilators. One supplement to ventilator capacity is using anesthesia machines freed up by deferring or cancelling elective surgeries. Other options go beyond the capacity or control of specific hospitals. The use of mobile military resources including the National Guard42–44 has the potential to address some capacity limitations, particularly given the differently timed epidemics across states. Other innovative strategies will need to be found, including the construction of temporary hospital facilities as was done in Wuhan,45 Washington state,46 and also New York.44,47
In this study, we have quantified the potential gap in physical resources, but there is an even larger potential gap in human resources (HR). Expanding bed capacity beyond licensed bed capacity may require an even larger increase in the HR to provide care. The average annual bed- day utilization rate in the US is 66% and ranges from 54% (Idaho) to 80% (Connecticut) by state. Most US hospitals are staffed appropriately at their usual capacity utilization rate, and expanding even up to, but then potentially well beyond, licensed capacity will require finding substantial additional HR. Strategies include increasing overtime, training operating room and community clinic staff in inpatient care or physician specialties in COVID-19 patient care, rehiring recently separated workers, and the use of volunteers. For example, UW Medicine has been fortunate that clinical faculty time can be redirected from research and teaching to clinical care during the COVID-19 surge. Other hospitals may not have this same ability. The most concerning HR bottleneck identified for UW Medicine is for ICU nurses, for which there are very limited options for increasing capacity. In addition to HR, what should not be overlooked is the increased demand for supplies ranging from PPE, medication, and ventilator supplies to basics such as bed linen. Add to these the need to expand other infrastructure required to meet the COVID-19 surge, such as information technology (IT) for electronic medical records. The overall financial cost over a short period of time is likely to be enormous, particularly when juxtaposed against the substantial reductions in revenue due to the cancellation of elective procedures.
The timing of the implementation of social distancing mandates may be a critical determinant of peak demand and cumulative deaths. For states that have not implemented 3 of 4 measures (school closures, closing non-essential services, shelter-in-place, and major travel restrictions), we have assumed that they will be implemented within 7 days, given the rapid adoption of these measures in nearly all states. At this point in the epidemic, we have had to make arbitrary assumptions in our model on the equivalency between implementing 1, 2, or 3 measures – and we have implicitly assumed that implementing 3 of 4 measures will be enough to follow a trajectory similar to Wuhan – but it is plausible that it requires all 4 measures. As more data accumulate, especially on the timing of deceleration of daily deaths, we may be able to empirically test which of these measures is more correlated with slowing the epidemic curve and reducing the ultimate death toll. Perhaps as important will be the question of adherence to social distancing mandates; it will take time to evaluate whether social distancing adherence is fundamentally different in the US compared to Wuhan. Even in Wuhan it was a full 27 days from implementation of social distancing to reaching the peak level of daily deaths.
As data on the epidemics in each admin 1 unit accumulate, including data on health service utilization, we will derive important insights into the epidemic trajectories and health service demand. At this early stage, even 1 to 2 days’ more data for a state will improve the estimates of service need and expected deaths. Because of sparse US data for some aspects of health service utilization, we have used data from the US, Italy, and China. As more data on US treatment accumulate, further revisions will be able to more accurately reflect US practice patterns for COVID-19. For this reason, we will revise the model every day, providing an updated forecast for health service providers and the public.
Any attempt to forecast the COVID-19 epidemic has many limitations. Only one location has had a generalized epidemic and has currently brought new cases to 0 or near 0, namely Wuhan. Many other locations, including all other provinces in China, have so far successfully contained transmission, preventing a general outbreak. Modeling for US states based on one completed epidemic, at least for the first wave, and many incomplete epidemics is intrinsically challenging. The consequent main limitation of our study is that observed epidemic curves for COVID-19 deaths define the likely trajectory for US states. In this study, we do include a covariate meant to capture the timing of social distancing measures to take into account that Wuhan implemented 4 out of 4 social distancing measures within 6 days of reaching a threshold death rate of 0.31 per million. Our models explicitly take into account variation in age-structure, which is a key driver of all-age mortality. But these efforts at quantification do not take into account many other factors that may influence the epidemic trajectory: the prevalence of chronic lung disease, the prevalence of multi-morbidity, population density, use of public transport, and other factors that may influence the immune response. We also have not explicitly incorporated the effect of reduced quality of care due to stressed and overloaded health systems beyond what is captured in the data. For example, the higher mortality rate in Italy is likely in part due to policies around restricting invasive ventilation in the elderly. The model ensemble used does suggest that locations with faster increases in the death rate are likely to have more peak case load and cumulative deaths, but our uncertainty intervals are appropriately large.
Conclusion
COVID-19 is an extraordinary challenge to US health and the healthcare system. In this study, we forecast a huge excess of demand for hospital bed-days and ICU bed-days, especially in the second week of April. Our estimate of 81 thousand deaths in the US over the next 4 months is an alarming number, but this number could be substantially higher if excess demand for health system resources is not addressed and if social distancing policies are not vigorously implemented and enforced across all states. This planning model will hopefully provide an up-to- date tool for improved hospital resource allocation.
Data Availability
A full list of data citations are available by contacting the corresponding author.