Abstract
Objectives We identify a correction for excess mortality that takes the sudden unexpected changes in the size of the United States population into account.
Design This is a weekly cross-sectional analysis of all-cause mortality since week 5, 2020. We describe and apply a simple correction that takes population changes into account in order to provide corrected weekly estimates of expected deaths for 2020 and 2021.
Setting The United States.
Participants All United States residents.
Interventions The covid-19 pandemic.
Main outcome measures Expected and excess mortality for the United States during the covid-19 period.
Results As of week 53, 2020 (ending January 2, 2021), approximately >10,200 more excess deaths have occurred in the United States than could be detected if expected deaths projections were not amended to reflect population decreases during 2020. The figure is projected to rise to >12,600 (>600 weekly) by week 5, 2021. Assuming recent excess mortality and pandemic-associated visa reductions continue until the earliest time herd immunity could be approached resulting from a combination of infections and vaccinations (week 17, 2021), if point estimates of expected deaths are not corrected, expected deaths will be overestimated (and therefore potential excess mortality underestimated) by ∼43,000 during 2021, or >53,300 since the outbreak of the pandemic measurement period (beginning week 5, 2020). By late December 2021, weekly expected death differences are projected to approach 1,000 per week.
Conclusions Current models measuring excess mortality should be revised immediately so that public health officials do not lose the ability to detect ongoing excess mortality as the population changes continue to compound, lowering the number of weekly expected deaths. A similar approach should be used in the middle and late phases of all future pandemics.
Introduction
In a pandemic, the ability to detect excess mortality is paramount to policy decision-making. Detection of excess mortality relies on accurate mortality expectations. Unfortunately, covid-19 has corrupted critical assumptions upon which usual predictive models rely. Specifically, there are currently fewer permanent United States residents than projected before the pandemic, owing to excess mortality and decreases in inward migration.
Current Centers for Disease Control and Prevention and other published mortality models do not take real-time population changes into account, assuming that established population trends have not deviated substantially, and therefore minimally affect the accuracy of expected death projections.1–4 Sudden changes in the United States population size during 2020 renders that assumption inadequate.
Excess mortality will be underestimated if expected death projections are not lowered to account for these population changes. The cumulative effect of this substantially blunts the ability to detect excess mortality during periods in which mortality rates only modestly exceed projected expectations. Moreover, the insidious compounding of these effects continuously decreases the population denominator, thereby further evading detection.
Here, we describe a simple correction procedure to improve the estimate of excess mortality during the 2020 covid-19 pandemic period and show its effect. We then conservatively project the anticipated magnitude of effect that is likely to occur during 2021 unless the technique we describe is widely adopted. This approach has widespread applicability to future excess death estimates when population sizes are in flux.
Methods
We performed a cross-sectional analysis of weekly excess deaths in the United States during the covid-19 pandemic period.
First, we determined excess deaths, defined as the difference between observed and expected deaths, using a “standard” approach.5,6 For granularity and accuracy, the United States population was divided into 6 age brackets and further divided by sex for a total of 12 demographic groups as follows: 0-24 female, 0-24 male, 25-44 female, 25-44 male, 45-64 female, 45-64 male, 65-74 female, 65-74 male, 75-84 female, 75-84 male, ≥85 female, and ≥85 male.
For observed mortality we used weekly figures provided by the National Center for Health Statistics at the United States Centers for Disease Control and Prevention. For weeks 6-53 2020 (with week 53 ending January 2nd, 2021, as per the CDC), published weighted figures were used in order to minimize known lag effects.7
To calculate standard expected deaths, we considered ongoing changes due to normal population growth and aging on a weekly basis, as elsewhere.5 Specifically, we applied seasonal autoregressive integrated moving average (ARIMA) to project the weekly population for all 12 groups for 2020-2021 using the data from the US Centers for Disease Control and Prevention for 2014-2019.8,9 The seasonal period was chosen at 52 weeks to project weekly death expectations for 2020 and 2021 for all 12 groups. We then summed the estimated contributions from the groups to obtain a point estimate of the total number of expected weekly deaths for the United States from week 6 of 2020 through Week 52 of 2021. Both 80% and 95% confidence interval estimates were determined by combining the confidence interval from the 12 groups, assuming independence among groups (see Tables S4-S16).
Next, for each of the 12 groups, we calculated “corrected” weekly estimates for expected deaths taking pandemic-associated changes in population into effect. To do this, weekly all-cause mortality incident rates using point estimates calculated via the standard approach above were first determined. Then, for each week, we calculated a revised estimated population for each of the 12 groups by subtracting the cumulative number of excess deaths during the covid-19 pandemic to that point and the cumulative decrease in permanent resident visas granted to persons arriving from abroad (owing to official policies halting such movements) for that group from the corresponding original population projection.9
The decrease in permanent resident visas granted during 2020 was determined by subtracting the month-over-month difference between 2020 and 2019 data, as reported by the United States Department of State (Table S1; see Supplemental Appendix Notes).10 Monthly figures were divided by 4 to create weekly estimates. We then apportioned the resulting difference among the 12 demographic groups according to the demographic breakdown of visas granted in 2019 (Table S2).11 Finally, to calculate the corrected expected number of weekly deaths, the previously determined all-cause mortality incident rate was multiplied by the revised population estimate (i.e. the corrected denominator).
The mathematical description of this procedure is provided as follows:
We assume there were 53 weeks during 2020. For week 6 < i ≤ 53, we defined the original projected population for each of the 12 demographics (denoted by k, where 1 ≤ k ≤ 12) in week i of 2020 to be ni,k; the original point estimate for all-cause deaths to be ; the sum of all previous weeks’ excess death to be Pi-1,k (starting from week 6, 2020); the sum of all previous weeks’ decreases in inward immigration to be Qi-1,k (starting from week 6, 2020). Therefore, the corrected expected deaths for week i (since week 6, 2020) after correction satisfies:
The above equation also holds for the weeks in 2021, where Pi -1,k and Qi -1,k denote the sum of all previous weeks’ excess death and the sum of all previous weeks’ decreases in inward immigration since week 6, 2020.
In order to project expected deaths for 2021, we assumed that pandemic-related population changes would continue through week 17 and cease at that point (the week ending May 1, 2021). For simplicity, excess deaths for week 1-17 of 2021 were estimated using the provisional mean excess mortality for each of the 12 demographics reported for weeks 49-52, 2020 (see Tables S4-S16, orange text), though these data remain incomplete and CDC models suggest this assumption will underestimate mean excess mortality during this time.12 To estimate ongoing decreases in permanent visas for these weeks, we conservatively assumed a decrease 0f 70% for weeks 1-17 of 2021, although the mean during the pandemic period through December 2020 was actually >80% (Supplemental Appendix Notes, Table S3, and Tables S4-S16, orange text).9,10 Expected and excess deaths were reported to the hundreds place in the text and to the ones place in tables and figures.
This study was not subject to institutional review because it relied solely upon data available to the public.
All the analysis was done through R version 4.0.2 and Microsoft Excel version 16.44.
Results
Our study included 3,079,937 all-cause deaths in the United States from week 5, 2020 through week 53 of 2020, based on preliminary CDC/NCHS data as of February 4, 2021.72/10/2021 1:03:00 PM Absent the covid-19 pandemic, we estimated that the United States population would have increased from approximately 329,525,900 in week 5, 2020 to 331,381,200 by week 53, 2020 residents during the study period (Table 1). Based on standard calculations that take this growth into account, approximately 438,200 excess deaths occurred during that period (lower 95% confidence boundary 282,400) based on preliminary data. However, as of week 53 2020, we estimate that the United States population was actually 330,624,000 persons (757,200 fewer than originally projected), owing to the noted pandemic-associated excess mortality and a measured decrease in the number new permanent visa-granted residents arriving to the United States (approximately 316,300 persons) since February 2020 (Table 1, Table S1). As a result of this denominator change, as of week 53, 2020 we measured that approximately >10,200 more excess deaths occurred in the United States in 2020 than detectable using standard uncorrected approaches (Figure 1).
Due to the compounding effect, the number of missed excess deaths will rise to >12,600 (>600 weekly) by week 5, 2021 (Figure 1-2). Assuming recent excess mortality and pandemic-associated visa reductions continue until the earliest feasible time herd immunity could be approached resulting from a combination of infections and vaccinations (week 17, 2021), if point estimates of expected deaths are not corrected to reflect these changes, we project expected deaths will be overestimated (and therefore potential excess mortality underestimated) by ∼43,000 during 2021 (Figure 2), or >53,300 since the outbreak of the pandemic measurement period (beginning week 5, 2021). By late December 2021, weekly expected death differences are projected to approach 1,000 per week (∼61,200 expected deaths using the standard calculation versus ∼60,200 using the population-corrected calculation (Figure 3).
Discussion
We find that excess mortality in the United States has been underestimated during the covid-19 pandemic because current models in use do not consider changes in population that have occurred during the crisis. Here, we show that by taking into account such population changes weekly, the number of expected deaths decreases, meaning that excess mortality is greater than would otherwise be detectable.
We also show that this effect is compounding over time such that these quantitative differences will soon translate into an important qualitative reversal. If the measured number of deaths in a coming month is similar to the originally expected deaths estimate, observers will incorrectly conclude that excess mortality is no longer occurring. However, in comparison to the corrected estimate, excess deaths would, in fact, be occurring (Figure 3, the growing area between blue and gray lines). Such a failing to detect ongoing excess death could lead to premature policy changes such as the early curtailing of non-pharmacologic interventions. Moreover, after the pandemic subsides, failure to make the corrections identified here would eventually and misleadingly seem to appear to epidemiologists and public health officials as deficit mortality (i.e., measuring fewer deaths than expected). This approach would inaccurately depict a scenario in which unadjusted post-pandemic death rates appear to be lower than at any time in history, when they are not. In addition, because much of the excess mortality that has occurred during the covid-19 pandemic has been in persons with medical comorbidities, the surviving post-pandemic population is likely to collectively have a lower crude all-cause mortality rate than the pre-pandemic population because a smaller proportion of them will have significant comorbidities. Therefore, even the corrected estimates of expected mortality in the future we provide may still be overestimated. In fact, our findings obliquely points to the need for a systemic post-pandemic population risk recalibration so that public health professionals can continue to track disease-specific mortality for virtually every cause of death accurately.
A strength of our correction procedure for calculating excess mortality is that it relies upon real data for 2020 (week 5, 2020 to week 53, 2020). A weakness is that we must currently rely upon incomplete provisional data. Therefore, reporting lags render the figures presented to be lower boundary estimates of the effect we have unveiled for 2020, which also artificially lowers our 2021 projections.
An acknowledged weakness in our 2021 projections is that it was necessary to estimate the magnitude of excess mortality and the decrease in permanent visas for weeks 1-17 of 2021. In addition, the assumption that all such pandemic-associated changes would continue until May 1, 2021, and then cease suddenly was arbitrary. However, we believe our approach for this period was conservative; we used average counts from weeks 49-52 of 2020, the most recent for which adequate data is currently available, although the subsequent weeks (January 2021) corresponded to the deadliest weeks of the pandemic in the United States yet recorded.13 Thus, our projections for 2021 are almost certain to underestimate the true magnitude of the effect described here. In addition, the changes due to immigration described herein also represent lower boundary estimates, as we considered only permanent visas granted from persons arriving directly from abroad (See Supplemental Appendix Notes). Based on prior years and government travel data (Table S3), approximately twice as many fewer permanent residents likely arrived to the United States during the pandemic period than would be represent by CDC mortality data.
To our knowledge, this report is the first to describe the underestimation of excess mortality resulting from real-time changes in the population during a pandemic or outbreak of any kind. During the covid-19 pandemic, most of the effect we describe resulted from excess mortality rather than immigration. However, in a future pandemic with higher mortality rates among young and middle-aged adults (who comprise a majority of inward migrants, see Table S2), that balance could change. This means that more inclusive and more definitive real-time reporting on the decreases in inward migration might be more important in accurately measuring this effect in future pandemics. In nations with more outward migration than inward migration, the effects described here could have the opposite effect. However, these potential eventualities are built into our procedure (See Equation 1).
We propose that in the middle and late phases of the covid-19 pandemic—and future pandemics that similarly cause substantial and measurable changes in the population size—the correction methodology described here will help officials more accurately detect excess mortality later in the pandemic. Failing to adopt the approach described here is almost certain to lead to the premature declaration that excess mortality has ceased to occur both now and in any similar instances in the future. That the problem we identify here has not been considered in the past is a result of the fact that there has never been an event with this magnitude of excess mortality during an era in which real-time state, national, and even international data became available to epidemiologists and public officials with such a short time lag and on an ongoing basis.
Excess mortality remains one of the most reliable metrics for measuring a pandemic’s effect, as it is agnostic to cause of death. However, to leverage excess death data properly, the ability to detect it later in pandemics must be maintained. Future research will include updated data and assess whether the effects described herein can be reliably applied at the city or state level, where denominators are smaller. In addition, the correction methodology we have identified may be improved upon by further by considering excess mortality in certain risk pools such as among persons living in long term care facilities, or in certain high-risk areas. To ensure our policy decisions are based on the best available evidence, officials should update United States and global expected death estimates immediately.
Data Availability
The data are available to the public and were available to all authors at all times.
Patient and public involvement statement
It was not appropriate or possible to involve patients or the public in the design, or conduct, or reporting, or dissemination plans of our research.
Contributor statement
Dr. Faust conceived of the project, designed it, and wrote the manuscript in consultation with Dr. Krumholz. Dr. Du, Li, and Lin provided the statistical modeling framework and planning. Dr. Du performed the primary statistical modeling. analysis. All authors were involved in the analysis of the findings and contributed to the manuscript creation process either via direct writing or editing.
Funding
There was no specific funding for this investigation.
Supplemental Appendix
See attached pdf.
Acknowledgements
We wish to thank Lauren M. Rossen, PhD, MS (National Center for Health Statistics), for facilitating the public release of National Center for Health Statistics data used for this study, and Julia Gelatt, PhD (Migration Policy Institute) for expertise and help identifying migration data. Neither individual received compensation for their contributions.