Abstract
Background Covid-19 excess deaths refer to increases in mortality over what would normally have been expected in the absence of the Covid-19 pandemic. Several prior studies have calculated excess deaths but were limited to the national or state level, precluding an examination of area-level variation in excess mortality and excess deaths not assigned to Covid-19. In this study, we take advantage of county-level variation in Covid-19 mortality to estimate excess deaths associated with the pandemic and examine how the extent of excess mortality not assigned to Covid-19 varies across subsets of counties defined by sociodemographic and health characteristics.
Methods and Findings We made use of National Center for Health Statistics (NCHS) data on direct Covid-19 and all-cause mortality in U.S. counties occurring from February 1 to October 17 and reported before December 15, 2020. Our sample included 787 counties with more than 20 Covid-19 deaths. We first modeled the relationship between 2020 all-cause mortality and Covid-19 mortality across all counties and then produced fully stratified models to explore differences in this relationship among strata of sociodemographic and health factors. Overall, we found that for every 100 deaths assigned to Covid-19, 144 all-cause deaths occurred (95% CI, 130 to 159), implying that 31% (95% CI, 24% to 38%) of excess deaths were ascribed to causes of death other than Covid-19 itself. Our stratified models revealed that the percentage of excess deaths not assigned to Covid-19 was substantially higher among counties in rural areas, counties with lower median household incomes and less formal education, counties with more diabetes, obesity, and smoking, and counties in the South. Counties with more non-Hispanic Black residents, who are already at high risk of Covid-19 death based on direct counts, also reported more excess deaths not assigned to Covid-19.
Conclusions Direct Covid-19 death counts in 2020 substantially underestimated total excess mortality attributable to Covid-19. Socioeconomic and racial inequities in Covid-19 mortality also increased when excess deaths not assigned to Covid-19 were considered. Our results highlight the significant role of social determinants of health in Covid-19 mortality and the importance of considering health equity in the policy response to the pandemic.
Introduction
The novel coronavirus disease 2019 (Covid-19) is an international public health emergency caused by the respiratory droplet transmission of the coronavirus-2 (SARS CoV-2) virus.1 SARS CoV-2 infects humans through the lung epithelium and is associated with a high incidence of acute respiratory distress syndrome, vascular injury, and death.2 The United States has emerged as an epicenter of the Covid-19 pandemic, with over 18.2 million confirmed cases and 322,218 deaths as of December 22, 2020.3
Vital registration data sourced from death certificates on cause of death are likely to underestimate the mortality burden associated with the Covid-19 pandemic for several reasons.4,5 First, some direct deaths attributable to Covid-19 may be assigned to other causes of death due to an absence of widespread testing and low rates of diagnoses at the time of death.6 Additionally, direct deaths from unfamiliar complications of Covid-19 such as coagulopathy, myocarditis, inflammatory processes, and arrhythmias may have caused confusion and led to attributions of death to other causes, especially early in the pandemic and among persons with comorbid conditions.7–9 Second, Covid-19 death counts do not take into account the indirect consequences of the Covid-19 pandemic on mortality levels.10–12 Indirect effects may include increases in mortality resulting from reductions in access to and use of health care services and psychosocial consequences of stay-at-home orders.13 Increases in stress, depression, and substance use related to the pandemic could also lead to suicides and overdose deaths.14,15 Economic hardship, housing insecurity, and food insecurity may cause indirect deaths, especially among those living with chronic illnesses or who face acute heath emergencies and cannot afford medicines or medical supplies.16–18 On the other hand, the pandemic may reduce mortality as a result of reductions in travel and associated motor vehicle mortality, lower air pollution levels, or the possible benefits of Covid-19 mitigation efforts (i.e. mask wearing and physical distancing) on reducing influenza spread.19,20 It is also possible that Covid-19 deaths are over-recorded in some instances, e.g., because some deaths that should have been assigned to influenza were instead assigned to Covid-19. Finally, the Covid-19 epidemic may reduce mortality from certain other causes of death because of frailty selection; those who die from Covid-19 may have been unusually frail and vulnerable to death from other diseases. Consequently, the rate of death from those diseases may decline and offset some of the increase in all-cause mortality attributable to Covid-19 deaths alone.
The term “excess deaths” refers to differences in mortality relative to what would have been expected in the absence of the Covid-19 pandemic. Typically, excess deaths are computed relative to a recent historical benchmark for the same population. Excess deaths include deaths directly assigned to Covid-19 on death certificates and excess deaths not assigned to Covid-19, which were either misclassified to other causes of deaths or were indirectly related to the Covid-19 pandemic. Using deaths from all causes to measure the excess mortality impact of the Covid-19 epidemic can help circumvent biases in vital statistics, such as low Covid-19 testing rates, reporting lags, and differences in death certification coding practices, and capture excess deaths indirectly related to the Covid-19 pandemic. As such, estimates of excess deaths from all causes associated with the pandemic provide a useful measure of the total mortality burden associated with Covid-19.5
Previous reports have estimated excess deaths in several different ways. The National Center for Health Statistics (NCHS) estimates excess mortality through comparison of mortality levels in 2020 to historical mortality data by week and geographic location.21 They present a range of values for excess deaths based on different historical thresholds, including the average expected count or upper boundary of the uncertainty interval, and apply weights to the 2020 provisional death data to account for incomplete data. In contrast, Weinberger et al. and Woolf et al. use multivariable Poisson regression models to evaluate increases in the occurrence of deaths due to any cause across the US.22,23 Weinberger et al. adjust for influenza activity.22 Kontis et al. apply Bayesian ensemble modeling to obtain smoothed estimates of excess deaths by age and sex in the UK.24 These studies make estimates for each state or country individually and do not allow a relationship between all-cause mortality and Covid-19 mortality to be identified through analysis across smaller area units such as counties. While prior research has documented significant racial and socioeconomic inequities in directly assigned Covid-19 deaths,25–30 few studies have documented how excess mortality in 2020 has differed across sociodemographic or health factors.31
In this paper, we take advantage of county-level variation in Covid-19 mortality to estimate its relationship with all-cause mortality across US counties. We anticipate that counties with higher mortality from Covid-19 will also have experienced greater increases in mortality from other causes of death because the impact of the pandemic is not registered in Covid-19 deaths alone. We use the relationship between Covid-19 mortality and changing mortality from all causes of death to estimate excess mortality that was not directly assigned to Covid-19 as a cause of death. While prior studies generated a prediction of expected mortality in 2020 based on historical trends and then compared expected to observed mortality to estimate excess deaths, our model takes advantage of county variation to estimate simultaneously the mortality trend and the imprint of Covid-19. We then examine how the extent of excess mortality not assigned to Covid-19 varies across subsets of counties defined by area-level sociodemographic and health characteristics, allowing us to identify population subgroups with a disproportionate number of excess deaths that were not directly assigned to Covid-19. Our estimates provide an alternative approach to calculating excess deaths that can complement existing approaches.
Methods
Data Sources
We used NCHS provisional county-level data on all-cause mortality and directly assigned Covid-19 mortality from February 1 to October 17, 2020. The data were considered provisional due to a time lag between the occurrence of deaths and the completion, submission, and processing of death certificates. To account for possible lags in mortality reporting, we used data generated on December 15, 2020 (eight weeks after the end date), meaning that all deaths occurring before October 17 that were reported before December 15 were captured. Prior analyses of NCHS provisional data have found that provisional mortality data can have low completeness in the first month after a death occurs but are more than 75% complete within 8 weeks after a death.32 94% of deaths assigned to Covid-19 by NCHS had Covid-19 reported as the underlying cause of death; the other 6% had Covid-19 listed somewhere else on the death certificate.33 The original NCHS data file was limited to counties with 10 or more Covid-19 deaths (n=1,239). We then excluded counties with 20 or fewer Covid-19 deaths because death rate estimates based on death counts of less than 20 have a high relative standard error and are thus considered unreliable.34
We also excluded counties in North Carolina, Connecticut, and West Virginia, which were flagged in NCHS technical documentation as having little to no provisional data available in recent weeks. The final number of counties included in the analysis was 787, and the exclusion criteria are detailed in Supplemental Figure 1.
To construct a historical comparison period, we utilized county-level data from CDC Wonder reporting all-cause mortality each month from February to October for 2013 through 2018. We also used U.S. Census data on county population and age distribution from 2013 to 2020 and data on sociodemographic and health factors from a variety of consolidated sources including the 2020 RWJ Foundation County Health Rankings. A list of these data sources is provided in Supplemental Table 1. The present investigation relied on deidentified publicly available data and was therefore exempted from review by the Boston University Medical Center Institutional Review Board.
Death Rates
We produced crude death rates for all-cause and directly coded Covid-19 mortality in 2020 using the reported death counts and the estimated county-level population on July 1, 2020. We multiplied the population by 259 days (the number of days from February 1 to October 17, 2020) divided by 365.25 days so that our death rates would be in units of deaths per person-years. To compute an average historical death rate for 2013 to 2018, we added deaths from February to September of each year plus 17 of the 31 days in October. We then divided the sum of deaths from 2013 to 2018 by the total population from 2013 to 2018.
Sociodemographic and Health Factors
Prior literature has documented differences in Covid-19 mortality by sociodemographic and health factors.25,35 In this analysis, the sociodemographic factors that we examined were age (% over 65 years), rurality (% rural), population distribution by race/ethnicity (% Hispanic, % non-Hispanic Black, % non-Hispanic white), socioeconomic status (median household income), income inequality (ratio of household income at the 80th percentile to income at the 20th percentile), and housing (% homeownership). For health factors, we examined obesity (% with obesity), smoking (% who smoke), and diabetes (% with diabetes). We stratified counties into population weighted quartiles for each sociodemographic or health factor, with particular attention to counties in the upper and lower 25% of values.
Statistical Analysis
We first modeled all-cause mortality in 2020 as a function of historical all-cause mortality, directly assigned Covid-19 mortality in 2020, and an error term:
M(i) = Death rate from all causes in county i in 2020
M*(i) = Average death rate from all causes, county i in 2013-2018
C(i) = Covid-19 death rate in county i in 2020
ε = Error term
The parameters of equation (1) were estimated using Ordinary Least Squares regression with county units weighted by their population size. The value of α represents changes in mortality that are independent of Covid-19 mortality and that are common across regions. The value of β1 represents the extent to which past levels of all-cause mortality in a county are replicated in 2020. If β1 = 1.0, for example, then this would indicate that all-cause mortality in 2020 was on average equal to that in 2013-18, plus or minus the value of α. Together, combinations of α and β1 indicate how mortality changes that are not associated with Covid-19 vary with the level of all-cause mortality in 2013-18. The value of β2 indicates the extent to which mortality from Covid-19 affects all-cause mortality in 2020 after adjusting for historical mortality patterns. If β2 = 1.0, this would imply that each death coded to Covid-19 would be associated with one additional death from all causes combined. Values of β2 greater than 1.0 would suggest that the effect of the Covid-19 pandemic in a county is not fully reflected in deaths assigned to Covid-19 and that excess deaths are being attributed to some other causes of death. Values of β2 less than 1.0 would suggest that Covid-19 is over-recorded as a cause of death or reductions in mortality are occurring for other causes.
In secondary analyses, we limited our sample to counties with more than 50 Covid-19 deaths to assess the sensitivity of our results to counties reporting relatively small numbers of Covid-19 deaths. We also limited our sample to counties with populations greater than 50,000 to assess the robustness to excluding counties with relatively small populations. In further sensitivity analyses, we performed the regression among all counties in the original NCHS dataset, including counties that were eliminated by our study’s exclusion criteria due to their low data quality. To assess the robustness of our results to alternative modeling approaches, we also estimated the relationship between Covid-19 and all-cause mortality using a Negative Binomial regression model.
We did not control for age in our primary analysis because the effect of age should be partially captured in the historical mortality term. In a sensitivity analysis, we re-estimated the primary OLS model using indirectly age-standardized death rates to adjust for differences in the counties’ age distributions. Deaths by age were not available at the county level so direct standardization of death rates was not possible. Indirect standardization adopts the age-specific death rate schedule for the whole US and applies it to the age distribution of a county to predict the number of deaths in that county.36 It then calculates the ratio of actual deaths in the county to the predicted number of deaths. Finally, it applies that ratio to the US crude death rate to estimate the indirectly age-standardized death rate for the county.37 Death rates and age distributions were employed in 10-year wide age intervals. When the death rate referred to Covid-19 mortality alone, death rates were restricted to that cause of death.
To identify county-level characteristics that modified the relationship between direct Covid-19 mortality and excess mortality (the β2 coefficient), we fully stratified our primary regression model into population weighted quartiles of various county-level sociodemographic and health factors. We then produced a forest plot displaying the β2 coefficients for the upper and lower 25% of each county-level factor. Counties with high β2 coefficients represent counties that have a high proportion of excess deaths not assigned to Covid-19 compared to their direct Covid-19 deaths. To understand how this relationship translated to absolute numbers of deaths, we also calculated predicted excess death rates in each of the strata using the fully stratified model coefficients. We determined the average observed directly assigned Covid-19 death rate for each stratum by calculating the weighted mean and then found the predicted excess death rate not assigned to Covid-19 by multiplying the average observed direct Covid-19 death rate for the stratum by the β2 coefficient minus 1. In a sensitivity analysis, we repeated both of these analyses using indirectly age standardized death rates.
Results
Table 1 presents characteristics of the 787 counties included in the dataset, whose distribution across the U.S. is visualized in Supplemental Figure 2. The total number of residents living in these counties was 259.2 million. Among these counties, 15.4% of the population was older than 65 years compared to 16.0% of the population in all counties in the U.S. The counties in the sample were 10.3% rural, which was lower than all counties in the U.S. which were 18.6% rural. On average, the counties in the sample were 21.0% Hispanic, 13.3% non-Hispanic Black and 55.9% non-Hispanic white. The median household income in the counties was $67,665, and the ratio of household income at the 80th percentile to income at the 20th percentile was 4.8. In the counties, 61.9% of residents were homeowners, and 16.6% were living with poor or fair health.
Figure 1 plots the difference between the 2020 all-cause mortality rate and the average 2013-2018 historical all-cause death rate against the Covid-19 mortality rate in the study counties. The area of each point is roughly proportional to the county’s population size. This figure presents evidence that there is a positive relationship between the change in mortality from all causes of death and the level of Covid-19 mortality in a county. The slope of the regression line (solid blue) is steeper than the 45-degree line (dashed grey), indicating that one additional Covid-19 death is associated with more than one additional death from all causes.
Table 2 presents coefficients of the model describing the relationship between the directly assigned Covid-19 mortality rate and all-cause mortality in 2020. The estimated value of α is 0.150 deaths per 1000 people and β1 is 1.09 (95% CI, 1.03 to 1.14). This combination suggests that crude all-cause mortality rose on average across counties between 2013-18 and 2020. The coefficient of β2 is estimated to be 1.44 (95% CI, 1.30 to 1.59). This value suggests that, for every 100 deaths assigned to Covid-19, the number of all-cause deaths rose by 144. This result implies that (144 – 100) / 144 or 31% (95% CI, 24% to 38%) of all excess deaths were not directly assigned to Covid-19 on death certificates. In absolute terms, 199,124 directly assigned Covid-19 deaths occurred in the 787 counties in the study between February 1 and October 17, 2020, meaning there were 88,142 (95% CI, 59,709 to 116,576) excess deaths not directly assigned to Covid-19 for a total of 287,266 (95% CI, 258,833 to 315,700) excess deaths.
When we limited to counties with more than 50 direct Covid-19 deaths, the coefficient of β2 was 1.37 (95% CI, 1.20 to 1.54). β2 was 1.45 (95% CI, 1.29 to 1.60) when we limited to counties with more than 50,000 residents. Our results were relatively consistent across alternative modeling specifications and with indirect age standardization, with the percent of excess deaths not attributed to Covid-19 ranging from 31% to 33% across the OLS, indirectly age-standardized OLS, and Negative Binomial models (Supplemental Table 2). Including counties that were excluded from the main analysis because of poor quality data led to a substantial increase in β2 (1.61 vs. 1.44 in Table 2).
Figure 2 examines how county-level characteristics modified the β2 coefficient (the relationship between the rate of direct Covid-19 deaths and the rate of excess deaths from all causes). The figure displays estimates of the β2 coefficient across models that were fully stratified into population weighted quartiles for various sociodemographic and health factors. For sociodemographic factors, the β2 coefficient was elevated in counties that were the most rural (2.84 [95% CI, 2.55, 3.12]) compared to counties that were the least rural (1.13 [95% CI, 0.88, 1.39]). The β2 coefficient was also higher in counties with the lowest median household incomes (1.96 [95% CI, 1.69, 2.23]) and the least formal education (2.00 [95% CI, 1.79, 2.20]) than in counties with the highest median incomes (1.38 [95% CI, 1.22, 1.55]) and most education (1.27 [95% CI, 0.96, 1.59]). Regarding health factors, the β2 coefficient was higher in counties with the most obesity (2.67 [95% CI, 2.38, 2.95]), the most smoking (2.72 [95% CI, 2.38, 3.07]), and the most diabetes (2.37 [95% CI, 2.11, 2.63]) compared to counties with the least obesity (1.08 [95% CI, 0.89, 1.27]), the least smoking (1.14 [95% CI, 0.94, 1.35]), or the least diabetes (1.26 [95% CI, 1.00, 1.52]). Regionally, the β2 coefficient was elevated in the South (2.47 [95% CI, 2.17, 2.77]) compared to the Northeast (1.14 [95% CI, 0.78, 1.49]). Supplemental Figure 3 presents the stratified models calculated using indirectly age standardized death rates.
Figure 3 decomposes the 2020 excess death rate into the observed excess death rate directly assigned to Covid-19 and the predicted excess death rate not assigned to Covid-19 and compares counties in the upper and lower 25% of county-level factors. Predictions were generated using the fully stratified models presented in Figure 2. Supplemental Figure 4 presents decomposed death rates with indirect age standardization. Comparing counties with the most non-Hispanic Black residents to counties with the fewest non-Hispanic Black residents, substantial racial inequities were observed in direct Covid-19 mortality and in excess deaths not assigned to Covid-19. As shown in Supplemental Table 3, racial inequities for the non-Hispanic Black population grew larger when excess deaths not assigned to Covid-19 were considered. For direct Covid-19 death rates, the risk ratio for mortality in counties with the most non-Hispanic Black residents compared to counties with the fewest non-Hispanic Black residents was 1.93. With the inclusion of excess deaths not assigned to Covid-19, the risk ratio was 2.18. For counties with lower median incomes and less education, socioeconomic disparities in death rates grew larger when excess deaths not assigned to Covid-19 were considered. For rural areas, when excess deaths not assigned to Covid-19 were excluded from death rates, the risk ratio for the most rural counties vs. the least rural counties was 0.61. However, when excess deaths not assigned to Covid-19 were included, the risk ratio increased to 1.52, showing that death rates were higher in rural areas only after excess deaths not assigned to Covid-19 were considered.
Discussion
We estimated that 31% of excess deaths attributable to the Covid-19 pandemic were not assigned to Covid-19 on death certificates. Prior estimates of excess mortality not assigned to Covid-19 have been similar despite being based on different methodological approaches. An analysis by Woolf et al., based on data from March 1 through April 25, 2020, found that of 87,001 excess deaths, 30,755 excess deaths or 35% were not assigned to Covid-19.23 A subsequent analysis by Woolf et al. using data from March 1 through August 1 found that of 225,530 excess deaths, 150,541 were directly assigned to Covid-19, suggesting that 33% of excess deaths were not assigned to Covid-19.38 Weinberger et al. identified 95,235 directly coded Covid-19 deaths through March 1 to May 30, and 122,300 total excess deaths attributable to the pandemic.22 This indicated that 27,065 deaths or 22% of excess deaths were excess deaths not assigned to Covid-19. According to updated data from Weinberger et al. through November 21, 2020, the team’s excess death estimate rose to 355,800 deaths, with 246,966 deaths directly assigned to Covid-19, suggesting that 31% of excess deaths were not assigned to Covid-19.39 Lastly, the NCHS has identified 288,287 directly coded Covid-19 deaths and between 291,208 and 400,791 total excess deaths as of December 22, 2020.40 This data suggests that between 1% and 28% of excess deaths were not directly assigned to Covid-19.
The coefficients from our primary model indicate that mortality would have risen between 2013-18 and 2020 even in the absence of the Covid-19 pandemic. This prediction is consistent with a rising trend in national crude death rates between 2013 and 2018. The crude death rate rose from 821.5 deaths per 100,000 people in the year 2013 to 867.8 deaths per 100,000 people in 2018.41 Estimates of excess deaths that fail to account for these increases, which are likely attributable primarily to population aging, may result in overestimates of the percentage of excess deaths not assigned to Covid-19.42
As noted earlier, excess deaths not assigned to Covid-19 could include deaths involving Covid-19 that were misclassified to other causes of death and deaths indirectly related to the Covid-19 pandemic. The NCHS has examined excess deaths not assigned to Covid-19 by cause of death nationally. As of December 22, 2020, NCHS has attributed 38,115 excess deaths to Alzheimer’s disease and related dementias, 22,661 excess deaths to hypertensive diseases, 14,684 excess deaths to ischemic heart disease, 14,194 excess deaths to diabetes, and 3,060 excess deaths to influenza and pneumonia.40 It is possible that a substantial fraction of the deaths of individuals with pre-existing chronic conditions who acquire Covid-19 and die as a result are ascribed to the pre-existing condition. These may constitute many of the excess deaths not attributed to Covid-19. This explanation is consistent with our finding that excess mortality not assigned to Covid-19 was higher in counties with higher levels of smoking, obesity, and diabetes.
In this study, we found that rural counties, counties with low median incomes and less formal education, and counties in the South reported high numbers of excess deaths not assigned to Covid-19 compared to direct Covid-19 deaths, suggesting that Covid-19 mortality may be especially undercounted in these areas. Determining the cause of potential undercounting is an important area for future research. Possible factors could include lower rates of Covid-19 testing in these populations,43 reduced access to health care,44,45 regional diagnostic and coding differences,46 or political attitudes about the Covid-19 pandemic.47,48
Indirect deaths, such as suicide, homicide, or drug overdose and deaths related to reductions in health care use, may also be higher in these areas. Faust et al. found that only 38% of excess deaths in 2020 among adults aged 25 to 44 years were attributed to Covid-19, suggesting substantial mortality among younger adults during the pandemic that is not accounted for in Covid-19 death tallies.49 Analyses of multiple cause-of-death data, when available, will help to shed additional light on the contribution of Covid-19 to US mortality and help to explain why excess all-cause mortality in the U.S. is high compared to other countries including those with high directly assigned Covid-19 mortality.50
Previous research has shown that counties with a higher percentage of Black residents have reported more mortality attributable to Covid-19.25,51 According to underlying cause of death data, Black people are 1.7 times more likely to die from Covid-19 than white people.52 Racial inequities in Covid-19 mortality relate to structural racism that has made Black people more likely to be exposed to Covid-19 at work, in transportation, and in housing during the pandemic.25,51,53,54 Another factor is racial health inequities in asthma, COPD, hypertension and diabetes, which are risk factors for Covid-19.55,56 The presence of these comorbidities could also reduce the likelihood of assigning Covid-19 as a cause of death. Our analysis suggests that the substantial racial inequities observed in directly assigned Covid-19 death rates for the non-Hispanic Black population are even larger in excess death rates not assigned to Covid-19.
Our analysis had several limitations. The 2020 all-cause mortality and Covid-19 mortality data used were provisional. Counties may have differential delays in reporting death certificate data that vary by county, state, rurality, or other area-level factors. In particular, counties that are currently reporting lower all-cause mortality in 2020 than in the historical period are notable. While it is possible that these counties represent random annual variation in death rates or incomplete data, these counties could also have experienced reductions in mortality as a result of strong public health measures that reduced other causes of death such as influenza.20 Future research should investigate this subset of counties in more detail to understand what led to their apparent success relative to other counties. If differential reporting delays occurred for direct Covid-19 mortality but not for all-cause mortality, we may have overestimated the percent of excess deaths that would not be assigned to Covid-19 when final data are available. To address this potential limitation, we used data that had an eight-week lag, meaning that all deaths occurring before October 17, 2020 which were reported and processed before December 15, 2020 were included. Another potential source of error in our analysis is that death rates calculated from small numbers of cases may be less accurate than death rates calculated for areas with larger numbers of cases. To account for this possibility, we limited our dataset to counties with more than 20 direct Covid-19 deaths. We also conducted sensitivity analyses where we limited to counties with more than 50,000 residents and restricted to counties with more than 50 direct Covid-19 deaths. Results were not sensitive to these alternative restrictions. Our analytic approach may also be affected by measurement error in Covid-19 mortality in a way that differs from its effect on other estimates. In general, random measurement error in an independent variable (such as Covid-19 mortality in our analysis) is expected to bias the coefficient of that variable towards the null, or zero.57 If our estimate of β2 is biased downwards, then we will have underestimated the magnitude of excess deaths. Another limitation of this study is that we lacked disaggregated data on county-level mortality by age, sex, race/ethnicity, and sociodemographic and health characteristics. As a result, age-compositional effects were addressed using an indirect age standardization procedure and predictors were assessed based on their distributions at the county-level. Future research based on county-level data with further disaggregation is needed to confirm our findings. Lastly, the sociodemographic and health factors examined in this analysis were based on data from 2010 through 2018. County-level distribution of these factors may have changed between that time and 2020 when the mortality data was analyzed.
Conclusion
In line with previously published studies, our findings suggest that the overall mortality burden of Covid-19 considerably exceeds reported Covid-19 deaths. Using provisional vital statistics county-level data on Covid-19 and all-cause mortality in 2020 from the NCHS, we estimated that 31% of all excess deaths in the United States from February 1 to October 17, 2020 were excess deaths not assigned to Covid-19. Socioeconomic and racial inequities in Covid-19 mortality also increased when excess deaths not assigned to Covid-19 were considered. Our findings emphasize the significant role of social determinants of health in Covid-19 mortality and the importance of considering health equity in the policy response to the pandemic, such as in access to Covid-19 vaccines. As the impact of Covid-19 continues to spread in the U.S., analysis of excess deaths will continue to be a valuable proxy measure for assessing the overall mortality burden of Covid-19. This study highlights the importance of considering excess deaths beyond those directly assigned to Covid-19 in the overall assessment of the mortality impact of the Covid-19 pandemic and provides a new method for doing so.
Data Availability
All data used in this manuscript are publicly available with the exception of the 2020 county population data, which are available through a special request to the U.S. Census Bureau. Further details about the data used in this analysis are provided at the linked GitHub repository.
https://github.com/pophealthdeterminantslab/covid-19-county-analysis
Conflicts of Interest
ACS reported receiving grants from Ethicon Inc outside the submitted work. No other disclosures were reported.
Disclaimer
The interpretations, conclusions, and recommendations in this work are those of the authors and do not necessarily represent the views of the Robert Wood Johnson Foundation.
Acknowledgements
The Robert Wood Johnson Foundation supported the research reported in this publication (Grant #77521). ITE was also supported by National Institute on Aging R01 AG060115 “Causes of Geographic Divergence in American Mortality Between 1990 and 2015: Health Behaviors, Health Care Access and Migration.”
The authors would like to thank Farida Ahmad, Robert Anderson, Magali Barbieri, Courtney Boen, Dana Glei, Josh Goldstein, Michelle Guillot, Patrick Heuveline, Anna McGregor, Jennifer Weuve and Wubin Xie for their valuable feedback on the manuscript.