Abstract
Background The Covid-19 pandemic has claimed many lives in the UK and globally. The objective of this paper is to study whether the number of deaths not registered as covid-19-related has increased compared to what would have been expected in the absence of the pandemic. This may be a result of some covid-19 deaths being unreported or spillover effects on other causes of death (or both). Reasons behind this might include covid-19 underreporting, avoiding visits to hospitals or GPs, and the effects of the lockdown.
Methods I used weekly ONS data on the number of deaths in England and Wales that did not officially involve covid-19 over the period 2015-2020. Simply observing trends is not sufficient as spikes in deaths may occasionally occur. I thus followed a differences-in-differences econometric approach to study whether there was a relative increase in deaths not registered as covid-19-related during the pandemic, compared to a control. As an additional approach, an interrupted time series model was also used.
Results Results suggest that there are an additional 968 weekly deaths that officially did not involve covid-19, compared to what would have otherwise been expected. This increase is also confirmed by the interrupted time series analysis.
Discussion The number of deaths not officially involving covid-19 has demonstrated an absolute and relative increase during the pandemic. It is possible that some people are dying from covid-19 without being diagnosed, and that there are excess deaths due to other causes as a result of the pandemic. Analysing the cause of death for any excess non-covid-19 deaths will shed light upon the reasons for the increase in such deaths and will help design appropriate policy responses to save lives.
1. Background
Over 4 million Covid-19 cases have been reported globally, leading to 300,000 deaths. In the United Kingdom, the death toll has reached 33,000, while over 220,000 people have been diagnosed with the virus.1 The novel coronavirus is directly claiming lives, and it is also possible that some covid-19 patients may have died without being diagnosed. However, this unprecedented situation and the lockdown might also be triggering additional health problems. People with other, unrelated health conditions may be reluctant to visit their GP or a hospital in order to avoid the risk of contracting the virus,2 thus remaining undiagnosed or not receiving the medical treatment they might need. Furthermore, to increase capacity for the overstretched NHS, routine operations have been postponed.3 The lockdown may also have unintended health effects. Lack of social contact can affect mental health,4 and big events or disasters at the national level can have a similar impact.5 Staying at home can limit physical activity, which has been associated with obesity6 and mental health.7 There are also reports of a rise in domestic abuse,8 while the current financial and public health situation may also cause additional uncertainty and stress.9-10 Apart from the negative effects, there may also be some improvement in certain areas. The lockdown has reduced traffic volume and may thus lead to a decrease in motor vehicle collisions and related deaths. Reduced traffic has also led to lower levels of air pollution, which is associated with mortality.11 The lockdown may have also helped reduce crime rates.
The objective of this paper is to study whether and to what extent the number of deaths not registered as covid-19-related have increased compared to what would have been expected in the absence of the virus. This may be a result of some covid-19 deaths being unreported or spillover effects on other causes of death (or both).
2. Data and Methods
This study used weekly (provisional) mortality data from England and Wales for years 2019 and 2020, obtained from the Office for National Statistics (ONS).12 Data were extracted on 7 April 2020 and updated on 14, 21, 28 April and on 5 and 12 May 2020. Data used in this study are based on the data released on 12 May 2020, and values included in this dataset may be changed in later releases, as is sometimes the case. Data were reported by gender, age group and Region. I used the total number of deaths (regardless of cause) as well as the number of deaths where Covid-19 was mentioned on the death certificate, in order to calculate the number of deaths that were not officially related to Covid-19. Data on Covid-19 deaths are also available by the Department of Health and Social Care,13 but the latter exclude those deaths that occurred outside hospital, which is why I preferred the ONS data. According to the ONS, data by gender or age group may be incomplete, so they might not necessarily sum to the total number of deaths.
Studying trends in a variable alone before and after a “treatment” can be misleading as there may be other factors driving any change. For that purpose, a control group can help filter out any other effects. Such a control group will have to remain unaffected by the treatment. The covid-19 pandemic is a major global crisis, and has caused a lockdown throughout the UK, so identifying a control population for the same period seems impossible as it would be highly likely to be contaminated. Instead, I follow an approach similar to that by Metcalfe et al5 and Powdthavwee et al14 who used trends in the same variable, in earlier years, as a control group. Likewise, I used deaths in the first 18 weeks in previous years as a control group for non-covid-19 deaths in the 18 first weeks in 2020. The “treatment” period starts in week 10 of the year, when the first covid-19 death occurred in England and Wales.15 Summary statistics are presented in Table 1.
In order to compare trends in deaths excluding covid-19 deaths to the control group, I used a difference-in-differences (D-I-D) econometric approach. I used the average number of deaths in the previous five previous years as a control group, which also helps smooth out any short-term spikes (possibly due to a bad flu season16). A difference-in-differences approach requires that the trends (rather than absolute values) in treatment and control groups are parallel prior to the intervention. To test whether this common trend assumption is met, I followed the approach by Autor,17 who used a model with interactions including lags and leads (prior to and after the treatment). Results of the common trend assumption test are presented in Table A1 in the Appendix. All interaction lags are insignificant, suggesting that there is indeed a common trend in the two groups prior to the intervention. Trends can also be observed graphically (Figure 1).
The dependent variable is the number of deaths in each of the 18 first weeks of the calendar years, excluding any deaths that mentioned covid-19 in the death certificate. The difference-in-differences model includes a “treatment group” dummy variable, which takes the value of 1 for the group that is affected by the intervention, and zero otherwise. In this case, observations in 2020 take the value of 1, and observations in previous years take the value of zero. Another dummy that is included is an “after” variable, which takes the value of 1 in the period after an intervention (for both groups, 2020 and other years), and zero otherwise. We consider the treatment period to start in week 10, as that is the week when the first covid-19 death was reported, thus indicating an escalating situation and capturing any spillover effects of the virus. There were five covid-19-related deaths reported in week 10; 41 in week 11; 397 in week 12; 1,838 in week 13; 5,079 in week 14; 8,073 in week 15; 8,121 in week 16; 6,746 in week 17 and 4744 in week 18. One might argue that the treatment period should start later, when the number of deaths started increasing steeply, but a question that remains is where we should draw the line, and this would possibly relate with the cause of excess deaths, which is currently unknown – so identifying where the treatment period should start becomes particularly challenging. To be on the safe side, I followed the most conservative approach, i.e. a treatment period that starts with the first death, rather than when the number of deaths demonstrate large increases. This might underestimate the magnitude of any effect on non-covid-19 deaths, but is unlikely to exaggerate any findings.
The interaction of these two dummy variables (treatment*after) is the main variable of interest. I also used dummy variables for gender and week dummies, to address seasonality. Robust standard errors were used in all regressions.
Finally, for completeness, I also employed an interrupted time series model, using all weekly observations (not the average) from the first week of 2015 until the 18th week of 2020 (279 observations in total). Again, the treatment period starts in the tenth week of 2020, which does indeed leave a very short post-treatment period (9 weeks) compared to the pre-treatment period (270 weeks). However, this approach is used as an additional check rather than as the main analysis.
3. Results
There are 3,268 additional deaths that did not officially involve covid-19 in week 18 of 2020, compared to the same week in years 2015-2019 on average. Figure 1 shows the weekly number of deaths by gender in England and Wales (excluding any covid-19 deaths) in the first 18 weeks of 2020 and the average weekly deaths for the period 2015-2019. On week 14, 2020, onwards, there is a jump in non-covid-19 deaths, compared to the trend in previous years. This increase only started in week 14, i.e. in the fifth week of reported covid-19 fatalities. In week 13 2020 (which was the fourth week of covid-19 fatalities), the number of non-covid-19 deaths demonstrated a relative decrease. Figure A1 in the Appendix provides trends in non-covid-19 deaths by age group and gender, for age groups 65-74; 75-84; and 85 or over, which account for over 85% of all deaths.
Results of the baseline difference-in-differences econometric analysis are presented in Table 2, where weekly deaths enter the model by gender. There is an increase in deaths not reported as covid-19-related in the post-treatment period compared to the control group [D-I-D coeff: 967.50; 95%CI: 470.55 to 1464.45].
Could this relative increase in deaths be random? To answer this, I performed a placebo test, restricting the sample to the pre-treatment period (up to week 9, i.e. before the first covid-19 death), using an earlier random treatment period starting in week 7. Finding no effect in this case would lend additional support that the findings of the baseline model are not random. Results are provided in Table 3, and indeed, there is no effect in this placebo regression [D-I-D coeff: -23.08; 95%CI: -463.44 to 417.27].
I also performed the analysis at the Region level (Wales and nine regions in England). Results are reported in Table A2 in the Appendix, and confirm the findings of the baseline model [D-I-D coeff: 166.90; 95%CI: 122.06 to 211.74].
Results of the interrupted time series analysis are reported in Table A3 in the Appendix and suggest a similar effect, indicating an increase in the weekly deaths trend compared to the pre-treatment period [coef: 656.78; 95% CI: 410.63 to 902.94].
4. Discussion
This paper studied whether, during the covid-19 pandemic, there was an increase in deaths that have not officially been linked to the virus. Using a differences-in-differences econometric approach by comparing trends in 2020 to the average trends in the previous five years, I find that there are an additional 968 weekly deaths not officially registered as covid-19 compared to what would have been expected in the absence of the pandemic. Therefore, apart from the official covid-19 death toll, there are additional deaths that might be somehow linked, either directly or indirectly, to covid-19.
There are two possible reasons for this excess mortality. First, some people might have died from covid-19 without being diagnosed. Second, there may be spillover effects on other causes, such as patients postponing treatment for unrelated health conditions in order to avoid contracting the virus in hospitals or GP clinics; prioritisation of covid-19 patients by health services; stress and anxiety related to the current financial and public health environment; domestic violence; and lack of activity or other effects due to the lockdown.2-10 This relative increase in deaths occurs despite reasonably expecting a reduction in some types of mortality, such as motor vehicle collisions, crime, pollution and smoking.
The way deaths are reported or registered is central to this research question. In the ONS data, coronavirus deaths are those that mention covid-19 on the death certificate, meaning that one may have died due to other causes, after having tested positive for covid-19. In any case, reporting deaths is challenging, and misclassification can often play a role in empirical results.18-20 Furthermore, weekly data updates often include revisions on provisional figures from previous weeks.
It is worth noting that the increase in deaths seems to only occur from week 14 onwards, i.e. in the fifth week since the first covid-19 death in the country - when there were already over 2,000 covid-19 deaths in England and Wales. For any excess deaths that might have been a result of not visiting a hospital or GP clinic due to fear of contracting a disease, it may be that either people changed their behaviour only when covid-19 deaths started rising steeply, or that any untreated health issues led to death with a time lag. Using a treatment period in the empirical model that would start later into the pandemic would show an even larger magnitude of non-covid-19 deaths.
If people are dying of covid-19 without being diagnosed, mortality may actually be higher than what we currently think. If people without covid-19 are also dying as a result of the virus, we to need act urgently, to minimise these tragic spillover effects. Data access on the causes of non-covid-19 deaths would allow us to understand the mechanism behind this phenomenon and would help design appropriate targeted responses to save lives.
Data Availability
Data availability: The data used in this study are freely available online from the Office for National Statistics.
Conflict of interest
None
None Funding
None
Ethics approval
The data used were aggregate anonymous data from a public source so ethics approval was not required.
Data availability
The data used in this study are freely available online from the Office for National Statistics.
Checklist
There is no relevant checklist of observational studies.