Abstract
As both testing for SARS Cov-2 and death registrations are incomplete or not yet available in many countries, the full impact of the Covid-19 pandemic is currently unknown in many world regions.
We studied the Covid-19 and all-cause mortality in 19 Indian states (combined population of 1.27 billion) with available all-cause mortality data during the pandemic for the entire state or for large cities. Excess mortality was calculated by comparison with available data from years 2015-2019. The known Covid-19 deaths reported by the Johns Hopkins University Center for Systems Science and Engineering for a state were assumed to be accurate, unless excess mortality data suggested a higher toll during the pandemic. Data from one state were not included in the final model due to anomalies.
In several regions, fewer deaths were reported in 2020 than expected. The excess mortality in Mumbai (in Maharashtra) in 2020 was 137.0 / 100K. Areas in Andhra Pradesh, Delhi, Haryana, Karnataka, Madhya Pradesh, Tamil Nadu, and Kolkata (in West Bengal), saw spikes in mortality in the spring of 2021.
The pandemic-related mortality through August 31, 2021 in 18 Indian states was estimated to be 198.7 per 100,000 population (range 146.1 to 263.8 per 100K). If these rates apply to India as a whole, then 2.69 million people (range 1.98 to 3.57 million) may have perished in India as a result of the Covid-19 pandemic by August 31, 2021.
Introduction
As both testing for SARS Cov-2 and death registrations are incomplete or not yet available in many countries, the full impact of the Covid-19 pandemic is unknown in many world regions. One approach to assess the impact of the evolving pandemic is to examine excess mortality from all causes. An increase in all-cause mortality during the pandemic is assumed to be a direct result of infection with the Sars Cov-2 virus, or indirect effects from health system overload or social responses to the pandemic.
For many countries, national tallies of mortality during phases of the pandemic have already become available. Early in the pandemic, India was believed to have suffered Covid-specific per-capita mortality well below that of the United States.1 Regional government websites and data journalists are publishing up-to-date mortality figures for an ever-increasing number of cities and states in India. We sought to integrate these data to estimate the impact of the Covid-19 pandemic in India as a whole. We understand that the picture might change as the pandemic proceeds, and as more data become available.
Methods
Sources of All-Cause Mortality Data
We used the publicly available mortality figures published by regional governments and by data journalists in India, often obtained from Right-to-Information (RTI) requests (Supplemental References, Table S1). Much of these data are stored on the websites maintained by the Local Mortality project of Ariel Karlinsky,2,3 or the Development Data Lab.4 These were supplemented by data from government hospitals, funeral counts, and handwritten death registers when available, as detailed below.
We acquired mortality data from 19 states (or union territories) with 1.27 billion population, either for the entire state, or for large cities or districts within the state (Table 1). As a convenient shorthand, we refer to these 19 administrative regions as “states”, even though two of them (Chandigarh and Delhi) are actually union territories. The inclusion of a state or large city in the analysis was based on whether reliable mortality data were available. Some smaller states were not included because governments, reporters, nonprofits and academicians have not yet published relevant mortality data for the pandemic period.
All of these publicly available regional-level mortality data contain no individually-identifiable information. The study was approved by the Office of Research Subjects Protection of Virginia Commonwealth University.
For Chhattisgarh, mortality data from the online portal of the state CRS have been released,4 and we included these data in the appendix. However, as the baseline data for this online portal appear to be only about one tenth complete, as compared with the national vital statistics registry, we deemed the Chhattisgarh data too unreliable to include in the model.
For Gujarat, journalists tabulated deaths from March 2020 through April 2021 for 68 of the 170 municipalities, which comprised 6.01% of the state’s population, from the bound handwritten official municipal death registers (Table S1, Supplemental References). For 3 of these Gujarat municipalities (Chorvad, Idar, and Khedabrahma), death register data were available from May 1 to June 10, 2021.
For the urban portions of 25 districts in Madhya Pradesh, the numbers of funerals in April 2021 have been tabulated (Supplemental References).
For Uttar Pradesh, the raw mortality data obtained from a Right-to-Information request contained anomalies, such as multiple districts with zero deaths for numerous months. Therefore, the Uttar Pradesh data were analyzed, but were not included in the top-line model.
Reported Covid-19 Mortality
Mortality attributed to Covid-19 has been tabulated by the Johns Hopkins University Center for Systems Science and Engineering (CSSE, Table S2).5 Our model assumed that the CSSE figures for Covid-19 mortality accurately reflected the pandemic-related mortality in a given state for each year of the pandemic (2020 and 2021), unless the excess mortality data suggested a higher toll.
The CSSE in turn obtains Covid mortality data from the governmental health authorities of the respective countries.6 In the case of India, the CSSE links to the Covid-19 webpage for the Health Ministry of India.7 According to reports, local physicians and health authorities in India were in some cases not reporting deaths as caused by Covid-19 if SarsCov-2 tests were not performed or if the patient had contributing comorbidities.8
Analysis of Mortality
For some states, several data sources were available, which permitted the calculation of multiple estimates of per-capita excess mortality. In this case, we presented both the median estimate and the lowest and highest estimates available.
For Gujarat and the urban portions of 25 districts in Madhya Pradesh, mortality data from entire year(s) before 2020 were available.9 Therefore, to estimate excess mortality for portions of 2021, it was necessary to assume that mortality was evenly distributed throughout the year.
For the analysis of data from 68 municipalities in Gujarat, the per-capita mortality for 2021 was estimated by summing the mortality rate through April 2021 with that from available municipalities for May 1 through June 10 (Table S1).
We calculated excess mortality in a region by comparing the mortality for a given time period in 2020 or 2021 with the value expected based on the years 2015 to 2019. If data from more than one year before 2019 were available, the expected value was calculated by creating a trend line for mortality by linear regression for the years 2015 to 2019, and carrying this trend one year (for 2020) or two years (for 2021) into the future. Carrying the trend line two years into the future for 2021 yielded conservative estimates of excess deaths.
For some states, reported mortality from the state government websites or right-to-information (RTI) requests was only available for 2018 and 2019, which was too short a period to generate a robust trend line. Moreover, the numbers of deaths from the state sources did not match the central government figures exactly, because the state information systems did not capture all of the registered deaths. In these cases, the vital statistics reports for India were used to generate a trend line for expected deaths, using the data from 2015 to 2019. The expected number of deaths was scaled up or down by multiplying the 2015 to 2019 trend line by the ratio of deaths in the state and federal systems for 2018 and 2019. For instance, if the state website average mortality for 2018 and 2019 was 97% of the figures for 2018 and 2019 in the federal reports, the trend line was multiplied by 0.97. This method was used to scale the trend line for Delhi, Bengaluru, Mumbai, Nagpur, Ahmedabad (for 2020), Madhya Pradesh, Tamil Nadu, for 6 city hospitals in Tamil Nadu, and for Madurai district.
Completeness of death registrations has been estimated by the Indian government in the vital statistics reports for each state by comparison with the Sample Registration System (Table S1).9 For years in which the completeness was less than 100%, the total number of deaths for each year 2015 to 2019 was determined by dividing the death registrations by the completeness fraction. Unlike the trend line for unadjusted death registrations, the trend line for the adjusted registrations did decrease over time for some states. In order to ensure the estimated excess deaths were conservative, the expected deaths for 2020 and 2021 were the maximum of the 2019 value and the value predicted by the trend line. The completeness fraction for 2020 and 2021 was extrapolated by linear regression from the 2015 to 2019 completeness fraction. Once again, in order to be conservative, the maximum of the 2019 value and the linear extrapolation was used.
The national mortality rate was estimated by summing the estimated pandemic-related deaths for the states analyzed and then dividing by the population of these states. The population size of Indian states was taken from the Hopkins mortality dataset.5 Raw data are tabulated in the appendix (Table S1).
Graphical analysis of mortality
In order to present the timing of the mortality graphically, the per-capita mortality rates (total monthly deaths divided by total population) were calculated for the states for which monthly data were available from Jan 2019 through May 2021 (Andhra Pradesh, Bihar, Chandigarh, Delhi, Haryana, Himachal Pradesh, Karnataka, Kerala, Madhya Pradesh, Maharashtra, Punjab, Rajasthan, Tamil Nadu, and West Bengal). For subsequent months, the per-capita mortality was calculated from states with data available (Punjab through June 2021, Andhra Pradesh through July 2021, Karnataka and Tamil Nadu through August 2021, and Odisha in July and August 2021). For the purposes of generating this figure, the Odisha cumulative annual mortality for August 1, 2021 was estimated by linear interpolation from the July 1 and August 8 values. This graphical analysis was completely separate from the tabulated estimate of total mortality in India.
Results
Excess Mortality
We studied 19 states, with a population of 1.27 billion, for which excess mortality could be estimated for at least a portion of the state (Tables 1, 2). Excess mortality could be estimated for regions in all 19 states in 2020, and in 18 states in 2021 (Table 1).
The timing of the excess per-capita mortality during the pandemic is illustrated in Figure 1. In 2020, there was a slight increase in all-cause mortality from August through October, as compared with 2019. However, the most prominent rise in all-cause mortality came in the spring of 2021, and was evident in April and May (Figure 1). After May 2021, the available data suggest that mortality registrations continued to rise through June, but had begun to wane in the summer of 2021 (Figure 1).
For Chandigarh, Delhi, Kerala, Madhya Pradesh, Punjab, and Uttar Pradesh, the excess mortality based on registered deaths was actually negative for 2020 (Table 1). This may be because fewer people were willing to register deaths during lockdown, or because fewer people died from accidents and other causes during lockdown.
At the other extreme, the excess mortality in Mumbai (in Maharashtra) was 137.0 / 100K in 2020 (Table 1). Similarly, Andhra Pradesh had an excess mortality of 121.5 / 100K in 2020 (Table 1).
For 2020, intermediate levels of excess mortality were seen for Kolkata in West Bengal (46.1 / 100K), Chennai in Tamil Nadu (83.5 / 100K), and Hyderabad in Telangana (88.2 / 100K) (Table 1).
For 2021, a prominent peak in all-cause mortality was seen for March through June for Chennai in Tamil Nadu, Kolkata in West Bengal, Delhi, Madhya Pradesh, Haryana, Punjab, and Andhra Pradesh (Table 1). These findings correspond with news reports of increasing severity of the pandemic in India. Data available by August 31, 2021 suggested excess mortality of at least 64.5 / 100K for Kolkata, 120.3 / 100K for Punjab, 145.5 / 100K in Haryana, 212.2 / 100K for Madhya Pradesh, 230.3 / 100K for Delhi, 265.0 / 100K for Tamil Nadu, and 321.8 / 100K for Andhra Pradesh (Table 1).
Reported Covid-19 Mortality
The mortality related to Covid-19, based on viral testing and the clinical picture, as tabulated by Johns Hopkins, was reasonably low: 10.9 / 100K in 2020, and 20.4 / 100K in 2021, for a combined total of 31.4 / 100K for the pandemic, as of August 31, 2021 (Table S2). There was some variation, with lower mortality rates in Assam, Bihar, Gujarat, Odisha, Rajasthan, Telangana, Uttar Pradesh, and West Bengal and higher mortality rates in Delhi, Maharashtra, and Punjab (Table S2).
Integrated Model of Covid-19-related Mortality
The best available estimates of the pandemic-related mortality, whether based on reported Covid-19 deaths, or on excess mortality, are presented in Table 2. We also presented the range of mortality estimates for each state and for India as a whole (Tables 3, S3). Table 3 presents the top-line model to estimate excess mortality during the pandemic in India.
Generally, the excess mortality exceeded the Covid-19 mortality figure reported by Hopkins, and was therefore taken as the pandemic-related mortality. However, the Covid-19 deaths reported by Hopkins were used in the model for Chandigarh, Delhi, Himachal Pradesh, Kerala, and Punjab in 2020 (Table 2).
Despite the wide uncertainty ranges for several states and time periods, the overall uncertainty range was narrower. In the primary model, data from Uttar Pradesh were excluded because of identified anomalies. For 2020, the pandemic-related mortality for 18 states with a population of 1.03 billion was estimated to be 64.3 / 100K (range 47.5 to 79.3 / 100K, Tables 2, 3). For 2021, through August 31, the pandemic-related mortality for 17 states with a population of 995 million was estimated to be 134.4 / 100K (range 98.6 to 184.4 / 100K, Tables 2, 3).
Summing these estimates for 2020 and 2021, we estimate the pandemic-related mortality to be: 198.7 / 100K (range 146.1 to 263.8 / 100K) population for the entire pandemic (through August 31, 2021, Tables 2, 3). Assuming a population of India of 1,352,642,280, these rates correspond with 2.69 million people (range 1.98 to 3.57 million) perishing during the pandemic in India from Covid-19 by June 30, 2021.
The estimated Covid-19 mortality can also be expressed as a fraction of the baseline mortality, taken from the 2019 national vital statistics reports (Table S3). The estimated Covid-19 mortality represented an increase over the baseline annual mortality of 10.70% (range 7.91% to 13.21%) in 2020 and 22.40% (range 16.44% to 30.73%) in 2021 (as of August 31). It should be noted that pandemic-related deaths in the final 3 months of 2021 will increase these values.
If the data from Uttar Pradesh are included, then the estimated pandemic-related mortality was 165.3 / 100,000 population through August 31, 2021 (range 122.8 to 217.9 / 100K, Tables 2, S3), corresponding with a mortality of 2.24 million people (range of 1.66 to 2.95 million) during the pandemic in India from Covid-19.
Discussion
This analysis of excess mortality found that 2.69 million people (range 1.98 to 3.57 million) may have perished in India as a result of the Covid-19 pandemic, as of August 31, 2021.
Data from Uttar Pradesh contained anomalies and had an estimated excess mortality lower than in other regions, in part because Uttar Pradesh data from after April 2021 were not available. However, if the Uttar Pradesh data are included in the model, the estimated pandemic-related mortality in India through August 31, 2021 was 2.24 million people (range 1.66 to 2.95 million).
This mortality level is well above the reported Covid-19 mortality of 438,560 in India as of August 31, 2021.5 The Institute for Health Metrics and Evaluation (IHME) at the University of Washington currently estimates that the excess mortality in India was 1.23 million persons on August 31, 2021.10 It should be noted, however, that the IHME model did not look directly at all-cause mortality in India. Rather, the IHME model extrapolated Indian all-cause mortality based on factors such as test positivity rates in India, and all-cause mortality data from other countries, such as Mexico, Brazil, and the United States.11 Our analysis was based on actual counts of mortality in India, and therefore was a more direct approach to estimation.
One strength of assessing the pandemic impact primarily through excess mortality is the potential to distinguish deaths caused by Covid-19 from “deaths with Covid-19”. Covid-19 patients who likely would have died during the study period due to their comorbidities even if they had never been infected should not result in excess deaths.
Our analysis used the Covid-19 mortality reported by Hopkins University, unless the excess mortality data suggested a higher pandemic-related toll. Our method of determining the expected baseline, by making a projection using linear regression one year (for 2020) or two years (for 2021) into the future, was more conservative than simply using the baseline average mortality, or projecting just one year forward (even for 2021). Thus, our estimates of pandemic-related mortality for various regions are more conservative (i.e. lower) than some other studies and news reports. For instance, Deshmukh and colleagues estimated excess mortality in India of 3.2 million through June 2021 based on Civil Registration System data from 5 states and 5 cities.12 Anand and colleagues estimated excess deaths of 3.4 million through June 2021 based on death registrations from seven states.13 If the number of death registrations increased by a fixed amount every year between 2015 and 2021 in a given state, our study would have reported no excess deaths, while the above studies12,13 would have attributed each annual step increase to the pandemic. Their approach may ultimately prove to generate more accurate estimates, but in the face of uncertainty, we elected to take a more conservative approach. Our definition of the baseline by linear regression was used previously in the World Mortality Dataset.2
Based on mortality data from the online Health Management Information System of the Ministry of Health and Family Welfare, excess deaths in India through June 2021 were estimated to be 2.7 million.12
Survey data can supplement mortality estimates from death registrations. Based on the consumer pyramid household survey (CPHS) produced by the Center for the Monitoring of the Indian Economy (CMIE), Anand estimated excess mortality in India during the pandemic of 4.9 million persons as of April 2021.13 Based on a national telephone survey conducted by Cvoter India OmniBus, the excess deaths in India were estimated to be 3.1 to 3.4 million through June 2021.12 One limitation of this survey is that it included deaths outside the immediate household.12
Seroprevalence data can also help to assess the extent of infection in a population. Based on application of international age-specific infection fatality rates to Indian demography and seroprevalence, the Covid-19 related mortality in India through June 2021 was estimated to be 4.0 million.13 One limitation of this approach is that the infection fatality rates may differ from country to country, depending on the medical system and host factors. The pediatric mortality from Covid-19 in India was higher than in comparison countries in one survey.14 Another limitation of mortality analyses based on seroprevalence is that antibody responses wane over time.15 If seroprevalence is assessed well after the epidemic peak in the study population, overall mortality will be underestimated. Conversely, if seroprevalence is assessed in the reference population well after the epidemic peak, mortality in the study population could be overestimated.
Guilmoto analyzed Covid-19 mortality in several well-defined Indian populations: deaths in Kerala, elected representatives, Indian Railways personnel, and teachers in Karnataka.16 Application of these age-specific mortality rates to the Indian population yielded a mortality estimate of 2.2 million persons by late May 2021.16
Our analysis has a number of limitations. The data analyzed are still incomplete for many regions and times. There may be delays in registering deaths. In addition, available data obtained from regional government websites, central government compilations, and by reporters through RTI requests are not in complete agreement. Some of these early data may contain errors. All-cause mortality may be higher not only due to infection with the SARS-Cov-2 virus, but also because of health system overload, delays in patients entering the health system for other conditions,17,18 or social changes, such as lockdowns. In principle, excess mortality might be observed due to other diseases, war, or environmental factors such as heat waves, though these factors are not known to have played a significant role during the Covid-19 pandemic in India.
A number of countries, such as Australia and New Zealand, have experienced lower than normal mortality during the pandemic.2 Lower mortality rates may occur because there are fewer accidents or homicides, etc. On the other hand, other factors may lead to higher mortality rates during lockdowns in some regions.
Time will tell what the true death toll has been, as the early data are confirmed, additional regions provide more complete mortality data, and data from diverse sources are reconciled. Additional surveys and seroprevalence data can supplement the estimates from death registrations.
Data Availability
Data are in the appendix of the article.
Funding
None.
Disclosures
The authors have no conflicts of interest.
Author contact information
Christopher T. Leffler, MD, MPH. Department of Ophthalmology. Virginia Commonwealth University. Richmond, VA 23298. chrislefflermd{at}gmail.com
Joseph D. Lykins V, MD. Department of Internal Medicine, Virginia Commonwealth University. Richmond, VA 23298. Joseph.Lykins{at}vcuhealth.org
Edward Yang, BA. School of Medicine, Virginia Commonwealth University, Richmond, VA 23298. yange3{at}vcu.edu
Acknowledgments
The author would like to acknowledge the data journalists, academicians, and nonprofits in India who have brought these data to light, such as Rukmini S. (@Rukmini), Chinmay Tumbe (@ChinmayTumbe), Vignesh Radhakrishnan (@VigneshJourno), Srinivasan Ramani (@vrsrini), Deepak Patel (@deepakpatel_91), Murad Banaji (@muradbanaji), Sumitra Debroy (@debroysumitra), Dhanya Rajendran (@dhanyarajendran), Shiba Kurian (@shiba_kurian), and Arappor Iyakkam. In addition, we acknowledge the important work of Ariel Karlinsky and colleagues, and the Development Data Lab for preparing many of the mortality databases we used. However, all errors in the paper are the responsibility of the authors.
Appendix
Footnotes
None of the authors has any conflicts of interest to disclose.
This version updates the data through August 31, 2021, and also includes a figure.