Abstract
Background As both testing for SARS Cov-2 and death registrations are incomplete or not yet available in many countries, the full impact of the Covid-19 pandemic is currently unknown in many world regions.
Methods We studied the Covid-19 and all-cause mortality in 18 Indian states (combined population of 1.26 billion) with available all-cause mortality data during the pandemic for the entire state or for large cities: Gujarat, Karnataka, Kerala, Maharashtra, Tamil Nadu, West Bengal, Delhi, Madhya Pradesh, Andhra Pradesh, Telangana, Assam, Bihar, Odisha, Haryana, Rajasthan, Himachal Pradesh, Punjab, and Uttar Pradesh. Excess mortality was calculated by comparison with available data from years 2015-2019. The known Covid-19 deaths reported by the Johns Hopkins University Center for Systems Science and Engineering for a state were assumed to be accurate, unless excess mortality data suggested a higher toll during the pandemic. Data from Uttar Pradesh were not included in the final model due to anomalies.
Results In several regions, fewer deaths were registered in 2020 than expected. The excess mortality in Mumbai (in Maharashtra) in 2020 was 137.0 / 100K. Areas in Tamil Nadu, Kolkata (in West Bengal), Delhi, Madhya Pradesh, Karnataka, Haryana, and Andhra Pradesh saw spikes in mortality in the spring of 2021.
Conclusions The pandemic-related mortality through June 30, 2021 in 17 Indian states was estimated to be 132.9 to 194.4 per 100,000 population. If these rates apply to India as a whole, then between 1.80 to 2.63 million people may have perished in India as a result of the Covid-19 pandemic by June 30, 2021. This per-capita mortality rate is similar to that in the United States and many other regions.
Introduction
As both testing for SARS Cov-2 and death registrations are incomplete or not yet available in many countries, the full impact of the Covid-19 pandemic is unknown in many world regions. One approach to assess the impact of the evolving pandemic is to examine excess mortality from all causes. When all-cause mortality increases during the pandemic, this is assumed to be a direct result of infection with the Sars Cov-2 virus, or indirect effects from health system overload or social responses to the pandemic.
For many countries, national tallies of mortality during portions of the pandemic have already become available. In India, regional government websites and data journalists are publishing up-to-date mortality figures for an ever-increasing number of cities and states. We sought to integrate these data to estimate the impact of the Covid-19 pandemic in India as a whole. We understand that the picture might change as the pandemic proceeds, and as more data become available.
Methods
We used the publicly available mortality figures published by regional governments (Kerala, Odisha, Karnataka, and Tamil Nadu) and by data journalists in India, often obtained from Right-to-Information (RTI) requests (Hindustan Times 2021; Ramani “Hindu” 2021; Ramani “docs” 2021; Ramani “Rajasthan” 2021; Ramani “Haryana” 2021; Ramani “Punjab”; Scroll 2021; Nagar 2021; The Hindu 2020; Khanna 2020; Saikia 2021; Radhakrishnan 2021, Rukmini 2021; Rukmini “Andhra” 2021). Much of these data are stored on the Local Mortality GitHub websites maintained by Ariel Karlinsky, Dmitry Kobak, and colleagues (Karlinsky 2021; Karlinsky & Kobak 2021) or on the Development Data Lab website (devdatalab).
We assumed that the mortality rate related to the Covid-19 pandemic in a given region was equivalent to the mortality reported by the Johns Hopkins University Center for Systems Science and Engineering (CSSE) (based on positive viral testing and clinical symptoms) (2021), unless the excess mortality data suggested a higher toll. When data for a given region and time period were conflicting or contradictory, we reported a range of estimates to reflect the uncertainty.
All of these publicly available regional-level mortality data contain no individually-identifiable information. The study was approved by the Office of Research Subjects Protection of Virginia Commonwealth University.
For Chhattisgarh, mortality data from the online portal of the state CRS have been released by the Development Data Lab, and we included these data in the appendix. However, as the baseline data for this online portal appear to be only about one tenth complete, as compared with the national vital statistics registry, we deemed the Chhattisgarh data too unreliable to include in the model.
For Uttar Pradesh, the raw mortality data obtained from a Right-to-Information request contained anomalies, such as multiple districts with zero deaths for numerous months. Therefore, the Uttar Pradesh data were analyzed, but were not included in the top-line model.
For the urban portions of 25 districts in Madhya Pradesh, the numbers of funerals in April 2021 have been tabulated (Datta 2021). For Gujarat and the urban portions of 25 districts in Madhya Pradesh, mortality from entire year(s) before 2020 was available (Vital Statistics reports). Therefore, to estimate excess mortality for portions of 2021, it was necessary to assume that mortality was evenly distributed throughout the year.
We calculated excess mortality in a region by comparing the mortality for a given time period in 2020 or 2021 with the value expected based on the years 2015 to 2019. If data from more than one year before 2019 was available, the expected value was calculated by creating a trend line for mortality by linear regression for the years 2015 to 2019, and carrying this trend one year (for 2020) or two years (for 2021) into the future. Carrying the trend line two years into the future for 2021 yielded conservative estimates of excess deaths.
For some states, reported mortality from the state government websites or right-to-information (RTI) requests was only available for 2018 and 2019, which was too short a period to generate a robust trend line. Moreover, the numbers of deaths from the state sources did not match the central government figures exactly, because the state information systems did not capture all of the registered deaths. In these cases, the vital statistics reports for India were used to generate a trend line for expected deaths, using the data from 2015 to 2019. The expected number of deaths was scaled up or down by multiplying the 2015 to 2019 trend line by the ratio of deaths in the state and federal systems for 2018 and 2019. For instance, if the state website average mortality for 2018 and 2019 was 97% of the figures for 2018 and 2019 in the federal reports, the trend line was multiplied by 0.97. This method was used to scale the trend line for Delhi, Bengaluru, Mumbai, Nagpur, Ahmedabad (for 2020), Madhya Pradesh, Tamil Nadu, for 6 city hospitals in Tamil Nadu, and for Madurai district.
Completeness of death registrations has been estimated in the vital statistics reports for each state by comparison with the Sample Registration System (Table S1). For years in which the completeness was less than 100%, the total number of deaths for each year 2015 to 2019 was determined by dividing the death registrations by the completeness fraction. Unlike the trend line for unadjusted death registrations, the trend line for the adjusted registrations did decrease over time for some states. In order to ensure the estimated excess deaths were conservative, the expected deaths for 2020 and 2021 were the maximum of the 2019 value and the value predicted by the trend line. The completeness fraction for 2020 and 2021 was extrapolated by linear regression from the 2015 to 2019 completeness fraction. Once again, in order to be conservative, the maximum of the 2019 value and the linear extrapolation was used.
The national mortality rate was estimated by summing the estimated pandemic-related deaths for the states analyzed and then dividing by the population of these states. The population of Indian states was taken from the Hopkins mortality dataset. Raw data are tabulated in the appendix (Table S1).
Results
We studied 17 states for which excess mortality data were available for at least a portion of the state: Gujarat, Karnataka, Kerala, Maharashtra, Tamil Nadu, West Bengal, Delhi, Madhya Pradesh, Andhra Pradesh, Telangana, Assam, Bihar, Odisha, Rajasthan, Haryana, Punjab and Uttar Pradesh. These 17 states have a combined population of 1.26 billion people (Table 1).
Reported Covid-19 Mortality
The mortality related to Covid-19, based on viral testing and the clinical picture, as tabulated by Johns Hopkins, was reasonably low: 11.0 / 100K in 2020, and 17.6 / 100K in 2021, for a combined total of 28.6 / 100K for the pandemic, as of June 30, 2021 (Table 1). There was some variation, with lower mortality rates in Gujarat, West Bengal, Telangana, Assam, Odisha, Uttar Pradesh, Rajasthan, and Bihar and higher mortality rates in Maharashtra, Delhi, and Punjab (Table 1).
Excess Mortality
Excess mortality data was available for regions of 18 states, including 17 states in 2020 (Table 2), and 17 states in 2021 (Table 3). The best available estimates of the mortality range, whether based on reported Covid-19 deaths, or on excess mortality, are tabulated in Table 4. Table 5 presents the top-line model to estimate excess mortality during the pandemic in India.
For Kerala, Delhi, Madhya Pradesh, Punjab, and Uttar Pradesh, the excess mortality based on registered deaths was actually negative for 2020. This may be because fewer people were willing to register deaths during lockdown, or because fewer people died from accidents and other causes during lockdown (Table 1).
At the other extreme, the excess mortality in Mumbai (in Maharashtra) was 137.0 / 100K in 2020 (Tables 2 and 4). Similarly, Bengaluru in Karnataka had an excess mortality of 117.3 / 100K in the first 7 months of 2020 (Tables 2 and 4).
For 2020, intermediate levels of excess mortality were seen for Kolkata in West Bengal (46.1 / 100K), Chennai in Tamil Nadu (83.5 / 100K), and Hyderabad in Telangana (88.2 / 100K) (Tables 2, 4).
For 2021, a significant peak in all-cause mortality was seen for March through June for Chennai in Tamil Nadu, Kolkata in West Bengal, Delhi, Madhya Pradesh, Haryana, Punjab, and Andhra Pradesh (Tables 3, 4). These findings correspond with news reports of increasing severity of the pandemic in India. During the first half of 2021, the excess mortality was 64.5 / 100K for Kolkata, 67.1 / 100K for Delhi, 120.3 / 100K for Punjab, 145.5 / 100K in Haryana, 176.8 / 100K for Madhya Pradesh, 197.9 / 100K for Andhra Pradesh, and 282.3 / 100K for Chennai in Tamil Nadu (Tables 3, 4).
Integrated Model of Covid-19-related Mortality
A model of pandemic-related mortality which integrates the available data is shown in Table 4. Generally, the excess mortality exceeded the official Covid-19 mortality figures, and was therefore taken as the pandemic-related mortality. However, the Hopkins data on reported Covid-19 deaths were used for Kerala, Delhi, Himachal Pradesh, and Punjab in 2020, and for the low end of the uncertainty range for several regions.
Despite the wide uncertainty ranges for several states and time periods, the overall uncertainty range was more narrow. In the primary model, data from Uttar Pradesh were excluded because of identified anomalies. For 2020, the pandemic-related mortality for 16 states with a population of 972,303,003 million was estimated to be 54.4 to 77.1 / 100K (Table 4). For 2021, through June 30, the pandemic-related mortality for 16 states with a population of 983,052,298 million was estimated to be 78.5 to 117.3 / 100K (Tables 4,5).
Summing these estimates for 2020 and 2021, we estimate the pandemic-related mortality to be: 132.9 to 194.4 per 100,000 population for the entire pandemic (through June 30, 2021, Table 5). Assuming a population of India of 1,352,642,280, these rates correspond with a mortality range of 1.80 to 2.63 million people perishing during the pandemic in India from Covid-19 by June 30, 2021.
If the data from Uttar Pradesh are included, then the estimated pandemic-related mortality was: 111.6 to 161.0 per 100,000 population for the entire pandemic (through June 30, 2021), corresponding with a mortality range of 1.51 to 2.18 million people perishing during the pandemic in India from Covid-19.
Discussion
This analysis of excess mortality found that between 1.80 to 2.63 million people may have perished in India as a result of the Covid-19 pandemic, as of June 30, 2021. It should be noted that the estimated per-capita mortality of 133 to 194 per 100,000 in India is similar to that of the United States and many other regions.
Data from Uttar Pradesh contained anomalies and the estimated excess mortality was lower than in other regions. However, if the Uttar Pradesh data are included in the model, the estimated pandemic-related mortality in India through June 30, 2021 was 1.51 to 2.18 million people.
This mortality level is well above the reported Covid-19 mortality of 398,454 in India as of June 30, 2021 (Hopkins). The IHME at the University of Washington currently estimates that the excess mortality in India was 1.14 million persons on June 30, 2021 (IHME 2021a). It should be noted, however, that the IHME’s original model did not look directly at all-cause mortality in India. Rather, the IHME model extrapolated Indian all-cause mortality based on factors such as test positivity rates in India, and all-cause mortality data from other countries, such as Mexico, Brazil, and the United States (IHME 2021b). Our analysis was based on actual counts of mortality in India, and therefore was a more direct approach to estimation.
Our method of determining the expected baseline, by making a projection using linear regression one year (for 2020) or two years (for 2021) into the future, was more conservative than simply using the baseline average mortality, or projecting just one year forward (even for 2021). In other words, our analysis used the accepted Covid mortality, unless there was very compelling excess mortality data to reject the official numbers. Thus, our estimates of pandemic-related mortality for various regions are more conservative (i.e. lower) than some other studies and news reports. For instance, Deshmukh and colleagues estimated excess mortality in India of 3.2 million based on Civil Registration System data from 5 states and 5 cities (Deshmukh 2021). Anand and colleagues estimated excess deaths of 3.4 million based on death registrations from seven states, including the somewhat problematical data Chhattisgarh data (Anand 2021). If the number of death registrations increased by a fixed amount every year between 2015 and 2021 in a given state, our study would have reported no excess deaths, while the studies of Deshmukh and Anand would have attributed each annual step increase to the pandemic. Their approach may ultimately prove to generate more accurate estimates, but in the face of uncertainty, we elected to take a more conservative approach. Our definition of the baseline by linear regression was used previously in the World Mortality Dataset (Karlinsky 2021).
Based on mortality data from the online Health Management Information System of the Ministry of Health and Family Welfare, Deshmukh estimated excess deaths in India of 2.7 million through June 2021 (Deshmukh 2021).
Survey data can supplement mortality estimates from death registrations. Based on the consumer pyramid household survey (CPHS) produced by the Center for the Monitoring of the Indian Economy (CMIE), Anand estimated excess mortality in India during the pandemic of 4.9 million persons (Anand 2021). Based on a national telephone survey conducted by Cvoter India OmniBus, Deshmukh estimated 3.1 to 3.4 million excess deaths through June 2021 (Deshmukh 2021). One limitation of this survey is that it included deaths outside the immediate household (Deshmukh 2021).
Seroprevalence data can also help to assess the extent of infection in a population. Based on application of international age-specific infection fatality rates to Indian demography and seroprevalence, Anand estimated the Covid-19 related mortality in India to be 4.0 million (Anand 2021). One limitation of this approach is that the infection fatality rates may differ from country to country, depending on the medical system and host factors.
Guilmoto analyzed Covid-19 mortality in several well-defined Indian populations: deaths in Kerala, elected representatives, Indian Railways personnel, and teachers in Karnataka (Guilmoto 2021). Application of the age-specific mortality rates to the Indian population yielded a mortality estimate of 2.2 million persons by late May 2021 (Guilmoto 2021).
Our analysis has a number of limitations. The data analyzed are still incomplete for many regions and times. There may be delays in registering deaths. In addition, available data obtained from regional government websites, central government compilations, and by reporters through RTI requests are not in complete agreement. Some of the data upon which the model is based may simply be wrong, because early reports may contain errors. All-cause mortality may be higher not only due to infection with the Sars Cov-2 virus, but also because of health system overload, delays in patients entering the health system for other conditions (Wu 2020, Aldujeli 2020), or social changes, such as lockdowns. A number of countries, such as Australia and New Zealand, have experienced lower than normal mortality during the pandemic (Karlinsky “World Mortality”). Lower mortality rates may occur because there are fewer accidents or homicides, etc. On the other hand, other factors may lead to higher mortality rates during lockdowns in some regions.
Time will tell what the true death toll has been, as the early data are confirmed, additional regions provide more complete mortality data, and data from diverse sources are reconciled. Additional surveys and seroprevalence data can supplement the estimates from death registrations.
Data Availability
Data are in the appendix of the article.
Appendix
Acknowledgments
The author would like to acknowledge the data journalists, academicians, and nonprofits in India who have brought these data to light, such as Rukmini S. (@Rukmini), Chinmay Tumbe (@ChinmayTumbe), Vignesh Radhakrishnan (@VigneshJourno), Srinivasan Ramani (@vrsrini), Deepak Patel (@deepakpatel_91), Murad Banaji (@muradbanaji), Sumitra Debroy (@debroysumitra), Dhanya Rajendran (@dhanyarajendran), Shiba Kurian (@shiba_kurian), and Arappor Iyakkam. In addition, we acknowledge the important work of Ariel Karlinsky and colleagues, and the Development Data Lab for preparing many of the mortality databases we used. However, all errors in the paper are the responsibility of the authors.
Footnotes
The author has no conflicts of interest to disclose.