Abstract
After SARS-CoV-2 set foot in India, the Government took a number of steps to limit the spread of the disease in the country. This study involves assessing how the disease affected the population in the initial days of the epidemic. Data was collected from government-controlled and crowdsourced websites and analyzed. Studying age and sex parameters of 413 Indian COVID-19 patients, the median age of the affected individuals was found to be 36 years (IQR, 25-54) with 20-39 years males being the most affected group. The number of affected males (66.34%) was more than that of the females (33.66%). Using Susceptible-Infected-Removed (SIR) model, the range of contact rate (β) of India was calculated and the role of public health interventions was assessed. If current contact rate continues, India may have 5583 to 13785 active cases at the end of 21 days lockdown.
Article Summary Line The study gives the epidemiological characteristics of the SARS-CoV-2 epidemic in India, where unlike other countries, the 20-39 years males are most affected, and the SIR model predicts the probable number of cases of COVID-19 by the end of the 21 days lockdown in the country, which will help to develop appropriate public health interventions to control the COVID-19 epidemic.
INTRODUCTION
In December 2019, a cluster of patients with pneumonia of unknown origin was encountered in Wuhan, China (1). The causative agent was determined to be a novel virus of the Coronaviridae family of RNA viruses and was claimed to be of zoonotic origin (1). Due to its close relationship with the SARS-CoV, this novel coronavirus was named SARS-CoV-2 (2). Since then, SARS-CoV-2 has caused a widespread outbreak of the disease now known as COVID-19 and was declared to be a pandemic by WHO on March 11 2020 (3). Human to human transmission occurs primarily through close-contact with the infected person, through fomites in the immediate surroundings of the infected person and via droplets of respiratory secretions (4, 5), although there is limited evidence pointing to a possibility of airborne and faeco-oral transmission as well (6, 7). According to few case studies, transmission may also occur via viral shedding in “pre-symptomatic” individuals during the incubation period (8, 9).
The incubation period for COVID-19 is thought to be within 14 days of exposure, with a median incubation period of 4-5 days (4, 10, 11). The median age of patients affected by COVID-19 is 47 years with the most common clinical findings being fever and cough (4, 12). Other symptoms include expectoration, headache, myalgia fatigue, and diarrhoea and haemoptysis in rare cases. About 18% of patients develop shortness of breath (4). Severe disease has been reported to occur in 14% of patients with older age and pre-existing chronic disease being risk factors (4, 13).
Critical disease requiring intensive care unit admission has been reported in 5 percent, and overall case-fatality rate as 2.3% (13). Currently, there are no approved treatments for COVID-19 and clinical trials such as the WHO SOLIDARITY trial are underway to evaluate the effectiveness of drugs like lopinavir-ritonavir, remdesivir, hydroxychloroquine and azithromycin (3, 14).
India reported its first case of COVID-19 on 30 January, 2020; a medical student who had travelled from Wuhan, China, the then epicenter of COVID-19 (15). While there were only a few reported imported cases in the month of February, the number began to increase rapidly in March (14). On 15th March 2020, India sealed its borders and stopped all international flights, meaning all the initial imported cases which seeded COVID-19 in India arrived in the Indian subcontinent before 15 March 2020 (16). According to the data available in the public domain, India had approximately 3726 cases (both imported cases and due to person-to-person transmission) as of 5 April 2020 (17). According to the Indian Council of Medical Research, as of 31st March 2020, community transmission has not yet started and India is in category 2 of WHO classification for transmission patterns, i.e., sporadic cases without evidence of community transmission (18).
The most important question in the current scenario concerns the mathematical parameters of the initial spread of COVID-19 in India, and what are the epidemiological aspects that can predict this spread. We acknowledge there are certain difficulties in making precise calculations due to the rapidly changing dynamic of the epidemic in the early stages, limited availability of data in the public domain, absence of robust line listing of cases and limited testing capacity.
Nonetheless, mathematical models with reasonable assumptions based on available information can help in analysis of the currently available data to provide important insights for guiding public health interventions. For a predictive model to be relevant to the reality, the model itself should represent what is happening in the real world. The most basic of these models is the Susceptible-Infected-Removed (SIR) model (19,20), which we’ve used in the current Indian scenario to determine the range in which contact rate β lies and also calculate the range of the current reproduction number, Rt.
In the first part of our paper, we outline the several public health measures taken by India in response to the COVID-19 outbreak. We then study and analyze the epidemiological parameters of COVID-19 in India using publicly available data till 28 March 2020. Finally, we use the SIR model, and run simulations to determine the effective contact rate β, and derive from that the reproduction number Rt, and comment on the result of the nationwide public health measures implemented from mid-March 2020 and the lockdown implemented from 25 March 2020.
INDIA’S OUTBREAK RESPONSE
Since the beginning of the outbreak in India, there have been a number of interventions done at various levels by the state governments and by the Central Government agencies. The number of cases started increasing in March (21), prompting a number of interventions to control the outbreak. The first interventions were mostly related to travel advisories and bans. Gradually there were more social distancing measures in March which were then followed by lockdowns, ultimately culminating in a nation-wide 21 days lockdown from 25 March, 2020 (Table 1)
METHODS
Epidemiological analysis
Data Source
The raw data was collected from the patient database of covid19india.org which is a crowdsourced patient database for positive cases of SARS-CoV-2 which have been confirmed by laboratory testing, including data from state government and central government agencies (https://www.covid19india.org/). The data was taken for cases confirmed on 28 March, 2020 or earlier. Only those patient data were analyzed which had both age and sex data of the patient mentioned. For convenience of analysis, patients whose age data was given as a range were excluded.
Study population
The above method yielded a sample of size n=413. Of these patients, the status of the patient as of 28th March 2020 was not known for 13 patients.
Study Design
For the epidemiological analysis part of the study, we used a descriptive design. After collecting the data analysis was done in regard to the age distribution, status of patients and sex distribution using Microsoft Office Excel 2007. Fatality ratio in any category was found out by dividing the number of deaths in the category by the number of affected individuals of that category. Central tendencies for age of patients of various categories were also calculated.
Mathematical analysis
The spread of the disease in the population is studied by the SIR model (19,20) which divides the (fixed) population of N individuals into three “compartments” which vary as a function of time (For purposes of this study, we have not included vital dynamics like birth and death rate.) –
S(t) - S(t) are those susceptible but not yet infected with the disease (in a novel disease like nCOV-19,the entire population is assumed to be susceptible as there is no pre-existing immunity);
I(t) - I(t) is the number of infectious individuals;
R(t) - R(t) are those individuals who have been removed from the infected population(includes those who have recovered from the disease and also the deaths)
The SIR model describes the change in the population of each of these compartments in terms of two parameters, β and γ.
β-β describes the effective contact rate of the disease: a susceptible individual comes into contact with an infectious individual and acquires the disease. This parameter takes into account both the number of people contacted per unit time, and the effectiveness of transmission in each contact. It reflects the force of infection of the disease and helps us understand at what rate the epidemic is progressing
γ - γ is the mean removal rate: In our model, it is calculated using the removed cases as against the new cases on a daily basis.
β and γ are useful in the SIR model using the following differential equations- Assuming the present scenario in India where I<<N and R<<N, S is almost equal to N. So S/N=1. Putting this in equation 2 and integrating, we get where A is determined the number of cases at t=0.
We have used the data available till 4th April to estimate the two parameters, β and Rt (time varying reproductive number) with the help of SIR model (www.statista.com). We assumed that the recovery rate γ would remain constant for the population. The removal rate followed a normal distribution, and the mean was calculated with the data available from 01 March to April 04, 2020 (22). Since the effect of interventions would reflect in the contact rate β, we then took the value of γ to be constant equal to the mean (0.103) and ran the SIR model multiple times by varying the value of β, and comparing the trends with the real data. We plotted the trendline for the real data using Microsoft Office Excel 2007, and used the equation of the curve to find out the trend of β in India in the present day scenario by comparing it to equation 4.
RESULTS
Among the 413 patients, there were 274 (66.34%) male and 319 (33.66%) female. The patients affected in India had a median age of 36 years (IQR, 25-54). The median age of females was higher (40 years; IQR, 24.5-59) than males (35 years; IQR, 25-50). Nearly half of the patients (49.88%) of affected patients were in the age group of 20-39 years. (refer Table 2). There were sixteen deaths (3.8%). Majority of them (68.75%) were in the 60-79 years age group. The median age of deceased patients was 65 years (IQR, 59.25-69). The mortality rate in male patients was 4.38% and for female patients was 2.88%. Overall case fatality rate was 3.87%. There is a 16.18% mortality rate in the 60-79 years age group and 20.0% in the equal to or above 80 years age group. (refer Table 3)
For recovered patients, most were in the 20-39 years age group followed by the 40-59 years age group. Median age was 36 years (IQR, 21.75-56.25). As per the data as on 28 March, 2020, 14 of these 413 patients had recovered.
The number of hospitalized patients accounted for 370 of the 413 patients. However, status of 13 of the 413 patients was not available.
After running multiple simulations using the SIR model, all assuming different values of β (Appendix Table), we found that the value of β in the current Indian scenario calculated from the trendline of real data lies around 0.258, which is also visible in the graph of real-time active cases lying between β=0.24 and β=0.29 curves (Figure 3).
Also visible in the graph, is the real data line shifting from 0.24 to 0.28 from day 8. Also, as Rt = β/γ, the value of the reproductive number is found to be varying between 2.4-2.9 in the Indian scenario until 4th April 2020. Considering the present trend of β, India can have nearly 7765 active cases by the end of the 21 days lockdown. Considering the range in which contact rate varies, number of active cases after 21 days lockdown period may vary from 5583 to 13785. (Figure3)
DISCUSSION
After the disease set foot in India on 30th January, 2020, the government started taking precautionary measures to control the outbreak of the disease. These interventions included restricting the inflow of international passengers, self quarantine measures, directives on testing and management strategies in-country travel restrictions.
The median age of affected patients in our study sample of 413 was 36 years (IQR, 25-54) and higher in females (40 years) compared to males (35 years). In study cohorts of Wuhan, the median age of affected patients ranges from 49-56 years (4, 23, 24). Thus, there are more people affected at a lower age in India when compared to China. This observation can be explained by the population distribution of India. In previous studies on COVID-19, it has been established that the risk increases with age and comorbidities (13, 23, 24). However, according to the population demographics in India for 2020 (25), the broad based nature of India’s population pyramid means there are more people in the younger age group and very few people in the equal to or above 80 years age group. This is reflected in the number of cases which are more in the younger age groups. On normalizing the percentage of patients in each age group with the corresponding percentage representation of the population, we observed that the highest number of male patients is in the 20-39 years age group category. Interestingly, according to this analysis, males in the 20-39 years age group are affected more than the 60-79 years age group males and equal to or above 80 years age group males. This is something that has not been reported until now and it has to be seen whether this changes as the number of cases in India grows. This is new to the literature of COVID-19 and needs to be studied further. Similarly finding the normalized ratio of affected patients by age group shows that for females, equal to or above 80 years age group is most affected.
As of 30 March, 2020, India has still not officially reported community spread (26). So the cases now are people who were affected in foreign countries or their direct contacts. In 2018, Indian residents between 35 and 49 years of age took the most holidays outside the nation (27). So, it is also possible that the number of young patients is more in India than other countries because they constitute the majority of outbound tourists. If community spread begins, this can get altered.
In our study, we found a case fatality rate of 3.87% which is lower than countries like Italy (28). The explanation for a lower mortality rate due to COVID-19 infection could be the universal immunisation policy against BCG (29). Case fatality rate was higher for males (4.38%) than females (2.88%). In China, the case fatality rate was found as 2.3%, 14.8% in the above 80 years population, and 8.0% in the 70-79 years population (13). Our estimate gives a higher value which may be due a smaller sample size or can also be because mild cases of COVID 19 have so far been missed due to limited testing capacities in India. In our study, the mortality rates for 60-79 years age group (16.18%) and equal to or above 80 years age group (20%) may be an underestimation or overestimation of the actual mortality rate as the total number of deaths is still small and many are still hospitalized. The median age for recovered patients (36 years) is lower than the median ages for the deceased (65 years) implying younger patients have a higher chance of recovery. This is in accordance with findings from other countries (13, 30). If we consider all the patients, males account for 66.34% of patients, implying more males are generally affected. 75.0% of the deceased were also males which show that males are more vulnerable agreeing with previous studies (31, 32).
In the SIR model, the value of β being 0.258 (the range being between 0.24-0.29) and the trend for Rt being about 2.50, shows that the interventions which were put in place by the Indian government (assuming a lag period of 11 days) during the mid-March period were partially effective, preventing the scenario where Rt can reach even more than 4 (33), and further data would tell how effective the lockdown is. The increase in β from day 8 is probably due to the identification of the Tablighi cluster and the cases it added to the Indian data (34).
According to a study (33) in the early days of the epidemic, Wuhan city and Hubei province reported Rt between 1.85 and 4.46 which aligns with our study findings. All over China, the Rt varied from 1.23 to 5.77. South Korea which has high population density like India had a decreasing trend of Rt from 9.72 on 20 February to 1.50 on 7 March. This indicates that the interventions have been helpful in preventing the worst case scenario in India but is unable to prevent the spike in number of cases (34,35).The situation can still be controlled if Rt can be brought down close to 1.This indicates the need for more effective strategies and ensuring optimum testing to avoid underestimation of danger.
The impact of the COVID-19 response (overall quarantine regulations, social distancing, and isolation of infections) in China in this context is encouraging for the many other countries (36) where the Rt gradually stabilized for most provinces ranging between 0.96 to 1.57 where there were more than 100 cases (33), and India should try and replicate this when community transmission starts by strict enforcement of lockdown measures to get the Rt closer to 1,so that COVID would not overwhelm the Indian healthcare system. The Indian government will not be able to minimise both deaths from coronavirus disease 2019 (COVID-19) and the economic impact of viral spread. Keeping mortality as low as possible will be the highest priority for individuals; hence the Indian government must put in place measures to ameliorate the inevitable economic downturn (37). However the Indian government’s timely decision to put a country wide lockdown into place when the number of cases was documented to be only 415,and effective contact tracing definitely helped to reduce the Rt and prevent an Italy/US like situation (however in the context of violations of the lockdown, more data awaited). Since at the end of the 21 days lockdown, India is likely to have significant number of active cases (5583 to 13785), it may enter the exponential growth phase and it would become very difficult to contain it. Hence, appropriate public health interventions like strict social distancing measures are necessary immediately to decrease the Rt in India.
The increase in β can be attributed to two possibilities: the Nizamuddin super cluster and change in the testing strategy. As on 4th April,2020, 1023 COVID-19 positive cases across 17 states in India can be traced to the Tablighi Jamaat congregation in Nizamuddin (38). Additionally, ICMR recently updated the testing strategy which might have increased the detection of the positive cases significantly (39). For now the epidemic is being reported either in clusters or super-clusters and community transmission is still in its infancy. If the disease leaks into the rural areas or urban slums, it will provide a big challenge to the health care delivery system which is not yet ready to combat the epidemic. India might as well enter the exponential phase of growth of the epidemic curve if stringent measures are not in place. Hence, strict social distancing, optimum testing and acceptance and awareness in Indian community are vital elements to control the COVID-19 epidemic in India.
LIMITATIONS
We have used crowd sourced data from covid19india.org due to lack of availability of official data. We could not include all the cases of COVID-19 till 28th March due to unavailability of the demographic details of few patients.
CONCLUSION
COVID-19 epidemic in India is affecting younger age groups more as compared to other countries.. Although the contact rate decreased initially after the Indian government’s stringent measures, the recent increase may be attributed to factors such as increase in testing or the occurrence of superclusters. Our mathematical model predicts that India will have adequate number of active cases after the 21 days lockdown. Hence, there is a need to develop appropriate public health interventions to control the COVID-19 epidemic in India.
Data Availability
Data taken from publicly available crowdsourced and government databases
COMPETING INTERESTS
None
FUNDING
This research received no specific grant from any funding agency in the public, commercial or not-for-profit sectors
ETHICS APPROVAL AND CONSENT TO PARTICIPATE
Mathematical modelling was done and available data in the public domain was used for analysis. Ethical approval was not required.
BIOGRAPHICAL SKETCH
Mehak Arora, Archisman Mazumder, Vishwesh Bharadiya and Parul Berry are all medical students (MBBS) at All India Institute of Medical Sciences, New Delhi. Their research interests include epidemiology, public health and clinical investigations.
ACKNOWLEDGEMENTS
We thank Hemant D Shewade of International Union against Tuberculosis and Lung Disease (The Union), Gowtham Balachandhiran of Panimalar Institute of Technology, Pidarithangal, Tamil Nadu, India, Krushna Prasad Pradhan of National Institute of Technology, Rourkela, Odisha, India and members of the Indian COVID Apex Research Team (i-CART) for their valuable feedback and sharing their expertise.
ABBREVIATIONS
- IQR
- Interquartile Range
- SIR
- Susceptible-Infected-Removed