A comparative study and application of modified SIR and Logistic models at Municipal Corporation level database of CoViD-19 in India ==================================================================================================================================== * Naman S. Bajaj * Sujit S. Pardeshi * Abhishek D. Patange * Hrushikesh S. Khade * K. K. Mate ## Abstract The WHO declared a global pandemic owing to the newfound coronavirus, or Covid-19, in March 2020. The disease quickly spread around the world by contagion, and the lack of an appropriate vaccine has led to limited social activities in every track of life. Several national and state-level studies conducted predict the course of the pandemic using machine learning algorithms, most common being the SIR and the Logistic models. However, it is unclear whether these models work for a controlled community like Municipal Corporation or not. With measures now being employed at Municipal levels in India, it only fits to conduct particular research to examine how these models perform at lower jurisdictions. This study provides concrete evidence to show the superiority of the modified SIR model over the Logistic model based on analysis. The models not only give accurate predictions for up to 14 days but can also be used to define and signify the practicality and effectiveness of the decisions taken by the authorities. This feature of the study allows us to justly say that the government action of Unlock 1.0 was not a wise decision considering the nature of the pandemic. This study hopes to help the authorities to take the proper actions to prevent any further aggravation of the spreading virus. In conclusion, Municipal corporations having control should make use of this study to make decisions and test their effectiveness, and more corporations should be empowered to benefit from this study. Keywords * CoViD-19 * Pandemic * SIR model * Logistic model * Statistical approach ## 1. Introduction The WHO declared a global pandemic of the newfound coronavirus, also named as Covid-19, in March 2020. Originated in Wuhan, China, the virus compromises the host’s respiratory system. The virus has had a comparatively lower death rate as compared to previous pandemics. However, the outbreak has disrupted the way of life for many and hit the world as a whole. Covid-19 is contagious and spreads through one’s respiratory droplets by coughing or sneezing. Once infected, it takes about a week or two for symptoms to show up. These can include mild to high fever, weakness, dry coughing, and shortness of breath in severe cases. The lack of a fitting cure or vaccine has limited the social activities in every track of life. Social distancing, self-quarantine, wearing a face mask, and frequent use of sanitizers are practiced extensively for the mitigation and control of the pandemic [1]. Be it the field of science and technology, the health sector, corporate worlds, share markets, or the daily applications used for entertainment and navigation, machine learning has proved to be useful. Also, significant production of data has become possible due to remarkable advances in biotechnology and health sciences [2]. Researchers are coming up with ways to better the algorithms, and scholars are using these methods to make useful predictions such as declaring a tumor fatal, treatability of cancer, to name a few. Pandemics are no exception to this, and machine learning algorithms are used to predict the turn of events. The Susceptible-Infected-Recovered (SIR) model is an effective way of fitting data of the current pandemic. Several research studies were conducted using the same model and have proven to be beneficial. The SIR model is a compartmental model for modeling how a disease spreads through a population [3]. Another method is the logistic plots. A logistic curve fits the data of population growth efficiently. Data for the growth of the infected population exhibits the same nature, and hence logistic models are also used to predict pandemic states. Several studies comparing and proving these methods better than others or studies with adjusted SIR and logistic models are available publicly. This particular study wishes to conclude with a comparative study between the SIR and logistic models. The primary goal is to present the results and help government organizations decide further steps needed to be taken to hinder the spread. Section 2 takes the reader through the recent attempts to use SIR and logistic models for predictive analysis of the Covid-19 pandemic. The mathematics of the models is explained in Section 3. Section 4 details the case study and motivation regarding the same. The results of the models are presented in Section 5. Discussions and inferences are presented in Section 6, and Section 7 follow with the conclusions. ## 2. Literature review As mentioned earlier, an extensive study was conducted on the coronavirus pandemic for various countries and states as well. The SIR model was the primary candidate for this. A simple SIR model for the UK and eight other European countries identified the pandemic characteristics as time-invariant across the world, and some highly variable, and also briefly studied the small but detectable average temperature effects on the probability α of infection [4]. Interestingly enough, the SIR model also predicts that high-risk individuals will be able to leave the lockdown well before vaccine arrival. The model was used to arrive at additional conclusions, presented as an exit time control problem where the lockdown ends with herd immunity [5]. The SIR model is compartmental. Also, many extensions of the same are proposed, in particular adding a compartment of exposed but non-contagious individuals, called the SEIR model [6] [7]. The bell-shaped distribution curves in the SIR model may not fit the Gaussian function as the CoViD-19 life cycle curves are not fully symmetric. A drawback of the SIR model is that it predicts the end dates to be much earlier than the actual end dates [8]. On the other hand, several scholars chose logistics modeling to predict the state of the pandemic. The efficacy of the model is a function of its fit to the available dataset. The question of whether we can find curve parameters that fit the complete dataset was addressed using a generalized logistic curve (or the Gompertz curve) for China and South Korea [9]. Another quantitative mathematical approach was able to derive a logistic model to characterize the age-specific case-fatality rates (CFRs). It was inferred that CFR does indeed increase with age [10]. A second peak was observed in the infections in late May of 2020 due to the relaxation of the mitigation efforts in the US. The basic logistic model fails to fit the data due to such relaxations, and an Adaptive Logistic Model (ALM) was designed, which gives more accurate predictions [11]. A Bayesian hierarchical five-parameter logistic model was fit to and approximated the observed data closely and could derive acceptable predictions [12]. Yet another logistic growth model was built for Italy, France, and Spain to help government authorities decide when to start the lockdown in a way not to exceed the health care capacities of the concerned country [13]. The search for a comparative study between the two models was short-lived for the lack of relevant literature. Forecasting of the spread of coronavirus in 8 different countries was undertaken based on both the logistic model and the SEIR and adjusted SEIR models [14]. An estimation of the final size of the pandemic for China, South Korea, and the rest of the world was done in two separate studies, one with the logistic model and the other with the SIR model [15]. Employing both models, the current stages of the pandemic in states of Karnataka, Kerala, and Maharashtra in India were predicted by a five-stage classification [16]. After realizing the existence of numerous variables at play at the Municipal level, the Indian Government allowed lower jurisdictions to take actions suitable to the concerned province. To explore the dynamics and nature of the pandemic in a controlled locality such as a Municipal Corporation, a comparative study was conducted that revealed exponential curves did not fit the Municipal data. Instead, a cubic curve gave better results, and more interestingly, the active cases followed a multi-peak Gaussian curve [17]. Gathering from above, several studies were conducted with the SIR and logistic models. Many tweaks were employed by scholars to achieve a better prediction from these models. However, there seems to be a lack of a detailed comparative study of these two methods, especially at a lower level. Till now, it is unclear whether these models work for a controlled community like Municipal Corporation. With measures now being employed at Municipal levels in India, it only fits to conduct particular research to examine how these models perform at lower jurisdictions. The aim is to help authorities take the correct steps based on these predictions to curb the mounting pandemic. With this motivation, this study intends to present the outcomes of both models at the National and Municipal level, by mapping and evaluating datasets of Akola, Mira-Bhayandar, Kalyan-Dombivli Municipal Corporation (KDMC), and India. ## 3. Mathematics of models ### 3.1 The SIR model The compartmental nature of the model is seen in the following equations, where the total population is divided into three categories. ![Formula][1] ![Formula][2] ![Formula][3] Here, at time T * *S*(*T*) = *S*: Number of susceptible people * *I*(*T*) = *I*: Number of Infected people * *R*(*T*) = *R*: Number of recovered people * *α*: Contact rate * 1/*β*: Average infectious period The total population size, P is then given as ![Formula][4] The initial conditions are taken as, ![Formula][5] Solving equations (1) and (3) simultaneously and integrating, we get ![Formula][6] MATLAB’s *ode45* function was used to integrate the model equation. Taking the limit as T →∞, ![Formula][7] where *S*∞ is the number of susceptible people remaining and *R*∞ is the end number of recovered people. Here, the susceptible are taken to be the entire population and this is how it differs from the following modified SIR model. ### 3.2 Modified SIR model The mathematics of the modified model is very much similar to that of the SIR model above and continued below. In the above model, the entire population is taken as the number of susceptible. However, younger people are found to be less susceptible to the virus. The COVID-19 pandemic has shown a markedly low proportion of cases among children where the age dependence in the probability of developing clinical symptoms rises from around 20% in under 10s to over 70% in older adults [18]. Also, people infected by the virus but are asymptomatic do not cough as frequently, which significantly reduces the chances of them spreading the virus. It was observed in 13 eligible studies that the symptomatic cases emerged to be a critical factor with very low transmission probability during the asymptomatic phase [19]. Hence, in equation (6), number of susceptible is not the entire population for the modified SIR model. As there are zero infected people in the end, *I*∞ = 0 and from equation (4), ![Formula][8] Plugging value of *S*∞ from equation (6), ![Formula][9] Number of recovered people at *T* = 0 *is R* = 0 and infected people, a constant *I* = *K* We need to estimate the model parameters *α* and *β* and the initial values *S*,*I*. From the data, we can write the total number of cases *K* as ![Formula][10] The estimated values of parameters and initial values give an estimate of *K* as ![Graphic][11] which we compare with the actual *K*. The optimum values of the model parameters and the initial values can be found by minimizing the squared error or difference between the estimated and actual number of cases i.e. ![Formula][12] where, *KT* = (*K*1,*K*2,*K*3,…,*Kn*) are actual cases at times *T*1,*T*2,*T*3,…, *Tn* ![Graphic][13] are respective estimates given by our model for a set of estimates of *α*, *β*, and initial values *S* and *I* The SIR model can be better grasped by observing the following flowchart. ![Figure 1:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2020/09/18/2020.09.12.20193375/F1.medium.gif) [Figure 1:](http://medrxiv.org/content/early/2020/09/18/2020.09.12.20193375/F1) Figure 1: Flow diagram of SIR model SIR and logistic models are often used to model epidemics as both are a good fit for the exponential growth rate [20]. As mentioned earlier, many scholars have used both models extensively, but separately to get appreciable results. Batista [15], in one of the few comparative studies found says, qualitatively, both models show that the epidemic is moderating for the time of February in case of China. The confidence level of a model tells us the range of possible deviation one might observe with respect to the real world data. Both SIR and logistic models give the narrowest confidence levels meaning their results are more reliable [20]. It therefore follows to study the mathematics of the logistic model to make a solid argument as to which model fits better for a Municipal Corporation level data of CoViD-19. ### 3.3 The Logistic model The logistic growth model is as follows, ![Formula][14] where * M – Accumulated number of cases * N – Final epidemic size (N > 0) * *γ* – Infection rate (*γ* > 0) The solution for the non-zero positive initial number of cases is, ![Formula][15] Here, ![Formula][16] *M*(0) = *M* When T<<1, assuming final size is much larger than the initial cases, C>>1 We get natural growth, ![Formula][17] When T →∞, Weibull function is followed as, ![Formula][18] We reach the maxima for growth rate ![Graphic][19] when ![Graphic][20] From this and above expressions of *M*, the growth rate peak is observed when ![Formula][21] At this time *Tp*, ![Formula][22] Also, the growth rate is, ![Formula][23] Often the intensity is judged by the doubling time, i.e., the time it will take to double the current number of the infected population. ![Formula][24] ![Formula][25] Solving for δ*T*, ![Formula][26] The first term represents initial exponential growth beyond which δ*T* increases with *T*. As the infected population tends to the peak, doubling time tends towards infinity, i.e., when *M* → *N*/2,δ*T* →∞. Beyond *N*/2, doubling time is not defined as it represents the region beyond the peak. One would want to know the final size of the pandemic *N* which can be estimated from all final size predictions *N*1, *N*2, *N*3,…, *Nn* at times *T*1,*T*2,*T*3,…,*Tn* respectively. The iterated Shanks’ transformation gives ![Formula][27] Here, care must be taken as there is no natural law behind the Shanks’ transformation, as in cases like *N*<*Mn*, the calculated limit is useless. (*M*1,*M*2,*M*3,…,*Mn*) are the total number of cases at the respective times) The logistic model (12) is non-linear, and the initial guess should be provided carefully. As the equation (13) follows an exponential growth, predicting final size is difficult at the early stages. With enough data, however, one can use the following process to get a good initial condition. Using *T* from equation (12) for 3 equidistant points ![Formula][28] The following solution is acceptable if all unknowns are positive, ![Formula][29] ![Formula][30] ![Formula][31] The above equations help to find an initial guess in the *fitVirus03* program. We choose the first, middle, and end data points for equations in (20). The regression analysis is questionable if this calculation fails. Finally, the parameters of the logistic model *N*,*γ*, and *A* are calculated by a least-square fit using the *lsqcurve* and *fitnlm* functions in MATLAB. ## 4. Methodology The Indian subcontinent occupies most of South Asia and is ranked third in the list of worst-hit countries with over 2.4 million confirmed cases as of 16 August [21]. Data from 29 January to 9 June formed the database of the total reported and recovered cases, deaths, day-wise and active cases for a national-level analysis. Maharashtra stands as one of the most affected states in India, with over 844K confirmed cases as of 05 September [21].Three regions from Maharashtra: Akola, Kalyan-Dombivli Municipal Corporation (KDMC), and Mira-Bhayander were chosen for the statistical analysis. The official websites for the respective regions under scrutiny provided the data [22] [23] [24]. Akola’s statistics from 7 April to 9 June was computed. Data for Kalyan-Dombivli Municipal Corporation (KDMC) from 14 March to 9 June computed the results, whereas that of Mira-Bhayander was from 27 March to 9 June. The total cases, active patients, discharged patients, and deaths were listed in the data for each region for the respective time frames. The start date in each of the data was the date when they classified the first case. The model predicted exponential and non-linear relations, which required sophisticated software to handle operations on data with ease. MATLAB was used for the statistical analysis of the above data and to plot it graphically. ## 5. Results This section covers the results of SIR and Logistic models employed for each of the regions and India. The model parameters, statistics of the total cases and daily cases, and final states predicted by each of the three models are formulated in a table under every sub-section. ### 5.1 India SIR model was used for two different timelines: data from start of pandemic to 30 May (Hereafter, SIR Model 1), and data from start of pandemic to 9 June (Hereafter, SIR Model 2) to incorporate the effects of Unlock 1 that took place on 1 June, 2020. Figure 2 shows the prediction of the pandemic given by the SIR model 1 for the Indian population. The vertical red line in graph (2a) corresponds to the peak of daily new cases showcased by graph (2b). The total cases recorded every day are represented in a bar graph, and the growth rate is presented in percentage below. The news of Unlock 1.0 spread before 1 June and its effect on people was apparent since before 30 May. To highlight the difference between SIR 1 and SIR 2 clearly, data till 24 May was considered for India, as shown in Figure (2a). ![Figure 2:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2020/09/18/2020.09.12.20193375/F2.medium.gif) [Figure 2:](http://medrxiv.org/content/early/2020/09/18/2020.09.12.20193375/F2) Figure 2: Graphical representations of SIR model 1 30-Jan to 24-May The following set of graphs show the results of the SIR model 2, where the effect of unlock was included. Figure 3 shows the same in a model curve, the bar graph for daily cases where the peaks and their timeline is to be carefully noted, and the growth rate factor. ![Figure 3:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2020/09/18/2020.09.12.20193375/F3.medium.gif) [Figure 3:](http://medrxiv.org/content/early/2020/09/18/2020.09.12.20193375/F3) Figure 3: Graphical representations of SIR model 2 30-Jan to 09-Jun Figure 4 shows the prediction given by the logistic model for India and the bar graph of daily cases. ![Figure 4:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2020/09/18/2020.09.12.20193375/F4.medium.gif) [Figure 4:](http://medrxiv.org/content/early/2020/09/18/2020.09.12.20193375/F4) Figure 4: Graphical representations of logistic model 30-Jan to 09-Jun View this table: [Table 1:](http://medrxiv.org/content/early/2020/09/18/2020.09.12.20193375/T1) Table 1: Parameters, statistical values and final state for India The table above formulates all significant statistical values and the model parameters for the models’ SIR 1, SIR 2, and logistic. The difference between SIR models 1 and 2 is clear from Figures 1 and 2 plotted for India. The graphs for the same were plotted for regions below but not shown henceforth. The difference between the two models for the Municipal Corporations is apparent from the results in Table 5. ### 5.2 Akola Figure 5 below shows the prediction by the SIR model 2. Peaks correspond to the dates of significant developments, and the graphs show a change in the daily number of cases. ![Figure 5:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2020/09/18/2020.09.12.20193375/F5.medium.gif) [Figure 5:](http://medrxiv.org/content/early/2020/09/18/2020.09.12.20193375/F5) Figure 5: Graphical representations of SIR model 2 07-Apr to 09-Jun Similarly, Figure 6 shows the pandemic for the same timeline but using the logistic model. ![Figure 6:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2020/09/18/2020.09.12.20193375/F6.medium.gif) [Figure 6:](http://medrxiv.org/content/early/2020/09/18/2020.09.12.20193375/F6) Figure 6: Graphical representations of logistic model 07-Apr to 09-Jun The model parameters, accuracies and final predictions for Akola are put together in the table below, Table 2. View this table: [Table 2:](http://medrxiv.org/content/early/2020/09/18/2020.09.12.20193375/T2) Table 2: Parameters, statistical values and final state for Akola ### 5.3 KDMC The Kalyan-Dombivli Municipal Corporation’s official website provided the data for plotting the predictions of SIR model 2 and the logistic model, as shown in the following set of graphs. The graph (7b) shows the variation of daily new cases in the form of a single peak Gaussian curve. Different parameters were chosen for the model to get a range of predictions to increase the accuracy of predictions. The use of Gaussian is also beneficial to detect any anomalous behavior in the trend of daily new cases, which can then be associated with specific policies employed or actions taken around that timeline. ![Figure 7:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2020/09/18/2020.09.12.20193375/F7.medium.gif) [Figure 7:](http://medrxiv.org/content/early/2020/09/18/2020.09.12.20193375/F7) Figure 7: Graphical representations of SIR model 2 14-Mar to 09-Jun A logistic model employed for the data of KDMC in the same timeline shows the trend represented by Figure 8. ![Figure 8:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2020/09/18/2020.09.12.20193375/F8.medium.gif) [Figure 8:](http://medrxiv.org/content/early/2020/09/18/2020.09.12.20193375/F8) Figure 8: Graphical representations of logistic model 14-Mar to 09-Jun As listed for the previous region and India, Table 3 above summarizes the details of the model employed, its results along with the statistical data, including accuracy, for KDMC. View this table: [Table 3:](http://medrxiv.org/content/early/2020/09/18/2020.09.12.20193375/T3) Table 3: Parameters, statistical values and final state for KDMC ### 5.4 Mira-Bhayander Results of the Mira-Bhayander region include the depiction of the employed models, i.e., SIR models, with and without consideration of effects of unlock on the behavior of the masses, and the logistic model. The growth factor features the growth in the daily cases relative to the previous day. This is showcased in the graph (9c), where the growth rate is almost constant, over 5%, and then reduces significantly after the bell curve passes the peak. The graph below shows the logistic model for the Mira-Bhayander and the variation of daily cases with time followed by the summary of results in Table 4. ![Figure 9:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2020/09/18/2020.09.12.20193375/F9.medium.gif) [Figure 9:](http://medrxiv.org/content/early/2020/09/18/2020.09.12.20193375/F9) Figure 9: Graphical representations of SIR model 2 27-Mar to 09-Jun ![Figure 10:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2020/09/18/2020.09.12.20193375/F10.medium.gif) [Figure 10:](http://medrxiv.org/content/early/2020/09/18/2020.09.12.20193375/F10) Figure 10: Graphical representations of logistic model 27-Mar to 09-Jun View this table: [Table 4:](http://medrxiv.org/content/early/2020/09/18/2020.09.12.20193375/T4) Table 4: Parameters, statistical values and final state for Mira-Bhayander All tables are to be read carefully to understand the difference between the results given by the SIR models and the logistic model at the national and the regional level. ## 6. Inferences * The modified SIR models, namely, SIR 1 and SIR 2 were employed for COVID-19 cases till 24 May and 9 June respectively. The predicted cases of SIR 2 model were approximately 60% more than the total cases predicted by SIR 1 model. This extreme and significant deviation in results can be attributed to the fact that SIR 2 incorporates the effect of Unlock 1.0, introduced on 1 June 2020, after the outbreak of COVID-19, whereas SIR 1 doesn’t. * Table 5 depicts a comparison between the actual cases on 23 June 2020 versus the predicted results obtained after employing modified SIR 1, modified SIR 2 and logistic model on their respective datasets. Both SIR 2 and logistic model take into account the impact of Unlock 1.0, thus giving higher accuracies than SIR 1 model. Furthermore, the superiority of modified SIR 2 model over the Logistic model can be concluded on the basis accuracy of the results obtained. * The actual cases obtained for Mira-Bhayandar even on 5 July match the results obtained from modified SIR 2 model with 90% accuracy i.e. 25 days after analysis. * For districts or municipalities showing fairly lesser number of cases, for example Akola, the model starts to fail due to the slow progress of the pandemic. Though for Akola, the accuracy is still 80%. * The sixth degree polynomial regression models will help Indian doctors and the Government in preparing their plans for the next 7 days [25]. According to the study conducted by Bajaj et al. [17], it only fits to consider that the prediction accuracy for Municipal Corporation level would decrease due to an increase in parameters under consideration. However, not only does the model suggested by this study give high accuracy but it also provides prediction up to 14 days, i.e. double the time period of the study mentioned above. View this table: [Table 5:](http://medrxiv.org/content/early/2020/09/18/2020.09.12.20193375/T5) Table 5: Accuracy of models, 23 June 2020 ## 7. Conclusions * From the rise in cases observed and predicted by modified SIR models 1 and 2, it can justly be said that the introduction of Unlock 1.0 was not a wise decision considering the nature of pandemic. * Modified SIR 2 model is far better than Logistic model as it provides prediction result more accurately. Even though this model has previously only been employed to predict the outbreak statistics for larger areas like countries or states, it can be unequivocally used for Municipal Corporations such as Mira-Bhayandar or localities with slow growth of the pandemic. Furthermore, if no significant environmental changes are introduced in controlled areas like Mira-Bhayander, it can provide prediction accuracy of 90% for at least one and at most 2 months rather than previously considered 14 days period. * Analysis of Akola dataset set shows that the accuracy of the model reduces significantly (only 80%) for areas with cases lower than 1000. * Even though the model cannot be utilized for longer time periods, but a remarkable method to use it would involve predicting the number of cases from the day any new policy to combat COVID-19 is introduced and thereby recommending periodic review and analysis. The results obtained from both analysis would define and signify the practicality and effectiveness of the decision. * Further, regions similar to each other in terms of population, location, and available facilities like manpower, can then implement the same decisions which proved beneficial and can help curb the pandemic in all such regions. * Few states in India like Maharashtra have already extended the authority to take required actions against the increasing pandemic to Municipalities. This study will help these government offices to take steps in the right direction. The author recommends corporations that have control to use this study as a tool and take the right decisions. It is also recommended to extend authority to more Municipal Corporations and benefit from the model. ## Data Availability All datasets were acquired from official websites of respective location [https://www.covid19india.org/](https://www.covid19india.org/) [https://www.mbmc.gov.in/master\_c/important\_information](https://www.mbmc.gov.in/master_c/important_information) [https://dashboard.kerala.gov.in/](https://dashboard.kerala.gov.in/) [https://dio-akola.blogspot.com/](https://dio-akola.blogspot.com/) * Received September 12, 2020. * Revision received September 12, 2020. * Accepted September 18, 2020. * © 2020, Posted by Cold Spring Harbor Laboratory This pre-print is available under a Creative Commons License (Attribution-NonCommercial-NoDerivs 4.0 International), CC BY-NC-ND 4.0, as described at [http://creativecommons.org/licenses/by-nc-nd/4.0/](http://creativecommons.org/licenses/by-nc-nd/4.0/) ## 8. References 1. [1]. I. Cooper, A. Mondal and C. G. Antonopoulos, “A SIR model assumption for the spread of COVID-19 in different communities,” Chaos, Solitons & Fractals, p. 110057, 2020. 2. [2]. I. Kavakiotis, O. Tsave, A. Salifoglou, N. Maglaveras, I. Vlahavas and I. Chouvarda, “Machine learning and data mining methods in diabetes research,” Computational and structural biotechnology journal, vol. 15, p. 104–116, 2017. 3. [3]. B. M. Ndiaye, L. Tendeng and D. Seck, “Analysis of the COVID-19 pandemic by SIR model and machine learning technics for forecasting,” *arXiv preprint arXiv:2004.01574*, 2020. 4. [4]. G. Bhanot and C. DeLisi, “Predictions for Europe for the Covid-19 pandemic from a SIR model,” *medRxiv*, 2020. 5. [5]. E. Bayraktar, A. Cohen and A. Nellis, “A Macroeconomic SIR Model for Covid-19,” *Available at SSRN 3633443*, 2020. 6. [6]. H. W. Hethcote, “Three basic epidemiological models,” in Applied mathematical ecology, Springer, 1989, p. 119–144. 7. [7]. M. Lavielle, M. Faron, J.-D. Zeitoun and others, “Extension of a SIR model for modelling the propagation of Covid-19 in several countries.,” *medRxiv*, 2020. 8. [8]. H. Merchant, “CoViD-19 may not end as predicted by the SIR model,” The BMJ, vol. 369, p. m1567–rr, 2020. 9. [9]. M. Villalobos-Arias, “Using generalized logistics regression to forecast population infected by Covid-19,” *arXiv preprint arXiv:2004.02406*, 2020. 10. [10]. X. Gao and Q. Dong, “A logistic model for age-specific COVID-19 case-fatality rates,” JAMIA open, vol. 3, p. 151–153, 2020. 11. [11]. D. H. Roberts, “A New Adaptive Logistic Model for Epidemics and the Resurgence of COVID-19 in the United States,” *medRxiv*, 2020. 12. [12]. L. Kriston and L. Kriston, “Projection of cumulative coronavirus disease 2019 (COVID-19) case growth with a hierarchical logistic model,” Bull World Health Organ COVID-19 Open Preprints. [http://dx.doi.org/10.2471/BLT](http://dx.doi.org/10.2471/BLT), vol. 20, 2020. 13. [13]. S. Lagdali and A. Saidi, “Logistic Growth Model of the COVID-19 Pandemic to Decide When to Start the Lockdown,” Journal homepage: [http://iieta.org/journals/rces](http://iieta.org/journals/rces), vol. 7, p. 26–30, 2020. 14. [14]. X. Zhou, X. Ma, N. Hong, L. Su, Y. Ma, J. He, H. Jiang, C. Liu, G. Shan, W. Zhu and others, “Forecasting the worldwide spread of COVID-19 based on logistic model and SEIR model,” *medRxiv*, 2020. 15. [15]. M. Batista, “Estimation of the final size of the COVID-19 epidemic,” MedRxiv. doi, vol. 10, p. 16–20023606, 2020. 16. [16]. J. Mackolil and B. Mahanthesh, “Mathematical Modelling of Coronavirus disease (COVID-19) Outbreak in India using Logistic Growth and SIR Models,” 2020. 17. [17]. N. S. Bajaj, S. S. Pardeshi, A. D. Patange, D. Kotecha and K. K. Mate, “Statistical analysis of national & municipal corporation level database of COVID-19 cases In India,” *medRxiv*, 2020. 18. [18]. N. G. Davies, P. Klepac, Y. Liu, K. Prem, M. Jit, R. M. Eggo, C. C. O. V. I. D.-1. working group and others, “Age-dependent effects in the transmission and control of COVID-19 epidemics,” *MedRxiv*, 2020. 19. [19]. K. Shah, D. Saxena and D. Mavalankar, “Secondary Attack Rate of COVID-19 in household contacts: Systematic review,” QJM: An International Journal of Medicine, 2020. 20. [20]. J. Ma, “Estimating epidemic exponential growth rate and basic reproduction number,” Infectious Disease Modelling, vol. 5, p. 129–141, 2020. 21. [21].“John Hopkins University and Medicine,” [https://coronavirus.jhu.edu/region/india](https://coronavirus.jhu.edu/region/india), 2020. 22. [22].“Akola Municipal Corporation,” [https://dio-akola.blogspot.com/](https://dio-akola.blogspot.com/) 2020. 23. [23].“Kalyan Dombivli Municipal Corporation,” [https://kdmc-coronavirus-response-skdcl.hub.arcgis.com/](https://kdmc-coronavirus-response-skdcl.hub.arcgis.com/) 2020. 24. [24].“Mira-Bhayander Municipal Corporation,” [https://www.mbmc.gov.in/master\_c/important\_information](https://www.mbmc.gov.in/master_c/important_information) 2020. 25. [25]. R. S. Yadav, “Data analysis of COVID-2019 epidemic using machine learning methods: a case study of India,” International Journal of Information Technology, p. 1–10, 2020. [1]: /embed/graphic-1.gif [2]: /embed/graphic-2.gif [3]: /embed/graphic-3.gif [4]: /embed/graphic-4.gif [5]: /embed/graphic-5.gif [6]: /embed/graphic-6.gif [7]: /embed/graphic-7.gif [8]: /embed/graphic-8.gif [9]: /embed/graphic-9.gif [10]: /embed/graphic-10.gif [11]: /embed/inline-graphic-1.gif [12]: /embed/graphic-11.gif [13]: /embed/inline-graphic-2.gif [14]: /embed/graphic-13.gif [15]: /embed/graphic-14.gif [16]: /embed/graphic-15.gif [17]: /embed/graphic-16.gif [18]: /embed/graphic-17.gif [19]: /embed/inline-graphic-3.gif [20]: /embed/inline-graphic-4.gif [21]: /embed/graphic-18.gif [22]: /embed/graphic-19.gif [23]: /embed/graphic-20.gif [24]: /embed/graphic-21.gif [25]: /embed/graphic-22.gif [26]: /embed/graphic-23.gif [27]: /embed/graphic-24.gif [28]: /embed/graphic-25.gif [29]: /embed/graphic-26.gif [30]: /embed/graphic-27.gif [31]: /embed/graphic-28.gif