SIR-simulation of Corona pandemic dynamics in Europe ==================================================== * Igor Nesteruk ## ABSTRACT The SIR (susceptible-infected-removed) model, statistical approach to the parameter identification and the official WHO daily data about the confirmed cumulative number of cases were used to estimate the characteristics of COVID-19 pandemic in Italy, Spain, Germany, France, Austria and Moldova. The final sizes and durations of epidemic outbreaks in these countries are calculated. Keywords * coronavirus pandemic * epidemic outbreak in Italy * Spain * Germany * France * Austria and Moldova * coronavirus COVID-19 * mathematical modeling of infection diseases * SIR model * parameter identification * statistical methods ## Introduction Here we consider the development of epidemic outbreak in Italy, Spain, Germany, France, Austria and the Republic of Moldova caused by coronavirus COVID-19 (2019-nCoV) (see e.g., [1]). For an epidemic of an infectious disease, the SIR model, connecting the number of susceptible *S*, infected and spreading the infection *I* and removed *R* persons, can be used [2-4]. The unknown parameters of this model can be estimated with the use of the cumulative number of cases *V=I+R* and the statistics-based method of parameter identification developed in [5, 6]. This approach was used in [6-12] to estimate the Corona pandemic dynamics in China, Republic of Korea, Italy, Austria and Ukraine. Usually the number of cases registered during the initial period of an epidemic is not reliable, since many infected persons are not detected. That is why the correct estimations of epidemic parameters can be done with the use of data sets obtained for later periods of the epidemic when the number of detected cases is closer to the real one. This fact necessitates a periodic reassessment of the epidemic’s characteristics and forecasts for its final size and duration. In this paper we will recalculate the pandemic parameters for Italy and Austria, provide estimations for Spain, Germany, France and Moldova, and compare with the further pandemic development. ### Data The official data about the accumulated numbers of confirmed COVID-19 cases *V**j* in Italy, Spain, Germany, France, Austria and in the republic of Moldova from WHO daily situation reports (numbers 33-87), [1] are presented in Tables 1 and 2. The corresponding moments of time *t**j* (measured in days) are also shown in these tables. The data sets presented in Table 1 were used only for comparison with corresponding SIR curves. Table 2 was used for calculations and verifications of predictions. View this table: [Table 1.](http://medrxiv.org/content/early/2020/04/25/2020.04.22.20075135/T1) Table 1. Official cumulative numbers of confirmed cases in Italy, Spain, Germany, France, Austria and Moldova during the initial stage of epidemics used only for comparison with SIR curves, [1] View this table: [Table 2.](http://medrxiv.org/content/early/2020/04/25/2020.04.22.20075135/T2) Table 2. Official cumulative numbers of confirmed cases in Italy, Spain, Germany, France, Austria and Moldova used for calculations and verifications of predictions, [1] ### SIR model The SIR model for an infectious disease [2-5] relates the number of susceptible persons *S* (persons who are sensitive to the pathogen and **not protected**); the number of infected is *I* (persons who are sick and **spread the infection**; please don’t confuse with the number of still ill persons, so known active cases) and the number of removed *R* (persons who **no longer spread the infection**; this number is the sum of isolated, recovered, dead, and infected people who left the region); *α* and *ρ* are constants. ![Formula][1] ![Formula][2] ![Formula][3] To determine the initial conditions for the set of equations (1–3), let us suppose that at the moment of the epidemic outbreak *t*, [5, 6]: ![Formula][4] The analytical solution for the set of equations (1–3) was obtained by introducing the function *V* (*t*) = *I* (*t*) + *R*(*t*), corresponding to the number of victims or cumulative confirmed number of cases, [5, 6]: ![Formula][5] ![Formula][6] Thus, for every set of parameters *N, ν, α, t* and a fixed value of *V* the integral (6) can be calculated and the corresponding moment of time can be determined from (5). Then functions *I(t)* and *R(t)* can be easily calculated with the of formulas, [5, 6]. ![Formula][7] Function *I* has a maximum at *S* =*ν* and tends to zero at infinity, see [2, 3]. In comparison, the number of susceptible persons at infinity *S*∞ > 0, and can be calculated from the non-linear equation, [5, 6]: ![Formula][8] The final number of victims (final accumulated number of cases) can be calculated from: ![Formula][9] To estimate the duration of an epidemic outbreak, we can use the condition: ![Formula][10] which means that at *t* > *t* *final* less than one person still spread the infection. ### Parameter identification procedure In the case of a new epidemic, the values of this independent four parameters are unknown and must be identified with the use of limited data sets. A statistical approach was developed in [5] and used in [6-12] to estimate the values of unknown parameters. The registered points for the number of victims *V**j* corresponding to the moments of time *t**j* can be used in order to calculate *F*1 *j* = *F*1 (*V**j*, *N,ν*) for every fixed values *N* and *ν* with the use of (6) and then to check how the registered points fit the straight line (5). For this purpose the linear regression can be used, e.g., [13], and the optimal straight line, minimizing the sum of squared distances between registered and theoretical points, can be defined. Thus we can find the optimal values of *α t* and calculate the, correlation coefficient *r*. Then the F-test may be applied to check how the null hypothesis that says that the proposed linear relationship (5) fits the data set. The experimental value of the Fisher function can be calculated with the use of the formula: ![Formula][11] where *n* is the number of observations, *m*=2 is the number of parameters in the regression equation, [13]. The corresponding experimental value *F* has to be compared with the critical value *F**C* (*k*1, *k*2) of the Fisher function at a desired significance or confidence level (*k*1 = *m* −1, *k*2 = *n* − *m*), [14]. When the values *n* and *m* are fixed, the maximum of the Fisher function coincides with the maximum of the correlation coefficient. Therefore, to find the optimal values of parameters *N* and *ν*, we have to find the maximum of the correlation coefficient. To compare the reliability of different predictions (with different values of *n*) it is useful to use the ratio *F* / *F**C* (1, *n* − 2) at fixed significance level, [15]. We will use the level 0.001; corresponding values *F**C* (1, *n* − 2) can be taken from [14]. The most reliable prediction yields the highest *F* / *F**C* (1, *n* − 2) ratio. ## Results Usually the number of cases during the initial period of an epidemic outbreak is not reliable. To avoid their influence on the results, only *V**j* values for the period March 28 – April 10, 2020 (35 ≤ *t* *j* ≤ 48 ; *n*=14; *F**C* (1, *n* − 2) =18.6; see Table 2) were used to calculate the epidemic characteristics in every country. Since during the quarantine, the international people exchange is quite limited, we can apply the SIR model for every country assuming its parameters to be constant (but different for every country) during the fixed period of time. The results of calculations are shown in Tables 3 and 4. To illustrate the influence of data on the results of SIR simulations, other time periods: April 5 – 18, 2020 (43 ≤ *t* *j* ≤ 56 ; *n*=14; *F**C* (1, *n* − 2) =18.6; France, Moldova) and March 29-April 18 (36 ≤ *t* *j* ≤ 56 ; *n*=21; *F**C* (1, *n* − 2) =15.2; Italy, Spain) were used for second series of calculations for France and Moldova. The results are shown in Table 5. View this table: [Table 3.](http://medrxiv.org/content/early/2020/04/25/2020.04.22.20075135/T3) Table 3. Epidemic characteristics for Italy, Spain and Germany. Optimal values of parameters, final sizes and durations (last two rows). View this table: [Table 4.](http://medrxiv.org/content/early/2020/04/25/2020.04.22.20075135/T4) Table 4. Epidemic characteristics for France, Austria and the Republic of Moldova. Optimal values of parameters, final sizes and durations (last two rows). View this table: [Table 5.](http://medrxiv.org/content/early/2020/04/25/2020.04.22.20075135/T5) Table 5. Results of second series of calculations with the use data from period April 5 – 18, 2020 for France and the Republic of Moldova, and March 29-April 18, 2020 for Italy and Spain. Optimal values of parameters, final sizes and durations (last two rows). The SIR curves and markers representing the *V**j* values taken for calculations (“circles”); for comparisons (“triangles”) and verifications of predictions (“stars”) are shown in Figs. 1-6. For Italy, Spain, France and Moldova, the second sets of optimal parameters (from Table 5) were used to calculate SIR curves. ![Fig. 1.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2020/04/25/2020.04.22.20075135/F1.medium.gif) [Fig. 1.](http://medrxiv.org/content/early/2020/04/25/2020.04.22.20075135/F1) Fig. 1. Italy, prediction 6: SIR curves (lines) and accumulated number of cases (markers) versus time. Numbers of infected *I* (green), removed *R* (black) and victims *V=I+R* (blue line). It can be seen that the previous predictions for Italy and Austria [11] were more optimistic. Fresh data sets has showed that the final number of cases in Italy could reach 226,000 and their appearance can stop only after August 18, 2020 (see Table 5, prediction 6). The epidemic stop in Austria is expected after May 21, 2020 (see Table 4). These estimations are valid only when the quarantine measures, isolation rate and the coronavisus activity will be same as for the periods taken for calculations. Tables 3-5 and Figs. 1, 2 illustrate that the estimations of parameter *t*** values are very different from the results published in [11]. In particular, according to the prediction in Italy the first COVID-19 cases could happen even after November 27, 2019. This results correlates with the information form Giuseppe Remuzzi, director of the Mario Negri Institute for Pharmacological Research that “virus was circulating before we were aware of the outbreak in China”, [16]. Table 5 and Fig. 3 show that the epidemic outbreak in France could happen around January 25, 2020. This estimation correlates with the results of paper [17], where the first COVID-19 cases with 5 Chinese tourists are described. They were from Wuhan and arrived in Europe on January 17-22, 2020. ![Fig. 2.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2020/04/25/2020.04.22.20075135/F2.medium.gif) [Fig. 2.](http://medrxiv.org/content/early/2020/04/25/2020.04.22.20075135/F2) Fig. 2. Austria, prediction 4: SIR curves (lines) and accumulated number of cases (markers) versus time. Numbers of infected *I* (green), removed *R* (black) and victims *V=I+R* (blue). ![Fig. 3.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2020/04/25/2020.04.22.20075135/F3.medium.gif) [Fig. 3.](http://medrxiv.org/content/early/2020/04/25/2020.04.22.20075135/F3) Fig. 3. France, prediction 2: SIR curves (lines) and accumulated number of cases (markers) versus time. Numbers of infected *I* (green), removed *R* (black) and victims *V=I+R* (blue). Fig. 4 illustrates that even second prediction for Spain looks too optimistic (“stars” deviate from the blue line more than for other countries). The second prediction for this country has lower value of *F* / *F**C* (1, *n* − 2) in comparison with the first one (see Tables 3 and 5), but for the first prediction the deviations are even larger. Probably the final size and the duration of the epidemic in Spain have to be re-estimated after obtaining new data. ![Fig. 4.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2020/04/25/2020.04.22.20075135/F4.medium.gif) [Fig. 4.](http://medrxiv.org/content/early/2020/04/25/2020.04.22.20075135/F4) Fig. 4. Spain, prediction 2: SIR curves (lines) and accumulated number of cases (markers) versus time. Numbers of infected *I* (green), removed *R* (black) and victims *V=I+R* (blue). The *V* and *R* curves are very close in Fig. 5. It means that the time of spreading infection in Germany is very short (in comparison, for example, with Moldova, see Fig. 6). SIR model allows calculating the average time of spreading infection 1/ *ρ*. For Germany this value could be estimated as 0.54 days; for Austria and France approximately 0.8 days; for Italy and Spain – 1.5 days and 3.2 days for the Republic of Moldova (see Table 3-5). By comparison, in South Korea this time was approximately 4.3 hours, [8]. ![Fig. 5.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2020/04/25/2020.04.22.20075135/F5.medium.gif) [Fig. 5.](http://medrxiv.org/content/early/2020/04/25/2020.04.22.20075135/F5) Fig. 5. Germany: SIR curves (lines) and accumulated number of cases (markers) versus time Numbers of infected *I* (green), removed *R* (black) and the number of victims *V=I+R* (blue line). ![Fig. 6.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2020/04/25/2020.04.22.20075135/F6.medium.gif) [Fig. 6.](http://medrxiv.org/content/early/2020/04/25/2020.04.22.20075135/F6) Fig. 6. Moldova, prediction 2: SIR curves (lines) and accumulated number of cases (markers) versus time. Numbers of infected *I* (green), removed *R* (black) and victims *V=I+R* (blue line). ## Discussion Figs. 1-6 illustrate that close to the epidemic outbreaks, the information about number of cases is not complete, since it is difficult to detect all the infected persons, especially those with mild illness. As a result, SIR simulation using data sets from the initial epidemic period has limited accuracy, and predictions of final size and duration are too optimistic (see [10-12]). The data quality may be very different and unpredictable. The only way to obtain the reliable results is to compare the *V(t)* curve with *V**j* data obtained after day of calculations. If the discrepancy after some days of observation is too large, new calculations must be performed with the use of fresh data. “Stars” in Figs. 1-3, 5,6 illustrate that the accuracy of calculations is good enough. Probably, the prediction for Spain is too optimistic and can be updated later. It looks, that SIR model can determine the real time of epidemic outbreak. Due to the data incompleteness, the “hidden” period could be very long. For example, the real epidemic outbreaks in Italy and Spain probably happened in November, 2019 and January, 2020 respectively (see Table 5 and Figs. 1, 4). There is no reason to think that the “hidden” period in China was shorter. According to [18], the first laboratory confirmed case was recorded on December 8, 2019. Therefore the real “zero” patient could get infection in October-November, 2019 and probably had no connection with the Wuhan fish market. Chinese authorities notified WHO about the epidemic outbreak only on January 3, 2020 (see [18]). This delay and obstacles to the dissemination of truthful information (an example is doctor Li Wenliang) has led to tragic consequences in Europe and other parts of the world. SIR curves could be useful to estimate the number of persons who are still spreading the infection, so known “hidden” patients (see green lines in Figs. 1-6). These information could be useful to plan the relaxation of quarantine measures. If medical and other services are able to quickly isolate persons who have contacted, for example, with 100 “hidden” patients, then there is no need to wait until the number of patients on the green curve reaches the value of unity. On the other hand, such attenuations may extend the time of occurrence of new cases and the number of deaths. A compromise between medicine and economic interests must be found here. ## Conclusions The SIR (susceptible-infected-removed) model and statistical approach to the parameter are able to make some reliable estimations for the epidemic dynamics, e.g., the real time of the outbreak, final size and duration of the epidemic and the number of persons spreading the infection versus time. This information may be useful to regulate the quarantine activities and to predict the medical and economic consequences of the pandemic. Unfortunately, the number of patients in Europe already exceeds one million. The risk of catching the infection will persist until at least mid-August, 2020. Such fatal consequences could have been avoided if timely and truthful information came from China. ## Data Availability Data is in text ## Acknowledgements I would like to express my sincere thanks to Gerhard Demelmair and Ihor Kudybyn for their help in collecting and processing data. ## Footnotes * *To Wuhan doctor Li Wenliang, who despite the power ban warned of the danger and was killed by coronavirus* * Received April 22, 2020. * Revision received April 22, 2020. * Accepted April 25, 2020. * © 2020, Posted by Cold Spring Harbor Laboratory The copyright holder for this pre-print is the author. All rights reserved. The material may not be redistributed, re-used or adapted without the author's permission. ## References 1. 1.World Health Organization. “Coronavirus disease (COVID-2019) situation reports”. [https://www.who.int/emergencies/diseases/novel-coronavirus-2019/situation-reports/](https://www.who.int/emergencies/diseases/novel-coronavirus-2019/situation-reports/). 2. 2.Kermack. W. O. & McKendrick. A. G. “A contribution to the mathematical theory of epidemics.” Proceedings of the Royal Society. Ser. A. vol. 115. pp. 700–721. 1927. 3. 3.Murray. J. D. Mathematical biology. 3rd ed. 2 v. New York : Springer. 2002–2003. 4. 4.Langemann. D.. Nesteruk. I. & Prestin. J. “Comparison of mathematical models for the dynamics of the Chernivtsi children disease.” Mathematics in computers and simulation. vol. 123. pp. 68–79. 2016. doi:10.1016/j.matcom.2016.01.003. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.matcom.2016.01.003&link_type=DOI) 5. 5.Nesteruk. I. “Statistics based models for the dynamics of Chernivtsi children disease.” AMMODIT Conference. Kyiv. Ukraine. January 2017. Naukovi visti NTUU KPI. 2017. no. 5. pp. 26–34. doi:10.20535/1810-0546.2017.5.108577. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.20535/1810-0546.2017.5.108577&link_type=DOI) 6. 6.Nesteruk, I. “Statistics-based predictions of coronavirus epidemic spreading in mainland China.” Innovative biosystems and bioengineering. vol. 4, no. 1, pp. 13–18, 2020. doi:10.20535/ibb.2020.4.1.195074. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.20535/ibb.2020.4.1.195074&link_type=DOI) 7. 7.Nesteruk, I. “Characteristics of coronavirus epidemic in mainland China estimated with the use of official data available after February 12, 2020.” [Preprint.] ResearchGate. 2020 Mar. doi:10.13140/RG.2.2.19667.32804. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.13140/RG.2.2.19667.32804&link_type=DOI) 8. 8.Nesteruk, I. “Estimations of the coronavirus epidemic dynamics in South Korea with the use of SIR model” [Preprint.] ResearchGate. 2020 Mar. doi: 10.13140/RG.2.2.15489.40807. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.13140/RG.2.2.15489.40807&link_type=DOI) 9. 9.Nesteruk, I. “Comparison of the coronavirus epidemic dynamics in Italy and mainland China” [Preprint.] ResearchGate. 2020 March. doi:10.13140/RG.2.2.19152.87049. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.13140/RG.2.2.19152.87049&link_type=DOI) 10. 10.Nesteruk, I. “Stabilization of the coronavirus pandemic in Italy and global prospects” [Preprint.] ResearchGate. 2020 March. doi: 10.13140/RG.2.2.13832.98561 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.13140/RG.2.2.13832.98561&link_type=DOI) 11. 11.Nesteruk, I. “Long-term predictions for COVID-19 pandemic dynamics in Ukraine, Austria and Italy” [Preprint.] MEDRXIV, 2020 Apr. doi: 10.13140/RG.2.2.31170.53448 [https://www.medrxiv.org/content/10.1101/2020.04.08.20058123v1](https://www.medrxiv.org/content/10.1101/2020.04.08.20058123v1) [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.13140/RG.2.2.31170.53448&link_type=DOI) 12. 12.Nesteruk, I. “Як довго українці сидітиMуть на карантині? How long will the Ukrainians stay in quarantine?” (in Ukrainian) [Preprint.] ResearchGate. 2020 April. doi: 10.13140/RG.2.2.15732.71046 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.13140/RG.2.2.15732.71046&link_type=DOI) 13. 13. N.R. Draper and H. Smith. Applied Regression Analysis (3rd ed.). John Wiley. 1998. 14. 14.[https://onlinepubs.trb.org/onlinepubs/nchrp/cd-22/manual/v2appendixc.pdf](https://onlinepubs.trb.org/onlinepubs/nchrp/cd-22/manual/v2appendixc.pdf) 15. 15. I. Nesteruk. “Maximal speed of underwater locomotion”. Innov Biosyst Bioeng. 2019. vol. 3. no. 3. pp. 152–167. Doi: [https://doi.org/10.20535/ibb.2019.3.3.177976](https://doi.org/10.20535/ibb.2019.3.3.177976) 16. 16.[https://www.scmp.com/news/china/society/article/3076334/coronavirus-strange-pneumonia-seen-lombardy-november-leading](https://www.scmp.com/news/china/society/article/3076334/coronavirus-strange-pneumonia-seen-lombardy-november-leading) 17. 17. F.-X. Lescure et al. Clinical and virological data of the first cases of COVID-19 in Europe: a case series. [www.thelancet.com/infection](http://www.thelancet.com/infection) Published online March 27, 2020 [https://doi.org/10.1016/S1473-3099(20)30200-0](https://doi.org/10.1016/S1473-3099(20)30200-0) 18. 18. Qun Li, Xuhua Guan, Peng Wu et al. Early Transmission Dynamics in Wuhan, China, of Novel Coronavirus–Infected Pneumonia. The new england journal of medicine. January 29, 2020 DOI: 10.1056/NEJMoa2001316 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1056/NEJMoa2001316&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=31995857&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F04%2F25%2F2020.04.22.20075135.atom) [1]: /embed/graphic-3.gif [2]: /embed/graphic-4.gif [3]: /embed/graphic-5.gif [4]: /embed/graphic-6.gif [5]: /embed/graphic-7.gif [6]: /embed/graphic-8.gif [7]: /embed/graphic-9.gif [8]: /embed/graphic-10.gif [9]: /embed/graphic-11.gif [10]: /embed/graphic-12.gif [11]: /embed/graphic-13.gif