Predicting COVID-19 peaks around the world ========================================== * Constantino Tsallis * Ugur Tirnakli ## Abstract The official data for the time evolution of active cases of COVID-19 pandemics around the world are available online. For all countries, a peak has been either observed (China and South Korea) or is expected in near future. The approximate dates and heights of those peaks imply in important epidemiological issues. Inspired by similar complex behaviour of volumes of transactions of stocks at NYSE and NASDAQ, we propose a *q*-statistical functional form which appears to describe satisfactorily the available data of all countries. Consistently, predictions become possible of the dates and heights of those peaks in severely affected countries unless efficient treatments or vaccines, or sensible modifications of the adopted epidemiological strategies, emerge. It is possible to predict the thermostatistical properties of uncountable physical systems at thermal equilibrium through the one-body distribution ![Graphic][1], where *ω*(*ε*) is the density of states as a function of the energy ε, multiplied by the celebrated Boltzmann factor *e*−*βε*, *β* being the inverse temperature. The function *ω*(*ε*) comes from mechanical considerations (classical, quantum, relativistic) related with the number of degrees of freedom, and does not depend on the temperature; the exponential weight comes instead from standard statistical-mechanical considerations. In many cases it is, either exactly or approximatively, ![Graphic][2]. For thermostatistical properties of the stationary- or quasi-stationary-state of wide classes of complex systems, the Boltzmann factor is to be generalised into the *q*-exponential factor ![Graphic][3] [1–3]. This procedure yielded quite satisfactory results for the high-frequency stock-market in NYSE, NASDAQ and elsewhere [4–6]. Let us focus now on the data available for the COVID-19 pandemics. Briefly after the beginning of the pandemics, several studies analysing the available data, and employing different models and candidate functions, started to appear in the literature [7–12]. Most of them are interested in the behaviour of total cases and fatality curves. We will concentrate here on the analysis of the active cases and deaths *per day*. The inspection of the public data [13] (updated on a daily basis), in particular of the time evolution of the number *N* of active cases (surely a lower bound of the unknown actual numbers) showed a rather intriguing similarity with the distributions of volumes of stocks. Along this line we adopt the following functional form for each country or region: ![Formula][4] with *C* > 0; *α* > 0; *β* > 0; *γ* > 0, *q* > 1 and *t* ≥ 0. The constant *t* indicates the first day of appearance of the epidemics in that particular region; it is conventionally chosen to be zero for China; for the other countries it is the number of days elapsed between the appearance of the first case in China and the first case in that country. The normalising constant *C* reflects the total population of that particular country. For *α* = 0, if *γ* = 1 we recover the standard *q*-exponential expression; if *γ* = 2, it is currently referred in the literature as *q*-Gaussian; for other values of *γ*, it is referred to as stretched *q*-exponential. Through the inspection of the roles played by the four nontrivial parameters, namely (*α*, *β*, *γ*, *q*), it became rather transparent that (*α*, *β*) depend strongly on the epidemiological strategy implemented in that region, in addition to the biological behaviour of the coronavirus in that geographical climate. In contrast, the parameters (*γ*, *q*) appear to be sort of universal, mainly depending on the coronavirus. Therefore, we investigate several countries, which do not reach their peaks yet, with the basic assumption that these two parameter would not change much from one country to another and we fixed these values at the values that we determine for China since this country has already had nearly the full evolution of the pandemics. This assumption seems to be working. For other contries whose peaks are already reached, we use the same functional form (1) but adjusting all parameters for a better fitting. The results of China and South Korea are given in Fig. 1. It is evident that, although the functional form (1) does yield satisfactory results for both China and South Korea, the (*γ*, *q*) parameter values differ somewhat for each of these countries. On the other hand, as can be easily seen from Fig. 2, our assumption is corroborated by several countries that we have numerically analysed. In Table 1 we present the forecasted dates and heights of the peaks, as well as the values of the fitting parameters using the data accumulated until 28 of April, 2020. ![FIG. 1:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2020/05/04/2020.04.24.20078154/F1.medium.gif) [FIG. 1:](http://medrxiv.org/content/early/2020/05/04/2020.04.24.20078154/F1) FIG. 1: Two possible fits for the COVID-19 active cases evolution in (a) China and (b) South Korea: the dots are the available data at 28 April 2020, and the lines are fittings using Eq. (1). For China a strange kink is present in the ascending part of the data curves, which makes that it is not possible to make a single fitting which would satisfactorily account for both increasing and decreasing parts. We present here two log-log representation of different curves which describe either the increasing part or the decreasing part but not both. Inset: linear-linear representation of the same data. ![FIG. 2:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2020/05/04/2020.04.24.20078154/F2.medium.gif) [FIG. 2:](http://medrxiv.org/content/early/2020/05/04/2020.04.24.20078154/F2) FIG. 2: Fittings of the active cases data available in 28 April 2020 for various severely a ected countries around the world with Eq. (1). The fitting parameters *γ* and *q* are fixed at the values of China for all countries which do not reach their peak values yet. Notice that, in the case of Brazil, a disruption occurred in the publicly available data in middle April. We do not know the cause of this. Coincidentally, however, precisely in those days the President of Brazil decided to change his Minister for Health. View this table: [TABLE I:](http://medrxiv.org/content/early/2020/05/04/2020.04.24.20078154/T1) TABLE I: The values of the parameters and predictions for the maximum number of active cases and for the day at which the maximum will be achieved. The second line in the column of prediction interval of active cases peak height shows the same data divided by the total population of the country. We also test our formula (1) and assumption for the evolution of deaths per day. Again we fixed the (*γ*, *q*) parameter values at China’s results, which can be seen in Fig. 3, and tried to fit the data of the same 6 countries. It is quite surprising to see that the results given in Fig. 4 seem to suggest that the evolution of death per day is also possible to be fitted without changing these two parameters. ![FIG. 3:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2020/05/04/2020.04.24.20078154/F3.medium.gif) [FIG. 3:](http://medrxiv.org/content/early/2020/05/04/2020.04.24.20078154/F3) FIG. 3: Evolution of deaths per day due to COVID-19 in China. The data are those available at 28 April 2020. The fitting parameters *γ* and *q* are fixed at one of the two choices of the values for China, namely (*γ*, *q*) = (3, 1.26). The other fitting parameters, namely (*Cdeath*, *αdeath*, *βdeath*), have been chosen to better fit the available data for China. ![FIG. 4:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2020/05/04/2020.04.24.20078154/F4.medium.gif) [FIG. 4:](http://medrxiv.org/content/early/2020/05/04/2020.04.24.20078154/F4) FIG. 4: Fittings of deaths per day data for various strongly a ected countries around the world with Eq. (1). The data are those available at 28 April 2020. The fitting parameters *γ* and *q* are fixed at one of the two choices of the values for China, namely (*γ*, *q*) = (3, 1.26). The other fitting parameters, namely (*C*death, *α*death, *β*death), have been chosen to better fit the available data for each country We may summarise as follows. The death curve of South Korea is atypical, in the sense that it sensibly differs from all those that we analysed in the present work. Because of that, we have not included it here. We remind that the values for (*γ*, *q*) used for the South Korea evolution curve of active cases also differ from those used for all the other countries, which reinforces the fact that some sort of exceptionality exists there, for reasons that are unknown to us. For all countries that do not reach their peak values yet, we have adopted the values of (*γ*, *q*) obtained from the inspection of the entire curve of China. We verify that the present work appears to belong to the realm of complex systems which includes not only, as mentioned above, high-frequency financial transactions (with *α* > 0 [4, 5], but also anthropological issues such as medieval trading networks and biotech intercorporate networks (with *α* < 0) [14], relaxation in spin-glasses [15], as well as q-Weibull distribution-like systems [16–19], which correspond to the particular case = −1. It remains open as a highly desirable goal to formulate a model which, along lines somewhat similar to epidemiological models such as the SIR one, would predict an evolution curve such as the present Eq. (1). This would clarify important issues such as the conditions under which the values of (*γ*, *q*) could indeed be universal and essentially determined by the biology of the infecting agent, such as the present COVID-19. ## Data Availability Publicly available datasets were analyzed in this study. ## Acknowledgments We have benefitted from useful remarks by E.P. Borges and partial support by CNPq and Faperj (Brazilian Agencies). * Received April 24, 2020. * Revision received April 30, 2020. * Accepted May 4, 2020. * © 2020, Posted by Cold Spring Harbor Laboratory This pre-print is available under a Creative Commons License (Attribution-NonCommercial-NoDerivs 4.0 International), CC BY-NC-ND 4.0, as described at [http://creativecommons.org/licenses/by-nc-nd/4.0/](http://creativecommons.org/licenses/by-nc-nd/4.0/) ## References 1. [1]. C. Tsallis, Possible generalization of Boltzmann-Gibbs statistics, J. Stat. Phys. 52, 479 (1988). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/BF01016429&link_type=DOI) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=A1988Q240100029&link_type=ISI) 2. [2]. M. Gell-Mann and C. Tsallis, eds., Nonextensive Entropy - Interdisciplinary Applications, (Oxford University Press, New York, 2004). 3. [3]. C. Tsallis, Introduction to Nonextensive Statistical Mechanics - Approaching a Complex World (Springer, New York, 2009). 4. [4]. C. Tsallis, C. Anteneodo, L. Borland and R. Osorio, Nonextensive statistical mechanics and economics, Physica A 324, 89 (2003). 5. [5]. R. Osorio, L. Borland and C. Tsallis, Distributions of high-frequency stock-market observables, in [2]. 6. [6]. G. Ruiz and A.F. de Marcos, Evidence for criticality in financial data, Eur. Phys. J. B 91, 1 (2018). 7. [7]. V. Zlatic et al., Bi-stability of SUDR+K model of epidemics and test kits applied to COVID-19, arXiv:2003.08479, (2020). year = ”2020”, 8. [8]. G. L. Vasconcelos et al., Modelling fatality curves of COVID-19 and the effectiveness of intervention strategies, doi = [https://doi.org/10.1101/2020.04.02.20051557](https://doi.org/10.1101/2020.04.02.20051557), (2020), preprint. 9. [9]. E. L. Piccolomini and F. Zama, Preliminary analysis of COVID-19 spread in Italy with an adaptive SEIRD model, arXiv:2003.09909, (2020). 10. [10]. M. Perc et al., Forecasting COVID-19, Front. Phys. 8, 127 (2020). 11. [11]. A. Pluchino et al., A Novel Methodology for Epidemic Risk Assessment: the case of COVID-19 outbreak in Italy, arXiv:2004.02739, (2020). 12. [12]. S. B. Bastos and D. O. Cajueiro, Modeling and forecasting the early evolution of the Covid-19 pandemic in Brazil, arXiv:2003.14288v2, (2020). 13. [13].[https://www.worldometers.info/coronavirus/#countries](https://www.worldometers.info/coronavirus/#countries) 14. [14]. D.R. White, N. Kejzar, C. Tsallis, D. Farmer and S. White, A generative model for feedback networks, Phys. Rev. E 73, 016119 (2006). 15. [15]. R.M. Pickup, R. Cywinski, C. Pappas, B. Farago and P. Fouquet, Generalized spin glass relaxation, Phys. Rev. Lett. 102, 097202 (2009). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1103/PhysRevLett.102.097202&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=19392558&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F05%2F04%2F2020.04.24.20078154.atom) 16. [16]. S. Picoli Jr.., R.S. Mendes and L.C. Malacarne, *q*-exponential, *Weibull, and q-Weibull distributions: An empirical analysis*, Physica A 324, 678 (2003). 17. [17]. E.M. Assis, E.P. Borges and S.A.B. Vieira de Melo, Generalized q-Weibull model and the bathtub curve, International Journal of Quality & Reliability Management 30, 720 2013. 18. [18]. T.S.V. Podesta, T.V. Rosembach, A.A. Santos and M.L. Martins, Anomalous diffusion and q-Weibull velocity distributions in epithelial cell migration, Plos One (2017). 19. [19]. M. Xu, Using the q-Weibull distribution for reliability engineering modeling and applications, Doctor Thesis (University of Maryland, College Park, 2019). [1]: /embed/inline-graphic-1.gif [2]: /embed/inline-graphic-2.gif [3]: /embed/inline-graphic-3.gif [4]: /embed/graphic-1.gif