Abstract
Italy was the first Western country to be seriously affected by COVID-19, and the first to implement drastic measures, which have successfully curtailed the epidemic. To understand which contain measures effected disease dynamics, we estimate change points in COVID-19 dynamics by fitting a compartmental model to official Italian data. Our results indicate that lockdowns managed to cause the epidemic to peak in late March 2020. Surprisingly, we found a change point during the decay from the peak, which does not correspond to obvious drastic legal interventions, but may be explained by widespread promotion and mandatory use of face masks. We confirm these interpretations at regional levels, and find that the gradual reopening of society since early May has caused no change in disease dynamics. We speculate that widespread use of face masks and other protective means has contributed substantially to keeping the number of new Italian COVID-19 cases under control in spite of society turning towards a new normality.
Introduction
The COVID-19 disease due to the SARS-CoV-2 coronavirus is spreading rapidly across the globe since its outbreak in China, and was declared a pandemic by the WHO on March 11, 2020. After the first severe patient was brought to the hospital of Codogno, Italy on February 20, 2020, and subsequently tested positive for COVID-19, a rapidly increasing number of patients have been identified, initially in Northern Italy and later in the rest of the country and Europe. Italy is one of the most affected European country, with ∼240.000 confirmed cases and nearly COVID-19 related deaths, and was the first to implement drastic contain measures. The imposed restrictions, culminating with complete lockdowns, have turned out to be effective in controlling the epidemic in Italy; the number of new daily cases peaked in late March 2020 at ∼6000 and declined to ∼200 − 300 by early June. The limitations in activities were followed by milder orders and direct invitations to behavioural change, such as distribution of face masks accompanied by their mandatory use, first in the most hit regions and later nationwide. During the month of May 2020, the country reopened many activities without compromising the decay in the number of newly infected individuals. Analyzing the Italian data carefully may therefore provide important insights into the epidemiology of COVID-19, and in particular to investigate if, how and which limitations in activities and other actions affected the disease dynamics in Italy.
Effective measures slow diffusion of the disease. By identifying such change points in COVID-19 spreading [1], it is therefore possible to associate interventions that were able to modify the course of the epidemic without assuming any effect a priori. Further, such an approach may reveal wether e.g. reopening of society lead to changes in disease dynamics, or could hint at change points apparently unrelated to regulations that deserve further investigations. To find such change points, it is advantageous to use relatively simple mathematical models of infectious diseases, which compared to more complex models, can be fitted to data with a minimum number of assumptions on model parameters [1–3]. However, even simple models should respect that nature of the data. There is thus a compromise between using a parsimonious model but sufficiently complex to be based on correct underlying assumptions.
In our setting, to fit the data on identified COVID-19 cases, a SIQR (susceptible – infectious – quarantined – recovered) model [4] is appropriate. In this model, infected individuals may be isolated, entering the “quarantined” subpopulation Q, so that these individuals no longer transmit the disease. Since Italian positive cases have been put in isolation (in hospitals or at home) immediately, the revealed data of active cases thus correspond to the number of individuals in state Q.
By fitting a modified SIQR model to official Italian data, we find change points that correspond well with general lockdowns. Our results indicate further that the mild restrictions imposed in Italy during the first two weeks of the outbreak had negligible effects on the disease dynamics. Surprisingly, we found an acceleration in the decay from the peak, which does not correspond to obvious drastic legal interventions, but may be explained by widespread promotion and mandatory use in face mask use. We confirm these interpretations at regional levels, and find that the gradual reopening of society since early May appears not to have caused any change in the disease dynamics. To be best of our knowledge, this is the first demonstration of a correlation between widespread face mask use and a reduction in COVID-19 transmission dynamics.
Methods
We use a SIQR model [4] to describe COVID-19 dynamics in Italy. Since we will be fitting the cumulative number of official cases, we unite the Q and R compartments in a “cases” compartment C = Q + R. The model equations are
where β is the rate of infection, η models the average rate with which infectious individuals become tested and quarantined, and eventually appear in the official statistics, and α is the rate with which unidentified infectious individuals recover or die from the disease. We do not explicitly model the number of recovered or deceased non-identified COVID-19 patients, but only the rate α with which these patient become non-infectious. Further, N is the total number of individuals in the population, assumed constant since we are studying the early phase of the epidemic. Note that S + I + C≠N. To obtain such a conservation law, an additional compartment RI that model recovered but non-identified patients could be added with dynamics dRI/dt = αI.
Since there is evidence that COVID-19 can be transmitted in the absence of symptoms [5, 6], we do not include an explicit exposed-but-noninfectious (E) state, i.e., we do not consider a SEIQR model [7]. Further, from the Diamond Princess cruiseship and from the Italian village Vo’ Euganeo, it has been found that ∼50% of COVID-19-positive individuals do not develop symptoms [8, 9]. We assume that such positive but asymptomatic individuals can transmit the disease, i.e., the I state includes both individuals that will not develop symptoms, cases that did not develop symptoms yet, and symptomatic patient that still have not been tested positive and isolated.
In spite of the fact that almost 300,000 individuals have been found COVID-19 positive, and possibly a few millions of unidentified cases have occurred, the total number of persons that have had the infection constitute a relatively small fraction of the Italian population of ∼60 million. Thus, it is a reasonable approximation that most of the population is still susceptible, S ≈ N, and hence, as well known, the number of infected individuals follows at any time exponential growth or decay, unless parameters change.
To identify change points, we allow β to be a piecewise constant function of time, thus modelling how contain measures may affect the rate of COVID-19 transmission. We estimate both the time points (change points, T1, T2, T3) where β changes and the values of β = βi in the intervals (Ti−1, Ti] with T0 = 0 (Feb. 21, 2020) and T4 = 122 (June 22, 2020; last data point). The assumption of piecewise constant β is equivalent to I(t) being a piecewise exponential function (under the assumption S ≈ N).
As mentioned, the cumulative number of identified COVID-19 cases corresponds to C. With the above assumptions, we obtain from (2) with S = N and by integrating (3),
where ρi = βi − (α + η), i = 1, 2, 3, and C0 is the initial value of identified cases. We fit this expression to the Italian COVID-19 data from February 22, 2020 through June 18, 2020 (Fig. 1), using the nls function in R [10].
Model fit to national Italian data. A: The cumulative number of confirmed COVID-19 cases (circles) and the best fit (curve) of expression (4) with ηI0 = 66.3/day (SE 11.8/day), T1 = 17.8 days (March 7, 2020; SE 1.5 days), T2 = 28.4 days (March 20, 2020; SE 0.3 days), ρ1 = 0.192/day (SE 0.015/day), ρ2 = 0.104/day (SE 0.008/day), and ρ3 = −0.023/day (SE 0.001/day). The vertical dashed lines indicate the change points T1, T2 and T3. B : The daily number of new confirmed cases on logarithmic scale (circles) and the derivative of the fitted curve in panel A (red lines), i.e., dC/dt = ηI(t). Data from https://github.com/CSSEGISandData/COVID-19/tree/master/csse_covid_19_data/csse_covid_19_time_series.
We note that the values of η and α are irrelevant for the fitting procedure and the main findings in this manuscript regarding the identification of change points, but they permit us to estimate e.g. the (time varying) basic reproduction number R0 = βi/(α + η) by calculating βi = ρi + α + β. To estimate α and η we use previous findings. The probability that a COVID-19 positive individual is tested and quarantined is δ = η/(η + α). It has been estimated that the average incubation time is ∼5 days [11, 12] and the duration of the milder cases of disease it 5-10 days [5]. Identified cases are mostly symptomatic patients, which we assume are tested and isolated a few days after the incubation time is over and first symptoms appear, i.e., after ∼10 days. We also assume an average time of duration from infection to recovery or death of non-isolated cases of 10 days, i.e., on average infectious individuals are removed from compartment I with rate 0.1/day, either because they recover (milder cases) or become tested and enter compartment C. Hence, α + η = 0.1. Since ∼50-75% of the population is asymptomatic [8, 9], but some milder cases may also go unnoticed and not end in isolation, we assume that δ = 1/3 of infectious individuals are eventually tested. We thus set α = 0.067/day and η = 0.033/day.
Results
We obtained an excellent fit to the Italian data of cumulative COVID-19 cases assuming three change-points (Fig. 1A). Our results indicate that the early Italian containment measures introduced in late February (school closings, hygiene indications, etc.) had negligible, if any, effect on the disease dynamics. The first change points T1 was estimated to fall on March 10, 2020, corresponding very well to the lockdown of the Northern regions on March 8, 2020, which was followed by complete lockdown of Italy within a few days. These interventions lowered the growth rate ρ by approximately one third. However, only later around March 20, 2020, did the growth rate become negative, and the number of infected individuals started to decline. This corresponds reasonably with the more stringent control measures and closure of all non-essential work places introduces around this date. Indeed March 20, 2020, was the last work day before the wider lockdown of all non-essential companies. Another reasonable interpretation is that the two change points reflect a gradual change in dynamics caused by a distributed, delayed effect of the major lockdown of March 8, as discussed further in the Discussion. A third change point was found at April 20, 2020 where the decay was further accelerated.
From the plot of the cumulative number of cases it may be hard to see the abrupt changes in dynamics. In particular, the assumption of piecewise constant ρ (and β) – and hence exponential growth or decay of the number of infectious individuals in each subinterval defined by the change points – is difficult to verify graphically from this figure. Therefore, we plotted the daily number of new cases on logarithmic scale (Fig. 1B) with the derivative of the fit from Fig. 1A. Indeed, straight lines, corresponding to exponential growth or decay in I(t) = (dC/dt)/η, are easily identified, justifying the assumption of piecewise constant parameters.
The latter change point identified at April 20, 2020, is apparently not related to any specific containment measures. We speculated that the introduction of mandatory face mask use also outdoors in Lombardy (from April 4, 2020) and Veneto (from April 13, 2020) might be the cause of the acceleration of the decline occurring from this date. In addition, other hard-hit regions such as Tuscany, Emilia-Romagna, Piedmont and Liguria had distributed free face masks during April, encouraged their use, and some municipalities had made them mandatory in public spaces. This promotion of face masks anticipated mandatory use from April 20, 2020 in Tuscany and from May 4, 2020 nationwide. We thus proceeded to fit the data from these six regions with the largest number of official COVID-19 cases. For Lombardy and Veneto, we were able to identify three change points as for the agglomerated Italian data, whereas for the other four regions only two change points were identified, likely due to the relatively low number of cases in these regions during the first week of March.
We found that all regions had a change point in late March (Lombardy and Veneto had two, similar to what we found for the national data) and another in late April. The change in dynamics at this latter change point was very mild in Lombardy, but in other regions it was very strong and clearly seen in the raw data (Fig. 2). Lombardy introduced mandatory face mask use on April 4, 2020, very close to the epidemic peak where the regional health care system was close to a collapse. The data may therefore be unreliable near the peak, masking the effect of face mask use, or the effect of face masks may be hidden behind the other regional strict containment measures, lockdown and high level of alert in the region. For Veneto, the introduction of the lockdowns in March, which correspond well to the first two identified change points, stabilized the number of daily new cases at ∼400 with a small decline until the change point found at April 20. The number of new cases then shown a marked exponential decline. The identified change point corresponds well to the mandatory use of face masks from April 13 in Veneto.
The daily number of new confirmed cases in the six Italian regions with most COVID-19 cases, corresponding to Fig. 1B. For Emilia-Romagna, Piedmont, Tuscany and Liguria only two change points were identifiable. Data from https://github.com/CSSEGISandData/COVID-19/tree/master/csse_covid_19_data/csse_covid_19_time_series.
For Tuscany and Piedmont, we found a similar pattern with a near-constant plateau during late March – early April followed by exponential decline. The corresponding latter change points were, respectively, April 14 and April 19. For Tuscany, the date corresponds well with regional order of April 6, 2020, regarding the distribution of face masks in Tuscany from April 7, anticipating mandatory use in single municipalities once the distribution had completed, and from April 20 in the entire Region. Piedmont did not require face masks until May 4, 2020, but the regional government announced April 15 that masks would become mandatory and started their distribution soon after. Similar patterns and explanations hold for Emilia-Romagna (change point April 28) and Liguria (April 26).
Our chosen values for α and η yield an initial infection rate β1 = 0.294/day and consequently the basic reproduction number is estimated to be R0 = β/(α + η) = 2.94, in line with previous estimates of R0 falling between 2 and 4 [11,13–16]. Further, we obtain an estimate of the number of infectious individual at the moment of the outbreak of I0 = 65.0/0.033 ≈ 2000, although with a large uncertainty because of the large standard error on the estimate of ηI0, and because the calculated I0 obviously depends on the value of η. Based on the dynamics before the outbreak (δ = 0), we can estimate that the first infectious case appeared in Italy log(I0)/(β1 − 0.1) ≈ 40 days before the outbreak, i.e., around January 12, 2020.
Discussion and Conclusion
Our data fitting procedure identified three change points where the dynamics of the COVID-19 epidemic changed. The first two correspond with the major lockdowns imposed in Italy during March 2020. We therefore conclude that these were effective in halting the spreading of the disease, which lead to the peak in new cases seen in late March. More surprisingly and in contrast to our expectations, we found a change point around April 20, which at first glance did not correspond to any specific interventions. The major new containment measure introduced in April were the orders of mandatory face mask use in Lombardy, Veneto and Tuscany, and the distribution of free face masks in many regions. Analyzing regional data, we were able to distinguish change points for each region that correspond well to the introduction of face mask distributions or orders in the individual regions. Face masks have been suggested to be important mainly for reducing COVID-19 transmission from asymptomatic individuals [17] that may be responsible for the major number of newly infected cases due to the large fraction of hidden-to-quarantined SARS-CoV-2 positive individuals (∼10-fold difference) [17–19].
We considered alternative explanations for the acceleration of the decline of new cases seen in late April. The number of COVID-19 tests did not decline in correspondence to the identified change point, and the fraction of positive tests showed an acceleration in the decline similar to the number of new cases. Mobility data (https://www.google.com/covid19/mobility/) showed, if anything, increased activity in late compared to early April, excluding that reduced activity underlie the change. The weather was mild and dry in April in Italy (https://www.3bmeteo.com), with little variation in temperature during the month. Virtually no rain fell during the entire month until April 27. Thus, no abrupt change in weather was seen around mid-April, which might have been the cause of the change in dynamics. In summary, no obvious alternative explanations for the acceleration of the end of the epidemic were found.
Our model assumes that the rate of infection changes instantaneously, which is not completely realistic, but simplifies the model. We verified (not shown) that assuming that the change in growth rate occurred in a linear fashion over 2, 5 or 10 days [1] did not change the conclusions. The more graduate change in dynamics smoothened the curve, which made it harder to distinguish the two first change points. The data was well fitted with a first change point shortly after the lockdown on March 8 and a second one around April 15.
We estimated that there were ∼2000 of infectious but undetected individuals in Italy at the time of the outbreak around February 21, 2020. Only when a patient with severe symptoms was hospitalized and tested for COVID-19, and the first infected person died from COVID-19 on the following day, wide testing and isolation efforts started. By backward interpolation, we estimated that the first infectious individual (“patient zero”) appear in Italy around January 12, 2020. This estimate suggests that COVID-19 was present in Italy even earlier than a presumed patient zero, suggested to be a German citizen linked to a cluster in Munich [6] visiting Italy around January 25, 2020 [20].
In conclusion, identifying change points confirmed that strict lockdown measures were infective in slowing the spread of the epidemic, leading to its peak in late March. Face mask use appeared as a plausible explanation for the further acceleration of the decline in the number of new cases in April. The reopening of society in May did not lead to change in the decay rate (Fig. 1), also when analyzing the individual regions (Fig. 2), which may be due to the mandatory use of face masks nationwide from May 4. Our results thus lend further support to the importance of face mask use for controlling COVID-19 during the reopening of societies [17, 21, 22].
Data Availability
All data are publicly available. Links are provided in the manuscript.
https://github.com/CSSEGISandData/COVID-19/tree/master/csse_covid_19_data/csse_covid_19_time_series
Funding
this work was supported by MIUR (Italian Minister for Education) under the initiative “Departments of Excellence” (Law 232/2016).
Authors contributions
MGP and MM conceived research and discussed all results. MGP developed the theoretical framework and performed parameter estimation. MM developed the simulation framework and compared real data to model results. MGP wrote the paper. MM commented drafts and approved the final version of the paper.
Competing interests
The authors declare to have no competing interests.