Abstract
The arrival of SARS-COV-2 in late March 2020 in the state of Amazonas, Brazil, captured worldwide attention and concern. The rapid growth of the epidemic, a health system that had collapsed, and mass gravesites for coping with growing numbers of dead, were broadcast by the media around the world. Moreover, a majority of the local Amazonian indigenous communities were physically distant from appropriate medical services, to the point where warnings of genocide were issued. In a recent Science paper (December 2020), Buss et al. reported that some 76% of the residents of the city of Manaus, the capital of Amazonas, had been infected by October 2020. This estimate of the COVID-19 attack rate was based on a seroprevalence analysis of blood donor data, which despite its shortcomings was thought to be a sufficiently reliable proxy of the larger population. An attack rate of this magnitude (76%) implied that herd immunity had already been reached and the community was relatively protected from further infection. Yet in December 2020, a harsh second wave of COVID-19 struck Manaus, and currently appears to be even larger than the first wave. Here we use mathematical modelling of mortality data in Manaus, and in various states of Brazil, to understand why a second wave appeared against all expectations. Our analysis is based on estimating a “flexible” reproductive number R0(t) from the mortality data, as it changes in time over the epidemic.
In December 2020, a vicious second wave of COVID-19 erupted in Manaus, Brazil, resulting in >100 residents dying per day, and triggering yet another collapse in the country’s rundown healthcare system1. Awkwardly, this contradicts the recent Science paper of Buss et al. (2020)2 which suggested that any second wave would be highly unlikely. The authors estimated that the first wave infected ∼76% of the city’s population by October 2020 (see also3). With an Attack Rate (AR)∼76%, it appeared that herd immunity had been reached, and no further epidemic should be expected, assuming reinfection is relatively rare. Currently there are only speculations as to why a major second wave then appeared (Sabino et al, 2021)4. For example, measurements of AR via seroprevalence testing could possibly be inaccurate given the biases of the blood donor data5 used by Buss et al. (2020). It is therefore useful to explore alternative and complementary epidemiological modelling analyses of publicly available mortality datasets to help understand why the first wave of the COVID-19 outbreak suddenly “crashed,” and why a secondary wave later unexpectedly appeared in Manaus in December 2020. Here we discuss and expand on our results presented in Ref 6. Elucidating the underlying processes that took place is of central importance for planning future interventions appropriately, not only for the benefit of Manaus which is currently experiencing the full fury of SARS-CoV-2, but also for numerous other potentially high-risk locations worldwide.
We first mention that it is not uncommon for potentially large-scale epidemic outbreaks to die out well before predictions of conventional epidemiological models, and well before herd-immunity can develop7. For example, the Ebola outbreak in Sierra Leone Guinea 20148 began with an initially susceptible population of 5,364,000 residents. While a conventional SEIR-type model predicted 5,157,000 Ebola cases6, the epidemic resulted in “only” 14,124 infected individuals. We documented a similarly perplexing account of a catastrophic epidemic outbreak9 in WW2. Brauer (2019) 6 showed that the early demise of an epidemic can occur in the presence of: i) heterogeneous transmission and super-spreader-like events7,10; ii) human behavioural responses which reduce the size of epidemics by reducing the reproductive number R0(t) over time7,9,11,12, creating a “herd-protection” effect based on behaviour rather than immunity. Brauer (2019)7 modelled behavioural interference by assuming that R0(t) (which is a measure of transmission) reduces exponentially over time at rate e-κt (See also12). Other studies of cholera and influenza use similar parametric approaches12. Here we generalise this principle by fitting a “flexible” R0(t) to the data without pre-imposing any specific functional form 9,11.
Figure 1 plots the COVID-19 dynamics in Brazil for eight selected states (data from14). Each panel displays the number of COVID-confirmed deaths per day for the given state over the epidemic. In theory, mortality data should reflect trends in the disease dynamics more reliably than other surveillance data (e.g., confirmed cases), although under-reporting and disease confirmation issues remain sources of bias. We repeated the plots for all Brazilian states (see Appendix), and it became immediately obvious that the epidemic curve of the first wave was roughly synchronized over all of Brazil in the period May-July. All 27 regions peaked in this 2-month period, some earlier than others, and then declined within several months. The differences in timing between states presumably relate to spatial spread and differences in local NPI policies in terms of initiation and intensity. From this perspective, it would be unusual to argue that the epidemic crashed naturally in Amazonas after reaching herd immunity, while in other locations the outbreak turned around due to the involvement of non-pharmaceutical interventions (NPIs; social-distancing, facemasks, hand-sanitizing, work stoppages, lockdowns, travel bans, restrictions on gatherings, etc.).
Exploring this more deeply, we fitted an SEIR-type epidemiological model (see Appendix) to COVID-19 mortality datasets and estimated R0(t) as it changed, as shown in Figures 1&2 (blue). The model is described in the Appendix and in Refs.7,9&13. In all states, and in the cities of Manaus and Sao Paolo, we found evidence that R0(t) fell substantially with time over the first wave as the epidemic proceeded, and it thus appeared to be a regional phenomenon. In line with Brauer (2019)7, the reduction in R0(t) is likely to be due to behavioural reactions such as state-wide NPI’s, individual behavioural changes, and possibly also combined with seasonal and/or climatic factors. In addition, it was impossible to fit SEIR-type models to the mortality data with a constant R0, as might be a reasonable assumption if interventions or seasonality had little impact, as has been suggested. Any such fit yielded outcomes that did not appear meaningful. For example, for Manaus, the best fit in Figure 2a predicted a large R0(t) = 3.1 over the whole period through to January 2021, and provided a very poor fit to the data. However, with a flexible fitting of R0(t), as in Figure 2b&c, an excellent fit to the data is obtained. Figure 2b fits COVID-19 confirmed mortality data obtained from Ref.16 while Fig.2c fits Severe Acute Respiratory Illness mortality data from Ref.21, both collections for Manaus. Moreover, the SEIR-type model, despite being famous for its ability to fit epidemics of all types, was unable to do so unless R0(t) decreased rapidly in April, thereby allowing a low stationary endemic state to be maintained data from July to November, as had been the case. The model estimated an AR∼30% until October 2020 in Manaus, which would appear to be reasonable given the large ongoing second wave that followed. Mellan et al. (2020)22 predicted a smaller AR. In summary, the first wave of the Manaus epidemic appeared to crash due to a reduction in R0(t), rather than because of a large attack rate and the attainment of herd immunity.
Buss et al. (2020)2 showed that their estimate of a high AR∼76% implies that the infection fatality rate (IFR) must be extremely low for consistency (on average IFR∼0.018-0.26%) – but they did not mechanistically model the epidemic trajectory over time as done here. We were unable to fit the mortality dynamics sensibly with such a low IFR (see Figure S1). Our results indicate that intervention activities implemented in Manaus played an important role in the crash of the first wave seen in early May 2020, as also argued by Mellan et al. (2020)22. Although no national lockdown had officially been set early in the epidemic, states and cities adopted their own measures. Buss et al (2020)2 document that physical distancing increased during March. By March 23, a state of emergency was declared in Amazonas and in fact all of Brazil. Soon after, Manaus introduced a range of NPI’s. There have been valid criticisms concerning how well these NPIs were enforced, their late implementation, and the levels of compliance; some sectors of society generally associated with the far-right, deliberately flouted the most basic rulings. In addition, those sectors most severely suffering from inherent structural inequalities had also showed a lower adherence to lockdowns initially, as many lacked financial security to survive for over a week without income. Yet quarantine was generally favoured in these sectors, and with increasing financial and social aid, adherence rose. Generally, in Manaus, the NPIs were implemented often with the strong support of the Mayor, who was often undermined by the Brazilian President Bolsarano. By the beginning of April, health systems were reaching dangerously close to their threshold of collapse. Nevertheless, by mid-May the hospitals were back on their feet, and the epidemic was declining rapidly (Figure 2), although only after many lives were lost. Similar NPI’s were implemented in nearly all other Brazilian states, roughly synchronously to combat the newly emerging countrywide epidemic, although they have been subject to criticism for poor organization, compliance, and disregard. Nonetheless, these results strongly suggest that the implementation of NPIs was sufficiently effective to provide “herd-protection”, despite the notable political and infrastructural challenges faced by the residence of Manaus.
Our work highlights the difficulties policy makers face when attempting to predict herd immunity, as well as the need for developing and using complementary mechanistic epidemiological modelling tools and datasets.
Data Availability
All data used in this work are publicly available.
Declarations
Ethical approval and consent to participate are not applicable.
Consent for publication
Not applicable.
Availability of data and materials
All data used are publicly available.
Conflict of interests
DH was supported by an Alibaba (China) Co. Ltd Collaborative Research Project. Other authors declared no conflict of interest.
Funding
DH was supported by an Alibaba (China) Co. Ltd Collaborative Research Project (ZG9Z). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Author contributions
LS, DH, YA designed the study, carried out the statistical analysis and wrote the manuscript. SSM participated in data analysis and critically revised the manuscript.
Supplementary Figures
Acknowledgements
Not applicable.
Appendix
Models and Results
We fit a susceptible-exposed-infectious-recovered type model with a flexible time-varying transmission rate β (t) to the reported Severe Acute respiratory syndrome (SARI) mortality data 1: where the compartments S, E, I, and R denote the conventional susceptible, exposed, infectious, and recovered individuals, respectively. The variable N is the total population size of the city or state, H denotes the hospitalized cases (or severe cases), and D denotes the total of SARI-deaths. Parameters β (t), σ, γ, κ denote the transmission rate, the infectiousness emergence rate, the infectiousness disappearance rate, and the removal rate (due to death or recovery) of hospitalized cases, respectively. Parameters θ and π denote the ratio of hospitalized cases out of all infected cases and the proportion of deaths out of hospitalized cases, respectively. Thus, the overall case fatality rate (or infection fatality rate) equals θ π. All parameters are constant except β (t) being time-varying. A similar model was used for the Amazonas and Manaus COVID confirmed mortality data.
We follow closely our previous work for fitting a “flexible” time varying transmission rate β(t) (see 2–4). It requires defining β(t) = exp (cubic_spline) as an exponential cubic spline with n nodes evenly distributed over the study period. We set time step size as one day and integrated for one day and yielded the simulated daily deaths Dt. We defined the reported deaths as Ct, and then we have
Here, τ denotes the overdispersion, and accounts for the measurement noise due to surveillance and heterogeneity among individuals.
The parameter values of, σ, γ, κ are taken as 365/2, 365/3.5, and 365/12 per year, respectively. Thus the mean latent period and mean infectious period (the reciprocals of, σ and γ)) are 2 and 3.5 days, respectively. The generation time GI, the sum of mean latent period and mean infectious period 5, equals 5.5 days, which is in line with three key studies which provided estimated GI 6,10. From the SARI data we noticed a delay between the first symptom onset and the death at 14 days, which justifies the choice of 12 days (the reciprocal of κ) from loss of infectiousness to death. Namely the onset of infectiousness is 2 days ahead of first symptom onset.
Although we do not explicitly model age-class structure here, the model nevertheless has direct relevance for two reasons:
The important Science paper of Earn et al 8 was designed especially to show that the changes in R0(t) have much more impact on the epidemic trajectory than age-structure and should be the first consideration, while the effects of age-structure are in comparison far smaller.
Manaus can be qualitatively approximated as a two age-class system. Some 20% of the population are older than 45 years of age and contributes significantly to COVID-19 related deaths, while the remaining 80% of the population are under 45 years of age and have such low IFR that they barely contribute to the deaths. (In fact <7% of total COVID-19 deaths.) Yet the infection dynamics is far more homogeneous. (Buss et al. suppose all age classes have the same uniform prevalence.) It can be shown that the homogeneous system found from averaging two age classes (themselves homogenous in transmission) can explain the epidemic dynamics of the two age-class system (as shown in eg., Magpantay et al. 20199).
We used the Euler-multinomial simulation approach to simulate our model with a time-step size 1 day. We use the popular iterated filtering method (note that at least 25 applications use this method, see a list at https://en.wikipedia.org/wiki/Iterated_filtering) to fit our model to the observed data and yield the maximum likelihood estimates of unknown parameters, including values of nodes in β (t) and π while holding θ=0.1. Namely, we assume 10% of cases are hospitalized (or severe cases), as widely reported. Namely in this work, we only fit death data rather than case data. The fitting is insensitive to the choice of θ, but sensitive to the product of π and of θ, which is equivalent to the infection fatality rate (IFR). We assume uniform prior for all parameters. We hypothesized that the infection fatality rate (IFR) lay in the interval (0.006, 0.01), and fit the daily SARI mortality data with a flexible transmission rate. The method finds the best fitting parameters including IFR.
Reproductive number R0(t)
For each dataset we plotted , which is the blue line seen in the figures as a function of time (/date). The reproductive number R0(t) should not be confused with the effective reproductive number Re(t) = R0(t)S (t) mentioned in our Note, which we do not plot. The plot of Ro (t) allows us to visualise how the reproductive number changes e.g., as a result of behaviour, or interventions.
For Amazonas and Manaus, we also arrived at estimates for the initial reproductive number in the first phase of the epidemic, using standard techniques7. This is shown in Figure S3 which is based on the COVID-19 mortality data from Amazonas government health department reports (Boletim Diario de casos COVID-19 no Amazonas. http://www.fvs.am.gov.br/publicacoes) 12.
For Figure A1, we used a standard approach, , where M is the moment generating function of the distribution of the generation time which attains a Gamma distribution with a mean of 5.5 days and s.d. 3 days6,10. We used the internal from March 30, 2020 (when there were 2 deaths) to April 12, 2020. Our results for estimates of R0 were:
Amazonas 2.202 [1.712 2.782]
Manaus 2.250 [1.787 2.791]
REFERENCES
References for Appendix
- 1.
- 2.
- 3.
- 4.
- 5.
- 6.
- 7.
- 8.
- 9.
- 10.
- 11.
- 12.