Abstract
We apply Bayesian inference methods to a suite of distinct compartmental models of generalised SEIR type, in which diagnosis and quarantine are included via extra compartments. We investigate the evidence for a change in lethality of COVID-19 in late autumn 2020 in the UK, using age-structured, weekly national aggregate data for cases and mortalities. Models that allow a (step-like or graded) change in infection fatality rate (IFR) have consistently higher model evidence than those without. Moreover, they all infer a close to two-fold increase in IFR. This value lies well above most previously available estimates. However, the same models consistently infer that, most probably, the increase in IFR preceded the time window during which new variant B.1.1.7 became the dominant strain in the UK. Therefore, according to our models, the caseload and mortality data do not offer unequivocal evidence for higher lethality of a new variant. We compare these results for the UK with similar models for Germany and France, which also show increases in inferred IFR during the same period, despite the even later arrival of new variants in those countries. We argue that while the new variant(s) may be one contributing cause of a large increase in IFR in the UK in autumn 2020, other factors, such as seasonality, or pressure on health services, are likely to also have contributed.
Model structure We study a family of non-geographic compartmented models structured by age, and by epidemiological and diagnostic state. They include stages within exposed (E) and infected (I) classes, and separation of symptomatic and asymptomatic infected individuals; E and I classes are subdivided according to whether they have had a positive test.
Disease transmission between compartments is modelled by contact matrices derived from survey data and previous Bayesian inference [1]. Non-pharmaceutical interventions are incorporated via time- and age-dependent transmission rates. This dependence is parameterised by functions that account for lockdowns, seasonal changes, etc., with parameters inferred from the data except where known a priori (e.g. lockdown dates).
The process of diagnosis is modelled explicitly through the subdivided compartments. The targeting of tests towards symptomatic individuals is controlled by a single inferred parameter. Tested individuals have reduced infectivity, as a model for self-isolation. The fraction of asymptomatic cases is presumed fixed. Most other parameters are inferred (including initial conditions in March 2020). Our models are stochastic at all stages, including the infection and the diagnosis processes.
Similar models are considered for Germany and France. All models are analysed with the same Bayesian methodology as outlined below.
Data We analyse age-structured, weekly data for cases and mortalities for the UK, France and Germany starting in early March 2020. Our explicit modelling of diagnostic tests allows us to directly use data on the number of daily tests performed and their outcomes to inform parameter estimation. For the UK, the data we used are (i) ONS data for COVID deaths; (ii) PHE data for total numbers of tests; (iii) PHE data for numbers of positive tests. The time series analysed extends from March 2020 until January 2021.
Methods In line with the data used, we treat diagnosed cases and mortalities as the only observables; other compartments are treated as unobserved, with occupancies to be inferred from data. Parameters governing overdispersion for infection and death are likewise inferred.
We perform inference using a likelihood function that includes the stochasticity in all model transitions (infection, disease progression, testing, death). The likelihood of the observed data is computed in an approximation that becomes exact for large population sizes, taking into account all statistical dependencies in this limit. Inference is performed at the level of maximum a posteriori (MAP) estimates. Using these MAP estimates we approximate the evidence for each model, allowing us to compare the credibility of different models given the same set of data.
We use the software package PyRoss to build all the models and run the inference procedure [2, 3].
Model specifications We compare several distinct models that differ primarily in how interventions and fluctuations are addressed. Some of these variants (Type 1) allow for a time-dependent infection fatality rate (IFR) and some do not (Type 0). Most of our Type 1 models assume a step change in IFR but a ramped (tanh) variation is also studied. In Type 1 models, the values of IFR before and after the change are parameterised in terms of their log-ratio, whose prior is normal distributed with mean zero and standard deviation log(3). The prior for the time of the change is normal with mean 2020-12-12 and standard deviation 2 weeks.
For both Type 0 and Type 1 models we consider several possibilities for the time-dependence of model parameters other than the IFR. For the UK, these are (A) no dependence; (B) changing infectivity; (C) changing overdispersion: (BC) changing infectivity and overdispersion.
Findings Type 1 models for the UK consistently attain a significantly higher model evidence than Type 0 models. (Specifically, they show systematically larger values of the log-posterior.) For each Type 1 model, we separately infer that in the UK, the IFR rose by a factor of about two in late October or early November 2020: the MAP values vary between 1.9-2.2 depending on the model chosen. A similar change is also inferred in Germany and France, but is somewhat less pronounced.
Among the France/Germany models, the ratios of inferred IFRs before/after the step are smaller than in UK but still indicate significant changes, around 1.4 for France and 1.8 for Germany.
The IFR changes in UK are inferred to take place significantly before the B.1.1.7 variant became the dominant UK strain. The significant inferred IFR change in Germany is (to our knowledge) not associated with any new variant.
Interpretation These findings support the hypothesis [4–6] that the lethality of COVID-19 in the UK increased significantly in late autumn 2020. The MAP estimates further suggest that a larger increase than previously suggested [4, 5], of roughly a factor 2, arose with an onset time in early November. However, the fact that this onset time precedes the rise to dominance of new variant B.1.1.7 suggests that this rise is not solely responsible for the change in IFR. Other factors, such as seasonality and/or pressure on health services, may also be important. If these factors came into play significantly before the arrival of B.1.1.7, then the MAP-estimated onset time of early November could represent a compromise between two successive episodes of increasing IFR within models that, as so far implemented, allow only one.
Note that the IFR depends on the true number of infections (not just diagnosed cases as in the case fatality rate) which, in our model, is itself estimated from the data for cases and testing. Underestimation of this number can lead to an overestimate of the IFR. However, to explain the factor-two increase in fitted IFR this way would require implausible assumptions to be made concerning change in testing efficacy in late autumn 2020 (models TT0, TT1, P0).
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
This work was funded in part by the European Research Council under the Horizon 2020 Programme, ERC grant 740269, and by the Royal Society grant RP17002. We thank Microsoft Corporation for a Microsoft Research Award for the project "Building an open platform for pandemic modelling".
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
University of Cambridge
All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
Data Availability
All data used are in the public domain