Abstract
Compartmental models that describe infectious disease transmission across subpopulations are central for assessing the impact of non-pharmaceutical interventions, behavioral changes and seasonal effects on the spread of respiratory infections. We present a Bayesian workflow for such models, including four features: (1) an adjustment for incomplete case ascertainment, (2) an adequate sampling distribution of laboratory-confirmed cases, (3) a flexible, time-varying transmission rate, and (4) a stratification by age group. We benchmarked the performance of various implementations of two of these features (2 and 3). For the second feature, we used SARS-CoV-2 data from the canton of Geneva (Switzerland) and found that a quasi-Poisson distribution is the most suitable sampling distribution for describing the overdispersion in the observed laboratory-confirmed cases. For the third feature, we implemented three methods: Brownian motion, B-splines, and approximate Gaussian processes (aGP). We compared their performance in terms of the number of effective samples per second, and the error and sharpness in estimating the time-varying transmission rate over a selection of ordinary differential equation solvers and tuning parameters, using simulated seroprevalence and laboratory-confirmed case data. Even though all methods could recover the time-varying dynamics in the transmission rate accurately, we found that B-splines perform up to four and ten times faster than Brownian motion and aGPs, respectively. We validated the B-spline model with simulated age-stratified data. We applied this model to 2020 laboratory-confirmed SARS-CoV-2 cases and two seroprevalence studies from the canton of Geneva. This resulted in detailed estimates of the transmission rate over time and the case ascertainment. Our results illustrate the potential of the presented workflow including stratified transmission to estimate age-specific epidemiological parameters. The workflow is freely available in the R package HETTMO, and can be easily adapted and applied to other surveillance data.
Author summary Mathematical models are a central tool for understanding the spread of infectious diseases. These models can be fitted to surveillance data such as the number of laboratory-confirmed cases and seroprevalence over time. To provide insightful information for managing an epidemic, the models require several crucial features. In our study, we compare the performance of several implementations of two such features. First, we find that a quasi-Poisson distribution describes best how the number of laboratory-confirmed cases of SARS-CoV-2 from the canton of Geneva (Switzerland) are sampled from the total incidence of the infection. Second, we conclude that a B-spline based implementation of time-variation in the transmission rate performs better than a Brownian motion or approximate Gaussian processes based model. Moreover, we confirm that the B-spline based model can recover time-varying transmission also in an age-stratified population. This structural comparison of methods results in a Bayesian workflow. Such a comprehensive workflow is crucial to move the field of mathematical modeling for infectious disease dynamics forward and make methods widely applicable.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
This study is funded by the Multidisciplinary Center for Infectious Diseases, University of Bern, Bern, Switzerland. JR is supported by the Swiss Federal Office of Public Health (142005806) and by the Swiss National Science Foundation (189498). CA received funding from the European Union’s Horizon 2020 research and innovation program - project EpiPose (No 101003688) and the Swiss National Science Foundation (No 196046). ES acknowledges support in part by the AI2050 program at Schmidt Futures (Grant [G-22-64476]). SB acknowledges support from the MRC Centre for Global Infectious Disease Analysis (MR/R015600/1), jointly funded by the UK Medical Research Council (MRC) and the UK Foreign, Commonwealth and Development Office (FCDO), under the MRC/FCDO Concordat agreement, and also part of the EDCTP2 programme supported by the European Union. SB acknowledges support from the National Institute for Health and Care Research (NIHR) via the Health Protection Research Unit in Modelling and Health Economics, which is a partnership between the UK Health Security Agency (UKHSA), Imperial College London, and the London School of Hygiene and Tropical Medicine (grant code NIHR200908). (The views expressed are those of the authors and not necessarily those of the UK Department of Health and Social Care, NIHR, or UKHSA.). SB acknowledges support from the Novo Nordisk Foundation via The Novo Nordisk Young Investigator Award (NNF20OC0059309). SB acknowledges the Danish National Research Foundation (DNRF160) through the chair grant. SB acknowledges support from The Eric and Wendy Schmidt Fund For Strategic Innovation via the Schmidt Polymath Award (G-22-63345).
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.
Yes
Data Availability
Data and code are available as an R-package on https://github.com/JudithBouman2412/HETTMO.