Abstract
Background Polymerase chain reaction (PCR) cycle threshold (Ct) values can be used to estimate the viral burden of Severe Acute Respiratory Syndrome Coronavirus type 2 (SARS-CoV-2) and predict population-level epidemic trends. We investigated the use of machine learning (ML) and epidemic transmission modeling based on Ct value distribution for SARS-CoV-2 incidence prediction during an Omicron-predominant period.
Methods Using simulated data, we developed a ML model to predict the reproductive number based on Ct value distribution, and validated it on out-of-sample province-level data. We also developed an epidemiological model and fitted it to province-level data to accurately predict incidence.
Results Based on simulated data, the ML model predicted the reproductive number with highest performance on out-of-sample province-level data. The epidemiological model was validated on outbreak data, and fitted to province-level data, and accurately predicted incidence.
Conclusions
These modeling approaches can complement traditional surveillance, especially when diagnostic testing practices change over time. The models can be tailored to different epidemiological settings and used in real time to guide public health interventions.
Funding This work was supported by funding from Genome BC, Michael Smith Foundation for Health Research and British Columbia Centre for Disease Control Foundation to C.A.H. This work was also funded by the Public Health Agency of Canada COVID-19 Immunity Task Force COVID-19 Hot Spots Competition Grant (2021-HQ-000120) to M.G.R.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
This work was supported by funding by Genome BC, Michael Smith Foundation for Health Research and British Columbia Centre for Disease Control Foundation to C.A.H. This work was also funded by the Public Health Agency of Canada COVID-19 Immunity Task Force COVID-19 Hot Spots Competition Grant (2021-HQ-000120) to M.G.R.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
This research was approved by University of British Columbia Research Ethics (H20-0297 BCC19C-COVID-19 Research).
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.
Yes
Data Availability
The genomic sequencing data is publicly available in GISAID under the submitter British Columbia Center for Disease Control Public Health Laboratory (BCCDC PHL). The individual level demographic and epidemiological data can be made accessible following the data governance and data access policy guidelines (http://www.bccdc.ca/about/accountability/data-access-requests). Code used for study models will be made available upon request to the corresponding author.