ABSTRACT
Predicting epidemic evolution is essential for informed decision-making and guiding the implementation of necessary countermeasures. Computational models are vital tools that provide insights into illness progression and enable early detection, proactive intervention, and targeted preventive measures.
This paper introduces Sybil, a framework that integrates machine learning and variant-aware compartmental models, leveraging a fusion of data-centric and analytic methodologies. To validate and evaluate Sybil’s forecasts, we employed COVID-19 data from two European countries. The dataset included the number of new and recovered cases, fatalities, and variant presence over time. We evaluate the forecasting precision of Sybil in periods in which there is a change in the trend of the pandemic evolution or a new variant appears. Results demonstrate that Sybil outperforms a conventional data-centric approach, being able to forecast accurately the changes in the trend, the magnitude of these changes, and the future prevalence of new variants.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
This work was supported by grants from ″Ripresa delle attività socio-economiche e delle scuole: modelli per la progettazione e supporto di linee guida per la convivenza con il Covid-19″ (Cod. ROL 73459, 2020, PI Matteo Sereno), project funded by CRT foundation.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.
Yes
Data Availability
COVID-19 data used in this study are available in the COVID19 R library and in the ECDC variants data. The framework developed for the analysis presented in this work is available at https://github.com/daniele-baccega/sybil-forecasting. To reproduce the results, see supplementary material.
https://cran.r-project.org/web/packages/COVID19/index.html
https://www.ecdc.europa.eu/en/publications-data/data-virus-variants-covid-19-eueea