RT Journal Article SR Electronic T1 When are predictions useful? A new method for evaluating epidemic forecasts JF medRxiv FD Cold Spring Harbor Laboratory Press SP 2023.06.29.23292042 DO 10.1101/2023.06.29.23292042 A1 Marshall, Maximilian A1 Parker, Felix A1 Gardner, Lauren M. YR 2024 UL http://medrxiv.org/content/early/2024/08/07/2023.06.29.23292042.abstract AB Background COVID-19 will not be the last pandemic of the 21st century. To better prepare for the next one, it is essential that we make honest appraisals of the utility of different responses to COVID. In this paper we focus specifically on epidemiologic forecasting. Characterizing forecast efficacy over the history of the pandemic is challenging, especially given its significant spatial, temporal, and contextual variability. In this light, we introduce the Weighted Contextual Interval Score (WCIS), a new method for retrospective interval forecast evaluation. The WCIS reflects the potential utility of predictions, resulting in a score that is easily comparable across different pandemic scenarios despite remaining intuitively representative of the in-situ quality of individual forecasts.Methods The central tenet of the WCIS is a direct incorporation of contextual utility into the evaluation. This necessitates a specific characterization of forecast efficacy depending on the use case for predictions, accomplished via defining a utility threshold parameter. In essence, changes in forecast accuracy beyond this threshold do not map to changes in the utility of a prediction. This idea is generalized to probabilistic interval-form forecasts, which are the preferred prediction format for epidemiological modeling, as an adaptation of the existing Weighed Interval Score (WIS).Results We apply the WCIS to two different forecasting scenarios. The first assesses the performance of facility-level COVID-19 hospital bed occupancy predictions for the state of Maryland during the Omicron wave, and the second evaluates state-level hospitalization forecasts drawn from the COVID-19 Forecast Hub. We use these applications to demonstrate the parameterization of contextual utility, compare the WCIS to the WIS, and explore the utility of the WCIS.Conclusions The WCIS provides a pragmatic utility-based characterization of probabilistic predictions. This method is expressly intended to enable practitioners and policymakers who may not have expertise in forecasting but are nevertheless essential partners in epidemic response to use and provide insightful analysis of predictions. We note that the WCIS is intended specifically for retrospective forecast evaluation and should not be used as a minimized penalty in a competitive context as it lacks statistical propriety.Competing Interest StatementThe authors have declared no competing interest.Funding StatementThis study was supported by the National Science Foundation under grant no. 2108526Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:The study used ONLY openly available human COVID-19 outcome data that were originally located at: https://doi.org/10.5281/zenodo.6301718 AND https://healthdata.gov/d/j4ip-wfsvI confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.YesAll data produced in the present study are available online at https://github.com/cpt-diabetes/wcis https://doi.org/10.5281/zenodo.6301718 https://healthdata.gov/d/j4ip-wfsv