PT - JOURNAL ARTICLE AU - Marshall, Maximilian AU - Parker, Felix AU - Gardner, Lauren M. TI - When are predictions useful? A new method for evaluating epidemic forecasts AID - 10.1101/2023.06.29.23292042 DP - 2024 Jan 01 TA - medRxiv PG - 2023.06.29.23292042 4099 - http://medrxiv.org/content/early/2024/08/07/2023.06.29.23292042.short 4100 - http://medrxiv.org/content/early/2024/08/07/2023.06.29.23292042.full AB - Background COVID-19 will not be the last pandemic of the 21st century. To better prepare for the next one, it is essential that we make honest appraisals of the utility of different responses to COVID. In this paper we focus specifically on epidemiologic forecasting. Characterizing forecast efficacy over the history of the pandemic is challenging, especially given its significant spatial, temporal, and contextual variability. In this light, we introduce the Weighted Contextual Interval Score (WCIS), a new method for retrospective interval forecast evaluation. The WCIS reflects the potential utility of predictions, resulting in a score that is easily comparable across different pandemic scenarios despite remaining intuitively representative of the in-situ quality of individual forecasts.Methods The central tenet of the WCIS is a direct incorporation of contextual utility into the evaluation. This necessitates a specific characterization of forecast efficacy depending on the use case for predictions, accomplished via defining a utility threshold parameter. In essence, changes in forecast accuracy beyond this threshold do not map to changes in the utility of a prediction. This idea is generalized to probabilistic interval-form forecasts, which are the preferred prediction format for epidemiological modeling, as an adaptation of the existing Weighed Interval Score (WIS).Results We apply the WCIS to two different forecasting scenarios. The first assesses the performance of facility-level COVID-19 hospital bed occupancy predictions for the state of Maryland during the Omicron wave, and the second evaluates state-level hospitalization forecasts drawn from the COVID-19 Forecast Hub. We use these applications to demonstrate the parameterization of contextual utility, compare the WCIS to the WIS, and explore the utility of the WCIS.Conclusions The WCIS provides a pragmatic utility-based characterization of probabilistic predictions. This method is expressly intended to enable practitioners and policymakers who may not have expertise in forecasting but are nevertheless essential partners in epidemic response to use and provide insightful analysis of predictions. We note that the WCIS is intended specifically for retrospective forecast evaluation and should not be used as a minimized penalty in a competitive context as it lacks statistical propriety.Competing Interest StatementThe authors have declared no competing interest.Funding StatementThis study was supported by the National Science Foundation under grant no. 2108526Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:The study used ONLY openly available human COVID-19 outcome data that were originally located at: https://doi.org/10.5281/zenodo.6301718 AND https://healthdata.gov/d/j4ip-wfsvI confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.YesAll data produced in the present study are available online at https://github.com/cpt-diabetes/wcis https://doi.org/10.5281/zenodo.6301718 https://healthdata.gov/d/j4ip-wfsv