PT - JOURNAL ARTICLE AU - Zhang, Sheng AU - Ponce, Joan AU - Zhang, Zhen AU - Lin, Guang AU - Karniadakis, George TI - An integrated framework for building trustworthy data-driven epidemiological models: Application to the COVID-19 outbreak in New York City AID - 10.1101/2021.02.22.21252255 DP - 2021 Jan 01 TA - medRxiv PG - 2021.02.22.21252255 4099 - http://medrxiv.org/content/early/2021/02/24/2021.02.22.21252255.short 4100 - http://medrxiv.org/content/early/2021/02/24/2021.02.22.21252255.full AB - Epidemiological models can provide the dynamic evolution of a pandemic but they are based on many assumptions and parameters that have to be adjusted over the time when the pandemic lasts. However, often the available data are not sufficient to identify the model parameters and hence infer the unobserved dynamics. Here, we develop a general framework for building a trustworthy data-driven epidemiological model, consisting of a workflow that integrates data acquisition and event timeline, model development, identifiability analysis, sensitivity analysis, model calibration, model robustness analysis, and forecasting with uncertainties in different scenarios. In particular, we apply this framework to propose a modified susceptible–exposed–infectious–recovered (SEIR) model, including new compartments and model vaccination in order to forecast the transmission dynamics of COVID-19 in New York City (NYC). We find that we can uniquely estimate the model parameters and accurately predict the daily new infection cases, hospitalizations, and deaths, in agreement with the available data from NYC’s government’s website. In addition, we employ the calibrated data-driven model to study the effects of vaccination and timing of reopening indoor dining in NYC.Competing Interest StatementThe authors have declared no competing interest.Funding StatementWe gratefully acknowledge the support from ARO/MURI grant W911NF-15-1-0562.Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:This work does not need any approval of the IRB.All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesThe datasets generated during and/or analysed during the current study are available from the corresponding author on reasonable request.