PT - JOURNAL ARTICLE AU - Payrovnaziri, Seyedeh Neelufar AU - Xing, Aiwen AU - Shaeke, Salman AU - Liu, Xiuwen AU - Bian, Jiang AU - He, Zhe TI - The Impact of Missing Value Imputation on the Interpretations of Predictive Models: A Case Study on One-year Mortality Prediction in ICU Patients with Acute Myocardial Infarction AID - 10.1101/2020.06.06.20124347 DP - 2020 Jan 01 TA - medRxiv PG - 2020.06.06.20124347 4099 - http://medrxiv.org/content/early/2020/06/08/2020.06.06.20124347.short 4100 - http://medrxiv.org/content/early/2020/06/08/2020.06.06.20124347.full AB - Acute Myocardial Infarction (AMI) is responsible for the death of millions of people annually around the world, which makes predictive analyses of AMI mortality risk necessary. Rich clinical data in electronic health records (EHR) makes such predictive modeling possible. However, missing values in EHR data is a major issue. Also, the interpretability of predictive models in medicine and healthcare is vital for medical professionals. Therefore, this study examines the impact of imputing missing values in EHR data on the performance and interpretations of predictive models. Our experiments showed a small standard deviation in root mean squared error of different runs of imputation under similar method does not necessarily imply small standard deviation in prediction models’ performance and interpretation. Our findings reveal that the imputation method and the level of missingness impact not only the predictive models’ performance but also the interpretation of the models in terms of feature importance.Competing Interest StatementThe authors have declared no competing interest.Funding StatementThis study was supported in part by the National Institute on Aging (NIA) of the National Institutes of Health (NIH) under Award Number R21AG061431; and the University of Florida Clinical and Translational Science Institute, which is supported in part by the NIH National Center for Advancing Translational Sciences under award number UL1TR001427. The content is solely the responsibility of the authors and does not necessarily represent the official views of the NIH.Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:IRB is not applicable due to the use of de-identified public dataset.All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesThe data are available upon request.