Abstract
We compared the predictive performance of gradient-boosted decision tree (GBDT), random forest (RF), deep neural network (DNN), and logistic regression (LR) with the least absolute shrinkage and selection operator (LASSO) for 30-day unplanned readmission, according to the number of predictor variables and presence/absence of blood-test results. We used electronic health records of patients discharged alive from 38 hospitals in 2015–2017 for derivation (n=339,513) and in 2018 for validation (n=118,074), including basic characteristics (age, sex, admission diagnosis category, number of hospitalizations in the past year, discharge location), diagnosis, surgery, procedure, and drug codes, and blood-test results. We created six patterns of datasets having different numbers of binary variables (that ≥5% or ≥1% of patients or ≥10 patients had) with and without blood-test results. For the dataset with the smallest number of variables (102), the c-statistic was highest for GBDT (0.740), followed by RF (0.734), LR-LASSO (0.720), and DNN (0.664). For the dataset with the largest number of variables (1543), the c-statistic was highest for GBDT (0.764), followed by LR-LASSO (0.755), RF (0.751), and DNN (0.720). We found that GBDT generally outperformed LR-LASSO, but the difference became smaller when the number of variables was increased and blood-test results were used.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
This study was supported by a Japan Society for the Promotion of Science (JSPS) KAKENHI Grant (No. 19K19430) from the Japanese Ministry of Education, Culture, Sports, Science, and Technology. The funder had no role in study design, data collection, data analysis, data interpretation, or writing.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
The study was approved by the Ethics Committee of the University of Tsukuba (approval no. 1414) in accordance with the Declaration of Helsinki. Because the claims data were anonymized before the researchers received them, individual participants consent was waived according to the ethical guidelines for medical and health research involving human subjects.
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.
Yes
Footnotes
Full names and contact information of authors: Masao Iwagami, iwagami-tky{at}umin.ac.jp, Ryota Inokuchi, inokuchi.ryota.ge{at}u.tsukuba.ac.jp, Eiryo Kawakami, eiryo.kawakami{at}chiba-u.jp, Tomohide Yamada, bqx07367{at}yahoo.co.jp, Atsushi Goto, agoto{at}yokohama-cu.ac.jp, Toshiki Kuno, kuno-toshiki{at}hotmail.co.jp, Yohei Hashimoto, yohashimoto1223{at}gmail.com, Nobuaki Michihata, michihata{at}m.u-tokyo.ac.jp, Tadahiro Goto, tag695{at}mail.harvard.edu, Tomohiro Shinozaki, shinozaki{at}rs.tus.ac.jp, Yu Sun, sunyu{at}md.tsukuba.ac.jp, Yuta Taniguchi, taniguchi.yuta.ma{at}alumni.tsukuba.ac.jp, Jun Komiyama, jun.komi33{at}gmail.com, Kazuaki Uda, uda.kazuaki.gn{at}u.tsukuba.ac.jp, Toshikazu Abe, abetoshikazu{at}me.com, Nanako Tamiya, ntamiya{at}md.tsukuba.ac.jp
Funding: This study was supported by a Japan Society for the Promotion of Science (JSPS) KAKENHI Grant (No. 19K19430) from the Japanese Ministry of Education, Culture, Sports, Science, and Technology. The funder had no role in study design, data collection, data analysis, data interpretation, or writing.
The Chan Zuckerberg Initiative, Cold Spring Harbor Laboratory, the Sergey Brin Family Foundation, California Institute of Technology, Centre National de la Recherche Scientifique, Fred Hutchinson Cancer Center, Imperial College London, Massachusetts Institute of Technology, Stanford University, University of Washington, and Vrije Universiteit Amsterdam.