Discharge prediction of critical patients with spinal cord injury: a machine learning study with 1485 cases

Guoxin Fan; Huaqing Liu; Sheng Yang; Libo Luo; Lunji Wang; Mao Pang; Bin Liu; Liangming Zhang; Lanqing Han; Limin Rong

doi:10.1101/2021.06.26.21259569

Abstract

Objectives Prognostication of spinal cord injury (SCI) is vital, especially for critical patients who need intensive care. The study aims to develop machine-learning (ML) classifiers for discharge prediction of SCI patients in the intensive care unit (ICU).

Methods Clinical data of patients diagnosed with SCI were extracted from the publicly available ICU database. A total of 105 ML classifiers were initially developed to predict the discharge destination (dead, further medical care, home), and then the top 3 classifiers with the best performance were stacked into an ensemble classifier (Esb-Clf). To balance the accuracy and the feasibility, the complete Esb-Clf was finally simplified with top 10 features (simplified Esb-Clf). The micro-average area under the curve (AUC) was used to compare the prediction performance of difference ML classifiers and 6 doctors’ artificial prediction.

Results A total of 1485 SCI patients were used for the early and the recent prediction of discharge destination. In the early prediction, the micro-average AUC of the Esb-Clf and the simplified Esb-Clf was 0.846 and 0.835 during the independent testing, respectively. While in the recent prediction, the micro-average AUC of the Esb-Clf and the simplified Esb-Clf was 0.898 and 0.892. Performance of both the Esb-Clf and the simplified Esb-Clf were superior to the doctors’ in the early and the recent prediction.

Conclusions ML classifiers can discriminate the discharge destination of SCI patients with high accuracy, feasibility and interpretability. Whether the simplified Esb-Clf as an online predictive tool is applicable to guiding clinical management needs further verification.

Competing Interest Statement

The authors have declared no competing interest.

Funding Statement

This work was funded by the National Key Research and Development Program of China (Grant no. 2017YFA0105403), the Key Research and Development Program of Guangdong Province (Grant no. 2019B020236002), the Clinical innovation Research Program of Guangzhou Regenerative Medicine and Health Guangdong Laboratory (Grant no. 2018GZR0201006) and Guangzhou Health Care Cooperative Innovation Major Project (Grant no. 201704020221) granted to LR; funded by the China Postdoctoral Science Foundation (Grant no. 2019M663261) and Guangdong Basic and Applied Basic Research Foundation (Grant no. 2019A1515111171) granted to GF; and funded by the Guangzhou Science and Technology Project (Grant no. 202102080212) and the Medical Scientific Research Foundation of Guangdong Province (Grant no. A2018547) granted to MP. The funders had no role in study design, data collection, data analysis, interpretation, writing of this report and in the decision to submit the paper for publication.

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

This study protocol, which relies on two deidentified public database, was deemed exempt by the Institutional Review Board of Sun Yat-sen University.

All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.

Yes

Data Availability

The online predictive tool is presented at (https://colab.research.google.com/github/Huatsing-Lau/ Prognosis_Prediction_Tool_of_SCI_Patients/blob/main/demo.ipynb), and its code is available from https://github.com/Huatsing-Lau/Prognosis_Prediction_Tool_of_SCI_Patients. Original data can be obtained from the MIMIC database (https://mimic.physionet.org/ for MIMIC-III and https://mimic-iv.mit.edu/ for MIMIC-IV) and the eICU database (https://eicu-crd.mit.edu/). Data not provided in the article because of space limitations may be shared (anonymized) at the request of any qualified investigator for purposes of replicating procedures and results.

Abbreviations

SCI: spinal cord injury
ML: machine-learning
ICU: intensive care unit
Esb-Clf: ensemble classifier
AUC: area under the curve
FMC: further medical care
CSV: comma-separated values
SQL: structured query language
BMI: body mass index
RBC: red blood cell
RDW: red blood cell distribution width
MCH: mean corpuscular hemoglobin
MCHC: mean corpuscular hemoglobin concentration
MCV: mean corpuscular volume
PT: prothrombin time
APTT: activated partial thromboplastin time
INR: international normalized ratio
BE: base excess
BUN: blood urea nitrogen
MIC: maximal information coefficient
RFE: recursive feature elimination
LSVC: linear supported vector classifier
LR: logistic regressor
RF: random forest
mRMR: minimal-redundancy-maximal-relevance
LDA: linear discriminant analysis
SVM: support vector machine
KNN: K-Nearest Neighbor
NB: Gaussian Naïve Bayes
AdaBoost: adaptive boosting
GBDT: gradient boosting decision tree
lightGBM: light gradient boosting model
XGBoost: extreme gradient boosting
MLP: multilayer perceptron
DNN: deep neural network
los: length of stay
ROC: receiver operating characteristic

The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license.