Abstract
Objectives Prognostication of spinal cord injury (SCI) is vital, especially for critical patients who need intensive care. The study aims to develop machine-learning (ML) classifiers for discharge prediction of SCI patients in the intensive care unit (ICU).
Methods Clinical data of patients diagnosed with SCI were extracted from the publicly available ICU database. A total of 105 ML classifiers were initially developed to predict the discharge destination (dead, further medical care, home), and then the top 3 classifiers with the best performance were stacked into an ensemble classifier (Esb-Clf). To balance the accuracy and the feasibility, the complete Esb-Clf was finally simplified with top 10 features (simplified Esb-Clf). The micro-average area under the curve (AUC) was used to compare the prediction performance of difference ML classifiers and 6 doctors’ artificial prediction.
Results A total of 1485 SCI patients were used for the early and the recent prediction of discharge destination. In the early prediction, the micro-average AUC of the Esb-Clf and the simplified Esb-Clf was 0.846 and 0.835 during the independent testing, respectively. While in the recent prediction, the micro-average AUC of the Esb-Clf and the simplified Esb-Clf was 0.898 and 0.892. Performance of both the Esb-Clf and the simplified Esb-Clf were superior to the doctors’ in the early and the recent prediction.
Conclusions ML classifiers can discriminate the discharge destination of SCI patients with high accuracy, feasibility and interpretability. Whether the simplified Esb-Clf as an online predictive tool is applicable to guiding clinical management needs further verification.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
This work was funded by the National Key Research and Development Program of China (Grant no. 2017YFA0105403), the Key Research and Development Program of Guangdong Province (Grant no. 2019B020236002), the Clinical innovation Research Program of Guangzhou Regenerative Medicine and Health Guangdong Laboratory (Grant no. 2018GZR0201006) and Guangzhou Health Care Cooperative Innovation Major Project (Grant no. 201704020221) granted to LR; funded by the China Postdoctoral Science Foundation (Grant no. 2019M663261) and Guangdong Basic and Applied Basic Research Foundation (Grant no. 2019A1515111171) granted to GF; and funded by the Guangzhou Science and Technology Project (Grant no. 202102080212) and the Medical Scientific Research Foundation of Guangdong Province (Grant no. A2018547) granted to MP. The funders had no role in study design, data collection, data analysis, interpretation, writing of this report and in the decision to submit the paper for publication.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
This study protocol, which relies on two deidentified public database, was deemed exempt by the Institutional Review Board of Sun Yat-sen University.
All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
Data Availability
The online predictive tool is presented at (https://colab.research.google.com/github/Huatsing-Lau/ Prognosis_Prediction_Tool_of_SCI_Patients/blob/main/demo.ipynb), and its code is available from https://github.com/Huatsing-Lau/Prognosis_Prediction_Tool_of_SCI_Patients. Original data can be obtained from the MIMIC database (https://mimic.physionet.org/ for MIMIC-III and https://mimic-iv.mit.edu/ for MIMIC-IV) and the eICU database (https://eicu-crd.mit.edu/). Data not provided in the article because of space limitations may be shared (anonymized) at the request of any qualified investigator for purposes of replicating procedures and results.
Abbreviations
- SCI
- spinal cord injury
- ML
- machine-learning
- ICU
- intensive care unit
- Esb-Clf
- ensemble classifier
- AUC
- area under the curve
- FMC
- further medical care
- CSV
- comma-separated values
- SQL
- structured query language
- BMI
- body mass index
- RBC
- red blood cell
- RDW
- red blood cell distribution width
- MCH
- mean corpuscular hemoglobin
- MCHC
- mean corpuscular hemoglobin concentration
- MCV
- mean corpuscular volume
- PT
- prothrombin time
- APTT
- activated partial thromboplastin time
- INR
- international normalized ratio
- BE
- base excess
- BUN
- blood urea nitrogen
- MIC
- maximal information coefficient
- RFE
- recursive feature elimination
- LSVC
- linear supported vector classifier
- LR
- logistic regressor
- RF
- random forest
- mRMR
- minimal-redundancy-maximal-relevance
- LDA
- linear discriminant analysis
- SVM
- support vector machine
- KNN
- K-Nearest Neighbor
- NB
- Gaussian Naïve Bayes
- AdaBoost
- adaptive boosting
- GBDT
- gradient boosting decision tree
- lightGBM
- light gradient boosting model
- XGBoost
- extreme gradient boosting
- MLP
- multilayer perceptron
- DNN
- deep neural network
- los
- length of stay
- ROC
- receiver operating characteristic