Abstract
Rationale Given the expanding number of COVID-19 cases and the potential for upcoming waves of infection, there is an urgent need for early prediction of the severity of the disease in intensive care unit (ICU) patients to optimize treatment strategies.
Objectives Early prediction of mortality using machine learning based on typical laboratory results and clinical data registered on the day of ICU admission.
Methods We studied retrospectively 263 COVID-19 ICU patients. To find parameters with the highest predictive values, Kolmogorov-Smirnov and Pearson chi-squared tests were used. Logistic regression and random forest (RF) algorithms were utilized to build classification models. The impact of each marker on the RF model predictions was studied by implementing the local interpretable model-agnostic explanation technique (LIME-SP).
Results Among 66 documented parameters, 15 factors with the highest predictive values were identified as follows: gender, age, blood urea nitrogen (BUN), creatinine, international normalized ratio (INR), albumin, mean corpuscular volume, white blood cell count, segmented neutrophil count, lymphocyte count, red cell distribution width (RDW), and mean cell hemoglobin along with a history of neurological, cardiovascular, and respiratory disorders. Our RF model can predict patients outcomes with a sensitivity of 70% and a specificity of 75%.
Conclusions The most decisive variables in our model were increased levels of BUN, lowered albumin levels, increased creatinine, INR, and RDW along with gender and age. Complete blood count parameters were also crucial for some patients. Considering the importance of early triage decisions, this model can be a useful tool in COVID-19 ICU decision-making.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
The authors received no financial support for the research, authorship, and/or publication of this article.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
The study was performed after approval by Iran University of Medical Sciences Ethics Committee (approval ID: IR.IUMS.REC.1399.595)
All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
Footnotes
Funding The authors received no financial support for the research, authorship, and/or publication of this article.
Data Availability
The data that support the findings of this study are available from the corresponding authors upon request.
Abbreviations list
- ACE2
- Angiotensin-Converting Enzyme 2
- AI
- Artificial Intelligence
- BUN
- Blood Urea Nitrogen
- COVID-19
- coronavirus disease of 2019
- CIC
- clinical impact curve
- Cr
- Creatinine
- CRP
- C reactive protein
- DC
- decision curve
- ICU
- Intensive care unit
- INR
- International Normalized Ratio
- IFN
- interferon
- IL-6
- Interleukin 6
- IQR
- interquartile range
- KS
- Kolmogorov-Smirnov
- LR
- Logistics regression
- LIME
- local interpretable model-agnostic explanation
- LIME-SP
- local interpretable model-agnostic explanation submodular-pick
- ML
- Machine learning
- MCH
- mean corpuscular hemoglobin
- MCV
- mean corpuscular volume
- RF
- Random forest
- RDW
- Red blood cell distribution width
- ROC
- receiver operating characteristic curve
- RT-PCR
- reverse transcription-polymerase chain reaction
- WBC
- white blood cells count