Abstract
The clinical course of coronavirus disease 2019 (COVID-19) infection is highly variable with the vast majority recovering uneventfully but a small fraction progressing to severe disease and death. Appropriate and timely supportive care can reduce mortality and it is critical to evolve better patient risk stratification based on simple clinical data, so as to perform effective triage during strains on the healthcare infrastructure. This study presents risk stratification and mortality prediction models based on usual clinical data from 544 COVID-19 patients from New Delhi, India using machine learning methods. An XGboost classifier yielded the best performance on risk stratification (F1 score of 0.81). A logistic regression model yielded the best performance on mortality prediction (F1 score of 0.71). Significant biomarkers for predicting risk and mortality were identified. Examination of the data in comparison to a similar dataset with a Wuhan cohort of 375 patients was undertaken to understand the much lower mortality rates in India and the possible reasons thereof. The comparison indicated higher survival rate in the Delhi cohort even when patients had similar parameters as the Wuhan patients who died. Steroid administration was very frequent in Delhi patients, especially in surviving patients whose biomarkers indicated severe disease. This study helps in identifying the high-risk patient population and suggests treatment protocols that may be useful in countries with high mortality rates.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
This COVID project was funded by Intel Corp as part of its Pandemic Response Technology Initiative (PRTI) [40]. The Project from IGIB side was funded by CSIR (MLP-2005) and Fondation Botnar (CLP-0031). Authors also acknowledge Dr. Mitali Mukerji for facilitating collaboration with the clinical partner. PKV and UDP also thank DST-SERB for support (CVD/2020/000343).
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
The study was approved by the Max Super Speciality Hospital and CSIR IGIB Ethics Committees.
All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
Data Availability
The datasets analyzed and generated during the current study would be made public once the manuscript is accepted.