PT - JOURNAL ARTICLE AU - Vepa, Abhinav AU - Saleem, Amer AU - Rakhshan, Kambiz AU - Omar, Amr AU - Dharmaraj, Diana AU - Sami, Junaid AU - Parekh, Shital AU - Ibrahim, Mohamed AU - Raza, Mohammed AU - Kapila, Poonam AU - Chakrabarti, Prithwiraj AU - Sedighi, Tabassom AU - Chatrabgoun, Omid AU - Daneshkhah, Alireza TI - Predicting mortality, duration of treatment, pulmonary embolism and required ceiling of ventilatory support for COVID-19 inpatients: A Machine-Learning Approach AID - 10.1101/2021.02.15.21251752 DP - 2021 Jan 01 TA - medRxiv PG - 2021.02.15.21251752 4099 - http://medrxiv.org/content/early/2021/02/20/2021.02.15.21251752.short 4100 - http://medrxiv.org/content/early/2021/02/20/2021.02.15.21251752.full AB - Introduction Within the UK, COVID-19 has contributed towards over 103,000 deaths. Multiple risk factors for COVID-19 have been identified including various demographics, co-morbidities, biochemical parameters, and physical assessment findings. However, using this vast data to improve clinical care has proven challenging.Aims to develop a reliable, multivariable predictive model for COVID-19 in-patient outcomes, to aid risk-stratification and earlier clinical decision-making.Methods Anonymized data regarding 44 independent predictor variables of 355 adults diagnosed with COVID-19, at a UK hospital, was manually extracted from electronic patient records for retrospective, case-controlled analysis. Primary outcomes included inpatient mortality, level of ventilatory support and oxygen therapy required, and duration of inpatient treatment. Secondary pulmonary embolism was the only secondary outcome. After balancing data, key variables were feature selected for each outcome using random forests. Predictive models were created using Bayesian Networks, and cross-validated.Results Our multivariable models were able to predict, using feature selected risk factors, the probability of inpatient mortality (F1 score 83.7%, PPV 82%, NPV 67.9%); level of ventilatory support required (F1 score varies from 55.8% “High-flow Oxygen level” to 71.5% “ITU-Admission level”); duration of inpatient treatment (varies from 46.7% for “≥ 2 days but < 3 days” to 69.8% “≤ 1 day”); and risk of pulmonary embolism sequelae (F1 score 85.8%, PPV of 83.7%, and NPV of 80.9%).Conclusion Overall, our findings demonstrate reliable, multivariable predictive models for 4 outcomes, that utilize readily available clinical information for COVID-19 adult inpatients. Further research is required to externally validate our models and demonstrate their utility as clinical decision-making tools.HighlightsUsing COVID-19 risk-factor data to assist clinical decision making is a challengeAnonymous data from 355 COVID-19 inpatients was collected & balancedKey independent variables were feature selected for 4 different outcomesAccurate, multi-variable predictive models were computed, using Bayesian NetworksFuture research should externally validate our models & demonstrate clinical utilityCompeting Interest StatementThe authors have declared no competing interest.Clinical TrialN/AFunding StatementNo funding to declare.Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:Health Research Authority (HRA) provided authorization for this research (Project ID 284640) and waived the requirement for any Research Ethics Committee (REC) ApprovalAll necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesAnonymized data regarding 44 independent predictor variables of 355 adults diagnosed with COVID-19, at a UK hospital, was manually extracted from electronic patient records for retrospective, case-controlled analysis.(OS)Oxygen Saturations(BPM)Respiratory Rate(UoB)CT imaging severity of COVID-19 related changes(CCX)COVID-19 related Chest X-Ray changes(MADA)Albumin(MDD)D-Dimer(CRP)C-Reactive Protein(MCRP1)CRP Day 1-2(MCRP3)CRP Day 3-4(MCRP5)CRP Day 5-6(MCRP7)CRP Day 7-8(MCRP11)CRP Day 11-12(IPD)Inpatient Mortality(MOoVS)Maximum Oxygen or Ventilatory Support(ADT)Duration of Treatment for COVID-19(NCPE)New confirmed diagnosis of pulmonary embolism