RT Journal Article SR Electronic T1 Symptom Prediction and Mortality Risk Calculation for COVID-19 Using Machine Learning JF medRxiv FD Cold Spring Harbor Laboratory Press SP 2021.02.04.21251143 DO 10.1101/2021.02.04.21251143 A1 Jamshidi, Elham A1 Asgary, Amirhossein A1 Tavakoli, Nader A1 Zali, Alireza A1 Dastan, Farzaneh A1 Daaee, Amir A1 Badakhshan, Mohammadtaghi A1 Esmaily, Hadi A1 Jamaldini, Seyed Hamid A1 Safari, Saeid A1 Bastanhagh, Ehsan A1 Maher, Ali A1 Babajani, Amirhesam A1 Mehrazi, Maryam A1 Kashi, Mohammad Ali Sendani A1 Jamshidi, Masoud A1 Sendani, Mohammad Hassan A1 Rahi, Sahand Jamal A1 Mansouri, Nahal YR 2021 UL http://medrxiv.org/content/early/2021/02/06/2021.02.04.21251143.abstract AB Background Early prediction of symptoms and mortality risks for COVID-19 patients would improve healthcare outcomes, allow for the appropriate distribution of healthcare resources, reduce healthcare costs, aid in vaccine prioritization and self-isolation strategies, and thus reduce the prevalence of the disease. Such publicly accessible prediction models are lacking, however.Methods Based on a comprehensive evaluation of existing machine learning (ML) methods, we created two models based solely on the age, gender, and medical histories of 23,749 hospital-confirmed COVID-19 patients from February to September 2020: a symptom prediction model (SPM) and a mortality prediction model (MPM). The SPM predicts 12 symptom groups for each patient: respiratory distress, consciousness disorders, chest pain, paresis or paralysis, cough, fever or chill, gastrointestinal symptoms, sore throat, headache, vertigo, loss of smell or taste, and muscular pain or fatigue. The MPM predicts the death of COVID-19-positive individuals.Results The SPM yielded ROC-AUCs of 0.53-0.78 for symptoms. The most accurate prediction was for consciousness disorders at a sensitivity of 74% and a specificity of 70%. 2440 deaths were observed in the study population. MPM had a ROC-AUC of 0.79 and could predict mortality with a sensitivity of 75% and a specificity of 70%. About 90% of deaths occurred in the top 21 percentile of risk groups. To allow patients and clinicians to use these models easily, we created a freely accessible online interface at www.aicovid.org.Conclusions The ML models predict COVID-19-related symptoms and mortality using information that is readily available to patients as well as clinicians. Thus, both can rapidly estimate the severity of the disease, allowing shared and better healthcare decisions with regard to hospitalization, self-isolation strategy, and COVID-19 vaccine prioritization in the coming months.Competing Interest StatementThe authors have declared no competing interest.Funding StatementSJR thanks the Ecole polytechnique federale de Lausanne for generous supportAuthor DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:The study was performed after approval by Iran University of Medical Sciences Ethics CommitteeAll necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesThe data that support the findings of this study are available from the corresponding authors upon request.AIartificial intelligenceCOVID-19coronavirus disease of 2019ICUintensive care unitIQRinterquartile rangeKSKolmogorov-SmirnovLRlogistic regressionMLmachine learningRFrandom forestRDWred blood cell distribution widthROCreceiver operating characteristicHIS(hospital information system