RT Journal Article
SR Electronic
T1 Predicting clinical outcomes in the Machine Learning era: The Piacenza score a purely data driven approach for mortality prediction in COVID-19 Pneumonia
JF medRxiv
FD Cold Spring Harbor Laboratory Press
SP 2021.03.16.21253752
DO 10.1101/2021.03.16.21253752
A1 Halasz, Geza
A1 Sperti, Michela
A1 Villani, Matteo
A1 Michelucci, Umberto
A1 Agostoni, Piergiuseppe
A1 Biagi, Andrea
A1 Rossi, Luca
A1 Botti, Andrea
A1 Mari, Chiara
A1 Maccarini, Marco
A1 Pura, Filippo
A1 Roveda, Loris
A1 Nardecchia, Alessia
A1 Mottola, Emanuele
A1 Nolli, Massimo
A1 Salvioni, Elisabetta
A1 Mapelli, Massimo
A1 Deriu, Marco Agostino
A1 Piga, Dario
A1 Piepoli, Massimo
YR 2021
UL http://medrxiv.org/content/early/2021/03/20/2021.03.16.21253752.abstract
AB Background Several models have been developed to predict mortality in patients with COVID-19 pneumonia, but only few have demonstrated enough discriminatory capacity. Machine-learning(ML) algorithms represent a novel approach for data-driven prediction of clinical outcomes with advantages over statistical modelling. We developed the Piacenza score, a ML-based score, to predict 30-day mortality in patients with COVID-19 pneumonia.Methods 852 patients (mean age 70years, 70%males) were enrolled from February to November 2020. The dataset was randomly splitted into derivation and test. The Piacenza score was obtained through the Naïve Bayes classifier and externally validated on 86 patients. Using a forward-search algorithm the following six features were identified: age; mean corpuscular haemoglobin concentration; PaO2 /FiO2 ratio; temperature; previous stroke; gender. In case one or more of the features are not available for a patient, the model can be re-trained using only the provided features.We also compared the Piacenza score with the 4C score and with a Naïve Bayes algorithm with 14 variables chosen a-priori.Results The Piacenza score showed an AUC of 0.78(95% CI 0.74-0.84, Brier-score 0.19) in the internal validation cohort and 0.79(95% CI 0.68-0.89, Brier-score 0.16) in the external validation cohort showing a comparable accuracy respect to the 4C score and to the Naïve Bayes model with a-priori chosen features, which achieved an AUC of 0.78(95% CI 0.73-0.83, Brier-score 0.26) and 0.80(95% CI 0.75-0.86, Brier-score 0.17) respectively.Conclusion A personalized ML-based score with a purely data driven features selection is feasible and effective to predict mortality in patients with COVID-19 pneumonia.Competing Interest StatementThe authors have declared no competing interest.Funding StatementNoneAuthor DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:AUSL Piacenza ethics committeeAll necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesIf request, we agree to publicly share the study data and analysis source code.