PT - JOURNAL ARTICLE AU - Villegas, Marta AU - Gonzalez-Agirre, Aitor AU - Gutiérrez-Fandiño, Asier AU - Armengol-Estapé, Jordi AU - Carrino, Casimiro Pio AU - Fernández, David Pérez AU - Soares, Felipe AU - Serrano, Pablo AU - Pedrera, Miguel AU - García, Noelia AU - Valencia, Alfonso TI - Predicting the Evolution of COVID-19 Mortality Risk: a Recurrent Neural Network Approach AID - 10.1101/2020.12.22.20244061 DP - 2020 Jan 01 TA - medRxiv PG - 2020.12.22.20244061 4099 - http://medrxiv.org/content/early/2020/12/23/2020.12.22.20244061.short 4100 - http://medrxiv.org/content/early/2020/12/23/2020.12.22.20244061.full AB - Background The propagation of COVID-19 in Spain prompted the declaration of the state of alarm on March 14, 2020. On 2 December 2020, the infection had been confirmed in 1,665,775 patients and caused 45,784 deaths. This unprecedented health crisis challenged the ingenuity of all professionals involved. Decision support systems in clinical care and health services management were identified as crucial in the fight against the pandemic.Methods This study applies Deep Learning techniques for mortality prediction in COVID-19 patients. Two datasets with clinical information (medication, laboratory tests, vital signs etc.) of 2,307 and 3,870 COVID-19 infected patients admitted to two Spanish hospital chains were used. Firstly, we built a sequence of temporal events gathering all the clinical information for each patient. Next, we used the temporal sequences to train a Recurrent Neural Network (RNN) model with an attention mechanism exploring interpretability. We conducted extensive experiments and trained the RNNs in different settings, performing hyperparameter search and cross-validation. We ensembled resulting RNNs to reduce variability and enhance sensitivity.Results We assessed the performance of our models using both global metrics, by averaging the performance across all the days in the sequences. We also measured day-by-day metrics starting from the day of hospital admission and the outcome day and evaluated the daily predictions. Regarding sensitivity, when compared to more traditional models, our best two RNN ensemble models outper-form a Support Vector Classifier in 6 and 16 percentage points, and Random Forest in 23 and 18 points. For the day-by-day predictions from the outcome date, the models also achieved better results than baselines showing system’s ability towards early predictions.Conclusions We have shown the feasibility of our approach to predict the clinical outcome (i.e. discharged alive or death) of patients infected with SARS-CoV-2. The result is a time series model that can support decision-making in healthcare systems and aims at interpretability. Despite the low-resource scenario, the results achieved are promising and suggests that more data will further increase the performance of the model.Competing Interest StatementThe authors have declared no competing interest.Funding StatementThis work has been funded by the State Secretariat for Digitalization and Artificial Intelligence (SEDIA) to carry out specialised technical support activities in supercomputing within the framework of the Plan TL 23 signed on 14 December 2018. The tasks done by Hospital Universitario 12 de Octubre were supported by PI18/00981, funded bythe Carlos III Health Institute from the Spanish National plan for Scientific and Technical Research and Innovation2017-2020 and the European Regional Development Funds (FEDER)Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:The secondary anonymized data from HM Hospitales were downloaded from https://www.hmhospitales.com/coronavirus/covid-data-save-lives/english-version which are subject to the CBE (Comité de Bioética de España) https://saib.es/wp-content/uploads/Informe-CBE-investigacion-COVID-19.pdf. The dataset required application for access, such application was filled and we received the approval and accces details via email on May 7, 2020. Usage of anonymized data from the "Hospital Universitario 12 de Octubre" was approved by the hospital's IRB (CEIm number: 20/666) and shared with BSC under a Collaboration Agreement. All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesHM is available is the provided link. H12O data is subject to legal restrictions and cannot be currently distributed. https://www.hmhospitales.com/coronavirus/covid-data-save-lives