RT Journal Article SR Electronic T1 Time-dependent prediction of mortality and cytomegalovirus reactivation after allogeneic hematopoietic cell transplantation using machine learning JF medRxiv FD Cold Spring Harbor Laboratory Press SP 2021.09.14.21263446 DO 10.1101/2021.09.14.21263446 A1 Eisenberg, Lisa A1 the XplOit consortium A1 Brossette, Christian A1 Rauch, Jochen A1 Grandjean, Andrea A1 Ottinger, Hellmut A1 Rissland, Jürgen A1 Schwarz, Ulf A1 Graf, Norbert A1 Beelen, Dietrich W. A1 Kiefer, Stephan A1 Pfeifer, Nico A1 Turki, Amin T. YR 2022 UL http://medrxiv.org/content/early/2022/04/01/2021.09.14.21263446.abstract AB Allogeneic hematopoietic cell transplantation (HCT) effectively treats high-risk hematologic diseases but can entail HCT-specific complications, which may be minimized by appropriate patient management, supported by accurate, individual risk estimation. However, almost all HCT risk scores are limited to a single risk assessment before HCT without incorporation of additional data. We developed machine learning models which integrate both baseline patient data and time-dependent laboratory measurements to individually predict mortality and cytomegalovirus (CMV) reactivation after HCT at multiple time points per patient. These gradient boosting machine models provide well-calibrated, time-dependent risk predictions and achieved areas under the receiver-operating characteristic of 0.92 and 0.83 and areas under the precision-recall curve of 0.58 and 0.62 for prediction of mortality and CMV reactivation, respectively, in a 21-day time window. Both models were successfully validated in a prospective, non-interventional study and performed on par with expert hematologists in a pilot comparison.Competing Interest StatementATT received travel subsidies from Neovii Biotech and consulting fees from CSL Behring, MSD and JAZZ Pharmaceuticals, all outside the submitted work. DWB received travel subsidies from Medac, outside the submitted work. The other authors declare no competing financial interests.Clinical TrialDRKS00026643Funding StatementResearch of the XplOit consortium and the conduct of the prospective XplOit study were funded by the German Federal Ministry of Education and Research (BMBF) grants No. 031L0027A-F (NP, SK, NG, DWB). In part, the completion of this analysis was supported by the German Research Foundation (DFG)-UDE-UMEA grant No. FU 356/12-1 (ATT). NP is supported by the DFG Cluster of Excellence "Machine Learning - New Perspectives for Science", EXC 2064/1, project No. 390727645.Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:Study design and data acquisition was evaluated by the institutional review board (IRB) of the University Duisburg-Essen (Protocol No. 17-7576-BO) and by the IRB of the medical association of the Saarland (Protocol No. 33/17).I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesThe data used in this article contains sensitive personal health information. Due to the high dimensionality and the inclusion of longitudinal data, it cannot be fully anonymized and published without the risk of re-identification. Requests for access to the data may be submitted to the University Hospital Essen and are subject to approval by data protection officer and ethics committee.