Abstract
The prognosis prediction of cancer patients is important for disease management. We introduce DeepProg, a new computational framework that robustly predicts patient survival subtypes based on multiple types of omic data, using an ensemble of deep-learning and machine-learning models. We apply DeepProg on 32 cancer datasets from TCGA and identified multiple cancer survival subtypes. Patient survival risk-stratification based on DeepProg is significantly better (p-value=7.9e-7 rank sum test) than Similarity Network Fusion based multi-omics data integration in all cancer types. Further comprehensive pan-cancer comparative analysis unveils the genomic signatures common among all the poorest survival subtypes, with genes enriched in extracellular matrix modeling, immune deregulation, and mitosis processes. Furthermore, models built on closely related cancer types using DeepProg are predictive of the subtypes of some other cancers, demonstrating the utility of DeepProg for transfer learning.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
This research was supported by grants K01ES025434 awarded by NIEHS through funds provided by the trans-NIH Big Data to Knowledge (BD2K) initiative (www.bd2k.nih.gov), P20 COBRE GM103457 awarded by NIH/NIGMS, R01 LM012373 and R01 LM012907 awarded by NLM, and R01 HD084633 awarded by NICHD to L.X. Garmire.
Author Declarations
All relevant ethical guidelines have been followed and any necessary IRB and/or ethics committee approvals have been obtained.
Not Applicable
All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.
Not Applicable
Any clinical trials involved have been registered with an ICMJE-approved registry such as ClinicalTrials.gov and the trial ID is included in the manuscript.
Not Applicable
I have followed all appropriate research reporting guidelines and uploaded the relevant Equator, ICMJE or other checklist(s) as supplementary files, if applicable.
Not Applicable
Data Availability
Data are available upon request