Abstract
Background Although surgical resection is the standard therapy for stage II/III colorectal cancer (CRC), recurrence rates exceed 30%. Circulating tumor DNA (ctDNA) emerged as a promising recurrence predictor, detecting molecular residual disease (MRD). However, spatial information about the tumor and its microenvironment is not directly measured by ctDNA. Deep Learning (DL) can predict prognosis directly from routine histopathology slides.
Methods We developed a DL pipeline utilizing vision transformers to predict disease-free survival (DFS) based on histological hematoxylin & eosin (H&E) stained whole slide images (WSIs) from patients with resectable stage II-IV CRC. This model was trained on the DACHS cohort (n=1766) and independently validated on the GALAXY cohort (n=1555). Patients were categorized into high- or low-risk groups based on the DL-prediction scores. In the GALAXY cohort, the DL-scores were combined with the four-weeks post-surgery MRD status measured by ctDNA for prognostic stratification.
Results In GALAXY, the DL-model categorized 307 patients as DL high-risk and 1248 patients as DL low-risk (p<0.001; HR 2.60, CI 95% 2.11-3.21). Combining the DL scores with the MRD status significantly stratified both the MRD-positive group into DL high-risk (n=81) and DL low-risk (n=160) (HR 1.58 (CI 95% 1.17-2.11; p=0.002) and the MRD-negative group into DL high-risk (n=226) and DL low-risk (n=1088) (HR 2.37 CI 95% 1.73-3.23; p<0.001). Moreover, MRD-negative patients had significantly longer DFS when predicted as DL high-risk and treated with ACT (HR 0.48, CI 95% 0.27-0.86; p= 0.01), compared to the MRD-negative patients predicted as DL low-risk (HR=1.14, CI 95% 0.8-1.63; p=0.48).
Conclusion DL-based spatial assessment of tumor histopathology slides significantly improves the risk stratification provided by MRD alone. Combining histologic information with ctDNA yields the most powerful predictor for disease recurrence to date, with the potential to improve follow-up, withhold adjuvant chemotherapy in low-risk patients and escalate adjuvant chemotherapy in high-risk patients.
Highlights
- This study combines MRD status measured by ctDNA with a DL-based risk assessment trained on histological image data to enhance recurrence prediction.
- DL-based spatial assessment of tumor histopathology slides significantly improves the risk stratification provided by MRD alone.
- MRD-negative patients with high DL-based risk had a significantly longer DFS if treated with ACT, compared to MRD-negative and DL low risk patients
- The DL model is fully open-source and publicly available.
Competing Interest Statement
CMLL reports honoraria from AstraZeneca. HB reports research funding from Ono Pharmaceutical and honoraria from Ono Pharmaceutical Eli Lilly Japan and Taiho Pharmaceutical. TM reports honoraria from Chugai AstraZeneca and Miyarisan. SM reports honoraria from Taiho Pharmaceutical Co. Ltd. Chugai Pharmaceutical Co. Ltd. and Eli Lilly CO Ltd. DK reports honoraria from Takeda Chugai Lilly MSD Ono Seagen Guardant Health Eisai Taiho Bristol Myers Squibb Daiichi-Sankyo Pfizer Merckbiopharma and Sysmex research funding from Ono MSD Novartis Servier Janssen IQVIA Syneoshealth CIMIC and Cimicshiftzero. HT reports speakers bureau from MSD K.K Merck Biopharma Takeda Taiho Lilly Japan Bristol-Myers Squibb Japan Chugai Pharmaceutical Ono Yakuhin Amgen research funding from Takeda Daiichi Sankyo. IT reports speakers bureau from Medtronic Johnson &Johnson Intuitive Medicaroid Eli Lilly and research funding from Medtronic sysmex. SF has received honoraria from MSD and BMS. TK reports nothing to declare. EO reports speakers bureau from Chugai Pharmaceutical Co. Ltd. Bristol Meyers Ono Pharmaceutical Co. Ltd. Eli Lilly Takeda Pharmaceutical Co. Ltd. research funding from Guardant Health Inc. advisory role from Glaxosmithkline plc. YN reports advisory role from Guardant Health Pte Ltd. NateraInc. Roche Ltd. SeagenInc. Premo Partners Inc. Daiichi Sankyo Co. Ltd. Takeda Pharmaceutical Co. Ltd. Exact Sciences Corporation and Gilead Sciences Inc. speakers bureau from Guardant Health Pte Ltd. MSD K.K. Eisai Co. Ltd. Zeria Pharmaceutical Co. Ltd. Miyarisan Pharmaceutical Co. Ltd. Merck Biopharma Co. Ltd. CareNetInc. Hisamitsu Pharmaceutical Co. Inc. Taiho Pharmaceutical Co. Ltd. Daiichi Sankyo Co. Ltd. Chugai Pharmaceutical Co. Ltd. and Becton Dickinson and Company Guardant Health Japan Corp research funding from Seagen Inc. Genomedia Inc. Guardant Health AMEA Inc. Guardant Health Inc. Tempus Labs Inc. Roche Diagnostics K.K. Daiichi Sankyo Co. Ltd. and Chugai Pharmaceutical Co. Ltd. TY reports honoraria from Taiho Chugai Eli Lilly Merck Bayer Yakuhin Ono and MSD and research funding from Ono Sanofi Daiichi Sankyo Parexel Pfizer Taiho MSD Amgen Genomedia Sysmex Chugai and Nippon Boehringer Ingelheim. S.S. The remaining authors declare no competing interests. JNK declares consulting services for Owkin France; DoMore Diagnostics Norway Panakeia UK Scailyte Switzerland Mindpeak Germany and MultiplexDx Slovakia. Furthermore he holds shares in StratifAI GmbH Germany has received a research grant by GSK and has received honoraria by AstraZeneca Bayer Eisai Janssen MSD BMS Roche Pfizer and Fresenius. All the other authors report nothing to declare.
Funding Statement
JNK is supported by the German Cancer Aid (DECADE 70115166) the German Federal Ministry of Education and Research (PEARL 01KD2104C CAMINO 01EO2101 SWAG 01KD2215A TRANSFORM LIVER 031L0312A TANGERINE 01KT2302 through ERA-NET Transcan) the German Academic Exchange Service (SECAI 57616814) the German Federal Joint Committee (TransplantKI 01VSF21048) the European Unions Horizon Europe and innovation programme (ODELIA 101057091 GENIAL 101096312) the European Research Council (ERC NADIR 101114631) and the National Institute for Health and Care Research (NIHR NIHR203331) Leeds Biomedical Research Centre. The views expressed are those of the author(s) and not necessarily those of the NHS the NIHR or the Department of Health and Social Care. This work was funded by the European Union. Views and opinions expressed are however those of the author(s) only and do not necessarily reflect those of the European Union. Neither the European Union nor the granting authority can be held responsible for them. CIRCULATE-Japan receives financial support from the Japan Agency for Medical Research and Development (grant 19ck0106447h0002-TY). SF is supported by the German Federal Ministry of Education and Research (SWAG 01KD2215A) the German Cancer Aid (DECADE 70115166 and TargHet 70115995) and the German Research Foundation (504101714). The DACHS study (HB TY DW and MH) was supported by the German Research Council (BR 1704/6-1 BR 1704/6-3 BR 1704/6-4 CH 117/1-1 HO 5117/2-1 HO 5117/2-2 HE 5998/2-1 HE 5998/2-2 KL 2354/3-1 KL 2354/3-2 RO 2270/8-1 RO 2270/8-2 BR 1704/17-1 and BR 1704/17-2) the Interdisciplinary Research Program of the National Center for Tumor Diseases (NCT; Germany) and the German Federal Ministry of Education and Research (01KH0404 01ER0814 01ER0815 01ER1505A and 01ER1505B).
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
The experiments in this study were carried out according to the Declaration of Helsinki and the International Ethical Guidelines for Biomedical Research Involving Human Subjects by the Council for International Organizations of Medical Sciences (CIOMS). The present study also adheres to the Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD) statement.20. The Ethics Board at the Medical Faculty of Technical University Dresden (BO-EK-444102022) and Institutional Review Board of the National Cancer Center Japan (2023-207) approved of the overall analysis in this study. The patient sample collection in each cohort was separately approved by the respective institutional ethics board.
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.
Yes
Data Availability
Our whole slide image preprocessing pipeline is available here: https://github.com/KatherLab/end2end-WSI-preprocessing. The code for the pretrained vision encoder UNI can be found under: https://github.com/mahmoodlab/uni. Our DL model codes are publicly available at https://github.com/KatherLab/marugoto/tree/survival-transformer/marugoto/survival. The respective study Principal Investigators provided the remaining data. For detailed data sharing policies, please refer to the original publications.
https://github.com/KatherLab/end2end-WSI-preprocessing
https://github.com/mahmoodlab/uni
https://github.com/KatherLab/marugoto/tree/survival-transformer/marugoto/survival