Abstract
Lung cancer survival prediction is a critical task in healthcare, where accurate and timely predictions can significantly impact patient outcomes. In hospital settings, new patient data is constantly generated, requiring predictive models to adapt without forgetting previously learned knowledge. This challenge is intensified by the need to seamlessly integrate complex multimodal data, such as imaging, DNA, and patient records. Traditional Deep Learning (DL) models, while powerful, often suffer from catastrophic forgetting during incremental learning, further complicating the task of reliable survival prediction in dynamic environments. To address these challenges, we introduce a hybrid Continual Learning (CL) framework that integrates Elastic Weight Consolidation (EWC) with replay-based modules, including EWC Experience Replay (ER), Instance-Level Correlation Replay (EICR), and Class-Level Correlation Replay (ECCR). The ER module preserves knowledge by replaying representative samples from previous data, mitigating interference from new data. The EICR module ensures the retention of fine-grained feature patterns through inter-instance relationship modeling, while the ECCR module consolidates global knowledge across tasks using random triplet probabilities to preserve inter-class correlations. Together, these components create a robust framework, addressing catastrophic forgetting while enhancing adaptability for real-time survival prediction. Another critical challenge is the limitations of Convolutional Neural Networks (CNNs), which tend to miss ground-glass opacities or tiny tumor features in CT and PET images due to their reliance on datasets similar to their pretraining data. To overcome this, we propose a Swin Transformer (SwinT)-based method to extract critical features, addressing CNN shortcomings in such multimodal scenarios. Additionally, XLNet-permutation enriches multimodal analysis by effectively handling small DNA datasets and capturing latent patterns, whereas Fully Connected Network (FCN) process clinical features. A cross-attention fusion mechanism integrates clinical, CT, PET, and DNA data, producing a robust survival prediction model. The final prediction is guided by FCN and Cox Proportional Hazards (CoxPH) techniques, achieves state-of-the-art performance with a 7.7% concordance index (C-Index) improvement (0.84), a mean absolute error (MAE) reduction to 140 days, and minimized forgetting to 0.08. Ablation studies demonstrate the importance of the DNA modality, cross-attention mechanism, and CL strategies, advancing adaptive survival prediction and stability.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
The author(s) received no specific funding for this work.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
N/A: We used a publicly open dataset in our research, and the dataset does not contain sensitive or private information requiring ethical approval, it is acceptable to state that no IRB (Institutional Review Board) approval is required.
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.
Yes
Data Availability
Dataset are publicly accessed in online. Link: https://aihub.or.kr/aihubdata/data/view.do?currMenu=115&topMenu=100&dataSetSn=228
https://aihub.or.kr/aihubdata/data/view.do?currMenu=115&topMenu=100&dataSetSn=228