Abstract
Next generation sequencing (NGS) of tumours is increasingly utilised in oncological practice, however only a minority of patients harbour oncogenic driver mutations benefiting from targeted therapy. Development of a drug response prediction (DRP) model based on available genomic data is important for the ‘untargetable’ majority of cases. Prior DRP models typically rely on whole transcriptome and whole exome sequencing (WES), which is often unavailable in clinical practice. We therefore aim to develop a DRP model towards repurposing of standard chemotherapy, requiring only information available in clinical grade NGS (cNGS) panels of recurrently mutated genes in cancer. Such an approach is challenging due to the sparsity of data in a restricted gene set and limited availability of patient samples with documented drug response. We first show that an existing DRP performs equally well with whole exome data and a cNGS subset comprising ∼300 genes. We then develop Drug IDentifier (DruID), a DRP model specific for restricted gene sets, using a novel transfer learning-based approach combining variant annotations, domain-invariant representation learning and multi-task learning. Evaluation of DruID on pan-cancer data (TCGA) showed significant improvements over state-of-the-art response prediction methods. Validation on two real world - colorectal and ovarian cancer - clinical datasets showed robust response classification performance, suggesting DruID to be a significant step towards a clinically applicable DRP tool.
Competing Interest Statement
Robert Walsh reported serving on the advisory board of Pfizer; receiving honoraria from Pfizer, AstraZeneca and Merck (MSD) outside the submitted work. David SP Tan reports personal fees for advisory board membership from AstraZeneca, Bayer, Boehringer Ingelheim, Eisai, Genmab, GSK, MSD, and Roche; personal fees as an invited speaker from AstraZeneca, Eisai, GSK, Merck Serono, MSD, Roche, and Takeda; ownership of stocks/shares of Asian Microbiome Library(AMiLi); institutional research grants from AstraZeneca, Bayer, Karyopharm Therapeutics, and Roche; institutional funding as coordinating PI from AstraZeneca and Bergen Bio; institutional funding as local PI from Bayer, Byondis B.V. and Zeria Pharmaceutical Co Ltd; a previous non-renumerated role as Chair of the Asia-Pacific Gynecologic Oncology Trials Group (APGOT); a previous non-renumerated role as the Society President of the Gynecologic Cancer Group Singapore; non-renumerated membership of the Board of Directors of the GCIG; non-remunerated role as Chair of the Cervical cancer research network of the GCIG; non-remunerated role as Protocol Committee Chair of APGOT and product samples from AstraZeneca, Eisai, and MSD (non-financial interest). Ragunathan Mariappan and Vaibhav Rajan are co-founders of Spectrum Learning Analytics. ADJ has received consultancy fees from DKSH/Beigene, Roche, Gilead, Turbine Ltd, AstraZeneca, Antengene, Janssen, MSD and IQVIA; and research funding from Janssen and AstraZeneca.
Funding Statement
David SP Tan is supported by the National Medical Research Council, Singapore under its NMRC Clinician Scientist Award (MOH-001006) and has received charitable research funding from the Pangestu Family Foundation Gynaecological Cancer Research Fund. The ongoing IMAC study is supported by National Research Foundation, Singapore and National Medical Research Council, Singapore under its NMRC Centre Grant Programme (NMRC/CG/M005/2017_NCIS). Vaibhav Rajan acknowledges support from AI Singapore 100 Experiments Grant No. AISG-100E-2023-116 (PI: Vaibhav Rajan). Aishwarya Jayagopal is supported by the National University of Singapore Research Scholarship.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
National Healthcare Group Domain Specific Review Board gave ethical approval for IMAC datasets.
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.
Yes