Abstract
Immune checkpoint inhibitors (ICI) have become integral to treatment of non-small cell lung cancer (NSCLC). However, reliable biomarkers predictive of immunotherapy efficacy are limited. Here, we introduce HistoTME, a novel weakly supervised deep learning approach to infer the tumor microenvironment (TME) composition directly from histopathology images of NSCLC patients. We show that HistoTME accurately predicts the expression of 30 distinct cell type-specific molecular signatures directly from whole slide images, achieving an average Pearson correlation of 0.5 with the ground truth on independent tumor cohorts. Furthermore, we find that HistoTME-predicted microenvironment signatures and their underlying interactions improve prognostication of lung cancer patients receiving immunotherapy, achieving an AUROC of 0.75[95% CI: 0.61-0.88] for predicting treatment responses following first-line ICI treatment, utilizing an external clinical cohort of 652 patients. Collectively, HistoTME presents an effective approach for interrogating the TME and predicting ICI response, complementing PD-L1 expression, and bringing us closer to personalized immuno-oncology.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
This project was supported by an award from Upstate Foundation's Hendricks Endowment. Data (digital images and clinical meta-data) from the institutional cohort was generated at the Pathology Research Core Lab using institutional resources and support.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
Ethics committee of SUNY Upstate gave ethical approval for this work.
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.
Yes
Footnotes
↵* co-first author
Data Availability
The TCGA Whole slide Imaging and bulk transcriptomics data is publicly available and can be downloaded from the GDC portal (https://portal.gdc.cancer.gov). The CPTAC lung cancer whole slide imaging data is publicly available at The Cancer Imaging Archive (https://www.cancerimagingarchive.net/collection/cptac-luad, https://www.cancerimagingarchive.net/collection/cptac-lscc), whereas the CPTAC lung cancer bulk transcriptomics data is available from the GDC portal. Whole slide imaging data from the SUNY Upstate cohort is currently not publicly available owing to restrictions of patient privacy. However, the data will be made available upon reasonable request. Full clinical metadata and HistoTME predictions for each cohort are available through Zenodo (https://zenodo.org/uploads/11490460).
https://portal.gdc.cancer.gov/
https://wiki.cancerimagingarchive.net/display/Public/CPTAC+Imaging+Proteomics