PT - JOURNAL ARTICLE AU - Wang, Pin AU - Jiang, Chengfei AU - Mao, April W. AU - Sun, Qi AU - Zhu, Hong AU - Inman, Jamie AU - Celniker, Susan AU - Snijders, Antoine M. AU - Threadgill, David W AU - Balmain, Allan AU - Hang, Bo AU - Fan, Jia AU - Mao, Jian-Hua AU - Wang, Lei AU - Chang, Hang TI - An AI-Powered tissue-agnostic cellular morphometrics biomarker for risk assessment in patients with pan-gastrointestinal precancerous lesions and cancers AID - 10.1101/2024.11.14.24317353 DP - 2024 Jan 01 TA - medRxiv PG - 2024.11.14.24317353 4099 - http://medrxiv.org/content/early/2024/11/15/2024.11.14.24317353.short 4100 - http://medrxiv.org/content/early/2024/11/15/2024.11.14.24317353.full AB - PURPOSE Tissue-agnostic biomarkers that capture the commonality in cancer biology, may provide a new avenue for treatment development and optimization across cancer types. Here, we aimed to evaluate and validate the clinical value of a tissue-agnostic cellular morphometrics biomarker (CMB) signature, which was discovered by artificial intelligence (AI) from H&E-stained whole-slide images (WSI) of diagnostic slides of colon cancers, in pan-gastrointestinal (pan-GI) pre-cancer lesions and cancers.METHODS We discovered CMBs from WSI using our well-established CMB-ML pipeline and established a CMB risk score (CMBRS) using multivariate regression models. Based on CMBRS, we assigned individual patients from The Cancer Genome Atlas Colon Adenocarcinoma Cohort (TCGA-COAD) (n=430) to CMB risk groups (CMBRG). We then extensively evaluated tissue-agnostic clinical value of CMB signature, CMBRS and CMBRG in multi-cohorts with different types of GI cancer (n=2,219) and risk assessment of precancerous lesions (n=1,016). We unraveled each CMB-related biological function using bulk RNA-sequencing, single-cell RNA-sequencing (scRNA-seq) and opal multiplex immunohistochemistry (IHC) techniques.RESULTS From the TCGA-COAD cohort, we developed a 13-CMB signature and constructed CMBRS/CMBRG that predict prognosis of colon cancer patients. Importantly, this 13-CMB signature proved prognostic and predictive values for TCGA patients with rectal, gastric and esophageal cancer independent of traditional clinical factors. These findings were independently validated using multiple cohorts from Drum Tower Hospital. Moreover, 13-CMB signature exhibited the power for risk stratification of colon adenoma and early esophageal neoplastic lesion patients for predicting cancer progression. In addition, we demonstrated and validated independent prognostic impacts of gene signatures and CMB signatures and a significant increase in predictive power by integration of CMB signature, gene signature and clinical factors. Correlations between CMBs and gene expression levels revealed the association of each CMB with biological functions including cell proliferation, epithelial-to-mesenchymal transition and immune microenvironment. The association of CMBs with the immune microenvironment was prospectively validated by scRNA-seq and was further confirmed by Opal multiplex IHC staining in colon cancer.CONCLUSION This study demonstrates the clinical value of tissue-agnostic AI-empowered CMB signature from WSI with defined biological functions, which can be used in clinical settings to assess risk, diagnose disease, and guide clinical interventions. Tissue-agnostic CMBs potentially provide a new avenue for a rapid, robust and cost-effective cross-cancer prediction that is essential for developing common treatment strategy for multiple cancers.Competing Interest StatementThe authors have declared no competing interest.Funding StatementThis work has been supported by the National Natural Science Foundation of China (ID Number: 82272952), the Natural Science Foundation of Jiangsu Province for Excellent Young Scholars (ID Number: BK20220094), China Postdoctoral Science Foundation (ID Number: 2022M721579) and funds for Clinical Trials from the Affiliated Drum Tower Hospital, Medical School of Nanjing University (ID Number: 2021-LCYJ-PY-21).Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:The hospital validation study was approved by the Institutional Review Board (IRB) at the participating hospital and was independently carried out at Nanjing Drum Tower Hospital.I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.YesWhole slide images and clinical data of the TCGA cohorts were downloaded from the TCGA GDC portal (https://portal.gdc.cancer.gov/). Processed data related to TCGA and Drum Tower cohorts have been provided with the manuscript. Raw data from the Nanjing Drum Tower Hospital is not currently permitted in public repositories because ethical and legal implications are still being discussed at an institutional level.H&Ehematoxylin and eosinAIartificial intelligenceGIgastrointestinalCMBcellular morphometrics biomarkerWSIwhole-slide imagesCMBRSCMB risk scoreCMBRGCMB risk groupOSoverall survivalscRNA-seqsingle cell RNA sequencingIHCimmunohistochemistryTCGA-COADThe Cancer Genome Atlas - Colon AdenocarcinomaTCGA-STADThe Cancer Genome Atlas - Stomach AdenocarcinomTCGA-READThe Cancer Genome Atlas - Rectum AdenocarcinomaTCGA-ESCAThe Cancer Genome Atlas - Esophageal CarcinomaLGINLow-Grade Intraepithelial NeoplasiaHGINHigh-Grade Intraepithelial NeoplasiaCAPColon Adenomatous PolypsEELEarly Esophageal LesionCRCColorectal CancerDTDrum Tower Hospital