RT Journal Article SR Electronic T1 A deep learning based graph-transformer for whole slide image classification JF medRxiv FD Cold Spring Harbor Laboratory Press SP 2021.10.15.21265060 DO 10.1101/2021.10.15.21265060 A1 Zheng, Yi A1 Gindra, Rushin A1 Betke, Margrit A1 Beane, Jennifer E. A1 Kolachalama, Vijaya B. YR 2021 UL http://medrxiv.org/content/early/2021/10/18/2021.10.15.21265060.abstract AB Deep learning is a powerful tool for assessing pathology data obtained from digitized biopsy slides. In the context of supervised learning, most methods typically divide a whole slide image (WSI) into patches, aggregate convolutional neural network outcomes on them and estimate overall disease grade. However, patch-based methods introduce label noise in training by assuming that each patch is independent with the same label as the WSI and neglect the important contextual information that is significant in disease grading. Here we present a Graph-Transformer (GT) based framework for processing pathology data, called GTP, that interprets morphological and spatial information at the WSI-level to predict disease grade. To demonstrate the applicability of our approach, we selected 3,024 hematoxylin and eosin WSIs of lung tumors and with normal histology from the Clinical Proteomic Tumor Analysis Consortium, the National Lung Screening Trial, and The Cancer Genome Atlas, and used GTP to distinguish adenocarcinoma (LUAD) and squamous cell carcinoma (LSCC) from those that have normal histology. Our model achieved consistently high performance on binary (tumor versus normal: mean overall accuracy = 0.975 ± 0.013) as well as three-label (normal versus LUAD versus LSCC: mean accuracy = 0.932 ± 0.019) classification on held-out test data, underscoring the power of GT-based deep learning for WSI-level classification. We also introduced a graphbased saliency mapping technique, called GraphCAM, that captures regional as well as contextual information and allows our model to highlight WSI regions that are highly associated with the class label. Taken together, our findings demonstrate GTP as a novel interpretable and effective deep learning framework for WSI-level classification.Competing Interest StatementThe authors have declared no competing interest.Funding StatementThis work was supported by grants from the National Institutes of Heath, Johnson & Johnson Enterprise Innovation, Inc, a Hariri Research Award from the Hariri Institute for Computing and Computational Science & Engineering at Boston University, a Strategically Focused Research Network (SFRN) Center Grant from the American Heart Association, the Toffler Scholarship in Neuroscience from the Karen Toffler Charitable Trust, and the National Science Foundation. The authors thank the National Cancer Institute for access to NCIs data collected by the National Lung Screening Trial (NLST).Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:This study involves only openly available human data, which can be obtained from the TCGA, NLST and CPTAC databases. Since all the data is publicly available, an IRB approval was not necessary.I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesTCGA, NLST and CPTAC websites