ABSTRACT
Ulcerative colitis (UC) is a progressive disorder that elevates the risk of cancer development through a colitis-dysplasia-carcinoma sequence. Differential gene expression (DEGs) profiles of three UC clinical subtypes and healthy controls were developed for the GSE47908 microarray dataset [n = 15 (healthy controls), n = 20 (left-sided colitis), n = 19 (pancolitis), and n = 6 (colitis-associated dysplasia, CAD)] using limma R. Gene ontology (GO) enrichment analysis of DEGs revealed a shift in transcriptome landscape as UC progressed from left-sided colitis to pancolitis to CAD, from being immune-centric to being cytoskeleton-dependent. Hippo signaling (via Yes-associated protein, YAP) and Ephrin receptor signaling were the top canonical pathways progressively altered in concert with the pathogenic progression of UC. Molecular interaction network analysis of DEGs in left-sided colitis, pancolitis, and CAD revealed one pairwise line or edge that was topologically important to the network structure. This edge was found to be highly enriched in actin-based processes, and death-associated protein kinase 3 (DAPK3) was a critical member and sole protein kinase associated with this edge. DAPK3 is a regulator of actin-cytoskeleton reorganization that controls proliferation and apoptosis. Differential correlation analyses revealed a negative correlation for DAPK3-YAP in healthy controls which flipped to positive in left-sided colitis. With UC progression to CAD, the DAPK3-YAP correlation grew progressively more positive. In summary, DAPK3 was identified as a candidate gene involved in UC progression to dysplasia.
1. INTRODUCTION
Ulcerative colitis (UC) is a chronic, inflammatory bowel disease (IBD) that is confined to the mucosal layer of the large bowel, most commonly the rectum, and may extend proximally in a continuous fashion (1). This is a heterogenous and progressive disorder that has seen a substantial increase in global prevalence. The current framework of UC pathogenesis comprises environmental, genetic, immune, and microbiome factors that culminate at the perturbation of the mucosal barrier and prolonged mucosal inflammatory response (2). Consequently, UC is a clinically-, molecularly-, and genetically-heterogeneous disease that fosters varied disease course and mixed response to therapy (3). The genetic heterogeneity is highlighted in the analysis of 75,000 IBD cases and controls by Jostins and colleagues that identified 23 UC-specific risk loci with primary involvement in the regulation of epithelial barrier function and immune pathways (4). Genetic studies such as this promote the concept that UC disease susceptibility is a compilation of small effects/gene-alterations that is not shared by all patients.
UC patients are at greater risk of developing colitis-associated colorectal cancer (CAC) through an acceleration of the colitis-dysplasia-carcinoma sequence of cellular transformation (5). Mutational signature analysis of tissue and blood samples from CAC patients showed that the evolutionary trajectory of disease is initiated early in the colitis-dysplasia-carcinoma sequence (6). Furthermore, the extent of colitis has been recognized as an independent and most significant risk factor for CAC (7). In support of this, Bjerrum and colleagues provided transcriptional profiles of colonic mucosa from patients with varying extent of UC (8). A gene expression dataset (GSE47908) was generated from mucosal biopsies sampled from the left colon of patients with left-sided colitis, pancolitis, or colitis-associated dysplasia (CAD) plus healthy controls. The authors identified differential transcriptional profiles with primary component analysis (PCA) aligned with UC extent (i.e., areas of involvement) that were not inferred by potential covariates in the clinical data (i.e., age, years with disease, Mayo score, and medication). Their findings suggest that gene expression profiles of colitis-associated lesions obtained from patients with varied extent of UC can be mined to support the development of molecular panels that identify patients at high risk of developing dysplasia or CAC. However, additional network analyses and detailed interrogation of specific molecular participants were not provided in the Bjerrum study.
Elucidation of the mechanisms of colon carcinogenesis requires further investigation, and insight into the molecular events underpinning the progression of UC to colitis-associated colon cancer may be gained from the study of non-dysplastic colonic mucosa (6,8). The transcriptional dataset generated by Bjerrum and colleagues (8) provides an excellent resource to perform such an analysis. In this study, we performed differential expression analysis, pathway and network analyses, as well as differential correlation analysis on the GSE47908 dataset to determine how the extent of colitis impacts upon biological functions and regulatory pathways and to identify the key molecular factors that bridge the colitis-dysplasia progression. The results provide insight into the molecular events associated with colitis-dysplasia progression, which could be exploited for the development of biomarkers in non-dysplastic mucosa that identify the risk of dysplasia for UC patients.
2. METHODS
2.1. Data Processing
Data processing was completed using the R (v4.0.2) programming language, and all codes used in this study align with recommendations made by authors of R packages in their respective user’s guide, which can be accessed at https://bioconductor.org.
2.2. Differential Gene Expression Analysis
Log transformed microarray expression data for GSE47908 and microarray platform data for GPL570 (HG-U133_Plus_2; Affymetrix Human Genome U133 Plus 2.0 Array) were retrieved from the GEO database available at https://www.ncbi.nlm.nih.gov/geo/ with the R package GEOquery (9). The limma workflow (10) was used to detect differentially expressed transcripts between the UC clinical subtypes [left-sided colitis (n=20), pancolitis (n=19), colitis-associated dysplasia, CAD (n=6)], and healthy control (HC) samples (n=15). Specifically, the function |lmFit| was used to generate a linear model fit to the data matrix containing log expression values for GSE47908. Next, function |contrasts.fit| was used to compute estimated coefficients and standard error for a given set of contrasts (e.g., pancolitis vs. HC). Finally, function |eBayes| was used to compute log-odds of differential expression by empirical Bayes moderation of the standard errors towards a global value. All functions were operated with default settings.
Log-fold changes calculated by function |eBayes| for 54,675 transcripts, along with their false discovery ratio (FDR) and p-values, were uploaded to the Ingenuity Pathway Analysis (IPA) software (11). Some Affymetrix transcript identifiers remain unmapped by IPA; these genes were eliminated from the study, leaving 45,480 transcripts to be mapped. Due to redundancy of the GeneChip HG-U133 Plus 2.0 Array, this transcript pool included duplicate genes. To resolve duplicates, IPA Core Analyses were performed on the mapped transcripts, and transcript identifiers were consolidated using their log2FC measurement, by which, representative transcript was selected based on maximum absolute log2FC. This returned 21,475 ‘analysis-ready’ genes. To distinguish the differentially expressed genes (DEGs), the combined application of a stringent FDR threshold (q < 0.001) and a moderate fold change threshold |log2FC| > 0.75) was used. This reduced the number of false positives while maintaining the ‘ideal’ dataset size (200-3,000 genes) for subsequent IPA Core Analysis of gene expression data (11). Differential expression was then visualized via the EnhancedVolcano R package (12).
2.3. Functional Analysis of DEGs in UC disease subtypes
To compare DEGs and illustrate possible relationships between left-sided colitis, pancolitis, and CAD, a Venn diagram was first used to visualize the overlap of DEGs found in the three UC clinical subtypes. The lists of overlapping DEGs derived from the comparison of UC subtypes were then used as input data for gene ontology (GO) enrichment analysis, performed with the topGO R package (13). topGO was chosen owing to its ‘elim’ method that takes GO hierarchy into consideration when calculating enrichment. For the enrichment analysis, the background consisted of all genes assessed by the microarray platform GPL570, and annotation was completed with the R package org.Hs.eg.db (14); GO terms with <10 annotations were excluded for interpretability. The degree of enrichment was reported as the odds ratio (OR), where IA = # DEGs annotated with the GO term, IB = # background genes annotated with the GO term, and OR = (IA/size of DEG list) / (IB/size of background list). Statistical significance was defined with a Fisher’s Exact Test.
2.4. Ingenuity Pathway Analyses
To analyze changes in biological states across the UC clinical subtypes, three sets of IPA Core Analyses were performed, followed by IPA Comparison Analysis. The IPA Comparison Analysis allows for side-by-side comparison of multiple Core Analyses, which facilitated the discovery of trends amongst the three datasets. Core Analyses were performed on the three datasets (i.e., left-sided colitis vs. HC, pancolitis vs. HC, and CAD vs. HC) to assess the canonical pathways, upstream regulators, molecular and cellular functions, and molecular interaction networks that were most likely to be perturbed based on the changes in gene expression. Within IPA, canonical pathways were built with reference to literature prior to DEG input and did not undergo structural changes upon DEG input. Instead, IPA computes a z-score that assesses the directionality within a gene set (i.e., the DEG input) to infer the activation state of each canonical pathway or molecular and cellular function. Upstream regulators were identified by the observed differential regulation of known downstream effector(s). The z-score determines the activation state of an upstream regulator by the regulation direction associated with the relationship from the upstream regulator to the effector(s). A negative z-score indicates inhibition, and a positive z-score indicates activation. Significance was calculated with the right-tailed Fisher’s Exact Test. Changes in activation state across UC subtypes were assessed with Comparison Analysis (sort method = trend + z-score). The correlation of activation state as the extent of disease progressed from left-sided colitis to pancolitis to CAD (the trend) was examined, and the findings were reported as trending towards activation or trending towards deactivation.
2.5. Mapping Molecular Interaction Networks
DEGs from the three datasets were mapped to their corresponding gene objects in the Ingenuity Knowledge Base (IKB), and those that interacted with other molecules in the IKB were designated focus molecules. Focus molecules were then assembled into networks by maximizing their interconnectedness with each other (relative to non-focus molecules with which they are connected to in the IKB). While non-focus molecules from the IKB may be used to merge smaller networks into a larger network, networks are scored based on the number of focus molecules they contain. Network size, the total number of focus molecules analyzed, and the total number of molecules in the IKB that could be included in the networks also contribute to the network scores. The score is a test of significance using hypergeometric distribution and is calculated with the right-tailed Fisher’s Exact Test (Score = - log (Fisher’s p-value; score ≥ 2 equals p ≤ 0.01). For this study, networks were limited to 70 molecules each, and a maximum of 25 networks per UC-subtype were constructed. Individual networks (child networks, count = 75) were overlapped with one another to create a single parent network by virtue of common network molecules between child pairings; this was done with the RVenn R package. Finally, the edge betweenness parameter for the core network was computed via the NetworkAnalyzer app, included in Cytoscape 3.8.0 (15).
2.6. Differential Correlations
A ggplot2-based R package, ggpubr (https://rpkgs.datanovia.com/ggpubr/), was used to investigate the relationship between the expression profiles of two genes. Specifically, function |stat_cor| was used to calculate the Pearson correlation coefficient. The size of the concentration ellipse in normal probability was left at the default 0.95, which translates to a 95% confidence interval.
3. RESULTS
3.1. Analysis of Genes Differentially Expressed in UC Disease Subtypes
The distribution of gene expression ratios (log-transformed) calculated between UC subtypes and healthy controls (HC) is presented in Figure 1A. For the left-sided colitis vs. HC comparison, 1,016 genes had log2FC greater than 0.75 (q < 0.001); of these DEGs, 614 were upregulated, and 402 were downregulated. Pancolitis returned 2,858 DEGs, half of which were upregulated. Colitis-associated dysplasia (CAD) returned 1,842 DEGs; 393 were upregulated, and 1,449 were downregulated.
For functional predictions, a Venn diagram was first used to identify genes with expression regulation observed in multiple UC disease subtypes. As presented in Figure 1B, pancolitis overlapped both left-sided colitis and CAD in terms of common DEGs. Of the 2,858 pancolitis DEGs, 651 shared similar expression regulation as left-sided colitis DEGs (465 common upregulation plus 159 common downregulation), and 1,194 shared similar expression regulations as CAD DEGs (209 common upregulation plus 985 common downregulation). This feature was not observed when comparing left-sided colitis and CAD; less than 2% of the input DEGs appeared at the intersection of left-sided colitis ∩ pancolitis ∩ CAD. This analysis suggests pancolitis exists as the middle ground of UC subtypes that bridges the progression of UC from left-sided colitis to CAD. Thus, the DEGs located at the intersection of pancolitis ∩ left-sided colitis or pancolitis ∩ CAD were selected for functional analysis.
To identify the potential biological processes associated with UC clinical progression, two sets of DEGs at the intersection of 1) pancolitis ∩ left-sided colitis, or 2) pancolitis ∩ CAD were processed separately with the topGO R package for functional enrichment analysis. These enrichment analyses included both up-regulated and down-regulated genes. The top 10 terms from the ‘Biological Process’ (BP) category of the GO enrichment analysis are presented in Figure 2. The DEGs at the intersection of pancolitis and left-sided colitis were enriched in inflammatory processes while the pancolitis ∩ CAD overlapping DEGs were enriched in actin-based processes. Specifically, the pancolitis ∩ left-sided colitis DEGs showed significant enrichment for the regulation of IFN-γ mediated signaling pathways (GO:0060334), the antimicrobial humoral immune response mediated by antimicrobial peptide (GO:0061844), and the acute-phase response (GO:0006953) with 8.6-, 6.4-, and 6.0-fold enrichments, respectively. Across the pancolitis ∩ CAD DEGs, microvillus assembly (GO:0030033), Golgi to plasma membrane transport (GO:0006893), and actin cytoskeleton reorganization (GO:0031532) were found significantly over-represented with 6.3-, 3.1-, and 3.0-fold enrichments, respectively. Overall, the GO enrichment analysis suggests that as UC progressed from left-sided colitis to pancolitis to CAD, the transcriptome pattern also shifted from being immune-centric to having cytoskeleton-dependence. Presumably, the regulation of actin cytoskeleton organization plays an important role in colitis-dysplasia progression.
3.2. Association of Hippo Signaling Activation with Colitis-Dysplasia Progression
To identify canonical pathways that are most relevant to the observed shift in gene expression profile (Figure 2), a comparison analysis heat map for canonical pathways significantly altered by UC disease subtypes was constructed (Figure 4). This heatmap displays the trend of either activation or deactivation as the gene expression profile shifted in response to UC extent, that is, from left-sided colitis to pancolitis to CAD. The heatmap revealed Hippo signaling and Ephrin receptor signaling as the top canonical pathways that were progressively altered in concert with the pathogenic progression of UC. The Hippo signaling pathway displayed incremental activation, while the Ephrin receptor signaling pathway exhibited gradual deactivation as the UC extent broadened (Figure 4).
Trends of activation or deactivation were also probed on upstream regulators and biological functions to obtain a basic view of the molecular mechanisms underlying the extent of colitis. In terms of upstream regulators, IL10RA exhibited a trend of increased activity whereas interferon (IFN)-γ exhibited a trend of decreased activity. Affected biological functions include apoptosis and cell movement, that underwent gradual activation and deactivation, respectively, as disease extended from left-sided colitis to pancolitis to CAD (Figure 4).
3.3. Actin Reorganization as a Potential Key Determinant for Colitis Progression
Using the limma R-derived DEGs, the IPA software generated 25 networks for each of the three UC subtypes. Molecular network intersections, assembled with the RVenn R package, revealed a single parent network that connected all 75 child networks via 757 intersections/edges (Figure 5A). Among the 75 child networks, approximately two-third (48/75) were composed of less than 60 focus molecules (<85% focus molecules). The 757 intersections/edges that connected the child networks each comprise one to 24 common molecule overlaps. Nevertheless, over half (57%) of the interactions/edges were enabled by just one common molecule, and 87% of the edges involved less than four common molecules (<5% overlap). To reduce noise from the parent network depicted in Figure 5A, molecular network intersection was compiled once more, but with child networks that were built upon 60 or more focus molecules (>85% focus molecules) and preserving connections that were maintained by four or more common molecules (>5% overlap). This produced one core network of networks that connected 21 child networks with 21 edges; One intersection segment, though strongly connecting pancolitis network 6 to CAD network 10 with 24 common molecules (34% overlap), stood apart from the core network of networks (Figure 5B), so was excluded from subsequent analysis.
To pinpoint a network pairing that could be used to discern pathways or molecular processes bridging pancolitis to CAD, the ‘count of overlapping molecules’ and ‘value of edge betweenness’ measures were utilized for edge evaluation. Edge betweenness reflects the amount of control that an edge exerts over the interactions of other child networks in the parent network. The edge betweenness of e=(v,w) is defined as the number of shortest paths between two nodes that go through e divided by the total number of shortest paths that go from the two nodes (16,17). Edge betweenness does not consider the number of overlapping molecules that contribute to the intrinsic strength of each edge. Thus, additional consideration for this attribute was completed post hoc. As shown in Figure 5B and Table 1, the edge connecting pancolitis network 15 to CAD network 1 stood out from the remaining network associations by virtue of its large number of overlapping molecules (20% overlap) and high value of edge betweenness. Taken together, the findings suggest that the removal of this edge may affect interactions between the remaining networks within network. Thus, the overlap between pancolitis network 15 and CAD network 1 was selected for further detailed examination.
The intersection pancolitis 15 ∩ CAD 1 was formed by 14 gene products, which, when analyzed for GO term enrichment, was predominantly composed of actin-based processes (Figure 6). Specifically, the DEGs at this intersection showed significant enrichment for the ruffle organization (GO:0031529), the regulation of actin filament polymerization (GO:0030833), and the myofibril assembly (GO:0030239) with 66-, 33-, and 33-fold enrichments, respectively.
3.4. Implication of DAPK3 as a Key Factor in the Colitis-Dysplasia Progression
Further dissection of pancolitis 15 and CAD 1 revealed two signal transduction gene products that reside in the overlap of these two networks (Table 2). Namely, death-associated protein kinase 3 (DAPK3) as the sole protein kinase, and protein phosphatase PP1β catalytic subunit (PPP1CB) as the sole protein phosphatase. The remaining 12 gene products were categorized as either mechanochemical enzymes (i.e., motors) or scaffolding proteins involving the cytoskeleton. Both DAPK3 and PPP1CB were downregulated in pancolitis (log2FC: -1.08, DAPK3; -1.56, PPP1CB) and CAD (log2FC: -0.98, DAPK3; -1.52, PPP1CB) but showed no deregulation in left-sided colitis. Presumably, DAPK3 and PPP1CB act as key factors regulating the altered molecular pathways that bridge pancolitis to CAD.
3.5. Differential Correlation of DAPK3-YAP with UC Disease Progression
Given the association of Hippo signaling activation with UC extent (Figure 3), and the implication of DAPK3 and PPP1CB as key factors in colitis-dysplasia progression (Figure 6), differential correlation analyses for the pairings DAPK3-YAP and PPP1CB-YAP were conducted to assess potential differences in gene-gene regulatory relationships among the UC disease subtypes. As shown in Figure 7A, the DAPK3-YAP correlation in healthy controls was negative but flipped to positive in left-sided colitis. Moreover, as the UC disease extent progressed from left-sided colitis to pancolitis and then on to CAD, the DAPK3-YAP correlation grew progressively more positive. The general direction of differential correlation for the PPP1CB-YAP pairing mirrors that of the DAPK3-YAP pairing (Figure 7B). However, the differential correlation with UC disease extent was less apparent for the PPP1CB-YAP pairing. This result suggests that changes in the potential regulatory relationship between DAPK3 and YAP, conditioned on UC disease extent, may contribute to disease progression.
4. DISCUSSION
In UC, prolonged disease duration and extensive intestinal involvement (i.e., pancolitis) are associated with an increased risk for colorectal cancer (7). The progression of carcinogenesis in UC is thought to be driven by chronic inflammation and proceeds in a stepwise manner through a colitis-dysplasia-carcinoma sequence (18-21). As the area affected by UC grows so too does the inflammatory load which could in turn accelerate dysplasia and CAC tumorigenesis (20). While the impact that inflammation can have on colon carcinogenesis should not be discounted, evidence also suggests that non-inflammatory factors play a role in mediating the colitis-dysplasia-carcinoma progression (5,22,23). Among patients or animals with similar inflammatory status, some develop CAC while others do not (5). Additionally, whole-exome sequence analysis of IBD-associated colorectal cancer showed that, apart from rare cases of mutations in DNA proofreading or repair pathways, supra-IBD inflammation alone does not compel greater mutation rates when compared with sporadic CRC of non-inflammatory origin (23). This also supports a role for non-inflammatory factors in mediating colitis-dysplasia-carcinoma progression. As an example, Arthur and colleagues found that while azoxymethane (AOM)-treated Il10-/-mice infected with colitogenic E. coli NC101 or E. faecalis OG1RF exhibited a similar degree of colitis and comparable levels of immune infiltrate, tumors were observed in only 10% of E. faecalis infected mice whereas 80% of E. coli infected mice showed tumor development. The authors then associated the polyketide synthase (pks) genotoxic island, found in E. coli NC101 but not E. faecalis OG1RF, with the DNA damage and subsequent tumorigenesis in their E. coli infected mice (24). Moreover, gene mutation and gene expression analyses have linked cytoskeleton remodeling to colitis-dysplasia-carcinoma progression (23,25,26). Utilizing the Bjerrum dataset, our investigation verified pancolitis as a conduit for UC advancement from left-sided colitis to CAD and confirmed dysregulation of actin reorganization as a key determinant for the progression of UC from non-dysplastic to dysplastic UC.
Out of the Bjerrum dataset, 651 and 1,194 parallel dysregulated transcripts were identified at the intersection of pancolitis ∩ left-sided colitis and pancolitis ∩ CAD, respectively. GO term enrichment analysis revealed that the parallelisms of pancolitis with CAD were rooted in dysregulation of actin-based processes, whereas the similarities between pancolitis and left-sided colitis were rooted in dysregulation of inflammatory processes. This finding substantiates results of a gene expression analysis of patients with CAC presented by Kanaan and colleagues, identifying actin cytoskeleton organization as the most significantly disrupted process in UC progression (25). Protein abundance analysis of UC progressors (UC patients with CAD or CAC) versus UC non-progressors has also identified enrichment of dysregulated cytoskeletal proteins in UC progressors (27). Drawbacks in the two aforementioned studies include the evaluation of UC progression in a binary fashion (presence or absence of CAD and/or CAC) as well as the small sample sizes (i.e., three unique patients in the study conducted by Kanaan and colleagues (25), and 15 unique patients in the study conducted by May and colleagues (27)). To provide insight into the molecular events associated with the stepwise progression of UC, it will be necessary to demonstrate that the dysregulations observed in CAD or CAC were issued forth from some form of non-dysplastic UC.
Signal transduction pathways regulate many cellular functions that were found altered in UC carcinogenesis, such as proliferation, growth, differentiation, metabolism, and survival. Molecular investigations have previously associated STAT3, Wnt, TGF-β, and TLR4/NFκB signaling with the pathogenesis of colon carcinogenesis (28-32). However, the pathways most relevant to the gene expression changes observed during the early progression of UC, before any histological evidence of dysplasia/carcinoma, remain unknown. In this regard, Hippo signaling and Ephrin receptor signaling were uncovered as the top canonical pathways that were progressively altered in concert with the extent of UC progression (Figure 3). The Hippo pathway is a fundamental signaling cascade that negatively regulates the activity of YAP/TAZ to coordinate cell proliferation, apoptosis, and cell movement; as such, it is essential for tissue homeostasis, repair, and regeneration (33). Importantly, YAP/TAZ-mediated cell proliferation in epithelial monolayers is controlled by a cytoskeletal checkpoint, which in turn, is monitored by actin-processing factors. The Ephrin pathway also controls intestinal homeostasis through cell proliferation and cell movement additionally to cell attachment and repulsion (34-36). However, deciphering functional outcomes by Ephrin pathway activation is circuitous due to the redundancy and idiosyncrasy of this pathway (37). The Ephrin receptors (Eph) comprise the largest family of receptor tyrosine kinases (RTK). But, unlike most RTKs for which ligands are generally soluble, the cognate ligands of Eph receptors, the ephrins, are also membrane bound. This aspect of the Eph-ephrin receptor-ligand pairing consequently induces bidirectional signaling, where signaling through Eph is termed forward signaling and through ephrin is termed reverse signaling. Furthermore, there is a plethora of Ephs and ephrins (i.e., 14 Ephs, 8 ephrins) with promiscuous pairing options. Finally, Eph/ephrin also exhibits cis-interactions to inhibit forwarding signaling. Ephrin is therefore a more convoluted pathway to render, relative to the Hippo pathway.
The Hippo pathway was previously identified as a key factor for the compensatory regeneration of IECs in response to tissue injury using the dextran sodium sulphate (DSS)-induced colitis mouse model (38). The recent emergence of YAP as a potential regulator of intestinal diseases involves elements beyond the canonical Hippo pathway. For one, YAP can be sequestered at adherens junctions (AJs) via interactions with α-catenin (39), the abundance of which is significantly altered in active UC (40). Secondly, nuclear translocation of YAP may be brought about via stimulation of gp130-associated Src family kinase Yes (41). Finally, YAP was found to be a crucial pivot point of cellular reprograming during intestinal epithelial repair, coupling epithelial restitution to the proliferative phase of regeneration by way of FAK-Src signaling (42). In view of YAP as a mechanosensor and mechanotranducer amidst epithelial regeneration of injured tissue, a comprehensive understanding of the interplay between YAP and the actin cytoskeleton is needed to make rational selections of therapeutic targets for patients at high risk of UC neoplastic progression.
The identification of DAPK3 as a potential key factor in UC progression is particularly interesting, and the role for DAPK3 in UC pathogenesis was unknown prior to this study. However, DAPK1, the closely-related family member and upstream regulator of DAPK3, was previously associated with UC severity (43) and gastrointestinal cancer pathogenesis (44). In addition, pharmacological inhibition of DAPK1 was reported to augment susceptibility to DSS-induced colitis in mice, with concomitant increase in bacterial translocation ascribed to epithelial barrier defects (45). It is regrettable that the small molecule inhibitor of DAPK1 (i.e., DAPK6 or DI) applied in the study of tunicamycin (TM)-induced, ER-stress-dependent reduction of bacterial translocation, potently cross-inhibits DAPK3 (IC50=225 nM (46)) and Rho-associated coil-coiled kinase (ROCK, Ki=132 nM (47)). Although the Lopes study included siRNA knockdown experiments to independently validate DAPK1 signaling involvement, the potential impact of concurrent DAPK3 and ROCK inhibition brought forth with the DAPK6 inhibitor was not examined. Previously, Ito and colleagues showed that ROCK activity increased in response to TM, and that treatment with Y27632 (a ROCK inhibitor: Ki=220 nM for ROCK1, Ki=300 nM for ROCK2 (48,49)) completely reversed TM-induced ER-stress responses in the J774 macrophage cell line (50). Moreover, the involvement of DAPK3 in ER-stress response was also demonstrated in human aortic vascular smooth muscle cells, where shRNA-mediated silencing of DAPK3 ablated the calcifying-media induced increase of CCAAT-enhancer-binding protein homologous protein (CHOP), a multifunctional transcription factor in ER-stress response (51). In the same study, treatment with DAPK6 attenuated vascular calcification in rats, alongside a significant reduction in CHOP protein abundance in the aorta. It may be beneficial to learn whether DAPK3 and/or ROCK alter ER-stress-dependent autophagy in the context of DSS-induced colitis in mice.
The probability that DAPK3 plays a role in the progression of the pathological changes of UC is notable. To substantiate the connection of DAPK3 to UC progression, the difference in correlation between DAPK3 and YAP was studied in all UC clinical subtypes plus healthy control samples. Differential co-expression operates on the level of gene pairs and is used as an alternative approach to identify disease-related genes (52,53). Results from the differential co-expression analyses demonstrate the correspondence of DAPK3-YAP correlation with UC extent. This suggests that changes in the potential regulatory relationship between DAPK3 and YAP, conditioned on UC extent, may contribute to UC progression. Still, the driver(s) behind the differential DAPK3–YAP co-expression pattern is unclear. Better understanding of the DAPK3-YAP relationship may enable the discovery of targeted therapy for the prevention of UC neoplastic progression.
Study Limitations
The progression of UC to CAD occurs through multiple mechanisms involving various cell types. The present analyses were completed on transcriptional profiles generated from mucosal biopsies (8), and genes may demonstrate diverse functions across different cell types. Hence, the gene sets identified from the averaged dataset will require re-examination in a cell type-specific way (e.g., single cell RNA-Seq) to precisely identify the susceptible cell types and convergent pathways among different cells.
Data Availability
The GSE47908 transcriptional microarray dataset is available to the public via the Gene Expression Omnibus.
6. AUTHOR CONTRIBUTIONS
H.-M.C. completed the data analysis, prepared figures and wrote the manuscript. J.A.M. conceived and coordinated the study, wrote the manuscript, supervised trainees and provided intellectual contributions to the project. All authors reviewed the results and approved the final version of the manuscript.
7. AUTHORS’ DECLARATION OF INTERESTS STATEMENT
J.A.M. is cofounder and has an equity position in Arch Biopartners Inc. All other authors declare no conflicts of interest.
5. ACKNOWLEDGEMENTS
This work was supported by a research grant from the Canadian Institutes of Health Research (MOP#97931 to J.A.M.). H.-M.C. was recipient of CIHR Fredrick Banting and Charles Best Canada and Alberta Graduate Excellence Scholarships.