Abstract
The current investigation aimed to mine therapeutic molecular targets that play an key part in the advancement of pancreatic ductal adenocarcinoma (PDAC). The expression profiling by high throughput sequencing dataset profile GSE133684 dataset was downloaded from the Gene Expression Omnibus (GEO) database. Limma package of R was used to identify differentially expressed genes (DEGs). Functional enrichment analysis of DEGs were performed. Protein-protein interaction (PPI) networks of the DEGs were constructed. An integrated gene regulatory network was built including DEGs, microRNAs (miRNAs), and transcription factors. Furthermore, consistent hub genes were further validated. Molecular docking experiment was conducted. A total of 463 DEGs (232 up regulated and 231 down regulated genes) were identified between very early PDAC and normal control samples. The results of Functional enrichment analysis revealed that the DEGs were significantly enriched in vesicle organization, secretory vesicle, protein dimerization activity, lymphocyte activation, cell surface, transferase activity, transferring phosphorus-containing groups, hemostasis and adaptive immune system. The PPI network and gene regulatory network of up regulated genes and down regulated genes were established, and hub genes were identified. The expression of hub genes (CCNB1, FHL2, HLA-DPA1 and TUBB1) were also validated to be differentially expressed among PDAC and normal control samples. Molecular docking experiment predicted the novel inhibitory molecules for CCNB1 and FHL2. The identification of hub genes in PDAC enhances our understanding of the molecular mechanisms underlying the progression of this disease. These genes may be potential diagnostic biomarkers and/or therapeutic molecular targets in patients with PDAC.
Introduction
Pancreatic ductal adenocarcinoma (PDAC) is one of the most prevalent cancers in the world and primary tumor of the pancreas [1]. PDAC is a global burden ranking 15th in terms of incidence and fourth in terms of mortality [2]. Despite new developments in multimodal therapy its overall 5-year survival rate remains less than 8% [3]. PDAC treatment commonly includes surgery, radiation, chemotherapy and immunotherapy [4]. However, PDAC remains common and malignant due to recurrence and metastasis, and ultimately key cause of PDAC associated death [5]. Therefore, there is a vital need to advance new diagnostic strategies and therapeutic agents to upgrade the prognosis of patients with PDAC.
The molecular mechanisms of PDAC tumorigenesis and development remain imprecise. It is therefore key to identify novel genes and pathways that are linked with PDAC tumorigenesis and patient prognosis, which may not only help to illuminate the underlying molecular mechanisms involved, but also to disclose novel diagnostic markers and therapeutic targets. Oji et al [6] demonstrated that the over expression of WT1 is linked with prognosis in patients with PDAC. A previous investigation reported that phosphoinositide 3-kinase signaling pathway is linked with development of PDAC [7]. Expression profiling by high throughput sequencing can promptly uncover gene expression on a global basis and are specially useful in identifying for differentially expressed genes (DEGs) [8]. A huge amount of data has been generated through the use of microarrays and the majority of such data has been deposited and saved in public databases. Previous investigations concerning PDAC gene expression profiling have diagnosed hundreds of DEGs [9].
The aim of this investigation was to identify hub genes and pathways in PDAC using bioinformatics methods. Our investigation contributes predictable biomarkers for early detection and prognosis, as well as effective drug targets for treating PDAC.
Materials and methods
Sequencing data
PDAC expression profiling by high throughput sequencing dataset in this investigation was downloaded from the GEO database (https://www.ncbi.nlm.nih.gov/gds/) [10]. The DEGs were considered by 1 independent PDAC dataset, GSE133684 [11] with 284 PDAC and 117 normal samples. The GSE133684 expression profiling by high throughput sequencing data was based on the GPL20795 HiSeq X Ten (Homo sapiens) platform..
Identification of DEGs
The Limma package of R language were used to normalize and convert the raw data to expression profiles [12]. The limma package of R language was used for DEGs between PDAC and normal control samples [12]. The P-value was adjusted by the Benjamini-Hochberg method [13]. An adjusted P-value <0.05 and |log2 fold change (FC) | >1 were considered as threshold values for DEGs identification. The ggplot2 package and gplots package of R language was used to generate volcano plot and heat map. The identified DEGs were preserved for further bioinformatics analysis.
GO analysis and pathway enrichment analysis of DEGs
The GO repository (http://geneontology.org/) [14] consists of a massive set of annotation terms and is generally used for annotating genes and identifying the distinctive biological aspects for expression profiling by high throughput sequencing data. The REACTOME database (https://reactome.org/) [15] contains data on known genes and their biochemical functions and is used for identifying functional and metabolic pathways. By performing the GO and REACTOME enrichment analysis at the functional level, we can boost a better understanding of the roles of these DEGs in the induction and in the advancement of PDAC. The ToppGene (ToppFun) (https://toppgene.cchmc.org/enrichment.jsp) [16] is an online resource that add tools for functional annotation and bioinformatics analysis. Both GO categories and REACTOME pathway enrichment analysis were implemented using ToppGene to inform the functions of these DEGs. P<0.05 was considered to indicate a statistically significant difference.
Protein-protein interaction (PPI) network construction and module analysis
The online database IID interactome (http://iid.ophid.utoronto.ca//)[17] was used to construct a PPI network of the proteins encoded by DEGs. Then, Cytoscape software (Version 3.8.1) [18] was applied to perform protein interaction association network analysis and analyze the interaction correlation of the candidate proteins encoded by the DEGs in PDAC. Next, the Network Analyzer Cytoscape plug-in was applied to calculate node degree [19], betweenness centrality [20], stress centrality [21], closeness centrality [22]. Finally, the PEWCC1 (http://apps.cytoscape.org/apps/PEWCC1) [23] module for Cytoscape was used to collect the significant modules in the PPI network complex.
Construction of miRNA-DEG regulatory network
The miRNet database (https://www.mirnet.ca/) [24] is a database, containing miRNAs involved in various diseases. The miRNAs related to PDAC were searched from miRNet database. Through getting the intersection of the miRNAs and the DEGs, the miRNA-DEG regulatory relationships were selected. Finally, miRNA-DEG regulatory network was built using Cytoscape software.
Construction of TF-DEG regulatory network
The NetworkAnalyst database (https://www.networkanalyst.ca/) [25] is a database, containing TFs involved in various diseases. The TFs related to PDAC were searched from TF database. Through getting the intersection of the TFs s and the DEGs, the TF-DEG regulatory relationships were selected. Finally, TFs-DEG regulatory network was built using Cytoscape software.
Hub genes validation
After hub genes identified from expression profiling by high throughput sequencing dataset, UALCAN (http://ualcan.path.uab.edu/analysis.html) [26] was used to validate the selected up regulated and down regulated hub genes. UALCAN is an online tool for gene expression analysis between PDAC and normal data from The Cancer Genome Atlas (TCGA). It adds data such as gene expression, tumor staging, and survival period for PDAC. cBioPortal is an online platform (http://www.cbioportal.org) [27] for gene alteration of hub genes analysis from TCGA. Human protein atlas is an online database (HPA, www.proteinatlas.org) [28] for protein expression analysis between PDAC and normal data from TCGA. TIMER is an online platform (https://cistrome.shinyapps.io/timer/) [29] for immune infiltration analysis from TCGA. To explore diagnostic biomarkers of PDAC, we used the above hub genes as candidates to find their diagnostic value based on generalized linear model (GLM) [30]. The pROC in R was used for Receiver operating characteristic (ROC) curve analysis [30]. In brief, half of the samples (PDAC = 142, controls = 59) were aimlessly distributed as the training set and remaining data were used as the test set, which were used to set up a model. An ROC curve analysis was tested to calculate the specificity and sensitivity of the GLM prediction model. The area under the curve (AUC) was figure out to determine the diagnostic efficiency of the classifier.
RT-PCR analysis
TRI Reagent® (Sigma, USA) was used to extract total RNA from the culture cells of PDAC (CRL-2549™) and normal (CRL-2989™) according to the manufacturer’s protocol. Reverse transcription cDNA kit (Thermo Fisher Scientific, Waltham, MA, USA) and random primers were used to synthesize cDNA. Quantitative real-time PCR (qRT-PCR) was conducted on the 7 Flex real-time PCR system (Thermo Fisher Scientific, Waltham, MA, USA). The reaction guideline included a denaturation program (5 min at 95 °C), followed by an amplification and quantification program over 40 cycles (15 s at 95°C and 45 s at 65°C). Each sample was tested in triplicates. Table 1 depicts the primer sequences of hub genes. The expression level was resolved as a ratio between the hub genes and the internal control β-actin in the same mRNA sample, and determined by the comparative CT method [31]. Levels of CCNB1, FHL2, HLA-DPA1 and TUBB1 expression were determined by the 2−ΔΔCt method.
Molecular docking experiment
The module SYBYL-X 2.0 perpetual software were used for Surflex-Docking of the designed molecules. The molecules were sketched by using ChemDraw Software and imported and saved in sdf. format using openbabel free software. The protein structures of CyclinB1 (CCNB1) its co-crystallisedprotein of PDB code 4Y72, 5H0V and Four and half LIM domains 2 (FHL2) its NMR structure of proteins 2D8Z and 2EHE was retrieved from Protein Data Bank [32-34]. Together with the TRIPOS force field, GasteigerHuckel (GH) charges were added to all designed derivatives for the structure optimization process.In addition, energy minimization was carried outusing MMFF94s and MMFF94 algorithm process. Protein processing was carried out after the incorporation of protein.The co-crystallized ligand and all water molecules were removed from the crystal structure; more hydrogens were added and the side chain was set. TRIPOS force field was used for the minimization of structure. The compounds’ interaction efficiency with the receptor was represented by the Surflex-Dock score in kcal / mol units. The interaction between the protein and the ligand, the best pose was incorporated into the molecular area. The visualisation of ligand interaction with receptor is done by using discovery studio visualizer.
Results
Identification of DEGs
We analyzed the DEGs of GSE133684 by using the limma package. We used p□<□0.05 and |logFC|□≥□1 as the cutoff criteria. We screened 463 DEGs, including 232 up regulated genes and 231 down regulated genes in PDAC samples compared with normal control samples and are listed in Table 2. We identified all the DEGs which were shown in the above volcano map according to the value of |logFC| is shown in Fig. 1 and then displayed the DEGs on a heatmap is shown in Fig. 2.
GO analysis and pathway enrichment analysis of DEGs
To symbolize the function of the DEGs and to identify important candidate pathways, GO functional enrichment analysis and REACTOME pathway enrichment analysis were performed. The results of GO categories analysis including biological processes (BP), cellular components (CC) and molecular functions (MF) are listed in Table 3. Firstly, the up regulated genes were annotated with the BP category, including vesicle organization and secretion, whereas the down regulated genes were annotated with the GO terms, including lymphocyte activation and regulation of cell death. Secondly, the up regulated genes were annotated with the GO terms of the CC category, namely secretory vesicle and whole membrane, whereas the down regulated genes were annotated with the GO terms, including cell surface and intrinsic component of plasma membrane.
Thirdly, the up regulated genes were annotated with the GO terms of the MF category, such as protein dimerization activity and signaling receptor binding, whereas the down regulated genes were annotated with the GO terms, including transferase activity, transferring phosphorus-containing groups and drug binding. As shown in Table 4, the significantly enriched REACTOME pathways of the up regulated genes with P<0.05 were hemostasis and cell cycle, whereas down regulated genes with P<0.05 were adaptive immune system and transmembrane transport of small molecules.
Protein-protein interaction (PPI) network construction and module analysis
After all the DEGs were uploaded to the online IID interactome database, the PPI network with 6188 nodes and 13153 edges was constructed using the Cytoscape software (Fig. 3A). Hub DEGs with the node degree, betweenness centrality, stress centrality and closeness centrality were obtained and are listed in Table 5. Among them, CCNB1 and FHL2 were the major up regulated genes, while HLA-DPA1 and TUBB1 were the major down regulated genes. Then, two significant module that fulfilled the cut-off criteria, namely, PEWCC1 scores >3 and number of nodes >5, was screened (Fig. 3B and Fig. 3C). The FGB, FGA, FGG, EEF1A1, RPL13A, ITGA4, RPL27A, RPL23A and RPL10 genes were identified in these modules. GO analysis of these genes showed that they were annotated with vesicle organization, regulation of cell death and lymphocyte activation. In addition, the REACTOME enrichment analysis suggested that these genes were mainly involved in hemostasis, innate immune system, disease and adaptive immune system.
Construction of miRNA-DEG regulatory network
The regulatory network of miRNA-DEG and predicted targets is presented in Fig. 4A. Notably, MAP1B targeted 202 miRNAs, including hsa-mir-4461; CCNB1 targeted 94 miRNAs, including hsa-mir-3928-3p; AHNAK targeted 256 miRNAs, including hsa-mir-2682-5p; KMT2D targeted 209 miRNAs, including hsa-mir-1202 and top 20 are listed in Table 6. As a group, a total of 257 of the 463 DEGs were contained in the miRNA-DEG regulatory network.
Construction of TF-DEG regulatory network
The regulatory network of TF-DEG and predicted targets is presented in Fig. 4B. Notably, EZH2 targeted 45 TFs, including SOX2; TPM1 targeted 40 TFs, including MYC; AHNAK targeted 58 TFs, including KLF4, TXNIP targeted 51 TFs, including TP63 and top 20 are listed in Table 6. As a group, a total of 259 of the 463 DEGs were contained in the TF-DEG regulatory network.
Hub genes validation
All of the hub genes were validated in TCGA data. Hub genes contributed to the survival period in patients with PDAC, we analyzed the overall survival (OS) for each hub gene by UALCAN (Fig. 5). The results showed that the high expression of CCNB1 and FHL2 mRNA level were associated with the worse OS in patients with PDAC, while low expression of HLA-DPA1 and TUBB1 mRNA level were associated with the worse OS in patients with PDAC. As shown in Fig. 6, the expression of the up regulated hub genes CCNB1 and FHL2 in PDAC were significantly elevated compared with normal, while expression of the down regulated hub genes HLA-DPA1 and TUBB1 in PDAC were significantly decreased compared with normal. The expression of each hub gene in PDAC patients was analyzed according to the individual cancer stage. As shown in Fig. 7, the expression of CCNB1 and FHL2 were higher in patients with all individual cancer stages than that in normal, which revealed that these up regulated hub genes might be associated with tumor progression positively, where as the expression of HLA-DPA1 and TUBB1 were lower in patients with all individual cancer stages than that in normal, which revealed that these down regulated hub genes might be associated with tumor progression positively. We used cBioportal tool to explore the specific mutation of hub genes in PDAC dataset with 184 samples. From the OncoPrint, percentages of alterations in CCNB1, FHL2, HLA-DPA1 and TUBB1 genes among lung cancer ranged from 0% to 2.3% in individual genes (CCNB1, 0%; FHL2, 0.6%; HLA-DPA1, 2.3%; TUBB1, 2.3%) and is shown in Fig. 8. In addition, we used the ‘HPA’ to examine the protein expression levels of CCNB1 and FHL2, and observed that the protein expression levels of the these hub genes were noticeably up regulated in PDAC compared with normal tissues, whereas protein expression levels of HLA-DPA1 and TUBB1, and observed that the protein expression levels of the these hub genes were noticeably down regulated in PDAC compared with normal tissues (Fig. 9). The association of CCNB1, FHL2, HLA-DPA1 and TUBB1 expression level with immune infiltration abundance in PDAC was evaluated using TIMER database. CCNB1 and FHL2 expression were negatively correlated with infiltration degree of B cells, CD8+ T cells, macrophage, neutrophil, and dendritic cells, where as HLA-DPA1 and TUBB1 were positively correlated with infiltration degree of B cells, CD8+ T cells, macrophage, neutrophil, and dendritic cells and is shown in Fig. 10. As these 4 genes are prominently expressed in PDAC, we performed a ROC curve analysis to evaluate their sensitivity and specificity for the diagnosis of PDAC. As shown in Fig. 11, CCNB1, FHL2, HLA-DPA1 and TUBB1 achieved an AUC value of >0.70, demonstrating that these genes have high sensitivity and specificity for PDAC diagnosis. The results suggested that CCNB1, FHL2, HLA-DPA1 and TUBB1 can be used as biomarkers for the diagnosis of PDAC.
RT-PCR analysis
Next, in order to verify the results of previous bioinformatics analysis, the gene expression levels of CCNB1, FHL2, HLA-DPA1 and TUBB1 were detected by RT-PCR. As shown in Fig 12, CCNB1 and FHL2 mRNA expression levels were significantly up regulated in the PDAC compared to normal, and HLA-DPA1 and TUBB1 mRNA level were down regulated compared to normal, which was consistent with the results of bioinformatics analysis.
Molecular docking experiment
The docking simulation was performed in the present study to recognize the active site conformation andsignificant interactions, which are responsible for complex stability with ligand receptor. Novel molecules containing alkylating group and purine heterocyclic ring were designed and performed docking studies using Sybyl X 2.1 drug design software. Molecules containing alkylating group is designed due to non-specific alkylation of physiologically important groupings and purine heterocyclic ring is incorporated due to structural similarity of purine derivatives and to compete for the synthesis of proteins. The proteins which are over expressed in pancreatic duct adenocarcinoma are selected for docking studies. The two proteins of each over expressedcyclin B1 (CCNB1)its co-crystallised protein of PDB code 4Y72, 5H0V and Four and half LIM domains 2 (FHL2) of NMR structure of proteins 2D8Z and 2EHE were selected for docking. The investigation of designed molecules was performed to identify the potential molecule. The most of the designed molecules obtained C-score greater than 5 and are active having the c-score greater than 5 are said to be an active, among total of 48 designed molecules few molecules have excellent good binding energy (C-score) greater than 8 respectively. Few of the designed molecules IM 11 & PU 42, shown good binding score of 7.83& 8.57 and the molecules IM 13, TZ 23, TZ 27, TZ 37, PU 41, PU 43 & PU 49 have good binding score 8.013, 8.523, 8.235, 8.800, 10.338, 10.891& 9.411 with CCNB1 PDB code 1H0V and 4Y72 respectively and are shown in Fig. 13. Molecules of IM 09, IM 10, IM & 18 shown good binding score of 7.14, 7.75 & 7.80 and the molecules IM 8 and TZ 24 with binding score 6.24 and 6.32 with FHL2 of PDB code 2D8Z and 2EQQ respectively, the values are depicted in Table 7. The molecule IM 8has highest binding score its interaction with protein 1H0V and hydrogen bonding and other bonding interactions with amino acids are depicted by 3D and 2D shown in Fig. 14 and Fig. 15.
Discussion
Due to the high heterogeneity of PDAC, PDAC was still a disease with high rates of pervasiveness and fatality. With surgery as the main, the other treatments including radiotherapy, chemotherapy, targeted therapy, and gene therapy as a additive to the finite treatment measures of PDAC, the 5-year survival rate was still less than 8% [35]. Therefore, the early diagnosis and effective treatment of PDAC is crucially required, which may be achieved via the identification of the DEGs between PDAC and normal control, and by considerate the underlying molecular mechanism. Microarray and high throughput sequencing analysis can screen a massive number of genes in the human genome for farther functional analysis, and can be extensively used to screen biomarkers for early diagnosis and unique therapeutic targets. Therefore, they may help the diagnosis of PDAC in the early stages and the advancement of targeted treatment, thus developing prognosis.
The current investigation systematically applied integrated bioinformatics methods to identify novel biomarkers that serve roles in the advancement PDAC. The data extracted from the GEO dataset contained 31 pairs of lung cancer and normal samples. A total of 232 up regulated and 231 down regulated genes in PDAC, when compared with normal control samples, were identified using bioinformatics analysis, indicating that the incidence and advancement of PDAC. The results of the DEGs may provide potential biomarkers for the diagnosis of PDAC. DAP (death associated protein) [36], KRT8 [37], IGFBP2 [38], KRT19 [39], CD44 [40], AHNAK (AHNAK nucleoprotein) [41] and BTG1 [42] are a noticeable factors in the PDAC progression. Wang et al [43] reported that KIF2C induces proliferation, migration, and invasion in gastric cancer patients through the MAPK signaling pathway, but this gene might be associated with development of PDAC. DBN1 expression was significantly increased in breast cancer [44], but this gene might be liable for development of PDAC. MAP1B was reported to lung cancer progression [45], but this gene may be key for advancement of PDAC. BNIP3L down regulation was required to develop breast and ovarian cancer [46], but this down regulation of gene might be involved in progression of PDAC. Yen et al [47] reported that ITGA4 was expressed in oral cancer, but this gene might be novel biomarker for PDAC. Tomsic et al. [48] showed that mutation in SRRM2 was associated with progression of thyroid carcinoma, but alteration in this gene might be key factor for advancement of PDAC. Recent studies have shown that down regulation of IL7R is associated with progression of esophageal squamous cell carcinoma [49], but this gene might be involved in pathogenesis of PDAC. Lee et al [50] found that reduced expression of the HLA-DRA is key factor for development of colorectal cancer, but this gene might be linked with advancement of PDAC. Liu et al. [51] reported that the absence of SESN3 linked with development of hepatocellular carcinoma, but this gene might be associated with progression of PDAC.
Then, GO and REACTOME pathway analyses were used to investigate the interactions of these DEGs. Increasing evidence shows that LAPTM4B [52], CEACAM6 [53], SERPINE2 [54] and VNN1 [55], SPHK1 [56], HRG (histidine rich glycoprotein) [57], VEGFC (vascular endothelial growth factor C) [58], ANXA3 [59], APOA2 [60], LCN2 [61], TIMP1 [62], CD63 [63], CD151 [64], MAL2 [65], ARNTL2 [66], PKD2 [67], E2F1 [68], MMP1 [69], CCR7 [70], NOTCH2 [71], BTLA (B and T lymphocyte associated) [72], TFRC (transferrin receptor) [73], CD4 [74], ATM (ATM serine/threonine kinase) [75], LEF1 [76], CSF1R [77], CTSB (cathepsin B) [78], DUSP2 [79] and NR4A1 [80] are closely associated with progression of PDAC. PTGER3 [81] and MAGI2 [82] are linked with angiogenesis, chemoresistance, cell proliferation and migration in ovary cancer, but these genes might be liable for growth PDAC. Hoagland et al [83] demonstrated that HP (haptoglobin) expression was responsible for progression of lung cancer, but this gene might be involved in PDAC progression. FGA (fibrinogen alpha chain) was demonstrated to be a lung cancer susceptibility gene through activation of integrin–AKT signaling pathway [84], but this gene might be liable for progression of PDAC. Repetto et al [85] investigated the importance of FGB (fibrinogen beta chain) in the pathogenesis of gastric carcinoma, but this gene might be responsible for progression of PDAC. PLA2G4A [86], FGG (fibrinogen gamma chain) [87] and TYMS (thymidylatesynthetase) [88] have been demonstrated to be up regulated in cancer, but these genes might be liable for progression of PDAC. RAB32 [89], SEPTIN4 [90], TPM2 [91], ACOT7 [92], PRTFDC1 [93], CABLES1 [94], HLA-DMB [95], PTPRC (protein tyrosine phosphatase receptor type C) [96], CD5 [97], CD6 [97], MS4A1 [98], CD22 [99], CD27 [100], MRC2 [101], CLEC2D [102], EEF1A1 [103] and APOB (apolipoprotein B) [104] played a predominant role in the cancer progression, but these genes might be associated with development of PDAC. Jung et al [105] found that SMPD1 stimulates the drug resistance in colorectal cancer, but this gene might be linked with development of PDAC. Liu et al. [106], Yang et al [107], Song et al [108], Seachrist et al [109], Zhu et al [110], Wu et al [111], Wang et al [112], Yi et al [113], Lan et al [114] and Appert-Collin et al [115] revealed that PADI4, MAOB (monoamine oxidase B), TRPC6, BCL11A, CXCR5, TCF7, POU2F2, SLC4A1, STK17B and LRP1 were associated with cancer cell invasion, but these genes might be liable for progression of PDAC. Kairouz et al [116], Diez-Bello et al [117], Xue et al [118], Abo-Elfadl et al [119], Li et al [120] and Zhao et al [121] reported that GRB14, TRPC6, ZFPM2, TNFRSF13B, ADAM19 and PIK3IP1enhance the cancer cell proliferation, but this gene might be involved in advancement of PDAC. Leite et al [122], Feng et al [123], Wang et al [124], Zhong et al [125], Yokoyama-Mashima et al [126], Guo et al [127], Lawson et al [128] and Wang et al [129] demonstrated that low levels of HLA-DPA1, FGL2, CBLB (Cbl proto-oncogene B), NCKAP1L, DYRK2, OGT (O-linked N-acetylglucosamine (GlcNAc) transferase), CAMK1D and RNF213 are linked with progression of cancer, but these genes might be essential for progression of PDAC. Polymorphic genes such as RORA (RAR related orphan receptor A) [130], IGF2R [131] and ZBTB20 [132] are contribute to progression of cancer, but these genes might be crucial for advancement of PDAC. Hope et al. [133] identified that the VCAN (versican) was central role in immune cell infiltration in cancer, but this gene may be associated with immune cell infiltration in PDAC.
Construction of PPI network and modules of DEGs may be helpful for understanding the relationship of developmental PDAC. Bai et al [134] reported that CCNB1 plays a positive role in proliferation of cancer cells, but this gene might be involved in development of PDAC. FHL2 [135] and RPL10 [136] are associated with progression of PDAC. Further investigation is required in order to clarify the underlying biological mechanisms of novel biomarkers HLA-DPA1, TUBB1, RPL13A, RPL27A and RPL23A on PDAC.
The miRNA-DEG regulatory network and TF-DEG regulatory network were constructed to explore the molecular mechanism of PDAC. The EZH2 [137], KMT2D [138], TXNIP (thioredoxin interacting protein) [139], TP63 [140], SOX2 [141], MYC [142] and KLF4 [143] genes are associated with PDAC. TPM1 [144] and hsa-mir-1202 [145] have been associated with cancer risk, but these genes and miRNAs might be responsible for progression of PDAC. Hsa-mir-4461, hsa-mir-3928-3p and hsa-mir-2682-5p might be considered as novel biomarkers for progression of PDAC.
In conclusion, we aim to identify DEGs by bioinformatics analysis to find the potential biomarkers which may be involved in the advancement of PDAC. The investigation contributes a set of useful DEGs for future studies into molecular mechanisms and biomarkers of PDAC. And the application of data mining and integration is accessible for prediction of PDAC advancement. Nevertheless, further molecular biological analyses are recommended to certify the function of the DEGs in PDAC.
Conflict of interest
The authors declare that they have no conflict of interest.
Ethical approval
This article does not contain any studies with human participants or animals performed by any of the authors.
Informed consent
No informed consent because this study does not contain human or animals participants.
Availability of data and materials
The datasets supporting the conclusions of this article are available in the GEO (Gene Expression Omnibus) (https://www.ncbi.nlm.nih.gov/geo/) repository. [(GSE133684) (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE133684]
Consent for publication
Not applicable.
Competing interests
The authors declare that they have no competing interests.
Data Availability
The datasets supporting the conclusions of this article are available in the GEO (Gene Expression Omnibus) (https://www.ncbi.nlm.nih.gov/geo/) repository. [(GSE133684) (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE133684]
Author Contributions
Basavaraj Vastrad-Writing original draft, and review and editing
Anandkumar Tengli - Writing original draft and investigation
Chanabasayya Vastrad - Software and investigation
Acknowledgement
I thank Shenglin Huang, Fudan University, Shanghai, China, very much, the author who deposited their microarray dataset, GSE133684, into the public GEO database.
References
- 1.↵
- 2.↵
- 3.↵
- 4.↵
- 5.↵
- 6.↵
- 7.↵
- 8.↵
- 9.↵
- 10.↵
- 11.↵
- 12.↵
- 13.↵
- 14.↵
- 15.↵
- 16.↵
- 17.
- 18.↵
- 19.↵
- 20.↵
- 21.↵
- 22.↵
- 23.↵
- 24.↵
- 25.↵
- 26.↵
- 27.↵
- 28.↵
- 29.↵
- 30.↵
- 31.↵
- 32.↵
- 33.
- 34.↵
- 35.↵
- 36.↵
- 37.↵
- 38.↵
- 39.↵
- 40.↵
- 41.↵
- 42.↵
- 43.↵
- 44.↵
- 45.↵
- 46.↵
- 47.↵
- 48.↵
- 49.↵
- 50.↵
- 51.↵
- 52.↵
- 53.↵
- 54.↵
- 55.↵
- 56.↵
- 57.↵
- 58.↵
- 59.↵
- 60.↵
- 61.↵
- 62.↵
- 63.↵
- 64.↵
- 65.↵
- 66.↵
- 67.↵
- 68.↵
- 69.↵
- 70.↵
- 71.↵
- 72.↵
- 73.↵
- 74.↵
- 75.↵
- 76.↵
- 77.↵
- 78.↵
- 79.↵
- 80.↵
- 81.↵
- 82.↵
- 83.↵
- 84.↵
- 85.↵
- 86.↵
- 87.↵
- 88.↵
- 89.↵
- 90.↵
- 91.↵
- 92.↵
- 93.↵
- 94.↵
- 95.↵
- 96.↵
- 97.↵
- 98.↵
- 99.↵
- 100.↵
- 101.↵
- 102.↵
- 103.↵
- 104.↵
- 105.↵
- 106.↵
- 107.↵
- 108.↵
- 109.↵
- 110.↵
- 111.↵
- 112.↵
- 113.↵
- 114.↵
- 115.↵
- 116.↵
- 117.↵
- 118.↵
- 119.↵
- 120.↵
- 121.↵
- 122.↵
- 123.↵
- 124.↵
- 125.↵
- 126.↵
- 127.↵
- 128.↵
- 129.↵
- 130.↵
- 131.↵
- 132.↵
- 133.↵
- 134.↵
- 135.↵
- 136.↵
- 137.↵
- 138.↵
- 139.↵
- 140.↵
- 141.↵
- 142.↵
- 143.↵
- 144.↵
- 145.↵