Abstract
Objective To prioritize genes that were pleiotropically or potentially causally associated with periodontitis.
Methods We applied the summary data-based Mendelian randomization (SMR) method integrating genome-wide association study (GWAS) for periodontitis and expression quantitative trait loci (eQTL) data to identify genes that were pleiotropically associated with periodontitis. We performed separate SMR analysis using CAGE eQTL data and GTEx eQTL data. SMR analysis were done for participants of European and East Asian ancestries, separately.
Results We identified multiple genes showing pleiotropic association with periodontitis in participants of European ancestry and participants of East Asian ancestry. PDCD2 (corresponding probe: ILMN_1758915) was the top hit showing pleotropic association with periodontitis in participants of European ancestry, and BX093763 (corresponding probe: ILMN_1899903) and AC104135.3 (corresponding probe: ENSG00000204792.2) were the top hits in participants of East Asian ancestry using CAGE eQTL data and GTEx eQTL data, respectively.
Conclusion We identified multiple genes that may be involved in the pathogenesis of periodontitis in participants of European ancestry and participants of East Asian ancestry. Our findings provided important leads to a better understanding of the mechanisms underlying periodontitis and revealed potential therapeutic targets for the effective treatment of periodontitis.
Introduction
Periodontitis is a common disease characterized by an inflammatory response to commensal and pathogenic oral bacteria1. The primary clinical features of periodontitis include periodontal pocketing, alveolar bone loss (BL), clinical attachment loss (CAL), and gingival inflammation2. Based on the 2009lJ2014 National Health and Nutrition Examination Surveys data, it was estimated that periodontitis affected about 42% of US adults aged 30 to 79 years3. Periodontitis is considered as the main cause of tooth loss in adults. Moreover, it is also associated with various systemic conditions such as coronary heart disease4, diabetes5 and pre-term birth6. Periodontitis not only affects a patient’s life, it also brings tremendous economic burden to the society, with an estimated global productivity loss due to untreated severe periodontitis being around $38.85 billion in 20157.
Periodontitis is a complex, multi-factorial infectious disease with possible contributions from multiple factors, including immunological response8, oral bacterial infections9, lifestyle factors such as smoking10 and alcohol consumption11, psychological factors such as stress12 and depression13, and systematic diseases such as diabetes14. Previous studies also suggested that genetics plays an important role in the pathogenesis of periodontitis. For example, genetically identical monozygotic twins have more than a twofold increased risk of early onset periodontitis, compared with dizygotic twins15. Another population-based twin study estimated that the heritability of periodontitis was approximately 50%16. Moreover, many GWAS and candidate gene studies have identified a number of genetic loci associated with the susceptibility of periodontitis17-22. However, the biological mechanisms of these findings remain largely unclear, and more studies are needed to explore genes that are potentially causally associated with periodontitis to better understand the pathogenesis of periodontitis.
Mendelian randomization (MR) uses genetic variants as the proxy to randomization and is a promising tool to search for pleotropic/potentially causal effect of an exposure (e.g., gene expression) on the outcome (e.g., periodontitis) without the need of conducting conventional randomized clinical trials (RCTs)23. Confounding and reverse causation, which are commonly encountered in traditional association studies, can be greatly reduced by using MR. MR has been successful in identifying gene expression probes or DNA methylation loci that are pleiotropically/potentially causally associated with various phenotypes, such as neuropathologies of Alzheimer’s disease and severity of COVID-1924,25.
In this study, we applied the summary data-based MR (SMR) method integrating summarized GWAS data for periodontitis and cis-eQTL (expression quantitative trait loci) data to prioritize genes that are pleiotropically/potentially causally associated with periodontitis.
Methods
Data sources
eQTL data
In the SMR analysis, cis-eQTL genetic variants were used as the instrumental variables (IVs) for gene expression. We performed SMR analysis using gene expression data in blood due to the unavailability of eQTL data in the eye. Specifically, we used the CAGE eQTL summarized data26, which included 2,765 participants, and the V7 release of the GTEx eQTL summarized data27, which included 338 participants. The eQTL data can be downloaded at https://cnsgenomics.com/data/SMR/#eQTLsummarydata.
GWAS data for periodontitis
The GWAS summarized data were provided by a recent genome-wide association meta-analysis of periodontitis28. The results were based on meta-analyses of 1000 genomes phase 1 version 2/3 imputed GWASs on periodontitis, with a total of nine cohorts from the Gene-Lifestyle Interactions in Dental Endpoints (GLIDE) consortium29. Specifically, the meta-analysis for participants of European ancestry included seven cohorts with a total sample size of 45,563 (17,353 cases and 28,210 controls), and the meta-analysis for participants of East Asian ancestry included two cohorts with a total sample size of 17,350 (1,680 cases and 15,670 controls). All participating studies assumed an additive genetic model, adjusting for age, age-squared and other study-specific covariates. The GWAS summarized data can be downloaded at https://data.bris.ac.uk/data/dataset/2j2rqgzedxlq02oqbb4vmycnc2.
SMR analysis
We conducted the SMR analysis with cis-eQTL as the IV, gene expression as the exposure, and periodontitis as the outcome. The analysis was done using the method as implemented in the software SMR. Detailed information regarding the SMR method was reported in a previous publication30. In brief, SMR applies the principles of MR to jointly analyze GWAS and eQTL summary statistics in order to test for pleotropic association between gene expression and a trait due to a shared and potentially causal variant at a locus. We also conducted the heterogeneity in dependent instruments (HEIDI) test to evaluate the existence of linkage in the observed association. A PHEIDI of less than 0.05 indicates that the observed association could be due to two distinct genetic variants in high linkage disequilibrium with each other. We adopted the default settings in SMR (e.g., minor allele frequency [MAF] > 0.01, removing SNPs in very strong linkage disequilibrium [LD, r2 > 0.9] with the top associated eQTL, and removing SNPs in low LD or not in LD [r2 <0.05] with the top associated eQTL) except relaxing the threshold of eQTL P-value (PeQTL <10−4) due to the exploratory nature of this study, and used false discovery rate (FDR) to adjust for multiple testing. We performed SMR analysis for participants of European and East Asian ancestries, separately, using CAGE and GTEx eQTL data, respectively, comprising a total of four SMR analyses.
We used Affymetrix exon array S1.0 platforms to annotate the transcripts. We conducted functional enrichment analysis using the functional annotation tool “Metascape” for the top tagged genes to functionally annotate putative transcripts31. Gene symbols corresponding to the ten top hit genes were used as the input of the gene ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analysis.
Data cleaning and statistical/bioinformatical analysis was performed using R version 4.0.3 (https://www.r-project.org/), PLINK 1.9 (https://www.cog-genomics.org/plink/1.9/) and SMR (https://cnsgenomics.com/software/smr/).
Results
Basic information of the summarized data
The number of participants of the CAGE eQTL data is much larger than that of the GTEx eQTL data, so is the number of eligible probes. The sample size of the GWAS data for the European ancestry is much large than that for the East Asian ancestry, so is the number of eligible genetic variants. The detailed information was shown in Table 1.
SMR analysis in participants of European ancestry
In participants of European ancestry, we identified two genes showing pleiotropic association with periodontitis after correction for multiple testing using FDR (Table 2). Specifically, using the CAGE eQTL data, our SMR analysis identified two genes that were pleiotropically/potentially causally associated with periodontitis, including PDCD2 (ILMN_1758915; PSMR=3.77×10−5; Figure 1) and D4S234E (i.e., NSG1, ILMN_1772627; PSMR=9.08×10−4; Figure 2). GO enrichment analysis of biological process and molecular function showed that the significant genes were involved in two GO terms, including positive regulation of cysteine-type endopeptidase activity (GO:2001056) and positive regulation of defense response (GO:0031349; Figure S1A). Concept network analysis of the ten top hit genes also revealed multiple domains related with endopeptidase activity (Figure S1B). More information could be found in Table S1.
Using the GTEx eQTL data, we did not identify any genes that were pleiotropically/potentially causally associated with periodontitis, after correction for multiple testing using FDR (Table 2). However, we found that two genes, NSG1 (CAGE eQTL: ILMN_1772627, PSMR=9.08×10−4; GETx eQTL: ENSG00000168824.10, PSMR=1.35×10−3) and S100A12 (CAGE eQTL: ILMN_1748915, PSMR=1.58×10−3; GETxeQTL: ENSG00000163221.7, PSMR=1.90×10−3) were among the top hits in both SMR analyses. GO enrichment analysis of biological process and molecular function showed that the ten top hit genes were involved in two MAP kinase-related GO terms (GO:0043405 and GO:0043406; Figure S1C). Concept network analysis of the genes revealed multiple domains related with MAP kinase activity and inflammation (Figure S1D). More information could be found in Table S2.
SMR analysis in participants of East Asian ancestry
In participants of East Asian ancestry, we identified two genes showing significant pleiotropic association with periodontitis after correction for multiple testing using FDR (Table 3). Specifically, using the CAGE eQTL data, our SMR analysis identified one gene, BX093763 (ILMN_1899903, PSMR=2.33×10−4). GO enrichment analysis of biological process and molecular function showed that the ten top hit genes were involved in one GO terms, axon development (GO:0061564; Figure S2A). Concept network analysis of the genes revealed multiple domains related with inflammation (Figure S2B). More information could be found in Table S3. Using the GTEx eQTL data, our SMR analysis identified one gene, AC104135.3, that was pleiotropically/potentially causally associated with periodontitis, after correction for multiple testing using FDR (ENSG00000204792.2, PSMR=7.46×10−4; Figure 3). GO enrichment analysis of biological process and molecular function did not find any significant GO terms. Concept network analysis of the genes revealed multiple domains related with endogenous peptide antigen (Figure S2C). More information could be found in Table S4.
We found that two genes were among the top hits in both SMR analyses, including PSD4 (CAGE eQTL: ILMN_2154115, PSMR=1.79×10−3; GTEx: ENSG00000125637.11, PSMR=1.84×10−3) and GFRA2 (CAGE eQTL: ILMN_1656300, PSMR=3.55×10−3; GTEx: ENSG00000168546.6, PSMR=7.07×10−3). More information could be found in Table S3-4.
Discussion
In the present study, we explored putative genes that showed pleiotropic/potentially causal association with periodontitis by integrating GWAS and eQTL data in the SMR analysis. We identified multiple genes, some of which represented novel genes, that might be involved in the pathogenesis of periodontitis in participants of European ancestry and participants of East Asian ancestry. Our findings provided helpful leads to a better understanding of the mechanisms underlying periodontitis and suggested potential therapeutic targets for the treatment of periodontitis.
A recent study investigated molecular biomarker candidates and biological pathways of chronic periodontitis using pooled datasets in the Gene Expression Omnibus (GEO) database, and identified 123 common differently expressed genes (DEGs), including 81 upregulated genes and 42 downregulated genes32. Several of the identified genes were also among the top hits in our SMR analysis. For example, the gene NSG1 (Neuronal Vesicle Trafficking Associated 1) was found to be downregulated in persons with chronic periodontitis. It also showed significant pleiotropic association with periodontitis in our study of participants of European ancestry (Table 2). NSG1, also known as D4S234E or NEEP21, is located on 4p16.3 in human and is a member of the neuron-specific gene family. It is the most important in regulating receptor recycling and synaptic transmission33. p53, an important tumor suppressor gene, binds to the promoter region of NSG1 and regulates its expression in response to DNA damage. Inhibition of NSG1 expression suppressed apoptosis34. The exact role of NSG1 in the pathogenesis of periodontitis is unclear and warrants further research.
Another research integrating GWAS and eQTL data identified 10 genes whose expression might influence periodontitis35. Of them, the gene S100A12 (S calcium-binding protein A12) also appeared among the top hits in participants of European ancestry in the SMR analysis using CAGE and GTEx eQTL data (Table 2). S100A12, located on 1q21.3, is a member of the S100 family of EF-hand calcium-binding proteins36. Previous studies indicated that it played a prominent role in the regulation of inflammatory processes and immune response37. It was reported that the levels of S100A12 were higher in participants with high periodontal inflammatory burden and were associated with the percentage of bleeding on probing38. In gingival crevicular fluid and serum, the levels of S100A12 increased with the inflammation of periodontium39. These findings, together with ours, demonstrated the important role of S100A12 in influencing periodontitis and highlighted the potential of this gene as a promising target for the prevention and treatment of periodontitis.
Our study was different from the previous study integrating GWAS and eQTL data35. We used the GWAS summarized data for both European and East Asian ancestries, while the previous research only analyzed GWAS data of European ancestry. Similarly, we used both CAGE and GTEx eQTL data, while the previous research used only GTEx data. Moreover, we undertook a SMR analytic framework which focused on exploring genes showing pleiotropic association/potentially causal association with periodontitis while the previous research adopted a Sherlock approach which is a Bayesian statistical framework aiming to identify genes whose expression was associated with periodontitis susceptibility40.
Our study was also very different from another MR research on periodontitis29. Although both studies aimed to explore potentially causal factors for periodontitis, the previous research focused examining the causal role of total adiposity in the pathogenesis of periodontitis, while our study attempted to identify genes that were pleiotropically/potentially causally associated with periodontitis. The analytic approaches were also different: the previous research used genetic risk scores as the instrumental variables calculated from three genes (FTO, MC4R and TMEM18) by summing the number of BMI increasing alleles, while in our study, we used all the genetic variants from GWAS summarized data as the potential instrumental variables.
Our study has some limitations. The number of probes used in our SMR analysis was limited, especially in the SMR analysis of participants of East Asian ancestry. As a result, we may have missed some genes which played important roles in the pathogenesis of periodontitis. The HEIDI test was significant for some of the identified genes (Table 2-3). Therefore, we could not rule out the possibility of horizontal pleiotropy, i.e., the identified association might be due to two distinct genetic variants in high linkage disequilibrium with each other. In addition, we only performed SMR analysis for participants of European and East Asian ancestries, and our findings might not be generalized to other populations. More studies are needed to validate our findings in independent populations. Due to the exploratory nature of study, we adopted correction for multiple testing to reduce false positive rate; however, we may have missed important SNPs or genes. Finally, we could not quantify the changes in gene expression in subjects with periodontitis in comparison with the control due to the unavailability of individual eQTL data.
Conclusions
In conclusion, our SMR analysis revealed multiple genes that were potentially pleiotropically associated with periodontitis. More studies are needed to explore the underlying physiological mechanisms in the etiology of periodontitis.
Data Availability
All data generated or analyzed during this study are included in this published article and its supplementary information files.
Author contribution
DL, JY and GZ designed the study. FW, DL, BF and WL analyzed data and performed data interpretation. FW, DL, YZ and JY wrote the initial draft and BF, WL, and GZ contributed to writing subsequent versions of the manuscript. All authors reviewed the study findings and read and approved the final version before submission.
Availability of data and materials
All data generated or analyzed during this study are included in this published article and its supplementary information files.
Disclosure of Potential Conflicts of Interest
No potential conflicts of interest were disclosed by the authors.
Acknowledgements
The study was supported by NIH/NIA grants P30AG10161, R01AG15819, R01AG17917, R01AG36042, U01AG61356 and 1RF1AG064312-01. Di Liu was supported by China Scholarship Council (CSC 201908110339). The authors confirmed that all authors have reviewed the contents of the article being submitted, approved its contents, and validated the accuracy of the data.