Differential Causal Effects of Type 1 and Type 2 Diabetes on Osteomyelitis Risk: Insights from Mendelian Randomization Analysis =============================================================================================================================== * YanHui Li * Chuanyang Zhou * Liming Yang * Lei Tan ## Abstract **Background** Osteomyelitis (OM) poses a significant clinical challenge, especially among individuals with diabetes mellitus (DM). While both type 1 diabetes mellitus (T1DM) and type 2 diabetes mellitus (T2DM) have been linked to an elevated risk of OM, the precise causal relationships remain uncertain. **Methods** We conducted Mendelian randomization (MR) analyses using summary statistics from genome-wide association studies (GWAS) to explore the causal effects of T1DM, T2DM, their complications, and glycemic traits on OM risk. The study utilized the inverse variance weighted (IVW) method, along with weighted median and MR-Egger for causal estimation, and performed various sensitivity analyses to ensure robustness. Multivariable MR (MVMR) analysis assessed direct effects, while two-step mediation MR analyses investigated the mediating role of DM between rheumatoid arthritis (RA) and OM. **Results** The MR analysis unveiled distinct causal effects of T1DM and T2DM on OM risk. Genetically determined T2DM, rather than its complications, significantly increased OM risk (primary dataset: IVW: OR = 1.13, 95% CI 1.056–1.209, *p* = 4E-04; validation dataset: IVW: OR = 1.317, 95% CI 1.14–1.522, *p* =2E-04; Meta-analysis: OR=1.206; 95% CI 1.037–1.402; *p*=0.014), with no observable heterogeneity or horizontal pleiotropy. MVMR analysis confirmed the robustness of the causal association between T2DM and OM, even after adjusting for potential confounders such as body mass index. Conversely, T1DM and its complications showed no significant causal link with OM in either the primary dataset (IVW: *p* = 0.071), the validation dataset (IVW: *p* = 0.276), or the meta-analysis (IVW: *p* = 0.242). Additionally, there was no robust evidence supporting the causal risk of glycemic traits on OM. Mediation MR analysis underscored T2DM as a pivotal contributor to the differential effects of RA on OM. **Conclusions** Mendelian randomization analysis provides compelling evidence of a significant causal relationship between genetically determined T2DM and increased OM risk, while T1DM exhibits distinct causal effects. Additionally, our findings highlight the role of T2DM in mediating the association between RA and OM. Further research is warranted to elucidate the underlying mechanisms and guide targeted interventions for OM prevention and management in diabetic populations. ## Introduction Osteomyelitis (OM), a bone infection, can occur via contiguous spread from surrounding tissues, direct bone trauma, or hematogenous dissemination. It poses a considerable healthcare burden, with a prevalence of 22 cases per 100,000 person-years in the United States, rising over time, especially among the elderly and those with diabetes mellitus (DM)1. Challenges in treatment include pathogen identification, bone destruction and repair complexities, and disease recurrence, resulting in prolonged treatment and poorer prognoses2. Diabetes mellitus, affecting nearly 500 million individuals worldwide and projected to increase by 51% by 2045, poses a significant global health challenge3. Type 1 diabetes mellitus (T1DM) and type 2 diabetes mellitus (T2DM) are the primary forms, each with distinct pathophysiological mechanisms. While T1DM stems from autoimmune destruction of pancreatic beta cells, T2DM involves insulin resistance and impaired insulin secretion4. Although observational studies have linked DM to a higher risk of OM5,6, uncertainties persist regarding causal relationships and potential mediators due to confounding factors and biases in existing research. Moreover, limited studies have explored the differing impacts of diabetes subtypes on OM risk. Clinical studies have established a link between Rheumatoid Arthritis (RA) and heightened susceptibility to OM, attributed to chronic inflammation7,8. RA is also associated with diabetes due to a vicious circle perpetuated by glucose derangement and inflammatory mediators.9,10. Given the shared risk profile of RA with both diabetes and OM, we employed mediation MR analysis to investigate the mediating role of DM between RA and OM. Randomized controlled trials (RCTs) are ideal for understanding causality, but their implementation is impractical due to ethical and complex relations between OM and diabetes. MR offers an alternative using genetic variation as a proxy for exposure, mitigating bias11. This method mirrors RCTs, validating causal relationships while reducing biases. Moreover, MR elucidates independent causal pathways and potential mediators linking DM and OM. In this study, we utilized univariable MR (UVMR) analyses to evaluate the impact of T1DM, T2DM, and their complications on OM risk, as well as to explore the association between glycemic traits and OM. Multivariable MR (MVMR) analysis was conducted to assess direct effects by adjusting for potential confounders. Additionally, two-step mediation MR analyses were employed to explore DM’s mediating role between RA, and OM. ## Methods ### 1. Study design and data sources In this study, we employed UVMR,UVMR and two-step mediation MR analyses to examine the causal effects of T1DM, T2DM, their complications and glycemic traits on OM risk, and to explore DM’s mediating role between RA and OM. Throughout the study, we rigorously adhered to three core assumptions ensured the validity of our results: (1) establishing a reliable association between genetic variants and the risk factor; (2) confirming no association between genetic variants and confounders; and (3) ensuring genetic variants solely influence the outcome through the risk factors. A comprehensive study design flowchart is provided in Figure 1. ![Figure 1.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2024/04/10/2024.04.08.24305482/F1.medium.gif) [Figure 1.](http://medrxiv.org/content/early/2024/04/10/2024.04.08.24305482/F1) Figure 1. Flowchart of the study design. GWAS, genome-wide association studies; T1DM, type 1 diabetes mellitus; T2DM, type 2 diabetes mellitus; RA, rheumatoid arthritis; OM, osteomyelitis; UVMR, Univariable MR; MVMR, multivariable mendelian randomization; BMI, body mass index; UKBB, United Kingdom Biobank; IVs, instrument variables. Our study utilized the largest and most recent publicly available summary statistics from multiple genome-wide association studies (GWAS) sources, including the FinnGen database, the UK Biobank (UKBB) database, and other large consortia. To prevent overlap, we carefully selected exposure factors, designating T1DM and T2DM data from FinnGen, and OM data from the UKBB as the primary dataset. DM data from non-FinnGen sources and additional data from FinnGen were designated as the validation dataset. To ensure robustness, we conducted a meta-analysis of results from these datasets. Adherence to the STROBE-MR (Strengthening the Reporting of Mendelian Randomization Studies) guidelines was maintained throughout the study12. Detailed information on the GWAS data used in this study is provided in Supplementary Tables 1. ### 2. Genetic association datasets #### GWAS data for T1DM and T1DM with complications Type 1 Diabetes (T1DM) genetic instruments were selected from two GWAS Studies. THE Non-UKBB One is based on 5,928 cases and 183,185 controls of European ancestry from the FinnGen database as the primary source. Validation (Non-FinnGen) was conducted meta-analysis using data from Onengut-Gumuscu et al.’s study, which included 6,683 T1DM cases from the UK Genetic Resource Investigating Diabetes cohort and control samples (N=12,173) from four additional cohorts, all reporting European ancestry13. T1DM with complications dataset obtained from FinnGen database14.Detailed information on single nucleotide polymorphisms (SNPs) used in this study is presented in Supplementary Tables S4 and S6. #### GWAS data for T2DM and T2DM with complications The genetic instruments for T2DM were derived from FinnGen database comprising 32,469 cases and 183,185 controls of European ancestry as the primary source (Non-UKBB). For validation (Non-FinnGen), we utilized data from the GWAS meta-analysis study by Xue A et al., which included a sample of 61,714 cases and 593,952 controls with T2DM. This study combined three GWAS datasets of European ancestry, including UKBB, representing the vast majority (99.4%) of individuals of European ancestry15. T2DM with complications dataset obtained from FinnGen database14. Detailed information regarding the SNPs used can be found in Supplementary Tables S6 and S10. #### GWAS data for Hyperglycemia traits SNPs were chosen from a GWAS meta-analysis by Chen et al., involving 151,013 European individuals for fasting insulin, up to 200,622 participants for fasting glucose 16. The GWAS summary statistics for HbA1c levels were obtained from Within family GWAS consortium, which included 45,734 participants of European ancestry. All participants had European ancestry, and there was no overlap with the outcome data. Detailed SNP information is available in Supplementary Tables S12, 14 and 16. #### GWAS data for Body mass index (BMI) The GWAS summary statistics for MVMR (BMI) were sourced from Within family GWAS consortium, involving 99,998 European participants. There was no overlap between this dataset and other exposures and outcomes considered in our study. #### GWAS data for RA The GWAS summary statistics for RA were obtained from Eyre S et al., which included 13,838 cases and 33,742 controls of European ancestry17. There was no overlap between this dataset and all exposures and outcomes considered in our study. Supplementary Table 18 provides details of the traits involved in this analysis. #### GWAS data for Outcomes The study outcome is OM, defined as inflammation of bone and its structures due to pyogenic bacterial infection. We analyzed the associations between selected instruments and OM using summary GWAS data from the UK Biobank (4,836 cases, 481,648 controls) as the primary source. Validation was performed using FinnGen data (842 cases, 209,575 controls), all of European ancestry. ### 3. Genetic Instrument Selection For genetic instrument selection, we established a genome-wide significance threshold. SNPs with a P-value less than 5×10−8 were considered significant for T1DM, T2DM, glycemic traits, BMI and RA. Since only few SNPs were identified for part of complications of DM when they were as the exposure, a higher cutoff (p□<□1e-6) was chosen. Variants meeting these criteria were then clumped for linkage disequilibrium (LD) using a distance window of 10,000 kB and an r2 < 0.01. To avoid the risk of weak instrumental bias, the F statistic was performed to evaluate the strength of the IV. When F□>□10, the association between the IV and exposures was deemed to be sufficiently robust, thereby safeguarding the results of the MR analysis against potential weak instrumental bias. The PhenoScanner ([http://www.phenoscanner.medschl.cam.ac.uk/](http://www.phenoscanner.medschl.cam.ac.uk/)) was introduced to identify and remove SNPs with potential associations with confounding factors that might violate the independence assumption18. After several rounds of rigorous filtering, a set of eligible instrumental variables for the subsequent MR analysis were obtained. Summary of the instrument variables used in this study is presented in Supplementary Tables S2. ### 4. Statistical analysis We employed the “TwoSampleMR” 22, “MendelianRandomization” 23 and “MR-PRESSO”24 packages for UVMR, MVMR, and Mediation MR analyses, including sensitivity tests. Causal estimates were expressed as odds ratios (ORs) with 95% confidence intervals (CIs). Statistical analyses were conducted using R software version 4.3.2 (The R Foundation for Statistical Computing). #### UVMR analysis Causal effects were estimated using the random-effects inverse variance weighted (IVW) method 19. To ensure unbiased estimates, MR analyses were also conducted using four alternative methods (MR Egger, Simple mode, Weighted median, and Weighted mode). A causal effect was considered suggested if the IVW p-value was less than 0.05. Moreover, a causal effect was deemed significant if the IVW p-value fell below the Bonferroni-corrected threshold ((p□<□0.05/21□=□0.002) for primary datasets and (p□<□0.05/7□=□0.007) for validation datasets, coupled with consistent directionality in the weighted median and MR-Egger results. #### MVMR The DM and BMI shared genetic risk factors20. To mitigate the confounding influence of BMI on DM, MVMR analysis adjusting for BMI was performed. For the significant causal associations in the univariable MR analysis, the MVMR analysis was performed using the MVMR-IVW method, aiming to adjust for potential confounding factors BMI21. Mediation MR analysis Clinical studies indicate that RA is associated with an increased risk of OM9. To explore whether DM mediates this association, we conducted two-step mediation MR analysis, assessing three key estimates: (i) the total effect of RA on OM (β_all); (ii) the direct effect of RA on DM (β1); and (iii) the direct effect of DM on OM (β2). Significance was determined by IVW p-values (*p*□<□0.05), with mediation effect proportions estimated using the delta method. The mediation effect is calculated as β1 * β2. The proportion of the mediation effect was estimated as the total causal effect of β_all divided by the mediation effect. We performed two-step mediation MR analyses in both primary and validation datasets to ensure result reliability and compared them for consistency. #### Sensitivity analyses Sensitivity analyses were performed to address horizontal pleiotropy and heterogeneity. We utilized weighted median, MR-Egger, and MR-PRESSO methods to verify assumptions and assess robustness, identifying potential horizontal pleiotropy. The weighted median model provides consistent estimates when over half of the weights are from valid IVs25. MR-Egger regression detects horizontal pleiotropy and corrects for it, with its intercept term indicating unbalanced directional pleiotropy (*p* < 0.05)26. MR-PRESSO identifies and corrects outliers, providing outlier-corrected estimates. The MR-PRESSO distortion test compares estimation differences before and after outlier removal. Cochran’s Q test evaluates heterogeneity among SNPs for exposure and confirms consistency between MR assumptions and analyses (*p* < 0.05)24. Credible causal inference requires consistent directionality across the three methods and the absence of horizontal pleiotropic effects. ## Results ### 1. NO Causal effects of T1DM on OM The causal effect of T1DM on OM was not supported in either the primary dataset (IVW: *p*= 0.071), the validation dataset (IVW: *p =* 0.276), or the meta-analysis (IVW: *p* = 0.242). Sensitivity analyses revealed consistent results with no evidence of horizontal pleiotropy (MR-Egger intercept *p* = 0.521; *p* = 0.967) and no heterogeneity (Cochran’s Q statistic: *p* = 0.089; *p* = 0.244) for the primary and validation datasets, respectively. No outliers were found in the MR-PRESSO test. To further address the potential influence of confounding factors and level pleiotropy, we conducted MVMR. After controlling for BMI, no statistical significance remained between T1DM and OM (Figure 2; Supplementary Table S3). Leave-one-out and scatter plots are provided in Supplementary Figures 1-4. ![Figure. 2.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2024/04/10/2024.04.08.24305482/F2.medium.gif) [Figure. 2.](http://medrxiv.org/content/early/2024/04/10/2024.04.08.24305482/F2) Figure. 2. Genetically predicted type 1 diabetes and its complications: associations with osteomyelitis. T1DM, type 1 diabetes mellitus; OM, osteomyelitis; UVMR, Univariable MR; MVMR, multivariable mendelian randomization; OR, odds ratio; CI, confidence interval; UKBB, United Kingdom Biobank; IVW, Inverse Variance Weighted; H, Heterogeneity; P, Pleiotropy. p-values (IVW)□<□0.05 was considered suggested different. The Bonferroni-corrected results of the p-values (IVW) in each group remained consistent with the uncorrected results. To explore the relationship between different T1DM subgroups and OM, we analyzed data from the comprehensive FinnGen database, known for its coverage of T1DM complications. Our analysis revealed no causal correlations between either T1DM without complications or T1DM with complications and OM (IVW: *p*L=L0.478; *p*L=L0.777, respectively). Further subgroup analyses within T1DM with complications showed no causal links with OM. Specifically, IVW -P values for OM were 0.885 for T1DM with coma, 0.843 for T1DM with ketoacidosis, 0.770 for T1DM with ophthalmic complications, 0.978 for T1DM with renal complications, 0.197 for T1DM with peripheral circulatory complications, and 0.345 for T1DM with neurological complications. No evidence of horizontal pleiotropy or heterogeneity was found in these MR analyses. (Figure 2; Supplementary Table S5). ### 2. Causal effects of T2DM on OM Using IVs for T2DM, we found evidence linking genetically predicted T2DM to increased OM risk (primary dataset: IVW: OR = 1.13, 95% CI 1.056–1.209, *p =* 4.20E-04; validation dataset: IVW: OR = 1.317, 95% CI 1.14–1.522, *p =* 0.007; Meta-analysis: OR 1.203; 95% CI 1.038–1.395; p=1.85E-04) (Figure 2). Furthermore, after Bonferroni correction, the results remained statistically significant. The three MR methods showed consistent directions. After MVMR analysis controlling for BMI, statistical significance remained between T2DM and OM (Figure 3; Supplementary Table S7). No outliers were detected by MR-PRESSO, and no heterogeneity was observed by Cochran’s Q test. MR-Egger intercept tests found no horizontal pleiotropy. Leave-one-out analysis showed consistent T2DM results (Fig. 2). Leave-one-out and scatter plots are provided in Supplementary Figures 5-8. ![Figure. 3.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2024/04/10/2024.04.08.24305482/F3.medium.gif) [Figure. 3.](http://medrxiv.org/content/early/2024/04/10/2024.04.08.24305482/F3) Figure. 3. Genetically predicted type 2 diabetes and its complications: associations with osteomyelitis. T2DM, type 2 diabetes mellitus; OM, osteomyelitis; UVMR, Univariable MR; MVMR, multivariable mendelian randomization; OR, odds ratio; CI, confidence interval; UKBB, United Kingdom Biobank; IVW, Inverse Variance Weighted; H, Heterogeneity; P, Pleiotropy. The IVW p-value (