Abstract
Smoking, inflammation and depression commonly co-occur and may be mechanistically linked. However, key questions remain around the direction of association and the influence of residual confounding. We aimed to characterize the association between lifetime smoking and depression, as well as to assess the role that genetically-predicted C-reactive protein (CRP) level, an archetypal inflammatory marker, as a potential mediator for this association. We performed inverse variance weighted Mendelian randomization (MR) analyses using recently published summary-level GWAS data for lifetime smoking index, CRP levels, and depression. A subset of inflammatory-related genetic variants from the lifetime smoking GWAS were also used to assess the potential inflammatory causal pathways between smoking and depression. The analysis indicated significant reciprocal relationships between lifetime smoking and both depression (ORSmk-Dep = 2.01, 95% CI 1.71-2.37, p < 0.001; O RDep-Smk = 1.09, 95% CI 1.06-1.13, p < 0.001) and CRP levels (ORSmk-CRP = 1.40, 95% CI 1.21-1.55, p < 0.001; ORCRP-Smk = 1.03, 95% CI 1.02-1.05, p < 0.001). These significant and positive associations were also supported by the majority of the robust MR methods performed. The reciprocal relationships between CRP levels (using >500 genetic instruments for CRP) and depression were not significant (ORCRP-Dep = 1.01, 95% CI 0.99-1.04; ORDep-CRP = 1.03, 95% CI 0.99-1.07). We observed little variation in the IVW estimates between smoking and depression when we limited the genetic variants assessed to those related to inflammation or when we adjusted the analysis by CRP-levels in multivariable analysis. Our study supports potential causal associations between lifetime smoking and depression, as well as between lifetime smoking and CRP levels, but not between CRP and depression. No evidence was found that CRP mediates the relationship between smoking and depression.
Introduction
Increasing evidence indicates a role for inflammation in the pathogenesis of mental health disorders, particularly depression1,2. Even though, historically, the central nervous system (CNS) has been considered an immuno-privileged region in the human body, research has shown that microglia in the CNS produce inflammatory cytokines and inflammatory processes outside of the CNS can result inflammatory responses within the CNS 3–8. Furthermore, diseases associated with inflammation, such as rheumatoid arthritis, diabetes mellitus, coronary heart disease, stroke have been associated with depression9,10.
Modifiable exposures, such as smoking behavior, have been associated with both depression and inflammation. A 2014 study in the US estimated that the smoking prevalence among participants 16% and 40% in patients without a psychiatric and those who had a diagnosis of MDD during the previous year, respectively11. Furthermore, observational studies seem to support a systemic elevation of serum inflammatory markers in smokers12, including significant increases in C-reactive protein (CRP) levels13–15. However, the evidence regarding direction of association and causality is unclear due to the biases inherent to observational studies16,17.
Mendelian randomization (MR) is an epidemiological approach that uses genetic variants as instruments to untangle the problems of reverse causation (genetic variants are fixed at conception; hence, genetically predicted levels of risk factors must precede any event) and unmeasured confounding (genetic variants show considerably less conventional confounding than phenotypic variables)18. If genetically-predicted values of a risk factor are associated with a specific disease outcome, then it is likely that the association between the risk factor and outcome has a causal basis19–22. In this manner, MR studies act like natural randomized control trials and overcome some of the biases of observational studies23.
Previous studies have shown through MR analyses that smoking and depression may have a bidirectional causal relationship24, although the mechanism through which smoking causes depression is not known. MR studies using CRP as a proxy for systemic inflammation have shown conflicting evidence regarding the association between higher CRP levels and risk for depression20,25,26. Recently, statistically better powered genome wide association studies (GWAS) for depression27 and for CRP28 have become available, and provide an opportunity to use improved genetic instruments, which explain a larger proportion of the variable’s variance, for causal inference to resolve prior ambiguities.
In this study, first to examine potential causality and direction of association, we conducted univariable and bidirectional MR analysis testing associations of genetically-predicted smoking behavior with CRP levels and risk of depression, and vice versa. Second, to examine for a potential mediating role of inflammation between smoking behavior and risk of depression, we conducted univariable MR analyses limiting the smoking exposure genetic variants to those which have been previously associated with inflammatory traits. Finally, we conducted a multivariable Mendelian randomization (MVMR) analysis to examine the associations between genetically-predicted smoking behavior and risk of depression after adjusting for genetically-predicted CRP levels using CRP levels to adjust the effect of the smoking genetic variants on depression. We used the latest GWAS data to develop statistically better powered genetic instruments compared to previous studies and used this to investigate if inflammation mediates the potential causal effect of smoking on depression.
Methods
Data Sources
This study used publicly available summary level data obtained from previously published genome-wide association studies (GWAS) for all analyses. The source and description of the GWAS summary statistics used in this report for lifetime smoking, depression, and C-reactive protein (CRP) are presented in Table 1.
Lifetime Smoking Index
Lifetime smoking index was selected as the variable to represent the smoking exposure for all analyses. Wootton et al. (2019) generated this lifetime smoking index, encompassing information regarding smoking heaviness, duration, and smoking initiation and cessation. One standard deviation increase in lifetime smoking score is equivalent to an individual smoking 20 cigarettes a day for 15 years and stopping 17 years ago or an individual smoking 60 cigarettes a day for 13 years and stopping 22 years ago24.
Depression
Depression summary statistics were obtained from the Psychiatric Genetic Consortium (PGC). These data combined the meta-analysis efforts of two previously published articles27,29. The resulting summary statistics include a broad definition of depression, as they encompass the diagnostic criteria used as case definitions in the different cohorts included (Table 2).
C-reactive Protein
C-reactive protein (CRP) was selected as the proxy for systemic inflammation6,14,15,31,32. Han et al. (2019) generated the summary statistics used for this study from 418,642 individuals of British ancestry in the UK biobank for whom CRP levels were available and whose levels were lower than 10mg/L28.
Selection of Genetic Instrumental Variable
In Mendelian randomization, for a genetic variant to be a valid IV, it must meet three assumptions: (i) the variant is associated with the exposure, (ii) the variant is not associated with any confounder of the exposure-outcome association,(iii) the variant does not affect the outcome, except possibly via its association with the exposure33. The process for IV selection from the GWAS summary statistics for each of the variables studied was performed following a primarily statistical approach34. However, in some cases, the IV selection was further refined to include biological factors, as described below.
Statistical selection
SNPs were considered significantly associated to the GWAS variable of interest if the GWAS p-value reported on the summary statistics was smaller than 5 × 10−8 33. Using multiple correlated variants representing the same effect would decrease the efficiency of the analyses and increase the risk of weak instrument bias in the estimates obtained without increasing the power of the study35,36. Consequently, absence of LD and independence of the final IVs selected was ascertained using the ld_clump() function from the ieugwasr R package (clumping window kb = 10000, r2 = 0.001, p = 0.99)37.
If an IV selected did not have a match in the outcome GWAS statistics, a proxy IV (in linkage disequilibrium with the original IV; r2 >0.8), was used instead. Proxy IVs were obtained using the LDlink R package38.
Biological selection
Smoking inflammatory SNPs
Out of the lifetime smoking IVs determined significant and in linkage disequilibrium, those SNPs that had been associated with inflammatory traits (p < 5 × 10−8), including those relating to cytokines, acute phase proteins, and immune cells, in previously published GWAS were subselected using the Phenoscanner R package39. These inflammatory-related IVs were used to further assess the role of inflammation in the association between smoking and depression.
CRP-cis SNPs
When assessing the role of CRP, four variants – rs1205, rs3093077, rs1130864 and rs1800947 – were selected as cis-variants, which are those located in the CRP gene region20. Limiting the analysis to cis-variants allows for more reliable conclusions due to the biological relevance of the variants used34. Clumping was performed using the ld_clump() function from the ieugwasr R package (clumping window kb = 10000, r2 = 0.001, p = 0.99)37 and the variant rs3093077 was selected as the lead CRP variant. Wald ratio MR analyses were performed to assess the effect of this variant on depression and smoking.
Univariable reciprocal Mendelian randomization analysis
The Inverse-variance weighted method was selected as the main method to calculate the combined effect of the selected instrumental variables in all univariable MR analyses. The IVW method is similar to a weighted regression of the effect of each specific IV on the outcome on the effect of the same IV on the exposure, restricting the intercept to zero40,41. All of the univariable MR analyses are represented in Figure 1 and in Table 3. The direction of the relationships was confirmed using Steiger filtering using the steiger_filtering() function from the TwoSampleMR R package42–44.
Heterogeneity and sensitivity analysis
For all of the analyses performed, F-statistic were calculated to assess the strength of the instruments45. The heterogeneity between the estimates of the different IVs used was assessed using Cochran’s Q, leave-one-out analyses, and visualizing the data with funnel, scatter, and forest plots44,46,47. Steiger filtering was used to assess the directionality of the relationship in the associations that were significant42.
The robustness of the overall estimate obtained with the IVW method was assessed using a combination of robust MR methods from different classes and working under a wide range of assumptions42. Four additional MR methods were used: weighted median, MR-PRESSO, MR-Egger, and contamination mixture (Table 4). Weighted median is a median based estimate that provides valid estimate even if up to 50% of the instruments are invalid. MR-PRESSO accounts for pleiotropy by detecting and removing outliers. MR-Egger accounts for pleiotropy by including an intercept term in the IVW model. Contamination mixture accounts for heterogeneity in causal mechanisms by identifying genetic instruments with similar causal estimates.
Multivariable Mendelian randomization analysis
(MVMR) methods allow for the estimation of the proportion of the effect of smoking directly acting on depression and the proportion potentially being mediated by CRP levels – representing systemic inflammation. IV coefficients for smoking were regressed on the SNP coefficients for depression, followed by a second regression of the coefficients of the CRP on the previously obtained coefficients45. The global estimate is then produced using the IVW method.
Results
Instrumental variable selection
The results from the IV selection process, including all of the significant SNPs in each GWAS and the final IVs selected for each variable are shown in Table 5.
IVW MR Analyses Testing Association of Smoking with Depression and CRP
The IVW method supported significant associations of genetically-predicted lifetime smoking index with risk of depression (ORSmk-Dep = 2.01, 95% CI : 1.71−2.37, p < 0.001), and with CRP levels CRP (ORSmk-CRP = 1.40, 95% CI : 1.27−1.55, p < 0.001). The IVW method also identified a significant association between depression and lifetime smoking index (ORDep-Smk = 1.09, 95% CI : 1.06−1.13, p < 0.001), and between genetically-predicted CRP levels and lifetime smoking index (ORCRP-Smk = 1.03, 95% CI : 1.02−1.05, p < 0.001) (Table 6). However, MR associations between CRP and depression were not statistically significant.
MR Analysis using inflammation-related genetic variants for smoking as IVs
Further analysis using the smoking-related genetic variants that are also associated with inflammation as IVs, the associations between smoking and depression remained significant for both inflammation-related and unrelated genetic variant sets (Table 6).
MR Analysis using cis variants for CRP as IVs
No association was observed between the lead CRP-cis variant, rs3093077, and depression or smoking (Table 6).
Visual assessment of MR estimates
Visual assessment of the individual IV estimates using scatter, funnel, and forest plots indicated the presence of moderate heterogeneity among these effect estimates (Figure 2 & Supplemental Figures 1-3). However, the symmetrical distribution of individual estimates around the overall estimates suggests that the pleiotropy present is likely balanced.
Results for sensitivity analyses
Influential IVs
Among the significant associations, every global estimate obtained through leave-one-out analyses remained significant (Supplemental Figure 2). Similarly, in non-significant associations, all of the global estimates obtained through leave-one-out analyses remained non-significant. This suggests that none of the IVs used throughout the analyses acted as an influential point.
Robust analyses
In the smoking-CRP, smoking-depression, and depression-smoking association, all of the global estimates calculated using robust MR methods, excluding the MR-Egger estimate, showed a similarly significant increase in the odds of the outcome and higher levels of the exposure. I2GX values calculated indicate that the MR-Egger is likely to provide biased estimates; furthermore, the QR values obtained do not support a better fit of the MR-Egger data to the model when compared to the IVW model (table 9).
Abbreviations: Smoking_Infl indicates smoking IVs previously associated with inflammation. Smoking_nonInfl indicates smoking IVs not previously associated with inflammation. CI_low lower limit of 95% confidence interval. CI_high upper limit of 95% confidence interval.
Multivariable MR
Using the IVW method, the MVMR analysis, using lifetime smoking as the exposure, estimated a significant OR of 2.08 for depression (95% CI : 1.71−2.54; p < 0.001), and a non-significant OR of 0.92 for CRP levels (95% CI : 0.71−1.19; p = 0.542) (Table 8). The similarity between this estimate for smoking-depression and the unadjusted one, as well as the lack of significance for the CRP term in the model, does not provide evidence to support a role for CRP as a mediator the relationship between smoking and depression.
Inflammation-related smoking IVs
Secondary MVMR analyses were conducted with the subset of inflammatory and non-inflammatory smoking IVs being adjusted by CRP. These two analyses provided similar estimates to the ones obtained without adjusting by CRP levels (ORSmk−Infl = 2.12, 95%CI : 1.32−3.43, p=0.002; ORSmk−nonInfl =2.15, 95%CI : 1.73−2.67, p<0.001) (Table 8). Again, these results do not support a role for CRP levels as a mediator for the relationship between smoking and depression.
Assessment of bias and reliability of MR-Egger results
As MR-Egger tends to suffer from low statistical power and is particularly susceptible to bias from weak instruments47, we present assessment of reliability of the MR-Egger results (Table 9). I2GX for the data used in the different analyses (Table 9), suggested that the results from the MR-Egger analyses may be strongly biased and are not reliable; therefore, the IVW estimate is preferred48,49.
Discussion
Summary of findings
In this study, we present comprehensive Mendelian randomization analyses testing the direction and potential causality of association between smoking, CRP, and depression. Our results suggest potentially causal bi-directional associations of smoking with depression and CRP levels. However, there was no evidence for a potentially causal association between CRP levels and depression, or for a mediating role of inflammation for the association between smoking and depression. Results from multivariable Mendelian randomization analyses also suggest no evidence for the relationship between smoking and depression being mediated by CRP levels.
The results obtained were consistent through the different sensitivity and robust analyses performed. Overall, the various robust methods performed relying on different assumptions, provide strong evidence to support the potential causal role of lifetime smoking on depression.
Smoking and depression
For the relationship between smoking and depression, our results, using improved genetic instruments compared to prior studies, are consistent with previous evidence supporting a reciprocal association between these variables. Observational evidence has shown that depression can trigger smoking commencement and make its cessation more challenging50–54. Moreover, smoking has been shown to lead to depression, and smoking cessation has been associated with improved depressive outcomes and decreased depression symptomatology54–56. Wootton et al. (2019), using IVW methods and using data partially overlapping with that used in this study, obtained an OR of 1.99 (95% CI : 1.71−2.32; p < 0.001) for the causal role of lifetime smoking on depression, and an OR of 1.10 (95% CI : 1.02−1.17; p < 0.001) for the reciprocal relationship between depression and smoking24. The main differences between this analysis and that in Wootton et al. (2019) is that the Howard et al (2019) depression GWAS that informed our study had a larger sample size and incorporated many cases with a broader depression definition, including broad depression (help-seeking for problems with nerves, anxiety, tension, or depression) and probable major depressive disorder. This broader definition allows for an increase in in statistical power resulting from the larger number of cases encompassed may result in a loss of specificity57. Nevertheless, the results obtained are very similar to those from Wootton et al. (2019) (we found an odds ratio of 2.01 for effect of smoking on depression compared to Wootton et al odds ratio of 1.99), suggesting that a despite the lack of a formal MDD diagnosis in all of the cases used, the broader depression definition used remains relevant for MR analysis27,29. The association between smoking and depression has important implications; it strengthens the case for primary prevention of smoking and stop-smoking initiatives, and raises the question, for future research, of whether smoking cessation initiatives are effective treatments for depression.
Smoking and CRP
Evidence supporting a causal relationship between smoking and elevated CRP levels has been extensively documented in observational studies: with higher CRP levels in smokers13–15,58–71, and decreased CRP levels after smoking cessation14,15,58,63,65–68,70,71. This study provides new evidence to support a causal relationship between smoking behavior and CRP levels using Mendelian randomization techniques. Furthermore, this study supported a very small, but significant, effect (odds ratio of 1.03) of CRP levels on smoking lifetime index, which may be clinically negligible. Further study would be required to investigate which aspects of lifetime smoking CRP is causally associated with (as lifetime smoking is a composite measure reflecting both initiation and persistence of smoking).
CRP and Depression
Our results did not support a potentially causal relationship between CRP and depression or vice versa, though several observational studies have demonstrated that patients with depression have significantly higher CRP levels when compared to those without depression26,31,72–75. Of previously published MR studies assessing this relationship20,25,26, our prior study (Khandaker et al. (2020)) found evidence for a potentially causal association between CRP levels and depression using data from the UK Biobank cohort. More recently, using the MR approach Kappelmann and colleagues have reported that inflammatory markers like CRP and IL-6 are associated with specific symptoms of depression, such as suicidality76. Using symptom-level data from the UK Biobank and Dutch NESDA cohorts we have reported observational and MR associations for CRP and IL-6 with somatic/neurovegetative symptoms of depression such as fatigue and sleeping difficulties77. Taken together, current evidence from epidemiological and genetic MR studies is consistent with inflammation being potentially causally related to certain symptoms of depression, namely somatic/neurovegetative symptoms, though MR evidence for a potentially causal role of CRP on the syndrome of depression as outcome is mixed.
Inflammation as a mediator
The lack of difference in results of the analyses using inflammatory-related and non-inflammatory related smoking IVs, suggest that there is no clear distinct CRP mediated inflammatory causal pathway mediating the causal relationship detected between smoking and depression. Furthermore, MVMR analyses showed that adjusting the effect of smoking on depression for CRP levels did not result in a significant estimate for the effect of CRP levels.
Overall, these results do not support a role for CRP-indexed inflammation in the development of depression. This is null effect contrasts with a prior finding, using the same CRP GWAS and a similar methodology, that serum CRP is causally associated with another multifactorial etiology phenotype – namely age-related macular degeneration – indicating that the genetic instruments we used are capable of revealing positive causal relationships between serum CRP and disease. Randomized control trials have shown that using of anti-inflammatory agents, such as celecoxib or infliximab, can successfully improve depression outcomes in patients with elevated CRP levels and patients with treatment-resistant depression78, strongly suggesting that inflammation has a crucial role in the pathogenesis of at least certain types of depression78,79. If inflammation is only causally relevant in certain subtypes of depression with their own distinct etiology and pathogenesis, our approach would not necessarily detect this. However, our results remain consistent with the possibility that inflammation may cause depression, considered as a unitary entity, via non-CRP mediated processes. Our research strongly suggests that future studies examining the effect of inflammatory cytokines on depression should broaden their scope beyond CRP.
Limitations
Lifetime smoking index and depression are both behavioral variables, with complex etiologies and hard to measure phenotypes. Furthermore, the statistical approach to IV selection use increases the risk of bias from horizontal pleiotropy80. However, the robust methods used, including MR-Egger, MR-PRESSO, and MR contamination mixture, allow MR analyses to be conducted in the presence of horizontal pleiotropy. The various sensitivity analyses we performed provide supporting evidence for the robustness of the findings.
It is important to note that all of the GWAS used for this study were obtained from the UK Biobank, resulting in a one-sample MR study. In one-sample MR studies, the exaggerated effect of the IVs on the exposure may lead to an overestimation of the causal effect between the exposure and the outcome assessed. This bias is called “winner’s curse” and could result in a strong bias in the obtained results81,82. Nevertheless, previous studies performed in non-overlapping samples have demonstrated the same direction of the effect between smoking and depression24. Furthermore, we used a smaller (47 independent IVs), non-overlapping CRP GWAS83, which showed a similar reciprocal associations between smoking and CRP, and CRP and depression (supplemental table x).
Conclusion
The results from this study add on to the growing body of evidence supporting the bidirectionality of the causal relationship between smoking and depression. Furthermore, this study strengthens the evidence for a causal role of smoking on CRP levels. However, we do not find evidence for a potentially causal role for CRP on depression or for potentially mediating role for inflammation on the association between smoking and depression. Further research is needed to understand potential mechanisms for bidirectional association between smoking and depression.
Data Availability
All data is publicly available from the GWAS catalog or Psychiatric Genomics Consortium website
Author Contribution/Credit Statement
Diego Galán: Conceptualization, Methodology, Formal Analysis, Writing-Original Draft. Ben Perry: Resources, Methodology, Writing-Review & Editing. Varun Warrier: Conceptualization, Methodology, Writing-Review & Editing. Golam Khandaker: Methodology, Writing-Review & Editing. Douglas Easton: Writing-Review & Editing, Supervision. Graham Murray: Conceptualization, Methodology, Writing-Review & Editing, Supervision.
Funding
This work was supported by a Wellcome Trust award to GMK (201486/Z/16/Z), and a National Institute for Health Research award to BIP (DRF-2018-11-ST2-018). VW is supported by the Bowring Research Fellowship (St. Catharine’s College, Cambridge) and a Wellcome Trust Collaborative Grant.
This work was supported by a Wellcome Trust award to GMK (201486 Z 16 Z) and a National Institute for Health Research award to BIP (DRF 2018 11 ST2 018). VW is supported by the Bowring Research Fellowship (St. Catharine’s College Cambridge) and a Wellcome Trust Collaborative Grant
This work was supported by a Wellcome Trust award to GMK (201486 Z 16 Z) and a National Institute for Health Research award to BIP (DRF 2018 11 ST2 018). VW is supported by the Bowring Research Fellowship (St. Catharine’s College Cambridge) and a Wellcome Trust Collaborative Grant
Conflicts of Interest
All authors declare no conflicts of interest.