ABSTRACT
Objective Observational studies on cognition found that vitamin D supplementation is associated with improved cognition. Further, recent Mendelian randomization (MR) studies have shown that increased vitamin D levels, as measured by 25-hydroxyvitamin D [25(OH)D], may protect against Alzheimer’s disease. Thus, it is possible that 25OHD may protect against Alzheimer’s disease by improving cognition.
Methods To evaluate this hypothesis, we began by performing an observational study by testing the association of 25OHD levels with 5 cognitive outcomes related to memory and executive function, adjusting for age, sex, BMI, smoking status and nutritional risk index in 26,787 older individuals (median age 62). Since such observational analyses can be biased by confounding, we next performed two-sample mendelian randomization (MR) analyses testing the effect of 25(OH)D on cognitive outcomes, since this method helps to protect against bias from confounding.
Results Observational studies suggested a strong protective association between 25(OH)D and cognitive measures. However, in MR analyses, we found no estimated effect of 25(OH)D on these outcomes.
Conclusion These findings suggest that associations between 25(OH)D levels and cognition are likely to be biased by confounding and that the relationship between 25(OH)D levels and Alzheimer’s disease is not due to the effect of 25(OH)D on cognition.
INTRODUCTION
According to the Canadian census bureau, the number of 85 years old and older will triple over the next 25 years to 2.5 million1. The rate of dementia-related diseases increases exponentially for individuals 90 years and older2. Therefore, identifying therapeutic targets and enhancing our understanding of age-related neurodegeneration is necessary.
Some studies have demonstrated an increased prevalence of low 25-hydroxyvitamin D [25(OH)D] levels in the elderly population3,4. Further, cognitive decline and loss of autonomy in the elderly, which are key drivers of Alzheimer’s disease (AD) have been observed in individuals with hypovitaminosis D5. There is a wide range of pathways through which vitamin D may influence brain health, including inflammation, thrombosis, and growth factors6,7. Several observational studies have suggested a role of vitamin D in modulating neuronal development, increasing senescence, reducing sensory and social acuity8,9,10. Meta-analyses of the association of vitamin D and stroke incidence and dementia risk found a dose-response relationship between vitamin D and these complications11,12. Since vitamin D levels can be increased with relative safety using available supplements, increasing vitamin D levels could potentially help to protect against cognitive decline.
The preclinical Alzheimer’s cognitive composite cognition score13 (PACC) tests multiple brain function parameters. The meta-analysis from which the score was derived demonstrated a strong association of the score with the gene encoding for the apolipoprotein E ε4 polymorphism14. These genetic variants are the most significant risk factors for Alzheimer’s disease, which suggests that the PACC score is a helpful score to measure Alzheimer’s disease-related cognitive impairment, since it shares biological determinants with Alzheimer’s disease.
Understanding the causal effect of vitamin D on cognitive function would therefore be important to better understand widespread findings from observational study data demonstrating an association of 25(OH)D with cognitive function. However, such studies may be biased by confounding and previous work has shown that several observed associations between vitamin D levels and disease outcomes cannot be corrected by vitamin D supplementation in large-scale randomized trials15,16,17,18,19,20,21,22.. Mendelian randomization (MR) is a genetic epidemiology causal inference method that uses instrumental variables to reduce bias from confounding and reverse causation23,24. Contrasting the existing observational studies mentioned earlier, a one-sample MR study in a meta-analysis on the impact of vitamin D on cognition found no evidence for 25(OH)D levels as a causal factor for cognitive performance25.. Additionally, different ways to measure cognition (including memory and global cognition scores) amongst the different cohorts made the cognition measurements heterogeneous. Finally, the study employed just two SNPs (CYP2R1, DHCR7) associated with 25(OH)D, thus reducing the variance explained in 25(OH)D levels and, consequently statistical power. On the other hand, recent MR studies have demonstrated that increased 25(OH)D levels protect against Alzheimer’s disease26. Since cognition is an important risk factor for Alzheimer’s disease, it would be important to better understand if increased 25(OH)D levels were protective for cognition and if these results depended upon age.
Therefore, to better understand the relationship between 25(OH)D and cognition (determined using the PACC score), we undertook a large-scale observational study, testing the association of 25(OH)D with cognitive performance in 26,787 individuals in the Canadian Longitudinal Study of Aging (CLSA). Next, we used MR to better understand if these associations were influenced by confounding using genome-wide significant SNPs from the largest (N=443,734) GWAS of 25OHD levels27 as genetic instruments. We then queried their effect on the PACC score in 17,166 samples from the Canadian longitudinal study on aging (CLSA). The results provide insights into the effect of 25(OH)D levels of cognition in humans.
METHODS
Observational associations
Using linear regression, we tested the association of log transformed 25(OH)D with each individual cognition score (AFT, AFT2, MAT, REYI, REYII, COLTIME) and with the composite PACC score. Since 25(OH)D is confounded by various factors28 (anthropometric variables, lifestyle habits, and more), we identified covariates available in the CLSA cohort to add to our regression model to partially control for this potential bias. Cognition scores were adjusted for age, sex, smoking status (answered on a questionnaire with possible answers: Yes (I currently smoke)), No (I don’t smoke and I never have), Former (I don’t smoke now but I have in the past)), high nutritional risk indicator (answered on a questionnaire with possible answers: Not at high nutritional risk, and High nutritional risk), and BMI classification for adults aged 18 and over (the classification includes underweight, normal weight, overweight, obese – Class I, obese – Class II, obese – Class III). Cognition scores were inverse normal transformed. All measures were at baseline and included at most 25,357 individuals. To control for multiple testing, we considered Bonferroni significant those associations with P < 7.1 × 10-3 (0.05/7 (where 7 is the number of cognitive scores tested)).
Instrumental variables for 25(OH)D
Instrumental variables were defined as independent genome-wide significant SNPs (P < 6.6 × 10-9) for 25(OH)D levels in 443,734 individuals. Identification of independent loci excluded high collinearity (R2 > 0.9) SNPs within a 20,000 kilobase window. Details on the conditionally independent approach, imputation quality control, missingness rates excluded individuals and software tools employed in the GWAS are provided in great detail in our original paper27. The approach yielded 138 independent SNPs. In order to ensure that we are able to recover all the variants from the outcome GWAS, we restricted our study to SNPs with a MAF > 5%. When a SNP was not present in the results for the GWAS of the PACC, AFT, COLTIME, REYI, REYI, and MAT score, we alternatively looked up a proxy SNP in linkage disequilibrium, with an R2 ≥ 0.8 using the 1000G European reference panel29 (Supplementary Table2). Therefore, we limited our analysis to 80 SNPs for which we estimated the variance explained in 25(OH)D. We computed the variance explained using the following formula : r2 = 2β 2 ƒ (1 –ƒ), where β and ƒ denote the effect of the SNP on 25OHD level and the MAF, respectively. Which we then used to calculate the F-statistic (formula F: r2 x (N-2)/(1-r2 ) where r2 is the variance explained and N denotes the sample size.
Study outcomes
The PACC score is a composite score which synthesizes tests on episodic memory, timed executive function, and global cognition. The composite measure can precisely recognize early signs of cognitive changes prior to mild cognitive impairment (MCI)13. In terms of the definition of the individual scores, The Rey auditory verbal learning test – trial 1 and five-minute delayed recall (respectively) REYI, and REYII score represents the number of words correctly recalled in 90 seconds. The first one assesses verbal episodic memory, whereas the second one assesses delayed recall. The mental alteration test (MAT) score assesses the number of correct consecutive numeric and alphabetical alternations in 30 seconds. The time to say the ink colour of each colour name (COLTIME measured in seconds) ascertains the executive and the set switching cognitive domains. Finally, animal fluency (AFT & AFT2) assesses the different animals recited in 60 seconds. Table 1 summarizes the individual cognitive scores in CLSA.
To ascertain the association between 25(OH)D levels and the PACC13 score, we retrieved the effects of these 80 variants on the PACC, REYI, REYII, MAT, AFT and AFT2 scores, which is a GWAS of the aforementioned score in the CLSA cohort described elsewhere30. Since 92.6% of CLSA participants are of Europeans ancestry30, our analysis focused on individuals with European ancestry only with PACC score data (N=18,706). Population structure was identified through principal component analysis as detailed elsewhere30. We employed FastGWA as implemented in GCTA31 and PLINK232 to perform association tests, which we adjusted for sex, age, and the first 20 principal components. We then stratified the population by age testing the entire cohort and individuals at least 60 years of age (N=9,489) to compare the effect of age on the cognitive scores (Supplementary Table1).
Mendelian randomization
Units
The causal associations were evaluated using odds ratios (ORs) for the susceptibility to multiple sclerosis, which are expressed according to a standard deviation (SD) increase in 1SD of genetically predicted natural log of 25(OH)D (nmol/l). For the cognition scores, one standard deviation increase in cognitive scores are expressed according to one standard deviation increase in genetically predicted natural log of 25(OH)D (nmol/l).
Two-sample MR
We performed the two-sample mendelian randomization analyses using multiplicative random effect inverse-variance weighted (referred to as IVW(re)) to estimate the effect of a 1 SD increase in standardized natural log-transformed 25(OH)D on 1 a SD of PACC, AFT, REYI, REYII, MAT, COLTIME scores and odds of multiple sclerosis. We additionally stratified our analysis by age (individuals above 60 years of age versus all individuals) when estimating the effect on cognitive scores. Our analyses we performed with the package TwoSampleMR (v 0.5.6)33 in R(v4.2.0). We considered significant MR results with P-value < 6.3 × 10-3 (Bonferroni threshold accounting for multiple testing for the seven cognition scores and the risk of multiple sclerosis: 0.05/8).
MR Assumptions
Three assumptions are required in MR analyses. The first MR assumption requires that the genetic instrument must be strongly associated with the exposure. We selected conditionally independent genome-wide significant common (P-value < 6.6 × 10−9) variants to satisfy this assumption. Of 138 conditionally independent variants, 80 remained (Supplementary Table1).
The second MR assumption stipulates that genetic instruments should not be correlated to confounders linking exposure and outcome34.
The third MR assumption requires the instruments to influence the outcome only via their effects on the exposure (the exclusion restriction assumption), i.e. that horizontal pleiotropy does not influence the results. We therefore investigated for the presence of directional horizontal pleiotropy in our study. First, we calculated MR Egger regression slope and intercept, in addition to IVW median and IVW mode. MR-Egger is a weighted linear regression which allows the estimation of an intercept as a measure of the average pleiotropic effect and produces a slope coefficient that is robust to directional horizontal pleiotropy. MR-Egger relaxes the exclusion restriction assumption and is valid under the Instrument Strength Independent of Direct Effect (InSIDE) assumption wherein associations of the genetic variants with the 25(OH)D are independent of direct effects of the genetic variants on the cognitive score. Additionally, we performed a weighted median analysis which weights MR estimates by the effect size/standard error. This approach leverages the fact that causal estimates from non-pleiotropic instrumental variable are more probable to converge toward the median. This approach provides reliable causal estimates when less than 50% of the total weight is derived from variants with pleiotropic effects. Additionally, we assessed Cochran’s Q test. The test yields an I2 index which gives an estimate of the amount of heterogeneity amongst IVs (I2 index ≤ 25%, is considered low heterogeneity, and a I2 index > 50% is considered high heterogeneity). Finally, we employed MR-PRESSO global test, which can detect the presence of pleiotropic outlier SNPs. By undertaking this variety of sensitivity analyses, and comparing results using these approaches, each one with different underlying assumptions, we could assess, to some degree, whether our findings were biased by horizontal pleiotropy. We considered results as significant when P was < 0.05 for all horizontal pleiotropy tests (MR-Egger intercept test, Cochran’s Q test, MR-PRESSO global test).
This study was conducted in accordance with the STROBE-MR guideline29 (STROBE-MR checklist is provided in Supplementary Material.
RESULTS
Observational associations
All the associations (Table 2) between log transformed 25(OH)D and the seven cognitive scores (the individual scores and the composite score) adjusted for age, sex, smoking status, nutritional index, and BMI classification were Bonferonni significant (P-value < 7.1 × 10-3). While the direction of effect between 25(OH)D and the cognition scores was positive, we observed a negative effect between the vitamin and the executive and set switching tests measuring how much time it takes to name ink colours (COLTIME). Thus increased 25(OH)D was consistently associated with improved cognition in observational associations.
Instrumental variables for 25(OH)D
We used 80 common (MAF > 5%) independent SNPs (S1 Table) explaining 3.8 % of the variance in 25(OH)D levels, with an F-statistic of 214.75 as instruments in our MR study. 5 SNPs were absent from the PACC score GWAS and were replaced by proxies in high LD (r2 > 0.8). TwoSampleMR identified rs10832289, rs11182550, and rs13336347 as palindromic and were therefore removed from further analyses.
GWAS of PACC score
We performed a genome-wide study of the PACC score in European individuals from the CLSA (N=17,166). When we restricted our analysis to individuals aged 60 and older, we found a genome-wide significant association rs429358 (P=1.8 × 10−8) at apolipoprotein E (APOE) (Figure 3 C&D). This is a well-characterized locus for dementia and Alzheimer’s disease35,36,37.
PACC score and REYI, REYI, AFT, AFT2, MAT, COLTIME score outcome
For the PACC score, MR showed that the genetically predicted increase per 1 SD in natural log of 25(OH)D had no effect on PACC score estimated in the general population (BetaPACC - All = -0.005, 95% CI -0.058 – 0.048, P = 0.85. Results were consistent in individuals over 60 (BetaPACC – Age >60 = 0.0009, 95% CI -0.07 – 0.07, P = 0.98) (Figure 1 & Table2).
For REYI, REYII, MAT, and COLTIME, the MR results were consistent with those from the PACC score. There was no discernible effect of genetically predicted increase per 1 SD in the natural log of 25(OH)D on any of these scores (Table 2).
For MR for the AFT, and AFT2 outcomes, we did observe a nominally significant association, which did not survive Bonferroni correction, in the population over 60 years of age. For AFT, the genetically predicted increase per 1 SD in natural log of 25(OH)D has no observed effect on the overall population (BetaAFT - All = 0.083, 95% CI -0.0057 – 0.17, P = 0.07), but did show an effect in individuals over 60 (BetaAFT2 – Age >60 = 0.13, 95% CI 0.02 – 0.25, P = 0.02). The trend was consistent for AFT2, BetaAFT2 - All = 0.069, 95% CI -0.022 – 0.16, P = 0.14, but there was an uncertain effect in individuals over 60, BetaAFT2 – Age >60 = 0.12, 95% CI 0.00061 – 0.23, P = 0.049.
Sensitivity analysis
We computed MR-PRESSO, Egger’s regression, and Cochran’s Q test to assess the robustness of our results. Our sensitivity analyses compiled in Table 3 for the PACC score and the other cognitive scores show that, by and large, there is low to moderate heterogeneity since I2 values are below 25%. The only moderate heterogeneity value (I2 = 47%; I2 Pvalue = 0.005) was for the REYI score in individuals above 60 years. In fact, the MR-PRESSO global test, which can detect horizontal pleiotropic outlier SNPs, found a pleiotropic SNP for the Rey verbal learning test I. Assessing at the distortion of the causal estimate before and after outlier correction, the MR results remained the same. We found that MR-Egger intercept test, Cochran’s Q test, MR-PRESSO global test and distortion test did not detect horizontal pleiotropy (P-values > 0.05). We therefore did not identify evidence of directional horizontal pleiotropy in the MR-results.
DISCUSSION
In this study, we carried out an observational study followed by a two-sample MR to ascertain the effect of 25(OH)D supplementation on cognitive outcome test scores. First, we computed the linear regression adjusting for age, sex, BMI, smoking status, and nutritional risk index, between 25(OH)D and cognition test scores. Higher 25(OH)D levels were associated with improvements in all cognition measures. We then used MR, which reduces bias from reverse causation and confounding, thus enabling us to test for causal effects. The results of the MR study did not support a causal effect between a genetically determined change in 25(OH)D levels and the PACC score and the cognitive score. These results suggest that observational associations between 25(OH)D levels and improved cognition are likely due to confounding.
Vitamin D supplementation has been proposed for treatment of many common diseases. While supplementation could be beneficial in certain contexts15 (i.e. prevention of multiple sclerosis, hypovitaminosis D and Alzheimer’s disease), results from observational studies need to be considered carefully. For instance, an observational study published in 201438 with a diverse and small cohort (N=382; 41.4% individuals from European ancestry, 29.6% individuals from African American ancestry, 25.1% individuals from Latino/Hispanic ancestry) showed that low vitamin D level is associated with an increased rate of cognitive decline. However, this was not supported by an MR study in a cohort of 153,187 individuals of European ancestry. The study looked at the effect of only two 25(OH)D canonical variants rs12785878 (vitamin D-decreasing allele, G), located near gene coding 7-dehydrocholesterol reductase (DHCR7), and rs12794714 (vitamin D-decreasing allele, A) near 25-hydroxylase (CYP2R1) on global cognition score (Beta: - 0.00; 95% CI: (−0.01, 0.01); P-value = 1.0), and memory cognitive score (Beta: -0.00; 95% CI: (−0.01, 0.00); P-value = 0.6). Both the previously published observational studies and MR studies on the effect of 25(OH)D on cognitive scores are congruent with our study25, 38.
How 25(OH)D may protect against Alzheimer’s disease in human is unknown. MR has shown a protective effect of 25(OH)D on Alzheimer’s disease in humans26,39,40. Additionally, previous and novel evidence demonstrated that cognition is protective for Alzheimer’s disease40, 41. Therefore, we hypothesized that 25(OH)D may protect against Alzheimer’s disease in humans by increasing cognition. However, our results suggest that the observed MR effects of 25(OH)D on Alzheimer’s disease are explained by improved cognition.
We note several strengths in our study. We leveraged the strengths of the MR design to lessen bias from confounders and reverse causation, therefore enabling more reliable estimate causal effects provided adherence to MR assumptions. To maximize statistical power, we utilized cohorts with large sample sizes.
Our study also contains several limitations. Since MR depends mainly on three causal inference assumptions (relevance, independence, and exclusion restriction) any violation of these could invalidate our results. To address these assumptions, we performed several sensitivity analyses investigating horizontal pleiotropy and heterogeneity (MR-Egger, MR-PRESSO, Cochran’s Q test). Our causal estimate between 25(OH)D and REYI score in older individuals was considered to have a moderate amount of heterogeneity. After the removal of outlying SNP through MR-PRESSO, the causal estimate remained the same. This suggests that the MR results are not likely to be strongly biased by directional horizontal pleiotropy. Further, our study focuses on European ancestry individuals, therefore, conclusions and interpretation of the results are not generalizable to other ancestries. This once again emphasizes the need for large-scale studies assessing the genetic determinants of disease in individuals of non-European ancestry.
In conclusion, our MR study found no evidence for a causal effect between 25(OH)D concentrations and cognitive performance in European ancestry individuals.
Data Availability
All data produced are available online at the GWAS catalog
http://ftp.ebi.ac.uk/pub/databases/gwas/summary_statistics/GCST90239001-GCST90240000//GCST90239732/
http://ftp.ebi.ac.uk/pub/databases/gwas/summary_statistics/GCST90239001-GCST90240000//GCST90239733/
http://ftp.ebi.ac.uk/pub/databases/gwas/summary_statistics/GCST010001-GCST011000/GCST010144
FUNDING STATEMENT
The Richards research group is supported by the Canadian Institutes of Health Research (CIHR: 365825; 409511, 100558, 169303), the McGill Interdisciplinary Initiative in Infection and Immunity (MI4), the Lady Davis Institute of the Jewish General Hospital, the Jewish General Hospital Foundation, the Canadian Foundation for Innovation, the NIH Foundation, Cancer Research UK, Genome Québec, the Public Health Agency of Canada, McGill University, Cancer Research UK [grant number C18281/A29019] and the Fonds de Recherche Québec Santé (FRQS). JBR is supported by a FRQS Mérite Clinical Research Scholarship. Support from Calcul Québec and Compute Canada is acknowledged. TwinsUK is funded by the Welcome Trust, Medical Research Council, European Union, the National Institute for Health Research (NIHR)-funded BioResource, Clinical Research Facility and Biomedical Research Centre based at Guy’s and St Thomas’ NHS Foundation Trust in partnership with King’s College London. These funding agencies had no role in the design, implementation or interpretation of this study.
CONFLICT OF INTEREST
JBR’s institution has received investigator-initiated grant funding from Eli Lilly, GlaxoSmithKline and Biogen for projects unrelated to this research. He is the CEO of 5 Prime Sciences (www.5primesciences.com), which provides research services for biotech, pharma and venture capital companies for projects unrelated to this research.
ETHICS STATEMENT
Ethical review and approval was not required for the study on human participants in accordance with the local legislation and institutional requirements. The patients/participants provided their written informed consent to participate in this study.
DATA AVAILABILITY
All GWAS summary statistics used in this study are made publicly available through the GWAS catalog. 25(OH)D and PACC score GWAS summary statistics (All ages and individuals 60 and older) can be downloaded through the following accession: 25(OH)D: GCST010144; PACC_GWAS: GCST90239732; PACC_GWAS_old: GCST90239733. Further inquiry can be directed to the corresponding author.
ACKNOWLEDGMENTS
We thank all the participants, the UK Biobank coordinating team, and the CLSA (Project# 1906026 and 190208) coordinating team for their approval, valuable time, generosity, and contributions to the data collection.