Abstract
Psychiatric disorders are complex clinical conditions with large heterogeneity and overlap in symptoms, genetic liability and brain imaging abnormalities. Building on a dimensional conceptualization of mental health, previous studies have reported genetic overlap between psychiatric disorders and population-level mental health, and between psychiatric disorders and brain functional connectivity. Here, in 30.701 participants aged 45-82 from the UK Biobank we map the genetic associations between self-reported mental health and resting-state fMRI-based measures of brain network function. Multivariate Omnibus Statistical Test revealed 10 genetic loci associated with population-level mental symptoms. Next, conjunctional FDR identified 23 shared genetic variants between these symptom profiles and fMRI-based brain network measures. Functional annotation implicated genes involved in brain structure and function, in particular synaptic processes. These findings provide further genetic evidence of an association between brain function and mental health traits in the population.
Introduction
Psychiatric disorders are complex, with a polygenic architecture, and large degree of overlapping symptoms and risk factors. Both imaging and genetics studies have shown numerous but small associations between brain phenotypes, psychiatric disorders, and genetics, such as schizophrenia (SCZ)1– 3, bipolar disorder (BIP)4,5, major depressive disorder (MDD)6,7, and anxiety disorder (ANX)8–10. Interactions between various brain phenotypes and genetics have been reported across structural11,12 and functional2,7,10 imaging modalities.
We have recently deployed a multivariate analysis to study the genetic architecture of brain functional connectivity, revealing genetic variants associated with functional brain connectivity as well as variance in brain activity over time13. The results showed meaningful overlap with psychiatric disorders, pointing at synapse-related pathways among the biological processes shared between disorders and brain function13.
Previous studies have shown widespread phenotypic and genetic overlap between psychiatric disorders14– 18. In addition, patients within a diagnostic category can display a wide variety of symptoms. This heterogeneity complicates both diagnosis and therapeutic response due to overlapping symptoms and generally low specificity of diagnostic features19,20. While the mental health of any individual in the population varies over the course of a lifetime, most will not meet diagnostic criteria for a psychiatric disorder21,22. In order to capture the variance encompassing psychiatric symptoms that is lacking in traditional case-control studies, one can use population-level mental health questionnaires as implemented in the UK Biobank23. This facilitates analyses using the continuous scales which enable data-driven clustering methods to extract different profiles each capturing a separate domain relevant to mental health in a sample without individuals diagnosed with a psychiatric disorder, taking advantage of larger sample sizes. Using independent component analysis (ICA), we have previously derived 13 mental health profiles from UK Biobank data, and showed that, although phenotypically independent (by design) they nonetheless share genetic underpinnings24.
Here, we aimed to uncover the genetic architecture of mental symptoms and identify shared genetic loci with neurobiological processes related to brain function. Using multivariate analysis25, we generated multivariate genome-wide association statistics across our previously identified 13 population-level mental health profiles24. This allowed us to identify new gene variants associated with mental health symptoms and traits such as psychosis, depression, and anxiety in the UK Biobank sample not captured in a univariate analysis. Further, we combined this multivariate genetic profile of mental health with GWAS summary statistics of 7 psychiatric disorders and with our previously identified multivariate profiles of functional brain connectivity and variance in brain activity over time24. This research aims to provide insight into the biological underpinnings of mental health symptoms.
Methods
Sample and exclusion criteria
We utilized data from the UK Biobank26 with permission no. 27412. All participants provided signed informed consent before inclusion in the study. The UK Biobank was approved by the National Health Service National Research Ethics Service (ref. 11/NW/0382). We previously used data from the online follow-up questionnaire on mental health to define 13 phenotypically independent profiles relevant for mental health24. In addition, we also utilized imaging data provided by the UK Biobank, the procedure of processing and analyzing the imaging data is described a previous paper13. For this study we deployed the summary statistics from these previous studies.
Image acquisition and pre-processing
The processing pipeline for imaging data used for the multivariate GWAS is described in Roelfs et al.13. In short, images were acquired using 3T Siemens Magnetom Skyra scanners with a 32 channel head coil (Siemens Healthcare GmbH, Erlangen, Germany) at four different sites in the UK. The fMRI data was recorded using a gradient-echo echo planar imaging sequence with x8 multislice acceleration (TR: 0.735s, TE: 39ms, FOV: 88×88×64 matrix, FA: 52°) with a voxel size of 2.4×2.4×2.4mm. Data is processed by the UK Biobank team following the protocol described in Alfaro-Almagro et al.27.
Multivariate Genome-Wide Analysis
In this study we applied the Multivariate Omnibus Statistical Test (MOSTest) to the phenotypic data from the ICA decomposition described in Roelfs et al.24. MOSTest deploys the univariate test-statistics for each SNP and computes a multivariate test statistic through single random permutations of the genotype vector. This method is described in Van der Meer et al 25. We performed positional gene mapping using Functional Mapping and Annotation (FUMA)28. We also used a built-in tool to follow up these gene mapping analyses using MAGMA to connect the identified genes with tissue types29. We analyzed gene sets using the reactome toolbox30 to identify biological processes associated with the genes associated with the summary statistics identified by FUMA.
Pleiotropy-informed conjunctional false discovery rate
In order to quantify the degree of genetic overlap and identify shared genetic loci between the mental health profiles and the imaging features we deployed the pleiotropy-informed conjunctional false discovery rate (conjFDR) through the pleioFDR toolbox31. One of the advantages of conjFDR is that it can identify shared genetic loci regardless of effect direction and effect size, a feature that is useful when working with multivariate measures where effect direction might be lacking.
Results
MOSTest revealed 10 significant loci across the 13 previously identified profiles of mental health (Figure 1). FUMA and its positional mapping tool revealed 48 genes associated with these loci (See Suppl. Table 1), that were linked by MAGMA to a number of brain structures such as the cerebellum and amygdala (Suppl. Figure 1). Among the identified genes, those mapped from the strongest GWAS loci were ADH1B and ADH5 (chromosome 4) and CRHR1 (chromosome 17).
Manhattan plot showing the multivariate genome-wide association of our multivariate measure of mental health. We identified 10 loci associated with the multivariate genome-wide association statistics for mental health.
In order to identify shared genomic loci between the mental health profiles and psychiatric disorders, we used GWAS summary statistics from prior case-control studies including schizophrenia (SCZ)1, bipolar disorders (BIP)4, major depression (MD)6, attention-deficit hyperactivity disorder (ADHD)32, autism spectrum disorder (ASD)33, post-traumatic stress disorder (PTSD)34, and anxiety (ANX)8, see also Suppl. Table 2. First, we compared the gene set from the multivariate mental health genome-wide association statistics with the gene set from each of the psychiatric disorders. Here we found 35 overlapping genes, 29 with SCZ, 7 with BIP, and 1 with ADHD (see Suppl. Table 3). It is important to note that the 7 overlapping genes with BIP were mapped from only 2 separate loci. Next, we extracted the loci from each case-control GWAS (202 in total) and assessed whether each locus was significant in the multivariate genome-wide association statistics for mental health profiles as well. Of the 202 loci significant in any of the disorders, one showed genome-wide significance at P < 5e-8 and 122 showed nominal significance at P < 0.05 only in the multivariate genome-wide association statistics for mental health profiles, potentially indicating some shared but small effects.
Next, we explored the genetic overlap between the mental health profiles and the psychiatric disorders through the conjunctional false discovery rate (conjFDR)31,35 which leverages pleiotropy between two phenotypes to estimate shared genetic determinants. ConjFDR allows for the discovery of shared genetic determinants even when those loci are not genome-wide significant in either of the traits in the analysis. Through conjFDR we identified 35 overlapping loci in total between the multivariate genome-wide association statistics for mental health profiles and psychiatric disorders. We found 10 overlapping loci between the multivariate measure of mental health profiles and BIP, 8 overlapping loci with both MD and ADHD, 5 overlapping loci with SCZ, and 4 overlapping loci with autism (see Suppl. Figure 2). FUMA identified 89 genes associated with these loci (See Suppl. Table 3). We found no overlapping loci between the mental health profiles and ANX or PTSD, which may be related to the limited power in these GWASs (see Suppl. Table 2).
Association strength per locus is depicted as q-value from the conjunctional FDR. Values for FC and node variance are shown in the same figure with separate colors.
We then calculated the number of shared genetic loci between the multivariate genome-wide association statistics for mental health profiles and the two multivariate measures of the brain functional connectome using conjFDR. We used the GWAS summary statistics from our previous study of brain function13. In contrast to the prior study in which we identified genetic overlap between brain function and psychiatric disorders, we here investigated overlaps with the multivariate genome-wide association statistics for mental health to investigate if this approach captures associations not revealed through case-control GWAS approaches. Figure 2 shows two Manhattan plots of the conjunctional FDR analyses between both functional connectivity and node variance with the multivariate GWAS on the multivariate genome-wide association statistics for mental health profiles. Genetic signal was adequate (See Suppl. Figure 3). The multivariate genome-wide association statistics for mental health profiles shared 18 loci with functional connectivity and 5 with node variance. A full list of genes associated with the (in total) 23 unique shared loci between the multivariate summary statistics and FC is presented in Suppl. Table 5. The number of overlapping loci between the brain functional connectome and the multivariate genome-wide association statistics for mental health profiles (18 for FC, 5 for node variance) was generally larger than the number of shared loci between the brain functional connectome and the psychiatric disorders (with the exception of SCZ) identified in our previous study13. When we mapped the genes from these loci using FUMA and tested for enrichment in gene-sets using the reactome toolbox30 we found that the genes associated with these shared loci are involved in a number of neurobiologically relevant processes such as axonal growth regulation (NGFR and RHOA) and regulation of transcription factors related through MECP2 (MEF2C, see Suppl. Table 6).
Discussion
In this study we identified a number of loci associated with multivariate genome-wide association statistics for mental health profiles and found overlapping loci with the measures of brain function and psychiatric disorders. Using MOSTest we were able to leverage the phenotypic overlap between different mental health profiles to identify new loci associated with a multivariate measure of mental health. Genes associated with these loci showed regional expression in different parts of the brain (e.g. cerebellum, amygdala).
Our analysis using conjFDR revealed a number of shared loci and genes between the multivariate genome-wide association statistics and the psychiatric disorders. This demonstrates the shared genetics between psychiatric symptoms regardless of clinical diagnosis, emphasizes the utility of using population-level phenotypes to investigate variance in mental health profiles, and highlights the advantage in leveraging pleiotropy between complex phenotypes to boost discovery. We found shared genes with all but two case-control GWAS (ANX, PTSD), which also had the two smallest sample sizes, which may reflect insufficient power to detect an effect36, or may indicate the absence of an effect with those disorders. The largest overlap was with SCZ, which shared 29 genes in the geneset with the multivariate measure. Future sample increases in the case-control GWAS may reveal shared genetics with other complex traits, including population based mental health phenotypes and brain imaging features.
We also identified a number of overlapping loci between mental health profiles and fMRI measures of brain function, including 18 shared loci with functional connectivity and 5 shared loci with node variance. The higher number of shared loci for functional connectivity might be partially explained by the number of phenotypes in each composite measure. While the functional connectivity GWAS comprises 210 measures, i.e. partial correlations between 21 brain nodes, the node variance GWAS encompasses only the temporal variance in each node. It is possible that the number of phenotypes included in the multivariate analysis can affect the discovery25. Both the functional connectivity and node variance summary statistics had the same sample size (N=30.701). For our analyses this means that the difference in their overlap with the multivariate genome-wide association statistics for mental health is due to either the discrepancy in the number of features contained within the composite measure, or alternatively because of different biological processes underlying both measures. The measures differ in that functional connectivity refers to the correlation between brain networks (edge strength), which is possibly governed by different processes than the temporal variance in activity within brain networks. Overall, we found a number of genes associated with the shared loci that are involved in biologically relevant processes such as axonal growth and energy transport (See Suppl. Table 6). Although more thorough functional analysis is necessary, this could suggest that axonal growth processes is a shared feature between brain connectivity and mental disorders, which would be in line with previous evidence linking axonal growth with both processes independently37–39.
The two conjunctional analyses with fMRI measures and the multivariate genome-wide association statistics for mental health each showed a number of overlapping loci. Not all shared loci were unique, this can be partially explained by the definition of the brain networks in our analyses. The functional connectivity and the node variance measures use the same 21 nodes, and, ultimately, the two measures reflect different properties of the same time series. We found that the number of overlapping loci between the multivariate genome-wide association statistics for mental health and the brain functional connectome was generally larger than findings from our previous study highlighting shared genetic loci between psychiatric disorders and the brain functional connectome. This may partly be due to the larger sample size of the multivariate measure of mental health, but it could also reflect that the multivariate genome-wide association statistics capture genetic variance more generally related to the brain functional connectome. We found little direct overlap between loci of these two GWAS’ separately, which highlights the discovery boost advantage of using conjFDR in phenotypes with generally low heritability.
The main implication from our findings is that we can identify shared genetic variants between a multifactorial measure of mental health in an undiagnosed population sample and fMRI-based measures of brain functional connectivity. Several limitations should be considered. First, the data were obtained from a middle-aged and older White British population, which limits the generalizability of the findings. Further, the mental health questionnaires are self-administered, so the data is vulnerable to various response and self-selection biases40. We excluded individuals with a psychiatric diagnosis in our independent component analysis in order to maximalize the population variance and to mitigate the influence of a smaller number of individuals with a (diagnosed) psychiatric condition. This results in a healthier sample, lower variance on a number of severe symptom domains and possible survivor biases24. This bias also extends to the imaging data, where participants are reportedly healthier than the general population41. Further, MOSTest currently lacks effect direction. This complicates further analyses such as genetic correlations that require reliable effect directions. Nonetheless, FUMA and MAGMA revealed brain structures associated with these mapped genes, such as the cerebellum, amygdala, and various parts of the cortex, which have been linked to psychiatric disorders and symptoms. To what degree these shared genes can explain shared clinical characteristics such as symptoms is an important and relevant issue that needs to be answered in future studies.
In conclusion, our multivariate GWAS on 13 mental health symptom profiles showed a number of shared genetic loci with two fMRI-measures reflecting brain function and connectivity. This provides further genetic evidence of an association between brain function and mental health traits in the population.
Data Availability
All data used in this study are part of the publicly available UK Biobank initiative (https://www.ukbiobank.ac.uk/). Summary statistics for the disorders are publicly available through their respective consortia. The summary statistics for the multivariate analyses will be shared on GitHub upon acceptance.
Conflicts of interest
D.R., D.A., O.F., D.vd.M., O.B.S., L.T.W. and T.K. declare no conflicts of interest. O.A.A. is a consultant to HealthLytix and received speakers honorarium from Lundbeck.
Author contributions
D.R. and T.K. conceived the study; D.R. analyzed the data with contributions from T.K.; All authors contributed with conceptual input on methods and/or interpretation of results; D.R. and T.K. wrote the first draft of the paper and all authors contributed to the final manuscript.
Data availability
All data used in this study are part of the publicly available UK Biobank initiative (https://www.ukbiobank.ac.uk/). Summary statistics for the disorders are publicly available through their respective consortia. The summary statistics for the multivariate analyses will be shared on GitHub upon acceptance.
Code availability
Code will be made publicly available via GitHub (https://www.github.com/norment/open-science) upon acceptance of the manuscript.
Supplementary Figures
List of biological structures mapped by MAGMA implemented through FUMA. Separate facets for up-regulated (top), down-regulated (middle), and both directions (bottom). Among some tangentially related structures (e.g. sexual organs, esophagus), there are large number of items on the higher end of the spectrum are related to brain structures (e.g. cerebellum, amygdala, cortex).
Manhattan plots illustrating the shared genetic determinants between a number of psychiatric disorders and the multivariate genome-wide association statistics for mental health. It shows the largest number of shared loci between the multivariate summary statistics and SCZ (29 loci), BIP (10 loci), followed by ADHD and MDD (both 8 loci).
Figure showing no inflation of signal in the conjFDR at different thresholds. Genetic signal overall was not very strong, but sufficient for the analyses in this pipeline.
Supplementary Tables
Genes mapped by FUMA on the summary statistics from the multivariate summary statistics. We found in total 48 genes associated with the genetic signal of the multivariate measure.
List of GWASs we included in the genetic analyses in this paper and their meta data. Sample size in e table and in the summary statistics used in the final analyses may differ since we excluded individuals ready included in the UK Biobank and included only individuals with White European ancestry.
Table showing the number of overlapping genes identified through either overlap in gene sets, the number of genes from the disorder GWAS alone, and the number of shared genes identified through conjFDR.
Genes associated with conjFDR output from diagnosis and multivariate genome-wide association statistics for mental health. This list only includes psychiatric disorders for which we identified significant loci. The number of mapped genes can be greater than the number of discovered loci since loci can be associated with more than one gene.
Genes identified through FUMA associated with the shared loci between the multivariate genome-wide association statistics for mental health and FC or node variance. The number of genes in this gene set can be larger than the number of loci since a locus can be associated more than one gene.
Biological processes associated with the shared genetic determinants between the multivariate genome-wide association statistics for mental health and the brain functional connectome as identified by the reactome toolbox.
Acknowledgements
The authors were funded by the Research Council of Norway (#276082 LifespanHealth, #223273 NORMENT, #283798 ERA-NET Neuron SYNSCHIZ, #249795), the South-East Norway Regional Health Authority (2019101, 2019107, and 2020086), and the European Research Council under the European Union’s Horizon2020 Research and Innovation program (ERC Starting Grant #802998), as well as the Horizon2020 Research and Innovation Action Grant CoMorMent (#847776). This research has been conducted using the UK Biobank Resource (access code 27412, https://www.ukbiobank.ac.uk/). E.T. has been supported by the Foundation “De Drie Lichten” and The Simons Foundation Fund in The Netherlands. This work was performed on the TSD (Tjenester for Sensitive Data) facilities, owned by the University of Oslo, operated and developed by the TSD service group at the University of Oslo, IT-Department (USIT). Computations were also performed on resources provided by UNINETT Sigma2 - the National Infrastructure for High Performance Computing and Data Storage in Norway.