Abstract
Psychiatric and cardio-metabolic conditions are commonly comorbid, being a leading cause of disability. Early adversity is a risk factor for both conditions, however, biological pathways remain unknown. The dopamine (DA) system is sensitive to early adversity and influences the development of comorbidities. We hypothesized that early life adversity would functionally link these conditions with the mesocorticolimbic dopamine system as a critical moderator pathway. We computed a co-expression based polygenic score (ePRS) reflecting variations in the function of the dopamine transporter (DAT) gene network in the prefrontal cortex and striatum, the final targets of the mesocorticolimbic pathway. We explored the interaction effects of the ePRS with a score of early life adversity on presence of psychiatric and cardio-metabolic comorbidities in adults (UK Biobank, N= 60016) and adolescents (ALSPAC, N= 910). In adults we also explored genetic and environment effects on gray matter density variations. As predicted, the mesocorticolimbic DAT1 ePRS significantly moderated the impact of early life adversity on the risk for both psychiatric (schizophrenia, neuroticism, mood and substance use disorders) and cardio-metabolic (type 2 diabetes, atherosclerosis, cardiovascular disease) comorbidities in adults and adolescents. Brain gray matter densities in the insula and prefrontal cortex were significantly associated with SNPs from the DAT1 ePRS implicating these regions as critical dopaminergic targets for psychiatric/cardio-metabolic comorbidities. These results reveal that psychiatric and cardio-metabolic comorbidities share common developmental pathways and underlying biological mechanisms.
Introduction
The co-occurrence of more than one chronic disease1 has high prevalence in primary care settings2, inflating health care utilization and functional disability3. Psychiatric and cardio-metabolic disorders, which are highly comorbid, rank amongst the leading global causes of disability-adjusted life years worldwide4, 5. Prospective studies show a bi-directional relationship between psychiatric and cardio-metabolic conditions6. Meta-analytic evidence from longitudinal studies indicates that diabetes increases the risk for depression by approximately 25% and that depression increases the risk for type 2 diabetes by 40-60%5, 7. The odds for depression also increases with one or more non-psychiatric coexisting chronic conditions, especially coronary artery disease, chronic arthritis, and stroke8. Among adult patients with schizophrenia, the prevalence of diabetes averages a prevalence of 15% and this association persists even after controlling for obesity and use of antipsychotic drugs9.
The underlying mechanism for these co-morbidities remains unknown, but an emerging explanation is that psychiatric and cardio-metabolic disorders share common developmental pathways. Indeed early life adversity increases the risk for a range of psychiatric disorders such as depression10, addiction11, attention deficit hyperactivity disorder12 and schizophrenia13 and for metabolic diseases such as type II diabetes14 and cardiovascular disease15. An obvious question concerns the biological mechanisms that underlie such a developmental trajectory.
The central dopamine system is highly sensitive to early adversity16 and proposed as a mechanism for developmental pathways to multiple psychiatric and metabolic comorbidities17, 18. Dysfunction of the mesocorticolimbic dopamine system associates with a range of mood disorders and psychosis19, 20, as well as dysregulated food intake and altered energy homeostasis21-23. Dopamine signaling is influenced by core hormones linked to the metabolic syndrome, such as leptin and insulin, through actions on the dopamine transporter (DAT)21, 24. We hypothesized that the mesocorticolimbic DAT1 network underlies the association between early life adversity and the comorbidity of psychiatric and cardio-metabolic disorders. We created a mesocorticolimbic co-expression-based polygenic score (DAT1 ePRS) and investigated if the DAT1 ePRS would moderate the association between early adversity and psychiatric/cardio-metabolic comorbidity in adults using the UK Biobank and in adolescents using the Avon Longitudinal Study of Parents and Children (ALSPAC).
Materials and Methods
Detailed description is provided in the Supplementary Materials and Methods.
Participants
We used genomic and phenotypic data from two cohorts, one of adults and one of adolescents: 1) the UK Biobank and 2) Avon Longitudinal Study on Parents and Children (ALSPAC). Informed consent was obtained from each participant, and the projects have been approved by: 1) The North West Multicentre Research 580 Ethics Committee (REC reference 11/NW/0382), the National Information Governance Board for Health and Social Care, and the Community Health Index Advisory Group for UK Biobank; 2) the ALSPAC Ethics and Law Committee and the Local Research Ethics Committees.
Adult cohort
UK Biobank, a large population-based study from the United Kingdom, was used as the adult human cohort25. Participants, aged 37-73, were recruited between 2006 and 2010 resulting in 502,543 participants. After all exclusions and inclusion criteria, the number of subjects that remained for the analysis was 60,016 (mean age of 54.51, SD=7.60). For the brain imaging analysis, a subsample of these 60,016 subjects was considered (N=9,291, mean age = 54, SD = 7.371). Detailed description of each inclusion and exclusion criteria and the corresponding sample size at each step can be found in Supplementary Figure 1.
Adolescent cohort
To test our hypothesis in a earlier developmental time point with the aim to explore the developmental origins of psychiatric and cardiometabolic comorbidities, we used data from the Avon Longitudinal Study of Parents and Children (ALSPAC) cohort26-28. This is a transgenerational prospective observational cohort that recruited 14,451 pregnant women residents in Avon County, UK. Additional recruitment (N=913) was done during later phases, bringing the total sample size to 15,454. The data from the adolescent offspring aged between 15.5 and 17.5 was used in this study. Only subjects with available phenotypic data of interest, early life adversity scores and genotyping data were considered for the analyses (N=910). Detailed description of each inclusion and exclusion criteria and the corresponding sample size at each step can be found in Supplementary Figure 2.
Identification of the mesocorticolimbic DAT1co-expression gene network and ePRS calculation
Figure 1 shows the steps involved in the identification of the gene networks and the calculation of the ePRS score. The ePRS was calculated considering genes co-expressed with the DAT1 gene in the PFC and Striatum, the two brain regions representing the final targets of the mesocorticolimbic pathway structure. We aimed to identify DAT1 co-expression networks within each brain region, with the final ePRS being a joint representation of the functional co-expression networks in these two mesocorticolimbic regions. As described previously29-33, we began by using brain region-specific RNA sequencing data from mouse available at GeneNetwork (http://genenetwork.org/)34 to identify DAT1 gene co-expression networks (absolute value of co-expression correlation with DAT1 gene greater or equal to 0.5), which were then converted to human orthologs by using the biomaRt package35. GeneNetwork was used to obtain gene expression from rodents since our previous findings demonstrated an effect of early life adversity on dopaminergic mesocorticolimbic system in rodents22, 23, 36. Next, since we were interested in gene networks that were active during the developmental period in which adversity occurred (early life) and active when the brain is still undergoing core maturational processes in humans, we used BrainSpan to select autosomal transcripts expressed at least 1.5-fold more during fetal and child development (0–60 months after birth) in comparison to adulthood (20–40 years of age). Only common genes between the GeneNetwork human orthologs and BrainSpan derived lists were retained. These processes resulted in two lists of DAT1 co-expressed genes (210 genes for the PFC and 51 genes for the striatum). No gene overlap was observed between the two lists. Our final list included 261 genes (Supplementary Table 1). We then mapped SNPs in the human ortholog genes comprising the mesocorticolimbic DAT1 gene network using the GRCh37.p13 assembly of the NCBI to source chromosome and start/ end position, and to gather all the gene-SNP pairs from the GTEx dataset in human PFC and Striatum. These lists were merged with the genotyping data from UK Biobank and ALSPAC cohort, retaining only the common SNPs and subjecting the final SNP list to linkage disequilibrium clumping (r2 < 0.2) within 500kb radious, which resulted in 7,940 and 1,330 independent functional SNPs retained for the PFC and striatum lists respectively in the UK Biobank cohort.
To calculate the mesocorticolimbic DAT1 ePRS, the number of effect alleles (genotype information from the study samples) at a given SNP was weighted using the estimated brain-region-specific effect of the genotype on gene expression from the GTEx data37. We also accounted for the direction of the co-expression of each gene with DAT1 by multiplying the weight by -1 in case the expression of a gene was negatively correlated with the expression of the DAT1 gene. The sum of the weighted values from all SNPs for each individual in the cohorts resulted in the region-specific PFC and striatum scores, that were than summed in one score. The mesocorticolimbic DAT1 expression-based polygenic score (ePRS)30, 32, 33 reflects the variation of gene expression of the genes co-expressed with the DAT1 gene in the PFC and striatum in a combined manner. The mesocorticolimbic DAT1 ePRS score was used as a continuous variable in this study.
Validation/specificity of the findings
Using the same steps described above, control ePRS were created to test the validity and specificity of the mesocorticolimbic DAT1 ePRS to moderate the effect of early life adversity on the study outcomes. The PFC DAT1 ePRS and the striatum DAT1 ePRS were used separately in the gene by environment model to test if the summed effect of these two brain regions is more informative than the individual scores. We also tested a VTA DAT1 ePRS (number of genes = 83, number of SNPs = 3.182) based on the high density of dopaminergic neurons on this region38, to further test the specificity of the brain regions chosen for the mesocorticolimbic DAT1 ePRS. For this score, the substantia nigra gene expression information from GTEx was used to weight the ePRS score as there is no available data for the VTA brain region in GTEx.
Functional enrichment analysis
To characterize the biological functions associated with the mesocorticolimbic DAT1 co-expression gene network, functional enrichment analysis was performed using MetaCore® software from Clarivate Analytics (https://portal.genego.com). SNPs that compose the mesocorticolimbic DAT1 ePRS were used in the analysis. The significance was considered for the false discovery rate (FDR) adjusted p-value <0.05. To investigate network centrality measures, protein-protein (PPI) network interactions were identified using the STRING database (https://string-db.org)39 and the mesocorticolimbic DAT1 ePRS genes. These elements interaction were then visualized using the Cytoscape® software40. The nodes are the elements of a network and edges are the connection between these elements. Bottleneck genes are defined as having high betweenness, hub genes are defined as genes with high degree (nodes with more connections). To analyze the topological properties associated with this gene network, the CentiScaPe app in Cytoscape® was used to calculate the degrees and betweenness (genes that act as ‘bridges’ between nodes in a network). We used this information to define the “hub genes” within the network, characterized as nodes with degrees higher than +1SD above the mean; and the “bottlenecks” characterized as nodes with betweenness higher than +1SD above the mean. Genes that are both bottlenecks and hubs were considered the central nodes of the network41. In order to investigate if the genes that composed the mesocorticolimbic DAT1 ePRS have similar patterns of gene co-expression during different life periods in humans, we used the gene expression data from human samples extracted from the BrainSpan database42. Genetic correlations between the SNPs included in the score and other genome-wide association studies, accounting for linkage disequilibrium between SNPs was done using LD Hub43 (http://ldsc.broadinstitute.org/ldhub/). We also used FUMA for GWAS enrichment analysis 44 (Supplementary Materials and Methods).
Early Life Adversity Score
A cumulative measure combining multiple indicators of early life adversity according to the methods demonstrated previously33 was calculated for UK Biobank and ALSPAC participants. (Table 1 and Supplementary Materials and Methods).
Outcome measures
Adult cohort
diagnostic terms from across all participants hospital inpatient records, coded according to the International Classification of Diseases version 10 (ICD-10)45 were used. Selected diagnostic terms from chapter V Mental and behavioural disorders, chapter IV Endocrine, nutritional and metabolic diseases and chapter IX Diseases of the circulatory system were included. The presence of at least one cardio-metabolic diagnosis and at least one psychiatric diagnosis was considered a “co-morbidity” case. Comorbidity was coded as a binary variable (1=“yes” or 0=“no”). T1 structural brain MRI pre-processed imaging data were generated by an image-processing pipeline developed and run on behalf of the UK Biobank46 (Supplementary Materials and Methods).
Adolescent cohort
Presence of psychiatric and metabolic disorders are less common at early stages of development47. Since no diagnoses for described above cardio-metabolic and psychiatric disorders were available in ASLPAC cohort and to maintain consistency with the outcome tested in UK Biobank adult cohort, we created a comorbidity risk variable by cluster analysis, considering known indicators of risk to develop psychiatric and metabolic disorders later in life. Total difficulties score measured by the Strengths and Difficulties Questionnaire, depression and anxiety scores measured by Computerized Interview Schedule – Revised (CIS-R), Homeostatic Model Assessment of Insulin Resistance (HOMA-IR), and waist circumference (cm) were considered. The resulting cluster membership, which represented comorbidity risk, was coded as a binary variable (1= “yes comorbidity” or 0= “no comorbidity”). (Supplementary Materials and Methods, Supplementary Table 2, Supplementary Figure 3).
Statistical Analysis
Statistical analysis were performed using R48. For the descriptive statistics, the ePRS groups were defined by median split, and a comparison between low and high ePRS groups was done using student t test for continuous variables and a chi-square test for categorical variables (Table 2 and Table 3). Significance levels for all measures were set at p < 0.05. The comorbidity risk variable by cluster analysis was performed using the mclust package49. This algorithm uses a model-based classification and density estimation of the z-standardized variables based on finite Gaussian mixture modelling. The method assumes that inputted variables can be explained by an underlying latent categorical cluster variable that represents distinct profiles within the sample, both in a qualitative and quantitative manner. A cluster size solution of two was retained, representing two distinct profiles within the sample associated with the inputted variables. All variables were z-standardized prior to entering the clustering procedure. Regression analysis was carried out to test if the means for each variable were different between the two identified cluster groups indicating two distinct profiles (Supplementary Table 2). Sex and social economic status were used as covariates, because of their know influence on the variables included in the cluster analysis50-52.
The gene by environment (GxE) interaction was explored by logistic regression analysis. The early life adversity score, the mesocorticolimbic DAT1 ePRS and the interaction term between them were used as predictors for both cohorts. In the UK Biobank analysis were adjusted by sex, age, the first forty genetic population structure principal components, genotyping array, and assessment center (Table 2). In the ALSPAC analysis, adjusted by sex, the first ten genetic population structure principal components and breastfeeding duration for being significant at baseline comparisons (Table 3). The analysis was also adjusted for zBMI at 15.5 years of age to diminish confounding factors while analyzing metabolic variables (as the ones considered in the comorbidity risk variable). In case of significant gene by environment (GxE) interactions, post hoc simple slope analysis was performed to investigate how the environment effect varies as a function of the genetic background53.
Gray matter density from the UK Biobank participants was analyzed as an outcome in a multivariate parallel independent component analysis (pICA). This analysis was applied to identify the effect of early life adversity on the relationship between two different data modalities in a data-driven manner54. This analysis estimates the maximum independent components within each data modality separately while also maximizing the association between modalities using an entropy term based on information theory54. Each SNP that composes the mesocorticolimbic DAT1 ePRS weighted by GTEx (genotype * GTEx gene expression slope for each SNP) and whole brain voxel based gray matter density were used in the analysis. Weighted SNPs were adjusted for the most significant principal components from population stratification (ancestry). The subjects were split in two groups according to the early life adversity score (yes or no adversity present). A significant effect of age was observed on the gray matter density (r=-0.59, p<2.2e-16), suggesting that as age progresses, a significant loss of gray matter density is seen in the UK Biobank participants (Supplementary Figure 4). To account for the age effect, the participants were separated in blocs of 5 years of age (from 40 to 70 years old), resulting in 6 blocs, as the pICA analysis does not allow inclusion of co-variates. The Fusion ICA Toolbox (http://mialab.mrn.org/software/fit/) within MATLAB® R2019 was used to run the analysis. The number of independent components was estimated using minimum description length criteria54 for the MRI modality and SNP dimensionality inside the toolbox for the genetic modality. Components for both modalities were converted to z-scores and a threshold at |Z| > 2.5 was used to identify significant brain regions and SNPs that contributed the most for the component overall pattern. Significant brain regions were further analyzed in terms of frequency across the 6 groups analyzed, to narrow down important brain regions. Loading coefficients, which describe the presence of the identified component across subjects54 were extracted for each component, modality and subject. The mean subject-specific loading coefficients of these components between subjects from high and low prenatal adversity groups was compared using Student’s t-test. To identify the anatomical classification of brain areas included in the significant MRI components, Talairach coordinates were used. The significant SNPs (|Z| > 2.5) from the significant genetic components were analyzed using MetaCore®, to identify associated underlying biological pathways and gene ontology processes (Figure 3).
Results
The mesocorticolimbic DAT1 gene network is co-expressed in humans
To investigate if mouse-generated DAT1 gene network was co-expressed in humans, we explored the gene co-expression patterns of the mesocorticolimbic DAT1 gene network throughout human development using gene expression data from human postmortem samples42. A high co-expression was expected in childhood, as the mesocorticolimbic DAT1 gene network was enriched for genes over expressed in this life period (see Figure 1A and Supplementary Materials and Methods,). However, prominent gene co-expression clusters were also seen in adolescents and adults (Figure 1B and 1C). When visualizing and exploring the network properties (Figure 1D), 12 genes were considered central (Figure 1E), and NPM1 was the central node. PIK3CA was among the most important genes serving as a bridge between other nodes. Among the hub genes, HNRNPA1 was the gene with the highest degree. The main pathways associated with the network include epigenetic regulation, activation of the PI3K pathway and regulation of the mTOR transcription factor, comprising regulation of translation, cell proliferation and oxidative stress (Supplementary Figure 5).
Mesocorticolimbic DAT1 ePRS moderates the association of early life adversity with the risk for psychiatric/cardio-metabolic comorbidities in adults
In agreement with the literature, our early adversity score was strongly associated with the presence of co-morbidities in both UK Biobank (b= 0.442, Odds ratio (OR)= 1.556, 95% confidence interval (CI):1.492– 1.620, P <0.001) and ALSPAC (b= 0.201, OR= 1.222, 95% (CI): 1.087 –1.358, P =0.003). For the UK Biobank there were no significant main association of the ePRS with co-morbidity risk (b= 0.022, OR= 1.021, 95% (CI): 0.939 – 1.102, P= 0.603). In contrast, and consistent with our anticipated hypothesis, there was a significant interaction between the mesocorticolimbic DAT1 ePRS and early life adversity on the presence of psychiatric/cardio-metabolic co-morbidities in UK Biobank adults (b= -0,069, OR = 0.933, 95% (CI): 0.868 – 0.997, P = 0.036, n = 60016) (Figure 2A).
See Supplementary Materials and Methods for details description of the genotyping procedure for each cohort.
In an earlier developmental time point (ALSPAC adolescents), the GxE model revealed a significant interaction between the mesocorticolimbic DAT1 ePRS and early life adversity on the presence of psychiatric/metabolic comorbidity risk (b = 0.180, OR= 1.198, 95% (CI): 1.060– 1.335, P= 0.009, n=910) (Figure 2B). Similar to the findings in adults, there was no significant main association of the ePRS with the comorbidity risk (b= 0.105, OR= 1.110, 95% (CI): 0.945 – 1.275, P= 0.150). These results indicate a developmental trajectory, in which early indicators of risk to develop psychiatric and metabolic comorbidities in adulthood can be seen in adolescents as a function of the mesocorticolimbic DAT1 co-expression gene network and early life environmental conditions.
To further validate and test the specificity of our findings, the PFC DAT1 ePRS and the striatum DAT1 ePRS were used separately in the GxE model to test if the summed effect of these two brain regions is more informative than the individual scores. No significant interaction was found between early life adversity and the PFC DAT1 ePRS on the probability of having psychiatric and cardio-metabolic comorbidities in adults (b= -0.023, OR= 0.977, 95% (CI): 0.913 – 1.041, P= 0.477). The interaction model using the striatum DAT1 ePRS and early life adversity showed a non-significant trend (b= -0.023, OR= 0.937, 95% (CI): 0.872 – 1.002, P= 0.054) in predicting presence of psychiatric and cardio-metabolic comorbidities in the UK Biobank cohort. We also used the ventral tegmental area (VTA) DAT1 ePRS as another control score, as this is a brain region dense in dopaminergic neurons38. No significant interaction was found in adults (b= -0.020, OR= 0.980, 95% (CI):0.921 – 1.045, P= 0.549). As a whole, we conclude that the results from the GxE model using the mesocorticolimbic DAT1 ePRS are specific to this gene network and to the combination of the PFC and striatum gene network in adulthood.
In the adolescent ALSPAC cohort, the PFC DAT1 ePRS (b= 0.071, OR= 1.074, 95% (CI): 0.830 – 1.111, P= 0.282) and the VTA DAT1 ePRS (b= -0.076, OR=0.926, 95% (CI): 1.190 – 0.890, P= 0.257) interactions with early adversity were not significant for the presence of co-morbidity risk. However, the striatal DAT1 ePRS showed a significant interaction with early adversity in predicting the presence of a psychiatric/cardio-metabolic comorbidity risk in adolescence (b = 0.165, OR = 1.179, 95% (CI): 0.597 – 1.246, P= 0.021). This suggests that the striatal DAT1 gene network may also have an independent role in the emergence of these co-morbidities in response to early life adversity during adolescence before the prefrontal dopaminergic connections are fully established55.
Mesocorticolimbic DAT1 gene network SNPs are enriched in genome-wide association studies for both psychiatric and metabolic conditions
To address whether the genes included in the network were relevant for psychiatric and metabolic illnesses we compared our network with genes identified in GWAS of psychiatric and metabolic disorders. We observed that the mesocorticolimbic DAT1 ePRS was significantly enriched for MDD GWAS (ρ = 0.23, p = 0.021). When examining genetic correlations between SNPs from the mesocorticolimbic DAT1 ePRS and SNPs from GWASes from the LD Hub catalog43, we observed genetic correlations with neuroticism (rg= 0.353; P= 0.0021), free cholesterol in small VLDL (rg= 0.328; P= 0.042) and triglycerides in very large VLDL (rg=0.248; P= 0.050). This analysis confirms that the genetic variation associated with the mesocorticolimbic DAT1 co-expression network is relevant for both psychiatric and metabolic phenotypes.
SNPs from the mesocorticolimbic DAT1 ePRS are related to gray matter variations in the insula and prefrontal cortex
To establish the anatomo-functional relevance of our mesocorticolimbic DAT1 gene network, we investigated correlations between the SNPs from the mesocorticolimbic DAT1 ePRS and voxel-based gray matter density in UK Biobank participants using a multivariate parallel independent component analysis (pICA)54 (Figure 3 and Supplementary Materials and Methods). Since a significant correlation of age was observed in gray matter density (Supplementary Figure 4), analyses were carried out in 6 groups of participants separated in blocks of 5 years of age (from 40 to 70 years old). Only the first significantly linked pair of components with higher correlation index values was selected for each age block to be further explored (Supplementary Table 3). A significant association with adversity was observed in components at different ages. In these cases, the relation between SNPs and gray matter were moderated by the early life adversity score (see Supplementary Table 4 for a full list of significant SNPs per age band). These subsets of significant SNPs are related to variations in gray matter density in the insula, inferior and medial frontal gyrus, striatum (pallidum, putamen and caudate), parahippocampal gyrus and cingulate gyrus across ages (see Supplementary Table 5 for a full list of significant brain regions from all age bands). Enrichment analysis of these significant SNPs using MetaCore® (Supplementary Table 6) showed that the most common pathway maps identified are involved in synaptic plasticity and transmission through the action of Ephrin-B and the cell cycle. Gene ontology enrichment revealed central nervous system and neuron development, neurogenesis, regulation of glutamate receptor signaling pathway, response to monoamine and catecholamine and regulation of insulin secretion involved in the cellular response to glucose (see Supplementary Table 6 for a full list).
Discussion
Our study suggests that exposure to early life adversity increases the risk for later co-morbidities between cardio-metabolic and psychiatric conditions in adulthood 10-14, and our novel functional genomics approach provides evidence for the mesocorticolimbic DAT1 gene network as a salient mechanism moderating this association. This finding is aligned with the critical role of the dopaminergic system in environmental responsivity30, 56.
Our enrichment analysis results showed that the mesocorticolimbic DAT1 gene network is highly co-expressed across childhood, adolescence and adulthood. This finding is consistent with our finding of a moderating correlation for the DAT1 ePRS in both adolescents and adults. The difference in simple slopes seen between adolescents and adults (Figure 2) suggests that the mesocorticolimbic DAT1 ePRS leads to a differential timing for the emergence of the co-morbidity phenotype in response to early adversity, which can be important to inform specific vulnerabilities for prevention measures.
We also identified candidate biological mechanisms underlying the moderating correlation of the mesocorticolimbic DAT1 ePRS on cardio-metabolic – psychiatric comorbidities, for example mTOR and PI3K, which are involved in intracellular insulin signaling. The mTOR (mammalian target of rapamycin) pathway is activated in insulin-induced long-term depression of DA neurons in the VTA57. A central gene in our network, PIK3C, operates in both the PI3K and mTOR pathways58. Insulin-induced activation of the PI3K/Akt/mTOR pathway is an important protective factor for cardiomyocytes apoptosis, implicated in myocardial injury associated with metabolic disturbances59. Insulin resistance is a risk factor for cardio-metabolic and brain-based disorders, including type II diabetes, cardiovascular disease, Alzheimer disease, and major depressive disorder60, 61. Insulin modulates mesocorticolimbic DA neurotransmission through different mechanisms, one of which is to increase the capacity of DA reuptake by DAT by activating the phosphatidylinositol (PI) 3-kinase24, 62. Insulin also reduces DA release in rodent nucleus accumbens and medial prefrontal cortex slices63. Taken together, the evidence suggests that insulin modulation of dopamine function may be an underlying mechanism by which the mesocorticolimbic DAT1 gene network moderates the risk for developing psychiatric and cardio-metabolic co-morbidities in response to early life adversity.
Interestingly, the subset of SNPs from the DAT1 ePRS that was related to gray matter density in all age bands in our anatomical-functional correlation analysis is also associated with the regulation of insulin secretion and a cellular response to glucose and insulin stimulus. This subset of insulin-related SNPs from our DAT1 ePRS was correlated with gray matter density variations in the insula, medial and inferior frontal gyrus, striatum (lentiform nucleus and caudate) and parahippocampal gyrus. The insula is involved in emotional recognition and processing, cognition and motivation64. Altered insular gray matter volumes occur in patients with type II diabetes and are associated with increased anxiety and depression65 as well as cognitive impairment66. Importantly, insulin resistance moderates the relationship between childhood abuse and insula-precuneus connectivity in depressed and overweight youth67.. Reduced gray matter volume in the parahippocampal gyrus is a hallmark of schizophrenia68 and functional connectivity between the parahippocampal gyrus and prefrontal areas is positively correlated with self-reported childhood adversity in adults69, further implicating these regions as a critical site for adversity-related cardio-metabolic – psychiatric conditions.
Specificity analysis revealed that the GxE results in the UK Biobank cohort were apparent only for the mesocorticolimbic DAT1 gene network, in comparison to other important sites for dopamine transmission. However, in adolescents, the striatum DAT1 ePRS also showed a significant interaction with early adversity, suggesting an independent role in the emergence of co-morbidities in response to early life adversity. This agrees with evidence of prolonged mesocorticolimbic dopamine axons growth (from the striatum to the PFC) during adolescence55. Due to this late maturation, striatal dopaminergic axons are especially vulnerable to environmental effects during development70. The Netrin-1 receptor, DCC, is one of the main molecules responsible for axonal guiding, determining the axons’ extent of growth and final target in the PFC71. The DCC gene is indeed present in our mesocorticolimbic DAT1 gene network (Supplementary Table 1).
Given that DAT1 gene is expressed in dendrites and cell bodies of dopaminergic neurons and DAT concentration is higher in the VTA and substantia nigra38, one might expect significant associations with a VTA DAT1 gene co-expression network. In our study, however, the terminal gene networks (PFC and striatum) were strong moderators of the association between early adversity and the co-morbidities, while the VTA co-expression network showed no significant associations. mRNA axonal transport and protein synthesis at the terminal is an important mechanism for regulation of neurotransmitter synthesis and reuptake72, and could explain the presence of DAT1 mRNA at the terminals, consistently detected in numerous human post-mortem studies38, 73-75. Indeed, one of the central nodes of our network is HNRNPA1 (Figure 1E), a gene associated with mRNA transport and synthesis76, suggesting that DAT1 mRNA transport to terminals may be a key mechanistic feature of our DAT1 mesocorticolimbic gene network. DAT1 protein vesicular traffic has a limited contribution to DAT1 concentration in synapses77, hence other forms of regulation of DAT1 availability in terminals – for instance via mRNA axonal transport and terminal protein synthesis – may be in place, in agreement to our findings.
Our ePRS does not consider intronic regions, potentially ignoring important regulatory elements. Moreover, our developmental results are based on cross sectional studies, and further longitudinal data are needed to better describe this trajectory.
We observed that the association between environmental and genetic factors can place individuals at risk for adult co-morbid chronic conditions from an early age, and that dopaminergic neurotransmission has a central role in moderating the association of the early environment with the risk for these diseases. These findings open opportunities for designing more effective interventions and prevention strategies, especially with the growing support from precision medicine.
Data Availability
For UK Biobank, data can be purchased; the study website contains details of all the data that is available at https://biobank.ndph.ox.ac.uk/showcase/. For ALSPAC, data can be purchased; the study website contains details of all the data that is available through a fully searchable data dictionary and variable search tool at http://www.bristol.ac.uk/ alspac/researchers/our-data/.
Conflict of Interest
BB, DMA, EJMF, RML, SP, OK, ZW, IP, CP, AKP, CD, MJM and PPS report no biomedical financial interests or potential conflicts of interest.
Acknowledgments
We are extremely grateful to all the families who took part in this study, the midwives for their help in recruiting them, and the whole ALSPAC and UK Biobank teams, which includes interviewers, computer and laboratory technicians, clerical workers, research scientists, volunteers, managers, receptionists and nurses. Funding: The UK Medical Research Council and Wellcome (Grant ref: 102215/2/13/2) and the University of Bristol provide core support for ALSPAC. A comprehensive list of grants funding is available on the ALSPAC website (http://www.bristol.ac.uk/alspac/external/documents/grant-acknowledgements.pdf). The research using ALSPAC was specifically funded by the Wellcome Trust and MRC (Grant ref: 076467/Z/05/Z) and Wellcome Trust (Grant ref: 08426812/Z/07/Z). GWAS data was generated by Sample Logistics and Genotyping Facilities at Wellcome Sanger Institute and LabCorp (Laboratory Corporation of America) using support from 23andMe.
This project was supported by The JPB Foundation through a grant to The JPB Research Network on Toxic Stress: A Project of the Center on the Developing Child at Harvard University. This work was also funded by the Fonds de recherche du Québec – Santé and Canadian Institutes of Health Research (CIHR, PJT-166066, PI Silveira PP). This publication is the work of the authors and Barbara Barth and Patricia P. Silveira will serve as guarantors for the contents of this paper.