ABSTRACT
DNA methylation variations are prevalent in human obesity, but evidence of a causative role in disease pathogenesis is limited. In this study, we combine epigenome-wide association and integrative genomics to investigate the impact of subcutaneous and visceral adipocyte DNA methylation variations in extreme human obesity. We identify extensive DNA methylation changes that are robustly associated with extreme obesity in combined discovery and replication analyses (N=190 samples, 691 loci in subcutaneous and 173 loci in visceral adipocytes, P<1×10-7). Using functional interaction maps and methylation-expression association testing in human adipocytes, we connect extreme obesity-associated methylation variations to transcriptomic changes at >500 target genes. We find that disease-associated methylation variations localise to active genomic regions and transcription factor binding sites, at which DNA methylation influences transcription factor-target gene co-expression relationships. In Mendelian Randomisation analyses, we infer causal effects of DNA methylation on human obesity and obesity-induced metabolic disturbances, under genetic control, at 28 independent loci. Silencing of two target genes of causal DNA methylation variations, the PRRC2A and LIMD2 genes, further reveals novel metabolic effects in adipocytes. Our results indicate DNA methylation is an important determinant of human obesity and its metabolic complications, and reveal genomic and molecular mechanisms through which altered DNA methylation may impact adipocyte cellular functions.
INTRODUCTION
Obesity is a disease of excess adipose tissue that impairs health1. Worldwide there are more than 650 million people affected by obesity2. These individuals are at high risk of developing obesity-induced inflammatory and metabolic disturbances, and subsequent type 2 diabetes (T2D)3–5. Existing treatments for obesity and T2D have major limitations6, and new therapeutic targets derived from increased understanding of disease mechanisms are a global health priority.
DNA methylation, the first layer of epigenetic regulation, is causally implicated in human obesity and T2D through diverse aetiological pathways7–9. These pathways include in utero programming of disease risk, long-term effects of diet and lifestyle, mediation of causal genetic variations, age-related susceptibility and even inter/trans-generational inheritance10–16. However, previous efforts to identify robust causal associations between DNA methylation and these common harmful human conditions have been hindered by issues with tissue selection, cell specificity and assigning causation17–21. Obesity-associated DNA methylation changes in human blood are primarily a consequence rather than a cause of disease22, 23. Methylation variations in human metabolic tissues cannot be confidently linked to phenotype due to cellular and therefore epigenetic heterogeneity18, 19. Very few studies have investigated genome-wide DNA methylation in clinically relevant human cell-types due to the substantial challenges in collecting and then isolating cells from human tissues24–26.
Adipocytes are the major cell-type in adipose tissues. These specialist metabolic cells have important roles in local energy storage and expenditure, whole-body energy and glucose homeostasis, and obesity and T2D pathogenesis27, 28. Interestingly, adipocytes in distinct anatomical locations have variable DNA methylation and transcriptomic profiles, molecular functions and impacts on metabolic health, with visceral adipocytes considered more harmful than subcutaneous adipocytes25, 29, 30. Recent experimental studies demonstrate that manipulation of DNA methylation enzymes in adipocytes can induce or prevent obesity and T2D, through cellular effects on energy expenditure and insulin sensitivity31, 32. These proof-of-principle studies provide strong rationale for exploratory epigenomic studies in human adipocytes.
Here, we addressed key limitations of previous human epigenome-wide association studies (EWAS) in subcutaneous and visceral adipocytes with the aim of identifying novel epigenetic mechanisms underlying obesity or its adverse metabolic consequences. We used an integrated genomic strategy to: i. identify robust alterations in human adipocyte DNA methylation associated with extreme obesity; ii. predict the potential effector transcripts (cis-target genes) of these DNA methylation changes; and iii. infer mechanisms underlying these DNA methylation-gene expression relationships. At a subset of methylation sites and target genes, we used genetic association and adipocyte genetic manipulation to provide complementary evidence of causation. Our results highlight the importance of studying cell type-specific epigenomic variations and the power of extreme trait sampling. We provide mechanistic insights into the role of DNA methylation in human obesity and T2D, and deliver novel targets for detailed functional characterisation and potential clinical translation.
RESULTS
Genome-wide alterations in adipocyte DNA methylation in people with extreme obesity
To identify obesity-associated alterations in human adipocyte DNA methylation (5-methylcytosine, 5mC) we collected subcutaneous and visceral adipose tissue samples intraoperatively from people with extreme obesity and healthy controls, and isolated populations of adipocytes from these tissues. We then characterised genome-wide DNA methylation in 190 subcutaneous and visceral adipocyte samples from obese cases and controls in separate discovery and replication cohorts (Illumina 450K and EPIC Beadchips, Supplementary Figure 1). The mean difference in body mass index (BMI) between obese cases and controls from both discovery and replication cohorts was large (∼20kg/m2) (Supplementary Table 1; age (±3.5-yrs), sex and ethnicity matched).
In subcutaneous adipocytes, we discovered 4485 5mC sites associated with extreme obesity at a false discovery rate (FDR) of <1%33, 34. We then replicated the association of 5mC with obesity at 905 of these sites in an independent subcutaneous adipocyte sample (at FDR<1% in the replication sample, and at epigenome-wide significance P<1×10-7 in the combined discovery and replication samples, Fig. 1A). For subsequent genomic and functional analyses, we annotated the sentinel 5mC site at each replicated genomic locus (691 subcutaneous sentinels, lowest combined discovery and replication P value, ≥5-kb apart which is equivalent to reported CG island (CGI) widths, Supplementary Table 2). Subcutaneous adipocyte sentinels had a median 5.8% (range 1.1-17.9%) difference in methylation between obese cases and controls, and were systematically hypomethylated in obese cases (binomial sign test P<6.4×10-33, Fig. 1B, consistent with previous reports35). More sentinels had intermediate (0.2-0.8) than low (<0.2) or high (>0.8) methylation levels (Fig. 1B). 24 loci had ≥5 differentially methylated 5mC sites immediately flanking and within ±5-kb of the sentinel 5mC site (consistent association direction, P<0.05, Bonferroni adjusted), indicating extended regions of differential methylation (Supplementary Table 2). The largest differentially methylated regions were found at the RUNX3 (19 sites, 1174-bp), TBX5 (17 sites, 3187-bp) and ISLR2 (14 sites, 4050-bp) loci. Overall, 4363 of the 4485 5mC sites identified at FDR<1% in the discovery sample showed directional consistency for association with extreme obesity in the replication sample (binomial sign test P<1×10-300), suggesting our findings represent only the strongest signals among a larger number of obesity-associated DNA methylation changes in subcutaneous adipocytes.
In visceral adipocytes, we identified 445 5mC sites associated with obesity at FDR<1%, markedly fewer sites than in subcutaneous adipocytes. 220 of these 5mC sites replicated in an independent visceral adipocyte sample (replication FDR<1%, combined discovery and replication P<1×10-7, Fig. 1A). The 173 sentinel sites (visceral sentinels, lowest P-value, ≥5-kb apart) had median methylation difference of 7.9% (range 2.9%-21.5%) between obese cases and controls (Fig. 1B, Supplementary Table 3). Visceral adipocyte sentinels were also preferentially hypomethylated in obesity (P=3.8×10-7), and most sentinels had intermediate (0.2-0.8) methylation levels (Fig. 1B). 2 loci showed extended regions of differential methylation (≥5 significantly differentially methylated 5mC sites within ±5-kb of the sentinel site, Supplementary Table 3); at NFIA (5 sites, 2081-bp) and near ATG5 (5 sites, 831-bp). Again, overall directions of effect in visceral adipocytes were highly concordant between the discovery and replication samples (440 of 445 5mC sites, P<2×10-123).
Disease-associated alterations in DNA methylation are frequently the result of underlying differences in genetic polymorphisms or cellular (and therefore epigenetic) heterogeneity between individuals18, 19. In sensitivity analyses, we found that the majority of sentinels remained associated with obesity after correction for genetic effects (Fig. 1C, Supplementary Figure 2). Consistent with this, <1% of sentinels showed methylation distributions fitting with underlying SNP effects36, indicating that the identified methylation differences are predominantly environmentally driven. Similarly, we found that potential cellular heterogeneity (arising from impurity) and other unmeasured confounding exposures did not systematically alter our findings (Fig. 1C, Supplementary Figures 3 and 4).
As adipose tissues from different depots have varying impacts on metabolic health29, we examined whether 5mC changes associated with obesity were specific to subcutaneous or visceral adipocytes. Only 23 subcutaneous adipocyte sentinels were robustly associated with obesity in visceral adipocytes, while a further 11 visceral sentinels replicated in subcutaneous adipocytes (consistent direction and P<0.05, Bonferroni corrected for the number of sentinels, Fig. 1D, Supplementary Table 4). Similarly, we found only weak concordance when we compared overall directions of effect between subcutaneous and visceral adipocyte sentinels (374 of 671, P=0.003 subcutaneous in visceral, and 105 of 173 P=0.006 visceral in subcutaneous).
Together, these genome-wide discovery and replication analyses quantify extensive alterations in DNA methylation in adipocytes from people with extreme obesity. Surprisingly, the majority of extreme obesity-associated 5mC changes are adipose depot specific, raising the possibility of tissue intrinsic biological origins and functions.
Enrichment of extreme obesity-associated 5mC sites in active genomic regions
The genomic location of DNA methylation influences its gene regulatory potential and its mechanism of action9, 37, 38. To identify 5mC sites with active gene regulatory potential and evaluate underlying mechanisms, we mapped obesity-associated subcutaneous and visceral adipocyte sentinels to the human reference genome, human CpG island (CGI) annotations, and human adipose/adipocyte functional genomic maps.
Differentially methylated subcutaneous adipocyte sentinels were significantly enriched in adipose tissue and adipocyte enhancers (Roadmap chromatin states, Reg2Map, Fantom539–41) and enhancers predicted from multifaceted datasets (Genehancer42, Fig. 1E, Supplementary Figure 5). Enrichment in enhancers was explained by both hypo-(lower methylation in obese adipocytes) and hyper-(higher methylation in obese adipocytes) methylated subcutaneous sentinels. Hypo-methylated sentinels were also (weakly) enriched in regions flanking active transcription start sites (TSS), bivalent enhancers and repressed polycomb, whereas hyper-methylated sentinels were generally underrepresented in these regions. In contrast, hyper-methylated sentinels were enriched and hypo-methylated sentinels were diminished in actively transcribed genic regions. In addition, subcutaneous sentinels were under-represented at promoter CGIs but not at intra-or inter-genic CGIs (Fig. 1E, Supplementary Figure 5). Differentially methylated visceral adipocyte sentinels showed similar trends in CGIs and multifaceted enhancer datasets but were not significantly enriched in these or in adipose tissue/adipocyte chromatin states (which notably are derived from the subcutaneous adipose depot, Supplementary Figure 5). Our findings are consistent with previous studies localising variably methylated regions to cis-regulatory regions involved in transcriptional control and phenotypic variation43–46, rather than CG dense DNA sequences in the promoters of core housekeeping genes.
We also examined whether the genomic regions flanking our 5mC sentinels contained DNA sequence variants associated with human obesity phenotypes in GWAS. Although we found no evidence of enrichment, 11 loci contained genetic variants associated with obesity (BMI47), 8 central adiposity (waist-hip-ratio adjusted for BMI, WHR48) and 4 type-2 diabetes (T2D49) at genome-wide significance (P<5×10-8, Supplementary Table 5); these loci included the BMI and adiposity locus NRXN350, 51, the WHR locus TBX1552, 53 and the T2D locus TCF7L254.
Predicted target genes of extreme obesity-associated 5mC sites
To identify genes that might be responsible for the effects of DNA methylation on phenotype (target effector genes), we carried out RNA sequencing in obese and lean subcutaneous and visceral adipocytes with paired DNA methylation results (replication cohort, N=89 samples, median 49M assigned reads). We split our 5mC sentinels into those in gene promoters, 5/3’UTRs and exons (N=389 subcutaneous and N=92 visceral sentinels) with unambiguous target genes, and those in intronic and distal intergenic regions (N=302 subcutaneous and N=81 visceral sentinels) without defined target genes.
At promoters, 5/3’UTRs and exons, we examined the relationship between change in methylation at the 5mC sentinel site and change in expression of the overlapping/directly flanking cis-genes (mixed-effects model linear regression, combined subcutaneous and visceral adipocyte samples). Methylation at 121 subcutaneous adipocyte sentinels was robustly associated with expression of 126 unique cis-genes at FDR<0.01 (P range 9.1×10-3 to 1.9×10-24, Fig. 2A and 2D, Supplementary Table 6). In addition, methylation at 29 visceral adipocyte sentinels was associated with expression of 32 unique cis-genes at FDR<0.01 (P range 4.6×10-3 to 1.6×10-12, Fig. 2D, Supplementary Table 7). Methylation changes at promoters, 5/3’UTRs and exons had predominantly negative effects on gene transcription (Binomial test P=0.004, Fig. 2E). Subcutaneous sentinels associated with gene transcriptional changes were enriched in the genomic regions flanking active TSS rather than at active TSS (2.4-fold, Fisher’s exact P=0.0005 in adipocytes and 2.4-fold, P=7.4×10-5 in adipose, Fig. 2E, Supplementary Figure 6).
At intergenic and intronic sites, we used human adipocyte chromosomal interaction maps (promoter capture Hi-C55) and enhancer-promoter inference datasets (GeneHancer42) to functionally assign 5mC sentinels to specific distal target genes, then tested sentinel 5mC sites for association with target gene expression. 84 subcutaneous and 14 visceral sentinels were associated with 135 and 16 unique distal target genes respectively at FDR<0.01 (P range 1.1×10-2 to 8.6×10-18, Fig. 2B and 2D, Supplementary Tables 8 and 9). For intergenic and intronic sentinels not assigned to target genes through direct functional interactions (N=161 in subcutaneous and N=56 in visceral) we localised the 5mC site to a human adipocyte topologically associated gene regulatory domain (TAD56), and defined all genes within the domain as potential targets (median 7, range 1-46 target genes per sentinel). 91 subcutaneous sentinels were associated with 177 cis-genes and 21 visceral sentinels with 34 cis-genes in the same TAD at FDR<0.01 (P range 2.7×10-3 to 3.5×10-20, Fig. 2C and 2D, Supplementary Tables 10 and 11). Most associations occurred in unique tads, though 8 TADs had ≥5 sentinel methylation-target gene associations (Fig. 2D). As expected, sentinels associated with functionally assigned target genes were enriched in enhancers (subcutaneous 1.8-fold P=8.4×10-4 in adipocytes and 1.5-fold P=0.03 in adipose; visceral 3.1-fold P=0.05 in adipocytes and 3.3-fold P=0.01 in adipose, Fig. 2E and Supplementary Figure 6). By contrast, sentinels associated with genes in a shared TAD were enriched in polycomb repressed regions (subcutaneous 1.5-fold P=0.007 in adipocytes, 1.8-fold P=0.0002 in adipose, Fig. 2E), suggesting that genomic context-specific regulatory mechanisms underlie the observed methylation-expression associations.
Many of the genes associated with altered DNA methylation have important roles in insulin signalling/sensitivity, adipogenesis, fatty acid metabolism and adipocytokine signalling (Supplementary Tables 6 to 11). As examples, the IRS2, ADIPOR2, PLIN5, FABP3, SOCS3, RBP4 and AKT3 genes in subcutaneous adipocytes57–63, and the KLF6, TULP3, DOC2B, ACOT11, PRKD2, SLC22A3 and KIF5C genes in visceral adipocytes64–70. Interestingly, several methylation-expression associations involved genes that control neurite formation, axonal guidance and synaptic plasticity (NRN1, SEMA6B, SEMA4B and NRXN371–73), raising the possibility of epigenetic effects on neural inputs to expanding adipose tissue. Subcutaneous sentinels were associated with important genes involved in browning/beigeing of white adipocytes including the master regulator transcription factors PRDM16 and EBF274, 75, and the signalling molecules FGF9 and IL10RA76, 77, though directions of effect on browning were inconsistent. We also discovered a novel association between expression of a cluster of micro-RNAs implicated in adipogenesis, MIR23A, MIR24-2 and MIR27A78–83, and methylation at its shared promoter in subcutaneous adipocytes. Nonetheless, most potential effector genes of obesity-associated 5mC changes have unknown functions in adipocytes.
Functional annotation of target genes linked to extreme obesity-associated 5mC changes
We used gene set enrichment analyses to systematically evaluate the molecular and functional significance of cis-genes linked to our 5mC sentinels either by genomic location (nearest cis-gene) or by association (methylation-expression FDR<0.01). The nearest cis-genes to subcutaneous sentinel 5mC sites were enriched in growth and development, inflammatory, metabolic and apoptosis pathways (FDR<0.01 Empirical, Fig. 2F, Supplementary Table 12). Notable pathways included TLR4 signalling, a key trigger of the obesity-induced inflammatory response84, sphingolipid binding which is implicated in cell stress and metabolic dysfunction85, and Vitamin D metabolism which is linked to adipogenesis, lipid storage and adipocytokine production86. The genes nearest to visceral sentinels were over-represented in endoderm development, body pattern formation and cell motility pathways (FDR<0.01 Empirical, Fig. 2F, Supplementary Table 12). Cis-genes associated with change in methylation at subcutaneous sentinels were enriched in a cluster of intersecting gene sets involved in transcriptional control and cell/tissue development (FDR<0.01 Hypergeometric, gProfiler87). Relevant sub-clusters included cell differentiation and muscle development gene sets (Supplementary Figure 7 and Table 13). Similarly, cis-genes associated with visceral sentinels were enriched in intersecting embryonic and tissue development gene sets, and in transmembrane drug transporter genes (FDR<0.01 Hypergeometric, gProfiler, Supplementary Figure 7 and Table 13). Taken together, these pathway analyses link obesity-associated 5mC sites in subcutaneous and visceral adipocytes to genes involved in cell lineage and fate determination, tissue development and remodelling, inflammation and metabolic function/dysfunction.
Evidence of mechanistic interactions between transcription factors and extreme obesity-associated 5mC sites
Recent studies suggest that DNA methylation in enhancers and other active cis-regulatory regions may systematically regulate gene transcription by altering the binding of methylation-sensitive transcription factors (TFs88–90). Alternatively, TF binding can alter the methylation status of flanking DNA in these regions89, 90. To investigate putative mechanistic interactions between obesity-associated 5mC changes and TFs, we mapped human TF binding motifs within ±150-bp of each sentinel 5mC site using the Homer database91. We then identified the TFs with the strongest potential to bind to each motif, and examined the relationship between expression of these TFs, change in sentinel methylation and change in sentinel target gene transcription in adipocytes.
The genomic regions flanking the 671 subcutaneous sentinels were enriched for 7 distinct TF binding motifs (P range 1×10-9 to 1×10-13); 5 motifs at hypo-(lower in obesity) and 2 motifs at hyper-methylated (higher in obesity) sentinels (median 28 range 16-86 sentinels per motif, Fig. 3A, Supplementary Table 14). Motif 1 was preferentially located at tissue-specific enhancers, Motif 4 at regions flanking active TSS, and Motif 5 at both enhancers and active TSS flanking regions (Fig. 3B, Supplementary Figure 8). Overall, 49 TFs with potential to bind at these 7 motifs were expressed in human subcutaneous adipocytes (median 8 range 4-11 TFs per motif, Supplementary Table 15). In visceral adipocytes, we identified several putative motifs at obesity-associated 5mC sites (Supplementary Table 16) but none reached the stringent significance threshold required for robust de novo motif discovery.
Consistent with evidence that TF binding activity can alter DNA methylation levels, we found correlation between TF expression and sentinel methylation levels at each motif (Fig. 3C). The strongest correlations involved the HOXA11 and HOXA13 TFs (Motif 7, Fig. 3C), with weaker correlations between the ATF3, FOSL1 and JUN (Motif 1), SNAI1 (Motif 2), KLF3, KLF4 and KLF5 (Motif 3), ELF1 and ELK1 (Motif 4), RUNX1, RUNX3 and SPI1 (Motif 5), and SOX4, SOX6 and SOX10 (Motif 6) TFs and subsets of their sentinels (Fig. 3C). Motifs 2 and 4 contained CG sites within their binding sites, raising the possibility that the presence of DNA methylation might directly alter TF binding affinity at loci linked to these motifs (Fig. 3A, Supplementary Table 14). To evaluate this in more detail, we examined whether these motifs were enriched for differentially methylated sites (sentinel and flanking sites) associated with obesity. No enrichment was observed but array coverage of potential CG sites was sparse (Median 22% IQR 14-33% of genomic CG sites within ±150bp of a Motif, Supplementary Figure 8). As DNA methylation levels at adjacent CG sites are correlated92, we examined motifs for enrichment of genomic CG sites not covered in our dataset at which 5mC could impact TF-DNA binding. Motif 4 was strongly enriched for genomic CG sites relative to the flanking DNA sequences (±150-bp, Fig. 3D). A cluster of genomic CGs was also present immediately upstream of Motif 2, though other genomic CG clusters were observed in the DNA sequences flanking Motif 2. We therefore examined whether sentinel methylation levels at Motifs 2 and 4 influenced co-expression between TFs and their predicted target genes (the predicted target genes of each sentinel site corresponding to that TF-Motif pair). At both motifs, levels of methylation impacted TF-target gene relationships, suggesting potential mediation of transcriptional regulation by methylation (Fig. 3E, Supplementary Figure 8). The largest effects of methylation on TF-target gene relationships were observed at the ELK1, ELK3 and ELF4 TFs (Fig. 3E), of which ELK1 and ELF4 have been shown to be methylation-sensitive in vitro93. Importantly, the TFs implicated in these reciprocal relationships with DNA methylation are widely involved in adipocyte biology, adiposity and metabolic dysfunction (Supplementary Table 1553, 64, 94–103).
Genetic association analyses to infer disease causation
To distinguish individual 5mC sites with potential causal effects on obesity or obesity-induced metabolic disturbances, we carried out two sample Mendelian Randomisation analyses (MR). We used whole adipose tissue (N=588 samples, Twins UK104–107) rather than adipocytes to increase power to identify independent cis-SNPs associated with each sentinel 5mC site (within +/500-kb and pairwise r2<0.01) and selected a significance threshold of 0.05 (Bonferroni corrected) to reduce weak instrument bias. We then used the identified cis-SNPs as instrumental variables (IV) to infer causal effects of DNA methylation on obesity, central adiposity, T2D and glycaemic traits linked to T2D among large-scale human GWAS (Fig 4A47–49, 108).
Only 191 subcutaneous and 34 visceral sentinels had significantly associated cis-SNPs, meaning we were unable to assess causation at a large fraction of sites. Nevertheless, we found genetic evidence to support causal effects of 5mC on obesity (measured using BMI) at 10 loci in subcutaneous adipocytes and 4 loci in visceral adipocytes (MR single instrument Wald Ratio or multiple instrument IVW random effects FDR<0.01, and Steiger directionality test FDR<0.01, Fig. 4B, Supplementary Table 17109, 110). We also identified potential causal effects of DNA methylation on central adiposity at 12 loci (10 subcutaneous, 2 visceral, measured using WHR), T2D at 4 loci (3 subcutaneous, 1 visceral), and fasting glucose, insulin and HbA1c at 3 loci (all subcutaneous, Fig. 4B, Supplementary Table 17). Methylation at 3 loci was causally linked to multiple obesity phenotypes: cg26906217 (BMI and WHR; near the PIK3C2A gene); cg15809217 (WHR, T2D and FI; associated with PRRC2A expression in subcutaneous adipocytes); and cg05941027 (BMI, WHR and T2D; associated with LIMD2 expression in visceral adipocytes). Another notable locus was the 5mC site cg21681030, causally associated with BMI through MR, and respectively associated with expression of RHOQ, a glucose receptor trafficking gene111, 112, in our human adipocyte samples.
As our MR analyses were predominantly based on single SNPs as IVs, we repeated them using a larger number of correlated cis-SNPs (pairwise r2<0.8, SNP-methylation P<0.05, Bonferroni corrected) as IVs to evaluate for horizontal pleiotropy (Fig. 4B). 18 loci had at least 3 correlated IV SNPs enabling sensitivity testing at these loci. 15 loci replicated using MR IVW regression and 4 replicated using the less powerful MR Egger regression (P<0.05, Supplementary Table 17). Multiple loci showed evidence of heterogeneity between IVs, but only 4 loci violated the MR assumption of no horizontal pleiotropy (MR Egger intersect P<0.05, Supplementary Table 17). We then evaluated the relevancy of our MR findings to adipocytes by replication testing the SNP-5mC associations identified in whole adipose tissue in subcutaneous and visceral adipocytes. Despite the small sample size, 8 of 31 cis-SNPs replicated in adipocytes at P<0.05 (Binomial Sign Test P=0.0001) and 24 of 31 cis-SNPs had consistent directions of effect (Binomial Sign Test P=0.003, Supplementary Table 18). Thus, through genetic association, we infer causal effects of adipocyte DNA methylation on obesity or obesity-induced metabolic disturbances at up to 28 independent genomic loci.
Adipocyte functional studies
Finally, we selected two target genes of 5mC sites implicated in disease pathogenesis through MR, but without known metabolic functions, for functional screening in adipocytes. We focused on the PRRC2A gene which was linked to central adiposity, insulin resistance and T2D, and the LIMD2 gene which was linked to obesity, central adiposity and T2D. We evaluated the effect of silencing each target gene on a cellular model of adipogenesis, because adipogenesis has an important role in adipose tissue expansion and insulin sensitivity. We found that knockdown of both genes significantly reduced lipid accumulation during adipocyte differentiation, with a more marked effect observed after PRRC2A than LIMD2 knockdown (Fig. 4C and 4D, Supplementary Figure 9). As adipocyte differentiation models are prone to variability, we independently replicated our lipid accumulation studies and observed concordant results (Supplementary Figure 9). We also examined the effects of PRRC2A and LIMD2 gene silencing on key adipogenesis, lipid metabolism and insulin signalling genes. PRRC2A knockdown led to a consistent reduction in expression of PPARG, the master regulator adipogenic transcription factor, and concordant reductions in lipid metabolism and insulin signalling genes that are regulated by PPARG (Fig. 4E, Supplementary Figure 9). In contrast, LIMD2 gene silencing did not alter PPARG expression (Fig. 4E, Supplementary Figure 9). Instead, LIMD2 silencing led to increased expression of the fat mobilising genes FABP4 and HSL, and reduction in expression of the lipid synthesis gene SCD1, though findings were inconsistent across replicates (Fig. 4E, Supplementary Figure 9). Overall, our integrative genomic and adipocyte functional studies are consistent with lower methylation levels at cg15809217 in obese subcutaneous adipocytes promoting insulin resistance and T2D, through reduced PRRC2A expression and impaired adipogenesis. They also suggest that reduced methylation at cg05941027 in obese visceral adipocytes may increase LIMD2 expression and thereby increase lipid storage. PRRC2A is recently reported as a reader of N6-methyladenosine (m6A), the most abundant internal modification to mRNA, offering a novel mechanism through which its metabolic effects may be mediated113.
DISCUSSION
Epigenetic processes are tissue and cell specific which has made investigating their roles in complex human diseases a major challenge. In this study, we combine integrated functional genomics with extreme trait sampling in human adipocytes from functionally distinct adipose depots to elucidate epigenetic mechanisms underlying human obesity and obesity-induced metabolic disturbances. We discover extensive changes in DNA methylation associated with extreme obesity at epigenome-wide significance. Surprisingly, these methylation changes are largely adipose depot-specific with many more obesity-associated 5mC sites occurring in subcutaneous than visceral adipocytes. We surmise this depot specificity may be due to the local tissue microenvironment in the absence of technical, genetic or other known confounding factors.
By integrating our DNA methylation findings with adipocyte-specific transcriptomic and chromosomal interaction datasets, and cross-tissue enhancer-promoter catalogues, we statistically and functionally connect extreme obesity-associated 5mC sites to transcriptomic changes at >500 genes. These putative effector genes of obesity-associated methylation changes cluster in developmental, metabolic and inflammatory pathways, and encode proteins with key roles in adipogenesis, browning/beigeing of white adipocytes and insulin signalling. Of particular interest, we associate lower levels of DNA methylation in obese subcutaneous adipocytes with increased expression of a micro-RNA cluster, comprising MIR23A, MIR24-2 and MIR27A, whose members have been shown to inhibit PPARG signalling, suppress adipogenesis and induce insulin resistance78–83.
Our analyses co-localise extreme obesity-associated 5mC variations to functionally active genomic regions and transcription factor binding sites. They further suggest that TF activity may alter methylation status and that methylation status may impact TF activity at differentially methylated sites associated with obesity. These findings require experimental validation but are supported by existing evidence of reciprocal relationships between DNA methylation and TFs, and methylation-sensitivity of the TFs we identify88–90, 93. At a cellular level, many of the TF families linked to obesity-associated DNA methylation variations – AP1, KLF, SOX and ETS – have established or emergent roles in adipocyte biology and metabolism64, 94–97, 102, connecting disease-related DNA methylation variations to potential pathogenic pathways.
Contrary to most previous human obesity EWAS22–24, 114–129, we causally implicate up to 13% of extreme-obesity associated 5mC sites in obesity and obesity-induced metabolic disease susceptibility. Our MR findings should be regarded as causal estimates of the effects of methylation on phenotype under genetic control130. They do not indicate absence of causation at environmentally determined loci, the majority of our dataset. Complementary functional screens of target genes of causative methylation variations demonstrate novel effects on adipogenesis, PPARG signalling and adipocyte lipid handling, offering cellular mechanisms by which DNA methylation may promote obesity and its consequences.
New technologies for profiling DNA methylation and transcriptional regulation at single cell resolution will enable future studies to address the importance of cellular epigenetic heterogeneity, methylation and transcriptional dynamics, and spacial microenvironmental interactions, in obese adipose tissue remodelling131, 132. Nevertheless, we show that existing technologies that facilitate the study of epigenomic variations in larger numbers of individuals, and thus better capture human phenotypic diversity, remain a valuable tool for de-convoluting the epigenetic basis of human obesity and T2D. Combining these high-throughput approaches with precision epigenome tools133–136 will clarify the impact of DNA methylation, including at the sites we identify, on disease pathogenesis.
Taken together, our exploratory studies in human adipocytes begin to reveal genomic mechanisms and molecular signalling pathways through which DNA methylation may impact human obesity and its metabolic consequences. We provide new evidence of causation at a sizeable fraction of extreme obesity-associated 5mC sites, and a resource of epigenomic variations and genes for furthering our understanding of the human epigenome and its role in obesity and its metabolic complications.
METHODS
Study design
Case-control DNA methylation analyses were carried out in 95 subcutaneous and 95 visceral adipocyte samples from people with extreme-obesity and healthy controls in separate discovery and replication cohorts (Supplementary Figure 1). 44 participants in the discovery cohort and 42 participants in the replication cohort provided both subcutaneous and visceral adipocyte samples. The remaining subcutaneous and visceral adipocyte samples came from distinct individuals. RNA sequencing was carried out in adipocytes from the replication cohort.
Participant selection and sample processing
Adipose tissue samples were obtained intraoperatively from morbidly obese individuals (mean (sd) BMI 44.8 (7.2) kg/m2) undergoing laparoscopic bariatric surgery and healthy controls (mean (sd) BMI 24.9 (3.3) kg/m2) undergoing non-bariatric laparoscopic abdominal surgery (Supplementary Table 1). Subcutaneous tissue was collected from abdominal surgical incision sites and visceral tissue from the omentum. Participants were unrelated, between 20-70 years of age, from a multi-ethnic background, and free from systemic illnesses not related to obesity. Controls and cases were well-matched for age (within 3-yrs), sex and ethnicity (Supplementary Table 1). People with treated T2D were excluded because of potential effects of hypoglycaemic medications on adipose tissue metabolism. All participants gave informed consent. The study was approved by the London -City Road and Hampstead Research Ethics Committee, United Kingdom (reference 13/LO/0477).
Whole tissue samples were processed immediately to isolate populations of primary adipocytes using established protocols137, 138. Polypropylene plastics were used to minimise adipocyte cell lysis. Tissues were cut into 1-2-mm3 pieces and washed in Hank’s buffered salt solution (HBSS), before digestion using type 1 collagenase (1mg/ml, Worthington) in a water bath at 37C shaking at 100-rpm for ∼30-min. Digested samples were filtered through a 300-micron nylon mesh to remove debris, and the filtered solution was centrifuged at low speed (500-g; 5-min; 4C). After removal of the oil layer, floating mature adipocytes were collected by pipette, washed in ∼5x volume of HBSS and recentrifuged. After 3 washes the clean adipocyte cell suspension was collected for snap freezing and storage at −80C.
Quantification of DNA methylation
Genomic DNA and RNA were extracted in parallel from isolated adipocytes using the Qiagen AllPrep DNA/RNA/miRNA Universal Kit according to the manufacturer’s protocol for lipid-rich samples. Genome-wide DNA methylation was assayed using Illumina Infinium HumanMethylation450 (96 discovery samples) and EPIC (96 replication samples) beadchips. In both cohorts, all samples were randomised and processed in single batches. 0.2-0.5mcg of genomic DNA was bisulphite converted using EZ DNA Methylation-Direct Kits (Zymo Research, Irvine, CA). Bisulfite-treated DNA was denatured, neutralised and subjected to an overnight whole-genome amplification reaction. Amplified DNA was enzymatically fragmented, precipitated and resuspended for hybridisation to respective HumanMethylation450K and EPIC beadchips. After hybridization, beadchips were processed through a primer-extension protocol, stained, coated and then imaged using the HiScan System (Illumina).
Raw signal intensities were retrieved using the readIDAT function of the R package minfi (version 1.36.0, Bioconductor139), followed by background correction with the function bgcorrect.illumina from the same R package. Detection P values were derived using the function detectionP as the probability of the total signal (methylation + unmethylated) being detected above the background signal level, as estimated from negative control probes. Signals with detection P values ≥0.01 were removed. >99% of CG sites in both cohorts passed this quality control threshold. One visceral (discovery) and one subcutaneous (replication) sample with less than 95% of CG sites providing a detected signal were excluded. To reduce non-biological variability between observations, data were quantile normalized with the function normalizeQuantiles of the R package limma (version 2.12.0, Bioconductor). DNA methylation was quantified on a scale of 0-1, where 1 represents 100% methylation.
Separate principal component analyses (PCA) were carried out on HumanMethylation450K and EPIC positive control probe signal intensities. These probes assess multiple steps in the laboratory processing and the resulting principal components (PCs), which capture technical variability in the experiment, were included as covariates in our discovery and replication models to remove technical biases140.
RNA sequencing
RNA sequencing was carried out in the 96 samples with paired DNA methylation results from the replication cohort. Sample order was randomised for library preparation and sequencing. Total RNA was quantified using the Qubit Fluorometer (RNA HS Assay) and aliquots of 10-ng were used for library preparation. RNA integrity numbers (RINs, Aligent 2100 Bioanalyzer) ranged from 2.5 to 7.9 (median 5.8), an indication of suboptimal transcript quality in isolated human adipocytes (stored at −80C). Sequencing libraries were therefore generated using the SMARTer Stranded Total RNA-Seq Kits (v2 Pico input) which uses Switching Mechanism at the 5’ end of RNA Template technology and random priming to make cDNA from low-quality, low-input RNA volumes. Takara Bio/Clontech recommendations were followed at each library preparation stage (fragmentation, first-strand cDNA synthesis, addition of adapters and indexing, AMPure bead purification, library amplification, AMPure bead repeat purification). Final libraries were validated using the Aligent BioAnalyzer (DNA1000 and High Sensitivity chips) and 2 failed libraries were removed. Sample-specific libraries were pooled at equimolar concentrations (4nM) to avoid batch effects and sequenced according to the manufacturer’s protocols using the HiSeq 4000 (16 lanes). Each sample was sequenced to >80 million 100-bp paired-end reads.
Raw sequencing data was demultiplexed using Bcl2Fastq (version 2.20) and read quality was checked using fastQC (version 0.11.5). Reads were adapter trimmed (Illumina generic adapters) and mapped against the Ensembl human reference genome GRCh37 v87 using the splice aware aligner STAR (version 2.6.0, options: -outFilterMultimapNmax 20 -outFilterMismatchNoverReadLmax 0.04). Lane level bams were merged, sorted and indexed using Samtools (version 1.9) to create library level bams for each sample. Mapping rates and quality control measures were evaluated level using Picard (version 2.18.12) and RseQC (version 2.6.4), and summarised using MultiQC (version 1.9). Raw counts per gene were generated using FeatureCounts (subread version 1.6.2); multimapped reads were included (options: -M -fraction) to capture short reads resulting from lower quality transcripts. 2 samples with low number of aligned reads (<15M) reflecting low-quality libraries were excluded. 3 other outlying samples with low median transcript integrity (TIN) or evidence of GC bias were removed (Supplementary Figure 10A and 10B). After exclusions, gene expression results were available for 43 subcutaneous and 46 visceral adipocyte samples and 24,187 genes (counts ≥5 in ≥20% of samples). As a final QC step, surrogate variable analysis (SVA141) was carried out on variance stabilising transformed counts (generated using DESeq2, version 1.28.0142) for each sample to capture unwanted variation in our RNA sequencing data, particularly that arising due to differences in transcript and sequencing quality (Supplementary Figure 10C). The resulting surrogate variables (SV) were included as covariates in 5mC regression models to remove associations driven by sample and sequencing biases.
Epigenome-wide association analyses
All methylation-phenotype analyses were carried out separately in subcutaneous and visceral adipocytes. 44,101 probes were removed because of known cross hybridisation143, 144 or presence of a common genetic variant (SNP, indels, or structural variation, 1000G European phase 3 dataset, MAF>1%) within the probe sequence. After QC and removal of problem probes, only the 401,595 single CG sites present on both the Illumina HumanMethylation450 and EPIC beadchips were investigated. Single markers passing quality control were tested for association with extreme obesity using linear regression and an established analytic strategy to reduce batch and other technical confounding effects145. Obesity status was used as the predictor with methylation as the outcome variable to generate %-methylation differences between cases and controls. Covariate adjustments were made for potential biological (age, sex and ethnicity) and technical confounding variables (the first 4 control probe PCs which explained >95% of control probe technical variation). Association results from the discovery and replication cohorts were combined using inverse variance weighted (IVW) meta-analysis and evaluated for heterogeneity.
Statistical significance was inferred at FDR<0.0133, 34 in the discovery sample to provide an inclusive set of biologically relevant results. A more stringent cut off of (i) FDR<0.01 in the replication sample (with consistent direction of effect) and (ii) combined P<1×10-7 (epigenome-wide significance, EWS140) in the combined discovery and replication samples was then used to define significant associations. Markers associated with obesity at P<1×10-7 within ±5000-bp of each other were considered as a single genomic locus. A ±5000-bp distance was selected to take into account the known sizes of discrete regions of DNA methylation, for example CGIs146. At each independent locus the CG site with lowest P value for association with obesity was defined as the sentinel marker. An exact binomial test (R function binom.test) was used to test whether consist directions of effect between discovery and replication, and between subcutaneous and visceral, were observed more often than expected by chance.
Genetic confounding and adipocyte purity
Genotype data for each participant was generated from whole blood using Illumina Infinium OmniExpress-24 v1.2 beadchips. We removed directly genotyped SNPs with call rates <90%, minor allele frequency <0.01, Hardy-Weinberg Equilibrium P<1×10-6, SNPs on sex chromosomes and duplicated SNPs. After quality control, 649,007 SNPs were taken forward for imputation. SHAPEIT147 was used to infer haplotypes, and imputation was carried out in IMPUTE2 (version 2.3.2148) using the 1000 genome reference panel Phase 3 (all ancestries). Each chromosome was divided into 5Mb chunks for imputation and merged at the end. A random seed was supplied automatically. Effective population size (Ne) reflecting genetic diversity was 20,000 as recommended when using a multi-population reference panel. After imputation, genotype data was available for 81,656,368 SNPs. Sensitivity analyses were carried out in combined discovery and replication results (IVW meta-analysis). Multivariate regression models with and without genetic confounding factors were compared (Supplementary Figures 2A and 2B). First, PCA was performed on participant GWAS data and the first 5 PCs explaining >95% of inter-individual variation were included as covariates to adjust for population stratification. Second, the effects of cis-SNPs (within 500-kb) on methylation-phenotype associations were examined by including (i) the cis-SNP most strongly associated with DNA methylation and (ii) all independent cis-SNPs associated with DNA methylation at FDR<0.01 (pairwise LD R2<0.8) in multivariate regression models. Gaphunting from the Minfi package was also used to identify and flag methylation sites at which the distribution of methylation was consistent with an underlying SNP driven effect (version 1.36.036, 139 Supplementary Tables 2A and 2B).
Purity of the isolated adipocytes was evaluated using two approaches. First, in the replication cohort, RNA sequencing results for immune and stromovascular cell-specific genes were used to evaluate associations driven by potential contamination (broad immune cell – CD45/PTPRC; monocyte/macrophage – CD68, CD11b/ITGAM, CD14; B/T cell – TNFRSF8, CD19, CD4, CD8A, CD2; NK cell – NCAM1; endothelial cell – CD31; preadipocyte – DLK1/PREF1, CD34). Replication models without and with variance stabilised transformed gene expression counts (DESeq2, version 1.28.0) were compared for each immune-or stromovascular cell-specific gene (Supplementary Figures 3A and 3B). PCA analysis was also carried out to summarise the variance across the potential contaminating cell genes, and models without and with PCs were compared (Supplementary Figures 3A and 3B). Second, in the combined discovery and replication cohorts, SVA analysis was performed to capture unexplained variation due to cellular heterogeneity (resulting from contamination) and other potential confounding factors in our high-dimensional DNA methylation data141, 149–151. The resulting SV were then included as covariates in regression models to test whether the observed associations were driven by impurity or other unknown confounding factors (Supplementary Figure 4).
Methylation-expression analyses
Individual sentinel 5mC sites were assigned to potential target genes using three approaches. First, sentinels in exons, 5/3’ UTRs or within 5-kb of a promoter were assigned to overlapping genes. A 5-kb cut-off to define a promoter region was selected based on an observed drop off in the number of 5mC sentinels beyond this distance from their respective TSS (using sequential 1-kb bins). Second, for 5mC sites in intronic/intergenic regions, we overlapped the sentinel methylation site with distal chromosomal intervals connected to proximal gene transcription start sites from: i. Adipocyte Capture Hi-C results; and ii. the GeneHancer enhancer-promoter inference database42. Human adipocyte Capture Hi–C data was available at GEO (Accession ID: GSE11061955) as pre-processed and pre-called interactions. GeneHancer interactions were taken from the combined standard and elite functional interaction sets, with removal of standard interactions defined by proximity alone. Third, for those intronic/intergenic sentinels not assigned to a functional targeted gene, we intersected the sentinel sites with adipocyte-specific TADs, and took all genes within intersected sentinel-TAD pairs as potential targets. Adipocyte TADs were called from available Hi-C data generated at day 3 of human adipocyte differentiation in vitro (GEO, Accession ID: GSE10992456). TADs were called from bed files, using Arrowhead at 25-kb resolution with default parameters, and merged, removing hierarchal structures by retaining largest domains to maximise TAD coverage, for a final set of 5323 domains of median 425-kb, range 125-kb to 4,025-kb intervals.
Sentinel 5mC sites were tested for association with expression of each of their functionally assigned target genes using linear regression. Initially, we carried out association testing separately in subcutaneous and visceral adipocytes, but had limited power to detect methylation-expression associations. Thus, for our final analyses we used the combined subcutaneous and visceral adipocyte datasets to identify methylation-expression relationships, and mixed-effect modelling to control for sample relatedness (i.e. subcutaneous and visceral adipocytes from the same individuals). Mixed effect models were implemented using Dream analysis from the Bioconductor package variancePartition (version 1.18.0) in R; methylation betas were used as the predictor and logCPM transformed gene expression counts (voomWithDreamWeights) as the outcome. Study participant ID was included as a random effect. Two methylation control probe PCs, two SVs generated using SVA of the gene expression counts, age, sex, and ethnicity, and RNA integrity numbers (RINs) were included as fixed effects to adjust for technical, biological and sample-related biases. Mixed model results from Dream were compared with those from the nlme R package (version 3.1-149, using variance stabilised transformed counts from DESeq2, version 1.28.0) as a further sensitivity test. Statistical significance was inferred at FDR<0.01 (qvalue version 2.20.033) based on the number of methylation-target gene pairs analysed.
Functional enrichment analyses
All functional enrichment analyses were carried out using permutation testing as Illumina methylation arrays preferentially evaluate pre-selected genomic sites (e.g. CG islands) and well-annotated genes. For each sentinel CG site, we identified a permutation set of 1000 unique CG sites with equivalent methylation levels and variability in their respective subcutaneous or visceral adipocyte samples, using the following criteria: i. difference in mean between sentinel and permutation; ii. difference in standard deviation (sd) between sentinel and permutation; and iii. >5-kb distance between sentinel and permutation (i.e. not at the same genomic locus). For each sentinel, difference in mean and sd were based on a sliding scale starting at mean 0.025 and sd 0.0025 and increasing incrementally by mean 0.025 and sd 0.0025 until >1000 independent permutated CG sites were achieved. This approach was selected because a low fixed mean/sd was too stringent to generate 1000 permutations at some sentinels, while a higher fixed mean/sd was too permissive at other sentinels.
For genomic enrichment analyses, we compared the number of sentinels located in a genomic feature (observed frequency) with that in each of the 1000 permutation sets (expected frequency). For human obesity and metabolic disease GWAS enrichment analyses, we identified methylation sites and GWAS SNPs (P<5×10-8) in shared same Adipocyte Roadmap Chromatin State (E025), and compared the frequencies between the sentinels and the permuted background. We used the same permutation-based approach for our nearest gene pathway analyses, but limited our analyses to one sentinel per gene and one permutation per gene, to avoid recounting genes. The nearest gene to each sentinel was identified using the ChIPseeker annotation package (version 1.28.3152), which prioritises overlaps in promoters over other features (ranking: promoter, 5/3’UTR, Exon, Intron, Intergenic). Nearest genes were cross-referenced with the Molecular Signatures Database (MsigDB, hallmark, curated and ontology gene sets153, 154). Differences in observed and expected frequencies were calculated using the Fishers Exact test and Empirical P values. Gene set enrichment analyses of genes associated with methylation in adipocytes were carried out in gProfiler87, using a background of the nearest gene to each of the 1000 permuted CG sites, limited to genes expressed in our human adipocyte RNA sequencing data.
Transcription factor binding site analyses
De novo transcription factor binding motif enrichment analysis was performed using the script fingMotifGenome.pl from the Homer program package (version 4.11.1). Subcutaneous and visceral sentinels were investigated separately. Regions of interest were selected by extending each sentinel site for ±150-bp on each side. The enrichment analysis was done using two different backgrounds as controls: i. the ±150-bp regions around the 1000 permutation sites specific to each sentinel; and ii. genomic regions with GC% matching those of sentinel regions. At each motif, we identified the 10 transcription factors most likely to interact with that motif (best match between known motifs and the discovered motif) to provide a sizeable but manageable number of TFs. We then restricted these TFs to those expressed in our human RNA sequencing data (counts ≥5 in ≥20% of samples).
Positions in the region around the sentinel of respective motifs for analysis were inferred with the annotatePeaks.pl script from the Homer program package, with the options -rmrevopp to account for palindromic motifs. Genomic CpG sites relative to motifs were retrieved using the Bioconductor package seqPattern (version 1.20.0) in R. Correlation analyses of TF expression and methylation of their corresponding sentinels were carried out in the depot in which the sentinel was identified. Relationships between TFs, their respective sentinels and the predicted target genes of each sentinel (assigned sequentially first by promoter/exon/UTR overlap, then by functional interaction, then by shared TAD) were studied in combined subcutaneous and visceral adipocyte samples to increase power. Associations between TF and target gene expression were examined (i) without and (ii) with adjustment for sentinel DNA methylation level to explore the effects of sentinel methylation sites on TF-target gene relationships. Mixed effects models were carried out in the package nlme package (version 3.1_143) to adjust for sample relatedness; Age, Sex, Ethnicity, two methylation control probe PCs and two gene expression SVs were included as fixed effect covariates to adjust for potential confounding variables. For all TF expression analyses, variance stabilising transformed counts (DESeq2, version 1.28.0) were used.
Genetic association analyses
Two sample Mendelian Randomisation (MR) analyses were carried out to investigate causal relationships between individual sentinel 5mC sites and human obesity phenotypes using: i. 588 whole subcutaneous adipose tissue (WSAT) samples from the Twins UK cohort; and ii. summary results from recent large-scale human GWAS.
The Twins UK cohort is a nationwide registry of healthy volunteer twins in the United Kingdom, with about 14,000 registered twins since 1992, predominately Caucasian female (84%) and equal number of monozygotic and same-sex dizygotic twins. Twins UK phenotypic measurements, adipose tissue biopsies, genome-wide SNP and DNA methylation assays were performed as previously described104–106. Briefly, samples were genotyped using HumanHap300, HumanHap610Q, HumanHap1M Duo, and HumanHap1.2M Duo 1M arrays. Haplotypes from IMPUTE2 (without a reference panel) were used for fast imputation to the 1000 Genomes phase 1 dataset. Imputed SNPs were excluded based on Hardy Weinberg equilibrium (P<1e-6), allele frequency cut-offs (MAF<0.01), missingness (>5%) and imputation quality (info score<0.8). Individuals with mis-assigned sex or ancestry outliers were removed. Ancestry outliers (>7 SD) were obtained from PLINK 2.0 (unrelated) and GENESIS (related participants). Related individuals with IBS > 0.125 (PLINK 2.0) were also excluded. DNA methylation profiles in adipose tissue biopsies were obtained as described previously106. Methylation results were available for 588 out of 596 twins after further quality control analyses107. All individuals were female (mean (sd) age 59.1 (9.4)).
Human GWAS comprised: BMI as a measure of obesity (GIANT 2018, transethnic47); WHR adjusted for BMI as a measure of central adiposity (GIANT 2018, transethnic48); fasting glucose and insulin (MAGIC, Europeans, unpublished); HbA1c (MAGIC 2017108); T2D and T2D adjusted from BMI (DIAGRAM 2018, Europeans49).
Cis-SNPs within ±500-kb of each subcutaneous and visceral sentinel were tested for association with change in sentinel DNA methylation level in WSAT using linear regression. DNA methylation levels were adjusted for technical covariates, age, predicted smoking, family relatedness, genetic principal components (PCs) and non-genetic DNA methylation PCs. Methylation-genotype associations were evaluated in the MatrixEQTL package in R (version 2.1.0) using linear models, with the adjusted methylation values as the dependent variable and the dosage of alternative allele the independent variable. Ambiguous palindromic cis-SNPs with MAF>0.42 were removed. For each sentinel, cis-SNPs were clumped (linkage disequilibrium (LD) R2<0.01) and independent methylation quantitative trait locus (mQTL) SNPs associated with DNA methylation at P<0.05 (Bonferroni corrected for the number of SNPs) were selected. Primary MR analyses of these mQTL SNPs and human GWAS phenotypes were implemented in R using the TwoSampleMR package (version 0.5.1109, 110). Causal relationships were tested using the most powerful MR method (Wald Test for single mQTL SNPs, and Inverse Variance Weighted method for multiple mQTL SNPs). Cause-consequence directions of effect were evaluated using the Steiger directionality test, which compares SNP-methylation and SNP-phenotype R2 values. Potential causal effects of methylation on phenotype inferred if both the MR and Steiger tests passed a significance threshold of FDR<0.01.
MR sensitivity testing was carried out using 2 approaches. First, we evaluated MR assumptions by repeating our MR analyses using correlated cis-SNPs (within ±500-kb, clumped at LD R2 >0.8, and associated with methylation at P<0.05 Bonferroni corrected) in the R package MendelianRandomization (version 0.4.1); correlated cis-SNPs were used as no sentinels had ≥3 uncorrelated cis-SNPs for such analyses. MR IVW and MR Egger regression were used to test for: i. replication; ii. heterogeneity between SNPs; and iii. evidence of horizontal pleiotropy; at each sentinel with ≥3 correlated cis-mQTLs. Second, we replication tested WSAT mQTLs implicated in positive MR results amongst our subcutaneous and visceral adipocyte samples using SNP as the predictor, methylation beta as the outcome, and adjusting for biological and technical confounders (age, sex, ethnicity, control probe PCs 1-4). For WSAT mQTL SNPs not present in our adipocyte dataset, we identified a proxy SNP present in adipocytes (the cis-SNP (±500-kb) with the greatest pairwise LD with the mQTL SNP and minimum R2>0.8) and used the correlated allele to evaluate for association with methylation and concordance of directions of effect.
In vitro studies
The 3T3-L1 pre-adipocyte mouse cell line (ATCC-CL-173) was obtained commercially (LGC). Pre-adipocytes were grown in Dulbecco’s Modified Eagles Media with 10% newborn calf serum (NCS) and 1% penicillin/ streptomycin (P/S). Two days post-confluence (Day 0) differentiation was induced by switching cells to DMEM supplemented with 10% foetal bovine serum (FBS), 10-µg/ml insulin, 0.5-mM IBMX, 1-µM dexamethasone and 2-µM rosiglitazone. On day 2, cell media was refreshed with insulin media (DMEM containing 10-µg/ml insulin). Cells were maintained at 37C and 5% CO2 and were differentiated in 10-cm dishes until undergoing siRNA reverse transfection.
Early differentiation 3T3-L1 adipocytes were reverse transfected at Day 2 of differentiation with Silencer Select siRNA’s (Ambion) as described below, and similar to that described previously155. Two different siRNAs against each target were used in concert to enhance target gene knockdown. Cells were transfected with siRNA’s against either Limd2 (ID; s85672; s85674), Prrc2a (ID; s79278; s79279) or a non-silencing (NS) (ID; Negative control #1) siRNA. RNAiMax lipofectamine transfection reagent (Life Technologies) and siRNAs were diluted separately in Opti-MEM media (Gibco), mixed together, added to empty wells and incubated for 20-mins before cell suspension was seeded. To each well of a 6-well and 12-well plate, 50-pmol of siRNA (25-pmol each siRNA) and 30-pmol siRNA (15-pmol each siRNA) were added respectively. For NS control, a single siRNA was used at the same total pmol quantity as for targets. Differentiating 3T3-L1 adipocytes in 10-cm dishes were detached at Day 2 of differentiation by treating with Trypl Express (Gibco) for 10-mins. Cells were counted, resuspended at 450,000 cells/ml in DMEM insulin media (insulin, 10-µg/ml), and added to 12-well (1ml; 450,000 cells/well) and 6-well plates (2ml; 900,000 cells/well) containing the pre-incubated siRNA transfection mix. The next day cells were refreshed with new insulin media (10-µg/ml insulin). On day 6, siRNA treated differentiated cells were harvested for RNA (6-well plates) or assayed for lipid accumulation (12-well plate; Oil Red O).
Oil red O (ORO) staining was performed to assess lipid accumulation in mature 3T3-L1 cells (Day 6) that were reverse transfected with siRNA at early differentiation (Day 2). The protocol used was similar to that described previously156 with modifications. Cells were washed with PBS and fixed with 10% neutral formalin for 1-hr. After formalin removal, cells were washed with sterile water then exposed to 60% isopropanol for 3-mins. After removal of 60% isopropanol, cells were stained with ORO solution (Sigma) for 10-mins, and then washed with water until all extracellular ORO was completely removed. At this point images of ORO staining at 4X and 10X magnification were acquired using the Evos m7000 microscope (Thermo Scientific). Cells were treated with 100% isopropanol for 10-mins to extract ORO stain from lipid in cells for quantification by measuring elute absorbance at 500-nM using SpectrumMax 340PC plate reader (Molecular Devices). Samples were added to 96-well plate in quadruplicate along with known ORO quantities (µg/ml) to make a standard curve to calculate µg of ORO eluted. Following ORO elution, cells were washed 2x with water to allow crystal violet (CV) nuclear staining for relative cell number normalisation. Cells were stained with 0.05% CV for 10-mins, followed by 4x 10-min washes with water to remove all extracellular CV. SDS (1%) was added to cell plates and incubated for 10-mins with constant orbital agitation at 150rpm to lyse cells and allow CV absorbance in the lysate to be measured at 560-nM using the SpectrumMax 340PC plate reader. CV samples were added to 96-well plates in quadruplicate along with known CV quantities (µg/ml) to generate a standard curve to calculate µg of CV and to thus normalise ORO data to relative cell number.
Total RNA was isolated from siRNA transfected 3T3-L1 adipocytes at Day 6 using Qiazol reagent and the RNeasy mini kit (Qiagen) according to manufacturers’ instructions, with on-column DNase (Qiagen) treatment performed during RNA isolation. The High-Capacity RNA-to-cDNA kit was used to generate cDNA by reverse transcription of 1-µg total RNA. RT-qPCR gene expression analysis was performed using the CFX384 Touch Real-Time PCR Detection System (BioRad), SSO advanced Universal SYBR Green Supermix mix, gene-specific primers (500-nM final concentration) and cDNA in a 10-µl total reaction volume. qPCR conditions were: 3-mins at 95C, then 40 cycles of 95C for 10-secs, 60C for 30-secs and followed by melting curve analysis from 65-95C in 0.5C steps at 5 secs/step. Sequences of primers used in qPCR analysis are listed in Supplementary Table 19. Gene expression was quantified using the delta-delta Ct (2-ΔΔCT) method and is shown relative to the non-silencing group. Two housekeeping genes Nono and Ywhaz were utilised, with their geometric mean expression being used for normalisation. Effects of knockdown on genes involved in adipocyte differentiation (Pparg), insulin signalling (Glut4, Irs1), lipid uptake (Lpl), lipid storage (Fasn, Acaca, Scd1) and lipid mobilisation (Fabp4, Hsl) were evaluated157.
GraphPad Prism was used to perform Student’s t-tests used in analyses of gene expression and Oil Red O. All data shown relative to non-silencing group and presented as means ± SEM.
Data Availability
All data produced in the present study are available upon reasonable request to the authors, and will be made openly available after critical review.
ACKNOWLEDGEMENTS
This work was funded by the Medical Research Council UK (MR/K002414/1), the Wellcome Trust (219602/Z/19/Z) and the National Institute for Health Research (NIHR) Imperial Biomedical Research Centre (BRC). The Illumina HumanMethylation450 Beadchip data used in this research was generated by the High-Throughput Genomics Group at the Wellcome Trust Centre for Human Genetics (funded by Wellcome Trust grant reference 090532/Z/09/Z). The Illumina EPIC Beadchip data were generated in collaboration with RS. The NIHR Imperial BRC Genomics Facility provided resources and support that contributed to the reported RNA sequencing results. The TwinsUK research components were funded by JPI ERA-HDHL DIMENSION via the Biotechnology and Biological Sciences Research Council (BB/S020845/1). TwinsUK is funded by the Wellcome Trust, Medical Research Council, European Union, Chronic Disease Research Foundation (CDRF), Zoe Global Ltd and the NIHR-funded BioResource, Clinical Research Facility and Biomedical Research Centre based at Guy’s and St Thomas’ NHS Foundation Trust in partnership with King’s College London. We thank the participants and research staff who made this possible.
REFERENCES
- 1.↵
- 2.↵
- 3.↵
- 4.
- 5.↵
- 6.↵
- 7.↵
- 8.
- 9.↵
- 10.↵
- 11.
- 12.
- 13.
- 14.
- 15.
- 16.↵
- 17.↵
- 18.↵
- 19.↵
- 20.
- 21.↵
- 22.↵
- 23.↵
- 24.↵
- 25.↵
- 26.↵
- 27.↵
- 28.↵
- 29.↵
- 30.↵
- 31.↵
- 32.↵
- 33.↵
- 34.↵
- 35.↵
- 36.↵
- 37.↵
- 38.↵
- 39.↵
- 40.
- 41.↵
- 42.↵
- 43.↵
- 44.
- 45.
- 46.↵
- 47.↵
- 48.↵
- 49.↵
- 50.↵
- 51.↵
- 52.↵
- 53.↵
- 54.↵
- 55.↵
- 56.↵
- 57.↵
- 58.
- 59.
- 60.
- 61.
- 62.
- 63.↵
- 64.↵
- 65.
- 66.
- 67.
- 68.
- 69.
- 70.↵
- 71.↵
- 72.
- 73.↵
- 74.↵
- 75.↵
- 76.↵
- 77.↵
- 78.↵
- 79.
- 80.
- 81.
- 82.
- 83.↵
- 84.↵
- 85.↵
- 86.↵
- 87.↵
- 88.↵
- 89.↵
- 90.↵
- 91.↵
- 92.↵
- 93.↵
- 94.
- 95.
- 96.
- 97.↵
- 98.
- 99.
- 100.
- 101.
- 102.↵
- 103.↵
- 104.↵
- 105.
- 106.↵
- 107.↵
- 108.↵
- 109.↵
- 110.↵
- 111.↵
- 112.↵
- 113.↵
- 114.↵
- 115.
- 116.
- 117.
- 118.
- 119.
- 120.
- 121.
- 122.
- 123.
- 124.
- 125.
- 126.
- 127.
- 128.
- 129.↵
- 130.↵
- 131.↵
- 132.↵
- 133.↵
- 134.
- 135.
- 136.↵
- 137.↵
- 138.↵
- 139.↵
- 140.↵
- 141.↵
- 142.↵
- 143.↵
- 144.↵
- 145.↵
- 146.↵
- 147.↵
- 148.↵
- 149.↵
- 150.
- 151.↵
- 152.↵
- 153.↵
- 154.↵
- 155.↵
- 156.↵
- 157.↵