Abstract
Background Maternal obesity is a health concern that may predispose newborns to a high risk of medical problems later in life. To understand the intergenerational effect of maternal obesity, we hypothesized that the maternal obesity effect is mediated by epigenetic changes in the CD34+/CD38−/Lin− hematopoietic stem cells (uHSCs) in the offspring. Towards this, we conducted a DNA methylation centric multi-omics study. We measured the DNA methylation and gene expression in the CD34+/CD38−/Lin− uHSCs and metabolomics of the cord blood, all from a multi-ethnic cohort (n=72) from Kapiolani Medical Center for Women and Children in Honolulu, Hawaii (collected between 2016 and 2018).
Results Differential methylation (DM) analysis unveiled a global hypermethylation pattern in the maternal pre-pregnancy obese group (BH adjusted p<0.05), after adjusting for major clinical confounders. KEGG pathway enrichment, WGCNA, and PPI analyses revealed hypermethylated CpG sites were involved in critical biological processes, including cell cycle, protein synthesis, immune signaling, and lipid metabolism. Utilizing Shannon entropy on uHSCs methylation, we discerned notably higher quiescence of uHSCs impacted by maternal obesity. Additionally, the integration of multi-omics data-including methylation, gene expression, and metabolomics-provided further evidence of dysfunctions in adipogenesis, erythropoietin production, cell differentiation, and DNA repair, aligning with the findings at the epigenetic level. Furthermore, we trained a random forest classifier using the CpG sites in the genes of the top pathways associated with maternal obesity, and applied it to predict cancer vs. adjacent normal labels from samples in 14 Cancer Genome Atlas (TCGA) cancer types. Five of 14 cancers showed balanced accuracy of 0.6 or higher: LUSC (0.87), PAAD (0.83), KIRC (0.71), KIRP (0.63) and BRCA (0.60).
Conclusions This study revealed the significant correlation between pre-pregnancy maternal obesity and multi-omics level molecular changes in the uHSCs of offspring, particularly in DNA methylation. Moreover, these maternal obesity epigenetic markers in uHSCs may predispose offspring to higher risks in certain cancers.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
This research was supported by grants R01 LM012373 and LM012907 awarded by NLM, and R01 HD084633 awarded by NICHD to L.X. Garmire, as well as in part by the NCI Cancer Center Support Grant (CCSG) number P30 CA071789 awarded to Genomics and Bioinformatics Shared Resource (RRID:SCR_019085). This research was supported in part by training funding provided by the NIH grant T32 GM141746 and Advanced Proteogenomics of Cancer (T32 CA140044).
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
Western IRB of the University of Hawaii gave ethical approval for this work (WIRB Protocol #20151223). All participants involved in this study provided written informed consent before the collection of cord blood samples.
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.
Yes
Footnotes
-additional quality control steps with detailed description (mismatch, genomics inflation check, qq-plot, cross-hybridization CpG site removal etc) -re-do the classification modeling using obesity samples, and re-do the prediction on TCGA samples -comparison with previous DNA methylation results on maternal obesity -address the limitation of this study (sample size etc) in the Discussion.
Data availability statement
DNA methylation data and bulk RNA-seq data generated in this study have been submitted and will be available through the National Institutes of Health Gene Expression Omnibus (GEO) with the accession number GSE273075 (GEO reviewer token: upmtkoygrlwtrmb). Other datasets used in this project for the analysis and validation purpose are publicly available. The placenta datasets used in this article are available in the GEO repository with accession numbers GSE31781, GSE36829, GSE59274, GSE44667, GSE74738, GSE49343, GSE69502, and GSE98224. Cord blood metabolomics data used in this article is available in metabolomics workbench with study ID ST001114. Cancer methylation datasets for BLCA, BRCA, COAD, ESCA, HNSC, KIRC, KIRP, LIHC, LUAD, LUSC, PAAD, PRAD, THCA, UCEC are available in The Cancer Genome Atlas (TCGA data portal: https://portal.gdc.cancer.gov/).
Abbreviation
- AA
- Amino Acid
- BH
- Benjamini-Hochberg
- BMI
- Body mass index
- C
- Acylcarnitines
- DE
- Differential expression
- DIABLO
- Data Integration Analysis for Biomarker discovery using Latent cOmponents
- DM
- Differential methylation
- DMR
- Differentially methylated regions
- DOHaD
- Developmental Origins of Health and Disease
- EWAS
- Epigenome-wide association studies
- FC
- Fold Change
- FDR
- False positive results
- KEGG
- Kyoto Encyclopedia of Genes and Genomes
- LOG
- Logistic regression
- MDS
- Multi-dimensional Scaling
- NHPI
- Native Hawaiian and Pacific Islander
- PANDA
- Preferential Attachment-based common Neighbor Distribution derived Associations
- PC aa
- Diacyl phosphatidylcholines
- PC ae
- Acyl-alkylphosphatidylcholines
- PCC
- Pearson correlation coefficients
- PPI
- Protein-Protein Interaction
- RF
- Random Forest
- SOV
- Source of variance
- SVD
- Singular value decomposition
- SVA
- Surrogate variable analysis
- TSS
- Transcription start site
- TCGA
- The Cancer Genome Atlas
- uHSCs
- Umbilical cord blood hematopoietic stem cells
- UMAP
- Uniform Manifold Approximation and Projection
- VSN
- Variance Stabilization Normalization
- WGCNA
- Weighted Gene Co-expression Network Analysis