Abstract
Background Understanding how genetics and environmental factors shape human metabolic profiles is crucial for advancing metabolic health. Variability in metabolic profiles, influenced by genetic makeup, lifestyle, and environmental exposures, plays a critical role in disease susceptibility and progression.
Methods We conducted a two-year longitudinal study involving 101 clinically healthy individuals aged 50 to 65, integrating genomics, metabolomics, lipidomics, proteomics, clinical measurements, and lifestyle questionnaire data from repeat sampling. We evaluated the influence of both external and internal factors, including genetic predispositions, lifestyle factors, and physiological conditions, on individual metabolic profiles. Additionally, we developed an integrative metabolite-protein network to analyze protein-metabolite associations under both genetic and environmental regulations.
Results Our findings highlighted the significant role of genetics in determining metabolic variability, identifying 22 plasma metabolites as genetically predetermined. Environmental factors such as seasonal variation, weight management, smoking, and stress also significantly influenced metabolite levels. The integrative metabolite-protein network comprised 5,649 significant protein-metabolite pairs and identified 87 causal metabolite-protein associations under genetic regulation, validated by showing a high replication rate in an independent cohort. This network revealed stable and unique protein-metabolite profiles for each individual, emphasizing metabolic individuality. Notably, our results demonstrated the importance of plasma proteins in capturing individualized metabolic variabilities. Key proteins representing individual metabolic profiles were identified and validated in the UK Biobank, showing great potential for predicting metabolic diseases and metabolic risk assessment.
Conclusions Our study provides longitudinal insights into how genetic and environmental factors shape human metabolic profiles, revealing unique and stable individual metabolic profiles. Plasma proteins emerged as key indicators for capturing the variability in human metabolism and assessing metabolic risks. These findings offer valuable tools for personalized medicine and the development of diagnostics for metabolic diseases.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
This work was supported by the SciLifeLab & Wallenberg Data Driven Life Science Program (grant: KAW 2020.0239) and the Swedish Research Council (#2022-01562).
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
The study has been approved by the Ethical Review Board of Göteborg, Sweden (registration number 407-15), and all participants provided written informed consent.
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.
Yes
Footnotes
We updated the figure 7 and Table S1.
Data Availability
This study utilized participant-level datasets that have been securely deposited at the Swedish National Data Service (SND), a repository certified by the Core Trust Seal (URL: https://snd.gu.se/). The GC and LC-MS datasets are available as part of the full SW3P Wellness multi-omics dataset with the doi 10.5878/rdys-mz27. In compliance with patient consent and confidentiality agreements, these datasets are accessible for validation purposes only. Requests for access can be directed to SND via email (snd{at}snd.gu.se). The evaluation of such requests will be conducted in accordance with relevant Swedish legislation. Additionally, for inquiries specifically on research within the scope of the S3WP program, interested parties are encouraged to contact the corresponding author directly. The code can be made available upon request to the corresponding author.
Abbreviations
- 1-AG
- 1-Arachidonoylglycerol
- ACADS
- Acyl-CoA dehydrogenase short chain
- ALAT
- Alanine aminotransferase
- ANGPTL4
- Angiopoietin-like protein 4
- ANOVA
- Analysis of Variance
- ApoA1
- Apolipoprotein A1
- ApoB
- Apolipoprotein B
- AUC
- Area under curve
- BMI
- Body mass index
- CCA
- Canonical correspondence analysis
- CRP
- C-reactive protein
- CV
- Coefficient of variance
- CYP3A7
- Cytochrome P450 family 3 subfamily A member 7
- FDR
- False discovery rate
- GCG
- Glucagon
- GC-MS
- Gas chromatography-mass spectrometry
- GGT
- Gamma glutamyltransferase
- Gluc
- Fasting glucose
- GWAS
- Genome-wide association studies
- HDL
- High density lipoprotein
- HMDB
- Human Metabolomics Database
- IL10
- Interleukin 10
- IV
- Instrumental variable
- KNN
- K-nearest neighbor
- LC-MS
- Liquid chromatography-mass spectrometry
- LD
- Linkage Disequilibrium
- LDL
- Low density lipoprotein
- LEP
- Leptin
- LMM
- Linear mixed modeling
- LPL
- Lipoprotein lipase
- MAD
- Median absolute deviation
- MAF
- Minor allele frequency
- MDGA1
- MAM domain containing glycosylphosphatidylinositol anchor 1
- mQTL
- Metabolite quantitative trait loci
- MR
- Mendelian randomization
- NAD+
- Nicotinamide adenine dinucleotide+
- NPPC
- Natriuretic peptide
- NPX
- Normalized Protein eXpression
- NTproBNP
- N-Terminal pro-brain natriuretic peptide
- pQTL
- Protein quantitative trait locus
- QC
- Quality control
- Q-TOF
- Quadrupole Time-of-Flight
- ROC
- Receiver operating characteristic
- S3WP
- Swedish SciLifeLab SCAPIS Wellness Profiling
- SBP
- Systolic blood pressure
- SCAD
- Short-chain acyl-CoA dehydrogenase
- SCAPIS
- Swedish CArdioPulmonary bioImage Study
- siRNA
- short interfering RNA
- SMPDB
- Small Molecule Pathway Database
- SNN
- Shared nearest neighbor
- SNP
- Single-nucleotide polymorphism
- T2D
- Type 2 diabetes
- TG
- Triglyceride
- TNT
- Troponin T
- UMAP
- Uniform manifold approximation and projection
- WBC
- White blood cells count