Robust metabolomic age prediction based on a wide selection of metabolites

Tariq Faquih; Astrid van Hylckama Vlieg; Praveen Surendran; Adam S. Butterworth; Ruifang Li-Gao; Renée de Mutsert; Frits R. Rosendaal; Raymond Noordam; Diana van Heemst; Ko Willems van Dijk; Dennis O. Mook-Kanamori

doi:10.1101/2023.06.03.23290933

Abstract

Chronological age is a major risk factor for numerous diseases. However, chronological age does not capture the complex biological aging process. Biological aging can occur at a different pace in individuals of the same chronological age. Therefore, the difference between the chronological age and biologically driven aging could be more informative in reflecting health status. Metabolite levels are thought to reflect the integrated effects of both genetic and environmental factors on the rate of aging, and may thus provide a stronger signature for biological age than those previously developed using methylation and proteomics. Here, we set out to develop a metabolomic age prediction model by applying ridge regression and bootstrapping with 826 metabolites (of which 678 endogenous and 148 xenobiotics) measured by an untargeted high-performance liquid chromatography mass spectrometry platform (Metabolon) in 11,977 individuals (50.2% men) from the INTERVAL study (Cambridge, UK). Participants of the INTERVAL study are relatively healthy blood donors aged 18-75 years. After internal validation using bootstrapping, the models demonstrated high performance with an adjusted R² of 0.82 using the endogenous metabolites only and an adjusted R² of 0.83 when using the full set of 826 metabolites with age as outcome. The latter model performance could be indicative of xenobiotics predicting frailty. In summary, we developed robust models for predicting metabolomic age in a large relatively healthy population with a wide age range.

Introduction

Chronological age is a major risk factor for a multitude of diseases (1, 2). The biology of aging comprises complex and multifactorial processes that are influenced by genetic, lifestyle and environmental factors (3-5). Evidently, the rate of biological aging varies between individuals, as in contrast to other individuals of the same age group, wherein, some individuals are able to live to an older chronological age without age-related diseases and disability (3). This suggests that chronological age does not align with the biological aging process. For this reason, several studies have aimed to capture the signature of biological changes due to aging process by predicting its rate based on biological factors using, for example, DNA methylation (6) and proteins (7).

Metabolomic profiling aims to identify small molecules that are mostly substrates or products of metabolism (metabolites). The number of metabolomics studies has increased in recent years due to major technological advances and the availability of (validated and high throughput) commercial and noncommercial analyses platforms. In addition, the current platforms have improved their capability to detect and quantify large numbers of endogenous and xenobiotic metabolites(8, 9). Since individual metabolomic profiles reflect the influences of both genetic and acquired factors, they are thought to provide a more holistic representation of biological processes, such as aging (8, 9). Furthermore, as metabolomic profiles are strongly affected by chronological age (4, 10) and sex (11, 12), they have been used to develop prediction models of chronological age (i.e., the metabolomic age) (13-16). However, predicting metabolomic age using metabolomics had limited success or faced methodological limitations due to several reasons. First, some studies used a relatively small sample size to predict age using hundreds or even thousands of metabolite predictors (10, 17). The inclusion of a larger number of predictors than the number of samples may cause overfitting and bias. Second, previous metabolomic age studies used targeted platforms, such as the Nightingale platform, that specifically measure a small number of lipoproteins and some low-molecular metabolites. This limits the variety of the metabolites and the information used for the prediction of age. Third, studies may be limited by the age distribution of the cohort study. This may restrict the model to a specific age range which affects the generalizability of the model in other studies and other age groups (10). Fourth, generalizability may also be reduced if the model is developed in cohorts with an oversampling of individuals with specific disease outcomes or specific population characteristics (7, 13). Fifth, statistical methodology such as stepwise selection, have been reported to cause overfitting of the prediction model (18). Finally, it is challenging to examine the model’s validity and generalizability in external studies or different populations particularly due to the dynamic nature of metabolite levels (19-21).

In this study, we aimed to develop a model to predict metabolomic age in a large healthy population with a widespread age range, using a single untargeted metabolomics platform. To attain this goal, we developed a prediction model using data from the INTERVAL study (22) (University of Cambridge, Cambridge, UK). Metabolomic profiles were available in 11,977 participants as measured using Metabolon’s (Durham, North Carolina, USA) untargeted metabolomics platform. These measurements included a broad range of endogenous and xenobiotic metabolites (n=1,363) from various biochemical pathways, thereby enabling the capture of metabolites related to a vast range of ageing effects.

Methods

INTERVAL Study

The INTERVAL study is a prospective cohort study of approximately 50,000 participants nested within a pragmatic randomized sample of blood donors (22). Between 2012 and 2014, blood donors, aged 18 years and older, were consented and recruited from 25 National Health Service Blood and Transplant (NHSBT) static donor centers across the UK. The INTERVAL study was approved by The National Research Ethics Service (11/EE/0538). Individuals with major disease (myocardial infarction, stroke, cancer etc.) as well as those who reported being unwell or having had recent illness or infection or did not fulfill the other criteria required for blood donation (22, 23) were ineligible for the study. Therefore, participants included in the study were predominantly healthy. Participants completed online questionnaires addressing basic lifestyle and health-related information, including self-reported height and weight, ethnicity, smoking status, alcohol consumption, doctor-diagnosed anemia, use of medications (hormone replacement therapy, iron supplements) and menopausal status (22). Untargeted metabolomic data were available in 11,979 individuals (age range 18 -75). Two individuals had incorrect or missing height/weight values and were therefore excluded from the study. Thus, the final sample size for the current study was 11,977.

Untargeted metabolomic measurements

Untargeted metabolomic measurements were quantified at Metabolon Inc. (Durham, North Carolina, USA) using Metabolon™ Discovery HD4 platform. In brief, this process involves four independent ultra-high-performance liquid chromatography mass spectrometry (UHPLC-MS/MS) platforms (24, 25). Of these platforms, two used positive ionization reverse phase chromatography, one used negative ionization reverse phase chromatography, and one used hydrophilic interaction liquid chromatography negative ionization (25). Known metabolites were annotated at Metabolon Inc. with chemical names, super pathways, sub pathways, biochemical properties, and compound identifiers from various metabolite databases. Metabolomic measurements in the INTERVAL study were conducted in three batches (n=4087, 4566, and 3326). Subsequent harmonization and quality checks were performed between the batches by Metabolon. All metabolite measurements were scaled to a median of 1.

Selection of Predictors

We aimed to select metabolites consistently and reliably measured by the Metabolon platform. In total 1,411 metabolites were measured in the INTERVAL study. First, we removed metabolites completely missing in at least one of the batches (n=175). Second, we excluded metabolites without annotation (unnamed metabolites) (n=258), keeping only endogenous and xenobiotic metabolites. These metabolites were excluded as they are inconsistently measured by the platform and are highly variable between batches and studies. Moreover, the lack of full annotation increases the uncertainty that we are using the same metabolite across batches and studies and leaves no secondary information for further verification. Third, we excluded metabolites measured in less than 100 individuals (n=30). Fourth, to ensure that the metabolites that are used for our model are not unique for the INTERVAL study, we cross-referenced our dataset with an independent study with Metabolon measurements. For this we used the Netherlands Epidemiology of Obesity (NEO) study, a population-based (n=599) cohort study of individuals aged 45–65 years (26, 27). Based on this comparison, we additionally excluded 122 metabolites that were not detected in the NEO study. The final set of metabolites included 826 metabolites, with 678 endogenous and 148 xenobiotic metabolites that were confirmed to be regularly measured by the Metabolon platform (Figure 1).

Figure 1:

Flowchart of the selection of predictor metabolites and the development steps for the metabolomic age prediction model in the INTERVAL study

Missing value imputation

Missing values were imputed using the pipeline as described in our previous work (27). In brief, endogenous metabolites were imputed by multiple imputation using chained equations method to generate five imputed datasets (m=5). For each metabolite with missing values, we used the outcome variable (i.e., age), 5-10 highly correlated related metabolites, body mass index (BMI), center number where the blood samples were collected, and the batch number to impute the missing values. Xenobiotic metabolites were imputed to zero to account for true missingness.

Model Development

For the development of the model, we used ridge regression (28) to reduce potential overfitting. Cross validation was performed (n=10) in each imputed dataset to calculate the optimal shrinkage term (lambda). Subsequently the mean of the lambda values was used to develop the model on the stacked imputed datasets (i.e., all 5 datasets combined as one). Accordingly, the weight of the observations in the stacked dataset was set to 1/m = 1/5 = 0.2 (20). As the outcome (age) is a continuous variable, we assessed the fit of the model by deriving the R². As an additional sensitivity test, we used generalized additive model (GAM) to examine and calculate the R² for the nonlinear correlation. Internal validation was performed using bootstrapping (b=100). Bootstrapping results were used for the optimization of R² and the calculation of the mean squared error (MSE) and the mean absolute error (MAE) of the model. Two models were developed using two sets of the selected metabolite predictors. First, we used the full set of endogenous and xenobiotic metabolites (n=826) to develop “model A”. Second, we used only the endogenous metabolites (n=678) to develop “model B”. As previous studies found that metabolomic profiles (12, 29) and aging (30-32) are influenced by the sex of individuals, we included sex as an additional predictor in both models.

Sample size considerations

The primary database used to create the prediction model was the INTERVAL study (n= 11,977) with 826 predictors. We used the formulas described by Riley et al. (33) to confirm that our sample size (n) and number of predictor (p) are sufficient to minimize overfitting and provide high prediction precision. First, we calculated Copas global shrinkage factor for our data (34, 35) to see if it is above the recommended 0.9 threshold (33). Based on this calculation the estimated shrinkage factor was 0.95 if the adjusted R² of the model was assumed to be 0.7. Second, we calculated the sample size required to ensure a small difference between the R² and the adjusted R² for the development model. Assuming the adjusted R² was 0.7 again and a small desired R² difference (R²_diff=0.025), then the sample size required to achieve this should be at least n=9913. Third, we checked the sample size required for precise residual standard deviation of the model. Accordingly, we found the multiplicative margin of error (MMOE) to be less than 10% (MMOE =1.3%) using our n and p in the INTERVAL study. Finally, we checked the precision of the mean predicted outcome value (predicted age) of the model. We used n and p for the INTERVAL study and assumed that predicted age would have a mean of 45 and a variance of 35. Accordingly, the upper and lower bounds were approximately 45.34 and 44.65 respectively. Thus, the MMOE for the mean predicted outcome was less 1% (MMOE= 45.34 /45 = 1.007 = 0.7%). Therefore, the sample size of the INTERVAL study was optimal to minimize overfitting, optimism, and provide a precise estimation of the residual standard deviation and mean predicted values.

Results

Population characteristics

Characteristics of the total INTERVAL study population and age subgroups are summarized in Table 1. In total 11,977 individuals were included and had a normal age distribution with a mean age of 45 years and a range of 18 – 75 years. The number of men and women was approximately equal (50.2% were men). BMI was largely in the recommended range of 18.5-24.9 kg/m² (36) with a mean BMI of 22.8 kg/m² and was similar in all age groups.

View this table:

Table 1. Characteristics of the INTERVAL study population.

Metabolomic Age Prediction

Prediction models A (endogenous plus xenobiotic metabolites) and B (endogenous metabolites only) were developed in the INTERVAL study using ridge regression. The workflow including metabolite selection, missing value imputation, and analyses are summarized in Figure 1. Internal validation using bootstrapping (resampling n = 100) and optimization provided an R² of 0.83 (MSE=31, MAE=4.4) for model A, and 0.82 (MSE=33.7, MAE=4.6) for model B. The R² for the generalized additional model (GAM) were slightly higher for both models, 0.85 for model A and 0.84 for model B (Figure 2, Supplementary Table 1, Supplementary Figure 1). Full tables with the intercept, sex and metabolite coefficients for model A and B are provided in Supplementary Table 2. This table also contains the mean values for the metabolites from the INTERVAL study.

Figure 2:

Correlation plots of the Metabolomic age (predicted age) on the horizontal axis and the chronological age on the vertical axis for model A (A) and model B (B). The data used is the stacked imputed datasets in the INTERVAL study. Abbreviations: GAM, generalized additive model.

Discussion

In this study, we developed a prediction model for metabolomic age based on metabolite measurements, including a wide range of endogenous and xenobiotic metabolites belonging to a variety of biochemical pathways. The metabolomic measurements were performed in a single study using the same metabolomic platform and harmonized for within study and between batch variation. Importantly, we used multiple imputation, ridge regression, and bootstrapping (20) to develop and internally validate the metabolomic age prediction models. We developed two models, with the first model A using both endogenous and xenobiotic metabolites (n=826), and the second model B using the endogenous metabolites only (n=678). Both models had high adjusted R² (model A = 0.83; Model B = 0.82), however, model B had slightly higher MSE and MAE, indicating higher error for the predicted values.

Ridge Regression for Metabolomic Age

Our metabolomic age model is based on ridge regression which generates shrinkage factors that shrink metabolite coefficients with weak influence on the model to small values. Unlike methods such as LASSO and elastic net regression, this shrinkage never causes the coefficients to reach zero. Therefore, unlike those methods, ridge regression does not perform selection of predictors, in this case metabolites, due to the shrinkage term. Thus, the model consistently includes the same metabolites during the cross validation and internal validation by bootstrapping (20). Ridge regression thus ensures rigorousness and consideration of all the selected metabolite predictors. In addition, we found in our model that nonlinear GAM R² to be slightly higher than linear R² in both models. Figure 2 also shows that the GAM line curves at the edges of the age ranges. Therefore, predicting metabolomic age from this model in a nonlinear form, by adding a quadratic term or using a spline function, may further expand the model flexibility.

Strengths and Limitations

A major strength of our current study is the development of robust metabolomic age prediction models based on a large number of metabolites measured in a large cohort with a wide age distribution. This sample size was confirmed to fit the required criteria for developing a model with low overfitting (33). Furthermore, the INTERVAL study included relatively healthy blood donors as participants. Blood donors are prescreened for a number of common diseases, making it a suitable study to develop a metabolomic age without being affected by specific disease-related effects on metabolites. Unlike previous metabolomic age predictors (alternatively referred to as metabolomic “clocks”), we developed our model using the latest Metabolon metabolomics platform that measures a large selection of metabolites. An advantage of this platform is the inclusion of xenobiotics, such as those derived from medication or pollution as well as endogenous metabolites. Thus, we were able to include metabolites originating from various internal and external sources. The xenobiotic metabolites represent some of the acquired environmental and lifestyle exposures of individuals that could play a role in biological aging. For example, medication metabolites could be a predictor of frailty. However, our study has two limitations. First, because the INTERVAL study does not have data for health-related outcomes, we could not assess the effects of different disease outcomes and health phenotypes on the metabolomic age. Second, as the INTERVAL study used here was cross sectional, we could not examine the model overtime or examine the ability of metabolomic age to predict disease outcomes or mortality in a future time point.

External Validation for Metabolomic Age and Future Applications

Prediction models may suffer from overfitting due to selection bias, small sample size, methodology limitations, and lack of internal validation and calibration during development. However, another common issue regarding prediction modelling, such as the case with metabolomic age, is the challenge to externally validate them. This a common issue with prediction models in general. Indeed, few studies perform external validation of prediction models (19, 37). The reasons for the lack of external validation include the difficulty of applying and reproducing the prediction model method, lack of the full prediction variables to develop the model in the new dataset, or a lack of an appropriate sample size for external validation. Without external validation, the quality of the models cannot be properly assessed, and a model could still be overfitted despite presenting good results during internal validation(20, 21, 38). We took advantage of the sample size and wide age range to use a stringent ridge regression and internal validation method to develop the metabolomic age model. The resulting model demonstrated a high R² for the metabolomic age. The high R² from our models is unlikely to be due be overfitting as we considered and calculated the power and sensitivity of the model based on the sample size and number predictors (33), in addition to the rigorous prediction modelling methodology. The metabolomic age was able to predict chronological age well but not perfectly. The missing 0.2 from R² is likely a reflection the biological aging process captured by the metabolomic age. However, future robust external validation, using weak and moderate calibrations in other populations (38) would be valuable for the metabolomic age models presented in this paper.

Several age prediction models have been developed that utilize different biological measurements such as targeted metabolomics (7, 14), proteomics (7), and epigenetics (DNA methylation)(6). These studies have reported that the metabolites were a strong predictor of body mass index, metabolic syndrome, cancer (7), type 2 diabetes, and cardiovascular disease (14). Furthermore, other biological clocks using proteomics and epigenetics were reported to be associated with depression and other outcomes (7). Therefore, it will be valuable to apply the prediction models presented in this paper in large longitudinal cohorts and examine the metabolomic age in relation to calendar age with the aforementioned phenotypes and other health related outcomes. Assessing the association between the residual between the chronological and metabolomic age with disease and health phenotypes overtime will provide clinically relevant prediction. The prediction strength will indicate the robustness of our model and provide the means to for external validation and re-calibration of the model.

In addition to our aim of addressing primary issues of the development of metabolomic age models, the Metabolon platform is expanded on the range of metabolites that can potentially be a better predictor of age. For example, previous metabolomic age studies have not measured or included xenobiotic metabolites. In our study, we were able to include this additional group of metabolites in model A. Furthermore, Model A may also be used in tandem with metabolomic age from targeted platforms, and biological clocks of different biological molecules measurements, similar to the work by Jansen et al. (7), to possibly improve or compare their predictive performance and their ability of capturing the effects of health-related phenotypes.

Conclusions

We developed metabolomic age prediction models in a large relatively healthy population using a wide array of endogenous and xenobiotic metabolites. In model A with the endogenous and xenobiotic metabolites and in model B with endogenous metabolites only, the R² of the linear fit was 0.82 and 0.83, respectively. We provided the full list of metabolites and their coefficients for both models. This data can enable other researchers to replicate our metabolomic age prediction model, externally validate it in their own studies with different disease outcomes and combine them with other age prediction models.

Data Availability

Data is only available upon request.

Funding

The INTERVAL study was funded by NHSBT and the NIHR Blood and Transplant Research Unit in Donor Health and Genomics (NIHR BTRU-2014-10024). The trial’s coordinating centre at the Department of Public Health and Primary Care at the University of Cambridge, Cambridge, UK, has received core support from the UK Medical Research Council (G0800270), British Heart Foundation (SP/09/002), and the NIHR Cambridge Biomedical Research Centre. Investigators at the University of Oxford, Oxford, UK, have been supported by the Research and Development Programme of NHSBT, the NHSBT Howard Ostin Trust Fund, and the NIHR Oxford Biomedical Research Centre through the programme grant NIHR-RP-PG-0310-1004. We thank the blood donors who participated in the trial and NHSBT’s operational staff.

The NEO study is supported by the participating Departments, the Division, and the Board of Directors of the Leiden University Medical Centre, and by the Leiden University, Research Profile Area ‘Vascular and Regenerative Medicine’. The analyses of metabolites are funded by the VENI grant (ZonMW-VENI Grant 916.14.023) of D.O.M.-K., D.v.H. and R.N. were supported by a grant of the VELUX Stiftung [grant number 1156]. T.O.F. was supported by the King Abdullah Scholarship Program and King Faisal Specialist Hospital & Research Center [No. 1012879283].

Conflicts of Interest

P.S. is an associate director of applied and statistical genetics at GlaxoSmithKline plc. A.S.B. reports institutional grants from AstraZeneca, Bayer, Biogen, BioMarin, Bioverativ, Novartis, Regeneron and Sanofi. R.L.-G. is a part-time clinical research consultant for Metabolon, Inc. All other co-authors have no conflicts of interest to declare.

Author Contributions

T.O.F.- conceptualization, data curation, formal analysis, investigation, methodology, software, visualization, writing-original draft. R.L.-G.- validation, writing – review & editing. P.S. – supervision, conceptualization, project administration, resources, funding acquisition, writing – review & editing - A.S.B – project administration, resources, funding acquisition, writing – review & editing. R.d.M.- project administration, resources, funding acquisition, writing – review & editing. R.N. and D.V.H.- funding acquisition, writing – review & editing. F.R.R.- funding acquisition. A.v.H.V. and K.W.v.D- conceptualization, supervision, writing – review & editing. D.O.M.-K.- conceptualization, supervision, funding acquisition, writing – review & editing.

Acknowledgments and Disclosures

The authors of the NEO study thank all participants, all participating general practitioners for inviting eligible participants, all research nurses for data collection, and the NEO study group: Pat van Beelen, Petra Noordijk, and Ingeborg de Jonge for coordination, laboratory, and data management. The NEO study was approved by the medical ethical committee of the Leiden University Medical Centre (LUMC) and all participants provided written informed consent. The INTERVAL study was approved by The National Research Ethics Service (11/EE/0538) and all eligible donors provided a trial consent form before giving a blood donation.

Footnotes

Minor formatting and spelling corrections

References

1.↵
North BJ, Sinclair DA. The intersection between aging and cardiovascular disease. Circ Res. 2012;110(8):1097–108. doi:10.1161/CIRCRESAHA.111.246876
OpenUrl Abstract/FREE Full Text
2.↵
Broglio SP, Eckner JT, Paulson HL, Kutcher JS. Cognitive decline and aging: the role of concussive and subconcussive impacts. 2012;40(3):138.
OpenUrl
3.↵
Lopez-Otin C, Blasco MA, Partridge L, Serrano M, Kroemer G. The hallmarks of aging. Cell. 2013;153(6):1194–217. doi:10.1016/j.cell.2013.05.039
OpenUrl CrossRef PubMed Web of Science
4.↵
Hoffman JM, Lyu Y, Pletcher SD, Promislow DEL. Proteomics and metabolomics in ageing research: from biomarkers to systems biology. Essays Biochem. 2017;61(3):379–88. doi:10.1042/EBC20160083
OpenUrl Abstract/FREE Full Text
5.↵
Brooks-Wilson AR. Genetics of healthy aging and longevity. 2013;132(12):1323–38.
OpenUrl
6.↵
Horvath S. DNA methylation age of human tissues and cell types. Genome Biology. 2013;14(10):3156. doi:10.1186/gb-2013-14-10-r115
OpenUrl CrossRef
7.↵
Jansen R, Han LK, Verhoeven JE, et al. An integrative study of five biological clocks in somatic and mental health. Elife. 2021;10. doi:10.7554/eLife.59479
OpenUrl CrossRef
8.↵
Alonso A, Marsal S, Julia A. Analytical methods in untargeted metabolomics: state of the art in 2015. Front Bioeng Biotechnol. 2015;3:23. doi:10.3389/fbioe.2015.00023
OpenUrl CrossRef
9.↵
Rattray NJW, Deziel NC, Wallach JD, et al. Beyond genomics: understanding exposotypes through metabolomics. Hum Genomics. 2018;12(1):4. doi:10.1186/s40246-018-0134-x
OpenUrl CrossRef
10.↵
Rutledge J, Oh H, Wyss-Coray T. Measuring biological age using omics data. Nature Reviews Genetics. 2022. doi:10.1038/s41576-022-00511-7
OpenUrl CrossRef
11.↵
Martin FJ, Montoliu I, Kussmann M. Metabonomics of ageing - Towards understanding metabolism of a long and healthy life. Mech Ageing Dev. 2017;165(Pt B):171–9. doi:10.1016/j.mad.2016.12.009
OpenUrl CrossRef
12.↵
Yu Z, Zhai G, Singmann P, et al. Human serum metabolic profiles are age dependent. Aging Cell. 2012;11(6):960–7. doi:10.1111/j.1474-9726.2012.00865.x
OpenUrl CrossRef PubMed
13.↵
Macdonald-Dunlop E, Taba N, Klarić L, et al. A catalogue of omics biological ageing clocks reveals substantial commonality and associations with disease risk. Aging. 2022;14(2):623–59. doi:10.18632/aging.203847
OpenUrl CrossRef
14.↵
van den Akker EB, Trompet S, Barkey Wolf JJH, et al. Metabolic Age Based on the BBMRI-NL (1)H-NMR Metabolomics Repository as Biomarker of Age-related Disease. Circulation. Genomic and precision medicine. 2020;13(5):541–7. doi:10.1161/circgen.119.002610
OpenUrl CrossRef
15.
Hertel J, Friedrich N, Wittfeld K, et al. Measuring Biological Age via Metabonomics: The Metabolic Age Score. J Proteome Res. 2016;15(2):400–10. doi:10.1021/acs.jproteome.5b00561
OpenUrl CrossRef PubMed
16.↵
Rist MJ, Roth A, Frommherz L, et al. Metabolite patterns predicting sex and age in participants of the Karlsruhe Metabolomics and Nutrition (KarMeN) study. PLoS One. 2017;12(8):e0183228. doi:10.1371/journal.pone.0183228
OpenUrl CrossRef
17.↵
Hwangbo N, Zhang X, Raftery D, et al. A Metabolomic Aging Clock Using Human Cerebrospinal Fluid. The Journals of Gerontology: Series A. 2021;77(4):744–54. doi:10.1093/gerona/glab212 %J The Journals of Gerontology: Series A
OpenUrl CrossRef
18.↵
Smith G. Step away from stepwise. Journal of Big Data. 2018;5(1):32. doi:10.1186/s40537-018-0143-6
OpenUrl CrossRef
19.↵
Ramspek CL, Jager KJ, Dekker FW, Zoccali C, van Diepen M. External validation of prognostic models: what, why, how, when and where? Clinical Kidney Journal. 2020;14(1):49–58. doi:10.1093/ckj/sfaa188 %J Clinical Kidney Journal
OpenUrl CrossRef
20.↵
Steyerberg EW. Clinical Prediction Models. 2nd ed. Cham, Switzerland: Springer International Publishing; 2019.
21.↵
Bleeker SE, Moll HA, Steyerberg EW, et al. External validation is necessary in prediction research:: A clinical example. Journal of Clinical Epidemiology. 2003;56(9):826–32. doi:10.1016/S0895-4356(03)00207-5
OpenUrl CrossRef PubMed Web of Science
22.↵
Moore C, Sambrook J, Walker M, et al. The INTERVAL trial to determine whether intervals between blood donations can be safely and acceptably decreased to optimise blood supply: study protocol for a randomised controlled trial. Trials. 2014;15:363–. doi:10.1186/1745-6215-15-363
OpenUrl CrossRef PubMed
23.↵
NHS Blood Donation Who can give blood. 2022. https://www.blood.co.uk/who-can-give-blood.
24.↵
Evans A, Bridgewater B, Liu Q, et al. High resolution mass spectrometry improves data quantity and quality as compared to unit mass resolution mass spectrometry in high-throughput profiling metabolomics. 2014;4(2):1.
OpenUrl
25.↵
Rhee EP, Waikar SS, Rebholz CM, et al. Variability of Two Metabolomic Platforms in CKD. Clinical Journal of the American Society of Nephrology. 2019;14(1):40. doi:10.2215/CJN.07070618
OpenUrl Abstract/FREE Full Text
26.↵
de Mutsert R, den Heijer M, Rabelink TJ, et al. The Netherlands Epidemiology of Obesity (NEO) study: study design and data collection. Eur J Epidemiol. 2013;28(6):513–23. doi:10.1007/s10654-013-9801-3
OpenUrl CrossRef PubMed
27.↵
Faquih T, van Smeden M, Luo J, et al. A Workflow for Missing Values Imputation of Untargeted Metabolomics Data. Metabolites. 2020;10(12). doi:10.3390/metabo10120486
OpenUrl CrossRef
28.↵
Hoerl AE, Kennard RW. Ridge Regression: Biased Estimation for Nonorthogonal Problems. Technometrics. 1970;12(1):55–67. doi:10.1080/00401706.1970.10488634
OpenUrl CrossRef
29.↵
Saner C, Harcourt BE, Pandey A, et al. Sex and puberty-related differences in metabolomic profiles associated with adiposity measures in youth with obesity. Metabolomics. 2019;15(5):75. doi:10.1007/s11306-019-1537-y
OpenUrl CrossRef
30.↵
Hägg S, Jylhävä J. Sex differences in biological aging with a focus on human studies. eLife. 2021;10:e63425. doi:10.7554/eLife.63425
OpenUrl CrossRef
31.
Nakamura E, Miyao KJTJoGSABS, Sciences M. Sex differences in human biological aging. 2008;63(9):936–44.
OpenUrl
32.↵
McCrory C, Fiorito G, McLoughlin S, et al. Epigenetic clocks and allostatic load reveal potential sex-specific drivers of biological aging. 2020;75(3):495–503.
OpenUrl
33.↵
Riley RD, Snell KIE, Ensor J, et al. Minimum sample size for developing a multivariable prediction model: Part I - Continuous outcomes. Statistics in medicine. 2019;38(7):1262–75. doi:10.1002/sim.7993
OpenUrl CrossRef PubMed
34.↵
Copas JB. Regression, prediction and shrinkage. Journal of the Royal Statistical Society: Series B. 1983;45(3):311–35.
OpenUrl
35.↵
Copas JB. Using regression models for prediction: shrinkage and regression to the mean. Statistical methods in medical research. 1997;6(2):167–83. doi:10.1177/096228029700600206
OpenUrl CrossRef PubMed
36.↵
WHO WHO. A healthy lifestyle - WHO recommendations. World Health Organization: WHO. 2010.
37.↵
Siontis GCM, Tzoulaki I, Castaldi PJ, Ioannidis JPA. External validation of new risk prediction models is infrequent and reveals worse prognostic discrimination. Journal of Clinical Epidemiology. 2015;68(1):25–34. doi:10.1016/j.jclinepi.2014.09.007
OpenUrl CrossRef PubMed
38.↵
Van Calster B, McLernon DJ, van Smeden M, et al. Calibration: the Achilles heel of predictive analytics. BMC Medicine. 2019;17(1):230. doi:10.1186/s12916-019-1466-7
OpenUrl CrossRef PubMed

View the discussion thread.

Posted November 09, 2023.

Download PDF

Supplementary Material

Data/Code

Citation Tools

Subject Area

Epidemiology

Subject Areas

All Articles

Addiction Medicine (405)
Allergy and Immunology (714)
Anesthesia (209)
Cardiovascular Medicine (2986)
Dentistry and Oral Medicine (337)
Dermatology (254)
Emergency Medicine (446)
Endocrinology (including Diabetes Mellitus and Metabolic Disease) (1056)
Epidemiology (12847)
Forensic Medicine (12)
Gastroenterology (839)
Genetic and Genomic Medicine (4652)
Geriatric Medicine (428)
Health Economics (735)
Health Informatics (2960)
Health Policy (1076)
Health Systems and Quality Improvement (1097)
Hematology (394)
HIV/AIDS (938)
Infectious Diseases (except HIV/AIDS) (14173)
Intensive Care and Critical Care Medicine (859)
Medical Education (430)
Medical Ethics (116)
Nephrology (478)
Neurology (4441)
Nursing (239)
Nutrition (653)
Obstetrics and Gynecology (820)
Occupational and Environmental Health (741)
Oncology (2310)
Ophthalmology (655)
Orthopedics (260)
Otolaryngology (328)
Pain Medicine (285)
Palliative Medicine (85)
Pathology (504)
Pediatrics (1205)
Pharmacology and Therapeutics (511)
Primary Care Research (506)
Psychiatry and Clinical Psychology (3824)
Public and Global Health (7039)
Radiology and Imaging (1557)
Rehabilitation Medicine and Physical Therapy (931)
Respiratory Medicine (927)
Rheumatology (447)
Sexual and Reproductive Health (450)
Sports Medicine (387)
Surgery (494)
Toxicology (60)
Transplantation (213)
Urology (186)

[1] 1.↵
North BJ, Sinclair DA. The intersection between aging and cardiovascular disease. Circ Res. 2012;110(8):1097–108. doi:10.1161/CIRCRESAHA.111.246876
OpenUrl Abstract/FREE Full Text

[2] 2.↵
Broglio SP, Eckner JT, Paulson HL, Kutcher JS. Cognitive decline and aging: the role of concussive and subconcussive impacts. 2012;40(3):138.
OpenUrl

[3] 3.↵
Lopez-Otin C, Blasco MA, Partridge L, Serrano M, Kroemer G. The hallmarks of aging. Cell. 2013;153(6):1194–217. doi:10.1016/j.cell.2013.05.039
OpenUrl CrossRef PubMed Web of Science

[4] 4.↵
Hoffman JM, Lyu Y, Pletcher SD, Promislow DEL. Proteomics and metabolomics in ageing research: from biomarkers to systems biology. Essays Biochem. 2017;61(3):379–88. doi:10.1042/EBC20160083
OpenUrl Abstract/FREE Full Text

[5] 5.↵
Brooks-Wilson AR. Genetics of healthy aging and longevity. 2013;132(12):1323–38.
OpenUrl

[6] 6.↵
Horvath S. DNA methylation age of human tissues and cell types. Genome Biology. 2013;14(10):3156. doi:10.1186/gb-2013-14-10-r115
OpenUrl CrossRef

[7] 7.↵
Jansen R, Han LK, Verhoeven JE, et al. An integrative study of five biological clocks in somatic and mental health. Elife. 2021;10. doi:10.7554/eLife.59479
OpenUrl CrossRef

[8] 8.↵
Alonso A, Marsal S, Julia A. Analytical methods in untargeted metabolomics: state of the art in 2015. Front Bioeng Biotechnol. 2015;3:23. doi:10.3389/fbioe.2015.00023
OpenUrl CrossRef

[9] 9.↵
Rattray NJW, Deziel NC, Wallach JD, et al. Beyond genomics: understanding exposotypes through metabolomics. Hum Genomics. 2018;12(1):4. doi:10.1186/s40246-018-0134-x
OpenUrl CrossRef

[10] 10.↵
Rutledge J, Oh H, Wyss-Coray T. Measuring biological age using omics data. Nature Reviews Genetics. 2022. doi:10.1038/s41576-022-00511-7
OpenUrl CrossRef

[11] 11.↵
Martin FJ, Montoliu I, Kussmann M. Metabonomics of ageing - Towards understanding metabolism of a long and healthy life. Mech Ageing Dev. 2017;165(Pt B):171–9. doi:10.1016/j.mad.2016.12.009
OpenUrl CrossRef

[12] 12.↵
Yu Z, Zhai G, Singmann P, et al. Human serum metabolic profiles are age dependent. Aging Cell. 2012;11(6):960–7. doi:10.1111/j.1474-9726.2012.00865.x
OpenUrl CrossRef PubMed

[13] 13.↵
Macdonald-Dunlop E, Taba N, Klarić L, et al. A catalogue of omics biological ageing clocks reveals substantial commonality and associations with disease risk. Aging. 2022;14(2):623–59. doi:10.18632/aging.203847
OpenUrl CrossRef

[14] 14.↵
van den Akker EB, Trompet S, Barkey Wolf JJH, et al. Metabolic Age Based on the BBMRI-NL (1)H-NMR Metabolomics Repository as Biomarker of Age-related Disease. Circulation. Genomic and precision medicine. 2020;13(5):541–7. doi:10.1161/circgen.119.002610
OpenUrl CrossRef

[15] 15.
Hertel J, Friedrich N, Wittfeld K, et al. Measuring Biological Age via Metabonomics: The Metabolic Age Score. J Proteome Res. 2016;15(2):400–10. doi:10.1021/acs.jproteome.5b00561
OpenUrl CrossRef PubMed

[16] 16.↵
Rist MJ, Roth A, Frommherz L, et al. Metabolite patterns predicting sex and age in participants of the Karlsruhe Metabolomics and Nutrition (KarMeN) study. PLoS One. 2017;12(8):e0183228. doi:10.1371/journal.pone.0183228
OpenUrl CrossRef

[17] 17.↵
Hwangbo N, Zhang X, Raftery D, et al. A Metabolomic Aging Clock Using Human Cerebrospinal Fluid. The Journals of Gerontology: Series A. 2021;77(4):744–54. doi:10.1093/gerona/glab212 %J The Journals of Gerontology: Series A
OpenUrl CrossRef

[18] 18.↵
Smith G. Step away from stepwise. Journal of Big Data. 2018;5(1):32. doi:10.1186/s40537-018-0143-6
OpenUrl CrossRef

[19] 19.↵
Ramspek CL, Jager KJ, Dekker FW, Zoccali C, van Diepen M. External validation of prognostic models: what, why, how, when and where? Clinical Kidney Journal. 2020;14(1):49–58. doi:10.1093/ckj/sfaa188 %J Clinical Kidney Journal
OpenUrl CrossRef

[20] 20.↵
Steyerberg EW. Clinical Prediction Models. 2nd ed. Cham, Switzerland: Springer International Publishing; 2019.

[21] 21.↵
Bleeker SE, Moll HA, Steyerberg EW, et al. External validation is necessary in prediction research:: A clinical example. Journal of Clinical Epidemiology. 2003;56(9):826–32. doi:10.1016/S0895-4356(03)00207-5
OpenUrl CrossRef PubMed Web of Science

[22] 22.↵
Moore C, Sambrook J, Walker M, et al. The INTERVAL trial to determine whether intervals between blood donations can be safely and acceptably decreased to optimise blood supply: study protocol for a randomised controlled trial. Trials. 2014;15:363–. doi:10.1186/1745-6215-15-363
OpenUrl CrossRef PubMed

[23] 23.↵
NHS Blood Donation Who can give blood. 2022. https://www.blood.co.uk/who-can-give-blood.

[24] 24.↵
Evans A, Bridgewater B, Liu Q, et al. High resolution mass spectrometry improves data quantity and quality as compared to unit mass resolution mass spectrometry in high-throughput profiling metabolomics. 2014;4(2):1.
OpenUrl

[25] 25.↵
Rhee EP, Waikar SS, Rebholz CM, et al. Variability of Two Metabolomic Platforms in CKD. Clinical Journal of the American Society of Nephrology. 2019;14(1):40. doi:10.2215/CJN.07070618
OpenUrl Abstract/FREE Full Text

[26] 26.↵
de Mutsert R, den Heijer M, Rabelink TJ, et al. The Netherlands Epidemiology of Obesity (NEO) study: study design and data collection. Eur J Epidemiol. 2013;28(6):513–23. doi:10.1007/s10654-013-9801-3
OpenUrl CrossRef PubMed

[27] 27.↵
Faquih T, van Smeden M, Luo J, et al. A Workflow for Missing Values Imputation of Untargeted Metabolomics Data. Metabolites. 2020;10(12). doi:10.3390/metabo10120486
OpenUrl CrossRef

[28] 28.↵
Hoerl AE, Kennard RW. Ridge Regression: Biased Estimation for Nonorthogonal Problems. Technometrics. 1970;12(1):55–67. doi:10.1080/00401706.1970.10488634
OpenUrl CrossRef

[29] 29.↵
Saner C, Harcourt BE, Pandey A, et al. Sex and puberty-related differences in metabolomic profiles associated with adiposity measures in youth with obesity. Metabolomics. 2019;15(5):75. doi:10.1007/s11306-019-1537-y
OpenUrl CrossRef

[30] 30.↵
Hägg S, Jylhävä J. Sex differences in biological aging with a focus on human studies. eLife. 2021;10:e63425. doi:10.7554/eLife.63425
OpenUrl CrossRef

[31] 31.
Nakamura E, Miyao KJTJoGSABS, Sciences M. Sex differences in human biological aging. 2008;63(9):936–44.
OpenUrl

[32] 32.↵
McCrory C, Fiorito G, McLoughlin S, et al. Epigenetic clocks and allostatic load reveal potential sex-specific drivers of biological aging. 2020;75(3):495–503.
OpenUrl

[33] 33.↵
Riley RD, Snell KIE, Ensor J, et al. Minimum sample size for developing a multivariable prediction model: Part I - Continuous outcomes. Statistics in medicine. 2019;38(7):1262–75. doi:10.1002/sim.7993
OpenUrl CrossRef PubMed

[34] 34.↵
Copas JB. Regression, prediction and shrinkage. Journal of the Royal Statistical Society: Series B. 1983;45(3):311–35.
OpenUrl

[35] 35.↵
Copas JB. Using regression models for prediction: shrinkage and regression to the mean. Statistical methods in medical research. 1997;6(2):167–83. doi:10.1177/096228029700600206
OpenUrl CrossRef PubMed

[36] 36.↵
WHO WHO. A healthy lifestyle - WHO recommendations. World Health Organization: WHO. 2010.

[37] 37.↵
Siontis GCM, Tzoulaki I, Castaldi PJ, Ioannidis JPA. External validation of new risk prediction models is infrequent and reveals worse prognostic discrimination. Journal of Clinical Epidemiology. 2015;68(1):25–34. doi:10.1016/j.jclinepi.2014.09.007
OpenUrl CrossRef PubMed

[38] 38.↵
Van Calster B, McLernon DJ, van Smeden M, et al. Calibration: the Achilles heel of predictive analytics. BMC Medicine. 2019;17(1):230. doi:10.1186/s12916-019-1466-7
OpenUrl CrossRef PubMed