Identifying Metabolomic and Proteomic Biomarkers for Age-Related Morbidity in a Population-Based Cohort - the Cooperative Health Research in South Tyrol (CHRIS) study
======================================================================================================================================================================

* Essi Hantikainen
* Christian X. Weichenberger
* Nikola Dordevic
* Vinicius Verri Hernandes
* Luisa Foco
* Martin Gögele
* Roberto Melotti
* Cristian Pattaro
* Markus Ralser
* Fatma Amari
* Vadim Farztdinov
* Michael Mülleder
* Peter P. Pramstaller
* Johannes Rainer
* Francisco S. Domingues

## Abstract

Identifying biomarkers able to discriminate individuals on different health trajectories is crucial to understand the molecular basis of age-related morbidity. We investigated multi-omics signatures of general health and organ-specific morbidity, as well as their interconnectivity. We examined cross-sectional metabolome and proteome data from 3,142 adults of the Cooperative Health Research in South Tyrol (CHRIS) study, an Alpine population study designed to investigate how human biology, environment, and lifestyle factors contribute to people’s health over time. We had 174 metabolites and 148 proteins quantified from fasting serum and plasma samples. We used the Cumulative Illness Rating Scale (CIRS) Comorbidity Index (CMI), which considers morbidity in 14 organ systems, to assess health status (any morbidity vs. healthy). Omics-signatures for health status were identified using random forest (RF) classifiers. Linear regression models were fitted to assess directionality of omics markers and health status associations, as well as to identify omics markers related to organ-specific morbidity.

Next to age, we identified 21 metabolites and 10 proteins as relevant predictors of health status and results confirmed associations for serotonin and glutamate to be age-independent. Considering organ-specific morbidity, several metabolites and proteins were jointly related to endocrine, cardiovascular, and renal morbidity. To conclude, circulating serotonin was identified as a potential novel predictor for overall morbidity.

Key words
*   Health Status
*   Metabolomics
*   Proteomics
*   Comorbidity
*   Aging
*   CHRIS study

## 1. Introduction

Non-communicable diseases (NCDs) are the leading cause of morbidity and premature mortality globally1. Age itself is the leading predictor for most NCDs: NCD prevalence increases with age and multiple diseases tend to cluster among older individuals2. Common biological processes triggered by molecular damage and modified by cellular and systemic responses drive biological aging and modify risks for multiple diseases in a tissue-, organ- and system-specific manner3. In turn, these health outcomes feed back into the underlying biological processes impacting the rate of aging and enhancing the risk for further disease4. It is therefore important to enhance our understanding of the molecular basis of age-related diseases to improve measures for disease prevention and general health5. Omics-based biomarkers provide insights into the molecular processes driving functional decline, they also help monitoring health trajectories, age-related physiological decline and disease onset6. Such biomarkers can also support the development of prevention strategies targeting those processes and provide surrogate endpoints in intervention studies5. The goal is early identification of individuals at higher risk of diseases, who will benefit most from such preventive interventions7. Protein biomarkers have the advantage of being direct biological effectors of the underlying genomic background7. Serum metabolomics on the other hand provides a snapshot of general physiological state of an organism, and is influenced by genetic, epigenetic, and environmental factors8,9. For instance, Tanaka et al. (2020)7 identified a proteomic signature of aging involving 76 proteins and predicting accumulation of chronic diseases and all-cause mortality. You et al. (2023)10 developed a disease specific proteomic risk score, which stratified the risk for 45 common disease conditions, resulting into an equivalent predictive performance over established clinical indicators for almost all endpoints. Similarly, Gadd et al. (2024)11 demonstrated the utility of proteomic scores in predicting several 10-year incident outcomes beyond factors, such as age, sex, lifestyle and clinically relevant biomarkers, showing the relevance of early proteomic contributions to major age-related diseases. Pietzner et al. (2021)12 used untargeted metabolomics to investigate signatures of multimorbidity and found that 420 metabolites are shared between at least two chronic diseases. A recent work demonstrated the potential of metabolomic profiles as a multi-disease assay to inform on the risk of many common diseases simultaneously. For 10-year outcome prediction of 15 selected endpoints a combination of age, sex and the metabolomic state was equal or outperformed established predictors13.

Advances in different omics technologies and computing capabilities have also enabled the integration of multi-omics data to capture the complex molecular interplay of health and disease14. In TwinsUK, using data from 510 women, Zierer et al. (2016)15 integrated four high-throughput omics datasets and demonstrated the interconnectivity of age-related diseases by highlighting molecular markers of the aging process, which might drive disease comorbidities.

Here, we aimed to identify multi-omics signatures of general health among adult individuals using cross sectional data from the population-based Cooperative Health Research in South Tyrol (CHRIS) study, an Alpine population study designed to investigate how human biology, environment, and lifestyle factors contribute to people’s health over time. We specifically included targeted serum metabolomics16 and plasma proteomics data17. The Cumulative Illness Rating Scale (CIRS) based Comorbidity Index (CMI)18, reflecting disease status and severity in 14 relevant organ systems, was used to assess health status by classifying individuals into having any morbidity (CMI≥1) or being healthy (CMI=0). We then applied predictive models using a random forest classifier to determine health status. The analyses were complemented by multiple linear regression models investigating associations and inter-dependencies of individual proteins or metabolites with morbidity assessed by each organ-specific CIRS domains, such as the endocrine-metabolic or the renal domains.

## 2. Results

The main analytic sample consisted of n=3,142 adult individuals from the CHRIS study with available metabolomics and proteomics data. The AbsoluteIDQ® p180 kit from Biocrates (Biocrates Life Sciences AG, Innsbruck, Austria) was used for metabolite quantification in fasting serum samples16. The high abundance plasma proteome was determined using the Scanning SWATH mass spectrometry-based approach17. We first present the study sample’s main characteristics by health status (any morbidity vs. healthy) and the relationships between the co-occurring comorbidities. Next, we provide findings from the random forest (RF) analysis. The analyses were complemented by the integration of multiple linear regression models to better characterize actual associations of each significant feature from the RF analysis with health status. Finally, we investigated associations of omics signatures with organ specific morbidity using linear regression models.

### 2.1 Health status and characteristics of the study sample

The characteristics for the main analytic sample are presented in **Table 1**. The CIRS organ domains most completely described by the available data (completeness≥50%) were the hypertension, cardiac, respiratory, neurological, renal, vascular, endocrine-metabolic, hepatic and psychiatric/behavioral domains (**Table 1**). The remaining domains had <50% completeness and in general a lower proportion of unhealthy individuals was observed for these domains (**Table 1**). Among all individuals, 56% (n=1,751) were affected by at least one morbidity condition (CMI≥1). As expected, these were on average older than healthy individuals (CMI=0; **Table 1**, **Figure 1)**. The top five organ domains affected by health problems were the hepatic, vascular, hypertension, endocrine-metabolic and the respiratory domain, with 27.4%, 14.9%, 12.8%, 10.3%, and 8.8% of morbidity prevalence estimates, respectively. We additionally provide the characteristics for the CHRIS cohort, regardless of available omics data, to confirm the robustness of main characteristics and estimated disease prevalences in our analytic sample. These are reported in **Supplementary Table S1 and Figure S1**.

![Figure 1.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2024/07/15/2024.07.15.24310410/F1.medium.gif)

[Figure 1.](http://medrxiv.org/content/early/2024/07/15/2024.07.15.24310410/F1)

Figure 1. 
Distribution of health status and age in the CIRSHS sample. **A)** Age distribution by Comorbidity Index, which is expressed as the total number of CIRS organ domains scoring ≥2. **B)** Absolute distribution of “Any morbidity” (yes,no) by age-group. The relative proportion of red-bars within each stacked column-bar represents the age-group specific prevalence of any morbidity. **C)** Age distribution by morbidity conditions in the specific CIRS domains.

View this table:
[Table 1.](http://medrxiv.org/content/early/2024/07/15/2024.07.15.24310410/T1)

Table 1. Characteristics of the main analytic sample with information on health status (any morbidity vs. healthy)*a* and CIRS domain specific morbidityb.

Next, we explored relationships between the 14 CIRS domains through ordinary correspondence analysis (OCA; **Figure 2)**. We observed proximity between the hypertension, renal and endocrine domains, the cardiac and the vascular domains, and between the neurological and the psychiatric domains. To compare the robustness of the comorbidity relationships we present the OCA analysis results for the CHRIS cohort in **Supplementary Figure S2**.

![Figure 2.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2024/07/15/2024.07.15.24310410/F2.medium.gif)

[Figure 2.](http://medrxiv.org/content/early/2024/07/15/2024.07.15.24310410/F2)

Figure 2. 
Biplot of ordinary correspondence analysis considering all subjects with available metabolomics and proteomics data, presenting relations between the 14 CIRS domains. CIRS domains with a stronger relation have longer (size consistency) and closer (direction consistency) loadings.

### 2.2 Multi omics signatures of health status

To avoid confounding of results due to the impact of medication17, we performed the analysis on metabolites and protein abundances adjusted for use of medications that were not considered in the CIRS definition (**Supplementary Table S2**).

For the RF analysis we built a model including age, sex, 174 metabolites and 148 proteins as predictors. Overall, 100 RF models were generated, each containing 500 trees per model, using repeated random subsampling with 80% training and 20% validation set sizes, respectively. We compared the performance measures using the area under the receiver operating curve (ROC AUC;, as well as the Matthew’s correlation coefficient (MCC) and MCC-F1. The RF model showed moderate performance (AUC=0.743, 95%Conficence Interval (CI)=0.740, 0.747; **Figure 3**; MCC and MCC-F1 are presented in **Supplementary Figure S3).** In addition, we built models that included varying sets of the predictors, which were a) age and sex, b) age, sex and metabolites, c) age, sex and proteins, and d) age, sex, metabolites, proteins. The performance comparison of these to the full model are presented in **Supplementary Text S1**. The metabolomics/proteomics based models (b,c,d) show greater performance but the differences are not statistically significant.

![Figure 3.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2024/07/15/2024.07.15.24310410/F3.medium.gif)

[Figure 3.](http://medrxiv.org/content/early/2024/07/15/2024.07.15.24310410/F3)

Figure 3. 
Performance evaluation of 100 random forest models including as predictors age, sex, 174 metabolites and 148 proteins to classify health status (any morbidity vs. healthy) represented as ROC AUCs. In the plots each validation run is shown as a gray line with the average curve shown in black.

We next estimated the significance of the selected individual features using permutation testing. We obtained 33 significant features, including 21 metabolites and 10 proteins, with age being the most important variable and sex the least relevant (**Figure 4A**). Among the top ten omics features were the metabolites serotonin, glutamate, hexose, three acylcarnitines (C18:1, C16:1, C16), ornithine, and the proteins CFH, A2M and IGFALS. The medication-adjusted abundance distributions of these metabolites and proteins stratified by health status are presented in **Figure 4B**. Individuals with any morbidity had lower mean abundance of serotonin, taurine and lysoPC C18:2, and higher mean abundance of all other metabolites. Regarding proteins, individuals with any morbidity had lower mean abundances of A2M, IGFALS, IGHM, and F2, and higher abundance of CFH, C4BPA, A1BG, APOH, AFM, and RBP4.

![Figure 4.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2024/07/15/2024.07.15.24310410/F4.medium.gif)

[Figure 4.](http://medrxiv.org/content/early/2024/07/15/2024.07.15.24310410/F4)

Figure 4. 
Evaluation of importance for significant features identified through the random forest models including as predictors age, sex, 174 metabolites and 148 proteins, for health status (any morbidity vs healthy). #: Rank of the specific feature. Predictor significance was estimated by permutation testing, which generates 100 background distributions, yielding 100 *p*-values for each predictor variable. Box plots are color-coded by the number of times a *p*-value was significant for such a variable (*p* <0.05): red, all 100 runs returned a significant *p*-value; orange, between 80 and 99 runs returned a significant *p*-value; blue, between 50 and 79 runs returned a significant *p*-value. Panels are restricted to features that are significant in at least 50% of all permutation runs. Density plots are scaled to have the same width, a median of zero (0) (horizontal bar) and standard deviation of one (1) for the healthy group. For comparability, the any morbidity group data have been scaled to the standard deviation of the healthy group. **A)** Mean decrease in Gini Index for significant features. **B)** Violin plots presenting scaled abundance distributions stratified by health status for significant features identified through RF, ordered by importance from left to right.

To better characterize the actual association of each significant feature from the RF analysis with health status, we performed separate regression analyses with the abundances of each identified molecular feature as the response variable and health status (any morbidity vs. healthy), age and sex as the explanatory variables, allowing us to evaluate the influence of the feature on health status independently of age or sex. Coefficients for health status were extracted from these models and are presented in **Figure 5** and **Supplementary Table S3**. Individuals with any morbidity had, on average, 22% lower mean abundance of serotonin and 12% higher abundance of serum glutamate. Twelve other features (C18:1, C16:1, lyso PC a C18:2, PC aa C32:1, tyrosine, taurine, hexose, kynurenine, AFM, CFH, RBP4, and A1BG) passed the multiple-testing correction, however, the observed differences in abundances did fall within the range of the technical variability and were thus not considered significant. Given that age was the strongest predictor for health status in the RF model, we further investigated and compared the coefficients for age and health association from the regression models. For some features, such as citrulline, abundances were almost entirely explained by age and the coefficient for the association with health status from the regression model was only very small, and not significant. For others, such as serotonin and glutamate, associations with health status were strong, even in these age-adjusted models, suggesting an age-independent association of these features with health.

![Figure 5.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2024/07/15/2024.07.15.24310410/F5.medium.gif)

[Figure 5.](http://medrxiv.org/content/early/2024/07/15/2024.07.15.24310410/F5)

Figure 5. 
Volcano plot for the differential abundance of metabolomic and proteomic features of health status (any morbidity vs. healthy). Coefficients represent the log2□difference in average concentrations between the groups.

### 2.3 Metabolomic and proteomic signatures related to CIRS domain specific morbidity

We further evaluated morbidity markers, implementing separate regression models for medication adjusted abundances of all 174 metabolites and 148 proteins with each CIRS domain as well as age and sex as explanatory variables, and evaluated whether markers were shared across - or specific for any domain (**Figure 6; Supplementary Table S4)**. In total, 83 significant omics-disease associations were identified, with 40 metabolites and 17 proteins being significant for ≥1 CIRS domain. Associations were observed with the cardiac, vascular, hypertension, endocrine-metabolic, renal, hepatic, psychiatric, neurological, respiratory and lower gastrointestinal and genitourinary domains (**Figure 6**). Eleven metabolites (serotonin, glutamate, isoleucine, taurine, dihydroxyphenylalanine, several glycerophospholipids and acylcarnitines) and three proteins (F2, C3, A2M) were shared across multiple domains. For example, serotonin was related to the cardiac, vascular, hypertension and psychiatric systems, and glutamate to the hypertension, endocrine-metabolic, respiratory and hepatic domains. The proteins F2 and A2M were related to the cardiac, vascular, and the renal domains, and C3 to the hypertension and endocrine-metabolic domains. Overall, phosphatidylcholines and sphingolipids were all negatively associated to morbidity conditions in the related CIRS domain, whereas for acylcarnitines positive associations were observed. The directionality of biogenic amines and amino acids was not consistent across classes but remained consistent across CIRS domains. For proteins, negative associations were observed with APOB, APOD, APOM, IGHM, CD5L, PON1, FCN3, F2, C4BPA, IGHG2 and IGKC, whereas positive associations were found with SERPINA1, AFM, VTN, C3, HP, SERPIND1 and A2M. In general, all metabolites and proteins that were significantly associated with multiple domains showed consistent effect directions, being either negative or positive.

![Figure 6.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2024/07/15/2024.07.15.24310410/F6.medium.gif)

[Figure 6.](http://medrxiv.org/content/early/2024/07/15/2024.07.15.24310410/F6)

Figure 6. 
Heatmap presenting associations between metabolites, proteins and CIRS organ domains. Associations were obtained from multiple linear regression models adjusted for age and sex. Hierarchical clustering using the Eucledian distance as the similarity measure was applied. Only metabolites and proteins that were significantly associated with at least one specific CIRS domain are presented. *: highlights significant associations. β-coefficients are color-coded: red hues represent positive relations and blue hues represent negative relations.

## 3. Discussion

In this cross-sectional analysis of CHRIS study data we identified age and 31 molecular features predicting overall health status (any morbidity vs healthy) in adults using a random forest classifier. These features included 21 metabolites (5 biogenic amines; 4 amino acids; 6 acylcarnitines; 5 glycerophospholipids; and hexose monosaccharides), and 10 plasma proteins. Subsequent regression analyses confirmed a sizeable association of health status with both serotonin and glutamate, which was independent of age and sex. Analyses on single CIRS domains further identified multiple metabolite- and protein-disease associations, most being related to cardiovascular, hypertension, endocrine-metabolic and renal morbidity, revealing strong molecular interconnectivity across these related domains.

Several studies have investigated omics markers of aging, longevity6,19, aging-related chronic diseases7,11–13,20,21 but only few have integrated multi-omics data with regards to general health assessment15,22. Independent of the approach used, great heterogeneity exists in the included omics technologies, protein and/or metabolite coverage, analytic tools as well as the outcome of interest, which makes comparison of studies challenging. In our study, multi-omics RF models identified several molecular features relevant for predicting health status (any morbidity vs. healthy).

Among the age-independent predictive features was serotonin, which is involved in the regulation of energy, glucose and lipid metabolism. Changes in the serotonin system are known risk factors for many age-related diseases, such as diabetes and cardiovascular disease23,24, which was also observed in our study. Up to 95% of serotonin is produced in the gut, and only 5% of serotonin is synthesized by neurons, mainly in the central nervous system24. Although serotonin does not cross the blood-brain barrier, intestinal serotonin release causes neuronal activation in the brain stem, thus indirectly affecting the brain23. No other study has linked serotonin as a healthy aging marker per se. However, serotonin is a tryptophan derivative, and as inflammation and stress activate the tryptophan metabolism through the kynurenine pathway25, this consequently causes decreased production of serotonin26. The age associated upregulation of kynurenine and downregulation of serotonin therefore indicate a relevant role of tryptophan metabolism in inflammaging and aging27. Although tryptophan was included in the metabolomics panel in this study, it was not identified among the selected features in the RF model. These results indicate a robust relation between health and circulating serotonin levels, but the role of tryptophan metabolism and related pathways, and the putative causal relationships deserves further investigation.

Glutamate was also identified as an age-independent predictor for health status. This amino acid has been previously associated with physical frailty in elderly individuals28, supporting our findings. It has further been linked to cardiovascular disease and been suggested to be a potential biomarker of abdominal obesity and metabolic risk29. Glutamate is an important excitatory neurotransmitter in the brain, and although concentrations in the brain are much higher than in plasma as the blood brain barrier is not very permeable to glutamate30, it is also one of the most abundant amino acids in the liver, kidney and skeletal muscle, showing great metabolic versatility31. Glutamate plays a key role in protein synthesis and degradation32 and is a by-product of the catabolism of branched chain amino acids29. By linking amino acid and carbohydrate metabolism glutamate supports energy production, which has further implications for insulin secretion32. More specifically, glutamate is also a source of alpha-ketoglutarate, which plays a key role in energy metabolism and aging processes and that has been implicated in improved life and health span33–36. In this study the morbidity group is associated with higher levels of circulating glutamate, which might reflect a depletion of alpha-ketoglutarate relative to healthy individuals.

Other relevant molecular features (hexose, C18:1, C16:1, lyso PC a C18:2, PC aa C32:1, tyrosine, taurine, kynurenine, AFM, CFH, RBP4, A1BG) were determined with the RF model, and the age independent associations confirmed by linear regression, but the difference in abundances was lower than the cut-off set relative to the coefficient of variation. We provide further discussion of those features in **Supplementary Text S2**.

When looking into organ-specific morbidity, most health conditions in our study were related to the hepatic domain, followed by the vascular, hypertension, endocrine-metabolic and the renal domains. Using OCA, we observed closer relatedness between the renal, hypertension and endocrine-metabolic, as well as the vascular and cardiac domains. Our analyses linking omics markers to specific CIRS domains supported such connections. For example, we observed the metabolites serotonin, glutamate, taurine, isoleucine, three glycerophospholipids (lysoPC a C17.0/18.1/18.2), and two acylcarnitines (C3, C6 (C4)-1-DC) and the proteins C3, APOB, F2, A2M and HP as common markers among the cardiac, vascular, hypertension, endocrine-metabolic, renal but also the respiratory domains. The co-occurrence of type-2 diabetes and cardiovascular diseases is very common, and its high degree of connectivity has been found with other diseases as well37. In addition, cardiovascular and kidney disease are closely interrelated and disease of one organ is known to cause dysfunction of the other38. Remaining molecular signatures were domain specific showing no connections among each other, such as the proteins VTN, APOD and a set of other amino acids (valine, leucine, alanine) being only related to the endocrine-metabolic domain. Such associations have also been reported previously in the literature39–41. Untargeted metabolomics analysis in the prospective, population-based EPIC-Norfolk study identified 420 metabolites that were shared among two or more chronic diseases12, further observing high connectivity among cardiometabolic and respiratory diseases across different biochemical classes of metabolites. Those findings highlight potential biological pathways related to the onset of multiple chronic diseases, such as liver and kidney function, lipid and glucose metabolism and low-grade inflammation, among others12.

Strengths of our investigation are the wealth of the available data resource, foremost with the availability of both plasma metabolomics and proteomics data among the same participants. Additionally available phenotypic parameters, both quantitative (such as blood parameters), self-reported or collected by trained study-nurses allowed a detailed characterization of the participants and enabled the assessment of the health status through CIRS, which has been shown to be a useful tool to measure morbidity in clinical research42.

A first limitations is given by the study design, due to which participants with severe morbidity might have been underrepresented. In addition, metabolite and protein concentrations might be influenced by lifestyle, hormonal changes such as introduced by menopausal status and its treatment, as well as medication. Available information on medication use, irrelevant for the CIRS assessment, allowed adjustments to exclude spurious influence of this factor on the results. However, as the CIRS itself considers medication status to characterize certain diseases, it was not possible to distinguish whether the observed associations were driven by the disease itself or by the treatment for the given or related diseases. The present analyses are also limited to the set of metabolites and proteins that are possible to quantify by the analytical approaches used. Finally, given the cross-sectional design of the study we are only able to assess associations, hence no conclusions on temporal antecedence and causality can be drawn.Finally, we did not validate our models on an independent testing set.Despite these limitations, we were able to replicate several findings from previous studies, which supports reliable data and procedural quality within this study. Overall, using multi-omics data for profiling health status has great potential to identify changes in health trajectories at an earlier stage in life. This could help to develop new, effective target therapies for treating related as well as seemingly unrelated diseases occurring at the same time by uncovering common biological pathways connecting different underlying pathogenic mechanisms43.

To conclude, we identified several molecular signatures of overall health status. Specifically, circulating serotonin is suggested as a promising novel predictor for health and morbidity independent of age, implicating a potential key role of tryptophan metabolism and serotonin related pathways in sustaining health. The results also point to glutamate as another predictor for health and morbidity in adults, in agreement with previous studies relating this amino acid to frailty, metabolic and cardiovascular health. Future studies are needed to investigate the mediating role of these signatures in relation to lifestyle and the environment to promote healthy aging. In this regard the application of mendelian randomization approaches should be considered to further investigate causal links between circulating serotonin, serotonin metabolism and chronic disease.

## 4. Methods

### 4.1 Study cohort

The CHRIS study is a population-based cohort of 13,393 adults aged 18 and over recruited from 13 towns in the alpine Val Venosta/Vinschgau district in the Bolzano-South Tyrol province of northern Italy. The study was designed to investigate the genetic and molecular basis of age-related common chronic conditions and their interaction with lifestyle and environment in the general population44. The study was approved by the Ethics Committee of the Healthcare System of the Autonomous Province of Bolzano. The study conforms to the Declaration of Helsinki, and with national and institutional legal and ethical requirements.

Metabolomics data were available for n=6,415 individuals and proteomics data for n=3,541. After excluding participants who were not fasting (n=475) or had missing information on fasting status (n=2), as well as women who were pregnant (n=25) or unsure about pregnancy (n=9), n=3,142 individuals with overlapping omics data were available for analysis.

### 4.2 Data collection

Data, including collection of anthropometric measurements, blood and urine standard laboratory tests, blood pressure measurement and lifestyle information were collected at the study center following overnight fasting44. Laboratory test data included all main cardiovascular and metabolic risk factors, and markers of iron metabolism, coagulation, renal damage, thyroid, and liver function. Blood (serum and plasma) and urine samples were collected and stored in a biobank.

To obtain information on disease history, participants were interviewed by trained study assistants. Specific clinical domains covered by the questionnaires were the circulatory and nervous system, as well as psychiatric disorders, cognition, autonomic and genitourinary function, endocrine, nutritional, and metabolic diseases. A detailed overview of the mode of assessment can be found elsewhere44. In addition, each questionnaire contained a section for “other diseases”, where participants could report any other condition not explicitly included in the domains, as free text. Detailed medication information was collected by scanning the barcodes of the boxes of the medication used within the seven days prior to study center visit and brought by the study participants to the interview. The Anatomical Therapeutic Chemical (ATC) medicinal product classification coding system45 and the mode, frequency, and duration of drug administration was recorded for each scanned medication.

### 4.3 Assessment of the health status

To assess health in the CHRIS study sample, we used the Cumulative Illness Rating Scale (CIRS), which is a clinically relevant tool to measure the chronic medical illness burden by taking the number and the severity of chronic diseases into account46. For the purpose of this study we used the revised CIRS18, which assesses 14 organ related domains rating each of them according to the degree of severity ranging from: grade 0 (no impairment) to grade 4 (extremely severe impairment). Based on this guideline, we determined for all CHRIS participants the CIRS score for each domain by screening our questionnaire and interview data, as well as laboratory and clinical parameters for any given medical condition. We next derived the CIRS Comorbidity Index (CMI), which calculates the total number of CIRS organ domains with a score≥2 (domains with moderate or severe morbidity). To define health status we classified individuals as being healthy if the CMI=0 and as having any morbidity if CMI≥1, which means that at least one CIRS organ domain was identified with moderate or severe morbidity. We additionally considered organ specific health by classifying individuals as being healthy if the CIRS organ domain scored<2, or having morbidity if the respective domain scored≥2.

We further assessed the completeness of the CIRS score with regard to the guidelines18 based on available CHRIS data for each domain. Detailed information on the completeness assessment is presented in **Supplementary Text S3**.

#### Metabolite and protein quantification

The AbsoluteIDQ® p180 kit from Biocrates (Biocrates Life Sciences AG, Innsbruck, Austria) was used for metabolite quantification in fasting serum samples. Details on data generation, quality assessment and normalization are provided in Verri Hernandez et al. (2022)16. In total, concentrations of 175 metabolites and lipids were quantified. Due to the large number of missing values, sarcosine was excluded from the present analysis. An overview of the included metabolites is presented in the **Supplementary Table S5**. Prior to the main analysis all metabolite concentrations were log2 transformed.

The high abundance plasma proteome was determined using the Scanning SWATH mass spectrometry-based approach. Details on sample processing, data acquisition and normalization are provided in Dordevic et al. (2024)17. In total 148 highly abundant proteins were quantified and included in this analysis. An overview of the included proteins is presented in **Supplementary Table S6**.

Metabolite and protein abundances were adjusted for frequent medication use with a linear model based approach: models were fitted separately for each metabolite or protein with their abundance as response, and medication (as individual binary variables) as explanatory variables. Only medications taken at least twice per week and not considered for CIRS scoring were used. The residuals from these models were used to construct medication-independent abundances for the RF and subsequent regression analyses. **Supplementary Table S2** lists the medications for which abundances were adjusted.

### 4.4 Statistical analysis

Differences in characteristics were presented as mean (standard deviation, SD) for continuous variables and as percentages for categorical variables.

To investigate the relations between CIRS-domain specific morbidity (morbidity vs healthy), ordinary correspondence analysis (OCA) was performed using the rda function in the vegan package for R47.

#### Random forest

We used a random forest (RF) regression model to predict health status (any morbidity vs healthy), including as predictors age, sex, 174 metabolites and 148 proteins, respectively. Overall, 100 RF models were generated, each containing 500 trees per model, considering 80% training set sizes. Predictions were validated by 100 times stratified repeated random subsampling with 20% test set sizes, such that the ratio between the healthy and the any morbidity group of the original set sizes remained the same for the subsampled sets. We used the randomForest R package48 to perform the RF analysis.

Model performance was measured based on the receiver operating characteristic (ROC) area under the curve (AUC), the Matthew’s correlation coefficient (MCC), and the unit-normalized Matthews correlation coefficient (MCC-F1).

The Gini Index was used to measure feature importance. We further estimated the significance of importance metrics for the random forest models by permuting the response variable using the rfpermut*e* R package 49. This generates a background distribution and recalculates the 100 random forests, each with 500 trees. Usually, a background distribution is calculated only once; however, to increase the robustness of our results, we created 100 background distributions, yielding 100 *p*-values for each predictor variable. We considered associations with a predictor to be significant if ≥50 out of 100 test runs yielded a *p*-value<0.05.

To further investigate the influence of the significant features obtained from the RF analysis we additionally fitted multiple linear regression models to estimate the association between metabolites and proteins from the selected significant features, each implemented as the response variable, with health status as the explanatory variable and by further adjusting the models for age (years) and sex.

#### Investigation of omics-markers related to CIRS domains

To aid interpretation from the RF analysis, we further investigated the relation between metabolites, proteins and the specific CIRS domains using again multiple linear regression models with implementing all available metabolites and proteins (response variable) one by one with each CIRS domain (explanatory variable), adjusted for age and sex.

We adjusted all *p*-values for multiple hypothesis testing using the Bonferroni correction method (metabolites: 0.05/(14 CIRS domains*174 metabolites); proteins: 0.05/(14 CIRS domains*148 proteins)), thus an adjusted *p*-value <0.05 was considered statistically significant. For all linear regression analyses we additionally required the difference in abundance between categories to be ≥2* coefficient of variation (CV) for the given metabolite and >1*CV for the given protein to consider the markers significant. All analyses were conducted using the R statistical software, version 4.1.0 ([www.R-project.org](http://www.R-project.org)).

## Supporting information

Supplementary Information [[supplements/310410_file03.docx]](pending:yes)

Supplementary Text S3 [[supplements/310410_file04.xlsx]](pending:yes)

## Author contributions

All authors contributed to the study conception and design. Material preparation and data collection were performed by J.R, V.H, F.D, P.P, M.G, L.F, C.P, F.A, V.F, M.R, M.M and E.H. Formal analysis and investigation was performed by C.W, E.H, N.D, J.R and F.D. All authors contributed to the interpretation of the findings. The first draft of the manuscript was written by E.H, C.W, N.D, J.R and F.D and all authors commented on previous versions of the manuscript. All authors read and approved the final manuscript.

## Data availability

The data are not openly available due to reasons of sensitivity. CHRIS study data can be requested for research purposes by submitting a dedicated request to the CHRIS Access Committee (access.request.biomedicine{at}eurac.edu).

## Conflict of interest

The authors have no conflict of interest to declare.

## Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

## Financial Disclosure Statement

The CHRIS study was funded by the Department of Innovation, Research and University of the Autonomous Province of Bolzano-South Tyrol and supported by the European Regional Development Fund (FESR1157). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

## Acknowledgments

CHRIS study investigators thank all study participants, the Healthcare System of the Autonomous Province of Bolzano-South Tyrol, and all Eurac Research staff involved in the study. Bioresource Impact Factor Code: BRIF6107.

*   Received July 15, 2024.
*   Revision received July 15, 2024.
*   Accepted July 15, 2024.


*   © 2024, Posted by Cold Spring Harbor Laboratory

This pre-print is available under a Creative Commons License (Attribution-NoDerivs 4.0 International), CC BY-ND 4.0, as described at [http://creativecommons.org/licenses/by-nd/4.0/](http://creativecommons.org/licenses/by-nd/4.0/)

## References

1.  1.NCD Countdown 2030 collaborators. NCD Countdown 2030: worldwide trends in non-communicable disease mortality and progress towards Sustainable Development Goal target 3.4. Lancet Lond. Engl. 392, 1072–1088 (2018).
    
    
2.  2.Tchkonia, T. & Kirkland, J. L. Aging, Cell Senescence, and Chronic Disease: Emerging Therapeutic Strategies. JAMA 320, 1319 (2018).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1001/jama.2018.12440&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F15%2F2024.07.15.24310410.atom) 

3.  3.López-Otín, C., Blasco, M. A., Partridge, L., Serrano, M. & Kroemer, G. Hallmarks of aging: An expanding universe. Cell 186, 243–278 (2023).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/J.CELL.2022.11.001&link_type=DOI) 

4.  4.Martin-Ruiz, C. & von Zglinicki, T. Biomarkers of healthy ageing: expectations and validation. Proc. Nutr. Soc. 73, 422–429 (2014).
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F15%2F2024.07.15.24310410.atom) 

5.  5.Deelen, J. Targeting multimorbidity: Using healthspan and lifespan to identify biomarkers of ageing that pinpoint shared disease mechanisms. EBioMedicine 67, 103364 (2021).
    
    
6.  6.Menni, C. et al. Metabolomic markers reveal novel pathways of ageing and early development in human populations. Int. J. Epidemiol. 42, 1111–1119 (2013).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/ije/dyt094&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23838602&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F15%2F2024.07.15.24310410.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000325167800027&link_type=ISI) 

7.  7.Tanaka, T. et al. Plasma proteomic biomarker signature of age predicts health and life span. eLife 9, e61073 (2020).
    
    
8.  8.Chaleckis, R., Murakami, I., Takada, J., Kondoh, H. & Yanagida, M. Individual variability in human blood metabolites identifies age-related differences. Proc. Natl. Acad. Sci. U. S. A. 113, 4252–4259 (2016).
    
    [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NDoicG5hcyI7czo1OiJyZXNpZCI7czoxMToiMTEzLzE2LzQyNTIiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyNC8wNy8xNS8yMDI0LjA3LjE1LjI0MzEwNDEwLmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 

9.  9.Johnson, C. H. & Gonzalez, F. J. Challenges and opportunities of metabolomics. J. Cell. Physiol. 227, 2975–2981 (2012).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/jcp.24002&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22034100&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F15%2F2024.07.15.24310410.atom) 

10. 10.You, J. et al. Plasma proteomic profiles predict individual future health risk. Nat. Commun. 14, 7817 (2023).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41467-023-43575-7&link_type=DOI) 

11. 11.Gadd, D. A. et al. Blood protein assessment of leading incident diseases and mortality in the UK Biobank. Nat. Aging (2024) doi:10.1038/s43587-024-00655-7.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s43587-024-00655-7&link_type=DOI) 

12. 12.Pietzner, M. et al. Plasma metabolites to profile pathways in noncommunicable disease multimorbidity. Nat. Med. 27, 471–479 (2021).
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F15%2F2024.07.15.24310410.atom) 

13. 13.Buergel, T. et al. Metabolomic profiles predict individual multidisease outcomes. Nat. Med. 28, 2309–2320 (2022).
    
    
14. 14.Babu, M. & Snyder, M. Multi-Omics Profiling for Health. Mol. Cell. Proteomics 22, 100561 (2023).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.mcpro.2023.100561&link_type=DOI) 

15. 15.Zierer, J. et al. Exploring the molecular basis of age-related disease comorbidities using a multi-omics graphical model. Sci. Rep. 6, 37646 (2016).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/srep37646&link_type=DOI) 

16. 16.Verri Hernandes, V., et al. Age, Sex, Body Mass Index, Diet and Menopause Related Metabolites in a Large Homogeneous Alpine Cohort. Metabolites 12, 205 (2022).
    
    
17. 17.Dordevic, N., et al. Hormonal Contraceptives Are Shaping the Human Plasma Proteome in a Large Population Cohort. [http://medrxiv.org/lookup/doi/10.1101/2023.10.11.23296871](http://medrxiv.org/lookup/doi/10.1101/2023.10.11.23296871) (2023) doi:10.1101/2023.10.11.23296871.
    
    [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NzoibWVkcnhpdiI7czo1OiJyZXNpZCI7czoyMToiMjAyMy4xMC4xMS4yMzI5Njg3MXYzIjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjQvMDcvMTUvMjAyNC4wNy4xNS4yNDMxMDQxMC5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 

18. 18.Salvi, F. et al. A manual of guidelines to score the modified cumulative illness rating scale and its validation in acute hospitalized elderly patients. J. Am. Geriatr. Soc. 56, 1926–1931 (2008).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1111/j.1532-5415.2008.01935.x&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=18811613&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F15%2F2024.07.15.24310410.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000259900100023&link_type=ISI) 

19. 19.Menni, C. et al. Circulating Proteomic Signatures of Chronological Age. J. Gerontol. Ser. A 70, 809–816 (2015).
    
    
20. 20.Orwoll, E. S. et al. Proteomic assessment of serum biomarkers of longevity in older men. Aging Cell 19, (2020).
    
    
21. 21.Santos-Lozano, A. et al. Successful aging: insights from proteome analyses of healthy centenarians. Aging 12, 3502–3515 (2020).
    
    
22. 22.Zhang, W. et al. A population-based study of precision health assessments using multi-omics network-derived biological functional modules. Cell Rep. Med. 3, 100847 (2022).
    
    
23. 23.Fidalgo, S., Ivanov, D. K. & Wood, S. H. Serotonin: from top to bottom. Biogerontology 14, 21– 45 (2013).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s10522-012-9406-3&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23100172&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F15%2F2024.07.15.24310410.atom) 

24. 24.Martin, A. M. et al. The Diverse Metabolic Roles of Peripheral Serotonin. Endocrinology 158, 1049–1063 (2017).
    
    
25. 25.Calder, P. C. et al. Health relevance of the modification of low grade inflammation in ageing (inflammageing) and the role of nutrition. Ageing Res. Rev. 40, 95–119 (2017).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.arr.2017.09.001&link_type=DOI) 

26. 26.Kanova, M. & Kohout, P. Tryptophan: A Unique Role in the Critically Ill. Int. J. Mol. Sci. 22, 11714 (2021).
    
    
27. 27.Lassen, J. K. et al. LARGEDSCALE metabolomics: Predicting biological age using 10,133 routine untargeted LC–MS measurements. Aging Cell (2023) doi:10.1111/acel.13813.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1111/acel.13813&link_type=DOI) 

28. 28.Calvani, R. et al. A Distinct Pattern of Circulating Amino Acids Characterizes Older Persons with Physical Frailty and Sarcopenia: Results from the BIOSPHERE Study. Nutrients 10, 1691 (2018).
    
    
29. 29.Maltais-Payette, I., Allam-Ndoul, B., Pérusse, L., Vohl, M.-C. & Tchernof, A. Circulating glutamate level as a potential biomarker for abdominal obesity and metabolic risk. Nutr. Metab. Cardiovasc. Dis. 29, 1353–1360 (2019).
    
    
30. 30.Hawkins, R. A. The blood-brain barrier and glutamate. Am. J. Clin. Nutr. 90, 867S–874S (2009).
    
    [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NDoiYWpjbiI7czo1OiJyZXNpZCI7czo5OiI5MC8zLzg2N1MiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyNC8wNy8xNS8yMDI0LjA3LjE1LjI0MzEwNDEwLmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 

31. 31.Brosnan, J. T. Glutamate, at the Interface between Amino Acid and Carbohydrate Metabolism. J. Nutr. 130, 988S–990S (2000).
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=10736367&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F15%2F2024.07.15.24310410.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000086274200020&link_type=ISI) 

32. 32.Kelly, A. & Stanley, C. A. Disorders of glutamate metabolism. Ment. Retard. Dev. Disabil. Res. Rev. 7, 287–295 (2001).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/mrdd.1040&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=11754524&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F15%2F2024.07.15.24310410.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000172552800010&link_type=ISI) 

33. 33.Treberg, J. R., Banh, S., Pandey, U. & Weihrauch, D. Intertissue differences for the role of glutamate dehydrogenase in metabolism. Neurochem. Res. 39, 516–526 (2014).
    
    
34. 34.Canfield, C.-A. & Bradshaw, P. C. Amino acids in the regulation of aging and aging-related diseases. Transl. Med. Aging 3, 70–89 (2019).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.tma.2019.09.001&link_type=DOI) 

35. 35.Nassar, K. et al. The significance of caloric restriction mimetics as anti-aging drugs. Biochem. Biophys. Res. Commun. 692, 149354 (2024).
    
    
36. 36.Kurhaluk, N. Tricarboxylic Acid Cycle Intermediates and Individual Ageing. Biomolecules 14, 260 (2024).
    
    
37. 37.Guasch-Ferré, M. et al. Metabolomics in Prediabetes and Diabetes: A Systematic Review and Meta-analysis. Diabetes Care 39, 833–846 (2016).
    
    [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NzoiZGlhY2FyZSI7czo1OiJyZXNpZCI7czo4OiIzOS81LzgzMyI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDI0LzA3LzE1LzIwMjQuMDcuMTUuMjQzMTA0MTAuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 

38. 38.Liu, M. et al. Cardiovascular disease and its relationship with chronic kidney disease. Eur. Rev. Med. Pharmacol. Sci. 18, 2918–2926 (2014).
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25339487&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F15%2F2024.07.15.24310410.atom) 

39. 39.Cai, X. et al. Population serum proteomics uncovers a prognostic protein classifier for metabolic syndrome. Cell Rep. Med. 4, 101172 (2023).
    
    
40. 40.Coan, P. M. et al. Complement Factor B Is a Determinant of Both Metabolic and Cardiovascular Features of Metabolic Syndrome. Hypertension 70, 624–633 (2017).
    
    
41. 41.Kollerits, B. et al. Plasma Concentrations of Afamin Are Associated With Prevalent and Incident Type 2 Diabetes: A Pooled Analysis in More Than 20,000 Individuals. Diabetes Care 40, 1386– 1393 (2017).
    
    [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NzoiZGlhY2FyZSI7czo1OiJyZXNpZCI7czoxMDoiNDAvMTAvMTM4NiI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDI0LzA3LzE1LzIwMjQuMDcuMTUuMjQzMTA0MTAuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 

42. 42.de Groot, V., Beckerman, H., Lankhorst, G. & Bouter, L. How to measure comorbiditya critical review of available methods. J. Clin. Epidemiol. 56, 221–229 (2003).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S0895-4356(02)00585-1&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=12725876&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F15%2F2024.07.15.24310410.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000182631700003&link_type=ISI) 

43. 43.Dash, P., Mohapatra, S. R. & Pati, S. Metabolomics of Multimorbidity: Could It Be the Quo Vadis? Front. Mol. Biosci. 9, 848971 (2022).
    
    
44. 44.Pattaro, C. et al. The Cooperative Health Research in South Tyrol (CHRIS) study: rationale, objectives, and preliminary results. J. Transl. Med. 13, 348 (2015).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/s12967-015-0704-9&link_type=DOI) 

45. 45.WHO Collaborating Centre for Drug Statistics Methodology, ATC classification index with DDDs, 2022. Oslo, Norway 2021.
    
    
46. 46.Linn, B. S., Linn, M. W. & Gurel, L. Cumulative illness rating scale. J. Am. Geriatr. Soc. 16, 622–626 (1968).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1111/j.1532-5415.1968.tb02103.x&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=5646906&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F15%2F2024.07.15.24310410.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=A1968B134400016&link_type=ISI) 

47. 47.Oksanen J, Simpson G, Blanchet F, Kindt R, Legendre P, Minchin P, O’Hara R, Solymos P, Stevens M, Szoecs E, Wagner H, Barbour M, Bedward M, Bolker B, Borcard D, Carvalho G, Chirico M, De Caceres M, Durand S, Evangelista H, FitzJohn R, Friendly M, Furneaux B, Hannigan G, Hill M, Lahti L, McGlinn D, Ouellette M, Ribeiro Cunha E, Smith T, Stier A, Ter, & Braak C, Weedon J. \_vegan: Community Ecology Package\_. R package version 2.6-4. (2022).
    
    
48. 48.Liaw A, Wiener M. Classification and Regression by randomForest. R News, 2(3), 18–22. [https://CRAN.R-project.org/doc/Rnews/](https://CRAN.R-project.org/doc/Rnews/). (2002).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1159/000323281&link_type=DOI) 

49. 49.Archer, E. rfPermute: Estimate Permutation p-Values for Random Forest Importance Metrics. 2.5.2 doi:10.32614/CRAN.package.rfPermute (2011).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.32614/CRAN.package.rfPermute&link_type=DOI)