Abstract
Background Iron overload has been proposed as a risk factor for atherosclerosis but the available data are controversial. Here we investigated whether iron biomarkers within physiological limits that frequently differ between sexes are associated with peripheral arterial disease (PAD)
Methods Using two different analytical approaches (machine-learning and logistic regression), we studied the association between blood iron biomarkers and PAD in 368 individuals from the Heidelberg Study on Diabetes and Complications (HEIST-DiC) and in 5101 individuals from the National Health and Nutrition Examination Survey (NHANES 1999-2004).
Results We found that iron biomarkers were among the top predictors of PAD in the machine-learning classification in both cohorts. In the HEIST-DiC cohort, ferritin, iron, and transferrin were ranked among the top predictive markers, while in the NHANES cohort ferritin, total iron binding capacity (TIBC), transferrin saturation (TSAT) and iron showed high predictive power. In the regression analysis ferritin showed a positive interaction among females in the HEIST-DiC (OR 2.68, 95% CI 0.94-7.61, p=0.057) and NHANES cohorts (OR 1.76, 95% CI 1.16-2.67, p=0.008). The multivariable regression analysis of the NHANES cohort, detected a nonlinear relationship between ferritin and PAD in that certain ferritin ranges (48-97 ng/mL: OR 14.59, 95% CI 1.6-135.93, p= 0.019; 98-169 ng/mL: OR 171.07, 95% CI 1.27-23404, p=0.039) in females associated positively with PAD.
Conclusion Taken together our data show that iron biomarkers, importantly, elevated ferritin is associated with clinically apparent PAD in females. These findings add to the body of evidence suggesting sex differences in PAD and highlight a possible role of iron (directly or indirectly) in this relationship.
Introduction
Peripheral arterial disease (PAD) is a common atherosclerotic vascular disorder of the arteries of the lower limbs that affects millions of individuals worldwide1,2. Compared to cardiovascular and cerebrovascular disease, PAD is less well studied, and our understanding of the disease process is still poor. The known risk factors of PAD include old age, smoking, and comorbidities such as diabetes, hypertension or Vitamin D excess1.
Iron is an essential micronutrient, however, its accumulation in excess contributes to the generation of reactive oxygen species. Various regulatory systems maintain iron homeostasis by controlling absorption, transport, and storage3. Under physiological conditions, iron is bound to transferrin in the plasma, and transported to cells that require iron. Usually, less than one-third of transferrin’s binding sites are occupied by iron, allowing free iron entering the circulation to be efficiently sequestered. However, in pathological states associated with iron overload, the binding capacity of transferrin is nearly saturated resulting in the appearance of ‘non-transferrin-bound iron (NTBI)’. Therefore, iron overload is discussed as a risk factor for the development of atherosclerosis4. Accordingly, inhibition of ferroptosis, an iron-dependent form of cell death, has been shown to protect from atherosclerosis5. Iron and PAD may be linked by sex hormones. Changes in estrogen and progesterone levels associated with menopause were shown to impair endothelial function, especially among women6,7. Both hormones further control hepcidin, the master regulator of iron homeostasis in opposing directions, with estrogen8 inhibiting and progesterone9 activating hepcidin expression.
In our previous work, we demonstrated that iron overload in mouse models and humans with hemochromatosis show an aggravation of atherosclerosis and vessel damage, respectively10. Specifically, we found that NTBI in the circulation contributes to atherosclerosis by oxidising low-density lipoproteins (LDL) and by inducing vascular dysfunction; in addition, iron deposition in the vasculature of hemochromatotic mice was associated with more severe atherosclerosis10. Whether or not iron levels within physiological limits affect atherogenesis is currently unclear and available data are controversial. Most studies undertaken so far have investigated the relationships between iron-related parameters [e.g. plasma iron, ferritin, transferrin, transferrin saturation (TSAT)] and cardiovascular disease11–17 but associations with PAD are rarely addressed.
To address this, we analyzed iron biomarkers for their associations with PAD and by considering sex-specific differences. We used two different datasets and orthogonal approaches: 1) machine learning (ML) classification and 2) logistic regression. Both methods can be used for analyzing relationships between variables, but they differ in their approach and application. ML classification tasks can analyze complex patterns in multidimensional data and identify nonlinear interactions among variables. They are particularly useful when manual specification of relationships is challenging as is the case for human clinical data. In contrast, regression analysis quantifies the strength and direction of associations between variables and provides interpretable results in terms of odds ratios. Both approaches carry their strengths and limitations. However, we expect that true associations in the data (in this case, sex-specific differential associations between iron biomarkers and PAD) should be detected by both methods. Therefore, we initially used the ML approach to provide unbiased insight into our data and subsequently regression analyses to validate identified relationship. An outline of the study is shown in Figure 1.
Methods
HEIST-DiC study
The Heidelberg Study on Diabetes and Complications (HEIST-DiC; https://clinicaltrials.gov; NCT03022721) is a prospective, observational study designed to analyze the development of complications in individuals with diabetes and prediabetes. We analyzed the data from 368 individuals enrolled in the HEIST-DiC study. The selection criteria for participants have been described previously18. We excluded participants with values of serum iron biomarkers suggestive of hemochromatosis19. In addition, since ferritin not only indicates iron stores, but also is an acute phase protein that responds to inflammation, we excluded participants with ferritin> 400 ng/mL. For all participants, a detailed medical history was obtained. The ankle-brachial index (ABI) was measured using noninvasive blood pressure measurements of arms and ankles (ABI System 1000; Boso d.o.o.). Blood samples were obtained in the fasting state and all baseline parameters were analyzed under standardized conditions in the central laboratory of the University Hospital of Heidelberg. Serum samples were stored at −80C until further analysis. We analyzed iron, ferritin, and TSAT as biomarkers of systemic iron status, and assayed serum intercellular adhesion molecule-1 (ICAM1) and vascular-cell adhesion molecule-1 (VCAM1) as markers of vascular dysfunction. We measured ICAM1 (#171B6009M) and VCAM1 (#171B6022M) in serum using multiplex magnetic bead-array-based technology on a BioPlex200 system (Bio-Rad) according to the manufacturer’s instructions. All participants provided written informed consent, and the ethics committees of the University of Heidelberg approved this study (Decision No. 204/2004, 400/2010, and S-383/2016) according to the Declaration of Helsinki.
NHANES
The National Health and Nutrition Examination Survey (NHANES) is designed to assess the health and nutritional status of adults and children in the United States. The survey includes both an interview (in which participants are asked questions about their health and lifestyle), and a physical examination component (in which various measurements and samples are collected). We obtained data from 15,969 participants over 40 years in 3 consecutive cycles of NHANES (1999-2004) for whom ABI was measured.
The demographic variables, age (in years) and body mass index (BMI) were used as is. Smoking and alcohol were categorized according to a previous study20. We further categorized diabetes based on self-reported history, current use of insulin or oral hypoglycemic agents, fasting plasma glucose (FPG) ≥7.0 mM, or glycated hemoglobin (HbA1c) ≥6.5%. Prediabetes was defined among those with an absent self-reported history of diabetes by either FPG 5.6-6.9 mM or HbA1c 5.7-6.4%. We categorized hypertension based on self-reported history or current use of anti-hypertensive agents or systolic blood pressure >130 mmHg or diastolic blood pressure >80 mmHg. The presence of nonalcoholic fatty liver disease (NAFLD) was categorized by calculating the Fibrosis-4 Index score (FIB-4 score>3.25)21. As an index of insulin resistance, we calculated HOMA2-IR using C-peptide values instead of insulin22. The use of antiplatelet and antihyperlipidemic medications was categorized based on medication intake.
We applied the following exclusion criteria: not even one ABI measurement available, likely hemochromatosis19, anemia (hemoglobin <10g/L), elevated ferritin (>400 ng/mL), cancer, on current hormonal medications or iron supplements, donated blood in the last 1 month, with features of kidney disease (on dialysis or severe albuminuria), pregnant or breastfeeding in the last year, high risk of liver fibrosis, concurrent inflammation [C-Reactive Protein (CRP) >3 mg/dL] and with a feature of familial hyperlipidemia [LDL-Cholesterol (LDLc) >4.9 mM]. This resulted in a final sample number of 5101 participants. We obtained data from the following iron biomarkers: iron, ferritin, total iron binding capacity (TIBC), and TSAT. Assays for the analytes in NHANES were performed under strict laboratory protocols (for more details on data fields and procedure manuals, see Supplementary Table 1).
Categorization as PAD
In both cohorts, ABI values were measured on both the right and left limbs. We categorized PAD when the ABI value on one of the sides was <0.9 or >1.4 as per recommendations23.
Implementation of the ML framework
First, we preprocessed the HEIST-DiC and the NHANES dataset and removed all features that contained more than 10% of missing values. Additionally, to made both cohorts comparable by only considering the cases that did not have missing values for ferritin, diabetes, CRP and high blood pressure in the NHANES cohort. For the HEIST-DiC dataset, we have used the leave-one-out split strategy, and for each fold, we filled in the missing values using the average of each column (SimpleImputer method, from Scikit-learn). Moreover, we used SMOTE (v0.12.0)24 to oversample the minority class (PAD), to match the majority class (No PAD). For the NHANES dataset, we used the 10-fold cross-validation strategy, and for each fold, we filled the missing values and oversampled the minority class using the same methods, and performed the so-called one-hot encoding, where we encoded the categorical variables into numerical values. Next, for each dataset separately, we trained the XGBoost classifier (v2.0.3)25 and predicted the class of the instances on the test set, from where we derived the balanced accuracy, the F1 Score, and quantified the feature importance. Finally, we also implemented a version of the ML framework that allowed us to assess the variable importance using SHAP (SHapley Additive exPlanations, v0.44.0).26 Here, we performed the same steps for both HEIST and NHANES datasets, except that we did not use any split strategy; instead, we filled the missing values, oversampled the minority class and trained the classifier using the complete dataset. The ML pipeline is shown in Figure 2A.
Regression analysis
For the regression analysis, we first tested the interaction effect of each of the iron biomarkers with sex in both cohorts as iron metabolism is hallmarked by sex-specific differences. Further, in the NHANES cohort, we conducted multivariable regression analyses by flexibly modelling the probability of PAD’s presence with one of the iron biomarkers (iron, ferritin, or TSAT) by spline regression (degree of freedom=4). We chose variables from the feature importance in ML analysis as covariates in the regression models on the NHANES data: age, CRP, HDLc, triacylglycerols, HbA1c, ACR, BMI, ethnicity (non-Hispanic whites as the reference), smoking (non-smokers as reference), the presence of diabetes, AST, ALT, Hb, creatinine, antihyperlipidemic medications; the former seven variables as continuous and the rest as categorical). All continuous variables were mean-centered for the analysis, and variables were included in the models only when their missing data was <10%. In addition, sample weights were included in the analyses of NHANES data.
All regression analyses were performed using R Statistical Software (v4.2.227). We used Python (v3.10.1228), and Scikit-learn (v1.4.029) for the implementation of the ML framework. The reporting of the study is as per The Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) guidelines30. We have also developed a data exploration tool for readers to explore the datasets and interactively create their visualizations from the paper (https://anandr.shinyapps.io/NHANES_HEISTDiC/) using Quarto31 and Shiny32.
Results
Demographics
The baseline characteristics of the HEIST-DiC participants are shown in Table 1. The overall prevalence of PAD was 6.6% with a higher proportion among males (17/192) than females (6/165). Among both sexes, the participants with PAD were older and diabetes and hypertension was a common feature (Table 1). The sICAM1 and sVCAM1 levels were significantly higher only among males with PAD (Table 1). Among the iron biomarkers, ferritin was elevated among females, but not males, with PAD (Median 141, IQR 93-254, p=0.035) while iron, transferrin, and TSAT were similar between HEIST-DIC participants with and without PAD. (Table 1).
The analysis of NHANES builds on a previous report33 by including data from the NHANES 2003-2004 cycle. The baseline characteristics of the NHANES participants included in the study are shown in Table 2. Overall, the prevalence of PAD in the population was 6.25% with a significantly higher prevalence among females (males: 4.96%, females: 7.9%; p=0.001). Individuals with PAD showed differences in ethnicity, sex, presence of diabetes, smoking, alcohol status, physical activity, and use of antihyperlipidemic and antiplatelet medications (Table 2). Among the iron biomarkers, serum iron (Median 80, IQR 64-108 μg/dL; p<0.001) and transferrin saturation (Median 24, IQR 17-30; p=0.021) were reduced in males with PAD while females with PAD showed higher ferritin (Median 87, IQR 56-252 ng/mL; p<0.001) and lower TIBC (Median 60, IQR 54-68; p<0.001). Differences were also observed in some laboratory parameters within both sexes with PAD (e.g. albumin, AST, urea, creatinine, CRP, and ACR; Table 2).
Establishing a machine learning predictive framework for PAD
We next aimed to establish a predictive ML framework to anticipate the occurrence of PAD based on biomarkers. A so-called ML classifier is trained on a dataset containing instances with known outcomes (i.e., the presence or absence of PAD), and its relationship with input features. During the test phase, the trained classifier is used to predict PAD in participants unseen during the training phase, where we purposefully hid the true diagnostic. The result is then compared to the real diagnosis, and the quality of the framework is evaluated. Overall, this process enables the identification of subtle correlations between variables that might be imperceptible to human analysis, but that can be used to anticipate PAD and enhance our diagnostic capabilities.
Our framework received the available molecular and physiological biomarkers from participants from two distinct cohorts, HEIST-DiC and NHANES as input. These cohorts have unique characteristics that require the development of separate data preprocessing, training, and testing regimens. First, the HEIST-DiC cohort comprised 357 instances (23 PAD and 334 No PAD cases) and had 35 numerical and categorical features. Here, for the training and testing of the ML framework, we used a leave-one-out split methodology. This approach involves using each instance as a test case once, while the remainder forms the training set, thus maximizing the use of available data for training. In contrast, the NHANES cohort was considerably larger, with 1188 instances (100 PAD and 1088 No PAD), and 28 features; hence, we used the 10-fold cross-validation strategy. This method divides the dataset into ten parts, with each part successively used for testing while the other 9 parts are used for training. Together, these strategies ensured robustness and reliability in our model’s predictive performance.
We found that the ML framework achieved a balanced accuracy of 95% for the HEIST cohort (Figure 2B-D) and 96% for the NHANES cohort (Figure 2E-G). Balanced accuracy is an important metric in the context of imbalanced datasets, where one of the classes has many times more instances than the other (in the case of HEIST-DiC, No PAD cases were 14 times more frequent, and in NHANES, 10 times). Thus, the balanced accuracy represents the average accuracy obtained on both classes (PAD and No PAD), which in turn ensures that our model is equally adept at identifying both conditions.
Moreover, the ML framework also achieved high F1 scores of 0.89 for HEIST-DiC and 0.93 for NHANES, pointing to a strong balance between precision (the proportion of true positive results in all positive predictions), and recall (the proportion of true positive results in all actual positives). Thus, the F1 score in combination with the balanced accuracy provides realistic measures of a model’s strength, particularly in datasets where one class might significantly outnumber the other. In summary, the ML framework was effective in predicting the occurrence of PAD across different cohorts, even in the face of data imbalance. Further, sex and iron biomarkers were ranked among the top predictive markers for the presence of PAD in the HEIST-DiC (ferritin, iron and transferrin) and the NHANES cohorts (ferritin, TIBC, TSAT and iron; Supplementary Figure 2; Figure 3A-B).
Regression analyses
We next performed regression analyses on the HEIST-DiC data and tested the interaction effect of each of the iron biomarkers with sex and found that ferritin showed a positive association in females with PAD (Ferritin*Sex, OR 2.68, 95% CI 0.94-7.61, p=0.057) while, iron (Iron*Sex, OR 1.17, 95% CI 0.44-2.96, p=0.7), transferrin (Transferrin*Sex, OR 1.28, 95% CI 0.48-3.17, p=0.6) and TSAT (TSAT*Sex, OR 1.1, 95% CI 0.35-3.2, p=0.9) were non-significant. Consistent with our observations in the HEIST-DiC cohort, we found that ferritin also showed a significant interaction with sex in the NHANES cohort (Ferritin*Sex, OR 1.76, 95% CI 1.16-2.67, p=0.008). TIBC also had a significant negative interaction (TIBC*Sex, OR 0.61, 95% CI 0.4-0.92, p=0.018), while iron (Iron*Sex, OR 0.95, 95% CI 0.69-1.31, p=0.76) and TSAT (TSAT*Sex, OR 1.15, 95% CI 0.84-1.58, p=0.37) were nonsignificant.
In the multivariable regression analyses of the NHANES data, we found that, ferritin or sex alone do not show a statistically significant relationship with PAD. On the other hand, the interaction between ferritin and sex reveals a significant positive association for two spline components (Table 3). Specifically, ferritin showed positive and statistically significant effects in females with two positions (48-97 ng/mL: OR 14.59, 95% CI 1.6-135.93, p= 0.019; 98-169 ng/mL: OR 171.07, 95% CI 1.27-23404, p= 0.039); this suggests that the relationship between PAD and ferritin is nonlinear and that within high ferritin ranges females show a positive association with PAD. Other significant covariates include age, CRP, and smoking, all of which also positively impact the outcome. By contrast, no associations were significant for iron and TSAT (Supplementary Table 2).
Discussion
Our study provides a comparison between ML and regression analyses to demonstrate the association between iron biomarkers in the context of PAD. Taken together, our findings from both analyses show that serum iron biomarkers are associated with PAD, with ferritin demonstrating a consistent association specifically in females with the two methodologies applied.
The positive association for higher serum ferritin among females with PAD strengthens previous observations on the NHANES 1999-2001 cohort33. When this observation is compared to other related disease contexts (e.g. carotid/coronary atherosclerosis/cardiovascular disease), ferritin’s positive association agrees with some studies11,15,16, but not others34–36. Our findings also agree with The Iron and Atherosclerosis Study (FeAST; https://clinicaltrials.gov, NCT00032357), a major trial that investigated the benefits of iron reduction on the clinical outcomes in a cohort of individuals with PAD37. While the initial analysis of the FeAST trial did not report a favorable outcome following phlebotomy37, the reanalysis of the results adjusting for the effects of age38 and smoking39 have suggested that iron reduction (with ferritin and TSAT as endpoints) could be of significant benefit for younger individuals and smokers. However, the FeAST trial did not contain sufficient participants to study sex-specific differences. Our study also advances current knowledge. We extend the analyses by Menke et al33, by including additional participants from the NHANES 2003-2004 cycle in the analysis. In contrast to the FeAST trial37, we have included participants with prediabetes, adjusted more stringently for inflammation [excluded samples with features of inflammation (ferritin>400 ng/mL or CRP>3 mg/dL) and used CRP as a covariate] and increased statistical power to study sex-specific differences.
If the high ferritin values observed in our study were solely interpreted as an indicator of body iron stores, the data suggest that there is an increased risk of PAD at the upper end of, what is clinically accepted as “physiological range”. However, ferritin is also an acute-phase protein and inflammation influences most iron parameters. During inflammatory states, iron export via ferroportin from macrophages and duodenal enterocytes is reduced by hepcidin-controlled40 and transcriptional mechanisms41,42. Consequently, macrophages become iron loaded contributing to elevated circulating ferritin levels and hypoferremia (low iron levels in the plasma)43. Therefore, an alternative interpretation of our findings is that elevated ferritin may be an epiphenomenon secondary to inflammation. In addition, the presence of microinflammation in tissues cannot be completely excluded from the available data.
To summarize, our study adds to the body of evidence suggesting sex-specific differences in PAD44 and highlights a possible role of iron (directly or indirectly) in this relationship. We have partly controlled for hormonal influences by excluding NHANES participants who are currently pregnant, menstruating, lactating, or on hormonal contraceptives, but the hormone levels were not available to be included as covariates. It is further possible that additional sex-specific mechanisms are in play.
In both cohorts analyzed, we observed a similar prevalence of PAD (∼6%) associated with well-known risk factors (old age, diabetes, and hypertension). Markers related to metabolic and vascular health, such as blood pressure, plasma glucose levels and CRP, were elevated among PAD participants in both cohorts. Additionally, BMI was higher in PAD participants compared to non-PAD participants in HEIST-DiC, while NHANES shows more consistent BMI values across groups, with a minor increase only in females with PAD. It is important to note that the cohorts analyzed differed in that the HEIST-DiC cohort consisted of mainly European individuals with diabetes and prediabetes from a hospital-based study, while NHANES is a survey cohort, representative of the US population, and therefore contains a mix of ethnicities.
The strength of our study lies in the analyses of these two independent cohorts by orthogonal approaches, allowing for a generalization of the findings, an increased robustness of the data, and a reduction of cohort-specific biases. The examination of multiple iron biomarkers, including serum iron, ferritin, TSAT, and TIBC provides a comprehensive analysis of the relationship between iron and clinically apparent PAD. To the best of our knowledge, this is the first study to evaluate the relationship between iron biomarkers and PAD in prediabetes and diabetes. Our approach also underscores the complementary nature of ML and regression analyses: while ML excels at predictive accuracy and feature selection, regression analyses offer insights into interaction effects and the nuanced relationship between predictors and outcomes. Therefore, integrating both methodologies can leverage the strengths of each approach to provide a comprehensive understanding of complex disease dynamics. Limitations include the cross-sectional nature of the study which limits causal inferences and precludes the assessment of temporal relationships. Further, we were unable to control for all possible confounding factors, such as genetic predisposition, lifestyle, medications, or other comorbidities. The study described here also did not investigate potential mechanisms underlying the observed associations.
Conclusion
Overall, our data show that elevated ferritin within the physiological range is associated with clinically apparent PAD in females but not males. Further studies are necessary to confirm whether the observed associations are indeed causal and to explore the potential mechanisms underlying this relationship. Our findings need to be considered explorative to encourage the investigation of potential biological pathways underlying the sex-specific differences in iron’s influence on atherosclerosis.
Data Availability
The data from this study can be explored here: https://anandr.shinyapps.io/NHANES-HEISTDiC/. The HEIST-DiC dataset analyzed in the current study is available from the corresponding authors on reasonable request. The NHANES dataset is publicly available. Codes used for data analysis and visualization are accessible here (https://github.com/griffindoc/pad)
Ethics statement
The ethics committees of the University of Heidelberg approved this study (Decision No. 204/2004, 400/2010, and S-383/2016) according to the Declaration of Helsinki. All participants entered the study according to the guidelines of the local ethics committees following written informed consent to participate.
Funding
The HEIST-DiC study was funded by the Deutsche Forschungsgemeinschaft (SFB1118). MUM acknowledges funding from the Deutsche Forschungsgemeinschaft (DFG) (SFB1118; GRK 2727; FerrOs - FOR5146; Ferroptosis SPP2306: Project No.461704553) and from the Federal Ministry of Education and Research (NephrESA Nr 031L0191C; Translational Lung Research Center Heidelberg (TLRC-H), German Center for Lung Research (DZL); Project No. FKZ 82DZL004A1).
Author contributions
Anand Ruban Agarvas: Conceptualization, Methodology, Software, Formal analysis, Investigation, Data curation, Writing- Original draft, Visualization
Stefan Kopf: Investigation, Resources, Writing - Review & Editing
Tiago J.S Lopes: Formal analysis, Investigation, Writing- Original draft, Visualization
Paul Thalmann: Validation, Writing - Review & Editing
José Manuel Fernández-Real: Methodology, Writing - Review & Editing, Supervision
Peter Nawroth: Project administration, Funding acquisition, Supervision, Writing - Review & Editing
Martina U. Muckenthaler: Conceptualization, Methodology, Supervision, Writing - Review & Editing, Project administration, Funding acquisition
Conflict of interest
None to declare
Data and Code Availability Statement
The data from this study can be explored here: https://anandr.shinyapps.io/NHANES_HEISTDiC/. The HEIST-DiC dataset analyzed in the current study is available from the corresponding authors on reasonable request. The NHANES dataset is publicly available. Codes used for data analysis and visualization are accessible here (https://github.com/griffindoc/pad)
Supplementary files
Supplementary Table 1. Data fields and variables from NHANES 1999-2004.
The table shows the variables and the corresponding codes used for accessing data from the NHANES. The hyperlinks to the procedure manuals and datafiles are also provided in the table.
Supplementary Table 2. Multivariable regression analysis of iron biomarkers and PAD in the NHANES Cohort
We conducted multivariable regression analyses by flexibly modelling the probability of PAD’s presence with iron biomarkers (iron or TSAT) by spline regression (degree of freedom=4). The following variables were included aas covariates: age, CRP, HDLc, triacylglycerols, HbA1c, ACR, BMI, ethnicity (non-Hispanic whites as the reference), smoking (non-smokers as reference), the presence of diabetes, AST, ALT, Hb, creatinine, antihyperlipidemic medications; the former seven variables as continuous and the rest as categorical). All continuous variables were mean-centered for the analysis. In addition, sample weights were included in the analyses
BMI: Body Mass Index, HDL: High-Density Lipoprotein, CRP: High sensitivity C-reactive protein, AST: Aspartate Aminotransferase, ALT: Alanine Aminotransferase; ACR: Albumin-creatinine ratio, OR: Odds Ratio, 95% CI: 95% Confidence Interval, SE: Standard Error
Supplementary Figure 1. Feature importance of xGBoost classification
(A-B) We used the XGBoost classifier and its output to quantify the importance of the features to the prediction of PAD occurrence. In both panels, higher values indicate a higher contribution of a given feature, and iron-related markers are color coded, and appear among the top features in terms of relevance to the prediction of this condition.
Footnotes
Data has been reanalysed using machine learning; Methods and results portions updated. Figures removed and replaced by regression output tables; Supplemental files updated; authors list updated.