Abstract
Background Performance and portability of contemporary polygenic risk scores (PRS) for atherosclerotic cardiovascular disease (ASCVD) phenotypes vary based on different methods, training data, and trait ascertainment.
Objectives We aimed to investigate performance and portability of contemporary PRS for ASCVD subtypes: coronary heart disease (CHD), abdominal aortic aneurysm (AAA), ischemic stroke (IS), and peripheral artery disease (PAD), using the All of Us Workbench which provides access to a large diverse cohort with phenotype and whole genome sequence data. We also developed and evaluated a multi-trait PRS for each subtype.
Methods Performance of PRS for 4 ASCVD traits and related risk factors was compared across genetic ancestry groups in 245,388 All of Us participants. Genetic EUR, African (AFR), Admixed American (AMR), and remaining ancestry groups (combined as Other, OTH) were defined by All of Us based on principal components. PRS for CHD, IS, AAA, PAD, and multi-trait (combining PRS for the 4 traits as well as PRS for ASCVD risk factors) were assessed for portability across genetic ancestry groups using hazard ratios (HR) per SD increase.
Results For CHD, CHDPGS003725 performed the best (HR for 1 SD increase [95% CI]), across 4 genetic ancestry groups (EUR: 1.72[1.67-1.78], AFR: 1.24[1.18-1.31], AMR: 1.48[1.37-1.59], OTH: 1.65[1.52-1.79]). The best performing PRS for AAA was AAAPGS003972 (EUR: 1.68[1.59-1.78], AFR: 1.29[1.13-1.48], AMR: 1.30[1.06-1.60], OTH: 1.45[1.20-1.75]). The best performing IS PRS was ISPGS000039 in AFR (1.12[1.06-1.17]), AMR (1.11[1.04-1.19]), and OTH (1.23[1.09-1.38]), and ISPGS004939 in EUR (1.16[1.12-1.20]). For PAD, PADPGS004940 performed best in EUR (1.26[1.22-1.30]), AFR (1.11[1.05-1.18]), AMR (1.08[1.01-1.16]), and OTH (1.13[1.04-1.22]). Multi-trait PRS performed better than individual trait PRS for each ASCVD phenotype. Also, PRS derived from multi-ancestry cohorts performed better than those derived from single ancestry.
Conclusions PRS for ASCVD developed from multi-ancestry cohorts and multiple related traits performed best across ancestrally diverse and admixed individuals. PRS for CHD and AAA performed better than those for IS and PAD.
Background
Atherosclerotic cardiovascular disease (ASCVD) is the leading cause of mortality globally1,2 and includes four subtypes: coronary heart disease (CHD), abdominal aortic aneurysm (AAA), ischemic stroke (IS), and peripheral artery disease (PAD)3. PRS for predicting risk of ASCVD subtypes3,4 are derived from genome-wide association studies (GWAS) of primarily European genetic ancestry (EUR) individuals. Performance of PRS depend on genetic architecture of phenotypes, size, heterogeneity of development and training cohorts, as well as quality of summary statistics5,6. Newer PRS aim to include larger GWAS datasets and improve cross-ancestry prediction, resulting in a need to benchmark methods to compare performance in independent and diverse cohorts. In addition, recent studies suggest that linearly combining PRS for multiple related traits as well as training on multi-ancestry data improves prediction7.
Herein, we validate and benchmark available PRS for ASCVD phenotypes (i.e., PRS for CHD (PRSCHD), PRS for AAA (PRSAAA), PRS for IS (PRSIS), and PRS for PAD (PRSPAD)) in major genetic ancestry populations and explore the utility of newly developed multi-trait and multi-ancestry PRS for each subtype by leveraging data available in the AoU cohort. The All of Us (AoU) Researcher Workbench includes short read whole genome sequence data for 245,388 individuals8 and corresponding electronic health record (EHR) data9. In addition to information regarding ASCVD phenotypes, risk factor data is available for individuals genetically defined as EUR, Middle Eastern (MID), African (AFR), Admixed American (AMR), East Asian (EAS), and South Asian (SAS) ancestry based on principal component information.
We hypothesized that multi-trait PRS would perform better for all ASCVD subtypes given shared genetic architecture of the phenotypes. We evaluated PRS performance portability across genetic ancestry groups10.
Methods
Data Availability
The general workflow of the study is displayed in Figure 1. Additional information is provided in Supplemental Files (Tables S1-4, Figures S1-7)3,7,8,11–21. This study was approved by Mayo Clinic IRB application 10-00278. PRS chosen for validation were publicly available through the PGS Catalog (https://www.pgscatalog.org/), except for two internally developed PRS which will be published on PGS Catalog upon publication (i.e., PGS004939, PGS004940). We used data from the Researcher Workbench for All of Us Research Programs Controlled Tier Dataset version 7 and requests for access to data should be made directly with the All of Us Research Program.
Genotype, Phenotype, and Genetic Ancestry Ascertainment
The All of Us Researcher Workbench (v7) genomic data released in March 2023 include 245,388 individuals 8. We utilized EHR data to ascertain cases and controls for ASCVD subtypes as well as demographic and conventional risk factors. Incident cases for ASCVD subtypes were defined as individuals diagnosed after 6 months since entry into the EHR using algorithms published on pheKB and validated by the eMERGE Network11 (https://phekb.org/network-associations/emerge). Controls for ASCVD phenotypes were defined as having no ASCVD diagnosis to date (Table 1).
Analyses were restricted to adults (18 years of age or older at first EHR record) and individuals with differing sex at birth and gender designations were excluded (Figure S1). Type 2 Diabetes (T2D), systolic blood pressure (SBP), low-density lipoprotein cholesterol (LDL-C), antihypertensives, and statin use were determined at the most recent data entry before the median time to disease diagnosis for each ancestry group (Table S1) in the control group, or at the time of disease diagnosis for cases12.
Genetic ancestry was predetermined by AoU based on principal component analysis and random forest classification using the 1000 Genomes and GNOMAD data. Groups including AFR, AMR, EAS, EUR, MID, SAS, and Other (OTH) were defined in the “ancestry_preds_other” file. We analyzed genetically classified AFR, AMR, and EUR individuals, with the remaining individuals classified as OTH (i.e., highly admixed individuals or those belonging to genetic ancestry groups not large enough to analyze independently)8. Hereafter, any reference to ancestry refers to the genetic ancestry derived as stated.
PRS Performance in AoU
PRS for each subtype were downloaded from the PGS Catalog (www.pgscatalog.org, November 2023), prioritizing recent PRS trained on multi-ancestry populations when available. We included internally developed scores (ISPGS004939 and PADPGS004940) due to limited availability for IS and PAD PRS in the PGS Catalog. PRS evaluated in this study are referred to as TRAITPGSCatalogReference (i.e., PRSCHD by Patel et al. 13 is CHDPGS003725). Four PRSCHD were identified for testing as more multi-ancestry studies have been published with this phenotype. Two PRSCHD were chosen from one publication14 as one of the PRS being implemented in the eMERGE clinical trial15. Methods of PRS development with genetic ancestry distributions are outlined in Table 1. PRS for ASCVD risk factors (i.e., T2D, SBP, BMI, LDL-C) were also tested in the AoU cohort. One PRS for each risk factor, (chosen based on recently reported performance and inclusion of large, multi-ancestry cohorts) were used in multi-trait PRS development.
We used the “snp_match” function in the bigsnpR package to match and flip alleles as needed16. The “score” function in plink 2 was then used to sum the PRS over all individuals17. Testing was performed within genetic ancestry groups (EUR, AFR, AMR) and OTH. We normalized all PGS to zero-mean and unit-variance within each genetic ancestry group.
We followed reporting guidelines3,18. We assessed the associations of PRS with ASCVD phenotypes by fitting logistic regression models within each genetic ancestry group, adjusting for age and sex. We tested four different regression models that adjusted for i) age and sex, ii) age, sex, and PRS, iii) Cox proportional hazards using average time to event from first EHR record, sex and PRS, and iv) Cox proportional hazards using average time to event from first EHR record, sex, PRS, and risk factors in each genetic ancestry group (EUR, AFR, AMR, and OTH) for CHD, AAA, IS, PAD, and all ASCVD (Table 3). We additionally estimated OR for subtypes in the top 5% of PRS distribution compared to the rest (Table S2). C-statistics from cox models were reported in Table 3.
C-Statistics (validate package) were assessed for a model that included age and sex, as well as a model including age, sex, and PRS (Table S2). PRS results were examined using calibrate and validate functions in the rms package using the bootstrap method with 40 repetitions (Figure S5-8). Nagelkerke and calibration R2 values were obtained to assess performance of the PRS across genetic ancestry populations and Brier Scores calculated as a measure of accuracy of the predictions (Table S2). Pearson correlations between PRS within each ASCVD phenotype were assessed (Figure S2). HR per SD adjusted for time interval (age at EHR entry to age of diagnosis) and sex were assessed using Cox proportional hazards regression. Schoenfeld residuals were examined for deviations from the proportional-hazards assumption. Adjustments for conventional ASCVD risk factors (i.e., T2D, SBP, LDL-C), as well as statin and antihypertensive use were included in the cox model fit for comparison. Summary information for each PRS validated is in Table 3 and Table S2.
Multi-trait PRS
We gathered association weights from PRS for T2D, SBP, BMI, and LDL-C in the AoU cohort and included these PRS in an adjusted weight (multi-trait PRS) following previously described methods (Norland et al. 2022)7. This method involves linearly combining the trait of interest PRS with risk factor PRS7. We utilized one PRS for each ASCVD subtype based on performance (HR per SD): CHDPGS003356, AAAPGS003972, ISPGS000039, and PADPGS004940, excluding PRS that had previously accounted for risk factors included in the analysis (i.e., CHDPGS003725 13) to minimize statistical bias. Coefficients to weight the multi-trait PRS components were determined using 10-fold cross validation with glmnet and glmnetUtils R packages19 for each ASCVD phenotype and ancestry group (Figure S#). Multivariable linear regression was performed as previously described for all multi-trait PRS subtypes including the combined ASCVD phenotype.
Integrated Risk Scores
We also evaluated 10-year ASCVD risk using pooled cohort equations (PCEs)20. We then combined PCEs with all PRS to create an integrated risk score (IRS) for each trait to examine accuracy with the equation as follows:
Net reclassification indices (NRI) were evaluated for all PRS and all phenotypes using the nricens package with function “nribin()” for categorical and continuous NRI. Continuous and categorical NRI were used to evaluate risk prediction accuracy at a threshold of 10% individuals. Sensitivity and specificity were calculated for all clinical and integrated risk assessments (Supplemental File 2). Risk category reclassifications were also examined based on a threshold of 10-year ASCVD risk of 10% (Figure S9-12).
Results
Overall, PRSCHD performed best out of all ASCVD subtypes (Figure 2). CHDPGS003725 had the strongest associations with CHD in all population-based subsets of the AoU cohort (HR [95% CI]) with the highest performance in EUR (1.72[1.67-1.78]), followed by OTH (1.65[1.52-1.79]), AMR (1.48[1.37-1.59]) and AFR (1.24[1.18-1.31]) (Figure 2, Table S2). The CHDMulti PRS showed similar results to CHDPGS003725, with improvements in AFR (1.26[1.20-1.32]) and AMR (1.48[1.37-1.59]) populations, with a decrease in EUR (1.68[1.62-1.73]) and OTH (1.52[1.40-1.65]; Figure 2, Table 2, Table S2). Performance of PRSCHD for AFR populations was much lower (Figure 2, Table 2). Examining the Nagelkerke R2 values for these validations suggest that the model used to develop CHDPGS003725 that integrates other related traits provides better predictive accuracy than CHDMulti (Figure 3, Table S2).
We compared three PRSAAA (i.e., AAAPGS001784, and AAAPGS003972) in the 4 ancestry groups in AoU. AAAPGS002054 was developed using LDPred2, and the remaining two PRS were developed with PRS-CS21–23. AAAPGS001784 and AAAPGS003972 were developed using EUR cohorts, and both utilized multi-ancestry GWAS summary statistics (Table 2)22,23. PRS-CS scores outperformed the LDPred2 score in all ancestry groups, though it is noteworthy to state GWAS summary statistics used in training data differed (Figure 2-3, Table S2). AAAPGS003972 has the largest GWAS training data for AAA and was the best performing PRS for AAA across ancestry groups with HR per SD of 1.68[1.59-1.78] for EUR, 1.45[1.20-1.75] for OTH, 1.29[1.13-1.48] for AFR, and 1.30[1.06-1.60] for AMR, despite limitations in training diversity. AAAMulti increased calibration and Nagelkerke R2 value of the single trait PRS by an average of 18.5% (Table S2).
Three PRS 24,25 varying in linkage disequilibrium adjustment methods were compared using the AoU cohort26–30. The PRSIS were trained on EUR populations, limiting the portability of these scores between ancestry groups, however, ISPGS000039 and ISPGS004939 utilized multi-ancestry GWAS summary statistics7,29, improving the cross-ancestry performance compared to ISPGS000053. There was also a difference in PRS development method, allowing comparison of metaGRS as a multi-trait method and PRS-CS as a single-trait continuous shrinkage method (ISPGS000039 and ISPGS004939)28,29,31. For EUR populations, ISPGS004939 performed best (1.16[1.12-1.20]), whereas ISPGS000039 performed best for OTH (1.25[1.16-1.35]), AFR (1.12[1.06-1.17]), and AMR (1.11[1.04-1.19]; Figure 2-3, Table S2). ISMulti performed better in the 4 ancestry groups (EUR: 1.28[1.24-1.33], AFR: 1.21[1.16-1.27], AMR: 1.22[1.14-1.30], OTH: 1.45[1.35-1.55]) with an average increase of HR per SD 1.16-fold.
Portability of the PRSPAD was limited overall with essentially no predictive value for AFR (HR for 1 SD ∼1 with large standard errors). PADPGS004940 consistently outperformed PADPGS002055 for EUR (1.26[1.22-1.30]), OTH (1.13[1.04-1.22]), and AFR (1.11[1.05-1.18]) and PADPGS002055 performed best for AMR (1.11[1.03-1.19]; Figure 2, Table S2). However, PADMulti performed better with a HR per SD increase of 9.5% for EUR, 8.5% for AFR, 16% for AMR, and 13.8% for OTH from the best performing PRS.
Adjustment for conventional risk factors [type II diabetes (T2D), systolic blood pressure (SBP), low-density lipoprotein cholesterol (LDL-C)], as well as antihypertensive and statin use12 attenuated HR per SD for the PRS (∼0.1 HR per SD decrease), while increasing C-statistic (∼0.1 C-statistic increase). Overall, the multi-trait PRS for each ASCVD phenotype performed better than the single trait PRS, supporting the inclusion of ASCVD subtypes and risk factors in PRS development. However, the multi-trait PRS for each trait was trained and tested using AoU and results may be inflated. Analyses for OTH expectedly showed varying performance across ASCVD subtypes and individual PRS for performance statistics as OTH includes a broad variety of admixed and non-admixed individuals with limited sample sizes and resembled the performance for AMR. Calibrations and correlations for each PRS, trait, and ancestry group showed variable results relative to phenotype and ancestry group (Figures S5-8). R2 statistics from calibration are shown in Table S2 and Nagelkerke R2 with 95% CI are displayed in Figure 3. As EUR had larger training data sets, PRS performed better in EUR than in other ancestries for all ASCVD subtypes. However, EUR-trained PRS continue to perform poorly in AFR32.
We also examined performance of integrated risk scores (IRS) that included PRS and the pooled cohort equation (PCE) to estimate 10-year ASCVD risk in the AoU cohort. PRS were included as shown in Equation 1 (Methods). C-Statistics for all ASCVD subtype IRS indicate reasonable discrimination (0.682-0.778; Table S7). We identified changes in risk categorization for individuals using a 10% 10-year ASCVD risk threshold as shown in Figures S8-11, as well as continuous and categorical NRI (Table S7). IRS showed improvement of prediction for 10-year ASCVD risk scores. Predictive performance of PRS varied across subtypes. Multi-trait and multi-ancestry PRS had better predictive performance than scores derived from single ancestry.
Discussion
Performance of contemporary PRS for ASCVD subtypes varied across traits, ancestry groups, and development methods. PRS for CHD and AAA subtypes performed better across genetic ancestry groups than IS and PAD (Figures 2, 3). Expectedly, performance was best in those of EUR ancestry. Multi-ancestry and multi-trait PRS tended to perform better. Lower performance in AFR and admixed populations was consistent across all analyses.
PRSCHD have evolved and improved over the last decade33 but remain suboptimal for non-EUR groups. For example, the LDPred2-based method used to develop CHDPGS00372513 combined GWAS summary statistics for CHD in addition to other ASCVD phenotypes and risk factors, then was trained on primarily EUR populations in the UK Biobank13. This method had comparable performance to the multi-trait PRS developed in this study. Expanding the multi-trait method to other ASCVD subtypes increased performance, indicating that leveraging genetic correlations between other traits improves performance for each ASCVD trait compared to single-trait scores34–37. Despite the performance gains using multi-trait methods, PRS for non-EUR groups remain suboptimal.
PRSPAD and PRSIS performed less well, implying need for more data and diversity for these phenotypes before implementation into clinical practice (Figure 3). PRSPAD performs similarly for AMR and OTH but has a lower average R2 for EUR than PRSCHD and PRSAAA with broader confidence intervals overall (Figure 3). HR per SD of up to 1.69 (IS) and OR per SD of 1.44 (PAD) have been reported for cohort-based studies, however, the scores were not externally tested and were not available publicly or on PGS Catalog38,39. As many publications reporting metrics for PRS preceded or did not adhere the current guidelines3,18, direct comparison of PRS performance is difficult. Additionally, PAD is a heterogenous phenotype, and case ascertainment may vary across cohorts. Variable phenotyping can lead to variability in heritability estimates as well. For instance, heritability for PAD has been reported from modest (11%)38 to moderate (55%)40.
Differential performance of PRS for an ASCVD subtype across genetic ancestry groups could be due to varying number of cases across population groups, case ascertainment algorithms within GWAS and PRS development, PRS development methods, and overall diversity of training populations. Consistently lower performances of PRS for AFR is likely due to the greater genetic differences from EUR, which compose most training data available at varying stages of PRS development process13,14. Whether differences of causal variants and subsequent effect sizes contribute to performance is unclear41,42. As in other traits, PRS for ASCVD perform less well for AFR, highlighting the need for inclusion of more diverse individuals and improving portability of scores through methods to mitigate performance variability43.
IRS results (Table S4) had similar trends of performance across genetic ancestry groups and subtype (Table S2). It should be noted that PCEs include self-reported race designations10, but risk is likely better estimated by using social determinants of health methods instead of race when they become more available44.
Clinical Implications
The All of Us cohort is useful for validation of contemporary PRS as it includes a large, diverse group of individuals and has not yet been included in training datasets, thus avoiding overfitting. PRSAAA and PRSCHD performed robustly, suggesting potential for clinical utility. However, PRS for IS and PAD performed less well. Previous use of PRS for ASCVD phenotypes in clinical settings have shown promising results. For example in the MI-GENES trial, patients choose to implement changes based on PRS to reduce risk of CHD by including statins and/or healthier lifestyles45. The Electronic Medical Records and Genomics (eMERGE) Network is testing clinical implementation15 of PRS for 9 common conditions46. Performance of PRS for ASCVD phenotypes is limited for AFR10,46. However, the process of benchmarking in a highly diverse cohort enabled the identification of best-performing PRS for use in clinical risk scores. IRS used for clinical risk evaluation showed an increase in performance, when integrating PRSCHD (Table S4). Therefore, IRS with the best performing PRS will provide the most accurate risk estimates for patients.
Limitations
The AoU cohort defined genetic ancestry based on projection of samples onto the PC space, rather than more accurate methods of determining genetic ancestry such as ADMIXTURE and SCOPE, which may have labeled some individuals differently47,48. Additionally, few individuals present in the AoU cohort could also be subjects in other study cohorts used in development of the PRS weights.
Conclusions
PRS for ASCVD developed from multi-ancestry cohorts and multiple related traits performed best across ancestrally diverse and admixed individuals. PRS for CHD and AAA performed better than those for IS and PAD. Limited sample sizes and diversity within development cohorts restrict the portability of these PRS. As PRS continue to evolve, periodic assessment of performance is necessary. This study highlights the need for diverse cohorts for GWAS and PRS development for all ASCVD subtypes.
Data Availability
All data produced in the present work are contained in the manuscript. Individual-level data is available upon request or application to All of Us Research Program.
Non-standard Abbreviations and Acronyms
- PRS
- Polygenic risk score
- IRS
- Integrated risk score
- AoU
- All of Us
- ASCVD
- Atherosclerotic cardiovascular disease
- CHD
- Coronary heart disease
- AAA
- Abdominal aortic aneurysm
- IS
- Ischemic stroke
- PAD
- Peripheral artery disease
- EUR
- European genetic ancestry
- AFR
- African genetic ancestry
- AMR
- Admixed American genetic ancestry
- OTH
- Other genetic ancestry
- GWAS
- Genome-wide association study
- PRSCHD
- Polygenic risk score for coronary heart disease
- PRSAAA
- Polygenic risk score for abdominal aortic aneurysm
- PRSIS
- Polygenic risk score for ischemic stroke
- PRSPAD
- Polygenic risk score for peripheral artery disease
- Multi
- Multiple trait inclusion
- eMERGE Network
- electronic Medical Records and Genomics Network
- PGS Catalog
- Polygenic Score Catalog
- LDL-C
- Low-density lipoprotein cholesterol
- SBP
- Systolic blood pressure
- T2D
- Type II diabetes
- BMI
- Body mass index
- OR per SD
- Odds ratio per standard deviation increase
- HR per SD
- Hazard ratio per standard deviation increase
- Top 5% OR vs rest
- Odds ratio for the top 5% risk individuals versus the rest of the cohort
- CI
- Confidence interval
- C-Statistic
- Concordance Statistic
- NRI
- Net reclassification index
- AUC
- Area under the operating curve
- SNV
- Single nucleotide variant
Acknowledgements
We acknowledge the All of Us participants for contributions to the All of Us Researcher Workbench for the opportunity to contribute this research. We thank the National Institute of Health’s All of Us Research Program for making the data available for use in this study.
Sources of Funding
This study was done as part of the Polygenic Risk Methods in Diverse Populations (PRIMED) Consortium funded by the National Human Genome Research Institute (NHGRI) grant U01HG11710, as well as T32 grant HL07111-45, National Heart, Lung, and Blood grant K24 HL137010 and R35 GM140487.
Disclosures
The authors declare that they have no conflict of interest. The article used data from previously published human studies.