ABSTRACT
Purpose Considering the heightened risk of cancer treatment-related cardiomyopathy and cardiac death in long-term survivors of childhood cancer, we aimed to develop and validate a clinically-applicable risk prediction model for cardiomyopathy.
Patients and Methods Childhood cancer survivors from St. Jude Lifetime Cohort, (SJLIFE, model-development; n=3,479; median age 32.3 years, IQR 24.4-40.9) and Childhood Cancer Survivor Study (CCSS, model-validation; n=6,875; median age 33.2 years, IQR 27.9-38.9) were assessed for demographic and cardiovascular risk factors, treatment exposures, and polygenic risk scores (PRSs) for cardiomyopathy, heart failure, cardiac structure and function, and anthracycline-related cardiomyopathy risk. Multivariable Poisson regression predicted the 10-year risk of cardiomyopathy (CTCAE grade ≥3: requiring heart failure medications or heart transplantation or leading to death) following baseline visit/survey. Model performance was assessed by area under the receiver operating characteristic curve (AUC).
Results Cardiomyopathy was clinically identified in 75 (2.2%, SJLIFE) and self-reported in 87 (1.3%, CCSS) survivors within 10 years of the baseline assessment. AUC of a clinical model with sex, age at cancer diagnosis, cumulative anthracycline and mean heart radiation doses was 0.833 (SJLIFE) and 0.812 (CCSS). Age at baseline, hypertension and genetic ancestry showed associations with higher cardiomyopathy rates in SJLIFE but did not increase AUC in CCSS (0.812). Adding PRSs for hypertrophic cardiomyopathy and left ventricular end-systolic volume improved AUC in CCSS (0.822; P=0.016). Compared to existing survivorship-care guidelines, the PRS model classified fewer survivors as high-risk or moderate-risk, while identifying survivors in those categories as having 1.5-times greater risk.
Conclusion We developed and validated a model with highest-to-date performance for estimating the 10-year risk of cardiomyopathy in survivors of childhood cancer. Results could enhance identification of at-risk survivors beyond current guidelines.
INTRODUCTION
In the United States, over 85% of children diagnosed with cancer survive for at least five years, resulting in nearly half a million childhood cancer survivors today1. However, these survivors often face chronic health issues from their cancer and treatments, increasing their risk of early death2,3. Cardiovascular disease, including cardiomyopathy and heart failure, is the leading non-cancer cause of mortality in this population4–6, with prevalence increasing with age7. Previous exposure to cardiotoxic treatments, such as anthracycline chemotherapy and chest-directed radiotherapy, are known risk factors. Additionally, cardiovascular risk factors (CVRFs), such as hypertension, dyslipidemia, and diabetes, further exacerbate the cardiac risks associated with these cancer treatments8,9.
Risk-prediction models guide primary cardiovascular prevention and treatment in the general population10–13. However, these models likely underestimate risks in childhood cancer survivors because they do not incorporate cardiotoxic cancer therapy exposures. Two previous risk prediction models for heart failure (HF) in survivors incorporated cancer treatment and/or CVRFs but were based on self-reported data from the CCSS and did not consider age or race as predictors14,15. Compared to clinically-assessed outcomes, self-reported data can lead to misclassification and imprecise predictor estimates, adversely impacting model performance. In the general population, genetic variants are associated with cardiomyopathy16, HF17, and cardiac structure and function16,18. In survivors, genetic variants have been associated with anthracycline-related cardiomyopathy19–24. However, the role of these genetic factors in predicting cardiomyopathy risk among survivors has not been fully explored.
This study aimed to utilize unique data from the clinically-assessed SJLIFE cohort of childhood cancer survivors to develop accurate and clinically-applicable risk prediction models for cardiomyopathy. These models incorporated survivors’ characteristics including age at baseline visit, treatment exposures, and inherited genetic variation. Models were independently validated using survivors from the CCSS.
PATIENTS AND METHODS
Study population
Participants were from two large cohort studies of childhood cancer survivors in North America, SJLIFE25,26 and CCSS27,28. SJLIFE is a retrospectively constructed cohort initiated in 2007, with prospective clinical assessments of ≥5-year survivors of childhood cancer treated at St. Jude Children’s Research Hospital (SJCRH) since 1962. CCSS is also a retrospectively constructed cohort study with prospective follow-up of ≥5-year survivors diagnosed before age 21 and treated for various cancers at 31 participating institutions in the U.S and Canada between 1970 and 1999. Here, we included survivors in SJLIFE (model-development) and CCSS (model-validation; independent of those also in SJLIFE) who had genome-wide genetic data and completed at least one campus visit or questionnaire. Institutional review boards at SJCRH and each of the CCSS participating centers approved the study, and all participants provided written informed consent.
Definitions of outcomes
Cardiomyopathy in SJLIFE was clinically assessed based on left ventricular ejection fraction (LVEF) and pharmacologic treatment and graded per a modified version of the National Cancer Institute’s Common Terminology Criteria for Adverse Events (CTCAE v4.03)29. Participants in CCSS completed a multi-item questionnaire at baseline and follow-up evaluations that included age at onset of HF based on self-report of medical diagnosis and pharmacological treatment. Questionnaire items related to HF in CCSS were classified and graded using the CTCAE methodology8,30. Survivors in both SJLIFE and CCSS with CTCAE grade ≥3 were considered affected. Detailed CTCAE grading criteria for both cohorts are provided in Supplementary Table 1.
Cancer treatment exposures and CVRFs
Information pertaining to chemotherapy and radiotherapy exposures was abstracted from medical records (Supplementary Methods). The cumulative anthracycline dose (in mg/m2) was determined by doxorubicin toxicity equivalence31, and mean radiation dose to the heart for each survivor was calculated using established radiation dosimetry methodologies32–34. CVRFs including hypertension, dyslipidemia, and diabetes were clinically-assessed in SJLIFE (comprising laboratory measurements and blood pressure values in addition to pharmacologic treatment) and were self-reported in CCSS. These CVRFs were defined as CTCAE grade ≥2 in both cohorts.
Polygenic risk scores
Based on studies in the general population and childhood cancer survivors, we considered multiple PRSs for cardiomyopathies (dilated [DCM] and hypertrophic [HCM]), HF, measures of cardiac structure and function, and anthracycline-related cardiomyopathy. Specifically, we considered six separate PRSs for DCM (PRSDCM)16, HCM (PRSHCM)16, HF (PRSHF)17, LVEF (PRSLVEF)18, LV end-systolic volume index (PRSLVESVi)18 and anthracycline-related cardiomyopathy (PRSACT)19–24. Details regarding genotyping, genetic ancestry determination, and the PRS calculation are described in Supplementary Methods.
Statistical Analysis
Multivariable Poisson regression models were fit sequentially to the SJLIFE model-development cohort to evaluate associations with incident cardiomyopathy, starting with variables selected a priori from the existing HF risk prediction models previously developed in the CCSS35,36. The variables included in this clinical model were age at primary cancer diagnosis, sex, cumulative anthracycline dose, and mean heart radiation dose. Specifically, we predicted the incidence of CTCAE grade ≥3 cardiomyopathy over a 10-year period, incorporating an offset to account for variations in follow-up length: i.e., the offset was the time between a survivor’s baseline assessment (or cohort entry) and the earliest of his/her last follow-up, 10 years after the baseline, or death. This approach allowed us to scale each survivor’s risk according to the actual follow-up duration, ensuring accurate rate estimation while adjusting for differences in follow-up length. Survivors with grade ≥3 cardiomyopathy (or grade ≥3 HF) prior to baseline, or those whose first cardiomyopathy was grade 2 and occurred before baseline, were excluded from the analysis. The subsequent models included age and prevalent CVRFs at baseline. The CVRFs retained in the model were selected through a backward selection approach. Genetic ancestry was then entered into the model (with a sensitivity analysis that replaced genetic ancestry with self-reported race), followed by an addition of each of the six PRSs as z-scores – PRSDCM, PRSHCM, PRSLVEF, PRSLVESVi, PRSHF, and PRSACT. The PRSs with P<0.1 in this univariate analysis were added to the model simultaneously.
Receiver operating characteristic (ROC) analyses were performed to assess the predictive ability, measured using the area under the ROC curves (AUC), of the following five models – model 1 (clinical model); model 2 (model 1 with age at baseline); model 3 (model 2 with CVRFs at baseline); model 4 (model 3 with genetic ancestry or self-reported race); and model 5 (model 4 with the PRSs). The prediction models 1-5 developed based on SJLIFE participants were validated using the independent model-validation cohort of CCSS participants, calculating AUCs of the exact same models developed from SJLIFE using the same predictors and the model parameter values fixed as the estimates from SJLIFE. The statistical significance of AUC improvement by including additional predictor(s) over the model without them was assessed using the DeLong’s test37. All five models, including the univariate analyses of PRSs, were also assessed among survivors of European ancestry in both cohorts, since over 80% of survivors in SJLIFE/CCSS were of European ancestry, and the PRSs were primarily derived from studies involving individuals of European ancestry.
The CCSS model-validation cohort used for all model validations was required to have genome-wide genotype data, which is available for only about one-third of the entire CCSS cohort. For generalizability, however, models 1-4 without requiring the genotype-data were also evaluated in the entire CCSS cohort (n=21,352; 217 with CTCAE grade ≥3 HF after baseline). Additionally, the models were evaluated among survivors exposed to cardiotoxic therapies (anthracyclines and/or chest-directed radiotherapy [chest RT]), and those in the following risk groups based on the current International Late Effects of Childhood Cancer Guideline Harmonization Group (IGHG)38 recommendations: high-risk group (≥250 mg/m2 anthracyclines or ≥30 Gray chest RT, or a combination of ≥100 mg/m2 anthracyclines and ≥15 Gray chest RT); moderate-risk group (100 mg/m2≤anthracyclines<250 mg/m2 or 15 Gray≤chest RT<30 Gray); and low-risk group (0 mg/m2<anthracyclines<100 mg/m2 or 0 Gray<chest RT<15 Gray).
To assess the calibration of the best-performing model, the predicted probability of developing cardiomyopathy in the 10-year period from the baseline for each survivor (assuming no death) was compared to the observed cumulative incidence of cardiomyopathy. Based on these predicted probabilities, survivors were then categorized into “high” (predicted probability>10%), “moderate” (5%≤predicted probability≤10%), and “low” (predicted probability<5%) risk groups. This <5% threshold is the criterion used to identify low-risk individuals for atherosclerotic cardiovascular disease in the general population, as defined by the pooled cohort equations39.The 10-year cumulative incidence of cardiomyopathy in each risk group was estimated and compared to those of the IGHG-based risk groups.
RESULTS
A total of 3,479 survivors in SJLIFE were available for model development and 6,875 survivors in CCSS were included for model validation. Over the 10 years following baseline, cardiomyopathy was clinically identified in 75 (2.2%) SJLIFE survivors and self-reported in 87 (1.3%) CCSS survivors. Clinical, demographic, and treatment characteristics of study participants are provided in Table 1.
The fit of the multivariable Poisson-regression models 1-4 based on SJLIFE survivors is shown in Supplementary Table 3. Model 2 revealed a significant association between the 10-year cardiomyopathy incidence and baseline age >35-45 years (relative rate [RR]=2.40; P=0.027) and ≥45 years (RR=3.35; P=0.014) compared to age ≤25 years. In model 3, the only CVRF retained was hypertension (RR=2.20; P=2.7×10-3). In model 4, African genetic ancestry was suggestively associated (RR=1.70; P=0.078), with similar results for self-reported Black race (RR=1.66; P=0.084). Adding each of the six PRSs to model 4 showed none with P<0.1 for all survivors (Supplementary Table 4). However, in survivors of European ancestry, two PRSs showed P<0.1 (PRSLVESVi: RR per standard deviation=1.26; P=0.082 and PRSHCM: RR per standard deviation=0.72; P=0.024). Considering over 80% of survivors in both SJLIFE and CCSS cohorts were of European ancestry, these two PRSs were added simultaneously to model 5 (Table 2).
In SJLIFE model-development cohort, AUC of the clinical model 1 was 0.833 (95% CI=0.789-0.877) (Figure 1). Adding age at baseline (model 2), hypertension (model 3) and genetic ancestry (model 4) did not significantly improve the AUCs (0.844, P=0.15; 0.853, P=0.15; and 0.852, P=0.97, respectively). Using self-reported race instead of genetic ancestry yielded comparable results (Supplementary Table 5). Further adding two PRSs (PRSLVESVi and PRSHCM) also did not significantly enhance the AUC (0.856, P=0.41) (Figure 1 and Supplementary Table 6). In CCSS model-validation cohort, the AUC of the clinical model 1 was 0.812 (95% CI=0.770-0.853). This AUC value remained unchanged with the additions of age at baseline (model 2), hypertension (model 3), and genetic ancestry (model 4) to model 1 (Figure 1). Similar results were observed when genetic ancestry was replaced by self-reported race (Supplementary Table 5). However, adding the two PRSs (model 5) significantly increased the AUC to 0.822 (95% CI=0.782-0.863; P=0.016) compared to model 4 (Figure 1 and Supplementary Table 6). Consistent results were observed among survivors of European ancestry in both SJLIFE model-development and CCSS model-validation cohorts (Supplementary Table 7). A sensitivity analysis showed that the AUC estimates for models 1-4 without genotype data in the entire CCSS cohort were comparable to those in CCSS-model validation cohort with genotype data (Supplementary Table 8).
In the subgroup of survivors exposed to cardiotoxic therapies, model 5 (which includes age at baseline, hypertension, genetic ancestry, and two PRSs) achieved an AUC of 0.844 (95% CI=0.796-0.892) in SJLIFE model-development cohort and 0.791 (95% CI=0.746-0.835) in CCSS model-validation cohort (Table 3). Adding these factors to the clinical model modestly improved prediction performance, though changes in AUC were not statistically significant. Similar results were observed in the IGHG-based moderate-risk group, with model 5 yielding an AUC of 0.836 (95% CI=0.754-0.917) in SJLIFE and 0.757 (95% CI=0.677-0.838) in CCSS. However, in the IGHG-based high and low-risk groups, the models did not further stratify risk among survivors.
Compared to the IGHG-based risk groups, the best-performing model (model 5) more effectively classified the 10-year risk of cardiomyopathy. It identified fewer survivors at high-risk (260 vs. 506) and moderate-risk (454 vs. 821) in SJLIFE, with similar trends in CCSS (high-risk: 361 vs. 1,476; moderate-risk: 856 vs. 1,566). At the same time, model 5 predicted more survivors at low-risk than IGHG guidelines. Notably, cumulative incidences of cardiomyopathy at 10 years from baseline were nearly 1.5 times higher in higher-risk groups based on model 5 compared to IGHG: specifically, in high-risk (SJLIFE: 23.9% vs. 16.7%; CCSS: 7.7% vs. 4.8%) and moderate-risk groups (SJLIFE: 9.9% vs. 6.2%; CCSS: 3.6% vs. 1.3%) (Figure 2). The low-risk group remained stable (SJLIFE: 2.3% vs. 2.0%; CCSS: 1.0% vs. 0.4%). While model 5 showed good risk discrimination, the CCSS dataset indicated slight overestimation in risk: high and moderate-risk groups were calibrated to be >10% and 5%≤predicted probability≤10%, but CCSS showed 7.7% and 3.6%, respectively.
DISCUSSION
Using a large, clinically-assessed cohort of long-term survivors of childhood cancer, we developed a prediction model for estimating the 10-year risk of cardiomyopathy. The model, including cancer treatments, age, hypertension and two PRSs developed from the general population, demonstrated good discrimination in both model-development (AUC 0.856) and model-validation cohorts (AUC 0.822), highlighting its clinical utility. Notably, this model was more effective in identifying high- and moderate-risk survivors compared to IGHG guidelines. It is available as an online risk calculator at https://ccss.stjude.org/resources/calculators/cardiomyopathy-prediction-calculator.html.
Our risk prediction model, based on clinically-assessed outcomes, showed higher AUC values than those using self-reported data among CCSS survivors14,15. While our model targeted the 10-year risk from baseline assessment (with most survivors being adults at baseline), earlier CCSS models focused on risk after 5 years post-childhood cancer diagnosis. Chow et al.14 developed a model for predicting HF after 5-year survival before age 40, using data from 13,060 survivors, with an AUC of 0.76. Chen et al.15 expanded this by including traditional CVRFs and predicting HF from ages 20, 25, 30, and 35 by age 50, yielding AUCs between 0.69 and 0.77. Unlike these previous models, which were set-up to predict HF by a certain age and therefore, did not include attained age as a predictor, our model was established for use in current long-term survivors who are substantially older at the time of prediction and the models were designed for predicting cardiomyopathy over the next 10 years. Thus, age at the time of prediction was included to account for age-related risk.
The AUC of our clinical model 1, with demographic and treatment factors, increased by 2% in the model-development cohort when age, hypertension, and genetic ancestry were added, but showed no change in the model-validation cohort. Age and hypertension were associated with higher cardiomyopathy rates in the model-development cohort, however, and genetic ancestry showed marginal evidence of association. Consistent with the observed associations, the cardiovascular disease risk and incidence of CVRFs are known to increase with survivors’ age40. Furthermore, age is a well-established risk factor for cardiovascular disease and is included in most cardiovascular risk scores of the general population41,42. Survivors of African genetic ancestry or self-reported Black race are known to have significantly higher cardiomyopathy rates compared to European genetic ancestry or self-reported White race counterparts,21 which underscores the importance of including this factor in risk prediction models.
Two PRSs from the general population enhanced the 10-year cardiomyopathy risk prediction in the model-validation cohort, even though neither showed statistically significant association with cardiomyopathy rate in the model-development cohort. The PRS for hypertrophic cardiomyopathy was associated with lower risk of cardiomyopathy in our model-development cohort, consistent with the UK Biobank findings that hypertrophic and dilated cardiomyopathy share genetic pathways with opposite effect directions16. The observed association of the PRS for left ventricular end-systolic volume was consistent with that of the UK Biobank where the risks of dilated cardiomyopathy18 and HF43 were found to be associated with this PRS. This may suggest shared genetic mechanisms for certain cardiac changes in childhood cancer survivors and the general population.
Currently, IGHG estimates cardiomyopathy risk in childhood cancer survivors based solely on cardiotoxic treatment exposures. In contrast, our best-performing model 5, which includes two PRSs, more effectively predicted the 10-year risk of cardiomyopathy, specifically in high and moderate-risk groups. Compared to IGHG, it classified fewer survivors as high-risk and moderate-risk and identified more as low-risk. Notably, the cumulative incidence of cardiomyopathy was 1.5-times higher in the high-risk and moderate-risk groups identified by our model than by IGHG. Both IGHG and the Children’s Oncology Group recommend routine echocardiography every two years for high-risk and every five years for moderate-risk survivors, while it is no longer recommended for low-risk survivors 38. We defined low-risk survivors as those with a <5% predicted probability of developing cardiomyopathy over the next 10 years, observing 2.3% in SJLIFE and 1.0% in CCSS in this group. Leerink and colleagues have proposed a similar re-stratification approach using real-time risk estimates from echocardiogram results, suggesting potential reductions in screening intensity for some survivors44. If validated by cost-effectiveness analyses, our model could enhance risk stratification for cardiomyopathy in adult survivors of childhood cancer and potentially lower screening costs for high- and moderate-risk individuals.
Our study has several limitations. First, we did not consider lifestyle factors such as smoking, alcohol consumption, obesity, physical activity, and diet. However, the inclusion of CVRFs, which are often more common in individuals with suboptimal lifestyles, may reflect some of these behaviors. Second, some PRSs were based on a limited number of variants, potentially missing the full extent of genetic effects. Third, our model was not well calibrated in the CCSS validation cohort (Supplementary Figure 1), where the cardiomyopathy rate was consistently lower than in SJLIFE, with a multiplicative factor of 0.6, despite using the same risk thresholds. This uniformly lower risk suggests under-ascertainment or under-reporting of cardiomyopathy in CCSS and does not necessarily suggest poor prediction or lower risk in CCSS survivors. Fourth, while our model adjusted for genetic ancestry, over 80% of survivors were of European ancestry, which may limit its generalizability to non-European ancestry survivors. Given the higher risk of cardiomyopathy in African American survivors21, it is essential to collect data and develop race-specific risk prediction models for minority groups.
In conclusion, we developed and validated a prediction model to estimate the 10-year risk of cardiomyopathy in childhood cancer survivors. This model incorporates cancer treatments, age, hypertension, ancestry, and PRSs from the general population, resulting in improved accuracy in risk classification. It more effectively identified at-risk survivors compared to the current IGHG-based groups recommended for cardiac surveillance. With the highest prediction performance to date based on clinically assessed outcomes, our model is well-suited for long-term survivors facing age-related risks and can help identify those who would benefit most from preventive interventions and risk reduction strategies.
Data Availability
All data produced in the present study are available upon reasonable request to the authors
https://ccss.stjude.org/resources/calculators/cardiomyopathy-prediction-calculator.html