Structured Abstract
Background Pancreatic cancer is projected to become the second leading cause of cancer-related death by 2030 in the United States. DNA methylation (DNAm) age may reflect age-related variations in the biological changes and abnormalities related to cancer development.
Method We conducted a pooled analysis using prediagnostic blood samples of pancreatic cancer cases and matched controls selected from the Nurses’ Health Study (NHS), the Physician’s Health Study (PHS), and the Health Professionals Follow-up Study (HPFS). We used three DNAm aging clocks (Hannum, Horvath, and PhenoAge) to estimate subjects’ DNAm age, epigenetic age acceleration (AA) and intrinsic epigenetic age acceleration (IEAA) metrics. We performed conditional logistic regression and multivariable Cox proportional hazard regression to examine associations between six AA and IEAA metrics and risk of pancreatic cancer and survival, respectively.
Results A total of 393 incidence pancreatic cancer cases and 431 matched controls from the NHS, PHS, and HPFS cohorts were included in this analysis. The medians of all three epigenetic AA and three IEAA metrics were consistently above zero (indicating accelerated age) among cases, while they were below zero (indicating decelerated age) among the matched controls. Comparing participants in the highest quartile of age acceleration metrics, the pancreatic cancer risks were significantly increased by 67% to 83% for Hannum and PhenoAge AA or IEAA metrics with minimal of 7- to 9-years accelerated ages. Except for Hovarth AA and IEAA metrics, there were significant dose-response trends, such that higher age accelerations were associated with higher pancreatic cancer risk, but the relationships were nonlinear. Stratified analyses showed heterogeneous associations, varying by participants’ characteristics and by epigenetic AA or IEAA metrics. As time to diagnosis increased, the ORs of pancreatic cancer for the Hannum AA and Horvath AA or IEAA metrics trended upwards, while the ORs for the PhenoAge AA or IEAA and Hannum IEAA metrics trended downward. Overall, we observed no significant association between pancreatic cancer survival and any of the prediagnostic epigenetic AA or IEAA metrics.
Conclusion Our results indicate DNAm age acceleration is associated with an increased risk of pancreatic cancer in a nonlinear, dose-response manner. Epigenetic IEAA metrics may be a useful addition to current methods for pancreatic cancer risk prediction.
Background
DNA methylation (DNAm) clocks are derived from epigenetic DNAm markers that are strongly correlated with chronological age or time and can accurately quantify an age-related phenotype or outcome, or both (1). DNA methylation aging clocks capture age-related epigenetic variations, which can be divided into intrinsic (or intra-cellular) and extrinsic (or broadly within-tissue and external) aspects of the aging process (2). Intrinsic DNAm age is independent of age-related changes in blood-cell composition, while extrinsic DNAm age incorporates age-related changes in blood cell composition in the clocks’ algorithms.
Deviations between DNAm age and chronological age are commonly referred to as “age acceleration,” or positive deviation between DNAm age and chronological age. Numerous cohort studies have reported that subjects with age acceleration were at an increased risk for all-cause mortality after controlling for known risk factors (2-7). Furthermore, a study showed that the offspring of semi-supercentenarians (subjects who reached an age of 105-109 years) have a lower epigenetic age than age-matched controls (8). A meta-analysis of six longitudinal cohorts found that DNAm age increases at a slower rate than chronological age across the life course, and the likely explanation for this phenomenon is survival bias, where healthy individuals are those maintained within a longitudinal study (9). Based on these findings, it has been hypothesized that epigenetic processes play a role in healthy aging, and DNAm age captures some aspects of biological age. DNAm is thus thought to be a marker of susceptibility to disease and is associated with multiple health outcomes (10).
Pancreatic cancer incidence rates have been steadily increasing since 2010, and pancreatic cancer is projected to become the second leading cause of cancer-related death by 2030 in the United States (11). DNAm age may reflect age-related variations in the biological changes and abnormalities related to cancer development. Observational studies have reported that age acceleration estimated using different DNAm clocks is associated with increased all cancer risk and shorter cancer survival independent of major health risk factors (12, 13).
Consistent findings have been reported in studies that examined the association between DNAm age acceleration and specific type of cancer. In a pooled analysis of seven cohort studies, age acceleration had stronger associations with the risk of kidney cancer and B-cell lymphoma than colorectal, gastric, lung, prostate, and urothelial cancer (12). Intrinsic DNAm age acceleration estimated by Horvath’s clock (Horvath IEAA) was significantly associated with an increased risk of lung cancer in the Women’s Health Initiative (WHI) cohort (14). Horvath IEAA was also associated with an increased risk of breast cancer in a nested case-control study embedded in the European Prospective Investigation into Cancer and Nutrition (EPIC) cohort (15). Consistently, in a recent case-cohort study, significant positive associations were observed between age acceleration estimated by three different DNAm clocks (Hannum, Horvath, and PhenoAge) and risk of breast cancer (16).
To our knowledge, no study has examined epigenetic clocks in relation to pancreatic cancer risk or survival. Here, we assessed the associations of DNAm age acceleration and pancreatic cancer risk using incident pancreatic cancer cases and matched controls identified from three large cohort studies in the United States.
Methods
Study sample and blood collection
A pooled analysis using prediagnostic bloods of pancreatic cancer cases and matched controls selected from the Nurses’ Health Study (NHS) (17), the Physician’s Health Study (PHS) (18), and the Health Professionals Follow-up Study (HPFS) (19) was conducted. All blood specimens (for each cohort) were drawn by participants and mailed overnight, and upon receipt, the samples were aliquotted and frozen for future analyses.
Pancreatic cancer cases and matched controls
For any report of cancer (except basal cell skin cancer) in the three cohorts, written permission from study participants or next-of-kin was obtained to review their medical records. In the case permission could not be obtained, cancer registries were used to obtain details of diagnosis. Non-respondents are telephoned to obtain verbal confirmation of the information reported on the follow-up questionnaire (probing for details of diagnosis and treatment), and the social security death index is used to monitor death for remaining non-respondents. The active surveillance for mortality ensures virtually complete ascertainment of cancer deaths. All medical records were reviewed by trained physicians. The report is classified as confirmed cancer only if confirmed by the pathology report.Over 90% of participants have granted permission for medical record review.
During follow-up (through 2018 for HPFS and NHS, and through 2010 for PHS), 403 incident cases were confirmed to have pancreatic cancer among the participants who provided blood samples. A control subject was matched to each case on cohort (which also matches on sex), age (+/- 1 year), date of blood draw (month 3+/- and year), smoking (never, former, current) and race (White/other). Due to low DNA concentrations in some of the samples, and samples removed after data processing of the arrays (see Supplemental Methods), the initial 1:1 matching was not always conserved and resulted in some cases and controls with no matched pair. The final dataset consisted of 393 incidence pancreatic cases and 431 matched controls from the three large cohort studies where blood samples had been collected between 6 months and 26 years prior to cancer diagnosis.
DNA extraction and bisulfite conversion, and DNA methylation processing
DNA extracted from buffy coats were bisulfite-treated and DNA methylation was measured with the Illumina Infinium MethylationEPIC BeadChip [850K] arrays (Illumina, Inc, CA, USA). Details on DNA methylation measurements and data processing are provided in the Supplementary Methods.
Methylation age and age acceleration estimation
Three DNAm clocks (Hannum, Horvath, and PhenoAge) were used to estimate subjects’ DNAm age and age acceleration. All three DNAm clocks were developed based on Illumina 450K arrays; some CpGs from the 450K arrays were missing in the most updated Illumina EPIC [850K] arrays but the DNAm age estimates are highly correlated between the two arrays (r > 0.91) (20). The missing CpG values were imputed using the ENmix Bioconductor package (21). The Hannum’s clock is comprised of 71 CpG that strongly capture changes in chronological age, which is partly driven by age-related shifts in blood cell composition (22). The DNAm clock algorithm developed by Horvath was constructed across multiple tissues and included 353 CpGs (23). The more recent PhenoAge clock, developed by Levine and colleagues, was trained on age-related and disease phenotypes in combination with chronological age. It incorporates age-related biochemical measures and included 513 CpGs (24). Of the 513 CpGs, 41 and 6 CpGs were overlapped with Horvath’s clock and Hannum’s clock, respectively.
For each of the three DNAm clocks, DNAm age acceleration (AA) was defined by regressing DNAm age on chronologic age and calculating the difference between the observed chronological age and the fitted DNAm age (i.e., the residual). Additionally, intrinsic epigenetic age acceleration (IEAA) metrics were calculated using the residuals from the linear regression of DNAm age on the chronologic age and the measures of blood cell compositions. For all age acceleration metrics, a positive value indicates that DNAm age is higher than expected given the individual’s chronological age (accelerated aging), whereas a negative value indicates that DNAm age is lower than expected given the individual’s chronological age (decelerated aging). Subjects with the absolute value of the age acceleration estimate greater than three standard deviations (SDs) were excluded.
Statistical analysis
All statistical analysis was performed in R (version 3.6.1). Spearman’s rank correlation was used to calculate the correlations between age acceleration metrics. Results were plotted using ggplot2.
To examine association with pancreatic cancer, conditional logistic regression was performed, given that our samples were matched by cohorts (HPFS, NHS, or PHS), age, date of blood draw, race and smoking status using an incident sampling design. Body mass index (BMI) was adjusted in all conditional logistic regression models. Quartiles were assigned according to distribution of each DNAm age acceleration among controls. Similar results were observed when using unconditional logistic regression models including unmatched cases and controls (Supplemental Table 1).
Stratified analyses were also performed to explore whether odd ratios differed by sex, age groups (≤ 65 or >65 years), time to cancer diagnosis (≤10, 10-15, or >15 years), smoking status (current, former, or never smokers), and BMI categories (<25, ≥25 kg/m2). To reduce the number of group comparisons, participants were grouped into two groups, i.e., epigenetically older (values of AA or IEAA metrics ≤ 0) versus epigenetically younger (values of AA or IEAA metrics > 0), in these analyses. All stratified analysis models adjusted for BMI, except for the analysis by BMI categories.
The association between survival time and age acceleration among pancreatic cancer cases was examined using multivariable Cox proportional hazard model. Covariates included in the model were cohorts, age at blood draw, race, smoking status, and BMI. Subjects with survival time equal to zero were excluded because these subjects had no medical records and were identified using the National Death Index. Participants with missing covariate data were excluded from the analyses.
Results
Study population
Participants in the nested case-control study were diagnosed with pancreatic cancer an average of 13 years (range 6 months to 26 years) after providing a blood sample. About 45% of cases/controls were women (mean age = 59.4), all selected from the NHS cohort. The remaining cases/controls were men from the PHS (mean age = 56.5) and HPFS (mean age = 63.7) cohorts. The descriptive statistics of Hannum, Horvath, and PhenoAge epigenetic metrics by the characteristics of the study participants are provided in Table 1. About 40% of the study participants were never smokers, 45% former smokers, and 15% current smokers. About half of the study participant were overweight or obese. For all three AA metrics, there were more pancreatic cancer cases with accelerated epigenetic age than the matched controls. There were fewer female participants had accelerated epigenetic age than male.
The correlations between the three DNAm age and chronological age were moderate for all three epigenetic clocks (r = 0.42, 0.40, and 0.32 for Hannum, Horvath, and PhenoAge, respectively) (Figure 1). Correlations between each pair of age acceleration (AA) or intrinsic age acceleration (IEAA) metrics were high (r ranged from 0.65 to 0.98) in our study sample (Figure 2). The distributions of epigenetic AA and IEAA metrics among pancreatic cancer cases and their matched controls are shown in Figure 3. Consistently, the medians of all three epigenetic AA and three IEAA metrics were above zero (indicating accelerated age) among pancreatic cancer cases, while they were below zero (indicating decelerated age) among the matched controls.
DNAm age acceleration and pancreatic cancer risk
Conditional logistic regression was performed to examine the associations between epigenetic age acceleration and pancreatic cancer risk. All statistical models adjusted for matching factors, including cohorts (HPFS, NHS, or PHS), age, date of blood draw, race, and smoking status, in addition to BMI as a covariate. Participants were grouped into quartiles according to the distribution of each epigenetic AA and IEAA metric among controls. Quartiles 1 and 2 are comprised of pancreatic cancer cases and their matched controls with values of epigenetic AA or IEAA metrics less than zero (decelerated aging). Conversely, quartiles 3 and 4 are comprised of participants with values of epigenetic AA or IEAA metrics greater than zero (accelerated aging). Comparing participants in the highest quartile of age acceleration metrics (the cutoffs for these metrics varied) to those in the lowest quartile (the cutoffs for these metrics varied), the pancreatic cancer risks were significantly increased by 67% to 83% with minimal of 7- to 9-years accelerated ages for Hannum and PhenoAge AA or IEAA metrics. Comparing the highest to the lowest quartile, pancreatic cancer risks were increased by 28% for Horvath AA and IEAA metrics although these associations were not statistically significant (Figure 4); however, for those measures, the risks were higher when comparing participants in the third quartile to the lowest quartile (Horvath AA: odds ratio [OR] = 1.46; 95% 95% confidence interval [CI] = 0.95 to 2.24; Horvath IEAA: OR = 1.89; 95% CI = 1.23 to 2.96). Except for Hovarth AA and IEAA metrics, there were significant dose-response trends, such that higher age accelerations were associated with higher pancreatic cancer risks, but the relationship appeared nonlinear. (Figure 4)
Stratified analyses were conducted to explore whether the associations between DNAm age acceleration and pancreatic cancer risk differed by participants’ characteristics. In these analyses, the associations were heterogeneous, varying by participants’ characteristics and by epigenetic AA or IEAA metrics (Table 2), although some general trends were observed. Specifically, the associations were larger in male than female for Hannum IEAA and Horvath AA or IEAA metrics, but the associations were similar comparing male to female for Hannum AA and PhenoAge AA or IEAA metrics. Compared to the elderly (age >65 years), the associations were generally smaller for Hannum and PhenoAge AA or IEAA in the younger age group (age ≤65 years). In contrast, the Horvath AA or IEAA metrics were more strongly associated with risk in the younger age group compared to the elderly. For all epigenetic AA and IEAA metrics, the associations were larger in overweight or obese participants (BMI ≥25 kg/m2) than in normal or underweight participants. These results are mostly consistent with the results when analyses were conducted using quartiles of epigenetic age acceleration metrics (Supplemental Tables 2-5), except for the stratified analyses by smoking status. Specifically, strong positive associations (with wide CIs) between all AA or IEAA metrics and pancreatic cancer risk were observed among current smokers comparing participants in the higher quartiles to the lowest quartile (ORs ranged from 1.49 to 3.55). There were fewer current smokers in the lowest quartile (larger decelerated epigenetic ages) than higher quartiles, especially for the PhenoAge AA or IEAA metrics (15% and 17% of smokers in quartile 1, respectively). Thus, the median of all three epigenetic AA and three IEAA metrics were above zero for both pancreatic cancer cases and matched controls.
DNAm age acceleration and time to diagnosis
To examine whether the epigenetic clocks were affected by clinically occult cancer, we conducted stratified analyses by time to cancer diagnosis (≤10, 10-15, or >15 years). After accounting for the matching factors and BMI, epigenetic AA and IEAA metrics showed different trends in associations with cancer risk across categories of time to cancer diagnosis (Table 3). As time to diagnosis increased, the ORs of pancreatic cancer for the Hannum AA and Horvath AA or IEAA metrics trended upwards, while the ORs for the PhenoAge AA or IEAA and Hannum IEAA metrics trended downward. Additionally, stratified analyses by time to cancer diagnosis revealed that Hannum IEAA was significantly positively associated with cancer risk among participants who were within 10 years or less to cancer diagnosis (Hannum IEAA Q3 vs. Q1 OR□=□2.83, 95% CI = 1.08 to 7.39, P□= 0.033). All age acceleration metrics, except for Horvath AA, showed significant positive associations with cancer risk among those who were 15 years or more to cancer diagnosis (Table 3)
DNAm age acceleration and pancreatic cancer survival
We conducted case-only analyses to examine the associations between DNAm age acceleration and pancreatic cancer survival. Hazard ratios for pancreatic cancer survival were similar across all epigenetic AA and IEAA metrics. Overall, we observed little evidence of any significant association between pancreatic cancer survival and any of the epigenetic AA or IEAA metrics (Table 4).
Discussions
Our results indicate DNAm age acceleration is associated with an increased risk of pancreatic cancer that is independent of several key risk factors for cancer including chronological age. We found positive associations for all three epigenetic clocks (Hannum, Horvath, and PhenoAge), with stronger associations for intrinsic epigenetic age acceleration (IEAA) metrics suggesting that blood cell composition is a weak confounder in our analyses. Moreover, our findings suggest that the association between DNAm age acceleration and pancreatic cancer risk is possibly in a nonlinear, dose-response manner.
While prior studies have examined the epigenetic clock associations with cancer risk (12-16), only one study examined PhenoAge clock in relation to breast cancer risk (16) and another assessed nonlinear relationships between Hannum or Horvath IEAA and all cancer incidence and mortality (13). Analyzing multiple epigenetic age acceleration over time, the latter study found complex linear and nonlinear, dynamic time-dependent relationship with all cancer incidence and mortality (13). We did not find a significant association between age acceleration and pancreatic cancer survival. It should be noted that the prognosis of pancreatic cancer is poor (> 90% of pancreatic cancer cases died within 2.5 years after diagnosis) so associations between age acceleration and pancreatic cancer survival would be hard to detect, if they exist (Supplemental Figure 1)
The three epigenetic clocks have only a few CpGs in common, but the AA or IEAA metrics were highly correlated with each other in our study sample. The correlations between these epigenetic AA or IEAA metrics were moderate to high (r ranged from 0.39 to 0.98) in a case-cohort study of breast cancer risk (16). Unlike previous studies (none of these studies included pancreatic cancer cases) (12, 15, 16), the correlations between the epigenetic age estimated by the three epigenetic clocks and chronological age were only moderate in our study sample. The PhenoAge clock showed the lowest correlation to chronological age, which is not surprising because PhenoAge clock was trained on age-related and disease phenotypes in addition to chronological age (24). While different cancers are biologically distinct from each other, epigenetic age acceleration may not be cancer specific. The process of carcinogenesis is almost universally associated with both inflammation and activation of immune senescence pathways (25, 26). The fact that our findings also are consistent with those of the only prior study that examined the associations between age acceleration estimated by Hannum, Horvath, or PhenoAge clocks and risk of breast cancer (16) may also suggest that age acceleration reflects a more general cancer-associated biological process.
We observed heterogeneous associations, varying by participants’ characteristics, time to diagnosis, and AA or IEAA metrics in the stratified analyses. These findings suggest that the three epigenetic clocks (Hannum, Horvath, and PhenoAge) capture different aspects of biological aging and abnormal changes to cancer development, although the biological implications of the epigenetic clock are not well understood. Nonetheless, studies have consistently shown that BMI or obesity is positively associated with Hannum and Horvath age acceleration metrics (27-30), which support our findings with regard to the associations with pancreatic cancer that were larger in overweight or obese participants than in normal or underweight participants. Since studies have identified many common genetic variants associated with BMI or obesity (31), these findings point to a genetic component of these epigenetic clocks (12). Additionally, our stratified analyses showed that associations between all AA or IEAA metrics and pancreatic risk are modified by smoking status, suggesting that these epigenetic clocks are affected by lifestyle factors including smoking (27, 30, 32). Compared to 5 different DNAm aging clocks (including Hannum and Horvath), PhenoAge clock was shown to have stronger associations with all-cause mortality, smoking status, leukocyte telomere length, naïve CD8+ T cells and CD4+ T cells (24).
Lastly, our results overall suggest that Hannum and PhenoAge IEAA metrics may be more suitable for evaluating the associations with pancreatic cancer risks than Horvath clock, because smaller associations and opposite trends in associations with cancer risk across categories of time to cancer diagnosis were shown for Horvath IEAA metrics compared to Hannum and PhenoAge IEAA in our study sample. Age-related DNAm signatures may be influenced by tissue’s cell composition, which is altered with age and may partially mask age acceleration associations to disease risk. IEAA metrics are not confounded by the blood cell compositions by definition.
Our study has several strengths. We employed a prospective study design by using prediagnostic bloods of incidence pancreatic cancer cases and matched controls from three large cohort studies in the United States. Our analyses minimize confounding through matching and statistical adjustments. Although we find evidence that age acceleration is associated with increased risk of pancreatic cancer, our study has several limitations. First, our findings do not provide mechanistic insights into how these age-related changes influence pancreatic cancer risks. Second, our study population is comprised of mostly Caucasians so our results may not be generalizable to other races or ethnicity. Lastly, our stratified analyses may have spurious findings due to multiple testing and small sample sizes in some subgroups.
In sum, our study is first prospective study to examine Hannum, Horvath, and PhenoAge epigenetic clocks in relation to pancreatic cancer risk or survival, and we find that DNAm age acceleration is associated with an increased risk of pancreatic cancer in a nonlinear, dose-response manner. Epigenetic IEAA metrics may be used to improve pancreatic cancer risk prediction when used in concert with known disease risk factors. Little is known regarding the trajectories of these epigenetic clock in relation to cancer risk. To build evidence base to support epigenetic age as a biomarker for cancer early detection, future prospective studies should evaluate the changes in DNAm age (ideally at multiple time points) relating to cancer development.
Data Availability
All data from this study have been deposited in dbGAP and will be available on January 3, 2020 [DNA Methylation Markers and Pancreatic Cancer Risk in 3 Cohort Studies (NHS, PHS, HPFS) phs001917.v1.p1].
https://www.ncbi.nlm.nih.gov/projects/gapprev/gap/cgi-bin/study.cgi?study_id=phs001917.v1.p1
Footnotes
Financial Support: The research reported in this publication was primarily supported by the NIH/National Cancer Institute grant R01 CA207110. In addition, other NIH funds contributed to the support of the investigators: R01 CA163451, and P30 CA168525, and the Kansas IDeA Network of Biomedical Research Excellence Bioinformatics Core, supported in part by the National Institute of General Medical Science (NIGMS) Award P20GM103418.