Abstract
Changes in DNA methylation patterns exhibit a high correlation with chronological age. Epigenetic clocks, developed through statistical models that estimate epigenetic age using the methylation levels of cytosine-guanine dinucleotide (CpG) sites, have emerged as powerful tools in understanding aging and age-related diseases. Despite their popularity, the generalizability of these clocks across diverse populations remains a challenge. We find that some of the widely used epigenetic clocks, such as Horvath’s clock (Horvath, 2013) and PedBE clock (McEwen et al., 2020) do not perform well in our target cohort. This lack of representativeness raises concerns about applying these clocks to quantify biological age in distinct demographic and ethnic groups. In addition, the feature space between existing clocks and our target data is different: most existing clocks are trained with data from older platforms, such as the Illumina HumanMethylation450 BeadChip (450K). In contrast, our target data are profiled with a more recent Illumina HumanMethylationEPIC BeadChip (EPIC) array. To address these gaps, we propose a transfer learning framework to adapt existing epigenetic clocks to underrepresented populations, using shared knowledge from diverse datasets. Furthermore, we develop imputation- and DNN-based methods for feature adaptation between existing clocks and our target data. Using data collected from 593 blood samples from a cohort of children and adolescents in the ELEMENT study, we find that our proposed transfer learning methods greatly improve the prediction performance compared to applying existing clocks directly. Performance is further enhanced by using the CpG sites profiled on the EPIC array. Our methodology showcases the potential to bridge the gap between different DNAm datasets and different profiling platforms, thus improving the applicability of epigenetic clocks in diverse population groups and contributing to more accurate aging research.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
Research reported in this publication was supported by the National Institute On Aging of the National Institutes of Health under Award Number R21AG083364.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
IRB of Rutgers University gave ethical approval for this work.
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.
Yes
The Chan Zuckerberg Initiative, Cold Spring Harbor Laboratory, the Sergey Brin Family Foundation, California Institute of Technology, Centre National de la Recherche Scientifique, Fred Hutchinson Cancer Center, Imperial College London, Massachusetts Institute of Technology, Stanford University, University of Washington, and Vrije Universiteit Amsterdam.