Abstract
Clonal hematopoiesis (CH) is one of the most extensively studied somatic mutational phenomena, yet its causes and consequences remain poorly understood. We identify 10,924 individuals with CH amongst 200,453 whole-exome sequenced UK Biobank participants and use their linked genome-wide DNA genotypes to map the landscape of inherited predisposition to CH. We increase the number of European-ancestry genome-wide significant (P<5×10−8) germline associations with CH from four to 14 and identify one new transcriptome-wide significant (P<3.2×10−6) association. Genes at new loci implicate DNA damage repair (PARP1, ATM, and CHEK2), hematopoietic stem cell migration/homing (CD164), and myeloid oncogenesis (SETBP1) in CH development. Several associations were CH-subtype specific and, strikingly, variants at TCL1A and CD164 had opposite associations with DNMT3A-versus TET2-mutant CH, mirroring recently reported differences in lifelong behavior of these two most common CH subtypes and proposing important roles for these loci in CH pathogenesis. Using Mendelian randomization, we show, amongst other findings, that smoking and longer leukocyte telomere length are causal risk factors for CH and demonstrate that genetic predisposition to CH increases risks of myeloproliferative neoplasia, several non-hematological malignancies, atrial fibrillation, and blood epigenetic age acceleration.
Competing Interest Statement
GSV is a consultant to STRM.BIO and AstraZeneca.
Funding Statement
This work was funded by a joint grant from the Leukemia and Lymphoma Society (RTF6006-19) and the Rising Tide Foundation for Clinical Cancer Research (CCR-18-500), and by the Wellcome Trust (WT098051). SPK is supported by a United Kingdom Research and Innovation (UKRI) Future Leaders Fellowship (MR/T043202/1) and leads the somatic genomics theme of the Integrative Cancer Epidemiology Programme at the University of Bristol that is funded by Cancer Research UK (C18281/A29019). PMQ is funded by the Miguel Servet Program (CP20/00130). MAF is funded by a Wellcome Clinical Research Fellowship (WT098051). RL is supported by Cancer Research UK (C18281/A29019). PC is supported by a British Heart Foundation Clinical Training Research Fellowship. SB is supported by a Sir Henry Dale Fellowship jointly funded by the Wellcome Trust and the Royal Society (204623/Z/16/Z). GSV is supported by a Cancer Research UK Senior Cancer Fellowship (C22324/A23015) and work in his lab is also funded by the European Research Council, Kay Kendall Leukaemia Fund, Blood Cancer UK, and the Wellcome Trust. This research was conducted using the UK Biobank resource under applications 56844 and 29202. We thank the participants and investigators involved in the UK Biobank resource and in the other genome-wide association studies cited in this work who collectively made this research possible.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
Individual-level UK Biobank data used in this study can be requested via application to the UK Biobank (https://www.ukbiobank.ac.uk).
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
Data Availability
Individual-level UK Biobank data used in this study can be requested via application to the UK Biobank (https://www.ukbiobank.ac.uk). Publicly-available genome-wide association study summary statistics used in this study can be accessed via the IEU OpenGWAS project (gwas.mrcieu.ac.uk).