RT Journal Article SR Electronic T1 The Genetic Determinants and Genomic Consequences of Non-Leukemogenic Somatic Point Mutations JF medRxiv FD Cold Spring Harbor Laboratory Press SP 2024.08.22.24312319 DO 10.1101/2024.08.22.24312319 A1 Weinstock, Joshua S. A1 Chaudhry, Sharjeel A. A1 Ioannou, Maria A1 Viskadourou, Maria A1 Reventun, Paula A1 Jakubek, Yasminka A. A1 Alexander Liggett, L. A1 Laurie, Cecelia A1 Broome, Jai G. A1 Khan, Alyna A1 Taylor, Kent D. A1 Guo, Xiuqing A1 Peyser, Patricia A. A1 Boerwinkle, Eric A1 Chami, Nathalie A1 Kenny, Eimear E. A1 Loos, Ruth J. A1 Psaty, Bruce M. A1 Russell, Tracy P. A1 Brody, Jennifer A. A1 Yun, Jeong H. A1 Cho, Michael H. A1 Vasan, Ramachandran S. A1 Kardia, Sharon L. A1 Smith, Jennifer A. A1 Raffield, Laura M. A1 Bidulescu, Aurelian A1 O’Brien, Emily A1 de Andrade, Mariza A1 Rotter, Jerome I. A1 Rich, Stephen S. A1 Tracy, Russell P. A1 Chen, Yii Der Ida A1 Gu, C. Charles A1 Hsiung, Chao A. A1 Kooperberg, Charles A1 Haring, Bernhard A1 Nassir, Rami A1 Mathias, Rasika A1 Reiner, Alex A1 Sankaran, Vijay A1 Lowenstein, Charles J. A1 Blackwell, Thomas W. A1 Abecasis, Goncalo R. A1 Smith, Albert V. A1 Kang, Hyun M. A1 Natarajan, Pradeep A1 Jaiswal, Siddhartha A1 Bick, Alexander A1 Post, Wendy S. A1 Scheet, Paul A1 Auer, Paul A1 Karantanos, Theodoros A1 Battle, Alexis A1 Arvanitis, Marios YR 2024 UL http://medrxiv.org/content/early/2024/08/26/2024.08.22.24312319.abstract AB Clonal hematopoiesis (CH) is defined by the expansion of a lineage of genetically identical cells in blood. Genetic lesions that confer a fitness advantage, such as point mutations or mosaic chromosomal alterations (mCAs) in genes associated with hematologic malignancy, are frequent mediators of CH. However, recent analyses of both single cell-derived colonies of hematopoietic cells and population sequencing cohorts have revealed CH frequently occurs in the absence of known driver genetic lesions. To characterize CH without known driver genetic lesions, we used 51,399 deeply sequenced whole genomes from the NHLBI TOPMed sequencing initiative to perform simultaneous germline and somatic mutation analyses among individuals without leukemogenic point mutations (LPM), which we term CH-LPMneg. We quantified CH by estimating the total mutation burden. Because estimating somatic mutation burden without a paired-tissue sample is challenging, we developed a novel statistical method, the Genomic and Epigenomic informed Mutation (GEM) rate, that uses external genomic and epigenomic data sources to distinguish artifactual signals from true somatic mutations. We performed a genome-wide association study of GEM to discover the germline determinants of CH-LPMneg. After fine-mapping and variant-to-gene analyses, we identified seven genes associated with CH-LPMneg (TCL1A, TERT, SMC4, NRIP1, PRDM16, MSRA, SCARB1), and one locus associated with a sex-associated mutation pathway (SRGAP2C). We performed a secondary analysis excluding individuals with mCAs, finding that the genetic architecture was largely unaffected by their inclusion. Functional analyses of SMC4 and NRIP1 implicated altered HSC self-renewal and proliferation as the primary mediator of mutation burden in blood. We then performed comprehensive multi-tissue transcriptomic analyses, finding that the expression levels of 404 genes are associated with GEM. Finally, we performed phenotypic association meta-analyses across four cohorts, finding that GEM is associated with increased white blood cell count and increased risk for incident peripheral artery disease, but is not significantly associated with incident stroke or coronary disease events. Overall, we develop GEM for quantifying mutation burden from WGS without a paired-tissue sample and use GEM to discover the genetic, genomic, and phenotypic correlates of CH-LPMneg.Competing Interest StatementL.M.R. is a consultant for the TOPMed Administrative Coordinating Center (through Westat). B.M.P. serves on the Steering Committee of the Yale Open Data Access Project funded by Johnson & Johnson. J.Y. reports grant support from Bayer. M.C. reports grant support from Bayer and GSK, Consulting and speaking fees from Illumina and AstraZeneca. A.G.B., P.N, and S.J. are cofounders, equity holders, and on the scientific advisory board of TenSixteen Bio. G.R.A. is an employee of Regeneron Pharmaceuticals and receives salary, stock and stock options as compensation.Funding StatementWhole genome sequencing (WGS) for the Trans-Omics in Precision Medicine (TOPMed) program was supported by the National Heart, Lung and Blood Institute (NHLBI). See Supplementary Information 1 for study omics support information. Centralized read mapping and genotype calling, along with variant quality metrics and filtering were provided by the TOPMed Informatics Research Center (3R01HL-117626-02S1; contract HHSN268201800002I). Phenotype harmonization, data management, sample-identity quality control and general study coordination were provided by the TOPMed Data Coordinating Center (R01HL-120393; U01HL-120393; contract HHSN268201800001I). We thank the studies and participants who provided biological samples and data for TOPMed. The full study-specific acknowledgments are included in Supplementary Cohort Acknowledgements.Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:Ethics committee/IRB of Johns Hopkins University gave ethical approval for this workI confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.YesIndividual whole-genome sequence data for TOPMed whole genomes, individual-level harmonized phenotypes and the CHIP variant call sets used in this analysis are available through restricted access via the dbGaP TOPMed Exchange Area available to TOPMed investigators.