RT Journal Article SR Electronic T1 Blood- and brain-based genome-wide association studies of smoking JF medRxiv FD Cold Spring Harbor Laboratory Press SP 2024.05.21.24307663 DO 10.1101/2024.05.21.24307663 A1 Chybowska, Aleksandra D. A1 Bernabeu, Elena A1 Yousefi, Paul A1 Suderman, Matthew A1 Hillary, Robert F. A1 MacGillivray, Louise A1 Murphy, Lee A1 Harris, Sarah E. A1 Corley, Janie A1 Campbell, Archie A1 Spires-Jones, Tara L. A1 McCartney, Daniel L. A1 Cox, Simon R. A1 Price, Jackie F. A1 Evans, Kathryn L. A1 Marioni, Riccardo E. YR 2024 UL http://medrxiv.org/content/early/2024/05/21/2024.05.21.24307663.abstract AB Background Self-reported smoking is often incorporated into disease prediction tools but suffers from recall bias and does not capture passive exposure. Blood-based DNA methylation (DNAm) is an objective way to assess smoking. However, studies have not fully explored tissue-specificity or epigenome-wide coverage beyond array data. Here, we update the existing biomarkers of smoking and conduct a detailed analysis of the associations between blood DNAm and self-reported smoking.Methods and Findings A blood-based Bayesian epigenome-wide association study (EWAS) of smoking was carried out in 17,865 Generation Scotland individuals at ∼850k CpG sites (Illumina EPIC array). For 24 pairs of smokers and non-smokers a high-resolution approach was implemented (∼4 million sites, TWIST methylome panel). A DNAm-derived biomarker of smoking (mCigarette) was tested in the independent Lothian Birth Cohort 1936 (n=882, Illumina 450k array) and in the ALSPAC parents and offspring at four time points (range n=496–1,207). To explore tissue specific signals, EWASs of smoking were run across five brain regions for 14 individuals using DNAm from the EPIC array. Lastly, genome-wide association studies (GWASs) of smoking pack years and an epigenetic score for smoking (GrimAge DNAm pack years) were conducted (n=17,105). The primary EWAS analyses identified two novel genome-wide significant loci, mapping to genes related to addiction and carcinogenesis. Associations with CpG sites which are currently absent from methylation arrays were identified by the high resolution EWAS of smoking (n=48). The mCigarette pack years biomarker showed excellent discrimination across all smoking categories (current, former, never), and outperformed existing predictors in associations with pack years in an external test dataset (Pearson r=0.75). Several CpGs showed near-perfect discrimination of smoking status in both blood and brain, but these loci did not overlap across tissues. The GWAS of DNAm (but not self-reported) pack years identified novel and established smoking-related loci. However, the self-reported phenotype GWAS had a higher genetic correlation with a large meta-analysis GWAS of self-reported pack years. Among the study shortcomings are its potential lack of generalizability to non-Europeans and the absence of serum cotinine data.Conclusion A multi-tissue, multi-cohort analysis of the relationship between smoking, DNA and DNAm (assessed via arrays and targeted sequencing) has improved our understanding of the biological consequences of smoking.Competing Interest StatementR.E.M has received a speaker fee from Illumina and is an advisor to the Epigenetic Clock Development Foundation. R.F.H. has received consultant fees from Illumina. R.E.M and R.F.H. have received consultant fees from Optima partners. All other authors declare no competing interests.Funding StatementGeneration Scotland: Generation Scotland received core support from the Chief Scientist Office of the Scottish Government Health Directorates (CZD/16/6) and the Scottish Funding Council (HR03006). Genotyping and DNA methylation profiling of the Generation Scotland samples was carried out by the Genetics Core Laboratory at the Edinburgh Clinical Research Facility, Edinburgh, Scotland and was funded by the Medical Research Council UK and the Wellcome Trust (Wellcome Trust Strategic Award STratifying Resilience and Depression Longitudinally (STRADL; Reference 104036/Z/14/Z). The DNA methylation data assayed for Generation Scotland was partially funded by a 2018 NARSAD Young Investigator Grant from the Brain & Behavior Research Foundation (Ref: 27404; awardee: Dr David M Howard) and by a JMAS SIM fellowship from the Royal College of Physicians of Edinburgh (Awardee: Dr Heather C Whalley). LBC1936: The LBC1936 is supported by the BBSRC, and the Economic and Social Research Council [BB/W008793/1] (which supports S.E.H.), Age UK (Disconnected Mind project), the Milton Damerel Trust, the Medical Research Council (MR/M01311/1), and the University of Edinburgh. Methylation typing of LBC1936 was supported by the Centre for Cognitive Ageing and Cognitive Epidemiology (Pilot Fund award), Age UK, The Wellcome Trust Institutional Strategic Support Fund, The University of Edinburgh, and The University of Queensland. Genotyping was funded by the BBSRC (BB/F019394/1). S.R.C. is supported by a Sir Henry Dale Fellowship jointly funded by the Wellcome Trust and the Royal Society (Grant Number 221890/Z/20/Z). ALSPAC: The UK Medical Research Council and Wellcome (Grant ref: 217065/Z/19/Z) and the University of Bristol provide core support for ALSPAC. This publication is the work of the authors and they will serve as guarantors for the contents of this paper. A comprehensive list of grants funding is available on the ALSPAC website (http://www.bristol.ac.uk/alspac/external/documents/grant-acknowledgements.pdf). Funding for ALSPAC DNAm measurements were supported by the Wellcome (102215/2/13/2); the University of Bristol; the UK Economic and Social Research Council (ES/N000498/1); the UK Medical Research Council (MC_UU_12013/1, MC_UU_12013/2); the Biotechnology and Biological Sciences Research Council (BBI025751/1 and BB/I025263/1); and the John Templeton Foundation (60828). P.Y. and M.S. work is supported by the National Institute for Health and Care Research Bristol Biomedical Research Centre, the Medical Research Council Integrative Epidemiology Unit at the University of Bristol (MC_UU_00032/3, MC_UU_00032/4, MC_UU_00032/6), and Cancer Research UK [C18281/A29019, EDDISA-Jan22\100003]. A.D.C. is supported by a Medical Research Council PhD Studentship in Precision Medicine with funding from the Medical Research Council Doctoral Training Program and the University of Edinburgh College of Medicine and Veterinary Medicine. R.F.H is supported by an MRC IEU Fellowship. E.B. and R.E.M. are supported by Alzheimer's Society major project grant AS-PG-19b-010. This research was funded in whole, or in part, by the Wellcome Trust (104036/Z/14/Z, 108890/Z/15/Z, 220857/Z/20/Z, and 221890/Z/20/Z). For the purpose of open access, the author has applied a CC BY public copyright licence to any Author Accepted Manuscript version arising from this submission. Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:All components of Generation Scotland received ethical approval from the NHS Tayside Committee on Medical Research Ethics (REC Reference Number: 05/S1401/89). All participants provided broad and enduring written informed consent for biomedical research. Generation Scotland has also been granted Research Tissue Bank status by the East of Scotland Research Ethics Service (REC Reference Number: 15/0040/ES), providing generic ethical approval for a wide range of uses within medical research. This study was performed in accordance with the Helsinki declaration. Ethical approval for the LBC1936 study was obtained from the Multi-Centre Research Ethics Committee for Scotland (MREC/01/0/56) and the Lothian Research Ethics committee (LREC/1998/4/183; LREC/2003/2/29). All participants provided written informed consent. These studies were performed in accordance with the Helsinki declaration. Ethical approval for the ALSPAC study was obtained from the ALSPAC Ethics and Law Committee and the Local Research Ethics Committees. Consent for biological samples has been collected in accordance with the Human Tissue Act (2004). Informed consent for the use of data collected via questionnaires and clinics was obtained from participants following the recommendations of the ALSPAC Ethics and Law Committee at the time. I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.YesAccording to the terms of consent for Generation Scotland participants, access to data must be reviewed by the Generation Scotland Access Committee. Applications should be made to access{at}generationscotland.org.Lothian Birth Cohort data are available on request from the Lothian Birth Cohort Study, University of Edinburgh (https://www.ed.ac.uk/lothian-birth-cohorts/data-access-collaboration). Lothian Birth Cohort data are not publicly available due to them containing information that could compromise participant consent and confidentiality.ALSPAC data are available on request from bona fide researchers. The study website contains details of all the data that is available through a fully searchable data dictionary and variable search tool (http://www.bristol.ac.uk/alspac/researchers/our-data/).All custom R (version 4.3.1), Python (version 3.9.7), and bash code is available with open access at the following GitHub repository: https://github.com/aleksandra-chybowska/Smoking_EpiScore/GWAS and EWAS summary statistics will be made available on Edinburgh DataShare on publication.