RT Journal Article SR Electronic T1 Fine Mapping Coronary Artery Disease Loci in UK Biobank’s Whole Genome Sequencing Data by Optimising Usage of the Cloud-Based Research Analysis Platform JF medRxiv FD Cold Spring Harbor Laboratory Press SP 2024.09.23.24313932 DO 10.1101/2024.09.23.24313932 A1 Sng, Letitia M.F. A1 Kaphle, Anubhav A1 O’Brien, Mitchell J. A1 Hosking, Brendan A1 Reguant, Roc A1 Verjans, Johan A1 Jain, Yatish A1 Twine, Natalie A. A1 Bauer, Denis C. YR 2024 UL http://medrxiv.org/content/early/2024/10/03/2024.09.23.24313932.abstract AB We conducted the first comprehensive association analysis of a coronary artery disease (CAD) cohort within the recently released UK Biobank (UKB) whole genome sequencing dataset. We employed fine mapping tool PolyFun and pinpoint rs10757274 as the most likely causal SNV within the 9p21.3 CAD risk locus. Notably, we show that machine-learning (ML) approaches, REGENIE and VariantSpark, exhibited greater sensitivity compared to traditional single-SNV logistic regression, uncovering rs28451064 a known risk locus in 21q22.11. Our findings underscore the utility of leveraging advanced computational techniques and cloud-based resources for mega-biobank analyses. Aligning with the paradigm shift of bringing compute to data, we demonstrate a 44% cost reduction and 94% speedup through compute architecture optimisation on UK Biobank’s Research Analysis Platform using our RAPpoet approach. We discuss three considerations for researchers implementing novel workflows for datasets hosted on cloud-platforms, to pave the way for harnessing mega-biobank-sized data through scalable, cost-effective cloud computing solutions.Competing Interest StatementThe authors have declared no competing interest.Funding StatementThis study did not receive any fundingAuthor DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:IRB of the Commonwealth Scientific and Industrial Research Organisation gave ethical approval for this workI confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.YesAccess to the UK Biobank dataset is upon application and with permission of UKB Research Ethics Committee