Data Availability
In terms of data processed or generated as part of this study, we provide genetic association statistics for LD-independent lead SNPs and fine-mapped variants in UKB in addition to colocalization results (Supplementary tables 2-4). Full GWAS summary statistics from UKB and AoU will be made available in Zenodo upon peer-review. All GWAS sample sizes for each genetic ancestry group, meta-analysis, and phenotype can be found in Supplementary table 1. AoU policy does not currently permit public release of individual-level data due to important ethical and privacy considerations: https://www.researchallofus.org/wp-content/themes/research-hub-wordpress-theme/media/2020/05/AoU_Policy_Data_and_Statistics_Dissemination_508.pdf In terms of external data used in this study, we leveraged GWAS summary statistics, and ancestry-specific LD-matrices, and a curated list of 29 common, high-quality disease phenotypes generated as part of the Pan UKBB project (Pan UKBB Initiative, 2022), with more information available online (https://pan.ukbb.broadinstitute.org). UKB phenotype and whole genome sequencing data can be accessed via the UKB Research Analysis Platform after completing a UKB access application: https://ukbiobank.dnanexus.com/landing. AoU phenotype and genotype data can be accessed via access to the Controlled Tier v6 on the AoU researcher workbench: workbench.researchallofus.org. Published mtscATACseq data used for chrM:302 analysis can be obtained via approval from dbGaP. Gene-sets for enrichment analyses can be obtained using COMPARTMENTS (https://compartments.jensenlab.org) and MitoCarta 2.0 (https://www.broadinstitute.org/files/shared/metabolism/mitocarta/human.mitocarta2.0.html) as described previously (Gupta et al., 2021). The GRCh37 and GRCh38 reference genomes as well as other standard reference data are available via the GATK resource bundle: https://gatk.broadinstitute.org/hc/en-us/articles/360035890811-Resource-bundle. Annotations for the baseline v1.1 and BaselineLD v2.2 models for S-LDSC as well certain other relevant reference data, including the HapMap3 SNP list, can be obtained from https://alkesgroup.broadinstitute.org/LDSCORE/. BLASTn was used as available from the NCBI: https://blast.ncbi.nlm.nih.gov/Blast.cgi. Known reference and polymorphic NUMTs were obtained from supplemental data as provided in published work (Calabrese et al., 2012; Dayama et al., 2014; Li et al., 2012; Wei et al., 2022).