Multi-organ imaging-derived polygenic indexes for brain and body health ======================================================================= * Xiaochen Yang * Patrick F. Sullivan * Bingxuan Li * Zirui Fan * Dezheng Ding * Juan Shu * Yuxin Guo * Peristera Paschou * Jingxuan Bao * Li Shen * Marylyn D. Ritchie * Gideon Nave * Michael L. Platt * Tengfei Li * Hongtu Zhu * Bingxin Zhao ## Abstract The UK Biobank (UKB) imaging project is a crucial resource for biomedical research, but is limited to 100,000 participants due to cost and accessibility barriers. Here we used genetic data to predict heritable imaging-derived phenotypes (IDPs) for a larger cohort. We developed and evaluated 4,375 IDP genetic scores (IGS) derived from UKB brain and body images. When applied to UKB participants who were not imaged, IGS revealed links to numerous phenotypes and stratified participants at increased risk for both brain and somatic diseases. For example, IGS identified individuals at higher risk for Alzheimer’s disease and multiple sclerosis, offering additional insights beyond traditional polygenic risk scores of these diseases. When applied to independent external cohorts, IGS also stratified those at high disease risk in the All of Us Research Program and the Alzheimer’s Disease Neuroimaging Initiative study. Our results demonstrate that, while the UKB imaging cohort is largely healthy and may not be the most enriched for disease risk management, it holds immense potential for stratifying the risk of various brain and body diseases in broader external genetic cohorts. Keywords * Abdominal MRI * Brain MRI * Cardiovascular magnetic resonance * Disease risk assessment * OCT imaging * Genetic prediction * Alzheimer’s disease ## Introduction The large-scale imaging data from the UK Biobank (UKB) imaging study has proven to be an immensely valuable resource for characterizing the structural and functional organizations of the human brain and body1–4. These data have been used to establish links with clinical biomarkers5,6, predict biological aging7–9, study socioeconomic outcomes10,11, and facilitate early disease detection12. Launched in 2014, the UKB imaging project reached a milestone in 2022 by completing the multimodal brain magnetic resonance imaging (MRI) of its 60,000th participant. Although it is already the world’s largest imaging study, the UKB imaging study will ultimately include nearly 100,000 participants4, or about 20% of all UKB participants. Given the cost and difficulty of collecting additional imaging data, it is crucial to develop strategies that extend the utility of the existing UKB imaging data. Polygenic indices13 (also known as genetic scores, polygenic scores, or polygenic risk scores [PRS]) can be used to predict traits or disease genetic risk by aggregating genetic information across the genome14–21. The development of numerous prediction methods22, reporting standards23, genetic data resources24, and data sharing platforms25 has enabled the application of these indices to a wide variety of complex diseases and traits. Both family and population-based studies have shown that variations in brain and body, as measured by structural and functional MRI, are heritable2,26–32. Recent genome-wide association studies (GWAS)2,29,33–41 have identified many genetic loci associated with brain and body imaging-derived phenotypes (IDPs). Given the cost and complexity of imaging phenotypes, it is natural to ask whether genetic data can be used to predict IDPs for UKB participants who are not imaged or who are part of other cohorts of interest (e.g., another biobank-scale cohort, or a cohort dedicated to Alzheimer’s disease). Several GWAS have reported the prediction accuracy (out-of-sample *R*2) of genetically predicted brain IDPs33,42,43 in small-scale independent testing data, indicating that genetic predictors could partially recover imaging variations. These pilot studies demonstrate that IDP genetic scores (IGS) can serve as valuable proxy biomarkers of the brain and body in the absence of readily available imaging data29,40,44. In particular, since IGS are genetic predictions of imaging traits, we expect to see links between IGS and various phenotypes when there are shared genetic effects. Here, we systematically developed and evaluated IGS for UKB brain and body images. We examined 4,206 brain IDPs from various imaging modalities and independent processing pipelines, including 3,905 traits generated by the UKB brain imaging team1,2,34 (referred to as UKB-Oxford data hereafter) and 301 traits processed by the Brain Imaging Genetics Knowledge Portal (BIG-KP)33,35,37. These imaging biomarkers spanned major brain MRI categories, including structural MRI (sMRI) such as regional brain volumes45, cortical thicknesses, and surface areas1; diffusion MRI (dMRI) such as diffusion tensor imaging (DTI) parameters46,47; resting-state functional MRI (rfMRI) such as independent component analysis (ICA)1,2,48,49 or parcellation-based50 functional connectivity and activity/amplitude51 traits; task-based functional MRI (tfMRI) such as activation z- statistics1; and susceptibility weighted brain MRI, incorporating regional median T2*5. Moreover, we analyzed 169 body IDPs, including 82 from cardiovascular magnetic resonance (CMR)3,29, 41 from abdominal MRI52–54 (covering organs such as liver, kidney, and lung), and 46 from optical coherence tomography (OCT) imaging40. The **Methods** section provides more detailed information of these 4,375 IDPs (4,206 brain + 169 body) (**Table S1**). **Figure 1A** presents an overview of the study design. ![Fig. 1](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2024/06/03/2023.04.18.23288769/F1.medium.gif) [Fig. 1](http://medrxiv.org/content/early/2024/06/03/2023.04.18.23288769/F1) Fig. 1 Study overview and prediction analysis**. (A)** An overview of the study design. **(B)** The incremental prediction *R2* of brain and body IGS in the hold-out UK Biobank (UKB) testing sets of European, Asian, and African ancestries. The structural MRI (sMRI) modality consisted of 101 BIG-KP regional brain volumes, the diffusion MRI (dMRI) modality consisted of 110 BIG-KP DTI parameters, and the resting functional MRI (rfMRI) modality consisted of 90 BIG-KP parcellation-based traits and 82 UKB-Oxford ICA-based rfMRI traits. The abdominal MRI (Abd MRI) modality consisted of 41 abdominal MRI traits, the eye OCT modality consisted of 46 eye traits, and the heart CMR modality consisted of 82 UKB heart CMR traits. The results of other IGS can be found in **Figure S1**. We display the IGS that significantly predicted the corresponding IDPs in the European dataset after controlling the FDR rate at a 5% level. **(C)** The incremental prediction *R2* of these IGS in the European and Asian testing datasets. **(D)** The incremental prediction *R2* of these IGS in the European and African testing datasets. In **(C)-(D)**, the IGS with the highest prediction *R2* in the European testing set in each modality was marked by an arrow and text. ICA stands for independent component analysis, INL stands for inner nuclear layer, and RPE stands for retinal pigment epithelium. ## RESULTS ### Genetic scores for 4,206 brain IDPs and 169 body IDPs To develop the genetic scores for IDPs, we used data from UKB individuals of British ancestry with both imaging and genotyping array for training (average *n* = 34,293 for MRI and 54,761 for OCT). An independent hold-out dataset containing participants of non- British European ancestry served as a testing dataset to evaluate the predictive performance of the IGS (average *n* = 5,116 for MRI and 4,801 for OCT). We generated GWAS summary statistics for each IDP, which were then used as input to derive IGS for all UKB participants without imaging data. We used PRS-CS55 to construct the IGS with genotyping data, and 461,488 genetic variants were included in the prediction model after standard genetic data quality controls (**Methods**). We found that 68.14% (2,866/4,206) of brain IDPs and 95.27% (161/169) of body IDPs were significantly predicted in the UKB European testing dataset (Benjamini-Hochberg false discovery rate [FDR] < 5%; **Table S2**). Among the IDPs showing significant prediction, the average *R2* was 1.40% (range = [0.10%, 7.62%]) for brain MRI, 0.85% (range = [0.06%, 6.32%]) for heart CMR, 1.80% (range = [0.26%, 5.67%]) for abdominal MRI, and 4.99% (range = [1.10%, 10.01%]) for eye OCT. In the brain, we observed genetically predictable IDPs in both UKB-Oxford and BIG-KP across all brain MRI modalities, and IDPs from the same imaging modality exhibited similar predictive accuracy ranges in the two separate processing pipelines (**Fig. S1A**). Among BIG-KP sMRI regional brain volumes, the highest prediction accuracy was observed for brain stem volume (*R2* = 7.01%). Similarly, the UKB-Oxford sMRI IDP with the highest prediction accuracy (*R2* = 7.62%) was the volume of pons by subcortical volumetric sub-segmentation of the brain stem. Among BIG-KP dMRI DTI parameters, the highest prediction accuracy was the mean fractional anisotropy (FA) of the posterior limb of internal capsule (*R2* = 4.72%). For rfMRI IDPs, the highest prediction accuracy was observed on one ICA-based global functional connectivity trait that captured the central executive, salience, and default mode networks (*R2* = 5.04%). In addition, the IDP with the highest prediction accuracy was the ascending aorta minimum area for heart CMR (*R2* = 6.32%) and the liver iron corrected T1 for abdominal MRI (*R2* = 7.38%). For eye OCT, the thickness from the inner nuclear layer (INL) to the retinal pigment epithelium of the right eye had the highest prediction accuracy (*R2* = 10.01%). UKB has smaller numbers of non-European participants with imaging data. In an exploratory analysis, we found that many developed IGS exhibited predictive utility for individuals of Asian and African ancestries (average *n* = 460 and 252, respectively), although the prediction accuracy was generally smaller in these cohorts (**Figs. 1B-1D**, **S1B- S1D**, and **Tables S3-S4**). To ensure the robustness of our prediction models, we also conducted the same analyses on all 4,375 IDPs using DBSLMM56 and evaluated the consistency of prediction performance (**Figs. S2A-S2B**). Moreover, we performed prediction using imputed genetic data, which required far greater computational resources. We observed similar patterns with improved prediction accuracy (**Figs. S2C- S2D**). In summary, our results provide converging evidence that multi-organ IGS can be reliably developed using different prediction methods and genetic data types, and these IGS can be potentially applied to diverse populations. ### IGS can stratify participants with a higher risk of brain and body diseases Stratifying participants into different disease risk groups and identifying high-risk individuals is crucial in clinical research and has potential for clinical translation57–59. Various factors and biomarkers have been examined for risk stratification, such as disease polygenic risk scores (dPRS)60–63, lifestyle factors64,65, and imaging traits66,67. Here we evaluated the performance of IGS in disease risk stratification for 871 phecode-based68,69 diseases (**Table S5**). Considering the comparable prediction accuracy and association results (**Supplementary Note, Figs. S3-S7,** and **Tables S6-S9**) between BIG-KP and UKB- Oxford brain IGS, we focused on a selected set of 383 multi-modality brain IGS together with 41 abdominal IGS, 82 heart IGS, and 46 eye IGS in the following stratification analysis. For both brain and body IGS, we randomly divided 318,781 unrelated UKB discovery (non- imaging) participants into three groups (after removing the effects of covariates): the lowest 10%, the highest 10%, and the middle 80%. We then compared the difference in disease case rate of these three IGS-stratified groups using a chi-squared test70 (**Methods**). We conducted a stratification analysis of 69 brain disorders using 383 brain IGS, which consisted of 101 sMRI IGS (BIG-KP regional brain volumes33), 110 dMRI IGS (BIG-KP DTI parameters35), and 172 rfMRI IGS (82 ICA-based2 and 90 parcellation-based37 traits). After controlling FDR at 5% (69 × 383 tests), on UKB discovery cohort, we found that 141 brain IGS stratified 25 disorders into groups with significantly different disease case rates, resulting in a total of 294 IGS-disease pairs, and 247 of them were replicated on the UKB replication cohort (*P* range = [5.31 × 10-45, 5.49 × 10-4]) (**Figs. 2A-2C** and **S8,** and **Table S10**, **Methods**). Among them, we found that rfMRI IGS accounted for almost two-thirds of the identified stratifications with brain disorders (167/247, 67.61%). The majority (150/167, 89.82%) of the rfMRI IGS-brain disorder stratifications involved delirium, dementia, and Alzheimer’s disease (IGS tail case ratio range = [1.16, 2.46], *P* range = [5.31 × 10-45, 5.43 × 10-4]) (**Supplementary Note**), and the rest mainly related to multiple sclerosis and cerebrovascular diseases (IGS tail case ratio range = [1.06, 1.92], *P* range = [2.34 × 10-7, 5.06 × 10-4]). Besides, both the ICA-based and parcellation-based rfMRI IGS displayed a consistent pattern in stratifying delirium, dementia, and Alzheimer’s disease, with the ICA-based traits exhibiting higher power. For example, 14 parcellation-based rfMRI IGS had 38 stratification pairs with delirium, dementia, and Alzheimer’s disease, while 39 ICA- based rfMRI IGS had 119 such pairs, representing 67.86% (38/56) and 90.84% (119/131) of all parcellation-based rfMRI and ICA-based stratification results, respectively (**Table S10**). ![Fig. 2](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2024/06/03/2023.04.18.23288769/F2.medium.gif) [Fig. 2](http://medrxiv.org/content/early/2024/06/03/2023.04.18.23288769/F2) Fig. 2 IGS stratification of brain and heart disorders. **(A)-(D)** The IGS-disease pairs after controlling the FDR rate at a 5% level on UKB discovery cohort, replicated on UKB replication cohort, and with IGS tail case ratio greater than 1.1 (brain) or 1.2 (heart) are displayed. Each point represents an IGS-disease pair. The *y*-axis shows the disease name and the associated phecode (for example, the phecode of Alzheimer’s Disease is 290.11). The *x*-axis shows the disease case ratio between the two IGS tails. We show the results for UKB discovery cohort. **(E)-(F)** For the pair between aortic aneurysm (AA) and AAo\_max_area (ascending aorta maximum area), we show the disease percentage (*y*-axis) in quantile-based groups defined by AAo_max_area IGS (*x*- axis) in the UKB discovery and UKB replication cohorts. The horizontal grey dashed line represents the disease percentage for the entire cohort. The IGS derived from dMRI and sMRI also stratified multiple brain disorders. Out of the 110 dMRI IGS, 33 significantly stratified 14 disorders (IGS tail case ratio range = [1.04, 2.10], *P* range = [1.65 × 10-8, 5.12 × 10-4]), corresponding to 54 IGS-disease pairs, including multiple sclerosis, dementia, Alzheimer’s disease, and cerebrovascular diseases such as intracranial hemorrhage, cerebral ischemia, and occlusion of cerebral arteries (**Fig. 2B**). Among the 101 sMRI IGS, 14 significantly stratified 9 disorders (IGS tail case ratio range = [1.09, 1.98], *P* range = [3.50 × 10-11, 5.49 × 10-4]), with a total of 26 IGS-disease pairs (**Fig. 2C**). We found that both dMRI and sMRI IGS showed strong stratifications with multiple sclerosis, a heritable common neurodegenerative disease and a leading cause of nontraumatic disability among young adults71–73. For example, the IGS of the cingulum axial diffusivity was found to have significantly more MS patients in its lower 10% tail than its upper 10% tail (IGS tail case ratio = 1.76, *P* = 3.45 × 10-8), consistent with results of previous association analysis using dMRI data74. Participants with the smallest 10% IGS of the left and right thalamus proper volumes contained 1.73 and 1.96-fold (*P* < 5.37 × 10-5) more MS cases than those with the largest 10% IGS, respectively. Similarly, IGS of the left and right putamen and the accumbens area volumes all stratified MS patients such that the lower 10% IGS tail had a significantly higher case number than the upper 10% IGS tail (IGS tail case ratio range = [1.55, 1.85], *P* range = [2.09 × 10-10, 4.11 × 10-4]). These findings are in line with previous studies identifying decreased brain volume in the thalamus75,76, putamen77, and accumbens78 in multiple sclerosis patients, highlighting the potential of these brain IGS in identifying individuals at high risk for multiple sclerosis without actual imaging data. Body IGS also stratified diseases in their corresponding organs and systems. Using the 41 abdominal IGS, we stratified 104 digestive system diseases and 100 genitourinary system diseases (204 × 41 tests) and identified 280 significant IGS-disease pairs after controlling FDR at 5% level, where 229 of them were replicated on the UKB replication cohort, associated with 37 abdominal IGS and 51 diseases (*P* range = [4.80 × 10-250, 1.66 × 10-3]) (**Fig. S9A**). For example, we found that chronic renal failure patients were significantly stratified by the IGS of kidney volume and kidney parenchyma volume (IGS tail case ratio range = [1.25, 1.37], *P* range = [2.05 × 10-18, 6.42 × 10-10]). The lower 10% IGS tail contained more patients, and indeed the kidney volume and kidney parenchyma volume have been found associated with kidney function (such as positive correlation with as glomerular filtration rate and negative correlation with creatinine)79,80. In addition, patients with chronic liver disease and cirrhosis were significantly stratified by the IGS of several abdominal MRI traits, including liver proton density fat fraction and total abdominal fat (IGS tail case ratio range = [1.22, 1.99], *P* range = [4.88 × 10-40, 1.55 × 10-3]), consistent with previous studies on the association between liver disease risk and liver/abdominal fat81–83. Furthermore, we examined the stratification of 90 circulatory system diseases using 82 heart IGS (90 × 82 tests), discovered 202 IGS-disease pairs after controlling FDR at 5% level, and replicated 180 pairs between 54 heart IGS and 40 diseases (*P* range = [2.40 × 10-34, 1.37 × 10-3]) (**Fig. 2D**). The strongest stratifications were found between aortic aneurysms and the IGS of ascending/descending aorta maximum area, and ascending/descending aorta minimum area (IGS tail case ratio range = [1.32, 1.75], *P* range = [5.71 × 10-14, 6.75 × 10-4]), which is the area of the region where aortic aneurysm occurs84. We found that aortic aneurysm was monotonely stratified across all values of the IGS of the ascending aorta maximum area such that larger IGS corresponded to higher risk (**Figs. 2E-2F**). MRI-measured aortic dimensions are frequently used for diagnosis of aortic aneurysms85, and large aortic diameters are considered abnormal and even diagnostic of aortic aneurysms86. Similarly, we stratified 38 eye and adnexa-related disorders using 46 eye IGS (38 × 46 tests) and replicated 67 of the 107 IGS-disease pairs with significant stratifications, corresponding to 27 eye IGS and 13 eye disorders (*P* range = [5.65 × 10-49, 2.25 × 10-3]) (**Fig. S9B**). For example, we found significantly more glaucoma cases presented in the 10% upper tail of the IGS of the raw vertical cup-to-disc ratio (VCDR) of the left eye and the VCDR regressed on disc diameter87 (IGS tail case ratio range = [1.51, 1.98], *P* range = [3.41 × 10-22, 3.77 × 10-9]). The large value of observed VCDR has been used clinically as a robust indicator for glaucoma88. We also evaluated the performance of IGS in cross-organ applications. First, we stratified 69 brain disorders by the 41 abdominal IGS, 82 heart IGS, and 46 eye IGS. We identified and replicated 19 pairs between 19 IGS (6 heart IGS and 13 abdominal IGS) and 9 brain disorders (*P* range = [4.16 × 10-13, 9.97 × 10-5]) (**Fig. S9C** and **Table S11**). For example, the IGS of the right ventricular ejection fraction had 1.62-fold more multiple sclerosis patients in its lower 10% tail than its upper 10% tail (*P* = 7.03 × 10-6). This result matched previous findings that multiple sclerosis patients had decreased ventricular ejection fractions89,90. Second, we stratified 90 circulatory system diseases by the 383 brain IGS, 41 abdominal IGS, and 46 eye IGS. After controlling FDR at 5% level and replicating on UKB replication cohort, we found that 27 circulatory system diseases were significantly stratified by 13 abdominal IGS, 79 brain IGS, and 3 eye IGS (*P* range = [1.04 × 10-10, 2.78 × 10-4]) (**Fig. S9D**). Among these, peripheral vascular disease was widely stratified by all modalities of brain IGS (IGS tail case ratio range = [1.10, 1.36], *P* range = [2.25 × 10-8, 2.71 × 10-4]), with the majority of signals in dMRI IGS (56.36%, 31/55). For example, the FA IGS of the body of corpus callosum and anterior corona radiata both had more cases in their upper 10% tail, and the mean diffusivity IGS of these two had fewer cases in their upper 10% tail, consistent with prior findings on the associations between dMRI traits and peripheral vascular diseases91. Additionally, the IGS of the thickness of the inner nuclear of both eyes significantly stratified hypertension patients, consistent with the reported associations between higher blood pressure and thicker INL92, as well as our association results between eye IGS and quantitative measures of blood pressure (**Supplementary Note**). ### **IGS broadly contribute to disease stratification in addition to disease genetic risk** Following disease stratification by IGS, we measured the predictive power of IGS for disease by area under the curve (AUC), as AUC was widely used to measure power of PRS93,94 and allowed us to better measure the clinical utility of IGS. For each IGS-disease pair that showed significant stratification (**Figs. 2A-2D** and **S9**, and **Tables S10-S11**), we evaluated the AUC gain by including the IGS for disease prediction, compared to a baseline model. Since dPRS is a frequently used genetic risk measure, we also analyzed the AUC gain from the additional inclusion of IGS when FinnGen95-derived dPRS was already incorporated (**Methods**). As previously shown, brain and body IGS demonstrated marginal stratification capability for diseases in both same-organ and cross-organ applications (**Figs. 2A-2D** and **S10**). To measure the AUC gain from IGS, for each of the 110 diseases significantly stratified by IGS and for each IGS that significantly stratified the disease, we constructed two logistic regression models to measure the AUC gain: a baseline model that predicts disease status using basic covariates only, and an IGS model that additionally uses an IGS. The increment in the AUC of the IGS model relative to the baseline model is defined as the AUC gain of IGS. Out of the 919 pairs IGS-disease pairs showing replicated stratification patterns, 916 pairs had positive AUC gain for disease prediction from the IGS. Comparing the baseline model to the IGS model using DeLong test96, after controlling for FDR at level 5%, 80.52% (740/919) pairs had significant AUC gain with the median being 0.0208 (range = [7.25 × 10-5, 2.11 × 10-1], **Figs. 3A-3C** and **S10**, and **Tables S12-S13**). For example, in **Figure 3A**, the largest AUC gain from brain IGS for brain disorder prediction was achieved by Net25_Node9 IGS for Alzheimer’s disease (AUC gain = 0.0858, *P* = 8.04 × 10-23), whose disease stratification patterns were visualized in **Figure S8**. All dMRI IGS that stratified Alzheimer’s disease in **Figure 2B** also had significant AUC gain (**Fig. 3B**). The IGS of the ascending aorta maximum area predicted aortic aneurysm with AUC gain 0.0462 (*P* = 5.93 × 10-9), the largest among heart IGS that significantly stratified circulatory system diseases (**Fig. 3C**). Overall, the AUC gains of IGS suggest that the majority of disease-stratifying IGS not only stratify subsets of subjects at high risk but also contribute to disease prediction at the cohort level. ![Fig. 3](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2024/06/03/2023.04.18.23288769/F3.medium.gif) [Fig. 3](http://medrxiv.org/content/early/2024/06/03/2023.04.18.23288769/F3) Fig. 3 AUC of IGS for disease prediction. **(A)-(C)** AUC gain of disease prediction on UKB discovery cohort by including the stratifying IGS for IGS-disease pairs that were significant on UKB discovery cohort after controlling the FDR rate at a 5% level, replicated on UKB replication cohort, and with IGS tail case ratio greater than 1.1 (brain) or 1.2 (heart). Each point represents an IGS-disease pair. The *y*-axis shows the disease name and the associated phecode. The *x*-axis shows the AUC gain from the IGS model compared with the baseline model. **(D)** AUC gain of disease prediction on UKB discovery cohort by including the most stratifying IGS or all stratifying IGS in the IGS-dPRS model, compared with the baseline-dPRS model. The diseases with IGS stratification patterns visualized in **Figures 3E-3H and S11** was marked by an arrow and text. **(E)-(H)** For four select diseases, for each disease polygenic risk score (dPRS) quantile group (*x*-axis), we show the disease percentage (*y*-axis) for all participants (“All subjects”), participants in the low 10% tail of the prediction of the IGS-dPRS model of the disease (“Prediction lower tail”), and participants in the high 10% tail of the prediction of the IGS-dPRS model (“Prediction higher tail”). We show the results for UKB discovery cohort. We further assessed the contribution of IGS to disease prediction and stratification beyond that provided by dPRS. Briefly, we constructed dPRS using FinnGen GWAS summary statistics of phecode-based diseases. We manually mapped 105 of the 110 UKB diseases that had IGS stratification signals to 78 FinnGen disease endpoints (case number range = [216, 111,581], median = 7,576) following the FinnGen-phecode mapping in Sun et al97 and disease description in FinnGen R995 (**Table S14**). After Bonferroni correction at 5% level, 98 out of the 105 diseases gained significantly in AUC from inclusion of IGS on top of dPRS (AUC gain range = [5.85 × 10-5, 3.16 × 10-2], AUC gain median = 4.26 × 10-3, *P* range = [7.24 × 10-35, 2.27 × 10-3]) (**Fig. 3D** and **Table S15**). The largest AUC gain (0.0316) was achieved for nephrotic syndrome (without mention of glomerulonephritis, phecode 580.20), a kidney disorder associated with excessive protein in the urine. Examining the AUC from dPRS alone and the additional AUC gain from IGS in **Figure 3D**, diseases can be categorized into three types: (1) those with uninformative dPRS and large relative AUC gains from IGS, (2) those with moderate dPRS and moderate AUC gains from IGS, and (3) those with informative dPRS and minimal AUC gains from IGS. This suggests that IGS can help disease stratification, even if it does not always substantially boost the AUC. It makes sense because AUC primarily reflects overall predictive performance across the entire cohort and might not capture variations within specific subgroups (e.g., subjects with high/low risk). Multiple diseases from each category were discussed in **Supplementary Note**, and below we highlight a few examples spanning all categories. For diseases with “uninformative dPRS,” where subjects may have similar disease risks across all dPRS percentile groups, IGS can still effectively stratify these subjects into high and low risk within each dPRS group (**Fig. 3E**). Typically, these diseases lack large GWAS data resources, and incorporating IGS could substantially improve the AUC. For example, the dPRS of nephrotic syndrome had an AUC gain of 0.0083 and was generated using FinnGen GWAS (FinnGen ID: N14_NEPHROTICSYND), which had a limited sample size of 853 patients but was already large among the existing GWAS of this disease98,99. Cerebral degeneration (phecode 331.90) had a dPRS AUC gain of 0.0084 (G6_DEGENOTH) and gained additional AUC of 0.0188 by including three brain IGS. As the predictive performance of dPRS improved, resulting in “moderate dPRS,” the stratification ability of the IGS-dPRS model remained consistent on top of dPRS across almost all dPRS groups (**Figs. 3F-3G**). IGS AUC gain remained nontrivial for some diseases, such as delirium (phecode 290.20, AUC gain 0.0122) and aortic aneurysm (phecode 442.10, AUC gain 0.0101). Furthermore, Alzheimer’s disease and multiple sclerosis had “informative dPRS” such that the dPRS AUC gain was 0.1707 and 0.1503, respectively, and thus the further AUC gain from IGS was significant yet small, both being 0.0022. Nevertheless, individuals with high Alzheimer’s disease dPRS, especially the top 10% individuals with highest dPRS, were stratified into subgroups of distinct disease risk by IGS (**Fig. 3H**). Similar stratification trends from the IGS-dPRS model were observed across multiple sclerosis and various other diseases (**Fig. S11**). In summary, we conducted a comprehensive AUC analysis of IGS prediction of diseases. Most disease-stratifying IGS showed significant AUC contribution, and various diseases, such as nephrotic syndrome, cerebral degeneration, delirium, and aortic aneurysms, had nontrivial AUC gain from IGS in addition to dPRS. Besides, even for diseases with informative dPRS and seemingly minimal AUC gain from IGS, IGS still stratified individuals with high disease genetic risk into subgroups with distinct risk. We will conduct detailed analyses of IGS stratification for Alzheimer’s disease and multiple sclerosis in the next section to provide more insights into the IGS contribution to disease stratification in scenarios with minimal AUC gains. It is important to note that these cases may represent the lower bound of IGS utility, as their contribution would likely be greater if the AUC gains were more substantial. ### IGS stratification of Alzheimer’s disease and multiple sclerosis: in-depth analysis and generalizability As presented earlier, Alzheimer’s disease and multiple sclerosis are the two neurodegenerative disorders that were most frequently stratified by brain IGS on UKB discovery cohort, with 55 and 27 stratifying brain IGS, respectively. In this section, we provide an in-depth analysis of the stratification patterns of these two diseases using all stratifying brain IGS. Furthermore, we constructed IGS for participants from two independent cohorts: the All of Us research program (AOU, *n* = 245,394)100 and Alzheimer’s Disease Neuroimaging Initiative study (ADNI, *n* = 1,152)101 (**Methods**). We showed that the Alzheimer’s disease stratification patterns observed on UKB can be replicated on AOU and ADNI. Moreover, we considered *APOE* status, and found that IGS can effectively stratify Alzheimer’s disease among participants of same *APOE* haplotypes. *APOE* is known to be the strongest genetic factor risk for Alzheimer’s disease and its homozygote is considered a form of Alzheimer’s disease102. Similar to Alzheimer’s disease, multiple sclerosis stratification patterns on UKB were replicated on AOU as well. We have observed the marginal stratification of Alzheimer’s disease on UKB participants by many brain IGS (**Figs. 2A-2B**). To stratify Alzheimer’s disease using all the stratifying IGS, we constructed an IGS burden score by counting the number of IGS high-risk tails a subject was in, across the 55 IGS that were marginally stratifying (**Methods**). As expected, Alzheimer’s disease risk rose as IGS burden score increased (**Fig. S12**). Two additional risk factors, age and dPRS, were then evaluated. Ageing is known to be closely related to Alzheimer’s disease progression103,104. Upon grouping participants by age, participants of similar age were further stratified into “lower IGS burden score” or “higher IGS burden score” groups if their IGS burden scores fell within the lower or upper 20% extremes for their respective stratum. We found a consistent rise in the percentage of disease cases with increasing age. Within each age group, particularly beginning from around age 60 on the UKB cohorts and from around age 65 on the AOU and ADNI cohorts, participants with higher IGS burden score always had much higher disease percentage than those with lower IGS burden score (**Figs. 4A-4D** and **Table S16**). ![Fig. 4](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2024/06/03/2023.04.18.23288769/F4.medium.gif) [Fig. 4](http://medrxiv.org/content/early/2024/06/03/2023.04.18.23288769/F4) Fig. 4 IGS stratification of Alzheimer’s disease across age and disease polygenic risk score groups. **(A)-(D)** For each age group (*x*-axis), we show the Alzheimer’s disease (AD) or dementia percentage (*y*-axis) for all participants (“All subjects”), participants in the low 20% tail of IGS burden score (“Lower IGS burden score”), and participants in the high 20% tail of IGS burden score (“Higher IGS burden score”). **(E)-(H)** For each dPRS quantile group (*x*-axis), we show the AD or dementia percentage (*y*-axis) for all participants (“All subjects”), participants in the low 20% tail of IGS burden scores (“Lower IGS burden score”), and participants in the high 20% tail of IGS burden scores (“Higher IGS burden score”). We show the results for UKB discovery (left panels), UKB replication (middle-left panels), AOU European (middle-right panels), and ADNI (right panels) cohorts. For Alzheimer’s disease dPRS, we first verified that the FinnGen-derived dPRS effectively stratified patients in UKB, AOU, and ADNI cohorts (**Fig. S13**). We found that the IGS burden score further stratified participants from the same dPRS percentile group. For example, in **Figure 4E**, among the 31,797 participants with the top 10% dPRS, 2.38% had developed Alzheimer’s disease. The subgroup with higher IGS burden scores had an disease percentage of 3.09% and the one with lower IGS burden scores had a disease percentage of 1.85%. That is, inside this dPRS group, the higher IGS burden score subgroup contained 1.67-fold of cases, compared with the lower IGS burden score subgroup (*P* = 2.58 × 10-6). Similar patterns were observed in UKB replication and AOU cohorts (**Figs. 4F-4G**). In ADNI, we also found that within each dPRS percentile group, the subgroup of higher IGS burden score always had a higher dementia percentage than the lower IGS burden score subgroup (**Fig. 4H**). These results demonstrated the contribution of IGS stratification on top of dPRS and the generalizability of UKB-derived IGS on external genetic cohorts. We then examined *APOE* and found the IGS stratification power was not dominated by *APOE* status. In the UKB discovery cohort, we focused on subjects of age at least 55, and belong to one of the four common APOE haplotypes (*n* = 202,304): ε2 carriers, ε3 homozygotes, ε4 heterozygotes, and ε4 homozygotes (**Methods**). ε3 homozygotes were the most common *APOE* haplotype, ε2 carriers had lower disease risk, ε4 heterozygotes had higher risk, and ε4 homozygotes were recently considered a genetic form of Alzheimer’s disease102. ε4 heterozygotes and ε4 homozygotes were also combined into ε4 carriers. In **Figure 5A**, we found that IGS burden score stratified disease risk among participants of same *APOE* haplotypes, and IGS stratification power remained when both disease dPRS and *APOE* status were considered (**Figs. 5B-5F**). These results demonstrated that IGS stratification of Alzheimer’s disease was not simply driven by *APOE*, suggesting that IGS could be useful for risk stratification when combined with other genetic risk factors. ![Fig. 5](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2024/06/03/2023.04.18.23288769/F5.medium.gif) [Fig. 5](http://medrxiv.org/content/early/2024/06/03/2023.04.18.23288769/F5) Fig. 5 IGS stratification of Alzheimer’s disease on the basis of *APOE* status and disease polygenic risk score. **(A)** For each APOE group (*y*-axis), we show the relative change in Alzheimer’s Disease (AD) percentage (*x*-axis) in the low 20% tail of IGS burden score (“Lower IGS burden score”) and in the high 20% tail of IGS burden score (“Higher IGS burden score”) compared to the overall AD percentage of the APOE group (decimals at *x*=0). **(B)-(F)** For five *APOE* groups, for each dPRS quantile group (*x*-axis), we show the AD percentage (*y*-axis) for all participants (“All subjects”), participants in the low 20% tail of IGS burden score (“Lower IGS burden score”), and participants in the high 20% tail of IGS burden score (“Higher IGS burden score”). We show the results for the UKB discovery cohort. Brain IGS and burden scores also stratified multiple sclerosis on both UKB and AOU cohorts. For example, the IGS of the left putamen volume had 1.85-fold of multiple sclerosis patients in its lower 10% tail than its upper 10% tail (**Fig. 2C** and **Table S10**). Decreased putamen volume was observed among multiple sclerosis patients in previous studies77. Further examining participants across all IGS percentile groups, we found that both UKB discovery participants and AOU European participants had roughly monotone decreasing disease percentage as the left putamen volume IGS increased, with the monotone trend more visible among females (**Figs. 6A-6B** and **Table S17**). We then included dPRS as an additional risk factor to stratify multiple sclerosis. Similar to Alzheimer’s disease dPRS, we first verified that multiple sclerosis dPRS were able to stratify multiple sclerosis patients in UKB and AOU cohorts (**Fig. S14**). IGS burden score was then constructed and was able to further stratify participants of similar dPRS into subgroups of different disease percentage (**Methods**). Among both UKB and AOU participants, further stratification of multiple sclerosis by IGS burden score on top of dPRS was observed, and the stratification was weaker on non-European cohorts as expected (**Figs. 6C-6F**). The multiple sclerosis stratification patterns on UKB and AOU using UKB- derived IGS provided evidence that stratification patterns by UKB-derived IGS could generalize well to external cohort, especially when the ancestry of the external cohort is close the IGS development cohort. We also examined stratification of bipolar disorder in the AOU study (**Supplementary Note** and **Table S18**), which represents a scenario where the external cohort has a much larger number of disease cases compared to the UKB study. ![Fig. 6](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2024/06/03/2023.04.18.23288769/F6.medium.gif) [Fig. 6](http://medrxiv.org/content/early/2024/06/03/2023.04.18.23288769/F6) Fig. 6 IGS stratification of multiple sclerosis. **(A)-(B)** For the pair between multiple sclerosis (MS) and left.putamen (left putamen volume), we show the disease percentage (*y*-axis) in quantile-based groups defined by left.putamen IGS (*x*-axis) in the UKB discovery and AOU European cohorts. The horizontal grey dashed line represents the disease percentage for the entire cohort. **(C)-(E)** For each dPRS quantile group (*x*-axis), we show the MS percentage (*y*-axis) for all participants (“All subjects”), participants in the low 20% tail of IGS burden scores (“Lower IGS burden score”), and participants in the high 20% tail of IGS burden scores (“Higher IGS burden score”). We show the results for UKB discovery, UKB replication, AOU European, and AOU non-European cohorts. ## DISCUSSION In this study, we developed 4,375 brain and body IGS and evaluated their performance in disease risk stratification for UKB participants without imaging data and external participants from UKB-independent studies. We confirmed that these IGS can provide biologically relevant information to complex traits and diseases. Brain and body IGS effectively stratified the risk of various diseases not only within their respective organ systems but also in other organs, such as the pair of right ventricular ejection fraction and multiple sclerosis. These results reflect both intra- and inter-organ connections between IGS and clinical outcomes. We also conducted AUC analysis of IGS and found IGS can stratify disease risk on top of disease genetic risk score, regardless of the overall AUC gain. These results highlight differences in AUC measures and stratification capabilities, suggesting the potential of IGS for clinical utility. We conducted a thorough analysis of Alzheimer’s disease, detailing age-related trends, replication of findings in both UKB and independent cohorts, and the distinct contributions of IGS beyond dPRS and *APOE*. Our IGS data resources have been made publicly available, facilitating similar in-depth analyses for numerous brain and somatic diseases. We also provide a second example of this application to multiple sclerosis. Ageing is well-recognized for its close association with the progression of Alzheimer’s disease. Our findings indicate that IGS burden score consistently stratifies individuals at higher risk of Alzheimer’s disease in those aged over 55 in UKB and over 65 in AOU and ADNI (**Figs. 4A-4D**). The IGS burden score was derived from multiple brain MRI IDPs, many of which have been previously associated with Alzheimer’s disease in studies with actual imaging. Given that IGS was constructed solely from genotyping data, it can be readily applied to participants with genetic data, enabling a quick and efficient assessment of disease risk in older individuals, without undergoing real imaging data collection. We further demonstrated that IGS, when combined with dPRS, offered a multi-layered stratification of disease risk, which had better performance than using only the disease genetic risk score (**Figs. 4E-4H**). Moreover, IGS contributed to Alzheimer’s disease stratification when both *APOE* status and dPRS were considered (**Figs. 5A-5F**), suggesting stratification power of IGS was not purely driven by *APOE*. These results indicate that although the UKB imaging study, with its largely healthy participant base, may not have direct proven applicability for disease risk management, it possesses the capacity to stratify disease risk in broader external genetic cohorts. We anticipate that the IGS developed in our study can be widely applied in the stratification and management of Alzheimer’s disease risk. When real imaging data are not available, IGS can be used as genetically predicted variables for brain and body structures and functions. In addition to contribute to disease stratification and prediction, established IGS provides an objective pseudo-imaging biomarker associated with a specific organ region, which may serve as informative measure of diseases such as early-stage Alzheimer’s disease105,106, early-stage multiple sclerosis107,108, and bipolar disorder109. Furthermore, IGS may also contribute to ongoing efforts on disease subtyping and integration110,111. However, as shown in our prediction analysis, IGS was typically only able to partially reconstruct the imaging phenotypes. It has generally been observed that IGS has demonstrated imperfect performance in most complex traits and diseases, which can be attributed to many factors, including a limited number of training samples, heritability, and weak genetic effects112–114. Another challenge in IGS applications lies in ancestry and population differences115–117. As the current UKB imaging cohort had the majority of the participants of European ancestry, generating IGS in non-UKB and/or non-European studies may have further reduced performance113. In our UKB replication cohort, comprising non-British and non-European participants, we observed that the IGS demonstrated consistent performance similar to that in the discovery cohort (for example, **Figs. 4B, 4F,** and **6E**). More powerful IGS methods that better account for these IGS limitations and cohort differences may result in more informative IGS for potential clinical applications. In addition, we expect to see improvement in IGS prediction accuracy and stratification when more imaging samples and IDPs are released by the UKB and other large imaging genetic studies. ## METHODS Methods are available in the ***Methods*** section. ## Supporting information supplementary_info [[supplements/288769_file03.pdf]](pending:yes) supplementary_table [[supplements/288769_file04.zip]](pending:yes) ## METHODS ### Imaging traits The imaging data used in our study were obtained from the UKB study, which recruited around 500,000 individuals between the ages of 37 and 73 (229,114 males) between 2006 and 2010118 ([https://www.ukbiobank.ac.uk/](https://www.ukbiobank.ac.uk/)). The ethics approval of the UKB study was obtained from the North West Multicentre Research Ethics Committee (approval number: 11/NW/0382). We used a total of 4,206 brain MRI IDPs, 41 abdominal MRI IDPs, 82 heart CMR IDPs, and 46 eye OCT IDPs. The brain MRI IDPS consisted of 301 BIG-KP33,35,37 and 3,905 UKB-Oxford1,2,34 traits. The BIG-KP traits were from our previous studies on three brain MRI modalities. First, we obtained 101 regional brain volumes33 from sMRI by applying the advanced normalization tools45. Second, we generated 110 tract-averaged DTI parameters35 from dMRI using the ENIGMA-DTI pipeline46,47. Third, for rfMRI, we partitioned the cerebral cortex into 360 brain areas using the Glasser360 parcellation50. We obtained 90 functional activity (amplitude) and functional connectivity (edge) traits37 for 12 functional networks119. The UKB-Oxford had 1,437 IDPs from sMRI (including susceptibility-weighted structural imaging), 675 from dMRI, 1,777 from rfMRI, and 16 from task-based tfMRI. Specifically, the sMRI IDPs consisted of FIRST (Category 1102), FAST (Category 1101), FreeSurfer ASEG (Category 190), FreeSurfer BA exvivo (Category 195), FreeSurfer a2009s (Category 197), FreeSurfer DKT (Category 196), FreeSurfer desikan gw (Category 194), FreeSurfer desikan pial (Category 193), FreeSurfer desikan white (Category 192), FreeSurfer subsegmentation (Category 191), regional T2* (Category 109), and white matter hyperintensity volume (Category 112). The 675 dMRI IDPs included 432 TBSS-processed IDPs from Category 134 and 243 ProbtrackX-processed IDPs from Category 135. The 1,777 rfMRI IDPs consisted of 76 activity amplitude (node) traits and 1,701 functional connectivity (edge) traits (Category 111). They were parcellation- free and generated by whole brain spatial ICA1,2,48,49. Lastly, there were 16 tfMRI IDPs from Category 106. The details of the image acquisition, preprocessing procedures, and quality controls were available in the UKB Brain Imaging Documentation ([https://biobank.ctsu.ox.ac.uk/crystal/crystal/docs/brain_mri.pdf](https://biobank.ctsu.ox.ac.uk/crystal/crystal/docs/brain_mri.pdf)). The 41 abdominal MRI IDPs53,120–128 were from abdominal organ composition (Category 158), kidney-derived measures (Category 159), liver MRI (Category 126), and abdominal composition (Category 149). For the 82 heart CMR IDPs, the details of the image acquisition, preprocessing procedures, and quality controls were described in previous studies3,29. The 46 eye OCT IDPs were all from derived OCT measures40 (Category 100079). A list of these IDPs can be found in **Table S1**. ### IGS and dPRS constructions We performed the following genetic quality controls (QCs) for the set of participants with both IDP and genetic data37: 1) removed individuals with missing genotype rate > 0.1; 2) removed variants with missing genotype rate > 0.1; 3) removed variants with minor allele frequency (MAF) < 0.01; and 4) removed variants that failed the Hardy-Weinberg test for equilibrium at 1 × 10-7 level. Using individuals of British European ancestry, the GWAS was performed using linear mixed effect models via fastGWA129 (average *n* = 34,286 for brain, average *n* = 31,875 for heart, average *n* = 39,830 for abdomen, and average *n* = 54,761 for eye). The adjusting covariates included age (at imaging), age-squared, sex, the interaction between age and sex, the interaction between age-squared and sex, first 40 genetic principal components130 (PCs) for all organs; and additionally the estimated total intracranial volume (eTIV), head motion measurements and their squares, brain position measurements and their squares, and volumetric scaling for brain IDPs. Additionally, for BIG-KP regional brain volumes, the total brain volume (TBV) was included as an adjusting covariate to remove global effects33. For TBV, the eTIV and volumetric scaling were not included as covariates due to their high linearity. With the GWAS summary statistics as input, we used PRS-CS55 and DBSLMM56 to obtain the weights for IGS. The hyperparameters of both methods were the default values and/or the automatically tuned values. We then used PLINK131 to generate risk scores in testing data by summarizing across genetic variants, weighed by their weights. The prediction accuracy of IGS was measured by the incremental *R2*, which was the additional phenotypic variation that can be explained by the IGS while adjusting for the effects of covariates in a linear regression model. The covariates included age, age- squared, sex, the interaction between age and sex, the interaction between age-squared and sex, and the first 40 genetic PCs. The prediction accuracy was estimated separately in three independent hold-out datasets corresponding to European, Asian, and African ancestries. The European hold-out dataset consisted of UKB individuals of British or non- British European ancestry with IDP data and unrelated to the IGS training set used to generate IDP GWAS (average *n* = 4,541). The Asian hold-out dataset consisted of UKB individuals of Bangladeshi, Chinese, Indian, or Pakistani ancestry with IDP data (average *n* = 460). The African hold-out dataset consisted of UKB individuals of African or Caribbean ancestry with IDP data (average *n* = 252). We also used PRS-CS55 to construct dPRS. Out of the 110 phecode-based diseases that had IGS stratification signals on UKB (**Tables S10-S11**), we mapped 105 diseases to 75 FinnGen R9 disease endpoints that had publicly available GWAS summary statistics (**Table S14**). We used the FinnGen-phecode mapping in Sun et al97 and disease description from the FinnGen release R9. For the bipolar disorder analysis on AOU, we used GWAS summary statistics from the Psychiatric Genomics Consortium132 to construct BD dPRS. ### IGS-phenotype associations We used a discovery-replication design to examine associations between IGS and phenotypes in UKB participants without brain or body IDPs. We randomly selected 70% of UKB unrelated British European individuals (average *n* = 202,893) as the discovery dataset for IGS-phenotype associations, while the remaining 30% of UKB unrelated British European individuals, all UKB unrelated non-British European individuals, and all unrelated non-European individuals (average *n* = 129,333) were used as the replication dataset. We treated the values greater than five times the median absolute deviation from the median as outliers and removed these values. A total of 189 UKB phenotypes were tested, which represented a wide range of traits from various domains. Specifically, the 189 UKB phenotypes included 24 mental health traits (Category 100060), 5 cognitive traits (Category 100026), 12 physical activity traits (Category 100054), 6 electronic device use traits (Category 100053), 8 sun exposure traits (Category 100055), 3 sexual factor traits (Category 100056), 3 social support traits (Category 100061), 21 diet traits (Category 100052), 9 alcohol drinking traits (Category 100051), 6 smoking traits (Category 100058), 34 blood biochemistry biomarkers (Category 17518), 3 blood pressure traits (Category 100011), 3 spirometry traits (Category 100020), 17 early life factors (Categories 135, 100033 and 100072), 9 greenspace and coastal proximity (Category 151), 2 hand grip strength (Category 100019), 13 residential air pollution traits (Category 114), 5 residential noise pollution traits (Category 115), 2 body composition traits by impedance (Category 100009), 3 female specific factors (Category 100069), and 1 education trait (Category 100063) (**Table S6**). Association testing was then conducted to examine the linear relationship between the 4,375 IDP-derived IGS and the 189 UKB phenotypes. We adjusted for the same set of covariates separately in the discovery set and the replication set, including age, age- squared, sex, the interaction between age and sex, the interaction between age-squared and sex, and 40 genetic PCs. Specifically, we regressed the IDP-derived IGS onto the UKB phenotypes and calculated *P*-values using a two-sided t-test. We prioritized the results that met the following three criteria: 1) significant at 0.05 level after Bonferroni correction in the discovery dataset; 2) significant at a nominal significance level of 0.05 in the replication dataset; and 3) had regression coefficients with matching directions in both the discovery and replication datasets. ### Disease risk stratifications We used IGS to stratify 871 phecode-based68,69 diseases (**Table S5**) separately on the UKB discovery cohort and the UKB replication cohort. The UKB discovery cohort consisted of unrelated British European participants without brain or body IDPs (average *n* = 318,781). The UKB replication cohort consisted of unrelated non-British European participants and unrelated non-European participants, all without brain or body IDPs (average *n* = 48,015). In each cohort, we residualized IGS with the same covariates used in the association analysis, standardized the residualized IGS to mean 0 and standard deviation 1, sorted all participants according to the value of the standardized IGS, and split participants into three groups: the group of smallest 10% IGS, the group of largest 10% IGS, and the group of middle 80% IGS. The group of the smallest 10% IGS was referred to as the lower 10% tail, and the group of the largest 10% IGS was referred to as the upper 10% tail. For each disease, the number of disease cases was counted, and a chi-squared test of three levels of IGS-based groups (the smallest 10%, middle 80%, and the largest 10%) and two levels of disease status (case/control) was conducted. The original IGS tail case ratios between the two 10% tails were computed for all IGS-disease pairs and were reported in supplementary tables (**Tables S10-S11**), which were defined as (the number of patients in IGS upper 10% tail) / (the number of patients in IGS lower 10% tail). In **Figures 2** and **S9**, we visualized the maximum of the original IGS tail case ratio and its inverse, that is, a number always greater than or equal to 1. We also only visualized the IGS-disease pairs whose IGS tails contained above-average disease patients. That is, one or both of the two 10% tails of IGS contained at least 10% of all cases on the UKB discovery cohort. In total, 6 pairs between brain IGS and brain/mental disorders, 3 pairs between abdominal IGS and genitourinary/digestive diseases, 3 pairs between heart IGS and circulatory system diseases, and 2 pairs between brain/abdominal/eye IGS and circulatory systems diseases were not visualized because of this. For Alzheimer’s disease/dementia analysis on AOU, unlike UKB age distribution (**Fig. S15A**), AOU had a bimodal distribution (**Fig. S15B**), and thus we restricted our AOU analysis to participants of age between 60 and 90 (at the age of incidence for patients and at the time of survey otherwise), and only included those self-identified as European (‘White’) (*n* = 59,261, 26,881 males at birth). For the Alzheimer’s disease/dementia patients, we used age at incidence for the analysis in **Figure 4C**, and age at survey for the controls. Due to the lower disease percentage in AOU, we used both Alzheimer’s disease (ICD-10 code: G30) and dementia (ICD-10 code: F01-F03) for cases, and the rest as controls. For multiple sclerosis analysis on AOU, we used all participants self-identified as European (*n* = 128,515, 51,390 males at birth, age range = [17, 113]) and all participants self-identified as African (‘Black’), or Hispanic, or Asian(African *n* = 50,139, 21,684 males at birth, age range = [17, 104]; Hispanic *n* = 41,575, 13,461 males at birth, age range = [17, 100]; Asian *n* = 7,603, 3,066 males at birth, age range = [17, 100]). For bipolar disorder analysis on AOU, we used all participants self-identified as European. Multiple sclerosis patients were those diagnosed with ICD-10 G35, and bipolar disorder patients were those diagnosed with ICD-10 F31. Using PLINK, we developed IGS and dPRS using the Allele Count/Allele Frequency threshold genotyping data prepared by AOU and the UKB weights generated from PRS-CS. We removed the effects of age, sex and 16 genetic PCs from IGS and dPRS, where the genetic PCs were prepared by AOU as well133. For ADNI, we used participants of age at least 60 (*n* = 1,116, 656 males, age range = [60, 91]), the participants whose final diagnosis was dementia were used as cases and the rest as controls (dementia percentage = 42.92%). After performing the standard genetic QCs33, we developed the IGS and dPRS using UKB-derived weights and removed the effects of age and sex . ### AUC analysis To measure the contribution from a single IGS to disease prediction, we trained the following logistic regression models: (1) a baseline model that predicts disease status using age and sex as predictors, and (2) an IGS model that predicts disease status using age, sex, and one IGS. We constructed an IGS model for each IGS-disease pair that had significant stratification (**Tables S12-S13**), a total of 919 IGS models. The increment in the AUC of the IGS model compared with the baseline model is the AUC gain by including a single IGS and visualized in **Figures 3A-C and S9**. DeLong test96 was used to test for the difference in AUC between the baseline model and the IGS model. All AUC was obtained using case-control matched individuals. For each disease, each patient was matched with five participants free of this disease134, of same age and sex, and of British European ancestry. To measure the contribution from IGS when dPRS was included, we considered two logistic regression models for disease status prediction: (1) a baseline-dPRS model that included age, sex, and the corresponding dPRS, and (2) an IGS-dPRS model that included age, sex, dPRS, and IGS that marginally stratified this disease. The IGS-dPRS model was selected from the better one between using all stratifying IGS and using only the IGS with the largest stratification tail case ratio. The difference between the AUC of the IGS-dPRS model and the AUC of the baseline-dPRS model was the measure of IGS contribution to disease prediction given dPRS. We averaged the AUC across repeated 100 runs with two- fold cross-validation. The increment in the average AUC of the IGS-dPRS model compared with the average AUC of the baseline-dPRS model is the AUC gain visualized in **Figure 3D**. Wilcoxon rank sum test was used to test for the difference in AUC between the baseline- dPRS model and the IGS-dPRS model across all cross-validation replications. ### IGS burden scores The brain IGS for Alzheimer’s disease stratification on UKB cohorts was selected according to four criteria below: (1) on the UKB discovery cohort, the IGS stratified disease cases so that the chi-squared test was significant after controlling the FDR at a 5% level; (2) the IGS had one of its two 10% tails containing at least 10% of all disease cases of UKB discovery cohort; (3) the IGS had one of its two 10% tails containing at most 10% of all disease cases of UKB discovery cohort; and (4) the IGS tail case ratio direction was consistent between the UKB discovery cohort and the UKB replication cohort (that is, the 10% tail that contained more disease cases was consistent on both cohorts). There were 55 IGS that satisfied these conditions and were selected for Alzheimer’s disease stratification (**Table S16**). In **Figures 4A-4B**, **4E-4F** and **5**, within each group (defined by age, or by dPRS, or by *APOE* status, or by both *APOE* status and dPRS), brain IGS were selected with the following two criteria: (1) the IGS was selected for disease stratification on the entire cohort (that is, among the 55 selected IGS above); and (2) the 10% IGS tail in the group that contained more cases was also the 10% IGS tail that had contained more cases on the entire cohort (that is, the IGS high-risk tail that contained more disease cases remained consistent in this age/dPRS/*APOE*-defined group). IGS burden score of a participant was the number of “IGS high-risk tails” a participant was in. The “IGS high-risk tail” for IGS burden score was the 20% top tail of an IGS that contained more patients compared to the lower 20% tail. We constructed an IGS burden score by counting the number of IGS high-risk tails a subject was in, across the selected IGS. For example, if a subject was ranked in the top 20% for five out of the selected brain IGS, they would be assigned an IGS burden score of five. As IGS burden score is an integer as small as zero, for each group, we selected the “Low IGS burden score” to be the smallest nonnegative integer values that corresponded to around 20% of all participants in this group, and the “Higher IGS burden score” to be the largest integer values that corresponded to around 20% of all participants in this group. Within a group, the participants with IGS burden score among the “Lower IGS burden score” values were categorized as “Lower IGS burden score”, and likewise, those with IGS burden score among the “Higher IGS burden score” values were categorized as “Higher IGS burden score”. Due to smaller sample size and higher dementia percentage in ADNI, ADNI IGS burden score used 10% tail for “IGS high-risk tail”. Besides, ADNI had “Lower IGS burden score” cutoff at 10% for age analysis (**Fig. 4D**). Similar to UKB analysis, we grouped AOU participants of European ancestry into different groups (by age as in **Fig. 4C** or by dPRS as in **Fig. 4G**). For each stratum, we again calculated the IGS tail case ratio for the 55 selected IGS and only kept the IGS that had case ratio direction consistent with the UKB discovery cohort, and categorized participants inside each stratum into “Lower IGS burden score” and “Higher IGS burden score”. Similar analysis was performed for ADNI participants (**Figs. 4D** and **4H**). For multiple sclerosis stratification on UKB and AOU, similar to Alzheimer’s disease, we applied the following criterion to select the brain IGS for IGS burden score: (1) on the UKB discovery cohort, the IGS stratified multiple sclerosis cases so that the chi-squared test was significant after controlling the FDR at a 5% level; (2) the IGS had one of its two 10% tails containing at least 10% of all multiple sclerosis cases of UKB discovery cohort; (3) the IGS had one of its two 10% tails containing at most 10% of all multiple sclerosis cases of UKB discovery cohort; and (4) the IGS tail case ratio direction was consistent between the UKB discovery cohort and the UKB replication cohort (that is, the 10% tail that contained more cases was consistent on both cohorts). There were 25 IGS that satisfied these conditions and were selected for IGS burden score of multiple sclerosis (**Table S17**), and then stratification within group defined by multiple sclerosis dPRS were carried out using IGS burden score, similar to Alzheimer’s disease analysis described above. ### *APOE* status We extracted *APOE* genotypes for UKB participants based on the genetic variants rs429358 and rs7412. Following prior studies on *APOE*135, we considered five common *APOE* genotypes (ε4ε4, ε3ε4, ε3ε3, ε2ε3 and ε2ε2) and mapped the alleles of rs429358 and rs7412 as follows: rs429358 CC and rs7412 CC to ε4ε4, rs429358 TC and rs7412 CC to ε3ε4, rs429358 TT and rs7412 CC to ε3ε3, rs429358 TT and rs7412 TC to ε2ε3, and rs429358 TT and rs7412 TT to ε2ε2. Here ε4ε4 are ε4 homozygotes, ε3ε4 are ε4 heterozygotes, ε3ε3 are ε3 homozygotes, and ε2ε2 and ε2ε3 are ε2 carriers. We also combined ε4 homozygotes and ε4 heterozygotes as “ε4 carriers” in **Figures 5A and 5E**. ### Code availability We made use of publicly available software and tools. The list of genetic variants and their weights used for the construction of IGS for UKB brain and body IDPs are available at [https://github.com/xcyang17/IPRS\_UKB](https://github.com/xcyang17/IPRS_UKB). ### Data availability The individual-level data used in this study can be obtained from [https://www.ukbiobank.ac.uk/](https://www.ukbiobank.ac.uk/) and [http://adni.loni.usc.edu/data-samples/](http://adni.loni.usc.edu/data-samples/). The disease GWAS data can be downloaded from [https://www.finngen.fi/en/access\_results](https://www.finngen.fi/en/access_results) and [https://figshare.com/articles/dataset/bip2021_noUKBB/22564402](https://figshare.com/articles/dataset/bip2021_noUKBB/22564402). Our IGS data resources can be downloaded at Zenodo ([https://doi.org/10.5281/zenodo.7709788](https://doi.org/10.5281/zenodo.7709788)). ## ACKNOWLEDGEMENTS Research reported in this publication was supported by the National Institute On Aging of the National Institutes of Health under Award Number RF1AG082938. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health. The study has also been partially supported by funding from the Wharton Dean’s Research Fund, Analytics at Wharton, and Purdue Statistics Department. This research has been conducted using the UK Biobank resource (application number 76139), subject to a data transfer agreement. We would like to thank the individuals who represented themselves in the UK Biobank for their participation and the research teams for their efforts in collecting, processing, and disseminating these datasets. We gratefully acknowledge all the studies and databases that made GWAS summary-level data publicly available. We would like to thank the research computing groups at Purdue University, the University of North Carolina at Chapel Hill, and the Wharton School of the University of Pennsylvania for providing computational resources and support that have contributed to these research results. ## Footnotes * # *Lead contact:* Bingxin Zhao 413 Academic Research Building 265 South 37th Street, Philadelphia, PA 19104. E-mail: bxzhao{at}upenn.edu Phone: (215) 898-8222 * We have included more data analyses to illustrate the power of imaging genetic scores and replicate the findings in more genetic cohorts. * Received April 18, 2023. * Revision received June 2, 2024. * Accepted June 3, 2024. * © 2024, Posted by Cold Spring Harbor Laboratory This pre-print is available under a Creative Commons License (Attribution-NonCommercial-NoDerivs 4.0 International), CC BY-NC-ND 4.0, as described at [http://creativecommons.org/licenses/by-nc-nd/4.0/](http://creativecommons.org/licenses/by-nc-nd/4.0/) ## REFERENCES 1. 1.Alfaro-Almagro, F. et al. Image processing and Quality Control for the first 10,000 brain imaging datasets from UK Biobank. NeuroImage 166, 400–424 (2018). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/J.NEUROIMAGE.2017.10.034&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F06%2F03%2F2023.04.18.23288769.atom) 2. 2.Elliott, L.T. et al. Genome-wide association studies of brain imaging phenotypes in UK Biobank. Nature 562, 210–216 (2018). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41586-018-0571-7&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=30305740&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F06%2F03%2F2023.04.18.23288769.atom) 3. 3.Bai, W. et al. A population-based phenome-wide association study of cardiac and aortic structure and function. Nature Medicine 26, 1654–1662 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41591-020-1009-y&link_type=DOI) 4. 4.Littlejohns, T.J. et al. The UK Biobank imaging enhancement of 100,000 participants: rationale, data collection, management and future directions. Nature communications 11, 1–12 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41467-020-15305-w&link_type=DOI) 5. 5.Miller, K.L. et al. Multimodal population brain imaging in the UK Biobank prospective epidemiological study. Nature Neuroscience 19, 1523–1536 (2016). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/nn.4393&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=27643430&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F06%2F03%2F2023.04.18.23288769.atom) 6. 6.Smith, S.M. & Nichols, T.E. Statistical challenges in “big data” human neuroimaging. Neuron 97, 263–268 (2018). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.neuron.2017.12.018&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=29346749&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F06%2F03%2F2023.04.18.23288769.atom) 7. 7.Cole, J.H. Multimodality neuroimaging brain-age in UK biobank: relationship to biomedical, lifestyle, and cognitive factors. Neurobiology of aging 92, 34–42 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.neurobiolaging.2020.03.014&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=32380363&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F06%2F03%2F2023.04.18.23288769.atom) 8. 8.Cox, S.R. et al. Ageing and brain white matter structure in 3,513 UK Biobank participants. Nature communications 7, 13629 (2016). 9. 9.Peng, H., Gong, W., Beckmann, C.F., Vedaldi, A. & Smith, S.M. Accurate brain age prediction with lightweight deep neural networks. Medical image analysis 68, 101871 (2021). 10. 10.Kweon, H. et al. Human brain anatomy reflects separable genetic and environmental components of socioeconomic status. Science advances 8, eabm2923 (2022). 11. 11.Spreng, R.N. et al. The default network of the human brain is associated with perceived social isolation. Nature communications 11, 6393 (2020). 12. 12.12. Azevedo, T., et al. Identifying healthy individuals with Alzheimer neuroimaging phenotypes in the UK Biobank. medRxiv (2022). 13. 13.Becker, J. et al. Resource profile and user guide of the Polygenic Index Repository. Nature human behaviour 5, 1744–1758 (2021). 14. 14.Lewis, C.M. & Vassos, E. Polygenic risk scores: from research tools to clinical instruments. Genome medicine 12, 1–11 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/s13073-019-0703-1&link_type=DOI) 15. 15.Torkamani, A., Wineinger, N.E. & Topol, E.J. The personal and clinical utility of polygenic risk scores. Nature Reviews Genetics 19, 581–590 (2018). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41576-018-0018-x&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=29789686&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F06%2F03%2F2023.04.18.23288769.atom) 16. 16.Rietveld, C.A. et al. Replicability and robustness of genome-wide-association studies for behavioral traits. Psychological science 25, 1975–1986 (2014). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1177/0956797614545132&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25287667&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F06%2F03%2F2023.04.18.23288769.atom) 17. 17.Choi, S.W., Mak, T.S.-H. & O’Reilly, P.F. Tutorial: a guide to performing polygenic risk score analyses. Nature protocols 15, 2759–2772 (2020). 18. 18.Wray, N.R., Goddard, M.E. & Visscher, P.M. Prediction of individual genetic risk to disease from genome-wide association studies. Genome research 17, 1520–1528 (2007). [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NjoiZ2Vub21lIjtzOjU6InJlc2lkIjtzOjEwOiIxNy8xMC8xNTIwIjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjQvMDYvMDMvMjAyMy4wNC4xOC4yMzI4ODc2OS5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 19. 19.Consortium, I.S. Common polygenic variation contributes to risk of schizophrenia and bipolar disorder. Nature 460, 748–752 (2009). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/nature08185&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=19571811&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F06%2F03%2F2023.04.18.23288769.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000268670300041&link_type=ISI) 20. 20.Kullo, I.J. et al. Polygenic scores in biomedical research. Nature Reviews Genetics 23, 524–532 (2022). 21. 21.Chatterjee, N., Shi, J. & García-Closas, M. Developing and evaluating polygenic risk prediction models for stratified disease prevention. Nature Reviews Genetics 17, 392 (2016). 22. 22.Ma, Y. & Zhou, X. Genetic prediction of complex traits with polygenic scores: a statistical review. Trends in Genetics 37, 995–1011 (2021). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.tig.2021.06.004&link_type=DOI) 23. 23.Wand, H. et al. Improving reporting standards for polygenic scores in risk prediction studies. Nature 591, 211–219 (2021). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41586-021-03243-6&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=33692554&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F06%2F03%2F2023.04.18.23288769.atom) 24. 24.Uffelmann, E. et al. Genome-wide association studies. Nature Reviews Methods Primers 1, 1-21 (2021). 25. 25.Lambert, S.A. et al. The Polygenic Score Catalog as an open database for reproducibility and systematic evaluation. Nature Genetics 53, 420–425 (2021). 26. 26.Zhao, B. et al. Heritability of regional brain volumes in large-scale neuroimaging and genetic studies. Cerebral Cortex 29, 2904–2914 (2018). 27. 27.Kochunov, P. et al. Homogenizing estimates of heritability among SOLAR-Eclipse, OpenMx, APACE, and FPHI software packages in neuroimaging data. Frontiers in Neuroinformatics 13, 16 (2019). 28. 28.Thompson, P.M. et al. ENIGMA and global neuroscience: A decade of large-scale studies of the brain in health and disease across more than 40 countries. Translational psychiatry 10, 1–28 (2020). 29. 29.Zhao, B. et al. Heart-brain connections: Phenotypic and genetic insights from magnetic resonance images. Science 380, abn6598 (2023). 30. 30.Colclough, G.L. et al. The heritability of multi-modal connectivity in human brain activity. Elife 6, e20178 (2017). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.7554/eLife.20178&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=28745584&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F06%2F03%2F2023.04.18.23288769.atom) 31. 31.Busjahn, C.A. et al. Heritability of left ventricular and papillary muscle heart size: a twin study with cardiac magnetic resonance imaging. European heart journal 30, 1643–1647 (2009). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/eurheartj/ehp142&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=19406865&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F06%2F03%2F2023.04.18.23288769.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000267666300025&link_type=ISI) 32. 32.Chien, K.-L., Hsu, H.-C., Su, T.-C., Chen, M.-F. & Lee, Y.-T. Heritability and major gene effects on left ventricular mass in the Chinese population: a family study. BMC cardiovascular disorders 6, 1–9 (2006). 33. 33.Zhao, B. et al. Genome-wide association analysis of 19,629 individuals identifies variants influencing regional brain volumes and refines their genetic co- architecture with cognitive and mental health traits. Nature genetics 51, 1637–1644 (2019). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41588-019-0516-6&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=31676860&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F06%2F03%2F2023.04.18.23288769.atom) 34. 34.Smith, S.M. et al. An expanded set of genome-wide association studies of brain imaging phenotypes in UK Biobank. Nature neuroscience 24, 737–745 (2021). 35. 35.Zhao, B. et al. Common genetic variation influencing human white matter microstructure. Science 372, eabf3736 (2021). 36. 36.Grasby, K.L. et al. The genetic architecture of the human cerebral cortex. Science 367, eaay6690 (2020). 37. 37.Zhao, B. et al. Genetic influences on the intrinsic and extrinsic functional organizations of the cerebral cortex. medRxiv, 2021.07. 27.21261187 (2021). 38. 38.Hofer, E. et al. Genetic correlations and genome-wide associations of cortical structure in general population samples of 22,824 adults. Nature communications 11, 1–16 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41467-020-15305-w&link_type=DOI) 39. 39.Satizabal, C.L. et al. Genetic architecture of subcortical brain structures in 38,851 individuals. Nature genetics 51, 1624–1636 (2019). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41588-019-0511-y&link_type=DOI) 40. 40.Zhao, B. et al. Eye-brain connections revealed by multimodal retinal and brain imaging genetics in the UK Biobank. medRxiv, 2023.02. 16.23286035 (2023). 41. 41.Zhao, B. et al. Common variants contribute to intrinsic human brain functional networks. Nature Genetics 54, 508–517 (2022). 42. 42.Zhao, B. et al. Large-scale GWAS reveals genetic architecture of brain white matter microstructure and genetic overlap with cognitive and mental health traits (n = 17,706). Molecular Psychiatry (2019). 43. 43.Zhao, B. et al. Common variants contribute to intrinsic human brain functional networks. Nat Genet 54, 508–517 (2022). 44. 44.44. Cordova-Palomera, A., et al. Cardiac Imaging of Aortic Valve Area From 34 287 UK Biobank Participants Reveals Novel Genetic Associations and Shared Genetic Comorbidity With Multiple Disease Phenotypes. Circ Genom Precis Med 13, e003014 (2020). 45. 45.Avants, B.B. et al. A reproducible evaluation of ANTs similarity metric performance in brain image registration. Neuroimage 54, 2033–2044 (2011). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.neuroimage.2010.09.025&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20851191&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F06%2F03%2F2023.04.18.23288769.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000286302000027&link_type=ISI) 46. 46.Jahanshad, N. et al. Multi-site genetic analysis of diffusion images and voxelwise heritability analysis: A pilot project of the ENIGMA–DTI working group. Neuroimage 81, 455–469 (2013). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.neuroimage.2013.04.061&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23629049&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F06%2F03%2F2023.04.18.23288769.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000322934400044&link_type=ISI) 47. 47.Kochunov, P. et al. Multi-site study of additive genetic effects on fractional anisotropy of cerebral white matter: comparing meta and megaanalytical approaches for data pooling. Neuroimage 95, 136–150 (2014). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.neuroimage.2014.03.033&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24657781&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F06%2F03%2F2023.04.18.23288769.atom) 48. 48.Beckmann, C.F. & Smith, S.M. Probabilistic independent component analysis for functional magnetic resonance imaging. IEEE transactions on medical imaging 23, 137–152 (2004). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1109/TMI.2003.822821&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=14964560&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F06%2F03%2F2023.04.18.23288769.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000188858300001&link_type=ISI) 49. 49.Hyvarinen, A. Fast and robust fixed-point algorithms for independent component analysis. IEEE transactions on Neural Networks 10, 626–634 (1999). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1109/72.761722&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=18252563&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F06%2F03%2F2023.04.18.23288769.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000080122700019&link_type=ISI) 50. 50.Glasser, M.F. et al. A multi-modal parcellation of human cerebral cortex. Nature 536, 171–178 (2016). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/nature18933&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=27437579&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F06%2F03%2F2023.04.18.23288769.atom) 51. 51.Bijsterbosch, J. et al. Investigations into within-and between-subject resting- state amplitude variations. Neuroimage 159, 57–69 (2017). 52. 52.McKay, A. et al. Measurement of liver iron by magnetic resonance imaging in the UK Biobank population. PLoS One 13, e0209340 (2018). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pone.0209340&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F06%2F03%2F2023.04.18.23288769.atom) 53. 53.West, J. et al. Feasibility of MR-Based Body Composition Analysis in Large Scale Population Studies. PLoS One 11, e0163332 (2016). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pone.0163332&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=27662190&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F06%2F03%2F2023.04.18.23288769.atom) 54. 54.Wilman, H.R. et al. Genetic studies of abdominal MRI data identify genes regulating hepcidin as major determinants of liver iron concentration. Journal of Hepatology 71, 594–602 (2019). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.jhep.2019.05.032&link_type=DOI) 55. 55.Ge, T., Chen, C.-Y., Ni, Y., Feng, Y.-C.A. & Smoller, J.W. Polygenic prediction via Bayesian regression and continuous shrinkage priors. Nature Communications 10, 1776 (2019). 56. 56.Yang, S. & Zhou, X. Accurate and scalable construction of polygenic scores in large biobank data sets. The American Journal of Human Genetics 106, 679–693 (2020). 57. 57.Xiang, R. et al. Recent advances in polygenic scores: translation, equitability, methods and FAIR tools. Genome Med 16, 33 (2024). 58. 58.Lewis, C.M. & Vassos, E. Polygenic risk scores: from research tools to clinical instruments. Genome Med 12, 44 (2020). 59. 59.Moaddel, R. et al. Proteomics in aging research: A roadmap to clinical, translational research. Aging Cell 20, e13325 (2021). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1111/acel.13325&link_type=DOI) 60. 60.Mavaddat, N. et al. Polygenic Risk Scores for Prediction of Breast Cancer and Breast Cancer Subtypes. Am J Hum Genet 104, 21–34 (2019). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ajhg.2018.11.002&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=30554720&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F06%2F03%2F2023.04.18.23288769.atom) 61. 61.Aly, M. et al. Polygenic risk score improves prostate cancer risk prediction: results from the Stockholm-1 cohort study. Eur Urol 60, 21–8 (2011). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.eururo.2011.01.017&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=21295399&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F06%2F03%2F2023.04.18.23288769.atom) 62. 62.Desikan, R.S. et al. Genetic assessment of age-associated Alzheimer disease risk: Development and validation of a polygenic hazard score. PLoS Med 14, e1002258 (2017). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pmed.1002258&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=28323831&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F06%2F03%2F2023.04.18.23288769.atom) 63. 63.de Rojas, I. et al. Common variants in Alzheimer’s disease and risk stratification by polygenic risk scores. Nat Commun 12, 3417 (2021). 64. 64.Olsson, T., Barcellos, L.F. & Alfredsson, L. Interactions between genetic, lifestyle and environmental risk factors for multiple sclerosis. Nature Reviews Neurology 13, 25–36 (2017). 65. 65.Virtanen, R., Jula, A., Kuusela, T., Helenius, H. & Voipio-Pulkki, L.M. Reduced heart rate variability in hypertension: associations with lifestyle factors and plasma renin activity. J Hum Hypertens 17, 171–9 (2003). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/sj.jhh.1001529&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=12624607&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F06%2F03%2F2023.04.18.23288769.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000182115100004&link_type=ISI) 66. 66.Persyn, E. et al. Genome-wide association study of MRI markers of cerebral small vessel disease in 42,310 participants. Nat Commun 11, 2175 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41467-020-15932-3&link_type=DOI) 67. 67.Thompson, P.M. et al. ENIGMA and global neuroscience: A decade of large-scale studies of the brain in health and disease across more than 40 countries. Translational psychiatry 10, 100 (2020). 68. 68.Bastarache, L. Using Phecodes for Research with the Electronic Health Record: From PheWAS to PheRS. Annu Rev Biomed Data Sci 4, 1–19 (2021). 69. 69.Dey, R. et al. Efficient and accurate frailty model approach for genome-wide survival association analysis in large-scale biobanks. Nature communications 13, 5437 (2022). 70. 70.Bakshi, A. et al. A polygenic risk score predicts incident prostate cancer risk in older men but does not select for clinically significant disease. Cancers 13, 5815 (2021). 71. 71.Dobson, R. & Giovannoni, G. Multiple sclerosis - a review. Eur J Neurol 26, 27–40 (2019). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1111/ene.13819&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F06%2F03%2F2023.04.18.23288769.atom) 72. 72.Hauser, S.L. & Cree, B.A.C. Treatment of Multiple Sclerosis: A Review. Am J Med 133, 1380–1390 e2 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.amjmed.2020.05.049&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F06%2F03%2F2023.04.18.23288769.atom) 73. 73.Patsopoulos, N.A. Genetics of Multiple Sclerosis: An Overview and New Directions. Cold Spring Harb Perspect Med 8(2018). 74. 74.Koenig, K.A. et al. The relationship between cognitive function and high- resolution diffusion tensor MRI of the cingulum bundle in multiple sclerosis. Multiple Sclerosis Journal 21, 1794–1801 (2015). 75. 75.Azevedo, C.J. et al. Thalamic atrophy in multiple sclerosis: A magnetic resonance imaging marker of neurodegeneration throughout disease. Ann Neurol 83, 223–234 (2018). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/ana.25150&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=29328531&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F06%2F03%2F2023.04.18.23288769.atom) 76. 76.Minagar, A. et al. The thalamus and multiple sclerosis: modern views on pathologic, imaging, and clinical aspects. Neurology 80, 210–9 (2013). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1212/WNL.0b013e31827b910b&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23296131&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F06%2F03%2F2023.04.18.23288769.atom) 77. 77.Kramer, J. et al. Early and Degressive Putamen Atrophy in Multiple Sclerosis. Int J Mol Sci 16, 23195–209 (2015). 78. 78.Tokarska, N., Tottenham, I., Baaklini, C. & Gawryluk, J.R. How does the brain age in individuals with multiple sclerosis? A systematic review. Frontiers in Neurology 14, 1207626 (2023). 79. 79.Sanusi, A.A. et al. Relationship of ultrasonographically determined kidney volume with measured GFR, calculated creatinine clearance and other parameters in chronic kidney disease (CKD). Nephrol Dial Transplant 24, 1690–4 (2009). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/ndt/gfp055&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=19264744&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F06%2F03%2F2023.04.18.23288769.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000265275000054&link_type=ISI) 80. 80.Jovanovic, D., Gasic, B., Pavlovic, S. & Naumovic, R. Correlation of kidney size with kidney function and anthropometric parameters in healthy subjects and patients with chronic kidney diseases. Ren Fail 35, 896–900 (2013). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.3109/0886022X.2013.794683&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23751145&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F06%2F03%2F2023.04.18.23288769.atom) 81. 81.Xia, T. et al. Association between liver MRI proton density fat fraction and liver disease risk. Radiology 309, e231007 (2023). 82. 82.Zhang, Y.N. et al. Liver fat imaging-a clinical overview of ultrasound, CT, and MR imaging. Br J Radiol 91, 20170959 (2018). 83. 83.van der Poorten, D. et al. Visceral fat: a key mediator of steatohepatitis in metabolic liver disease. Hepatology 48, 449–57 (2008). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/hep.22350&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=18627003&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F06%2F03%2F2023.04.18.23288769.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000258247400012&link_type=ISI) 84. 84.Heikkinen, M., Salenius, J.-P. & Auvinen, O. Ruptured abdominal aortic aneurysm in a well-defined geographic area. Journal of vascular surgery 36, 291–296 (2002). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1067/mva.2002.125479&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=12170209&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F06%2F03%2F2023.04.18.23288769.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000177489000014&link_type=ISI) 85. 85.van Hout, M.J. et al. How to measure the aorta using MRI: a practical guide. Journal of Magnetic Resonance Imaging 52, 971 (2020). 86. 86.Litmanovich, D., Bankier, A.A., Cantin, L., Raptopoulos, V. & Boiselle, P.M. CT and MRI in diseases of the aorta. American Journal of Roentgenology 193, 928–940 (2009). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.2214/AJR.08.2166&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=19770313&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F06%2F03%2F2023.04.18.23288769.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000270033300004&link_type=ISI) 87. 87.Han, X. et al. Genome-wide association analysis of 95 549 individuals identifies novel loci and genes influencing optic disc morphology. Human Molecular Genetics 28, 3680–3690 (2019). 88. 88.Crowston, J.G. et al. The effect of optic disc diameter on vertical cup to disc ratio percentiles in a population based cohort: the Blue Mountains Eye Study. Br J Ophthalmol 88, 766–70 (2004). [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6MTI6ImJqb3BodGhhbG1vbCI7czo1OiJyZXNpZCI7czo4OiI4OC82Lzc2NiI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDI0LzA2LzAzLzIwMjMuMDQuMTguMjMyODg3NjkuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 89. 89.Olindo, S. et al. Decrease in heart ventricular ejection fraction during multiple sclerosis. Eur J Neurol 9, 287–91 (2002). [PubMed](http://medrxiv.org/lookup/external-ref?access_num=11985637&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F06%2F03%2F2023.04.18.23288769.atom) 90. 90.Mincu, R.I. et al. Impaired Cardiac Function in Patients with Multiple Sclerosis by Comparison with Normal Subjects. Sci Rep 8, 3300 (2018). 91. 91. Rowe Bijanki, K., et al. Characterizing white matter health and organization in atherosclerotic vascular disease: a diffusion tensor imaging study. Psychiatry Res 214, 389–94 (2013). 92. 92.Xie, H. et al. Arterial hypertension and retinal layer thickness: the Beijing Eye Study. Br J Ophthalmol 108, 105–111 (2023). 93. 93.Sinnott-Armstrong, N. et al. Genetics of 35 blood and urine biomarkers in the UK Biobank. Nat Genet 53, 185–194 (2021). 94. 94.Lin, J. et al. Integration of Biomarker Polygenic Risk Score Improves Prediction of Coronary Heart Disease. JACC Basic Transl Sci 8, 1489–1499 (2023). 95. 95.Kurki, M.I. et al. FinnGen provides genetic insights from a well-phenotyped isolated population. Nature 613, 508–518 (2023). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41586-02205473-8&link_type=DOI) 96. 96.DeLong, E.R., DeLong, D.M. & Clarke-Pearson, D.L. Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. Biometrics, 837–845 (1988). 97. 97.Sun, B.B. et al. Genetic associations of protein-coding variants in human disease. Nature 603, 95–102 (2022). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41586-022-04394-w&link_type=DOI) 98. 98.Okamoto, K. et al. Common variation in GPC5 is associated with acquired nephrotic syndrome. Nature genetics 43, 459–463 (2011). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/ng.792&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=21441931&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F06%2F03%2F2023.04.18.23288769.atom) 99. 99.Sanchez-Rodriguez, E., Southard, C.T. & Kiryluk, K. GWAS-Based Discoveries in IgA Nephropathy, Membranous Nephropathy, and Steroid-Sensitive Nephrotic Syndrome. Clin J Am Soc Nephrol 16, 458–466 (2021). [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6ODoiY2xpbmphc24iO3M6NToicmVzaWQiO3M6ODoiMTYvMy80NTgiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyNC8wNi8wMy8yMDIzLjA0LjE4LjIzMjg4NzY5LmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 100.100.Investigators, A.o.U.R.P.G. Genomic data in the All of Us research program. Nature 627, 340 (2024). 101.101.Petersen, R.C. et al. Alzheimer’s Disease Neuroimaging Initiative (ADNI): clinical characterization. Neurology 74, 201–9 (2010). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1212/WNL.0b013e3181cb3e25&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20042704&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F06%2F03%2F2023.04.18.23288769.atom) 102.102.Fortea, J. et al. APOE4 homozygozity represents a distinct genetic form of Alzheimer’s disease. Nature Medicine, 1–8 (2024). 103.103.Braak, H., Thal, D.R., Ghebremedhin, E. & Del Tredici, K. Stages of the pathologic process in Alzheimer disease: age categories from 1 to 100 years. J Neuropathol Exp Neurol 70, 960–9 (2011). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1097/NEN.0b013e318232a379&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22002422&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F06%2F03%2F2023.04.18.23288769.atom) 104.104.Herrup, K. Reimagining Alzheimer’s disease--an age-based hypothesis. J Neurosci 30, 16755–62 (2010). [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6Njoiam5ldXJvIjtzOjU6InJlc2lkIjtzOjExOiIzMC81MC8xNjc1NSI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDI0LzA2LzAzLzIwMjMuMDQuMTguMjMyODg3NjkuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 105.105.Counts, S.E., Ikonomovic, M.D., Mercado, N., Vega, I.E. & Mufson, E.J. Biomarkers for the Early Detection and Progression of Alzheimer’s Disease. Neurotherapeutics 14, 35–53 (2017). 106.106.Dubois, B., von Arnim, C.A.F., Burnie, N., Bozeat, S. & Cummings, J. Biomarkers in Alzheimer’s disease: role in early and differential diagnosis and recognition of atypical variants. Alzheimers Res Ther 15, 175 (2023). 107.107.Barillot, C., Edan, G. & Commowick, O. Imaging biomarkers in multiple Sclerosis: From image analysis to population imaging. Med Image Anal 33, 134–139 (2016). 108.108.Gill, A.J., Schorr, E.M., Gadani, S.P. & Calabresi, P.A. Emerging imaging and liquid biomarkers in multiple sclerosis. Eur J Immunol 53, e2250228 (2023). 109.109.Grewal, S., McKinlay, S., Kapczinski, F., Pfaffenseller, B. & Wollenhaupt-Aguiar, B. Biomarkers of neuroprogression and late staging in bipolar disorder: A systematic review. Aust N Z J Psychiatry 57, 328–343 (2023). 110.110.Lu, Y. et al. Subtyping Schizophrenia Using Psychiatric Polygenic Scores. medRxiv, 2023.10. 12.23296915 (2023). 111.111.Dahl, A. et al. Phenotype integration improves power and preserves specificity in biobank-based genetic studies of major depressive disorder. Nat Genet 55, 2082–2093 (2023). 112.112.Dudbridge, F. Power and predictive accuracy of polygenic risk scores. PLoS genetics 9, e1003348 (2013). 113.113.Wang, Y. et al. Theoretical and empirical quantification of the accuracy of polygenic scores in ancestry divergent populations. Nature communications 11, 1–9 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41467-020-15305-w&link_type=DOI) 114.114.Zhao, B. & Zou, F. On polygenic risk scores for complex traits prediction. Biometrics 78, 499–511 (2022). 115.115.Kachuri, L. et al. Principles and methods for transferring polygenic risk scores across global populations. Nature Reviews Genetics, 1–18 (2023). 116.116.Duncan, L. et al. Analysis of polygenic risk score usage and performance in diverse human populations. Nature communications 10, 1–9 (2019). 117.117.117. Zhao, B., Yang, X. & Zhu, H. Estimating trans-ancestry genetic correlation with unbalanced data resources. arXiv preprint arXiv:2203.12154 (2022). 118.118.Sudlow, C. et al. UK biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age. PLoS medicine 12, e1001779 (2015). 119.119.Ji, J.L. et al. Mapping the human brain’s cortical-subcortical functional network organization. Neuroimage 185, 35–57 (2019). 120.120.Liu, Y. et al. Genetic architecture of 11 organ traits derived from abdominal MRI using deep learning. Elife 10(2021). 121.121.Sorokin, E.P. et al. Analysis of MRI-derived spleen iron in the UK Biobank identifies genetic variation linked to iron homeostasis and hemolysis. Am J Hum Genet 109, 1092–1104 (2022). 122.122.Wilman, H.R. et al. Characterisation of liver fat in the UK Biobank cohort. PLoS One 12, e0172921 (2017). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pone.0172921&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=28241076&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F06%2F03%2F2023.04.18.23288769.atom) 123.123.Mojtahed, A. et al. Reference range of liver corrected T1 values in a population at low risk for fatty liver disease-a UK Biobank sub-study, with an appendix of interesting cases. Abdom Radiol (NY*)* 44, 72–84 (2019). 124.124.Karlsson, A. et al. Automatic and quantitative assessment of regional muscle volume by multi-atlas segmentation using whole-body water-fat MRI. J Magn Reson Imaging 41, 1558–69 (2015). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/jmri.24726&link_type=DOI) 125.125.Borga, M. et al. Validation of a fast method for quantification of intra-abdominal and subcutaneous adipose tissue for large-scale human studies. NMR Biomed 28, 1747–53 (2015). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/nbm.3432&link_type=DOI) 126.126.Linge, J. et al. Body Composition Profiling in the UK Biobank Imaging Study. Obesity (Silver Spring*)* 26, 1785–1795 (2018). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/oby.22210&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=29785727&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F06%2F03%2F2023.04.18.23288769.atom) 127.127.Borga, M. et al. Reproducibility and repeatability of MRI-based body composition analysis. Magn Reson Med 84, 3146–3156 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/mrm.28360&link_type=DOI) 128.128.Langner, T. et al. Kidney segmentation in neck-to-knee body MRI of 40,000 UK Biobank participants. Sci Rep 10, 20963 (2020). 129.129.Jiang, L. et al. A resource-efficient tool for mixed model association analysis of large-scale data. Nat Genet 51, 1749–1755 (2019). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41588–019–0530–8&link_type=DOI) 130.130.Bycroft, C. et al. The UK Biobank resource with deep phenotyping and genomic data. Nature 562, 203–209 (2018). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41586-018-0579-z&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=30305743&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F06%2F03%2F2023.04.18.23288769.atom) 131.131.Purcell, S. et al. PLINK: a tool set for whole-genome association and population- based linkage analyses. The American Journal of Human Genetics 81, 559–575 (2007). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1086/519795&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=17701901&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F06%2F03%2F2023.04.18.23288769.atom) 132.132.Mullins, N. et al. Genome-wide association study of more than 40,000 bipolar disorder cases provides new insights into the underlying biology. Nature genetics 53, 817–829 (2021). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41588-021-00857-4&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=34002096&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F06%2F03%2F2023.04.18.23288769.atom) 133.133.Program, A.o.U.R. All of Us Research Program Genomic Research Data Quality Report, All of Us Curated Data Repository (CDR) release C2022Q4R9. Vol. 2024 (2024). 134.134.Guo, Y. et al. Plasma proteomic profiles predict future dementia in healthy adults. Nature Aging, 1–14 (2024). 135.135.Lumsden, A.L., Mulugeta, A., Zhou, A. & Hypponen, E. Apolipoprotein E (APOE) genotype-associated disease risks: a phenome-wide, registry-based, case-control study utilising the UK Biobank. EBioMedicine 59, 102954 (2020).