Body mass index and birth weight improve polygenic risk score for type 2 diabetes

Avigail Moldovan; Yedael Y. Waldman; Nadav Brandes; Michal Linial

doi:10.1101/2021.05.16.21257279

Abstract

One of the major challenges in the post-genomic era is elucidating the genetic basis of human diseases. In recent years, studies have shown that polygenic risk scores (PRS), based on aggregated information from millions of variants across the human genome, can estimate individual risk for common diseases. In practice, the current medical practice still predominantly relies on physiological and clinical indicators to assess personal disease risk. For example, caregivers mark individuals with high body mass index (BMI) as having an increased risk to develop type 2 diabetes (T2D). An important question is whether combining PRS with clinical metrics can increase the power of disease prediction in particular from early life. In this work we examined this question, focusing on T2D. We show that an integrated approach combining adult BMI and PRS achieves considerably better prediction than each of the measures on unrelated Caucasians in the UK Biobank (UKB, n=290,584). Likewise, integrating PRS with self-reports on birth weight (n=172,239) and comparative body size at age ten (n=287,203) also substantially enhance prediction as compared to each of its components. While the integration of PRS with BMI achieved better results as compared to the other measurements, the latter are early-life measurements that can be integrated already at childhood, to allow preemptive intervention for those at high risk to develop T2D. Our integrated approach can be easily generalized to other diseases, with the relevant early-life measurements.

Introduction

Predicting the risk of an individual to develop a specific disease is a key challenge in clinical decision making [1]. Based on such predictions, individuals can be identified for early intervention to prevent, delay the onset or better manage the disease and its outcome. Understanding the genetic component of the disease can highlight individuals at risk based on their genetic profile. Indeed, with more genetic and phenotypic information available for large cohorts, genome-wide association studies (GWAS) have been used to find genetic variants associated with complex diseases and traits [2–4] Nevertheless, in most GWAS studies, variants that are significantly associated with the disease or trait explained only a small fraction of its presumed genetic heritability component. The shortage of GWAS contribution to complex disease risk has been addressed as the missing heritability problem with various explanations that were presented to address it [5–7]. A likely explanation argues that complex diseases are signified by complex intracellular interactions. However, the many variants that are below significance in GWAS, actually affect the trait, and cumulatively contribute to the phenotype even more than the relatively few statistically significant GWAS variants [8, 9]. In light of this possibility, different studies developed polygenic risk scores (PRS) that consider the accumulative effect of millions of genetic markers to predict the probability of an individual to develop a complex disease [1, 10–13]. In some cases, the PRS methodology was able to highlight individuals with the same risk as individuals with rare monogenic mutations linked to a disease. The greater effect on public health reflects the fact that the PRS-based approach cover many more individuals (up to 20 folds) as compared to rare monogenic mutation carriers [14]. In addition, it was shown that the penetrance of rare monogenic high-risk variants in various diseases is also affected by the polygenic risk background as reflected by PRS [15].

The etiology of common complex diseases is presumed to be a combination of both genetic and environmental factors and the interactions between them [16]. Various physical and clinical measures are often taken to highlight individuals with high risk for diseases, and these measures reflect both genetic and non-genetic factors. For example, high body mass index (BMI), which has both genetic and non-genetic components [17], is a major risk factor for type 2 diabetes (T2D) [18, 19]. Birth weight is yet another example of a physical measure that combines effects from both genetic and environmental factors [20]. However, the direction of the association between birth weight and T2D (low birth weight being a risk or also high birth weight), its scale and whether it is sex-dependent are still not clear [21–25].

In this work we asked whether a combined approach that utilizes both genetic factors (e.g., PRS) and quantitative measures (that have non-genetic components) can improve disease prediction. We evaluated this approach by using both the PRS and physical measurements associated with T2D prevalence (BMI, birth weight and comparative body size at age ten) to predict disease risk, based on the UK Biobank (UKB) cohort [26].

Our results demonstrate that such a combined risk predictor significantly enhances prediction as compared to PRS or each of the underlying measures alone. Importantly, our analysis includes early-life measurements, meaning that individuals at high risk can be identified early in life, leading to more effective intervention.

Methods

UK Biobank (UKB) data

The analysis in this work is based on the information available for UKB participants [26]. We focused on Caucasians by limiting the analysis to participants who self-reported themselves as White (being White, British, Irish or any other white background [codes 1, 1001, 1002, 1003, respectively, in Ethnic background, UKB data-field 21000]) and being classified as Caucasians based on their genetic ancestry (Genetic ethnic group, data-field 22006). We also required the individuals to have both genotyping data and information on T2D disease status. Disease classification was based on clinical information provided for UKB participants and encoded by ICD-10 code for T2D (E11.X). Additional phenotypes were used for the analysis: BMI (taken at the UKB Assessment Centre, UKB data field 21001), birth weight (based on self-reporting, UKB data field 20022) and comparative body size at age ten (based on self-reporting, UKB data field 1687). In each of the analyses we focused on individuals with the relevant phenotypic information. To address possible sex differences, the analysis was done separately for males and females. Following the filtering steps, 332,338, 184,288 and 318,260 participants were included in the analysis for BMI, birth weight and body size at age ten, respectively. Finally, we focused on participants evaluated at age 40-70 and removed genetic relatives, by keeping only one representative of each kinship group of related individuals from the same sex (recall that analysis was done separately for each sex). This resulted in sets of 290,584, 172,239 and 287,203 participants for BMI, birth weight and body size at age ten, respectively.

Polygenic risk score calculation

The PRS of an individual is calculated as the weighted sum of his/her allele values over the set of genotyped markers. This score is based on the genotype of each individual and does not considers sex or age. Therefore, we refer to it as a “raw” PRS. Let m be the number of markers used for raw PRS calculation, let G_i be the allelic status of marker i in a specific individual (G_i ∈ {0,1,2}), and let w_ibe the weight of marker i (based on the association of the marker with the trait). Raw PRS of that individual is then defined as: The weights for PRS calculation for T2D on a set of approximately 6.5 million markers (both genotyped and imputed), based on a previous work [14], were downloaded from The Cardiovascular Disease Knowledge Portal (http://www.broadcvdi.org/informational/data). We applied these weights, which had been fit on the UKB data, on the markers of UKB participants to obtain raw PRS values for each individual.

Composite risk score

In this work, we defined a composite risk score (CRS) which is composed of three components: genetic profile (raw PRS), phenotypic information P (i.e., BMI, birth weight or comparative body size at age 10), and age. For each of the components, we estimate an individual’s disease risk based on the disease prevalence observed within the relevant UKB cohort across individuals with similar scores (e.g., similar raw PRS for the genetic component). A weighted sum of the different components is taken to obtain a CRS that reflects an individual’s disease risk. The estimated risk scores and weights for each component are learned in a training set and evaluated on a test set (as described below). The rationale behind transforming the original measures into estimated disease prevalence is to allow incorporation of measures that are not necessarily monotonic with respect to disease prevalence. In addition, transforming the measures into disease prevalence also normalizes the different measures, that often span different ranges (e.g., raw PRS and BMI values). The analysis was done separately for each sex.

Formally, we sorted all the individuals in the training set based on their raw PRS values and divided them into 100 equal-size bins (i.e., raw PRS percentiles of UKB participants). For each bin we calculated T2D prevalence in that bin (i.e., the number of cases divided by the total number of individuals in that bin) and defined it as the genetic risk (GR) of the members of that bin. For example, if in a specific bin, 5% of the individuals were reported as having a disease, the GR of that bin was defined as 0.05. Thus, the GR reflects the actual disease risk in the UKB, based on individuals with similar raw PRS scores, sharing the same bin. Let raw PRS_i be the raw PRS value of sample. We define GR _i as the GR of the bin that raw PRS_i belongs to.

The same procedure was also applied to the phenotypic measure P: we sorted all individuals in the training set based on their P measures and divided them into 100 equal-size bins and calculated for each bin the phenotypic risk (PR) of members of that bin. In the case of comparative body size at age 10, which included only three values (“Thinner”, “About average” and “Plummer”), people were divided to three bins based on this classification and the PR was calculated for each of these three predefined bins. We denote the PR of individual by PR_i.

In addition, we also considered age for the composite score. We divided all individuals in the training set according to their age (measured in rounded years) and for each age calculated the age risk (AR) of members with the same age. We denote the AR of individual i by AR_i.

The composite risk score (CRS) of sample i, CRS_i, was then defined as a weighted sum of the three risk measures: Where: These parameters are trained in and learned in the training set, as described below.

In addition to CRS, we also converted each of the measures alone to disease risk estimates and included age, without including the other measure. Formally, the PRS of sample i, PRS_i, was defined as follows: Where: Thus, as opposed to the original raw PRS, PRS considers age as well (but does not include the phenotypic measure).

Similarly, for the phenotypic measures BMI and birth weight, we defined a measure risk score that combines them with age, but without PRS. Thus, BMI risk of sample i (as opposed to raw BMI that included only the original BMI measurement) was defined as: Where: This was also done for birth weight risk (as opposed to raw birth weight) but not to comparative body size at age ten that includes only three distinct values. Finally, we also considered age alone, to examine whether the other measures provide additional predictive information beyond age alone. In that case, it was defined as AR_i.

For each measure (CRS, PRS and the phenotypic measures BMI and birth weight), we trained our model on 70% of the individuals (comprising the training set) to estimate the optimal weights (α, β, γ depending on the specific measure) that maximize the area under curve (AUC) in the receiver operating characteristic (ROC) for the specific measure. We sampled all combinations for the values of the α, β, γ weights, in steps of 0.025 in the range [0,1]. Evaluation of the measures was performed on the remaining 30% of the individuals (comprising the test set), based on odds ratio (OR) analysis, as described below. For the age measure (AR) alone there was no weight to learn, but the measure itself (i.e., T2D prevalence per age for each sex) was estimated on the training set and evaluated on the test set.

Evaluation of the results

We evaluated and compared the different measures (CRS, PRS, BMI, birth weight and age) by examining the resulting T2D OR. For each measure, we divided the participants in the test set into 100 equal-size bins (i.e., percentiles 0-99). We then calculated for each bin its OR. Formally, let D_p be the number of individuals diagnosed with T2D among all individuals in the p percentile, and let H_p be the number of individuals not diagnosed with T2D among all individuals in the p percentile. Similarly, let D _⌐p and H _⌐p be the number of individuals diagnosed with T2D among all individuals except those in the p percentile and the number of individuals not diagnosed with T2D among all individuals except those in the p percentile, respectively. The OR of percentile p, OR (p) was then defined as: To estimate the robustness of the results (e.g., calculating standard deviations for the OR), we repeated the procedure of randomly dividing the dataset into training and test sets, and evaluating the OR from the classification results of 1000 repetitions.

Results

PRS and BMI

In the current study we used the UK Biobank (UKB) cohort [26], focusing on participants whose ethnic background was classified as White, where genotyping information and disease status for T2D was available (see Methods). As there are known sex differences in and T2D prevalence and risk factors [27, 28], we preformed the analysis separately for males and females. Raw PRS (based on [14]), BMI and disease state (case/control) information was available for 290,584 participants, among them 157,813 (54.31%) were females.

Figure 1A show the relation between raw PRS and BMI and T2D disease prevalence. As can be seen, both measures were strongly associated with disease prevalence in both sexes. T2D disease prevalence was higher in males as compared to females. The analysis also showed that raw BMI was a better predictor for the disease risk as compared to raw PRS. This was also demonstrated with respect to OR across the different percentiles (Figure 1B). For example, the OR in the 99^th percentile was 8.62 vs. 2.87 and 6.79 vs. 2.84 for raw BMI vs. raw PRS in females and males, respectively. The receiver operating characteristic (ROC) curves also confirmed this. The area under the curve (AUC) of the raw BMI measure was larger than the AUC of the raw PRS measure in both sexes: 0.767 vs. 0.626 and 0.721 vs. 0.629 for raw BMI vs. PRS in females and males, respectively (Figure 1C). These results also indicate that the differences between the two measures were larger in females than in males, and that BMI is a better predictor in females than in males for identifying individuals at high risk to develop T2D.

Figure 1.

Raw BMI and PRS as predictors for T2D risk. (A) T2D disease prevalence based on raw BMI and PRS percentiles for females and males. For each measure (raw BMI and PRS), UKB participants were divided into percentiles, and T2D prevalence was calculated for each percentile. (B) T2D odds ratio (OR) for each percentile is shown for females and males, where the horizontal line represents a neutral OR of 1. (C) Based on these percentiles, the receiver operating characteristic (ROC) curve is presented for the two measures for females and males, to compare the AUC of the two measures.

Next, we examined whether combining PRS and BMI together can increase their prediction power. For that purpose, we defined a new composite risk score (CRS) which combines both the raw PRS and BMI measures, as well as age. For each of these measures (raw PRS, raw BMI, age) we estimated an individual’s risk based on disease prevalence of people with similar values (e.g., people in the same raw PRS percentile) and combined them into a composite score. The AUC of the combined score was significantly higher as compared to the other measures in both sexes (Wilcoxon signed rank test P-value<10⁻¹⁶; Supplementary Figure S1). Comparison of OR revealed that for both sexes, BMI exhibited better performance as compared to PRS, but CRS outperformed both measures across all percentiles (Figure 2). All measures (BMI, PRS and CRS) outperformed age alone.

Figure 2.

Odds ratio (OR) for T2D, based on BMI, PRS, CRS or age percentiles. (A) OR for all percentiles and all measures for females and males. Vertical lines correspond to the standard deviation of the average OR across 1000 random splits of the dataset. The horizontal line represents a neutral OR of 1. OR values for females and males in specific percentiles are also presented: (B) 90^th, (C) 95^th, (D) 97^th and (E) 99^th.

Specifically, the average OR of the top percentile in males was 3.99, 7.84 and 9.38 for PRS, BMI and CRS, respectively. In females, the average OR of the top percentiles was 3.94, 9.10 and 10.27 for PRS, BMI and CRS, respectively. Additional results of the top percentiles are summarized in Figures 2C-2E and Table 1. Both PRS and BMI measures that included age achieved higher OR values than the raw PRS and BMI measures that did not include age (Figure 1B), demonstrating the importance of adding age into the predictive model.

View this table:

Table 1. Average OR values for T2D for the different measures (BMI, PRS, CRS) by percentiles

These results also demonstrate sex differences with respect to the predictive power of BMI, and therefore of CRS: higher OR values were achieved for females, in accordance with the results reported for the raw measures (Figure 1).

PRS and birth weight

After evaluating BMI, we turned to another physical measure associated with T2D – birth weight. We studied a cohort of 172,239 participants, 105,438 (61.21%) of which were females, who had birth weight values, PRS, and T2D disease state information was available. Similar to the analysis performed for the BMI, we analyzed the association between disease risk and raw birth weight, for males and females separately (Figure 3).

Figure 3. T2D disease prevalence across raw birth weight and PRS percentiles for females and males.

Lower birth weight was associated with higher disease prevalence in both males and females. High birth weight (mainly in the top percentiles) was also associated with higher T2D risk in both sexes, but to a lesser extent.

Next, and similar to the analysis for BMI, we defined a combined score that reflects both the risk associated with birth weight and PRS, based on disease prevalence for different PRS and birth weight percentiles, while also accounting for age. The predictive power (AUC) of the combined score was significantly higher than the individual measures in both sexes (Wilcoxon signed rank test P-value <10⁻¹⁶ ; Supplementary Figure S1). CRS also achieved higher OR values in the top percentiles (Table 2). Specifically, in females it achieved an average OR of 4.64 for the top percentile, compared to 3.81 and 3.62 for PRS and birth weight, respectively. In males the OR values were even higher: 4.83 vs. 4.54 and 3.08 for PRS and birth weight, respectively. For detailed trends across all measurement range (in percentiles) see Supplementary Figure S2.

View this table:

Table 2. Average OR values for T2D for the different measures (birth weight, PRS, CRS) by percentiles

While BMI was more predictive of T2D risk than birth weight, the latter also significantly improved prediction power (as part of the combined score) over PRS. Comparing males and females, we observed that males had higher OR values in the higher percentiles, for both PRS and CRS measures (but not for birth weight).

PRS and body size at age ten

Studies have shown that childhood obesity increases the risk for adult T2D and coronary artery disease (CAD) [29, 30]. Information on childhood BMI was not available for UKB participants but a related childhood measure of a comparative body size at age ten was available for 287,203 participants, among them 156,307 (54.42%) were females. While this measure is subjective and retrospective, and included only three predetermined categorical values (thinner, about average and plumper), it was still associated with T2D risk in adulthood (Figure 4).

Figure 4. T2D disease prevalence for different categories of body size at age ten for females and males.

People who had described themselves as being plumper at age ten were at higher risk to develop T2D in adulthood compared to people reporting average weight at that age. Similarly, but to a lesser extent, people who described themselves as being thinner at age ten were also at higher risk to develop T2D later in life. This was observed in both sexes These differences in T2D prevalence between the three groups were highly significant (Chi square test P-value<10⁻¹⁶).

Next, we defined a combined score that considers PRS, comparative body size at age ten and age. Even with this subjective and simplistic categorical measure, the CRS significantly outperformed PRS with respect to AUC (Wilcoxon signed rank test P-value<10⁻¹⁶; Supplementary Figure S1) and OR (Figure 5 and Table 3). Results for males and females were very similar, with slightly higher OR values in males (for both PRS and CRS). Specifically, the average OR in the top CRS percentile was 4.18 vs. 3.83 for PRS in females and 4.24 vs. 3.98 in males.

View this table:

Table 3. Average OR values for T2D for PRS and CRS by percentiles

Figure 5.

Odds ratio (OR) for T2D, based on PRS, CRS or age percentiles. (A) OR for all percentiles and all measures for females and males. Vertical lines correspond to the standard deviation of the average OR across 1000 random splits of the dataset. The horizontal line represents a neutral OR of 1. OR values for females and males in specific percentiles are also presented: (B) 90^th, (C) 95^th, (D) 97^th and (E) 99^th.

Discussion

In recent years, PRS has attracted increasing attention as a potential tool to estimate disease risk for common conditions and diseases based on the genetics of individuals [1, 12]. In the current work we enhanced PRS prediction potential by integrating the raw genetic signal with available physical measures that capture non-genetic (environmental) components of human diseases, focusing on T2D. First, we integrated information on BMI into the PRS model, as high BMI is a well-known risk factor for T2D [18, 19]. We found that while both PRS and BMI can highlight individuals with higher risk to develop T2D, a combined approach was superior to each of the measures alone, for both males and females, demonstrating the added value in such an approach.

Recently, several studies used integrated approaches for disease risk estimation by adding PRS information to standard clinical predictors. Conceptually, these studies applied the combined approach from both sides of its components: either to augment standard disease risk predictors with PRS or to augment PRS with disease risk predictors. Studies that focused on coronary artery disease (CAD) showed no [31] or little [32] improvement when adding PRS to clinically accepted risk predictors. These results raised again the question and the ongoing debate regarding the clinical utility of PRS [1, 33, 34].

A different study on CAD did find significant improvement by adding PRS to the routinely used risk predictors [35]. Another study on CAD, T2D, atrial fibrillation, breast and prostate cancer found that PRS improved the prediction power of such predictors [36]. Similarly, augmenting PRS with additional information such as BMI, and lab results such as HDL and LDL measures improved prediction power for T2D [37]. Similarly, augmenting PRS by traditional measures for cardiovascular disease risk modestly enhanced its prediction power [38]. In addition, a recent study added mortality risk factors to disease PRS to mark individuals with higher mortality risk [39].

Importantly, these studies used measures collected at adulthood while PRS values can be calculated earlier at life to indicate individuals at risk. Indeed, measurements that are taken at adulthood are likely to have stronger prediction power, as more relevant information on the disease and its risk predictors is revealed. However, interventions at the adult stage may be less effective, as some of the biological processes leading to diseases may have already started. Naturally, a composite score that includes adult BMI measures also suffers from this limit. Therefore, we examined whether augmenting PRS with early-life measures can increase their predictive utility. While genetic risk itself cannot be modified, additional risk factors that impact long-term health outcomes and are obtained at early life can be addressed through routine healthcare policy. In our study we used two such early-life measures that were available for many UKB participants: birth weight and three categories of body size at age ten. Similar to previous studies, we found association with low birth rate and high T2D prevalence, with stronger association in females. This is in accordance with the developmental origins theory, which suggests that low birth weight reflects under nutrition in utero that can lead to permanent changes in body functions, posing higher risk for certain metabolic diseases [40]. A weaker association was also found for high birth weight. Importantly, the number of UKB participants that were included in this analysis was relatively large as compared to many previous studies analyzing the relation between birth weight and T2D [22]. A combined approach that included birth weight and PRS improved the prediction power of each of its components. Turning to comparative body size at age ten, we found that adding this information to PRS improved its prediction power as well. Indeed, BMI had a better predictive power as compared to these early-life measures. However, these measures may only partially reveal the component they intend to reflect. Specifically, the body size categories at age ten measure is retrospective, subjective and included only three categories. Therefore, the labels for the body size at age ten only roughly estimated the actual body size at that age. Despite these limitations, early-life measures significantly improved PRS prediction power. We anticipate that more accurate and relevant measures such as childhood BMI or other relevant measures (that are routinely collected at the clinic), as well as their trajectories (across different ages), will further improve disease risk estimation and may inform early intervention.

This work also introduces a revised approach with respect to integrate age and sex into a predictive risk model. Traditionally, the sex of an individual is considered a covariate that is controlled for when learning raw PRS weights [41]. Therefore, when these weights are used, the resulting PRS is no longer affected by sex, and an individual’s PRS is determined solely based on their genetic background, regardless of their sex. In practice, like in other diseases, there are substantial sex differences in T2D prevalence and pathophysiology [27, 28]. In this work we addressed this issue by performing the analysis for each sex separately. Therefore, two people with the same raw PRS value but different sex may be given a completely different risk score. Indeed, we observed differences between the sexes. First, T2D prevalence was much higher in males as compared to females. In addition, T2D risk in the top percentiles for the PRS measure was slightly higher in males. This may perhaps explain why T2D risk in the top percentiles for the CRS measure (which is partially based on the PRS measure) was also higher in males when PRS was integrated with birth weight and comparative body size at age ten. However, when PRS was combined with BMI, the CRS measure achieved higher OR scores in females. This is likely because BMI, which outperforms PRS in its prediction power, is a better predictor in females for highlighting individuals at higher risk for T2D [42].

Similar to sex, age is also often considered a covariate that is controlled for when learning PRS weights. The inferred PRS of an individual is constant and does not change with age. However, similar to other diseases, T2D prevalence increases with age [43]. Here we addressed the role of age as a principal risk factor by adding it into the predictive model. As a result, our score reflects an individual’s risk to develop T2D around their age, and it changes throughout life, resulting in risk score trajectories.

We designed our combined risk score to be simple and easy for application and generalization. Thus, the PRS measure was based on raw PRS weights that had been calculated in a previous work [14]. While we focused on T2D, such summary statistics are available for numerous other diseases and traits (e.g., the Polygenic Score Catalog, [44]). Therefore, with additional relevant phenotypes and measures (based on the nature of the disease), our approach can also be applied to other complex diseases. In addition, we converted each of the measures used in the study into disease prevalence measures (based on the average disease prevalence in people with similar values of that measure). This conversion allowed us to easily integrate measures whose relation with disease prevalence is not monotonic (e.g., birth weight and comparative body size at age ten), and to integrate measures of different scales without explicit normalization. We integrated the different measures through a simple linear model. Taken together, this method can be applied relatively easily to various diseases, using various relevant measurements.

Even with this simplified approach, we achieved significant improvements that highlighted the importance of an integrated approach to estimate disease risk. Future works can further improve this through complementary ways to calculate and integrate risk factors. Below we briefly outline some suggestions for such improvements, mainly in the integration of sex and age into the model. First, our sex-specific approach was applied after the calculation of the raw PRS values, which can also be calculated for each sex alone. Indeed, several recent works used sex-specific PRS values because of the putative role of sex in many diseases and mortality [39, 45]. Second, for simplicity of the combined approach, age was taken as an independent measure with a constant effect. However, the role of some T2D risk factors changes throughout life [46]. Specifically, the weight of the genetic component of T2D varies across different ages of onsets [47], and this can lead to differential power of PRS to predict disease risk across different age groups, as was demonstrated in other diseases [48, 49]. Hence, the integration of age into the model can be done in more sophisticated ways (e.g., nonlinear), reflecting the apparent different weights of each component at different ages.

In summary, we demonstrated the benefit of adding measures to enhance PRS prediction. Specifically, we integrated PRS with early-life measures to pave the way for early intervention. We hope this will encourage future work on the integration of PRS with additional measures to provide more accurate clinical risk estimates for T2D and other complex diseases.

Data Availability

The entire analysis is based on UKBB aggregated data.

Funding

This study was supported by the ISF grant number: 2753/20 (to M.L.)

Competing interests

AM and YYW are employees of NRGene Ltd.

Ethics and Regulation

The UK-Biobank application ID 26664 (Linial lab). Ethical committee approval, The Hebrew University #13082019.

Supplementary Figures

Supplementary Figure S1.

Comparison of AUC values in the test sets for the different measures. In each analysis (BMI, birth weight and comparative body size at age ten), we compared the AUC values of the different measures across 1000 random test sets. Results are shown for the analysis of PRS with (A) BMI, (B) birth weight and (C) comparative body size at age ten. In all cases, the AUC of the combined measure (CRS) achieved significantly higher values as compared to the AUC of all other measures (Wilcoxon signed rank test P-value<10⁻¹⁶).

Supplementary Figure S2.

Odds ratio (OR) for T2D, based on birth weight, PRS, CRS or age percentiles. (A) OR for all percentiles and all measures for females and males. Vertical lines correspond to the standard deviation of the average OR across 1000 random splits of the dataset. The horizontal line represents a neutral OR of 1. OR values for females and males in specific percentiles are also presented: (B) 90^th, (C) 95^th, (D) 97^th and (E) 99^th.

Acknowledgments

We thank Ido Margaliot for useful discussion. We thank Center for Interdisciplinary Data Science (CIDR) and the CSE system team for support in data storage.

References

1.↵
Torkamani A, Wineinger NE, Topol EJ. The personal and clinical utility of polygenic risk scores. Nature Reviews Genetics. 2018;19:581–90.
OpenUrl CrossRef PubMed
2.↵
Hirschhorn JN, Daly MJ. Genome-wide association studies for common diseases and complex traits. Nature Reviews Genetics. 2005;6:95–108.
OpenUrl CrossRef PubMed Web of Science
3.
Lander ES. Initial impact of the sequencing of the human genome. Nature. 2011;470:187–97.
OpenUrl CrossRef PubMed Web of Science
4.↵
Bush WS, Moore JH. Chapter 11: Genome-Wide Association Studies. PLoS Comput Biol. 2012;8.
5.↵
Manolio TA, Collins FS, Cox NJ, Goldstein DB, Hindorff LA, Hunter DJ, et al. Finding the missing heritability of complex diseases. Nature. 2009;461:747–53.
OpenUrl CrossRef PubMed Web of Science
6.
Eichler EE, Flint J, Gibson G, Kong A, Leal SM, Moore JH, et al. Missing heritability and strategies for finding the underlying causes of complex disease. Nature Reviews Genetics. 2010;11:446–50.
OpenUrl CrossRef PubMed Web of Science
7.↵
Zuk O, Hechter E, Sunyaev SR, Lander ES. The mystery of missing heritability: Genetic interactions create phantom heritability. Proc Natl Acad Sci U S A. 2012;109:1193–8.
OpenUrl Abstract/FREE Full Text
8.↵
Yang J, Benyamin B, McEvoy BP, Gordon S, Henders AK, Nyholt DR, et al. Common SNPs explain a large proportion of the heritability for human height. Nat Genet. 2010;42:565–9.
OpenUrl CrossRef PubMed Web of Science
9.↵
Boyle EA, Li YI, Pritchard JK. An Expanded View of Complex Traits: From Polygenic to Omnigenic. Cell. 2017;169:1177–86.
OpenUrl CrossRef PubMed
10.↵
Chatterjee N, Shi J, García-Closas M. Developing and evaluating polygenic risk prediction models for stratified disease prevention. Nature Reviews Genetics. 2016;17:392–406.
OpenUrl CrossRef PubMed
11.
Inouye M, Abraham G, Nelson CP, Wood AM, Sweeting MJ, Dudbridge F, et al. Genomic Risk Prediction of Coronary Artery Disease in 480,000 Adults: Implications for Primary Prevention. J Am Coll Cardiol. 2018;72:1883–93.
OpenUrl FREE Full Text
12.↵
Lambert SA, Abraham G, Inouye M. Towards clinical utility of polygenic risk scores. Hum Mol Genet. 2019;28(R2):R133–42.
OpenUrl CrossRef PubMed
13.↵
Lewis CM, Vassos E. Polygenic risk scores: From research tools to clinical instruments. Genome Medicine. 2020;12.
14.↵
Khera A V., Chaffin M, Aragam KG, Haas ME, Roselli C, Choi SH, et al. Genome-wide polygenic scores for common diseases identify individuals with risk equivalent to monogenic mutations. Nature Genetics. 2018;50:1219–24.
OpenUrl CrossRef PubMed
15.↵
Fahed AC, Wang M, Homburger JR, Patel AP, Bick AG, Neben CL, et al. Polygenic background modifies penetrance of monogenic variants for tier 1 genomic conditions. Nat Commun. 2020;11.
16.↵
Wang WYS, Barratt BJ, Clayton DG, Todd JA. Genome-wide association studies: Theoretical and practical concerns. Nature Reviews Genetics. 2005;6:109–18.
OpenUrl CrossRef PubMed Web of Science
17.↵
Khera A V., Chaffin M, Wade KH, Zahid S, Brancale J, Xia R, et al. Polygenic Prediction of Weight and Obesity Trajectories from Birth to Adulthood. Cell. 2019;177.
18.↵
Chan JM, Rimm EB, Colditz GA, Stampfer MJ, Willett WC. Obesity, fat distribution, and weight gain as risk factors for clinical diabetes in men. Diabetes Care. 1994;17:961–9.
OpenUrl Abstract/FREE Full Text
19.↵
Tirosh A, Shai I, Afek A, Dubnov-Raz G, Ayalon N, Gordon B, et al. Adolescent BMI Trajectory and Risk of Diabetes versus Coronary Disease. N Engl J Med. 2011;364:1315–25.
OpenUrl CrossRef PubMed Web of Science
20.↵
Warrington NM, Beaumont RN, Horikoshi M, Day FR, Helgeland Ø, Laurin C, et al. Maternal and fetal genetic effects on birth weight and their relevance to cardio-metabolic risk factors. Nat Genet. 2019;51:804–14.
OpenUrl CrossRef PubMed
21.↵
Whincup PH, Kaye SJ, Owen CG, Huxley R, Cook DG, Anazawa S, et al. Birth weight and risk of type 2 diabetes a systematic review. JAMA - Journal of the American Medical Association. 2008;300.
22.↵
Zhao H, Song A, Zhang Y, Zhen Y, Song G, Ma H. The association between birth weight and the risk of type 2 diabetes mellitus: A systematic review and meta-analysis. Endocr J. 2018;65.
23.
Knop MR, Geng TT, Gorny AW, Ding R, Li C, Ley SH, et al. Birth weight and risk of type 2 diabetes mellitus, cardiovascular disease, and hypertension in adults: A meta-analysis of 7 646 267 participants from 135 studies. Journal of the American Heart Association. 2018;7.
24.
Mi D, Fang H, Zhao Y, Zhong L. Birth weight and type 2 diabetes: A meta-analysis. Exp Ther Med. 2017;14:5313–20.
OpenUrl
25.↵
Zimmermann E, Gamborg M, Sørensen TIA, Baker JL. Sex differences in the association between birth weight and adult type 2 diabetes. Diabetes. 2015;64:4220–5.
OpenUrl Abstract/FREE Full Text
26.↵
Bycroft C, Freeman C, Petkova D, Band G, Elliott LT, Sharp K, et al. The UK Biobank resource with deep phenotyping and genomic data. Nature. 2018;562:203–9.
OpenUrl CrossRef PubMed
27.↵
Kautzky-Willer A, Harreiter J, Pacini G. Sex and gender differences in risk, pathophysiology and complications of type 2 diabetes mellitus. Endocrine Reviews. 2016;37:278–316.
OpenUrl CrossRef PubMed
28.↵
Huebschmann AG, Huxley RR, Kohrt WM, Zeitler P, Regensteiner JG, Reusch JEB. Sex differences in the burden of type 2 diabetes and cardiovascular risk across the life course. Diabetologia. 2019;62:1761–72.
OpenUrl PubMed
29.↵
Geng T, Smith CE, Li C, Huang T. Childhood BMI and Adult Type 2 Diabetes, Coronary Artery Diseases, Chronic Kidney Disease, and Cardiometabolic Traits: A Mendelian Randomization Analysis. Diabetes Care. 2018;:dc172141.
30.↵
Dong SS, Zhang K, Guo Y, Ding JM, Rong Y, Feng JC, et al. Phenome-wide investigation of the causal associations between childhood BMI and adult trait outcomes: a two-sample Mendelian randomization study. Genome Med. 2021;13:1–17.
OpenUrl
31.↵
Mosley JD, Gupta DK, Tan J, Yao J, Wells QS, Shaffer CM, et al. Predictive Accuracy of a Polygenic Risk Score Compared with a Clinical Risk Score for Incident Coronary Heart Disease. JAMA - J Am Med Assoc. 2020;323:627–35.
OpenUrl
32.↵
Elliott J, Bodinier B, Bond TA, Chadeau-Hyam M, Evangelou E, Moons KGM, et al. Predictive Accuracy of a Polygenic Risk Score-Enhanced Prediction Model vs a Clinical Risk Score for Coronary Artery Disease. JAMA - J Am Med Assoc. 2020;323:636–45.
OpenUrl
33.↵
Khan SS, Cooper R, Greenland P. Do Polygenic Risk Scores Improve Patient Selection for Prevention of Coronary Artery Disease? JAMA - Journal of the American Medical Association. 2020;323:614–5.
OpenUrl
34.↵
Wald NJ, Old R. The illusion of polygenic disease risk prediction. Genetics in Medicine. 2019;21:1705–7.
OpenUrl CrossRef PubMed
35.↵
Riveros-Mckay F, Weale ME, Moore R, Selzam S, Krapohl E, Sivley RM, et al. An integrated polygenic and clinical risk tool enhances coronary artery disease prediction. medRxiv. 2020.
36.↵
Mars N, Koskela JT, Ripatti P, Kiiskinen TTJ, Havulinna AS, Lindbohm J V., et al. Polygenic and clinical risk scores and their impact on age at onset and prediction of cardiometabolic diseases and common cancers. Nat Med. 2020;26.
37.↵
Liu W, Zhuang Z, Wang W, Huang T, Liu Z. An Improved Genome-Wide Polygenic Score Model for Predicting the Risk of Type 2 Diabetes. Front Genet. 2021;12:632385.
OpenUrl
38.↵
Sun L, Pennells L, Kaptoge S, Nelson CP, Ritchie SC, Abraham G, et al. Polygenic risk scores in cardiovascular risk prediction: A cohort study and modelling analyses. PLoS Med. 2021;18:e1003498.
OpenUrl
39.↵
Meisner A, Kundu P, Zhang YD, Lan L V., Kim S, Ghandwani D, et al. Combined utility of 25 disease and risk factor polygenic risk scores for stratifying risk of all-cause mortality. medRxiv. 2020.
40.↵
Barker DJP. The origins of the developmental origins theory. Wiley Online Libr. 2007;261:412–7.
OpenUrl
41.↵
Choi SW, Mak TSH, O’Reilly PF. Tutorial: a guide to performing polygenic risk score analyses. Nature Protocols. 2020;15:2759–72.
OpenUrl
42.↵
Censin JC, Peters SAE, Bovijn J, Ferreira T, Pulit SL, Mägi R, et al. Causal relationships between obesity and the leading causes of death in women and men. PLoS Genet. 2019;15:e1008405.
OpenUrl CrossRef PubMed
43.↵
Halim M, Halim A. The effects of inflammation, aging and oxidative stress on the pathogenesis of diabetes mellitus (type 2 diabetes). Diabetes and Metabolic Syndrome: Clinical Research and Reviews. 2019;13:1165–72.
OpenUrl
44.↵
Lambert SA, Gil L, Jupp S, Ritchie SC, Xu Y, Buniello A, et al. The Polygenic Score Catalog as an open database for reproducibility and systematic evaluation. Nature Genetics. 2021;53:420–5.
OpenUrl
45.↵
Fan CC, Banks SJ, Thompson WK, Chen CH, McEvoy LK, Tan CH, et al. Sex-dependent polygenic effects on the clinical progressions of Alzheimer’s disease. bioRxiv. 2019;:613893.
46.↵
Alva ML, Hoerger TJ, Zhang P, Gregg EW. Identifying risk for type 2 diabetes in different age cohorts: Does one size fit all? BMJ Open Diabetes Res Care. 2017;5:e000447.
OpenUrl Abstract/FREE Full Text
47.↵
Padilla-Martínez F, Collin F, Kwasniewski M, Kretowski A. Systematic review of polygenic risk scores for type 1 and type 2 diabetes. International Journal of Molecular Sciences. 2020;21:1703.
OpenUrl
48.↵
Thomas M, Sakoda LC, Hoffmeister M, Rosenthal EA, Lee JK, van Duijnhoven FJB, et al. Response to Li and Hopper. American Journal of Human Genetics. 2021;108:527–9.
OpenUrl
49.↵
Li S, Hopper JL. Age dependency of the polygenic risk score for colorectal cancer. American Journal of Human Genetics. 2021;108:525–6.
OpenUrl

View the discussion thread.

Posted May 18, 2021.

Download PDF

Data/Code

Citation Tools

Subject Area

Health Informatics

Subject Areas

All Articles

Addiction Medicine (403)
Allergy and Immunology (712)
Anesthesia (207)
Cardiovascular Medicine (2970)
Dentistry and Oral Medicine (336)
Dermatology (253)
Emergency Medicine (446)
Endocrinology (including Diabetes Mellitus and Metabolic Disease) (1050)
Epidemiology (12816)
Forensic Medicine (12)
Gastroenterology (830)
Genetic and Genomic Medicine (4622)
Geriatric Medicine (423)
Health Economics (732)
Health Informatics (2943)
Health Policy (1073)
Health Systems and Quality Improvement (1092)
Hematology (393)
HIV/AIDS (933)
Infectious Diseases (except HIV/AIDS) (14145)
Intensive Care and Critical Care Medicine (854)
Medical Education (430)
Medical Ethics (116)
Nephrology (476)
Neurology (4412)
Nursing (238)
Nutrition (652)
Obstetrics and Gynecology (817)
Occupational and Environmental Health (739)
Oncology (2296)
Ophthalmology (652)
Orthopedics (260)
Otolaryngology (327)
Pain Medicine (282)
Palliative Medicine (84)
Pathology (503)
Pediatrics (1200)
Pharmacology and Therapeutics (510)
Primary Care Research (503)
Psychiatry and Clinical Psychology (3802)
Public and Global Health (7008)
Radiology and Imaging (1545)
Rehabilitation Medicine and Physical Therapy (920)
Respiratory Medicine (921)
Rheumatology (444)
Sexual and Reproductive Health (446)
Sports Medicine (386)
Surgery (491)
Toxicology (60)
Transplantation (212)
Urology (185)

[1] 1.↵
Torkamani A, Wineinger NE, Topol EJ. The personal and clinical utility of polygenic risk scores. Nature Reviews Genetics. 2018;19:581–90.
OpenUrl CrossRef PubMed

[2] 2.↵
Hirschhorn JN, Daly MJ. Genome-wide association studies for common diseases and complex traits. Nature Reviews Genetics. 2005;6:95–108.
OpenUrl CrossRef PubMed Web of Science

[3] 3.
Lander ES. Initial impact of the sequencing of the human genome. Nature. 2011;470:187–97.
OpenUrl CrossRef PubMed Web of Science

[4] 4.↵
Bush WS, Moore JH. Chapter 11: Genome-Wide Association Studies. PLoS Comput Biol. 2012;8.

[5] 5.↵
Manolio TA, Collins FS, Cox NJ, Goldstein DB, Hindorff LA, Hunter DJ, et al. Finding the missing heritability of complex diseases. Nature. 2009;461:747–53.
OpenUrl CrossRef PubMed Web of Science

[6] 6.
Eichler EE, Flint J, Gibson G, Kong A, Leal SM, Moore JH, et al. Missing heritability and strategies for finding the underlying causes of complex disease. Nature Reviews Genetics. 2010;11:446–50.
OpenUrl CrossRef PubMed Web of Science

[7] 7.↵
Zuk O, Hechter E, Sunyaev SR, Lander ES. The mystery of missing heritability: Genetic interactions create phantom heritability. Proc Natl Acad Sci U S A. 2012;109:1193–8.
OpenUrl Abstract/FREE Full Text

[8] 8.↵
Yang J, Benyamin B, McEvoy BP, Gordon S, Henders AK, Nyholt DR, et al. Common SNPs explain a large proportion of the heritability for human height. Nat Genet. 2010;42:565–9.
OpenUrl CrossRef PubMed Web of Science

[9] 9.↵
Boyle EA, Li YI, Pritchard JK. An Expanded View of Complex Traits: From Polygenic to Omnigenic. Cell. 2017;169:1177–86.
OpenUrl CrossRef PubMed

[10] 10.↵
Chatterjee N, Shi J, García-Closas M. Developing and evaluating polygenic risk prediction models for stratified disease prevention. Nature Reviews Genetics. 2016;17:392–406.
OpenUrl CrossRef PubMed

[11] 11.
Inouye M, Abraham G, Nelson CP, Wood AM, Sweeting MJ, Dudbridge F, et al. Genomic Risk Prediction of Coronary Artery Disease in 480,000 Adults: Implications for Primary Prevention. J Am Coll Cardiol. 2018;72:1883–93.
OpenUrl FREE Full Text

[12] 12.↵
Lambert SA, Abraham G, Inouye M. Towards clinical utility of polygenic risk scores. Hum Mol Genet. 2019;28(R2):R133–42.
OpenUrl CrossRef PubMed

[13] 13.↵
Lewis CM, Vassos E. Polygenic risk scores: From research tools to clinical instruments. Genome Medicine. 2020;12.

[14] 14.↵
Khera A V., Chaffin M, Aragam KG, Haas ME, Roselli C, Choi SH, et al. Genome-wide polygenic scores for common diseases identify individuals with risk equivalent to monogenic mutations. Nature Genetics. 2018;50:1219–24.
OpenUrl CrossRef PubMed

[15] 15.↵
Fahed AC, Wang M, Homburger JR, Patel AP, Bick AG, Neben CL, et al. Polygenic background modifies penetrance of monogenic variants for tier 1 genomic conditions. Nat Commun. 2020;11.

[16] 16.↵
Wang WYS, Barratt BJ, Clayton DG, Todd JA. Genome-wide association studies: Theoretical and practical concerns. Nature Reviews Genetics. 2005;6:109–18.
OpenUrl CrossRef PubMed Web of Science

[17] 17.↵
Khera A V., Chaffin M, Wade KH, Zahid S, Brancale J, Xia R, et al. Polygenic Prediction of Weight and Obesity Trajectories from Birth to Adulthood. Cell. 2019;177.

[18] 18.↵
Chan JM, Rimm EB, Colditz GA, Stampfer MJ, Willett WC. Obesity, fat distribution, and weight gain as risk factors for clinical diabetes in men. Diabetes Care. 1994;17:961–9.
OpenUrl Abstract/FREE Full Text

[19] 19.↵
Tirosh A, Shai I, Afek A, Dubnov-Raz G, Ayalon N, Gordon B, et al. Adolescent BMI Trajectory and Risk of Diabetes versus Coronary Disease. N Engl J Med. 2011;364:1315–25.
OpenUrl CrossRef PubMed Web of Science

[20] 20.↵
Warrington NM, Beaumont RN, Horikoshi M, Day FR, Helgeland Ø, Laurin C, et al. Maternal and fetal genetic effects on birth weight and their relevance to cardio-metabolic risk factors. Nat Genet. 2019;51:804–14.
OpenUrl CrossRef PubMed

[21] 21.↵
Whincup PH, Kaye SJ, Owen CG, Huxley R, Cook DG, Anazawa S, et al. Birth weight and risk of type 2 diabetes a systematic review. JAMA - Journal of the American Medical Association. 2008;300.

[22] 22.↵
Zhao H, Song A, Zhang Y, Zhen Y, Song G, Ma H. The association between birth weight and the risk of type 2 diabetes mellitus: A systematic review and meta-analysis. Endocr J. 2018;65.

[23] 23.
Knop MR, Geng TT, Gorny AW, Ding R, Li C, Ley SH, et al. Birth weight and risk of type 2 diabetes mellitus, cardiovascular disease, and hypertension in adults: A meta-analysis of 7 646 267 participants from 135 studies. Journal of the American Heart Association. 2018;7.

[24] 24.
Mi D, Fang H, Zhao Y, Zhong L. Birth weight and type 2 diabetes: A meta-analysis. Exp Ther Med. 2017;14:5313–20.
OpenUrl

[25] 25.↵
Zimmermann E, Gamborg M, Sørensen TIA, Baker JL. Sex differences in the association between birth weight and adult type 2 diabetes. Diabetes. 2015;64:4220–5.
OpenUrl Abstract/FREE Full Text

[26] 26.↵
Bycroft C, Freeman C, Petkova D, Band G, Elliott LT, Sharp K, et al. The UK Biobank resource with deep phenotyping and genomic data. Nature. 2018;562:203–9.
OpenUrl CrossRef PubMed

[27] 27.↵
Kautzky-Willer A, Harreiter J, Pacini G. Sex and gender differences in risk, pathophysiology and complications of type 2 diabetes mellitus. Endocrine Reviews. 2016;37:278–316.
OpenUrl CrossRef PubMed

[28] 28.↵
Huebschmann AG, Huxley RR, Kohrt WM, Zeitler P, Regensteiner JG, Reusch JEB. Sex differences in the burden of type 2 diabetes and cardiovascular risk across the life course. Diabetologia. 2019;62:1761–72.
OpenUrl PubMed

[29] 29.↵
Geng T, Smith CE, Li C, Huang T. Childhood BMI and Adult Type 2 Diabetes, Coronary Artery Diseases, Chronic Kidney Disease, and Cardiometabolic Traits: A Mendelian Randomization Analysis. Diabetes Care. 2018;:dc172141.

[30] 30.↵
Dong SS, Zhang K, Guo Y, Ding JM, Rong Y, Feng JC, et al. Phenome-wide investigation of the causal associations between childhood BMI and adult trait outcomes: a two-sample Mendelian randomization study. Genome Med. 2021;13:1–17.
OpenUrl

[31] 31.↵
Mosley JD, Gupta DK, Tan J, Yao J, Wells QS, Shaffer CM, et al. Predictive Accuracy of a Polygenic Risk Score Compared with a Clinical Risk Score for Incident Coronary Heart Disease. JAMA - J Am Med Assoc. 2020;323:627–35.
OpenUrl

[32] 32.↵
Elliott J, Bodinier B, Bond TA, Chadeau-Hyam M, Evangelou E, Moons KGM, et al. Predictive Accuracy of a Polygenic Risk Score-Enhanced Prediction Model vs a Clinical Risk Score for Coronary Artery Disease. JAMA - J Am Med Assoc. 2020;323:636–45.
OpenUrl

[33] 33.↵
Khan SS, Cooper R, Greenland P. Do Polygenic Risk Scores Improve Patient Selection for Prevention of Coronary Artery Disease? JAMA - Journal of the American Medical Association. 2020;323:614–5.
OpenUrl

[34] 34.↵
Wald NJ, Old R. The illusion of polygenic disease risk prediction. Genetics in Medicine. 2019;21:1705–7.
OpenUrl CrossRef PubMed

[35] 35.↵
Riveros-Mckay F, Weale ME, Moore R, Selzam S, Krapohl E, Sivley RM, et al. An integrated polygenic and clinical risk tool enhances coronary artery disease prediction. medRxiv. 2020.

[36] 36.↵
Mars N, Koskela JT, Ripatti P, Kiiskinen TTJ, Havulinna AS, Lindbohm J V., et al. Polygenic and clinical risk scores and their impact on age at onset and prediction of cardiometabolic diseases and common cancers. Nat Med. 2020;26.

[37] 37.↵
Liu W, Zhuang Z, Wang W, Huang T, Liu Z. An Improved Genome-Wide Polygenic Score Model for Predicting the Risk of Type 2 Diabetes. Front Genet. 2021;12:632385.
OpenUrl

[38] 38.↵
Sun L, Pennells L, Kaptoge S, Nelson CP, Ritchie SC, Abraham G, et al. Polygenic risk scores in cardiovascular risk prediction: A cohort study and modelling analyses. PLoS Med. 2021;18:e1003498.
OpenUrl

[39] 39.↵
Meisner A, Kundu P, Zhang YD, Lan L V., Kim S, Ghandwani D, et al. Combined utility of 25 disease and risk factor polygenic risk scores for stratifying risk of all-cause mortality. medRxiv. 2020.

[40] 40.↵
Barker DJP. The origins of the developmental origins theory. Wiley Online Libr. 2007;261:412–7.
OpenUrl

[41] 41.↵
Choi SW, Mak TSH, O’Reilly PF. Tutorial: a guide to performing polygenic risk score analyses. Nature Protocols. 2020;15:2759–72.
OpenUrl

[42] 42.↵
Censin JC, Peters SAE, Bovijn J, Ferreira T, Pulit SL, Mägi R, et al. Causal relationships between obesity and the leading causes of death in women and men. PLoS Genet. 2019;15:e1008405.
OpenUrl CrossRef PubMed

[43] 43.↵
Halim M, Halim A. The effects of inflammation, aging and oxidative stress on the pathogenesis of diabetes mellitus (type 2 diabetes). Diabetes and Metabolic Syndrome: Clinical Research and Reviews. 2019;13:1165–72.
OpenUrl

[44] 44.↵
Lambert SA, Gil L, Jupp S, Ritchie SC, Xu Y, Buniello A, et al. The Polygenic Score Catalog as an open database for reproducibility and systematic evaluation. Nature Genetics. 2021;53:420–5.
OpenUrl

[45] 45.↵
Fan CC, Banks SJ, Thompson WK, Chen CH, McEvoy LK, Tan CH, et al. Sex-dependent polygenic effects on the clinical progressions of Alzheimer’s disease. bioRxiv. 2019;:613893.

[46] 46.↵
Alva ML, Hoerger TJ, Zhang P, Gregg EW. Identifying risk for type 2 diabetes in different age cohorts: Does one size fit all? BMJ Open Diabetes Res Care. 2017;5:e000447.
OpenUrl Abstract/FREE Full Text

[47] 47.↵
Padilla-Martínez F, Collin F, Kwasniewski M, Kretowski A. Systematic review of polygenic risk scores for type 1 and type 2 diabetes. International Journal of Molecular Sciences. 2020;21:1703.
OpenUrl

[48] 48.↵
Thomas M, Sakoda LC, Hoffmeister M, Rosenthal EA, Lee JK, van Duijnhoven FJB, et al. Response to Li and Hopper. American Journal of Human Genetics. 2021;108:527–9.
OpenUrl

[49] 49.↵
Li S, Hopper JL. Age dependency of the polygenic risk score for colorectal cancer. American Journal of Human Genetics. 2021;108:525–6.
OpenUrl

Body mass index and birth weight improve polygenic risk score for type 2 diabetes

Abstract

Introduction

Methods

UK Biobank (UKB) data

Polygenic risk score calculation

Composite risk score

Evaluation of the results

Results

PRS and BMI

PRS and birth weight

PRS and body size at age ten

Discussion

Data Availability

Funding

Competing interests

Ethics and Regulation

Supplementary Figures

Acknowledgments

References

Citation Manager Formats

Subject Area