Weighted burden analysis in 200,000 exome-sequenced subjects characterises rare variant effects on risk of type 2 diabetes
==========================================================================================================================

* David Curtis

## Abstract

Type 2 diabetes (T2D) is a disease for which both common genetic variants and environmental factors influence risk. A few genes have been identified in which very rare variants have large effects on risk and here we carry out a weighted burden analysis of rare variants in a sample of over 200,000 exome-sequenced participants in the UK Biobank project, of whom over 13,000 have T2D. Variant weights were allocated based on allele frequency and predicted effect, as informed by a previous analysis of hyperlipidaemia. There was an exome-wide significant increased burden of rare, functional variants in three genes, *GCK, HNF4A* and *GIGYF1. GIGYF1* has not previously been identified as a diabetes risk gene but its product is plausibly involved in the modification of insulin signalling. A number of other genes did not attain exome-wide significance but were highly ranked and potentially of interest, including *ALAD, PPARG, GYG1* and *GHRL*. Loss of function (LOF) variants were associated with T2D in *GCK* and *GIGYF1* whereas nonsynonymous variants annotated as probably damaging were associated in *GCK* and *HNF4A*. Overall, fewer than 1% of T2D cases carried one of these variants. In two genes previously implicated in diabetes aetiology, *HNF1A* and *HNF1B*, there was an excess of LOF variants among cases but the small numbers of these fell well short of statistical significance, suggesting that even larger datasets will be helpful for more fully elucidating the contribution of rare genetic variants to T2D risk. This research has been conducted using the UK Biobank Resource.

Keywords
*   Diabetes
*   biobank
*   exome
*   *GCK*
*   *HNF4A*
*   *GIGYF1*
*   *ALAD*
*   *PPARG*
*   *GYG1*
*   *GHRL*

## Introduction

Genome-wide association studies of type 2 diabetes (T2D) have implicated a large number of common genetic variants (Xue et al., 2018). In the UK Biobank, a genetic risk score derived from common variants was associated with T2D and incorporating it alongside conventional risk factors in order to predict T2D increased the area under the curve (AUC) from 0.851 to 0.855 (Chen et al., 2021). Common variants have individually modest effects on risk but a small number of genes have been identified in which rare coding variants with large effects can result in a phenotype of non-insulin dependent diabetes. Maturity onset diabetes of the young (MODY) is caused by mutations in a number of genes including *HNF1A, HNF4A, GCK, HNF1B, KCNJ11, ABCC8* (Murphy et al., 2008). Loss of function (LOF) and nonsynonymous variants in *PPARG* can cause a familial lipodystrophy with insulin resistant diabetes (Agostini et al., 2018). Recessively acting mutations in the *INSR* gene can cause insulin resistance with hyperglycaemia (Semple et al., 2011). These rare coding variants have typically been identified by targeted studies of familial cases and individuals with severe phenotypes but the availability of large samples of exome sequenced subjects now allows the possibility to explore the effects of variations in these genes in the population more broadly and potentially to discover novel genes. A study of 16,000 exome and genome sequenced cases and controls failed to detect a substantial contribution from rare variants to risk of type 2 diabetes (Fuchsberger et al., 2016). However a larger study using 20,791 cases and 24,440 controls identified three exome-wide significant genes, *MC4R, SLC30A8* and *PAM*, as well as specific variant in *MC4R*, rs79783591 (Ile269Asn), which was exome-wide significant in single variant analyses (Flannick et al., 2019).

Exome sequence data is now available for 200,000 of the 500,000 UK Biobank subjects (Szustakowski et al., 2020). We have recently analysed this in order to illuminate the effect of rare, coding variants on susceptibility to hyperlipidaemia and we now apply the same approach to study T2D (Curtis, 2021a).

## Methods

The UK Biobank dataset was downloaded along with the variant call files for 200,632 subjects who had undergone exome-sequencing and genotyping by the UK Biobank Exome Sequencing Consortium using the GRCh38 assembly with coverage 20X at 95.6% of sites on average (Szustakowski et al., 2020). UK Biobank had obtained ethics approval from the North West Multi-centre Research Ethics Committee which covers the UK (approval number: 11/NW/0382) and had obtained informed consent from all participants. The UK Biobank approved an application for use of the data (ID 51119) and ethics approval for the analyses was obtained from the UCL Research Ethics Committee (11527/001). All variants were annotated using the standard software packages VEP, PolyPhen and SIFT (Adzhubei et al., 2013; Kumar et al., 2009; McLaren et al., 2016). To obtain population principal components reflecting ancestry, version 2.0 of *plink* ([https://www.cog-genomics.org/plink/2.0/](https://www.cog-genomics.org/plink/2.0/)) was run with the options *--maf 0*.*1 --pca 20 approx* (Chang et al., 2015; Galinsky et al., 2016).

The UK Biobank sample contains 503,317 subjects of whom 94.6% are of white ethnicity. As we have discussed previously, it has become standard practice for investigators to simply discard data from participants with other ancestries and we regard this as regrettable (Curtis, 2021b). We demonstrated that if population principal components are included as covariates then it is possible to include all participants, regardless of ancestry, in the type of weighted burden analysis described here without any inflation of the test statistic.

To define cases, a similar approach was used as was previously implemented for the investigation of hyperlipidaemia (Curtis, 2021a, 2019). The T2D phenotype was determined from three sources in the dataset: self-reported diabetes or type 2 diabetes (but not type 1 or gestational diabetes); reporting taking any of a list of named medications commonly used to treat T2D in the UK ([https://www.diabetes.co.uk/Diabetes-drugs.html](https://www.diabetes.co.uk/Diabetes-drugs.html)); having an ICD10 code for non-insulin-dependent diabetes mellitus in hospital records or as a cause of death. Subjects in any of these categories were deemed to be cases while all other subjects were taken to be controls.

The SCOREASSOC program was used to carry out a weighted burden analysis to test whether, in each gene, sequence variants which were rarer and/or predicted to have more severe functional effects occurred more commonly in cases than controls. Attention was restricted to rare variants with minor allele frequency (MAF) <= 0.01 in both cases and controls. As previously described, variants were weighted by overall MAF so that variants with MAF=0.01 were given a weight of 1 while very rare variants with MAF close to zero were given a weight of 10 (Curtis, 2021b). Variants were also weighted according to their functional annotation using the GENEVARASSOC program, which was used to generate input files for weighted burden analysis by SCOREASSOC (Curtis, 2016, 2012). The weights were informed from the analysis of the effects of different categories of variant in *LDLR* on hyperlipidaemia risk (Curtis, 2021a). Variants predicted to cause complete loss of function (LOF) of the gene were assigned a weight of 100. Nonsynonymous variants were assigned a weight of 5 but if PolyPhen annotated them as possibly or probably damaging then 5 or 10 was added to this and if SIFT annotated them as deleterious then 20 was added. In order to allow exploration of the effects of different types of variant on disease risk the variants were also grouped into broader categories to be used in multivariate analyses as described below. The full set of weights and categories is displayed in Table 1. As described previously, the weight due to MAF and the weight due to functional annotation were multiplied together to provide an overall weight for each variant. Variants were excluded if there were more than 10% of genotypes missing in the controls or if the heterozygote count was smaller than both homozygote counts in the controls. If a subject was not genotyped for a variant then they were assigned the subject-wise average score for that variant. For each subject a gene-wise weighted burden score was derived as the sum of the variant-wise weights, each multiplied by the number of alleles of the variant which the given subject possessed. For variants on the X chromosome, hemizygous males were treated as homozygotes.

View this table:
[Table 1.](http://medrxiv.org/content/early/2021/01/21/2021.01.08.21249453/T1)

Table 1. 
The table shows the weight which was assigned to each type of variant as annotated by VEP, Polyphen and SIFT as well as the broad categories which were used for multivariate analyses of variant effects (Adzhubei et al., 2013; Kumar et al., 2009; McLaren et al., 2016).

For each gene, logistic regression analysis was carried out including the first 20 population principal components and sex as covariates and a likelihood ratio test was performed comparing the likelihoods of the models with and without the gene-wise burden score. The statistical significance was summarised as a signed log p value (SLP), which is the log base 10 of the p value given a positive sign if the score is higher in cases and negative if it is higher in controls.

Gene set analyses were carried out as before using the 1454 “all GO gene sets, gene symbols” pathways as listed in the file *c5*.*all*.*v5*.**.*symbols*.*gmt* downloaded from the Molecular Signatures Database at [http://www.broadinstitute.org/gsea/msigdb/collections.jsp](http://www.broadinstitute.org/gsea/msigdb/collections.jsp) (Subramanian et al., 2005). For each set of genes, the natural logs of the gene-wise p values were summed according to Fisher’s method to produce a chi-squared statistic with degrees of freedom equal to twice the number of genes in the set. The p value associated with this chi-squared statistic was expressed as a minus log10 p (MLP) as a test of association of the set with the hyperlipidaemia phenotype.

For selected genes, additional analyses were carried out to clarify the contribution of different categories of variant. As described previously, logistic regression analyses were performed on the counts of the separate categories of variant as listed in Table 1, again including principal components and sex as covariates, to estimate the effect size for each category (Curtis, 2021a). The odds ratios associated with each category were estimated along with their standard errors and the Wald statistic was used to obtain a p value, except for categories in which variants occurred fewer than 50 times in which case Fisher’s exact test was applied to the variant counts. The associated p value was converted to an SLP, again with the sign being positive if the mean count was higher in cases than controls.

Data manipulation and statistical analyses were performed using GENEVARASSOC, SCOREASSOC and R (R Core Team, 2014).

## Results

There were 13,938 cases of T2D and 186,694 controls. There were 20,384 genes for which there were qualifying variants. Given that there were 20,384 informative genes, the critical threshold for the absolute value of the SLP to declare a result as formally statistically significant is -log10(0.05/20384) = 5.61 and this was achieved by three genes, *GCK* (SLP = 22.24), *HNF4A* (SLP = 6.82) and *GIGYF1* (SLP = 6.22). The quantile-quantile (QQ) plot for the SLPs obtained for all genes except *GCK* is shown in Figure 1. This shows that the test appears to be well-behaved and conforms well with the expected distribution. Omitting the genes with the 100 highest and 100 lowest SLPs, which might be capturing a real biological effect, the gradient for positive SLPs is 1.06 with intercept at -0.007 and the gradient for negative SLPs is 1.02 with intercept at -0.009, indicating only modest inflation of the test statistic in spite of the fact that participants of all ancestries are included.

![Figure 1.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/01/21/2021.01.08.21249453/F1.medium.gif)

[Figure 1.](http://medrxiv.org/content/early/2021/01/21/2021.01.08.21249453/F1)

Figure 1. 
QQ plot of SLPs obtained for weighted burden analysis of association with hyperlipidaemia showing observed against expected SLP for each gene, omitting results for GCK, which has SLP = 22.25.

Variants in *GCK* are recognised as the cause of up to half of cases of MODY, itself accounting for around 1-2% of cases of all diabetes diagnoses (Bishay and Greenfield, 2016). Likewise, *HNF4A* variants cause 5-10% of cases of MODY (Naylor et al., 2018). By contrast, *GIGYF1* has not previously been implicated in the aetiology of diabetes although it is known that its product binds to growth factor receptor-bound protein 10 (GRB10) and has a role in modulating the insulin-like growth factor (IGF1) receptor signalling pathway (Giovannone et al., 2003; Zhou et al., 2018). A variant in *GRB10* has been reported to be associated with decreased early-phase insulin secretion and the muscle-specific ablation of *Grb10* in mice causes increased glucose uptake into muscles with increased insulin signalling (Holt et al., 2018; Lyssenko et al., 2015). However *GRB10* itself showed no evidence for association in T2D in the current analysis (SLP = -0.02).

Table 2 shows all the genes achieving SLP with absolute value greater than 3, equivalent to an uncorrected p value of 0.001. Given that 20,384 genes were tested, one would expect that by chance about 20 would reach this level of significance whereas in fact there are 32. Thus it is possible that some of these highly ranked genes do demonstrate a biological signal which fails to reach statistical significance after correction for multiple testing and some of them seem worth commenting on. The expression of *ALAD* (SLP = 3.63) is reduced in obese subjects while the expression of Alad is reduced in rats with high-fat diet-induced weight gain (Moreno-Navarrete et al., 2017). Additionally, inhibition of ALAD with aminotriazole led to reduced glucose uptake in cultured human adipocytes. The common P12A variant of *PPARG* (SLP = 3.45) reduces risk of T2D whereas rare LOF variants and nonsynonymous variants which cause reduced activity (occurring in approximately 1 in 1,000 individuals) substantially increase risk (Majithia et al., 2014). Damaging variants in *GYG1* (SLP = 3.22) cause deficiency of glycogenin 1, resulting in glycogen storage myopathies, but have not been reported to be associated with diabetes (Ben Yaou et al., 2017). *GHRL* (SLP = -3.15) encodes the ghrelin-obestatin preproprotein which is cleaved to yield two peptides, ghrelin and obestatin, which are involved in appetite and energy metabolism and there have been some studies which have claimed that the common Leu72Met (rs696217) variant is associated with reduced risk of T2D although the effect does not seem to be consistent and the gene was not highlighted in a large GWAS meta-analysis (Rivera-León et al., 2020; Xue et al., 2018). The results for all sets are provided in Supplementary Table S1.

View this table:
[Table 2.](http://medrxiv.org/content/early/2021/01/21/2021.01.08.21249453/T2)

Table 2. 
Genes with absolute value of SLP exceeding 3 or more (equivalent to p<0.001) for test of association of weighted burden score with T2D.

In order to see if any additional genes were highlighted by analysing gene sets, gene set analysis was performed as described above after first removing all genes with absolute SLP value greater than 3. Given that 1,454 sets were tested a critical MLP to achieve to declare results significant after correction for multiple testing would be log10(1454*20) = 4.46 and this was not achieved by any set. There were two sets with MLP > 3, EXTERNAL SIDE OF PLASMA MEMBRANE (MLP = 3.29) and MONOSACCHARIDE TRANSMEMBRANE TRANSPORTER ACTIVITY (MLP = 3.06). The latter is of some interest because it consists of 10 genes, of which three were individually significant at p<0.05, these being *SLC2A2* (SLP = 2.54), *SLC2A3* (SLP = -2.37) and *SLC2A4* (SLP = 1.70). *SLC2A2*, previously known as *GLUT2*, codes for a glucose transporter expressed by beta cells which senses glucose levels and recessively acting variants in it can cause neonatal diabetes (Sansbury et al., 2012). A common intronic variant of *SLC2A2*, rs8192675, is associated with the glycaemic response to metformin (Zhou et al., 2016). *SLC2A4* codes for a glucose transporter whose levels in cell membranes increase in response to insulin but although candidate gene studies claim that common variants in it are associated with T2D these results are not supported by properly powered GWAS metanalysis (Hu et al., 2019; Xue et al., 2018). The results for all sets are provided in Supplementary Table S2.

For the genes of possible interest listed above, a logistic regression analysis of different categories of variant was carried out to elucidate their relative contributions. The results for the three exome-wide significant genes are shown in Table 3, which shows differences between the genes relating to the implicated pattern of variants. The results for *GCK* demonstrate that splice site variants and gene disruptive variants, comprising frameshift and stop variants, are associated with large effects on risk. These occur a total of 17 times among the 13,938 cases. However of note is that nonsynonymous variants annotated as probably damaging by PolyPhen are also associated with increased risk, with OR = 2.97 (1.59 - 5.54), and these occur 33 times among cases. The situation for *HNF4A* is quite different. There are no splice site variants and only 6 gene disruptive variants and these all occur in controls. Only probably damaging nonsynonymous variants show an effect, with OR = 2.97 (1.61 - 5.50), and these occur 34 times among cases. Finally, for *GIGYF1* probably damaging nonsynonymous variants have no discernible effect and it is only the splice site (OR = 7.70 (2.62 - 22.67)) and disruptive (OR = 5.65 (3.07 - 10.40)) variants which increase risk and these occur 24 times in cases.

View this table:
[Table 3.](http://medrxiv.org/content/early/2021/01/21/2021.01.08.21249453/T3)

Table 3. 
Results from logistic regression analysis showing the effects on risk of T2D of different categories of variant within exome-wide significant genes and genes of interest with absolute value of gene-wise SLP > 3. Odds ratios for each category are estimated including principal components and sex as covariates.

Table 3 also shows the results for the genes which, while not exome-wide significant, had SLP with magnitude >3 and which appeared to be potentially of interest, *ALAD, PPARG, GYG1* and *GHRL*. The signal for *ALAD* seems to be driven by the fact that although splice site and disruptive variants occur only a total of 5 times in cases this still makes them about 7 times as frequent as in controls. The signal for *PPARG* arises from the fact that disruptive variants occur 4 times in cases and 7 times in controls, OR = 8.23 (2.29 - 29.63). The findings for *GYG1* seem somewhat more robust, being based on an excess of both disruptive (OR = 1.98 (1.40 - 2.81)) and splice site (OR = 3.08 (0.96 - 9.81)) variants in cases, with a total of 37 occurrences. For *GHRL*, which has SLP = -3,15, implying that variants in it might be protective, no individual category has a statistically significant effect and there is more a general tendency for there to be fewer variants among cases which is spread over a number of categories, including 3 prime UTR, protein altering, indel, disruptive and deleterious.

The analyses described above failed to highlight a number of genes which have previously been implicated in T2D, comprising *HNF1A* (SLP = 1.66), *HNF1B* (SLP = -0.28), *ABCC8* (SLP = 1.94), *INSR* (SLP = -0.25), *MC4R* (SLP = 1.41), *SLC30A8* (SLP = -2.64) and *PAM* (SLP = 0.19). The association with T2D for each category of variant within these genes is shown in Table 4. From this it can be seen that LOF variants in *HNF1A* and *HNF1B* are commoner in cases than controls but that their absolute numbers are too small to produce statistically significant effects. By contrast, LOF variants have higher overall frequency in *ABCC8* and they have approximately equal frequencies in cases and controls. These results are consistent with reports that it is only activating variants in *ABCC8* result in diabetes whereas LOF variants can cause hyperinsulinaemia (De Franco et al., 2020). The overall frequencies of different category of nonsynonymous variant do not vary between cases and controls, possibly reflecting the inability of SIFT and PolyPhen to distinguish variants which have a gain of function effect. The frequency of LOF variants in *INSR* are similar in cases and controls, indicating that, although recessively acting variants can cause infantile hyperinsulinaemia followed by insulin dependent diabetes, the loss of function of a single copy of this gene has little discernible effect on risk of T2D (Semple et al., 2011). The results for *MC4R* suggest that variants which are disruptive or annotated as deleterious or damaging may be associated with a moderate increase in risk, with OR 1.2 – 1.5, but no individual category is statistically significant. Unfortunately genotypes for the single *MC4R* variant previously reported to be associated with T2D, rs79783591 producing an Ile269Asn substitution, were not provided in the supplied dataset even though it is clearly exonic and genotypes for nearby variants are available. The frequency of nonsynonymous variants annotated as deleterious in *SLC30A8* is higher in controls than cases, consistent with previous findings that missense variants in this gene typically result in reduced T2D risk, but again this is not statistically significant (Flannick et al., 2019). In *PAM* there is no real suggestion of any signal among the included variants. However the frequency criteria used excluded the two variants which had previously been individually implicated statistically and through functional studies (Thomsen et al., 2018). When these were analysed separately they both showed evidence for a small effect on risk, with rs35658696 (Asp563Gly) having SLP = 5.68, OR = 1.15 (1.08 – 1.21) and rs78408340 having SLP = 6.92, OR = 1.20 (1.13 – 1.28).

View this table:
[Table 4.](http://medrxiv.org/content/early/2021/01/21/2021.01.08.21249453/T4)

Table 4. 
Results from logistic regression analysis showing effect on risk of T2D of different categories of variant within genes previously implicated in diabetes pathogenesis.

## Discussion

These analyses provide a broad overview of the way very rare genetic variants contribute to risk of type 2 diabetes in a large sample broadly representative of the population. The weighted burden analysis successfully identifies two known diabetes genes, *GCK* and *HNF4A*, and implicates a novel gene, *GIGYF1*. A few other biologically plausible genes do not reach formal levels of statistical significance after correction for multiple testing but might be worthy of further investigation, including *ALAD, PPARG, GYG1, GHRL, SCL2A2* and *SLC2A4*. Any possible role for these genes will become clearer as additional data becomes available, for example from the remaining 300,000 UK Biobank subjects for whom exome sequence data has not yet been released. Typically, hundreds of variants are identified per gene, mostly occurring in only a handful of subjects each. The findings for some previously implicated genes, *HNF1A, HNF1B, ABCC8, MC4R* and *SLC30A8*, are not formally statistically significant but are consistent with an effect which could be detected once the sample size is increased.

Together, variants which can be identified as having large effects on risk occur in fewer than 1% of the cases of T2D in this sample. There is no doubt that identifying specific genetic causes may be useful to guide treatment for some patients (Agostini et al., 2018). However it needs to be acknowledged that for the vast majority of patients with T2D exome sequencing will be not be helpful in terms of identifying specific subtypes of disease which might benefit from specific treatments. Thus, the potential to apply a personalised medicine approach to T2D based on genetic testing seems to be somewhat limited.

The main potential utility of genetic investigations such as this might be to better characterise the mechanisms which can lead to disease, identify novel drug targets and develop improved therapeutic approaches which would benefit T2D patients in general, rather than only the small number carrying the relevant genetic variant. If some of the tentative findings reported here can be replicated then the genes identified could become the objects of more intensive investigation.

## Conflicts of interest

The author declares he has no conflict of interest.

## Data availability

The raw data is available on application to UK Biobank. Detailed results with variant counts cannot be made available because they might be used for subject identification. Scripts and relevant derived variables will be deposited in UK Biobank.

## Acknowledgments

This research has been conducted using the UK Biobank Resource. The author wishes to acknowledge the staff supporting the High Performance Computing Cluster, Computer Science Department, University College London. This work was carried out in part using resources provided by BBSRC equipment grant BB/R01356X/1. The author wishes to thank the participants who volunteered for the UK Biobank project.

## Footnotes

*   Added additional citations and results for additional genes.

*   Received January 8, 2021.
*   Revision received January 21, 2021.
*   Accepted January 21, 2021.


*   © 2021, Posted by Cold Spring Harbor Laboratory

The copyright holder for this pre-print is the author. All rights reserved. The material may not be redistributed, re-used or adapted without the author's permission.

## References

1.  Adzhubei, I., Jordan, D.M., Sunyaev, S.R. (2013) Predicting functional effect of human missense mutations using PolyPhen-2. Curr. Protoc. Hum. Genet. 7 Unit7.20.
    
    
2.  Agostini, M., Schoenmakers, E., Beig, J., Fairall, L., Szatmari, I., Rajanayagam, O., Muskett, F.W., Adams, C., Marais, A.D., O’Rahilly, S., Semple, R.K., Nagy, L., Majithia, A.R., Schwabe, J.W.R., Blom, D.J., Murphy, R., Chatterjee, K., Savage, D.B. (2018) A pharmacogenetic approach to the treatment of patients with PPARG mutations. Diabetes 67, 1086–1092.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.2337/db18-1086-P&link_type=DOI) 

3.  Ben Yaou, R., Hubert, A., Nelson, I., Dahlqvist, J.R., Gaist, D., Streichenberger, N., Beuvin, M., Krahn, M., Petiot, P., Parisot, F., Michel, F., Malfatti, E., Romero, N., Carlier, R.Y., Eymard, B., Labrune, P., Duno, M., Krag, T., Cerino, M., Bartoli, M., Bonne, G., Vissing, J., Laforet, P., Petit, F.M. (2017) Clinical heterogeneity and phenotype/genotype findings in 5 families with GYG1 deficiency. Neurol. Genet. 3, e208.
    
    [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6Mzoibm5nIjtzOjU6InJlc2lkIjtzOjg6IjMvNi9lMjA4IjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjEvMDEvMjEvMjAyMS4wMS4wOC4yMTI0OTQ1My5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 

4.  Bishay, R.H., Greenfield, J.R. (2016) A review of maturity onset diabetes of the young (MODY) and challenges in the management of glucokinase-MODY. Med. J. Aust. 205, 480–485.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.5694/mja16.00458&link_type=DOI) 

5.  Chang, C.C., Chow, C.C., Tellier, L.C., Vattikuti, S., Purcell, S.M., Lee, J.J. (2015) Second-generation PLINK: rising to the challenge of larger and richer datasets. Gigascience 4, 7.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/s13742-015-0047-8&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25722852&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F01%2F21%2F2021.01.08.21249453.atom) 

6.  Chen, X., Liu, C., Si, S., Li, Y., Li, W., Yuan, T., Xue, F. (2021) Genomic risk score provides predictive performance for type 2 diabetes in the UK biobank. Acta Diabetol. 1, 3.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s00592-020-01650-1&link_type=DOI) 

7.  Curtis, D. (2012) A rapid method for combined analysis of common and rare variants at the level of a region, gene, or pathway. Adv Appl Bioinform Chem 5, 1–9.
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22888262&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F01%2F21%2F2021.01.08.21249453.atom) 

8.  Curtis, D. (2016) Pathway analysis of whole exome sequence data provides further support for the involvement of histone modification in the aetiology of schizophrenia. Psychiatr. Genet. 26, 223–7.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1097/YPG.0000000000000132&link_type=DOI) 

9.  Curtis, D. (2019) A weighted burden test using logistic regression for integrated analysis of sequence variants, copy number variants and polygenic risk score. Eur. J. Hum. Genet. 27, 114–124.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41431-018-0272-6&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F01%2F21%2F2021.01.08.21249453.atom) 

10. Curtis, D. (2021a) Analysis of 200,000 exome-sequenced UK Biobank subjects illustrates the contribution of rare genetic variants to hyperlipidaemia. medRxiv.
    
    
11. Curtis, D. (2021b) Multiple Linear Regression Allows Weighted Burden Analysis of Rare Coding Variants in an Ethnically Heterogeneous Population. Hum. Hered. 1–10.
    
    
12. De Franco, E., Saint-Martin, C., Brusgaard, K., Knight Johnson, A.E., Aguilar-Bryan, L., Bowman, P., Arnoux, J.B., Larsen, A.R., May, S., Greeley, S.A.W., Calzada-León, R., Harman, B., Houghton, J.A.L., Nishimura-Meguro, E., Laver, T.W., Ellard, S., del Gaudio, D., Christesen, H.T., Bellanné-Chantelot, C., Flanagan, S.E. (2020) Update of variants identified in the pancreatic β-cell KATP channel genes KCNJ11 and ABCC8 in individuals with congenital hyperinsulinism and diabetes. Hum. Mutat. 41, 884–905.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/humu.23995&link_type=DOI) 

13. Flannick, J., Mercader, J.M., Fuchsberger, C., Udler, M.S., Mahajan, A., Wessel, J., Teslovich, T.M., Caulkins, L., Koesterer, R., Barajas-Olmos, F., Blackwell, T.W., Boerwinkle, E., Brody, J.A., Centeno-Cruz, F., Chen, L., Chen, S., Contreras-Cubas, C., Córdova, E., Correa, A., Cortes, M., DeFronzo, R.A., Dolan, L., Drews, K.L., Elliott, A., Floyd, J.S., Gabriel, S., Garay-Sevilla, M.E., Garcí a-Ortiz, H., Gross, M., Han, S., Heard-Costa, N.L., Jackson, A.U., Jørgensen, M.E., Kang, H.M., Kelsey, M., Kim, B.J., Koistinen, H.A., Kuusisto, J., Leader, J.B., Linneberg, A., Liu, C.T., Liu, J., Lyssenko, V., Manning, A.K., Marcketta, A., Malacara-Hernandez, J.M.,Martí nez-Hernández, A., Matsuo, K., Mayer-Davis, E., Mendoza-Caamal, E., Mohlke, K.L., Morrison, A.C., Ndungu, A., Ng, M.C.Y., O’Dushlaine, C., Payne, A.J., Pihoker, C., Post, W.S., Preuss, M., Psaty, B.M., Vasan, R.S., Rayner, N.W., Reiner, A.P., Revilla-Monsalve, C., Robertson, N.R., Santoro, N., Schurmann, C., So, W.Y., Soberón, X., Stringham, H.M., Strom, T.M., Tam, C.H.T., Thameem, F., Tomlinson, B., Torres, J.M., Tracy, R.P., van Dam, R.M., Vujkovic, M., Wang, S., Welch, R.P., Witte, D.R., Wong, T.Y., Atzmon, G., Barzilai, N., Blangero, J., Bonnycastle, L.L., Bowden, D.W., Chambers, J.C., Chan, E., Cheng, C.Y., Cho, Y.S., Collins, F.S., de Vries, P.S., Duggirala, R., Glaser, B., Gonzalez, C., Gonzalez, M.E., Groop, L., Kooner, J.S., Kwak, S.H., Laakso, M., Lehman, D.M., Nilsson, P., Spector, T.D., Tai, E.S., Tuomi, T., Tuomilehto, J., Wilson, J.G., Aguilar-Salinas, C.A., Bottinger, E., Burke, B., Carey, D.J., Chan, J.C.N., Dupuis, J., Frossard, P., Heckbert, S.R., Hwang, M.Y., Kim, Y.J., Kirchner, H.L., Lee, J.Y., Lee, J., Loos, R.J.F., Ma, R.C.W., Morris, A.D., O’Donnell, C.J., Palmer, C.N.A., Pankow, J., Park, K.S., Rasheed, A., Saleheen, D., Sim, X., Small, K.S., Teo, Y.Y., Haiman, C., Hanis, C.L., Henderson, B.E., Orozco, L., Tusié-Luna, T., Dewey, F.E., Baras, A., Gieger, C., Meitinger, T., Strauch, K., Lange, L., Grarup, N., Hansen, T., Pedersen, O., Zeitler, P., Dabelea, D., Abecasis, G., Bell, G.I., Cox, N.J., Seielstad, M., Sladek, R., Meigs, J.B., Rich, S.S., Rotter, J.I., Altshuler, D., Burtt, N.P., Scott, L.J., Morris, A.P., Florez, J.C., McCarthy, M.I., Boehnke, M. (2019) Exome sequencing of 20,791 cases of type 2 diabetes and 24,440 controls. Nature 570, 71–76.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41586-019-1231-2&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=31118516&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F01%2F21%2F2021.01.08.21249453.atom) 

14. Fuchsberger, C., Flannick, J., Teslovich, T.M., Mahajan, A., Agarwala, V., Gaulton, K.J., Ma, C., Fontanillas, P., Moutsianas, L., McCarthy, D.J., Rivas, M.A., Perry, J.R.B., Sim, X., Blackwell, T.W., Robertson, N.R., Rayner, N.W., Cingolani, P., Locke, A.E., Tajes, J.F., Highland, H.M., Dupuis, J., Chines, P.S., Lindgren, C.M., Hartl, C., Jackson, A.U., Chen, H., Huyghe, J.R., Van De Bunt, M., Pearson, R.D., Kumar, A., Müller-Nurasyid, M., Grarup, N., Stringham, H.M., Gamazon, E.R., Lee, Jaehoon, Chen, Y., Scott, R.A., Below, J.E., Chen, P., Huang, J., Go, M.J., Stitzel, M.L., Pasko, D., Parker, S.C.J., Varga, T. V., Green, T., Beer, N.L., Day-Williams, A.G., Ferreira, T., Fingerlin, T., Horikoshi, M., Hu, C., Huh, I., Ikram, M.K., Kim, B.J., Kim, Y., Kim, Y.J., Kwon, M.S., Lee, Juyoung, Lee, S., Lin, K.H., Maxwell, T.J., Nagai, Y., Wang, X., Welch, R.P., Yoon, J., Zhang, W., Barzilai, N., Voight, B.F., Han, B.G., Jenkinson, C.P., Kuulasmaa, T., Kuusisto, J., Manning, A., Ng, M.C.Y., Palmer, N.D., Balkau, B., Stančáková, A., Abboud, H.E., Boeing, H., Giedraitis, V., Prabhakaran, D., Gottesman, O., Scott, J., Carey, J., Kwan, P., Grant, G., Smith, J.D., Neale, B.M., Purcell, S., Butterworth, A.S., Howson, J.M.M., Lee, H.M., Lu, Y., Kwak, S.H., Zhao, W., Danesh, J., Lam, V.K.L., Park, K.S., Saleheen, D., So, W.Y., Tam, C.H.T., Afzal, U., Aguilar, D., Arya, R., Aung, T., Chan, E., Navarro, C., Cheng, C.Y., Palli, D., Correa, A., Curran, J.E., Rybin, D., Farook, V.S., Fowler, S.P., Freedman, B.I., Griswold, M., Hale, D.E., Hicks, P.J., Khor, C.C., Kumar, S., Lehne, B., Thuillier, D., Lim, W.Y., Liu, J., Van Der Schouw, Y.T., Loh, M., Musani, S.K., Puppala, S., Scott, W.R., Yengo, L., Tan, S.T., Taylor, H.A., Thameem, F., Wilson, G., Wong, T.Y., Njolstad, P.R., Levy, J.C., Mangino, M., Bonnycastle, L.L., Schwarzmayr, T., Fadista, J., Surdulescu, G.L., Herder, C., Groves, C.J., Wieland, T., Bork-Jensen, J., Brandslund, I., Christensen, C., Koistinen, H.A., Doney, A.S.F., Kinnunen, L., Esko, T., Farmer, A.J., Hakaste, L., Hodgkiss, D., Kravic, J., Lyssenko, V., Hollensted, M., Jorgensen, M.E., Jorgensen, T., Ladenvall, C., Justesen, J.M., Kä rä jä mäki, A., Kriebel, J., Rathmann, W., Lannfelt, L., Lauritzen, T., Narisu, N., Linneberg, A., Melander, O., Milani, L., Neville, M., Orho-Melander, M., Qi, L., Qi, Q., Roden, M., Rolandsson, O., Swift, A., Rosengren, A.H., Stirrups, K., Wood, A.R., Mihailov, E., Blancher, C., Carneiro, M.O., Maguire, J., Poplin, R., Shakir, K., Fennell, T., DePristo, M., De Angelis, M.H., Deloukas, P., Gjesing, A.P., Jun, G., Nilsson, P., Murphy, J., Onofrio, R., Thorand, B., Hansen, T., Meisinger, C., Hu, F.B., Isomaa, B., Karpe, F., Liang, L., Peters, A., Huth, C., O’Rahilly, S.P., Palmer, C.N.A., Pedersen, O., Rauramaa, R., Tuomilehto, J., Salomaa, V., Watanabe, R.M., Syvänen, A.C., Bergman, R.N., Bharadwaj, D., Bottinger, E.P., Cho, Y.S., Chandak, G.R., Chan, J.C.N., Chia, K.S., Daly, M.J., Ebrahim, S.B., Langenberg, C., Elliott, P., Jablonski, K.A., Lehman, D.M., Jia, W., Ma, R.C.W., Pollin, T.I., Sandhu, M., Tandon, N., Froguel, P., Barroso, I., Teo, Y.Y., Zeggini, E., Loos, R.J.F., Small, K.S., Ried, J.S., DeFronzo, R.A., Grallert, H., Glaser, B., Metspalu, A., Wareham, N.J., Walker, M., Banks, E., Gieger, C., Ingelsson, E., Im, H.K., Illig, T., Franks, P.W., Buck, G., Trakalo, J., Buck, D., Prokopenko, I., Mägi, R., Lind, L., Farjoun, Y., Owen, K.R., Gloyn, A.L., Strauch, K., Tuomi, T., Kooner, J.S., Lee, J.Y., Park, T., Donnelly, P., Morris, A.D., Hattersley, A.T., Bowden, D.W., Collins, F.S., Atzmon, G., Chambers, J.C., Spector, T.D., Laakso, M., Strom, T.M., Bell, G.I., Blangero, J., Duggirala, R., Tai, E.S., McVean, G., Hanis, C.L., Wilson, J.G., Seielstad, M., Frayling, T.M., Meigs, J.B., Cox, N.J., Sladek, R., Lander, E.S., Gabriel, S., Burtt, N.P., Mohlke, K.L., Meitinger, T., Groop, L., Abecasis, G., Florez, J.C., Scott, L.J., Morris, A.P., Kang, H.M., Boehnke, M., Altshuler, D., McCarthy, M.I. (2016) The genetic architecture of type 2 diabetes. Nature 536, 41–47.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/nature18642&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=27398621&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F01%2F21%2F2021.01.08.21249453.atom) 

15. Galinsky, K.J., Bhatia, G., Loh, P.R., Georgiev, S., Mukherjee, S., Patterson, N.J., Price, A.L. (2016) Fast Principal-Component Analysis Reveals Convergent Evolution of ADH1B in Europe and East Asia. Am. J. Hum. Genet. 98, 456–472.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ajhg.2015.12.022&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=26924531&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F01%2F21%2F2021.01.08.21249453.atom) 

16. Giovannone, B., Lee, E., Laviola, L., Giorgino, F., Cleveland, K.A., Smith, R.J. (2003) Two novel proteins that are linked to insulin-like growth factor (IGF-I) receptors by the Grb10 adapter and modulate IGF-I signaling. J. Biol. Chem. 278, 31564–31573.
    
    [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6MzoiamJjIjtzOjU6InJlc2lkIjtzOjEyOiIyNzgvMzQvMzE1NjQiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyMS8wMS8yMS8yMDIxLjAxLjA4LjIxMjQ5NDUzLmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 

17. Holt, L.J., Brandon, A.E., Small, L., Suryana, E., Preston, E., Wilks, D., Mokbel, N., Coles, C.A., White, J.D., Turner, N., Daly, R.J., Cooney, G.J. (2018) Ablation of Grb10 Specifically in Muscle Impacts Muscle Size and Glucose Metabolism in Mice. Endocrinology 159, 1339–1351.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1210/en.2017-00851&link_type=DOI) 

18. Hu, S., Ma, S., Li, X., Tian, Z., Liang, H., Yan, J., Chen, M., Tan, H. (2019) Relationships of SLC2A4, RBP4, PCK1, and PI3K gene polymorphisms with gestational diabetes mellitus in a Chinese population. Biomed Res. Int. 2019.
    
    
19. Kumar, P., Henikoff, S., Ng, P.C. (2009) Predicting the effects of coding non-synonymous variants on protein function using the SIFT algorithm. Nat. Protoc. 4, 1073–1081.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/nprot.2009.86&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=19561590&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F01%2F21%2F2021.01.08.21249453.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000268858700007&link_type=ISI) 

20. Lyssenko, V., Groop, L., Prasad, R.B. (2015) Genetics of type 2 diabetes: It matters from which parent we inherit the risk. Rev. Diabet. Stud.
    
    
21. Majithia, A.R., Flannick, J., Shahinian, P., Guo, M., Bray, M.A., Fontanillas, P., Gabriel, S.B., Manning, A.K., Hartl, C., Agarwala, V., Green, T., Banks, E., DePristo, M., Poplin, R., Shakir, K., Fennell, T., Njølstad, P.R., Altshuler, D., Burtt, N.P., Fuchsberger, C., Kang, H.M., Sim, X., Ma, C., Locke, A.E., Blackwell, Thomas, Jackson, A., Teslovich, T.M., Stringham, H., Chines, P.S., Kwan, P., Huyghe, J.R., Tan, A., Jun, G., Stitzel, M., Bergman, R.N., Bonnycastle, L., Tuomilehto, J., Collins, F.S., Scott, L.J., Mohlke, K.L., Abecasis, G., Boehnke, M., Strom, T., Gieger, C., Müller-Nurasyid, M., Grallert, H., Kriebel, J., Ried, J., De Angelis, M.H., Huth, C., Meisinger, C., Peters, A., Rathmann, W., Strauch, K., Meitinger, T., Kravic, J., Algren, P., Ladenvall, C., Toumi, T., Isomaa, B., Groop, L., Gaulton, K., Moutsianas, L., Rivas, M., Pearson, R., Mahajan, A., Prokopenko, I., Kumar, A., Perry, J., Howie, B., Van De Bunt, M., Small, K., Lindgren, C.M., Lunter, G., Robertson, N., Rayner, W., Morris, A., Buck, D., Hattersley, A., Spector, T., McVean, G., Frayling, T., Donnelly, P., McCarthy, M., Gupta, N., Taylor, H., Fox, E., Newton-Cheh, C., Wilson, J.G., O’Donnell, C.J., Kathiresan, S., Hirschhorn, J., Seidman, J.G., Seidman, C., Williams, A.L., Jacobs, S.B.R., Moreno-Mací as, H., Huerta-Chagoya, A., Churchhouse, C., Luna, C.M., Garcí a-Ortí z, H., Gómez-Vázquez, M.J., Estrada, K., Mercader, J.M., Ripke, S., Neale, B., Stram, D.O., Fernández-López, J.C., Romero-Hidalgo, S., Aguilar-Delfín, I., Martí nez-Hernández, A., Centeno-Cruz, F., Mendoza-Caamal, E., Monsalve, C.R., Islas-Andrade, S., Córdova, E., Rodrí guez-Arellano, E., Soberón, X., González-Villalpando, M.E., Monroe, K., Wilkens, L., Kolonel, L.N., Le Marchand, L., Riba, L., Ordóñez-Sánchez, M.L., Guillén, R.R., Cruz-Bautista, I., Rodrí guez-Torres, M., Muñoz-Hernández, L.L., Sáenz, T., Gómez, D., Alvirde, U., Onofrio, R.C., Brodeur, W.M., Gage, D., Murphy, J., Franklin, J., Mahan, S., Ardlie, K., Crenshaw, A.T., Winckler, W., MacArthur, D.G., Florez, J.C., Haiman, C.A., Henderson, B.E., Aguilar-Salinas, C.A., González-Villalpando, C., Orozco, L., Tusié-Luna, T., Almeida, M., Asimit, J.L., Atzmon, G., Barber, M., Beer, N.L., Bell, G.I., Below, J., Blackwell, Tom, Blangero, J., Bowden, D.W., Chambers, J., Chen, H., Chen, P., Choi, S., Cingolani, P., Cornes, B.K., Cox, N., Day-Williams, A.G., Duggirala, R., Dupuis, J., Dyer, T., Feng, S., Fernandez-Tajes, J., Ferreira, T., Fingerlin, T.E., Frayling, T.M., Gamazon, E.R., Ghosh, S., Gloyn, A., Grossman, R.L., Grundstad, J., Hanis, C., Heath, A., Highland, H., Hirokoshi, M., Huh, I.S., Ikram, K., Jablonski, K.A., Kim, Y.J., Kato, N., Kim, J., King, C.R., Kooner, J., Kwon, M.S., Im, H.K., Laakso, M., Lam, K.K.Y., Lee, J., Lee, Selyeong, Lee, Sungyoung, Lehman, D.M., Li, H., Liu, X., Livne, O.E., Maller, J.B., Maxwell, T.J., Mazoure, A., McCarthy, M.I., Meigs, J.B., Min, B., Musani, S., Nagai, Y., Ng, M.C.Y., Nicolae, D., Oh, S., Palmer, N., Park, T., Pollin, T.I., Reich, D., Rivas, M.A., Seielstad, M., Cho, Y.S., Tai, E.S., Sladek, R., Smith, P., Tachmazidou, I., Torres, J., Trubetskoy, V., Willems, S.M., Wiltshire, S., Won, S., Wood, A.R., Xu, W., Teo, Y.Y., Yoon, J., Lee, J.Y., Zawistowski, M., Zeggini, E., Zhang, W., Zöllner, S., Rosen, E.D., Scolnick, E. (2014) Rare variants in PPARG with decreased activity in adipocyte differentiation are associated with increased risk of type 2 diabetes. Proc. Natl. Acad. Sci. U. S. A. 111, 13127– 13132.
    
    [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NDoicG5hcyI7czo1OiJyZXNpZCI7czoxMjoiMTExLzM2LzEzMTI3IjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjEvMDEvMjEvMjAyMS4wMS4wOC4yMTI0OTQ1My5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 

22. McLaren, W., Gil, L., Hunt, S.E., Riat, H.S., Ritchie, G.R.S., Thormann, A., Flicek, P., Cunningham, F. (2016) The Ensembl Variant Effect Predictor. Genome Biol. 17, 122.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/s13059-016-0974-4&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=27268795&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F01%2F21%2F2021.01.08.21249453.atom) 

23. Moreno-Navarrete, J.M., Rodrí guez, A., Ortega, F., Becerril, S., Girones, J., Sabater-Masdeu, M., Latorre, J., Ricart, W., Frühbeck, G., Fernández-Real, J.M. (2017) Heme Biosynthetic Pathway is Functionally Linked to Adipogenesis via Mitochondrial Respiratory Activity. Obesity 25, 1723–1733.
    
    
24. Murphy, R., Ellard, S., Hattersley, A.T. (2008) Clinical implications of a molecular genetic classification of monogenic β-cell diabetes. Nat. Clin. Pract. Endocrinol.
    
    
25. Metab. Naylor, R., Johnson, A.K., Gaudio, D. del (2018) Maturity-Onset Diabetes of the Young Overview.
    
    
26. R Core Team (2014) R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria., Austria.
    
    
27. Rivera-León, E.A., Llamas-Covarrubias, M.A., Sánchez-Enríquez,  S., Martí nez-López, E., González-Hita, M., Llamas-Covarrubias, I.M. (2020) Leu72Met polymorphism of GHRL gene decreases susceptibility to type 2 diabetes mellitus in a Mexican population. BMC Endocr. Disord. 20.
    
    
28. Sansbury, F.H., Flanagan, S.E., Houghton, J.A.L., Shuixian Shen, F.L., Al-Senani, A.M.S., Habeb, A.M., Abdullah, M., Kariminejad, A., Ellard, S., Hattersley, A.T. (2012) SLC2A2 mutations can cause neonatal diabetes, suggesting GLUT2 may have a role in human insulin secretion. Diabetologia 55, 2381–2385.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s00125-012-2595-0&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22660720&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F01%2F21%2F2021.01.08.21249453.atom) 

29. Semple, R.K., Savage, D.B., Cochran, E.K., Gorden, P., O’Rahilly, S. (2011) Genetic syndromes of severe insulin resistance. Endocr. Rev.
    
    
30. Subramanian, A., Tamayo, P., Mootha, V.K., Mukherjee, S., Ebert, B.L., Gillette, M.A., Paulovich, A., Pomeroy, S.L., Golub, T.R., Lander, E.S., Mesirov, J.P. (2005) Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci U S A 102, 15545–15550.
    
    [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NDoicG5hcyI7czo1OiJyZXNpZCI7czoxMjoiMTAyLzQzLzE1NTQ1IjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjEvMDEvMjEvMjAyMS4wMS4wOC4yMTI0OTQ1My5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 

31. Szustakowski, J.D., Balasubramanian, S., Sasson, A., Khalid, S., Bronson, P.G., Kvikstad, E., Wong, E., Liu, D., Davis, J.W., Haefliger, C., Loomis, A.K., Mikkilineni, R., Noh, H.J., Wadhawan, S., Bai, X., Hawes, A., Krasheninina, O., Ulloa, R., Lopez, A., Smith, E.N., Waring, J., Whelan, C.D., Tsai, E.A., Overton, J., Salerno, W., Jacob, H., Szalma, S., Runz, H., Hinkle, G., Nioi, P., Petrovski, S., Miller, M.R., Baras, A., Mitnaul, L., Reid, J.G. (2020) Advancing Human Genetics Research and Drug Discovery through Exome Sequencing of the UK Biobank. medRxiv 2020.11.02.20222232.
    
    
32. Thomsen, S.K., Raimondo, A., Hastoy, B., Sengupta, S., Dai, X.Q., Bautista, A., Censin, J., Payne, A.J., Umapathysivam, M.M., Spigelman, A.F., Barrett, A., Groves, C.J., Beer, N.L., Manning Fox, J.E., McCarthy, M.I., Clark, A., Mahajan, A., Rorsman, P., MacDonald, P.E., Gloyn, A.L. (2018) Type 2 diabetes risk alleles in PAM impact insulin release from human pancreatic β-cells. Nat. Genet. 50, 1122–1131.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41588-018-0173-1&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=30054598&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F01%2F21%2F2021.01.08.21249453.atom) 

33. Xue, A., Wu, Yang, Zhu, Z., Zhang, F., Kemper, K.E., Zheng, Z., Yengo, L., Lloyd-Jones, L.R., Sidorenko, J., Wu, Yeda, Agbessi, M., Ahsan, H., Alves, I., Andiappan, A., Awadalla, P., Battle, A., Beutner, F., Bonder, M.J.J., Boomsma, D., Christiansen, M., Claringbould, A., Deelen, P., Esko, T., Favé, M.J., Franke, L., Frayling, T., Gharib, S., Gibson, G., Hemani, G., Jansen, R., Kä hönen, M., Kalnapenkis, A., Kasela, S., Kettunen, J., Kim, Y., Kirsten, H., Kovacs, P., Krohn, K., Kronberg-Guzman, J., Kukushkina, V., Kutalik, Z., Lee, B., Lehtimä ki, T., Loeffler, M., Marigorta, U.M., Metspalu, A., Milani, L., Müller-Nurasyid, M., Nauck, M., Nivard, M., Penninx, B., Perola, M., Pervjakova, N., Pierce, B., Powell, J., Prokisch, H., Psaty, B., Raitakari, O., Ring, S., Ripatti, S., Rotzschke, O., Ruëger, S., Saha, A., Scholz, M., Schramm, K., Seppä lä, I., Stumvoll, M., Sullivan, P., Teumer, A., Thiery, J., Tong, L., Tönjes, A., van Dongen, J., van Meurs, J., Verlouw, J., Völker, U., Võsa, U., Yaghootkar, H., Zeng, B., McRae, A.F., Visscher, P.M., Zeng, J., Yang, J. (2018) Genome-wide association analyses identify 143 risk variants and putative regulatory mechanisms for type 2 diabetes. Nat. Commun. 9, 1–14.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41467-018-02974-x&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=29317637&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F01%2F21%2F2021.01.08.21249453.atom) 

34. Zhou, K., Yee, S.W., Seiser, E.L., Van Leeuwen, N., Tavendale, R., Bennett, A.J., Groves, C.J., Coleman, R.L., Van Der Heijden, A.A., Beulens, J.W., De Keyser, C.E., Zaharenko, L., Rotroff, D.M., Out, M., Jablonski, K.A., Chen, L., Javorský, M., Zidzik, J., Levin, A.M., Keoki Williams, L., Dujic, T., Semiz, S., Kubo, M., Chien, H.C., Maeda, S., Witte, J.S., Wu, L., Tká, I., Kooy, A., Van Schaik, R.H.N., Stehouwer, C.D.A., Logie, L., Sutherland, C., Klovins, J., Pirags, V., Hofman, A., Stricker, B.H., Motsinger-Reif, A.A., Wagner, M.J., Innocenti, F., Hart, L.M., Holman, R.R., McCarthy, M.I., Hedderson, M.M., Palmer, C.N.A., Florez, J.C., Giacomini, K.M., Pearson, E.R. (2016) Variation in the glucose transporter gene SLC2A2 is associated with glycemic response to metformin. Nat. Genet. 48, 1055–1059.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/ng.3632&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=27500523&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F01%2F21%2F2021.01.08.21249453.atom) 

35. Zhou, T., Ma, Y., Tang, J., Guo, F., Dong, M., Wei, Q. (2018) Modulation of IGF1R Signaling Pathway by GIGYF1 in High Glucose-Induced SHSY-5Y Cells. DNA Cell Biol. 37, 1044–1054.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1089/dna.2018.4336&link_type=DOI)