Exome sequencing reveals aggregates of rare variants in glycosyltransferase and other genes influencing immunoglobulin G and transferrin glycosylation ====================================================================================================================================================== * Arianna Landini * Paul R.H.J. Timmers * Azra Frkatović-Hodžić * Irena Trbojević-Akmačić * Frano Vučković * Tea Pribić * Regeneron Genetics Center * Gannie Tzoneva * Alan R. Shuldiner * Ozren Polašek * Caroline Hayward * Gordan Lauc * James F. Wilson * Lucija Klarić ## Abstract It is often difficult to be certain which genes underlie the effects seen in association studies. However, variants that disrupt the protein, such as predicted loss of function (pLoF) and missense variants, provide a shortcut to identify genes with a clear biological link to the phenotype of interest. Glycosylation is one of the most common post-translationalmodifications of proteins, and an important biomarker of both disease and its progression. Here, we utilised the power of genetic isolates, gene-based aggregation tests and intermediate phenotypes to assess the effect of rare (MAF<5%) pLoF and missense variants from whole exome sequencing on the N-glycome of plasma transferrin (N=1907) and immunoglobulin G (N=4912), and their effect on diseases. We identified significant gene-based associations for transferrin glycosylation at 5 genes (p<8.06×10−8) and for IgG glycan traits at 4 genes (p<1.19×10−7). Associations in three of these genes (*FUT8, MGAT3* and *RFXAP*) are driven by multiple rare variants simultaneously contributing to protein glycosylation. Association at *ST6GAL1*, with a 300-fold up-drifted variant in the Orkney Islands, was detectable by a single-point exome-wide association analysis. Glycome-associated aggregate associations are located in genes already known to have a biological link to protein glycosylation (*FUT6, FUT8* for transferrin; *FUT8, MGAT3* and *ST6GAL1* for IgG) but also in genes which have not been previously reported (e.g. *RFXAP* for IgG). To assess the potential impact of rare variants associated with glycosylation on other traits, we queried public repositories of gene-based tests, discovering a potential connection between transferrin glycosylation, *MSR1*, galectin-3, insulin-like growth factor 1 and diabetes. However, the exact mechanism behind these connections requires further elucidation. ## Introduction Genome-wide association studies (GWAS) have so far identified thousands of loci associated with human complex traits and diseases. However, the large majority of these variants are found in noncoding regions of the genome1, posing a challenge when attempting to uncover their functional impact on the phenotype. On the contrary, whole-exome sequencing (WES) studies offer the opportunity to identify rare variants of larger effect on the encoded protein, such as predicted loss of function (pLoF) and missense variants, for which causal biological mechanisms are generally easier to elucidate2. Methods for exome-wide rare variant analysis have been successfully employed to discover variants and genes associated with both complex molecular traits3 and diseases4,5. While single-variant tests, such as GWAS, are largely adopted to explore associations of common genetic variants with phenotypes of interest, they have little power to identify rare variant associations, due to the low number of observations. Therefore, a set of methods testing cumulative effects of multiple rare variants in genetic regions, where rare variants are grouped at the gene level (also known as ‘masks’) via a collapsing test, such as burden tests, or variance-component tests (e.g. sequence kernel association test, SKAT6) were developed. In addition to increasing the statistical power by aggregating multiple rare-variants, using genetically isolated populations can provide unique opportunities for novel discovery in an association study7. Recent bottlenecks, restricted immigration and limited population size lead to increased genetic drift. Consequently, in such populations some otherwise rare variants can substantially increase in frequency compared to the general population, therefore increasing association power for these variants. Glycosylation is one of the most frequent post-translational modifications, where sugar residues, called glycans, are attached to the surface of proteins. Changes in protein N-glycosylation patterns have been described in the ageing process8,9 and in a wide variety of complex diseases, including autoimmune diseases10, diabetes11, cardiovascular diseases12, neurodegenerative diseases13 and cancer14. Despite glycans having an important role in human health and serving as potential biomarkers in clinical prognosis and diagnosis15, we have just started scratching the surface of the complex network of genes regulating protein glycosylation. All studies published to date exploring the genetic regulation of total plasma protein, immunoglobulin G (IgG) and transferrin N-glycosylation have employed single variant-based GWAS tests, mostly uncovering common variants located in non-coding regions of the genome16–23. Rare variants contributing to glycan variation, and their impact on human health, thus remain unexplored. To address this knowledge gap, we used multiple gene-based aggregation tests to investigate how rare (MAF<5%) pLoF and missense variants from whole exome sequencing affect 51 transferrin (N = 1907) and 94 IgG (N = 4912) glycan traits in European-descent cohorts. IgG is both the most abundant antibody and one of the most abundant proteins in human serum. It contains evolutionary conserved N-glycosylation sites in the constant region of each of its heavy chains, occupied by biantennary, largely core-fucosylated and partially truncated glycan structures, that may carry a bisecting N-acetylglucosamine and sialic acid residues24,25. Transferrin is a blood plasma glycoprotein that binds iron (Fe) and consequently mediates its transport through blood plasma. Human transferrin has two N-glycosylation sites, with biantennary disialylated digalactosylated glycan structure without fucose being the most abundant glycan attached26,27. In this study, we used gene-based aggregation of rare variants to identify several genes associated with transferrin and IgG glycosylation traits. Significant genes include known protein glycosylation genes as well as novel genes with no previously known role in post-translational modification. Importantly, several associations would not have been detectable by single-point analysis and one association was detected thanks to enrichment of rare variants in population isolates. Finally, we highlight the impact of rare variation in these genes on health-related traits by performing gene-based aggregation tests of 116 health-related traits together with gene lookups in public repositories of gene-based association tests ## Results ### Exome variant annotation To assess the effect of rare genetic variants on glycosylation of two proteins, we sequenced the exomes of 4,801 participants of European ancestry. After quality control, a total of 233,820 distinct autosomal coding genetic variants were available in the ORCADES cohort (N=2090), 244,649 in the VIKING cohort (N=2106) and 340,203 in the CROATIA-Korcula cohort (N=2872). Percentages of variants for each effect category in the total sequenced coding variation are similar across the three cohorts (Table 1). More than half (∼53%) of the sequenced coding variants are missense variants, of which nearly half (∼28% of total coding variation) are classified as likely or possibly deleterious by multiple variant effect predictor algorithms (see Methods). The second most represented effect category is synonymous mutations (∼33%), followed by variants in splice regions (∼8%), predicted loss of function (pLoF) (∼4%) and inframe insertions/deletions (∼1.5%). Around one quarter of coding variants in the ORCADES and VIKING cohorts are singletons (minor allele count, MAC=1); this percentage is instead higher in CROATIA-Korcula cohort (∼35%), possibly due to the larger sequenced sample size. View this table: [Table 1:](http://medrxiv.org/content/early/2022/12/27/2022.12.26.22283911/T1) Table 1: Number of coding exome variants sequenced in the complete sample of 3 isolated cohorts. Counts and prevalence of autosomal variants observed in WES-targeted regions across all individuals in the ORCADES, CROATIA-Korcula and VIKING cohort, by type or functional class for all and for singleton variants (MAC= 1). ### Exome-wide aggregated rare variant analysis of transferrin and IgG glycomes We performed exome-wide gene-based tests across 51 transferrin traits (glycome subset of CROATIA-Korcula N = 948, VIKING N = 959) and 94 IgG glycan traits (glycome subset of ORCADES N = 1960, CROATIA-Korcula N = 1866, VIKING N = 1086), testing low frequency and rare (MAF <5%) pLoF and missense variants. In total, we identified 16 significant associations for transferrin-(Supplementary Table 1) and 32 significant associations for IgG-(Supplementary Table 2) glycan traits, at Bonferroni-corrected p-values of 8.06×10−8 and 1.19×10−7, respectively (Figure 1, Table 2). Most gene-aggregated rare variants were associated with protein-specific glycans (transferrin: variants in *FUT6, TIRAP, MSR1* and *FOXI1* genes, IgG: variants in *MGAT3, ST6GAL1* and *RFXAP* genes); only *FUT8* was associated with glycans from both proteins (Table 2, Supplementary Tables 1 and 2). Almost all identified genes encode key enzymes in protein glycosylation (*MGAT3, ST6GAL1, FUT6, FUT8*) or have been previously associated with transferrin and IgG glycan traits in GWAS analysis (*MSR1, FOXI1*)17,18. The exceptions are *TIRAP* and *RFXAP*, which have no previously known link to protein glycosylation. We successfully replicated (p-value < 3.2×10−4 for transferrin, p-value < 5.9×10−4 for IgG) associations of glycans with low-frequency and rare variants from 4 genes - *FUT6* and *TIRAP* with transferrin glycans, and *FUT8* and *MGAT3* for IgG glycans (Table 2) - as frequencies of variants in these genes are similar across the studied cohorts (Supplementary Table 3). While the associations of IgG glycans and variants from *FUT8* replicated, the association of transferrin glycans with variants from the same gene did not reach the significance threshold for replication (p-value in VIKING = 1.7×10−3), likely because of the 7-fold decreased frequency of the rs2229678 variant in the VIKING (MAF = 0.0056) compared to CROATIA-Korcula (MAF = 0.049) cohort (Supplementary Table 4). However, given the known biological role of *FUT8* in protein glycosylation as a fucosyltransferase (one of the enzymes involved in the synthesis of glycans), we believe this association to be real. Associations of rare variants from the CROATIA-Korcula cohort in the *MSR1* gene with transferrin glycosylation also did not formally replicate in the VIKING cohort (p-value = 8.6×10−4) (Table 2). However, the cumulative allele count of rare variants in this gene is different between CROATIA-Korcula (MAC=46) and the VIKING cohort (MAC=38) (Supplementary Table 3), decreasing the power to replicate. We also detected a couple of isolate-specific associations that are driven by variants increased in frequency compared to publicly accessible biobanks and variant repositories. Namely, the rs750567016 variant in *ST6GAL1* that affects IgG glycosylation is more than 300 times more common in ORCADES (MAF = 3.3×10−3) than in UK Biobank (MAF = 1.0×10−5) or gnomAD (MAF = 9.0×10−6) and is absent from CROATIA-Korcula and VIKING cohorts. The rs115399307 variant in *FOXI1*, associated with transferrin glycosylation, is seven times more common in VIKING (MAF = 2.1×10−2) than in CROATIA-Korcula cohort (MAF = 2.7×10−3), UK Biobank (MAF = 8.5×10−3) and gnomAD (MAF = 7.1×10−3) (Supplementary Table 4). While the role of sialyltransferase *ST6GAL1* in IgG glycosylation is well described, the roles of the transcription factor *FOXI1* and the regulatory factor X-associated protein *RFXAP* still need to be confirmed and investigated. View this table: [Table 2:](http://medrxiv.org/content/early/2022/12/27/2022.12.26.22283911/T2) Table 2: Gene-based rare variants associations of transferrin and IgG glycosylation. ![Figure 1.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2022/12/27/2022.12.26.22283911/F1.medium.gif) [Figure 1.](http://medrxiv.org/content/early/2022/12/27/2022.12.26.22283911/F1) Figure 1. Miami plot summarising the results from exome-wide gene-based tests for transferrin and IgG glycan traits. Genomic positions of the genes, calculated as the mean position of variants included in the reported mask, are labelled on the x-axis and the –log10 of the p-value for each rare-variants aggregating test on the y-axis. For each gene-glycan association, the lowest p-value across multiple masks, multiple variant aggregate tests and cohorts was selected for plotting. The Bonferroni-corrected significance threshold for transferrin glycan traits (horizontal red line in the top part of the plot) corresponds to 8.06×10−8, while Bonferroni-corrected threshold for the IgG glycan traits (horizontal red line in the bottom part of the plot) corresponds to 1.19×10−7. Genes significantly associated with transferrin/IgG glycan traits are indicated with a triangle and labelled, while genes not passing the significance threshold are indicated with dots. ### IgG glycans gene-based aggregation meta-analysis To further increase statistical power, we performed gene-based aggregation meta-analysis of IgG glycan traits for ORCADES and VIKING cohorts. In addition to two genes already found to be associated with IgG in the cohort-specific analysis (*MGAT3* and *ST6GAL1*), the combined analysis of VIKING and ORCADES cohort added *FUT6* to the list of genes whose rare variants are significantly (p-value<1.19×10−7) associated with IgG glycan traits (Supplementary Table 5). *FUT6* is another gene known to be involved in glycosylation17,20,22,23, encoding a glycosyltransferase enzyme that catalyses the transfer of fucose moieties to a growing glycan chain. ### Genetic architecture of aggregated effects of rare-variants To better understand the genetic architecture of identified associations, we next assessed whether our findings could be discoverable within a GWAS framework and whether they are driven by single variants or multiple rare variants working in concert to affect levels of transferrin/IgG glycosylation. We first performed GWAS on imputed genotypes for each glycan trait and then repeated the rare variant association tests incorporating the dosages of GWAS sentinel SNPs as additional covariates. Genome-wide significant (transferrin p-value<1.61×10−9, IgG p-value<2.38×10−9) associations reported in this study (Supplementary Table 6 for transferrin glycans and Supplementary Table 7 for IgG glycans) have been described in further details in Landini *et al*.18 and Klarić *et al*.17. Overall, aggregated associations with variants from 3 out of 8 genes, *FUT8, ST6GAL1* and *MGAT3*, remained significant (p-value <8.06×10−8 for transferrin and p-value<1.19×10−7 for IgG) after conditioning on sentinel GWAS associations (Table 3). For one gene, *RFXAP*, there were no significant associations in the GWAS analysis, while the remaining four gene-based associations (*FUT6, MSR1, FOXI1* and *TIRAP*) were explained by sentinel GWAS variants. For two of these genes, *FUT6* and *FOXI1*, the GWAS sentinel variants are low frequency (0.02 1%. Then we also performed the exome-wide association analysis (ExWAS), following exactly the same protocol, but with exome sequencing data used for genotypes. We then re-run variant aggregate analysis as previously described, but with adjusting the glycan traits for the genotype of the sentinel SNPs from the GWAS/ExWAS significant loci, in addition to the other covariates listed above. The statistical significance level was determined in the same way as outlined in the main analysis above. ##### Replication of glycome rare associations in different cohorts and associations with health-related traits To investigate whether glycome rare-variants associations were cohort specific, each significant gene-glycan trait pair from the cohort-level discovery analysis was tested for associations in the remaining cohorts. The p-value threshold for replication was set to 3.23×10−4 for transferrin (0.05/31/5) and 5.95×10−4 for IgG (0.05/21/4) glycans, correcting for the number of independent glycan traits (i.e. 31 for transferrin and 21 for IgG) and the number of discovered glycome-gene pairs (i.e. 5 for transferrin and 4 for IgG in gene-based aggregation analysis). To investigate whether the glycome associated rare-variants may also affect health-related phenotypes, we tested for association each glycome-associated gene and 116 health-related traits The significance threshold was set to 5.43×10−5, correcting for the number of health-related traits (116), and the number of discovered glycome-gene pairs and number of glycome-associated genes (8). ## Supporting information Supplementary Tables [[supplements/283911_file02.xlsx]](pending:yes) ## Data Availability There is neither Research Ethics Committee approval, nor consent from individual participants, to permit open release of the individual level research data underlying this study. The datasets generated and analysed during the current study are therefore not publicly available. Instead, the research data and/or DNA samples are available from accessQTL@ed.ac.uk on reasonable request, following approval by the QTL Data Access Committee and in line with the consent given by participants. Each approved project is subject to a data or materials transfer agreement (D/MTA) or commercial contract. The UK Biobank genotypic data used in this study were approved under application 19655, 48511 and 19655 are available to qualified researchers via the UK Biobank data access process. The expression data used for the analyses described in this manuscript were obtained from the GTEx Portal on 25/11/2022. Genebass (https://app.genebass.org/) and AstraZeneca PheWAS portal (https://azphewas.com/) were accessed on 06/12/2022. ## Code availability We used publicly available software tools for all analyses. These software tools are listed in the main text and in the Methods. ## Data availability There is neither Research Ethics Committee approval, nor consent from individual participants, to permit open release of the individual level research data underlying this study. The datasets generated and analysed during the current study are therefore not publicly available. Instead, the research data and/or DNA samples are available from accessQTL{at}ed.ac.uk on reasonable request, following approval by the QTL Data Access Committee and in line with the consent given by participants. Each approved project is subject to a data or materials transfer agreement (D/MTA) or commercial contract. The UK Biobank genotypic data used in this study were approved under application 19655, 48511 and 19655 are available to qualified researchers via the UK Biobank data access process. The expression data used for the analyses described in this manuscript were obtained from the GTEx Portal on 25/11/2022. Genebass ([https://app.genebass.org/](https://app.genebass.org/)) and AstraZeneca PheWAS portal ([https://azphewas.com/](https://azphewas.com/)) were accessed on 06/12/2022. ## Ethics All studies were approved by local research ethics committees and all participants have given written informed consent. The ORCADES study was approved by the NHS Orkney Research Ethics Committee and the North of Scotland REC. The CROATIA-Korcula study was approved by the Ethics Committee of the Medical School, University of Split (approval ID: 2181-198-03-04/10-11-0008). The VIKING study was approved by the South East Scotland Research Ethics Committee, NHS Lothian (reference: 12/SS/0151). ## Author contributions A.L.: Data analysis and interpretation, visualisation, writing—original draft preparation, writing—review and editing. P.R.H.J.T.: preparation of pipeline for gene-based aggregation test of rare variants, writing—review and editing. A.F.-H.: computation of new derived IgG glycan traits, data interpretation. I.T.-A.: Quantification of transferrin and IgG N-glycans, computation of derived transferrin glycan traits, writing—review and editing. F.V.: Glycan data quality control. T.P.: Quantification of transferrin and IgG N-glycans. G.T.: preparation, quality control and annotation of whole-exome sequencing data, writing—review and editing. A.R.S.: Funding. O.P.: Genomic and demographic data provider for CROATIA-Korcula cohort. C.H.: Genomic and demographic data provider for CROATIA-Korcula cohort. G.L.: Conceptualisation, glycan data provider, writing—review and editing. J.F.W.: Funding, conceptualisation, genomic and demographic data provider for ORCADES and VIKING cohort, supervision, data interpretation, writing—original draft preparation, writing—review and editing. L.K.: Conceptualisation, supervision, data interpretation, writing—original draft preparation, writing—review and editing. ## Competing interests P.R.H.J.T. is an employee of BioAge Labs, Inc. G.T. and A.R.S. are full-time employees of Regeneron Genetics Center and receive salary, stock and stock options as compensation. G.L. is the founder and owner of Genos Ltd, a private research organisation that specialises in the high-throughput glycomic analysis and has several patents in this field. A.F.-H., I.T.-A., F.V., and T.P. are employees of Genos Ltd. L.K. is an employee of Humanity Inc., a company developing direct-to-consumer measures of biological ageing. All other authors declare no competing interests. ## Acknowledgements We thank Dr Nicola Pirastu for his help and advice regarding the statistical methods. The Orkney Complex Disease Study (ORCADES) was supported by the Chief Scientist Office of the Scottish Government (CZB/4/276 and CZB/4/710), the Royal Society, the MRC Human Genetics Unit, Arthritis Research UK and the European Union framework program 6 EUROSPAN project (contract number LSHG-CT-2006-018947). ORCADES DNA extractions and genotyping were performed at the Genetics Core of the Clinical Research Facility, University of Edinburgh. We would like to acknowledge the invaluable contributions of the research nurses in Orkney, the administrative team in Edinburgh, and the people of Orkney. The CROATIA-Korcula study was funded by grants from the MRC (United Kingdom), European Commission Framework 6 project EUROSPAN (contract number LSHG-CT-2006-018947), Croatian Science Foundation (grant 8875) and the Republic of Croatia Ministry of Science, Education and Sports (216-1080315-0302). Genotyping was performed in the Genetics Core of the Clinical Research Facility, University of Edinburgh. We would like to acknowledge all the staff of several institutions in Croatia that supported the CROATIA-Korcula fieldwork, including, but not limited to, the University of Split and Zagreb Medical Schools, Institute for Anthropological Research in Zagreb, and the Croatian Institute for Public Health in Split. The Viking Health Study-Shetland (VIKING) was supported by the MRC Human Genetics Unit quinquennial programme grant ‘QTL in Health and Disease’. DNA extractions and genotyping were performed at the Edinburgh Clinical Research Facility, University of Edinburgh. We would like to acknowledge the invaluable contributions of the research nurses in Shetland, the administrative team in Edinburgh and the people of Shetland. We acknowledge support from the European Union’s Horizon 2020 research and innovation programme IMforFUTURE (A.L. and A.F.-H.: H2020-MSCA-ITN/721815); the RCUK Innovation Fellowship from the National Productivity Investment Fund (L.K.: MR/R026408/1) and the MRC Human Genetics Unit programme grant, ‘QTL in Health and Disease’ (J.F.W. and C.H.: MC_UU_00007/10). Finally, this research has been conducted using data from the UK Biobank Resource (under application 26041, 48511 and 19655). For the purpose of open access, the author has applied a Creative Commons Attribution (CC BY) licence to any Author Accepted Manuscript version arising from this submission. * Received December 26, 2022. * Revision received December 26, 2022. * Accepted December 27, 2022. * © 2022, Posted by Cold Spring Harbor Laboratory This pre-print is available under a Creative Commons License (Attribution 4.0 International), CC BY 4.0, as described at [http://creativecommons.org/licenses/by/4.0/](http://creativecommons.org/licenses/by/4.0/) ## References 1. 1.Maurano, M. T. et al. Systematic localization of common disease-associated variation in regulatory DNA. Science (80-.). 337, 1190–1195 (2012). [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6Mzoic2NpIjtzOjU6InJlc2lkIjtzOjEzOiIzMzcvNjA5OS8xMTkwIjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjIvMTIvMjcvMjAyMi4xMi4yNi4yMjI4MzkxMS5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 2. 2.Van Hout, C. V et al. Exome sequencing and characterization of 49,960 individuals in the UK Biobank. Nature 586, 749–756 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41586-020-2853-0&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=33087929&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F12%2F27%2F2022.12.26.22283911.atom) 3. 3.Bomba, L. et al. Whole-exome sequencing identifies rare genetic variants associated with human plasma metabolites. Am. J. Hum. Genet. (2022). doi:10.1016/J.AJHG.2022.04.009 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/J.AJHG.2022.04.009&link_type=DOI) 4. 4.Flannick, J. et al. Exome sequencing of 20,791 cases of type 2 diabetes and 24,440 controls. Nature 570, 71–76 (2019). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41586-019-1231-2&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=31118516&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F12%2F27%2F2022.12.26.22283911.atom) 5. 5.Jurgens, S. J. et al. Analysis of rare genetic variation underlying cardiometabolic diseases and traits among 200,000 individuals in the UK Biobank. Nat. Genet. 2022 543 54, 240–250 (2022). 6. 6.Bansal, V., Libiger, O., Torkamani, A. & Schork, N. J. Statistical analysis strategies for association studies involving rare variants. Nat Rev Genet 11, 773–785 (2010). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/nrg2867&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20940738&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F12%2F27%2F2022.12.26.22283911.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000283052800011&link_type=ISI) 7. 7.Zuk, O. et al. Searching for missing heritability: Designing rare variant association studies. doi:10.1073/pnas.1322563111 [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NDoicG5hcyI7czo1OiJyZXNpZCI7czoxMDoiMTExLzQvRTQ1NSI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIyLzEyLzI3LzIwMjIuMTIuMjYuMjIyODM5MTEuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 8. 8.Vanhooren, V. et al. Serum N-glycan profile shift during human ageing. Exp Gerontol 45, 738–743 (2010). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.exger.2010.08.009&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20801208&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F12%2F27%2F2022.12.26.22283911.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000283891500003&link_type=ISI) 9. 9.Vilaj, M., Gudelj, I., Trbojević-Akmačić, I., Lauc, G. & Pezer, M. IgG Glycans as a Biomarker of Biological Age. in Biomarkers of Human Aging 81–99 doi:10.1007/978-3-030-24970-0_7 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/978-3-030-24970-0_7&link_type=DOI) 10. 10.1. Lauc, G. & 2. Trbojević-Akmačić, I. Ząbczyńska, M., Link-Lenczowski, P. & Pocheć, E. Glycosylation in Autoimmune Diseases. in The Role of Glycosylation in Health and Disease (eds. Lauc, G. & Trbojević-Akmačić, I.) 205–218 (Springer International Publishing, 2021). doi:10.1007/978-3-030-70115-4_10 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/978-3-030-70115-4_10&link_type=DOI) 11. 11.Rudman, N., Gornik, O. & Lauc, G. Altered N-glycosylation profiles as potential biomarkers and drug targets in diabetes. FEBS Letters 593, 1598–1615 (2019). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/1873-3468.13495&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=31215021&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F12%2F27%2F2022.12.26.22283911.atom) 12. 12.Gudelj, I. & Lauc, G. Protein N-Glycosylation in Cardiovascular Diseases and Related Risk Factors. Curr. Cardiovasc. Risk Rep. 12, (2018). 13. 13.Rebelo, A. L., Chevalier, M. T., Russo, L. & Pandit, A. Role and therapeutic implications of protein glycosylation in neuroinflammation. Trends Mol Med 28, 270– 289 (2022). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.molmed.2022.01.004&link_type=DOI) 14. 14.Costa, A. F., Campos, D., Reis, C. A. & Gomes, C. Targeting Glycosylation: A New Road for Cancer Drug Discovery. Trends Cancer 6, 757–766 (2020). 15. 15.Peng, W. et al. Clinical application of quantitative glycomics. Expert Rev. Proteomics 15, 1007–1031 (2018). 16. 16.Huffman, J. E. et al. Polymorphisms in B3GAT1, SLC9A9 and MGAT5 are associated with variation within the human plasma N-glycome of 3533 European adults. Hum. Mol. Genet. 20, 5000–5011 (2011). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/hmg/ddr414&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=21908519&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F12%2F27%2F2022.12.26.22283911.atom) 17. 17.Klarić, L. et al. Glycosylation of immunoglobulin G is regulated by a large network of genes pleiotropic with inflammatory diseases. Sci. Adv. 6, eaax0301 (2020). [FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6MzoiUERGIjtzOjExOiJqb3VybmFsQ29kZSI7czo4OiJhZHZhbmNlcyI7czo1OiJyZXNpZCI7czoxMjoiNi84L2VhYXgwMzAxIjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjIvMTIvMjcvMjAyMi4xMi4yNi4yMjI4MzkxMS5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 18. 18.Landini, A. et al. Genetic regulation of post-translational modification of two distinct proteins. Nat. Commun. 2022 131 13, 1–13 (2022). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41467-021-27838-9&link_type=DOI) 19. 19.Lauc, G. et al. Genomics Meets Glycomics—The First GWAS Study of Human N-Glycome Identifies HNF1α as a Master Regulator of Plasma Protein Fucosylation. PLoS Genet. 6, e1001256 (2010). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pgen.1001256&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=21203500&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F12%2F27%2F2022.12.26.22283911.atom) 20. 20.Lauc, G. et al. Loci Associated with N-Glycosylation of Human Immunoglobulin G Show Pleiotropy with Autoimmune Diseases and Haematological Cancers. PLoS Genet. 9, (2013). 21. 21.Sharapov, S. Z. et al. Defining the genetic control of human blood plasma N-glycome using genome-wide association study. Hum. Mol. Genet. 28, 2062–2077 (2019). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/hmg/ddz054&link_type=DOI) 22. 22.Shen, X. et al. Multivariate discovery and replication of five novel loci associated with Immunoglobulin G N-glycosylation. Nat. Commun. 8, 447 (2017). 23. 23.Wahl, A. et al. Genome-wide association study on immunoglobulin G glycosylation patterns. Front. Immunol. 9, 277 (2018). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.3389/fimmu.2018.00277&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F12%2F27%2F2022.12.26.22283911.atom) 24. 24.Bondt, A. et al. Immunoglobulin G (IgG) Fab glycosylation analysis using a new mass spectrometric high-throughput profiling method reveals pregnancy-associated changes. Mol. Cell. proteomics 13, 3029–3039 (2014). [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NjoibWNwcm90IjtzOjU6InJlc2lkIjtzOjEwOiIxMy8xMS8zMDI5IjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjIvMTIvMjcvMjAyMi4xMi4yNi4yMjI4MzkxMS5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 25. 25.Wuhrer, M. et al. Glycosylation profiling of immunoglobulin G (IgG) subclasses from human serum. Proteomics 7, 4070–4081 (2007). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/pmic.200700289&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=17994628&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F12%2F27%2F2022.12.26.22283911.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000251293000006&link_type=ISI) 26. 26.Karlsson, I., Ndreu, L., Quaranta, A. & Thorsén, G. Glycosylation patterns of selected proteins in individual serum and cerebrospinal fluid samples. J. Pharm. Biomed. Anal. 145, 431–439 (2017). 27. 27.Spik, G. et al. Studies on glycoconjugates. LXIV. Complete structure of two carbohydrate units of human serotransferrin. FEBS Lett. 50, 296–299 (1975). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/0014-5793(75)90053-8&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=1116600&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F12%2F27%2F2022.12.26.22283911.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=A1975V629400002&link_type=ISI) 28. 28.Võsa, U. et al. Large-scale cis- and trans-eQTL analyses identify thousands of genetic loci and polygenic scores that regulate blood gene expression. Nat. Genet. 2021 539 53, 1300–1310 (2021). 29. 29.Karczewski, K. J. et al. Systematic single-variant and gene-based association testing of thousands of phenotypes in 394,841 UK Biobank exomes. Cell Genomics 2, 30. 30.Wang, Q. et al. Rare variant contribution to human disease in 281,104 UK Biobank exomes. Nature 597, 527–532 (2021). 31. 31.Povysil, G. et al. Rare-variant collapsing analyses for complex traits: guidelines and applications. Nat. Rev. Genet. 20, 747–759 (2019). 32. 32.Zuk, O. et al. Searching for missing heritability : Designing rare variant association studies. (2014). doi:10.1073/pnas.1322563111 [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NDoicG5hcyI7czo1OiJyZXNpZCI7czoxMDoiMTExLzQvRTQ1NSI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIyLzEyLzI3LzIwMjIuMTIuMjYuMjIyODM5MTEuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 33. 33.Fitzgerald, K. A. et al. Mal (MyD88-adapter-like) is required for Toll-like receptor-4 signal transduction. Nature 413, 78–83 (2001). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/35092578&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=11544529&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F12%2F27%2F2022.12.26.22283911.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000170801200042&link_type=ISI) 34. 34.Horng, T., Barton, G. M. & Medzhitov, R. TIRAP: an adapter molecule in the Toll signaling pathway. Nat Immunol 2, 835–841 (2001). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/ni0901-835&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=11526399&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F12%2F27%2F2022.12.26.22283911.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000170781200023&link_type=ISI) 35. 35.Aguet, F. et al. Genetic effects on gene expression across human tissues. Nature 550, 204–213 (2017). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/nature24277&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=29022597&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F12%2F27%2F2022.12.26.22283911.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000412829500039&link_type=ISI) 36. 36.Jones, E. Y., Fugger, L., Strominger, J. L. & Siebold, C. MHC class II proteins and disease: a structural perspective. Nat Rev Immunol 6, 271–282 (2006). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/nri1805&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=16557259&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F12%2F27%2F2022.12.26.22283911.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000236303200016&link_type=ISI) 37. 37.Juszczak, A. et al. Plasma fucosylated glycans and C-reactive protein as biomarkers of HNF1A-MODY in young adult–onset nonautoimmune diabetes. Diabetes Care 42, 17–26 (2019). [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NzoiZGlhY2FyZSI7czo1OiJyZXNpZCI7czo3OiI0Mi8xLzE3IjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjIvMTIvMjcvMjAyMi4xMi4yNi4yMjI4MzkxMS5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 38. 38.Pairo-Castineira, E. et al. Genetic mechanisms of critical illness in COVID-19. Nature 591, 92–98 (2021). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41586-020-03065-y&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=33307546&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F12%2F27%2F2022.12.26.22283911.atom) 39. 39.Verhelst, X. et al. Protein Glycosylation as a Diagnostic and Prognostic Marker of Chronic Inflammatory Gastrointestinal and Liver Diseases. Gastroenterology 158, 95– 110 (2020). 40. 40.Wittenbecher, C. et al. Plasma N-Glycans as Emerging Biomarkers of Cardiometabolic Risk: A Prospective Investigation in the EPIC-Potsdam Cohort Study. Diabetes Care 43, 661–668 (2020). [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NzoiZGlhY2FyZSI7czo1OiJyZXNpZCI7czo4OiI0My8zLzY2MSI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIyLzEyLzI3LzIwMjIuMTIuMjYuMjIyODM5MTEuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 41. 41.Meyer, N. M. T. et al. Low IGF1 and high IGFBP1 predict diabetes onset in prediabetic patients. Eur J Endocrinol 187, 555–565 (2022). 42. 42.Segev, Y. et al. Systemic and renal growth hormone-IGF1 axis involvement in a mouse model of type 2 diabetes. Diabetologia 50, 1327–1334 (2007). [PubMed](http://medrxiv.org/lookup/external-ref?access_num=17443310&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F12%2F27%2F2022.12.26.22283911.atom) 43. 43.Gardner, E. J. et al. Damaging missense variants in IGF1R implicate a role for IGF-1 resistance in the etiology of type 2 diabetes. Cell Genomics doi:10.1016/j.xgen.2022.100208 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.xgen.2022.100208&link_type=DOI) 44. 44.Pietzner, M. et al. Genetic architecture of host proteins interacting with SARS-CoV-2. bioRxiv Prepr. Serv. Biol. (2020). doi:10.1101/2020.07.01.182709 [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NzoiYmlvcnhpdiI7czo1OiJyZXNpZCI7czoxOToiMjAyMC4wNy4wMS4xODI3MDl2MSI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIyLzEyLzI3LzIwMjIuMTIuMjYuMjIyODM5MTEuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 45. 45.Carlsson, M. C., Bengtson, P., Cucak, H. & Leffler, H. Galectin-3 guides intracellular trafficking of some human serotransferrin glycoforms. J Biol Chem 288, 28398–28408 (2013). [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6MzoiamJjIjtzOjU6InJlc2lkIjtzOjEyOiIyODgvMzkvMjgzOTgiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyMi8xMi8yNy8yMDIyLjEyLjI2LjIyMjgzOTExLmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 46. 46.Cederfur, C. et al. Different affinity of galectins for human serum glycoproteins: galectin-3 binds many protease inhibitors and acute phase proteins. Glycobiology 18, 384–394 (2008). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/glycob/cwn015&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=18263896&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F12%2F27%2F2022.12.26.22283911.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000255151000004&link_type=ISI) 47. 47.Atalar, M. N. et al. Assessment of serum galectin-3, methylated arginine and Hs-CRP levels in type 2 diabetes and prediabetes. Life Sci 231, 116577 (2019). 48. 48.Lin, D. et al. Galectin-3/adiponectin as a new biological indicator for assessing the risk of type 2 diabetes: a cross-sectional study in a community population. Aging (Albany NY) 13, 15433–15443 (2021). 49. 49.Ohkura, T. et al. Low serum galectin-3 concentrations are associated with insulin resistance in patients with type 2 diabetes mellitus. Diabetol Metab Syndr 6, 106 (2014). 50. 50.Vora, A., de Lemos, J. A., Ayers, C., Grodin, J. L. & Lingvay, I. Association of Galectin-3 With Diabetes Mellitus in the Dallas Heart Study. J Clin Endocrinol Metab 104, 4449–4458 (2019). 51. 51.Weigert, J. et al. Serum galectin-3 is elevated in obesity and negatively correlates with glycosylated hemoglobin in type 2 diabetes. J Clin Endocrinol Metab 95, 1404–1411 (2010). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1210/jc.2009-1619&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20080851&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F12%2F27%2F2022.12.26.22283911.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000275197500053&link_type=ISI) 52. 52.Reid, J. G. et al. Launching genomics into the cloud: deployment of Mercury, a next generation sequence analysis pipeline. BMC Bioinformatics 15, 30 (2014). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/1471-2105-15-30&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24475911&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F12%2F27%2F2022.12.26.22283911.atom) 53. 53.Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25, 1754–1760 (2009). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/bioinformatics/btp324&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=19451168&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F12%2F27%2F2022.12.26.22283911.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000267665900006&link_type=ISI) 54. 54.Institute, B. Picard Tools. (2018). 55. 55.Plc, G. weCall. (2018). 56. 56.Lin, M. F. et al. GLnexus: joint variant calling for large cohort sequencing. bioRxiv (2018). doi:10.1101/343970 [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NzoiYmlvcnhpdiI7czo1OiJyZXNpZCI7czo4OiIzNDM5NzB2MSI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIyLzEyLzI3LzIwMjIuMTIuMjYuMjIyODM5MTEuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 57. 57.Cingolani, P. et al. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly 6, 80–92 (2012). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.4161/fly.19695&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22728672&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F12%2F27%2F2022.12.26.22283911.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000305965500003&link_type=ISI) 58. 58.Chun, S. & Fay, J. C. Identification of deleterious mutations within three human genomes. Genome Res 19, 1553–1561 (2009). [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NjoiZ2Vub21lIjtzOjU6InJlc2lkIjtzOjk6IjE5LzkvMTU1MyI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIyLzEyLzI3LzIwMjIuMTIuMjYuMjIyODM5MTEuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 59. 59.Schwarz, J. M., Rodelsperger, C., Schuelke, M. & Seelow, D. MutationTaster evaluates disease-causing potential of sequence alterations. Nat Methods 7, 575–576 (2010). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/nmeth0810-575&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20676075&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F12%2F27%2F2022.12.26.22283911.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000280500000014&link_type=ISI) 60. 60.Trbojević-Akmačić, I. et al. Chromatographic monoliths for high-throughput immunoaffinity isolation of transferrin from human plasma. Croat. Chem. Acta 89, 203–211 (2016). 61. 61.Pucić, M. et al. High throughput isolation and glycosylation analysis of IgG-variability and heritability of the IgG glycome in three isolated human populations. Mol. Cell. Proteomics 10, M111.010090-M111.010090 (2011). 62. 62.Trbojević Akmačić, I., Ugrina, I. & Lauc, G. Methods in Enzymology, Volume 586: Chapter Three - Comparative Analysis and Validation of Different Steps in Glycomics Studies. (2017). doi:[https://doi.org/10.1016/bs.mie.2016.09.027](https://doi.org/10.1016/bs.mie.2016.09.027) 63. 63.Johnson, W. E., Li, C. & Rabinovic, A. Adjusting batch effects in microarray expression data using empirical Bayes methods. Biostatistics 8, 118–127 (2007). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/biostatistics/kxj037&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=16632515&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F12%2F27%2F2022.12.26.22283911.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000242715400008&link_type=ISI) 64. 64.Karssen, L. C., van Duijn, C. M. & Aulchenko, Y. S. The GenABEL Project for statistical genomics. F1000Research 5, 914 (2016). 65. 65.Huffman, J. E. et al. Comparative Performance of Four Methods for High-throughput Glycosylation Analysis of Immunoglobulin G in Genetic and Epidemiological Research. Mol. Cell. Proteomics 13, 1598–1610 (2014). [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NjoibWNwcm90IjtzOjU6InJlc2lkIjtzOjk6IjEzLzYvMTU5OCI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIyLzEyLzI3LzIwMjIuMTIuMjYuMjIyODM5MTEuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 66. 66.Chen, H. et al. Efficient Variant Set Mixed Model Association Tests for Continuous and Binary Traits in Large-Scale Whole-Genome Sequencing Studies. Am J Hum Genet 104, 260–274 (2019). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ajhg.2018.12.012&link_type=DOI) 67. 67.Kassambara, A. and Mundt, F. Factoextra: Extract and Visualize the Results of Multivariate Data Analyses. R Package Version 1.0.7. [https://CRAN.R-project.org/package=factoextra](https://CRAN.R-project.org/package=factoextra) (2020).