Systematic mediation and interaction analyses in an individual population study help characterize kidney function genetic loci ============================================================================================================================== * Dariush Ghasemi-Semeskandeh * David Emmert * Eva König * Luisa Foco * Martin Gögele * Laura Barin * Ryosuke Fujii * Christian Fuchsberger * Dorien J.M. Peters * Peter P. Pramstaller * Cristian Pattaro ## Abstract Chronic kidney disease (CKD) is a complex disease affecting >10% of the global population, with large between- and within-continent variability reflecting major environmental effects. To identify molecular targets for treatment, genome-wide association study meta-analyses (GWAMAs) of CKD-defining traits have identified hundreds of genetic loci in aggregated populations. However, while GWAMAs estimate the average allelic effect across studies, single population studies may be relevant to unravel specific mechanisms. To assess whether an individual study from a specific population could extend existing knowledge on kidney function genetics, we selected 147 kidney function relevant loci identified by a large European ancestry GWAMA, assessing their association with the glomerular filtration rate estimated from serum creatinine (eGFRcrea) in the Cooperative Health Research In South Tyrol (CHRIS) study (n=10,146), conducted in an Alpine region where thyroid dysfunction is common. We replicated associations with single nucleotide polymorphisms (SNPs) at 11 loci, showing up-to-5.4 times larger effect sizes than in the corresponding GWAMA, not explainable by minor allele frequency differences. Systematic mediation analysis across 70 quantitative traits identified serum magnesium, the activated partial thromboplastin time, and serum urate as partial mediators of the eGFRcrea associations at *SHROOM3*, *SLC34A1*, and *IGF1R*, respectively. Given that free triiodothyronine and thyroxine were effect modifiers across all loci, we conducted SNP-by-thyroid stimulating hormone (TSH) interaction analyses, identifying significant interactions at *STC1*: SNPs had larger effects on eGFRcrea at higher TSH levels, possibly reflecting stanniocalcin-1 autocrine and paracrine role. Individual population studies can help characterize genetic associations. The interplay between phenotypes at *SHROOM3* and *SLC34A1* and the role of thyroid function as a genetic effect modifier warrant further investigations. ## Introduction Chronic kidney disease (CKD) is a common complex disease that increases the risk of kidney failure, cardiovascular mortality, and all-cause mortality1. CKD is predicted to become the 5th leading cause of death by 20402. While affecting >10% of the population globally, CKD prevalence shows marked variability across countries, both between and within continents3. Between-continent variability is likely guided by combinations of environmental and genetic differences, such as is the case with sickle cell trait and albuminuria4. Within-continent variability could be more related to environmental differences, including lifestyle, public health policies, and assessment methods4, 5. To unravel the genetic basis of CKD, several genome-wide association study meta-analyses (GWAMAs) were conducted that analyzed the CKD-defining marker glomerular filtration rate (GFR) estimated from serum creatinine (eGFRcrea)6. More than 400 associated loci have been identified to date7. While GWAMAs estimate the average allelic effect across studies, evidence shows that individual population studies may be extremely relevant to advance discoveries not only in the field of rare diseases8 but also with regard to markers of common chronic diseases such as triglycerides and low density lipoprotein (LDL) cholesterol9, age at diabetes onset10, and mood disorders11. The evidence accumulated so far suggests that there is no general rule and that the contribution of an individual study to the dissection the biological basis of common phenotypes can only be assessed on a case-by-case basis and depends on study-specific characteristics. Here we analyzed genetic associations with eGFRcrea in the Cooperative Health Research In South Tyrol (CHRIS) study, a general population study conducted in a mountainous region of central Europe12. A sample from this same population was previously instrumental to identify genetic linkage of serum creatinine with the *APOL1*-*MYH9* locus13 and to replicate common variants associated with eGFRcrea14. Characteristics of the geographical region where the study was conducted include the demographic stability over time15, the rural environment, and a high frequency of hypothyroidism, typical of mountainous regions16. Aim of our analysis was to assess whether this particular study sample could provide information that extend current knowledge on kidney function genetics, by leveraging on local specific characteristics. For a given single population study, with a relatively limited sample size, to help expand knowledge on loci already identified by a large consortium, it is necessary and appropriate to limit investigation to the subset of loci associated with the phenotype of interest in the given study. Thus, we first identified which, among 147 loci found to be associated with kidney function in a large GWAMA from the CKDGen Consortium17, were specifically associated with eGFRcrea in the CHRIS study. Among the replicated loci, we explored whether there were local determinants enhancing specific genetic associations. Given that several replicated loci showed larger effects in CHRIS than in the discovery GWAMA, we examined the possible reasons of such larger effects and conducted extensive mediation analyses across 70 quantitative biochemical and anthropometric traits, reflecting multiple health conditions. Further, after observing a systematic modification of the genetic association with eGFRcrea when adjusting for thyroid-related traits, we conducted SNP-by-thyroid function interaction analyses. The aims of the mediation and the interaction analyses were to identify the presence of intermediate phenotypes and effect modifiers, respectively, that were specific of the study sample. ## Methods ### Study sample The present analyses were based on 10,146 individuals with complete genotype data who were included in the release 3 of the CHRIS study, a population-based study conducted in South Tyrol, Italy, between 2011 and 2018, which has been extensively described elsewhere12, 18. Briefly, participants underwent blood drawing, urine collection, anthropometric measurements, and clinical assessments early in the morning, after an overnight fast. Medical history was reconstructed through interviewer- and self-administered standardized questionnaires. Drug treatment was identified via barcode scan of the drug containers that participants were instructed to bring at the study center visit, with the drug code linked to an official drug databank12. ### eGFRcrea estimation Serum creatinine (SCr) was measured with a colorimetric assay on Roche Modular PPE (n=4,176) and Abbott Diagnostic Architect c16000 (n=5,970) instrumentations. According to previous analyses18, SCr was normalized by instrument (fixed effect) and participation period (random effect) using a linear mixed model implemented in the ‘lme4’ R package19. To reflect the same methods used in the CKDGen Consortium meta-analysis17, eGFRcrea was estimated using the 2009 CKD-EPI equation20 implemented in the R package ‘nephro’ v1.221 and winsorized at 15 and 200 ml/min/1.73m2. Finally, the natural logarithm transformation was applied. ### Genotype imputation Genotyping was performed in two batches using genotyping array chips based on the Illumina Human Omni Microarray platform. Following quality control analysis, the two data batches were merged, phased with SHAPEIT2 v2.r837, using the duoHMM method (--duohmm -W 5) with 800 states and 30 rounds22, and imputed based on the TOPMed r2 standard reference panel on the Michigan Imputation Server23. We obtained 34,084,280 SNPs with minimum imputation quality index Rsq ≥0.3 and minor allele count (MAC) ≥1, aligned with human genome assembly GRCh38. A genetic relatedness matrix (GRM) was obtained from the autosomal genotyped SNPs using EPACTS v3.2.6 modified for compatibility with assembly GRCh38. Genetic principal components (PCs) were estimated based on the genotyped variants data with minor allele frequency (MAF) >0.05 using GCTA24. ### GWAS and replication of CKDGen results We conducted a GWAS of age- and sex-adjusted residuals of ln(eGFRcrea) using the EMMAX method25 implemented in EPACTS v3.2.6, including the GRM to model the sample structure. After the analysis, we removed variants with minor allele frequency (MAF) <0.005, leaving 10,158,100 SNPs for further characterization. Summary statistics from the CKDGen European ancestry GWAMA17 were downloaded from the publicly available repository at [https://ckdgen.imbi.uni-freiburg.de](https://ckdgen.imbi.uni-freiburg.de). Given a subset of 4661 samples from the CHRIS study was previously included in the CKDGen GWAMA, we removed CHRIS data from the CKDGen GWAMA summary statistics using MetaSubtract v1.6026. After this process, the 147 variants remained genome-wide significant in CKDGen. We lifted the CKDGen genomic positions from the GRCh37 to the GRCh38 map, using CrossMap v0.5.327. To ensure consistency of the effect direction, replication of the 147 loci in CHRIS was assessed based on a one-sided test evaluated at the Bonferroni-corrected level of 0.00034 (0.05/147), for any variant in strong linkage disequilibrium (LD) (r2>0.8) with a lead CKDGen variant. LD was estimated using emeraLD v0.128. ### Mediation analysis Replicated ln(eGFRcrea)-SNP associations were submitted to mediation analysis across 70 quantitative anthropometric, blood pressure and biochemical traits (listed in Supplementary Table S1), following the flowchart represented in Fig. 1. SCr was included as a positive control. To prevent potential measurement instrument effects18, we applied quantile normalization to each trait (Supplementary Methods). Mediation analysis was conducted for each *trait* following the established 4-step framework and fitting linear regression models throughout: 1. ln(eGFRcrea) = β1 SNP +∑Ki θiXi + ε1 2. ln(eGFRcrea) = β2 SNP + γ2*trait* + ∑Ki θiXi + ε2 3. *trait* = β3 SNP +∑Ki θiXi + ε3 4. *trait* = β4 SNP + δ4ln(eGFRcrea) + ∑Ki θiXi + ε4 where ε1 to ε4 refer to Gaussian error terms, and ∑Ki θiXi indicate the inclusion of K covariates, namely age, sex, the first 10 genetic PCs and an intercept. Step 1 corresponds to the GWAS results: given we investigated the replicated variants, the estimate of β1 (b1) was always significant. We defined two criteria that must be satisfied in order for a trait to be considered at least a partial mediator of the ln(eGFRcrea)-SNP relation: (i) evidence of a substantial alteration of the SNP effect on ln(eGFRcrea) when adjusting for the trait (step 2) and (ii) the SNP is also associated with the trait (step 3) at a multiple-testing corrected level of 6.5 × 10-5 (corresponding to 0.05 / (70 traits × 11 independent loci)). Criterion (i) was assessed by analyzing the distribution of the estimate of β2 (b2): for a specific trait, b2 was classified as an outlier according to the rule b2 < P10 – 1.5 (P75 – P25) or b2 > P90 + 1.5 (P75 – P25), where P10, P25, P75, and P90 indicate the 10th, 25th, 75th, and 90th percentile of the distribution of b2 across all traits. This is essentially a more stringent version of the Tukey’s rule for outlier detection. ![Figure 1.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2023/04/23/2023.04.15.23288540/F1.medium.gif) [Figure 1.](http://medrxiv.org/content/early/2023/04/23/2023.04.15.23288540/F1) Figure 1. Flowchart of the mediation analysis. Abbreviations: Mg, serum magnesium; aPTT, activated partial thromboplastin time. To ensure that a lack of association in step 3 was not due to lack of power in the CHRIS study, we interrogated the SNP-trait associations that passed the mediation analysis step 2 on the PhenoScanner v229 ([http://www.phenoscanner.medschl.cam.ac.uk/](http://www.phenoscanner.medschl.cam.ac.uk/), interrogated on 31-Jan-2023), GWAS Catalog30 ([https://www.ebi.ac.uk/gwas/docs/api](https://www.ebi.ac.uk/gwas/docs/api); 24-Feb-2023), and the ThyroidOmics Consortium summary statistics31 ([https://www.thyroidomics.com](https://www.thyroidomics.com)), for associations at *P*<5.0×10-8. ### Interaction with thyroid function phenotypes Free triiodothyronine (FT3) and thyroxine (FT4) did not qualify as mediators but caused a major departure of b2 from its expected distribution (see Results). These traits may reflect thyroid gland issues. Given the relevance of hypothyroidism in the region, we analyzed these traits in detail. Because FT3 and FT4 levels were measured only when the thyroid stimulating hormone (TSH) level was <0.4 or >3.8 µUI/mL, we started with a series of sensitivity analyses to exclude the presence of underlying sample stratification. First, we tested whether the distribution of the genotypes was different between individuals with and without measured FT3 and FT4, using a two-sided Wilcoxon test at a significance level of 0.05. Second, to verify whether there was any geographical cluster of individuals with extreme TSH levels (and so measured FT3 and FT4 levels), which might have implicated population stratification, we further adjusted the ln(eGFRcrea)-SNP association model in step 1 for municipality of residence. After excluding the presence of sample stratification, we tested the interaction between the SNPs and TSH levels, which were measured in the whole sample. We excluded 416 individuals reporting any condition among thyroid cancer (n=16), kidney cancer (n=1), goiter (n=277), having undergone surgery to the thyroid gland (n=312), or having both missing TSH measurement and therapy information (n=4), leaving 9730 individuals for the interaction analysis. Regression models included both the main and the interaction effect terms and were adjusted for the same covariates used in the step 1 model. In addition to quantitative TSH levels, we also tested interaction with broadly defined hyper- and hypothyroidism. Hyperthyroidism was defined as a TSH level of <0.4 µUI/mL or use of thiamazole (n=3) or propylthiouracil (n=1). Hypothyroidism was defined as TSH levels >3.8 µUI/mL or reported use of levothyroxine sodium (n=512). No other thyroid-related treatment was reported. The statistical significance level for interaction testing was set at 0.0045=0.05/11 loci. ### Ethical statement The CHRIS study was approved by the Ethical Committee of the Healthcare System of the Autonomous Province of Bozen/Bolzano, protocol no. 21/2011 (19 Apr 2011). All participants provided written informed consent. All the methods were performed in strict accordance with the approved protocol. ## Results In the study sample, the mean age was 46 years (standard deviation, SD=16), females were 55%, and the mean eGFRcrea level was 92.00 (SD=16.11) ml/min/1.73m2 (Table 1; additional clinical characteristics are reported in Supplementary Table S1). After appropriate modeling of the sample structure (genomic inflation factor=1.02; Supplementary Fig. S1), 11 of the 147 CKDGen loci were replicated in CHRIS (1-sided P-value between 3.34×10-4 and 4.51×10-7; Table 2; Supplementary Table S2). For each replicated locus we retained all variants in LD (r2>0.8) with the lead SNP, totaling 163 variants over the 11 loci (Supplementary Table S3). The replicated loci displayed very similar LD pattern in CHRIS as in CKDGen (Supplementary Fig. S2). Among the replicated loci, variants’ effect magnitude at *CASZ1*, *DDX1*, *PIP5K1B*, *GAB2*, and *IGF1R* was significantly larger in CHRIS than in CKDGen (Fig. 2; Table 2). The CHRIS-to-CKDGen effect ratio varied between 1 and 5.4 and was not entirely explained by MAF differences (Fig. 3; Supplementary Table S3). Three broad groupings characterized by MAF and effect size differences could be distinguished: 1. at *SLC34A1* and *SHROOM3*, we observed similar MAF and effect magnitude, consistent with the fact that these two loci were amongst the ones identified by the earliest GWAS, when sample size was still relatively limited and the genomic imputation quality was coarser32, 33; 2. at *DAB2*, *TMEM60*, *PIP5K1B*, *GAB2*, and *IGF1R*, MAF was similar or slightly larger in CHRIS (effects in CHRIS were 1.3-to-5 times larger than in CKDGen); 3. at *STC1*, *DDX1*, and *CASZ1*, we observed lower MAF in CHRIS and about 1.8-to-5.4 times larger effects in CHRIS. ![Supplementary Figure S1.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2023/04/23/2023.04.15.23288540/F7.medium.gif) [Supplementary Figure S1.](http://medrxiv.org/content/early/2023/04/23/2023.04.15.23288540/F7) Supplementary Figure S1. Quantile-quantile (QQ) plot of the P-values from the GWAS of ln(eGFRcrea) in the CHRIS study. ![](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2023/04/23/2023.04.15.23288540/F8/graphic-12.medium.gif) [](http://medrxiv.org/content/early/2023/04/23/2023.04.15.23288540/F8/graphic-12) ![](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2023/04/23/2023.04.15.23288540/F8/graphic-13.medium.gif) [](http://medrxiv.org/content/early/2023/04/23/2023.04.15.23288540/F8/graphic-13) ![](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2023/04/23/2023.04.15.23288540/F8/graphic-14.medium.gif) [](http://medrxiv.org/content/early/2023/04/23/2023.04.15.23288540/F8/graphic-14) ![](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2023/04/23/2023.04.15.23288540/F8/graphic-15.medium.gif) [](http://medrxiv.org/content/early/2023/04/23/2023.04.15.23288540/F8/graphic-15) ![](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2023/04/23/2023.04.15.23288540/F8/graphic-16.medium.gif) [](http://medrxiv.org/content/early/2023/04/23/2023.04.15.23288540/F8/graphic-16) ![](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2023/04/23/2023.04.15.23288540/F8/graphic-17.medium.gif) [](http://medrxiv.org/content/early/2023/04/23/2023.04.15.23288540/F8/graphic-17) Supplementary Figure S2. Regional association plots for the 11 replicated loci. Highlighted in purple is the most associated SNP in the CKDGen GWAS meta-analysis. All SNP positions are referred to the NCBI Build 38. Plots were generated with LocusZoom version 1.4 (Pruim RJ, et al., Bioinformatics, 2010. 26(18): 2336-7). ![Supplementary Figure S3.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2023/04/23/2023.04.15.23288540/F9.medium.gif) [Supplementary Figure S3.](http://medrxiv.org/content/early/2023/04/23/2023.04.15.23288540/F9) Supplementary Figure S3. Comparison of effect size and standard error ratio for the variants in the model regressing eGFRcrea on age, sex, FT3 (or FT4), genetic variants with and without adjusting for municipality. ![Figure 2.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2023/04/23/2023.04.15.23288540/F2.medium.gif) [Figure 2.](http://medrxiv.org/content/early/2023/04/23/2023.04.15.23288540/F2) Figure 2. Effect sizes with 95% confidence intervals for the most associated variants at each locus in CKDGen and CHRIS following harmonization to the eGFRcrea-lowering allele. The effect is meant per copy of the effect allele on the natural logarithm of eGFRcrea in ml/min/1.73m2 (y-axis). For *CASZ1*, *DDX1*, *PIP5K1B*, *GAB2*, and *IGF1R* the magnitude of the effect in CHRIS was larger than in CKDGen. ![Figure 3.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2023/04/23/2023.04.15.23288540/F3.medium.gif) [Figure 3.](http://medrxiv.org/content/early/2023/04/23/2023.04.15.23288540/F3) Figure 3. Comparison of the characteristics of all 163 variants in the 11 replicated loci in CKDGen versus CHRIS after alignment of the effect alleles. The ratio between CHRIS and CKDGen minor allele frequencies (MAF, x-axis) is plotted against the ratio between CHRIS and CKDGen effect estimates (y-axis). Effects are expressed in terms of change of the natural logarithm of eGFRcrea in ml/min/1.73m2 per copy of the effect allele. View this table: [Table 1.](http://medrxiv.org/content/early/2023/04/23/2023.04.15.23288540/T1) Table 1. Main characteristics of the 10,146 study individuals. Additional clinical characteristics are described in Supplementary Table 1. View this table: [Table 2.](http://medrxiv.org/content/early/2023/04/23/2023.04.15.23288540/T2) Table 2. The 11 loci identified from the European ancestry CKDGen Consortium GWAS that showed significant effects in CHRIS. Loci were considered replicated if any variant in strong LD with the CKDGen lead variant was significantly associated with ln(eGFRcrea) in CHRIS. Consequently, for each locus, reported are either the results on the CKDGen lead variant or the results of the replicated variant in the locus when the lead variant did not replicate itself (see Methods). ### Mediation analysis To identify possible reasons for observed larger effects, we conducted mediation analysis of the association between eGFRcrea and each of the 163 variants at the 11 loci across 70 quantitative health traits available in CHRIS (Supplementary Table S1). Adjusting the eGFRcrea-SNP association for each trait in turn (step 2 of the mediation analysis; see Methods) resulted in a substantial change of the eGFRcrea-SNP association coefficient in between 4 and 12 variant-trait associations per locus (Fig. 4). As expected, adjustment for the positive control SCr resulted in the nearly complete knockdown of the SNP effect at all loci (effect sizes toward null). Often, the effect change happened when adjusting for traits closely related to kidney function such as urate (*GAB2*, *IGF1R*, *PIP5K1B*, *SHROOM3*, and *STC1*), urinary creatinine (*SHROOM3*), and urinary albumin (*IGF1R*). Serum electrolytes such as magnesium (*SHROOM3*), corrected calcium (*PIP5K1B*), and sodium (*DDX1*, *GAB2*, *RSBN1L*, *SLC34A1*, and *STC1*) were also widely involved. Of note was the widespread effect of FT3, FT4, and the activated partial thromboplastin time (aPTT) across all loci. When adjusting for FT3 and FT4, the SNP effect on ln(eGFRcrea) was often completely altered. This behavior across all loci may suggest the presence of either a measurement artifact or a general related condition affecting the underlying population. ![Figure 4.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2023/04/23/2023.04.15.23288540/F4.medium.gif) [Figure 4.](http://medrxiv.org/content/early/2023/04/23/2023.04.15.23288540/F4) Figure 4. Results of the 2nd step of the mediation analysis for the most associated variants at each locus. SNP effect size and their 95% confidence intervals are reported. Effects are expressed in terms of change of the natural logarithm of eGFRcrea in ml/min/1.73m2 per copy of the effect allele. Large departures from the expected effect size (‘outlier’, see Methods) are indicated in red. The vertical dashed lines identify the null. Large effect size changes were observed when adjusting for SCr (positive control; all loci), FT3 (all loci) and FT4 (10 loci), aPTT (10 loci), sodium (5 loci), homocysteine (3 loci), SBP (2 loci), urate (2 loci), and cortisol, DBP, lipase, magnesium, pulse rate, UACR, urinary albumin and urine pH (1 locus each). In the 3rd mediation analysis step, we evaluated whether the traits identified as effect modifiers in step 2 were associated with the SNP of which they modified the association with eGFRcrea (Table 3; Supplementary Table S4): SNP rs3812036 at *SLC34A1* was associated with aPTT levels (*P*=3.54×10-6) and five SNPs at *SHROOM3* were associated with serum magnesium levels (*P*-values between 5.33×10-5 and 6.12×10-5). Three of our associations reproduce previously reported associations found using Phenoscanner: at *SLC34A1*, Tang *et al.*34 reported association of rs3812036 with aPTT (*P*=2.88×10-18), and at *SHROOM3*, Meyer *et al.*35 reported associations of magnesium with rs4859682 (*P*=2.39×10-9) and rs13146355 (*P*=6.27×10-13). View this table: [Table 3](http://medrxiv.org/content/early/2023/04/23/2023.04.15.23288540/T3) Table 3 Results of the mediation analysis at *SHROOM3*, *SLC34A1* and *IGF1R*. Extensive mediation analysis results for all loci and all traits are reported in Supplementary Table S4. All SNP-trait associations that successfully passed the 2nd mediation analysis step were also interrogated against the Phenoscanner, GWAS Catalog, and ThyroidOmics consortium results (Methods) to identify associations that might have been missed by the 3rd step of the mediation analysis in CHRIS due to lack of power (Supplementary Table S4). This analysis additionally identified significant associations between 11 variants at *IGF1R* and serum urate (Table 3) supporting the presence of a mediation mechanism involving *IGF1R*, eGFRcrea and uric acid. In summary, aPTT, serum magnesium, and urate qualified as partial mediators of eGFRcrea associations at *SLC34A1*, *SHROOM3*, and *IGF1R*, respectively. At *SLC34A1*, the partial mediation of aPTT corresponded to a 21% larger effect of the SNP on ln(eGFRcrea); at *SHROOM3*, the partial mediation of serum magnesium corresponded to an effect attenuation of about 11%; and at *IGF1R* the mediation resulted in up-to-18% attenuation of the SNP effects on eGFRcrea (Table 3). ### Interaction with thyroid dysfunction Despite the pervasive effect change caused by FT3 and FT4 in step-2 analysis, these two traits did not result in significant step-3 associations and so did not qualify as mediators. However, motivated by the fact that hypothyroidism is amongst the leading causes of hospitalization in the region36, thyroid problems are common in Alpine areas, and the FT3 and FT4 adjustment affected the SNP-eGFRcrea association at all 11 loci, we considered it reasonable to evaluate a potential role of thyroid dysfunction as a modifier of the SNP-eGFRcrea relation. Preliminarily, given that in the CHRIS study FT3 and FT4 were measured only in individuals with TSH levels above or below specific thresholds, we first conducted sensitivity analyses to exclude the presence of artifacts. We did not observe any genotype stratification by measured versus unmeasured FT3 and FT4 levels (Supplementary Table S5). Adjustment for municipality of residence, to exclude the presence of local clusters of thyroid dysfunction, did not alter the results (Supplementary Fig. 3). We then tested for the presence of linear interaction with TSH levels, which identified a significant interaction at *STC1* SNPs rs819185 (*P*=0.00177) and rs819196 (*P*=0.00154) both below the multiple testing threshold of 0.05/11 loci (Fig. 5; Supplementary Table S6). Since for most loci the adjustment both for FT3 and for FT4 were causing an increase of the effect magnitude, we also postulated a U-shaped interaction model, with the SNP effects on ln(eGFRcrea) being larger both in hyper- and in hypothyroidism. We classified individuals as healthy (88.5%), with hypothyroidism (9.8%) or with hyperthyroidism (1.7%; Table 1). Our results did not support the presence of interaction with these two conditions at any locus after multiple testing control, despite nominally significant *P*-values observed for interaction between hyperthyroidism and SNPs in *SHROOM3* (*P*=0.03) and between hypothyroidism and SNPs in *PIP5K1B* (*P*=0.03) and *GAB2* (*P*=0.04; Supplementary Table S7). ![Figure 5.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2023/04/23/2023.04.15.23288540/F5.medium.gif) [Figure 5.](http://medrxiv.org/content/early/2023/04/23/2023.04.15.23288540/F5) Figure 5. Interaction of ln(eGFRcrea)-associated variants with TSH levels. Effects of *STC1* SNPs rs819185 (P=0.00177; **Panel A**) and rs819196 (P=0.00154; **Panel B**) are larger at lower TSH levels. ## Discussion Out of 147 loci known for their association with kidney function, our analysis identified 11 that were more strongly associated with eGFRcrea in the CHRIS study sample than in average European-ancestry population samples. Observed differences in effect magnitude could not be explained by MAF differences. Extensive mediation analysis identified serum magnesium, aPTT, and urate as partial mediators of the association of eGFRcrea with variants at *SHROOM3*, *SLC34A1*, and *IGF1R*, respectively. SNP-by-TSH interaction analysis highlighted that the effect of variants at *STC1* on eGFRcrea would vary by TSH levels. With the exception of *SHROOM3* and *SLC34A1*, for which a relatively small sample size proved to be sufficient for their detection32, if our replicated loci would had not had a large effect size, they would not have been replicated. But the question remains as to why precisely these and no other loci showed such large effects in CHRIS. SNPs may show a larger effects at lower MAF. This situation was observed at just *DDX1* and *CASZ1*, whose effects got larger as the CHRIS-to-CKDGen MAF ratio got smaller. However, the MAF difference was not related to the larger effects observed at *GAB2*, *TMEM60*, *PIP5K1B*, and *IGF1R*, where the SNP MAF was similar or even smaller in CKDGen than in CHRIS (Fig. 2). The presence of un-modeled sample structure causing genomic inflation can be excluded as relatedness was appropriately modeled. We can also exclude the presence of genetic admixture as the CHRIS study was conducted in a small area, on a local population of homogeneous ancestry15, 37–39. Differences in LD structure between the local and the overall population can also be excluded as regional association plots showed similar shape of the association results in CHRIS and CKDGen. Epistatic gene-gene interaction would be a further possibility that cannot be easily verified. The most plausible remaining reasons would be the presence of local effect modifiers or gene-by-environment interaction. Mediation analysis based on commonly measured biochemical and anthropometric traits provided limited answers in this respect but highlighted relevant mechanisms at three loci. The first finding was the identified presence of partial mediation of serum magnesium levels on the association of eGFRcrea with the *SHROOM3* locus. Genetic variants at *SHROOM3* have been associated with CKD8,9, reduced eGFRcrea9, increased albumin-to-creatinine ratio10, and low serum magnesium levels11. *SHROOM3* is necessary to maintain the glomerular filtration barrier integrity40. Variants in this gene have been shown to be associated with increased risk of CKD41, likely through disruption of the transcription factor TCF7L2 in podocyte cells. At the same locus, *FAM47E* is highly expressed in the human glomeruli and its transcription levels were associated with eGFRcrea42. Additional genes in this locus include *STBD1* and *CCDC158*, for which no connection with kidney function was demonstrated. The kidney is one of the primary regulators of serum magnesium concentration. It has been suggested that *SHROOM3* might increase serum magnesium levels through decreased renal magnesium excretion43. At the *SHROOM3* rs13146355 SNP, a previous GWAS on serum magnesium showed that the allele related with lower magnesium levels was associated with higher eGFRcrea levels35. The authors observed that the rs13146355 association with magnesium did not change when adjusting for eGFRcrea, suggesting pleiotropic independent effects. In our analysis, the effect of rs13146355 on eGFRcrea was reduced by 11% when adjusting for magnesium, suggesting partial mediation and thus the presence of partially overlapping mechanisms. Deeper analyses at the molecular level are warranted to identify the relevant mechanism. A similar situation was observed at *SLC34A1* which, in addition to eGFRcrea33, was also previously associated with aPTT in a GWAMA of 9240 European ancestry individuals34 including participants from the same areas where the CHRIS study was sampled. By quantifying the clotting time from the activation of factor XII (*F12*) until the formation of a fibrin clot, aPTT is a measure of the integrity of the intrinsic and common coagulation pathways44. The observed mediation, in which adjustment for aPTT increases the SNP-eGFRcrea association by 21%, is likely related to the proximity between *SLC34A1* and the coagulation factor gene *F12*. The LD at this locus is compatible with the presence of common haplotypes tagging both genes, inducing pleiotropic independent effects on eGFRcrea and aPTT. Finally, at the insulin like growth factor 1 receptor (*IGF1R*) locus, we observed an eGFRcrea-SNP altered association when adjusting for urate levels. Mediation could not be confirmed using CHRIS data only, likely due to lack of power, but it was confirmed through interrogation of publicly available summary statistics: SNPs at *IGF1R* were previously associated with uric acid45. The significant mediation probably reflects the involvement of *IGR1R* in urate reabsorption in the proximal tubule of the kidney46. An additional relevant finding was the presence of an interaction between two SNPs at *STC1* and TSH levels. *STC1* is chiefly expressed in the thyroid gland ([https://gtexportal.org/home/gene/STC1](https://gtexportal.org/home/gene/STC1)). It is also highly expressed in the kidney, where it is exclusively expressed in the collecting duct47. *STC1* is also associated with urate and urea levels and is involved in the physiological response to dehydration and protein-rich diet47. The *STC1*-encoded protein, stanniocalcin-1, is involved in phosphate reabsorption in the kidney proximal tubules and in multiple pathophysiological mechanisms including ischemic kidney injury48. Stanniocalcin-1 behaves like an endocrine hormone, with both autocrine and paracrine functions. Our observation of a SNP effect on eGFRcrea depending on TSH levels would suggest that altered thyroid function levels might affect Stanniocalcin-1 levels with consequent effect on kidney function phenotypes. While intriguing, this hypothesis should be verified by appropriate studies. Our study had both strengths and limitations. Using data from a local homogeneous population sample was at the same time the most relevant strength and limitation of our study. While homogeneity may enhance the identification of specific genetic associations, it may also hinder the possibility to explain the underlying reasons of the observed larger effects compared to the average European ancestry population. The availability of a very large number of clinical parameters has allowed us to conduct an exhaustive and unbiased screening on potential mediation mechanisms. However, the analyzed traits lacked molecular specificity to identify underlying mechanisms. Finally, assessment of environmental exposures47 may also be a tool to explain the observed larger effects and would be ground for further research. In conclusion, our study has identified loci that have particularly large effect on kidney function in the specific Alpine population sample considered in this study, namely *CASZ1*, *IGF1R*, *GAB2*, and *DDX1*. Allele frequency and LD analyses did not explain the observed larger effects nor did the extensive mediation analyses conducted. On the other hand, we showed that the effect of *SHROOM3*, *SLC34A1*, and *IGF1R* on eGFRcrea is partially mediated by serum magnesium, aPTT, and urate levels, respectively. Finally, the observed *STC1*-TSH interaction implicates thyroid function involvement, which might have been favored by the high burden of thyroid-related diseases in the region. Further investigations are warranted to link the identified loci to environmental and molecular characteristics of specific population samples that may elucidate mechanisms not otherwise identifiable in large genetic meta-analyses. ## Supporting information Supplementary Tables [[supplements/288540_file03.xlsx]](pending:yes) ## Data Availability The data used for the current study and the results are stored in a controlled storage at the Eurac Research Institute for Biomedicine. The data are not openly available due to reasons of sensitivity and can be requested with an application to access.request.biomedicine{at}eurac.edu. ## Funding The CHRIS study was funded by the Department of Innovation, Research and University of the Autonomous Province of Bolzano-South Tyrol. This work was carried out within the TrainCKDis project, funded by the European Union’s Horizon 2020 research and innovation programme under the Marie Skłodowska-Curie grant agreement H2020-MSCA-ITN-2019 ID:860977 (TrainCKDis). ## Author contributions Conceptualization of the research project: C.P., D.G. Recruitment and study management: M.G., P.P.P., C.P. Bioinformatics: D.G., E.K., D.E., C.F. Data quality control and harmonization: M.G., L.B., L.F. Statistical Analysis: D.G. Results interpretation: D.G., C.P., C.F., D.E., M.G., L.F., R.F. Manuscript drafting: D.G., C.P. Manuscript critical revision: D.E., E.K., R.F., L.F., L.B., M.G., D.J.M.P., P.P.P., C.F., D.G., C.P. ## Competing interests The authors declare no competing financial interests. ## Supplementary Material ### Supplementary method Quantile normalization of the 70 health traits used in the mediation analysis. We used *normalize2Reference* function in the caret R package version 6.0-34 (the function is available in the later versions here: [https://github.com/topepo/caret/tree/master/deprecated](https://github.com/topepo/caret/tree/master/deprecated)). Each trait was measured partially with an older method and partially with a newer, most recent method. The observations from the newer method (method = 1) were set as the reference to derive the quantiles of the trait distribution on which to standardize the observations from the older method (method = 0). We then replaced the observation from the older method with the standardized ones and included together with the values from the newer method. Notation: * – **trait**: the quantitative variable to which quantile normalization was applied; * – **method:** the newer method for measuring the trait (1); the older method (0); * – **df:** dataframe; the data in which the trait of interest is available ![Figure6](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2023/04/23/2023.04.15.23288540/F6.medium.gif) [Figure6](http://medrxiv.org/content/early/2023/04/23/2023.04.15.23288540/F6) ## Acknowledgements The CHRIS study is a collaborative effort between the Eurac Research Institute for Biomedicine and the Healthcare System of the Autonomous Province of Bozen/Bolzano. We thank all study participants, the general practitioners, the personnel of the Hospital of Schlanders/Silandro, the field study team and the personnel of the CHRIS Biobank (BRIF code BRIF6107) for their support and collaboration. Extensive acknowledgement is reported at [https://translationalmedicine.biomedcentral.com/articles/10.1186/s12967-015-0704-9](https://translationalmedicine.biomedcentral.com/articles/10.1186/s12967-015-0704-9). * Received April 15, 2023. * Revision received April 15, 2023. * Accepted April 23, 2023. * © 2023, Posted by Cold Spring Harbor Laboratory This pre-print is available under a Creative Commons License (Attribution-NonCommercial-NoDerivs 4.0 International), CC BY-NC-ND 4.0, as described at [http://creativecommons.org/licenses/by-nc-nd/4.0/](http://creativecommons.org/licenses/by-nc-nd/4.0/) ## References 1. Eckardt, K. U. et al. Evolving importance of kidney disease: from subspecialty to global health burden. Lancet 382, 158–169, doi:10.1016/s0140-6736(13)60439-0 (2013). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S0140-6736(13)60439-0&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23727165&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F04%2F23%2F2023.04.15.23288540.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000322115300031&link_type=ISI) 2. Foreman, K. J. et al. Forecasting life expectancy, years of life lost, and all-cause and cause-specific mortality for 250 causes of death: reference and alternative scenarios for 2016-40 for 195 countries and territories. Lancet 392, 2052–2090, doi:10.1016/s0140-6736(18)31694-5 (2018). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S0140-6736(18)31694-5&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=30340847&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F04%2F23%2F2023.04.15.23288540.atom) 3. Global, regional, and national burden of chronic kidney disease, 1990-2017: a systematic analysis for the Global Burden of Disease Study 2017. Lancet 395, 709–733, doi:10.1016/s0140-6736(20)30045-3 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/s0140-6736(20)30045-3&link_type=DOI) 4. Naik, R. P. et al. Association of sickle cell trait with chronic kidney disease and albuminuria in African Americans. Jama 312, 2115–2125, doi:10.1001/jama.2014.15063 (2014). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1001/jama.2014.15063&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25393378&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F04%2F23%2F2023.04.15.23288540.atom) 5. Brück, K. et al. CKD prevalence varies across the European general population. Journal of the American Society of Nephrology 27, 2135, doi:10.1681/ASN.2015050542 (2016). [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6ODoiam5lcGhyb2wiO3M6NToicmVzaWQiO3M6OToiMjcvNy8yMTM1IjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjMvMDQvMjMvMjAyMy4wNC4xNS4yMzI4ODU0MC5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 6. Köttgen, A. & Pattaro, C. The CKDGen Consortium: ten years of insights into the genetic basis of kidney function. Kidney Int 97, 236–242, doi:10.1016/j.kint.2019.10.027 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.kint.2019.10.027&link_type=DOI) 7. Stanzick, K. J. et al. Discovery and prioritization of variants and genes for kidney function in >1.2 million individuals. Nature Communications 12, 4350, doi:10.1038/s41467-021-24491-0 (2021). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41467-021-24491-0&link_type=DOI) 8. Chong, Jessica X. et al. The Genetic Basis of Mendelian Phenotypes: Discoveries, Challenges, and Opportunities. The American Journal of Human Genetics 97, 199–215, doi:[https://doi.org/10.1016/j.ajhg.2015.06.009](https://doi.org/10.1016/j.ajhg.2015.06.009) (2015). 9. Montasser, M. E. et al. Genetic and functional evidence links a missense variant in B4GALT1 to lower LDL and fibrinogen. Science 374, 1221–1227, doi:10.1126/science.abe0348 (2021). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1126/science.abe0348&link_type=DOI) 10. Hamet, P. et al. PROX1 gene CC genotype as a major determinant of early onset of type 2 diabetes in slavic study participants from Action in Diabetes and Vascular Disease: Preterax and Diamicron MR Controlled Evaluation study. J Hypertens 35 **Suppl 1**, S24–s32, doi:10.1097/hjh.0000000000001241 (2017). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1097/HJH.0000000000001241&link_type=DOI) 11. Humphries, E. M. et al. Genome-wide significant risk loci for mood disorders in the Old Order Amish founder population. Mol Psychiatry, doi:10.1038/s41380-023-02014-1 (2023). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41380-023-02014-1&link_type=DOI) 12. Pattaro, C. et al. The Cooperative Health Research in South Tyrol (CHRIS) study: rationale, objectives, and preliminary results. Journal of Translational Medicine 13, 348, doi:10.1186/s12967-015-0704-9 (2015). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/s12967-015-0704-9&link_type=DOI) 13. Pattaro, C. et al. Genome-wide linkage analysis of serum creatinine in three isolated European populations. Kidney International 76, 297–306, doi:[https://doi.org/10.1038/ki.2009.135](https://doi.org/10.1038/ki.2009.135) (2009). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/ki.2009.135&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=19387472&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F04%2F23%2F2023.04.15.23288540.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000267972800011&link_type=ISI) 14. Pattaro, C. et al. A meta-analysis of genome-wide data from five European isolates reveals an association of COL22A1, SYT1, and GABRR2with serum creatinine level. BMC Medical Genetics 11, 41, doi:10.1186/1471-2350-11-41 (2010). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/1471-2350-11-41&link_type=DOI) 15. Gögele, M., Pattaro, C., Fuchsberger, C. & Pramstaller, P. P. Fertility pattern and family structure in three Alpine settlements in South Tyrol (italy): marriage cohorts from 1750 to 1949. J Biosoc Sci 41, 697–701, doi:10.1017/s0021932009003423 (2009). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1017/S0021932009003423&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=19645089&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F04%2F23%2F2023.04.15.23288540.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000269022700010&link_type=ISI) 16. Bolzano, A. o. t. P. o. Prevalence of the Chronic Diseases, <[https://www.provincia.bz.it/salute-benessere/osservatorio-salute/diffusione-delle-malattie-croniche.asp](https://www.provincia.bz.it/salute-benessere/osservatorio-salute/diffusione-delle-malattie-croniche.asp)> (2019). 17. Wuttke, M. et al. A catalog of genetic loci associated with kidney function from analyses of a million individuals. Nat Genet 51, 957–972, doi:10.1038/s41588-019-0407-x (2019). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41588-019-0407-x&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=31152163&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F04%2F23%2F2023.04.15.23288540.atom) 18. Noce, D. et al. Sequential recruitment of study participants may inflate genetic heritability estimates. Hum Genet 136, 743–757, doi:10.1007/s00439-017-1785-8 (2017). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s00439-017-1785-8&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=28374192&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F04%2F23%2F2023.04.15.23288540.atom) 19. Bates, D., Mächler, M., Bolker, B. & Walker, S. Fitting Linear Mixed-Effects Models Using lme4. Journal of Statistical Software 67, 1–48, doi:10.18637/jss.v067.i01 (2015). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.18637/jss.v067.i01&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23757445&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F04%2F23%2F2023.04.15.23288540.atom) 20. Levey, A. S. et al. A new equation to estimate glomerular filtration rate. Ann Intern Med 150, 604–612, doi:10.7326/0003-4819-150-9-200905050-00006 (2009). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.7326/0003-4819-150-9-200905050-00006&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=19414839&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F04%2F23%2F2023.04.15.23288540.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000265903800004&link_type=ISI) 21. Pattaro, C. et al. Estimating the glomerular filtration rate in the general population using different equations: effects on classification and association. Nephron Clin Pract 123, 102–111, doi:10.1159/000351043 (2013). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1159/000351043&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23797027&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F04%2F23%2F2023.04.15.23288540.atom) 22. O’Connell, J. et al. A general approach for haplotype phasing across the full spectrum of relatedness. PLoS Genet 10, e1004234, doi:10.1371/journal.pgen.1004234 (2014). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pgen.1004234&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24743097&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F04%2F23%2F2023.04.15.23288540.atom) 23. Das, S. et al. Next-generation genotype imputation service and methods. Nat Genet 48, 1284–1287, doi:10.1038/ng.3656 (2016). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/ng.3656&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=27571263&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F04%2F23%2F2023.04.15.23288540.atom) 24. Yang, J., Lee, S. H., Goddard, M. E. & Visscher, P. M. GCTA: a tool for genome-wide complex trait analysis. Am J Hum Genet 88, 76–82, doi:10.1016/j.ajhg.2010.11.011 (2011). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ajhg.2010.11.011&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=21167468&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F04%2F23%2F2023.04.15.23288540.atom) 25. Kang, H. M. et al. Variance component model to account for sample structure in genome-wide association studies. Nat Genet 42, 348–354, doi:10.1038/ng.548 (2010). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/ng.548&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20208533&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F04%2F23%2F2023.04.15.23288540.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000276150500016&link_type=ISI) 26. Nolte, I. M. Metasubtract: an R-package to analytically produce leave-one-out meta-analysis GWAS summary statistics. Bioinformatics 36, 4521–4522, doi:10.1093/bioinformatics/btaa570 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/bioinformatics/btaa570&link_type=DOI) 27. Zhao, H. et al. CrossMap: a versatile tool for coordinate conversion between genome assemblies. Bioinformatics 30, 1006–1007, doi:10.1093/bioinformatics/btt730 (2014). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/bioinformatics/btt730&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24351709&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F04%2F23%2F2023.04.15.23288540.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000334078300017&link_type=ISI) 28. Quick, C. et al. emeraLD: rapid linkage disequilibrium estimation with massive datasets. Bioinformatics 35, 164–166, doi:10.1093/bioinformatics/bty547 (2019). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/bioinformatics/bty547&link_type=DOI) 29. Kamat, M. A. et al. PhenoScanner V2: an expanded tool for searching human genotype-phenotype associations. Bioinformatics 35, 4851–4853, doi:10.1093/bioinformatics/btz469 (2019). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/bioinformatics/btz469&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=31233103&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F04%2F23%2F2023.04.15.23288540.atom) 30. Sollis, E. et al. The NHGRI-EBI GWAS Catalog: knowledgebase and deposition resource. Nucleic Acids Res 51, D977–d985, doi:10.1093/nar/gkac1010 (2023). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/nar/gkac1010&link_type=DOI) 31. Teumer, A. et al. Genome-wide analyses identify a role for SLC17A4 and AADAT in thyroid hormone regulation. Nature Communications 9, 4455, doi:10.1038/s41467-018-06356-1 (2018). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41467-018-06356-1&link_type=DOI) 32. Köttgen, A. et al. Multiple loci associated with indices of renal function and chronic kidney disease. Nat Genet 41, 712–717, doi:10.1038/ng.377 (2009). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/ng.377&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=19430482&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F04%2F23%2F2023.04.15.23288540.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000266411700021&link_type=ISI) 33. Köttgen, A. et al. New loci associated with kidney function and chronic kidney disease. Nat Genet 42, 376–384, doi:10.1038/ng.568 (2010). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/ng.568&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20383146&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F04%2F23%2F2023.04.15.23288540.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000277179500006&link_type=ISI) 34. Tang, W. et al. Genetic associations for activated partial thromboplastin time and prothrombin time, their gene expression profiles, and risk of coronary artery disease. Am J Hum Genet 91, 152–162, doi:10.1016/j.ajhg.2012.05.009 (2012). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ajhg.2012.05.009&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22703881&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F04%2F23%2F2023.04.15.23288540.atom) 35. Meyer, T. E. et al. Genome-wide association studies of serum magnesium, potassium, and sodium concentrations identify six Loci influencing serum magnesium levels. PLoS Genet 6, doi:10.1371/journal.pgen.1001045 (2010). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pgen.1001045&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20700443&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F04%2F23%2F2023.04.15.23288540.atom) 36. Volpato, C. B. et al. Linkage and association analysis of hyperthyrotropinaemia in an Alpine population reveal two novel loci on chromosomes 3q28-29 and 6q26-27. J Med Genet 48, 549–556, doi:10.1136/jmg.2010.088583 (2011). [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6OToiam1lZGdlbmV0IjtzOjU6InJlc2lkIjtzOjg6IjQ4LzgvNTQ5IjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjMvMDQvMjMvMjAyMy4wNC4xNS4yMzI4ODU0MC5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 37. Pattaro, C. et al. The genetic study of three population microisolates in South Tyrol (MICROS): study design and epidemiological perspectives. BMC Med Genet 8, 29, doi:10.1186/1471-2350-8-29 (2007). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/1471-2350-8-29&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=17550581&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F04%2F23%2F2023.04.15.23288540.atom) 38. Gögele, M. et al. Heritability analysis of life span in a semi-isolated population followed across four centuries reveals the presence of pleiotropy between life span and reproduction. J Gerontol A Biol Sci Med Sci 66, 26–37, doi:10.1093/gerona/glq163 (2011). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/gerona/glq163&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20884848&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F04%2F23%2F2023.04.15.23288540.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000286655300004&link_type=ISI) 39. Riegler, A., Marroni, F., Pattaro, C., Gueresi, P. & Pramstaller, P. P. Isolation and marriage patterns in four South Tyrolean villages (Italy) during the nineteenth century. J Biosoc Sci 40, 787–791, doi:10.1017/s0021932007002568 (2008). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1017/s0021932007002568&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=18677805&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F04%2F23%2F2023.04.15.23288540.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000258735800009&link_type=ISI) 40. Yeo, N. C. et al. Shroom3 contributes to the maintenance of the glomerular filtration barrier integrity. Genome Res 25, 57–65, doi:10.1101/gr.182881.114 (2015). [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NjoiZ2Vub21lIjtzOjU6InJlc2lkIjtzOjc6IjI1LzEvNTciO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyMy8wNC8yMy8yMDIzLjA0LjE1LjIzMjg4NTQwLmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 41. Prokop, J. W. et al. Characterization of Coding/Noncoding Variants for SHROOM3 in Patients with CKD. J Am Soc Nephrol 29, 1525–1535, doi:10.1681/asn.2017080856 (2018). [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6ODoiam5lcGhyb2wiO3M6NToicmVzaWQiO3M6OToiMjkvNS8xNTI1IjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjMvMDQvMjMvMjAyMy4wNC4xNS4yMzI4ODU0MC5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 42. Ledo, N. et al. Functional genomic annotation of genetic risk loci highlights inflammation and epithelial biology networks in CKD. J Am Soc Nephrol 26, 692–714, doi:10.1681/asn.2014010028 (2015). [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6ODoiam5lcGhyb2wiO3M6NToicmVzaWQiO3M6ODoiMjYvMy82OTIiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyMy8wNC8yMy8yMDIzLjA0LjE1LjIzMjg4NTQwLmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 43. Larsson, S. C., Drca, N. & Michaëlsson, K. Serum Magnesium and Calcium Levels and Risk of Atrial Fibrillation. Circulation: Genomic and Precision Medicine 12, e002349, doi:doi:10.1161/CIRCGEN.118.002349 (2019). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1161/CIRCGEN.118.002349&link_type=DOI) 44. Kitchen, S., McCraw, A. & Echenagucia, M. Diagnosis of hemophilia and other bleeding disorders. World Federation of Hemophilia (2010). 45. Köttgen, A. et al. Genome-wide association analyses identify 18 new loci associated with serum urate concentrations. Nat Genet 45, 145–154, doi:10.1038/ng.2500 (2013). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/ng.2500&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23263486&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F04%2F23%2F2023.04.15.23288540.atom) 46. Mandal, A. K. et al. Genetic and Physiological Effects of Insulin-Like Growth Factor-1 (IGF-1) on Human Urate Homeostasis. J Am Soc Nephrol 34, 451–466, doi:10.1681/asn.0000000000000054 (2023). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1681/asn.0000000000000054&link_type=DOI) 47. Park, J. et al. Single-cell transcriptomics of the mouse kidney reveals potential cellular targets of kidney disease. Science 360, 758–763, doi:10.1126/science.aar2131 (2018). [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6Mzoic2NpIjtzOjU6InJlc2lkIjtzOjEyOiIzNjAvNjM5MC83NTgiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyMy8wNC8yMy8yMDIzLjA0LjE1LjIzMjg4NTQwLmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 48. Zhao, F. et al. Expression, function and clinical application of stanniocalcin-1 in cancer. J Cell Mol Med 24, 7686–7696, doi:10.1111/jcmm.15348 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1111/jcmm.15348&link_type=DOI)