Genetic regulation of the plasma proteome and its link to cardiometabolic disease in Greenlandic Inuit ====================================================================================================== * Sara E. Stinson * Renzo F. Balboa * Mette K. Andersen * Frederik F. Stæger * Shixu He * Anne Cathrine Baun Thuesen * Long Lin * Niels Grarup * Marit E. Jørgensen * Ida Moltke * Anders Albrechtsen * Torben Hansen ## Summary **Background** Circulating proteins play essential roles in numerous complex diseases, yet our understanding of the genetic influences on inflammation and cardiovascular-related proteins in non-European isolated populations remains sparse. Here, we aimed to characterise the genetic architecture of plasma protein biomarkers in a Greenlandic population. **Methods** Using combined data from Greenlandic population cohorts (n=3,707 individuals), including genotypes and plasma proteomics (177 proteins) from Olink Target 96 Inflammation and Cardiovascular II panels, we performed a protein quantitative trait loci (pQTL) study using a linear mixed model accounting for relatedness and population structure. **Findings** Mapping of 177 plasma proteins in 3,707 Greenlandic individuals (mean age 47.9; 54.5% female) reveal 235 additive (92 cis and 143 trans) and 16 recessive (1 cis and 15 trans) primary pQTLs, 48 secondary pQTLs, and 70 novel pQTLs (28%). We demonstrate a higher proportion of variance in protein levels explained in Inuit compared to European individuals from the UK Biobank (e.g. IL-27, IgGFcRII-b, MCP-2, IL-16, and Gal-9). We describe changes in protein expression associated with known high impact Greenlandic-specific variants, including in *CPT1A*, *TBC1D4*, *HNF1A*, *LDLR*, and *PCSK9* loci on inflammation and cardiovascular-related biomarkers. **Interpretation** These findings highlight the importance of genome-wide plasma proteomic analyses in Greenland Inuit and diverse populations in general, with implications for biomarker and therapeutic target development. **Funding** Novo Nordisk Foundation, The Independent Research Fund Denmark, and Karen Elise Jensen Foundation. **Evidence before this study** Recent affinity-based proteomic studies have been performed in large biobank-scale cohorts such as the UK Biobank (54,219 individuals, Olink) and deCODE (35,559 individuals, SomaScan). Several smaller scale studies have also been performed in isolated European populations, including MANOLIS and Pomak (Hellenic), Orkney (Scotland), and Vis (Croatia). Studies in non-European populations are also beginning to emerge, including in the China Kadoorie biobank. Studies performed in diverse populations have the ability to identify population-specific variants in genes implicated in regulating the expression of proteins which may be causally linked to cardiovascular disease and inflammation. In particular, small and historically isolated populations, such as Greenlandic Inuit are more likely to harbour common variants with larger effect sizes that may contribute to health and disease. **Added value of this study** This study reports 251 protein quantitative trait loci (pQTLs) associated with the abundance of 177 plasma proteins, 28% of which have not been reported in prior studies. We identified 48 additional pQTLs in a secondary conditional analysis. We identified novel pQTLs that were common in Greenland, but rare globally (including ST1A1 and DCN). We found pQTLs which explained a substantial proportion of variance (>30%) in protein abundance (including IL-27, IgGFcRIIb, MCP2, IL-16, and Gal-9), when compared to Europeans, due to the combination of large effects and high frequencies within the Greenlandic population. We examined differences in protein abundance in carriers of Arctic-specific variants (e.g. *CPT1A, TBC1D4*, *HNF1A*, *LDLR*, and *PCSK9*) implicated in lipid metabolism or cardiometabolic disease, revealing underlying biological mechanisms. **Implications of all the available evidence** Given that both genetics and the environment affect proteins that may be causally linked to disease, it is crucial to perform genome-wide association studies in smaller populations of diverse genetic ancestry, to ensure equity in genetic discovery. Investigating the effect of previously identified Inuit-specific variants on protein expression revealed links to therapeutic targets for metabolic disease, which could have implications for the health care system in Greenland, including access to treatment. Keywords * Proteomics * Genomics * Diverse populations * Population isolates * Greenland Inuit * Cardiovascular disease * Type 2 diabetes ## Introduction Circulating proteins measured in the blood represent intermediate phenotypes for disease states and are strongly linked to both causative genetic variants and disease outcomes1. Thus, differences in protein expression make ideal candidates to study. Proteomics data has only recently become accessible to large-scale genome-wide association approaches, largely due to analytical limitations specific to the composition of the plasma proteome2. Advances in high-throughput proteomics methods (e.g. Olink, SomaScan, and mass-spectrometry approaches) now enable measurements of plasma levels of hundreds to thousands of proteins concurrently3,4. Unlike mass-spectrometry, the Olink Proximity Extension Assay captures less abundant proteins, allowing for the quantification of cytokines and chemokines relevant to the pathophysiology of cardiometabolic disease5. Identification of genetic variants associated with altered plasma protein levels (protein quantitative trait loci, pQTLs) can enhance our understanding of the role of proteins in health and disease6. Discovery of genetic determinants of protein levels help elucidate the causal role of given proteins on disease risk, which may allow for improved risk stratification to enhance early diagnosis and identification of potential drug targets for treatment or prevention of such diseases7. To date, the vast majority of studies have restricted analyses of the genetic architecture of the plasma proteome to European (EUR) ancestries8–13; this includes large-scale biobanks such as the UK Biobank Pharma Proteomics Project (UKB-PPP)14 and deCODE Genetics. For example, the discovery analysis in UKB-PPP was limited to EUR individuals, with replication in a limited number of non-EUR individuals14. Although separate efforts have been made to analyse pQTLs in non-EUR, including East Asian (EAS)15,16 and Middle Eastern (MID)17 ancestries, diverse populations are in comparison, highly underrepresented. Two studies have been performed in isolated European populations including MANOLIS, Pomark, and ORCADES in combination with whole-genome sequencing18,19. Studying isolated populations has the power to enhance the discovery of disease associated loci due to the enrichment of certain rare variants with larger effects20,21. The Greenland Inuit population, comprising ∼57,000 individuals, is a small and historically isolated founder population, and has experienced increases in cardiovascular disease and type 2 diabetes (T2D) prevalence in the last 25 years22. Previous studies have revealed that complex diseases, including T2D, often manifest differently and have population-specific causes when compared to other global populations. How such variants are associated with differences in protein expression within Inuit, and its comparison with other populations, however, have yet to be studied. An investigation into these relationships will allow for a better understanding of the mechanism of disease within this population, and other diverse populations in general. Here, we examined the genetic architecture of the targeted plasma proteome in 3,707 Greenland Inuit individuals. The aims of this study were to: (i) perform a GWAS analyses of 177 plasma proteins measured by the Olink Target 96 Inflammation and Cardiovascular II panels; (ii) compare the genetic architecture of the targeted proteome between GL and EUR populations through comparisons with the UKB-PPP, and (iii) assess the impact of known high impact GL specific variants on circulating protein levels. ## Methods ### Ethics statement The study has received ethical approval from the Science Ethics Committee in Greenland (project 505-42, project 505-95, project 2011–13 [ref. no. 2011–056978], project 2017-5582, project 2015–22 [ref. no. 2015-16426], and project 2021-09 [16669546]). The study was conducted in accordance with the Declaration of Helsinki, second revision. All participants gave written informed consent. ### Study populations A total of 3,707 participants were included in this study, pooled from two cross-sectional cohorts: the Greenland Population Study (B99) (n = 15, 1998–2001)23 and the Inuit Health in Transition (IHIT) cohort (n = 2,130, 2005–2010)24. We further included B2018, a follow-up study of a subset of B99 and IHIT participants (n = 1,587, 2017-2019)25. ### Plasma proteomics Relative levels of 184 plasma proteins were measured in 3,707 participants using the Olink Target 96 Inflammation and Cardiovascular II panels across two batches. Seven proteins were common to both panels, yielding a total of 177 unique proteins that were measured. The Olink batches were bridged and normalised using 16 controls using the *OlinkAnalyze* R package ([https://cran.r-project.org/web/packages/OlinkAnalyze/index.html](https://cran.r-project.org/web/packages/OlinkAnalyze/index.html)). Normalised protein expression values (NPX) on a log2 scale were used for downstream analyses. Protein values below the lower limit of detection were included in analyses, while those failing Olink’s internal quality control warnings were excluded. ### Genotyping and imputation All samples were genotyped using the Multi-Ethnic Genotyping Array (MEGA chip, Illumina), composed of ∼2M variants across the human genome. This chip was used due to the inclusion of variants selected from Indigenous Americans and East Asians, maximising variant capture in Inuit. Genotypes were subsequently jointly called using GenCall within GenomeStudio (Illumina). Duplicate samples, individuals with >0.05 missing genotypes, and variants with >0.01 missing genotypes were excluded from analyses. After quality control, our variant set comprised 1.6M variants within this study. The same reference panel described in Senftleber et al.26 was utilised in this study. Briefly, whole genome sequencing data from 447 Greenlanders were genotyped using GATK and phased using SHAPEIT227, using available duo/trio information. These data were then combined with 1000 Genomes28 data and utilised as a reference for imputing common (MAF >0.5%) variants using IMPUTE229. The imputed data was used to estimate a genetic relationship matrix (GRM) and admixture proportions for inclusion in genome-wide association analyses, described below. ### Genome-wide association analyses For association analyses, a linear mixed model (LMM) was implemented, accounting for relatedness and population structure within Inuit (GEMMA30 version 0.98.5). The GRM was estimated from SNPs with <1% missingness, a MAF >5%, info-score >0.95. PCAngsd31 was used to calculate per-site inbreeding coefficient (site-F) accounting for population structure. Sites with site-F>|0.05| and likelihood-ratio test P-value <1×10−6 were also excluded from the GRM estimation. Association tests of additive and recessive models were performed using a score test. Effect sizes and standard errors were estimated using a restricted maximum likelihood approach. Normalised protein expression (NPX) values were quantile transformed to a normal distribution using a rank-based inverse normal transformation. *P*-values are from analyses of transformed data and effect size estimates are reported in standard deviations as βSD. In all analyses, sex, age, and cohort were included as covariates. ### Definition and refinement of significant loci Primary pQTLs were first defined by distance-based clumping of ± 10Mb, reporting one representative SNP-protein association with the lowest P-value within this distance to reduce LD-related false positives downstream. Should a protein be present in both the cardiovascular and inflammation panels, we took pQTLs with the lowest P-value in either panel to avoid duplication. We reported the total number of identified significant associations using the conventional genome-wide threshold of 5×10−8. Some SNP-protein pairs were genome-wide significant when tested in both additive and recessive models. To avoid double counting and to retain independence, if the same SNP-protein pair was genome-wide significant in both additive and recessive models, we reported the lowest p-value from the two, defined as ‘mostly additive’ if Padd < Prec and ‘mostly recessive’ if Prec < Padd. SNPs within 1Mb significant for the same phenotype were also merged to the SNP with the lowest p-value, even if SNPs were detected with different models. To test whether a SNP exhibited a ‘true’ additive/recessive mode of inheritance, we tested whether the full model with full flexibility to the mode of inheritance (i.e. the alternative mode of inheritance is added as a covariate in the LMM) was significantly better than when using either an additive or recessive model, respectively. We subsequently defined the following categories for this test using an alpha threshold of 0.05: a) recessive/reject additive – where the full model is significantly better than the additive model, and the full model is not significantly better than the recessive model; b) additive/reject recessive – where the full model is not significantly better than the additive model, and the full model is significantly better than the recessive model; c) co-dominant/reject both – where the full model is significantly better than both the additive and recessive models, respectively. To perform *cis/trans* classifications, we extracted the start/end information for all proteins by mapping UniProt IDs or gene names to Ensembl BioMart. Using these annotations, pQTLs were subsequently classified into four categories; for each protein, a *cis*-pQTL was defined when a SNP resided within 100 Kb upstream or downstream (± 100 Kb) of either the start or end of the gene encoding the associated protein, taking whichever was closer to the SNP; a *semi-cis*-pQTL was defined when a SNP resided within ± 100kb-5Mb of the gene encoding the associated protein; a *semi-trans*-pQTL was defined when a SNP resided within the same chromosome but >5Mb; and a *trans*-pQTL was defined when a SNP was located in a different chromosome with respect to the corresponding protein-coding gene. ### Secondary associations and conditional analyses For conditional analyses, primary SNPs reported to have P<1×10−6 were tested. For each test, the same GWAS procedure was performed as above, but adding the SNP as a covariate in the full model. When performing these analyses, we discovered that some SNPs in between two independent primary peaks retained significance for one peak, but when tested against another, lost significance. These SNPs were subsequently filtered out to ensure independence. Similar to our filters for primary SNPs, to avoid double counting and to retain independence, if the same SNP retained genome-wide significance in both additive and recessive models, we reported the lowest p-value from the two models, defined as ‘mostly additive’ if Padd< Prec and ‘mostly recessive’ if Prec < Padd. Secondary associations were then reported with their primary SNPs. ### Proportion of variance explained by pQTLs To ensure a fairer comparison when investigating effect sizes and proportion of variance explained (PVE) by pQTLs, we performed the same association analysis described above using a random subset of 3,707 individuals from the UKB-PPP publication14. Of the 177 unique proteins assayed within this study, 176 were matched using UniProtIDs (VSIG2 was not assayed in the dataset described by Sun et al.14). The following equation was used to estimate PVEs by pQTLs from GWAS summary statistics for each protein: ![Formula][1] Where ß represents the effect size, SE represents the standard error of the effect size, and N represents the number of individuals assayed. Comparisons were then made between our Greenland Inuit dataset (n=3,707), the UKB-PPP subset (n=3,707), and the UKB-PPP summary dataset (n=54,219; described in Supplementary Table 10 in Sun et al.14). ### Identification of novel pQTLs To assess the novelty of the identified pQTLs in this study, we compared our results at genome-wide significance level to 34 previously published pQTL studies, including the most recent UKB-PPP publication14. For all studies, we retained the pQTLs at the reported significance levels. We initially defined the novelty of variant-protein pairs if no variants residing within ± 1Mb of the primary pQTLs in this study had been reported in previous plasma/serum pQTL studies for the corresponding primary protein, and otherwise a replication as in Niu et al. 32. However, after closer examination of –log10 P values between the UKB-PPP subset (n=3,707) and our Greenland Inuit dataset (n=3,707) (Supplementary Figures 1 and 2; Supplementary Table 4), we increased our distance-based threshold to ± 10Mb either side of the sentinel variant to reduce false positives for novel pQTLs; this threshold was also previously used to define novel pQTLs in Thareja et al.17. Comparisons were done at protein level by matching the reported gene name from each study. Gene names were mapped based on Uniprot IDs through Uniprot ID mapping in case of missing values. ### Analysis of known high-impact cardiometabolic-related Greenlandic variants Summary data describing the associations between known high-impact Greenlandic specific variants in *CPT1A* (rs80356779), *TBC1D4* (rs61736969), *HNF1A* (rs2135845768), *LDLR* (rs730882082), and *PCSK9* (rs4609471) on the abundance of 177 Olink proteins measured in this study were examined. False discovery rate (FDR) was calculated to account for multiple testing. ### Role of funding source None of the funding agencies had any role in the study design or collection, analysis, or interpretation of the data. ## Results ### Genetic architecture of circulating plasma proteome We performed pQTL mapping for 177 proteins measured using the Olink Target Inflammation and Cardiovascular II panels (Supplementary Table 1) in four Greenlandic cohorts, totalling 3,707 participants from 19 different localities in Greenland and Denmark (Supplementary Table 2). We identified a total of 251 genome-wide significant (*P*<5×10−8) associations between 225 genomic regions and 133 proteins (Figure 1, Supplementary Table 3). Of the pQTLs, 235 were classified as additive and 16 as recessive. We found relatively low inflation, reporting a lambda factor of 0.985 ± 0.0246 (mean ± SD) for pQTLs discovered using an additive model and 0.989 ± 0.0250 for pQTLs discovered using a recessive model (Supplementary Figure 3). As expected, we found that the majority of SNPs were associated with a single protein, yet a large portion of proteins (56.4%) were associated with more than one SNP. To date, 181 of the identified associations have been previously reported, in almost equal proportions in *cis* and *trans* (Figure 1f). Conversely, 70 (28%) associations have not previously been reported in peer-reviewed articles (± 10Mb from the nearest reported variant), in which the vast majority of these associations were reported in *semi-trans* and *trans* (96%; Figure 1f; Supplementary Table 4). Interestingly, two of the three novel associations reported in *cis* that did not overlap with any of the examined published studies were extremely rare in European populations, yet were common in Inuit (chr16:28596376:T:C for ST1A1 – MAF=0.064 in Inuit and MAF=0.001 in Europeans; chr12:91158410:T:C for DCN – MAF=0.230 in Inuit and MAF<0.001 in Europeans; Supplementary Tables 3 & 4). Of the significant associations, 93 (37%) were local-acting (85 *cis*, 8 *semi-cis*; Figure 1k; Methods) and 158 (63%) were distant-acting (14 *semi-trans* and 144 *trans*; Figure 1k; Methods). Of these, 51 unique proteins were identified as acting both as *cis* and *trans* (defined as <±5Mb of the gene encoding the associated protein and >±5Mb of the gene encoding the associated protein or on a different chromosome, respectively). In addition, conditional analyses further revealed 48 secondary pQTLs, of which, 47 were additive (36 *cis*, 3 *semi-cis;* 8 *trans*) and one recessive (in *cis*) (Supplementary Table 5). ![Figure 1.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2024/07/05/2024.07.03.24309577/F1.medium.gif) [Figure 1.](http://medrxiv.org/content/early/2024/07/05/2024.07.03.24309577/F1) Figure 1. Genetic architecture of pQTLs in Greenland. **a** Significant pQTLs across the genome using additive (red) and recessive (blue) models using GEMMA. A genome-wide significance threshold of 5×10−8 is indicated in red. P-values are capped at 1×10−50 for both additive and recessive models, as indicated by dotted lines. Numbers between Manhattan plots describe chromosomes 1-22. **b** pQTLs classified according to model test significance. ‘Additive’ and ‘recessive’ indicate pQTLs that reject the recessive model in favour of the full model, and the additive model in favour of the full model, respectively (Methods). ‘Co-dominant’ indicates pQTLs that reject both the additive and recessive models in favour of the full model. **c** Number of pQTLs detected per chromosome. **d** Number of associated proteins per variant. **e** Number of associated variants per protein. **f** Number of known and novel associations in *cis*, *semi-cis*, *semi-trans* and *trans* (defined in k). Known indicates associations within ± 10Mb of any significant pQTL reported in previous studies (Methods). Novel indicates associations >± 10Mb of any significant pQTL reported in previous studies (Methods). **g** Number of proteins with pQTLs in *cis*, *trans* or both. *Trans* pQTLs here are defined as >5Mb from the start/end positions of the primary gene relevant for each Olink protein. **h** Dot plot indicating pQTL positions across the genome. *Trans* pQTLs are defined as >5Mb from the start/end positions of the primary gene relevant for each Olink protein. **i** Number of primary pQTLs defined as *cis*, *semi-cis*, *semi-trans* and *trans* (defined in k). **j** Number of secondary pQTLs defined as *cis*, *semi-cis*, *semi-trans* and *trans* (defined in k). **k** Definitions of *cis*, *semi-cis*, *semi-trans* and *trans* pQTLs. *Cis* pQTLs are defined as SNP-protein pairs which are within 100kb of the start/end of the gene encoding the associated protein. *Semi-cis* pQTLs are defined as SNP-protein pairs which are between 100-kb-5Mb of the start/end of the gene encoding the associated protein. *Semi-trans* pQTLs are defined as SNP-protein pairs which are >5Mb of the start/end of the encoding gene, but on the same chromosome. *Trans* pQTLs are defined as SNP-protein pairs which are on different chromosomes from the gene encoding the associated protein. ### Common, high impact pQTLs in Greenland Inuit We next explored the proportion of variance explained (PVE) for pQTLs detected within Greenland Inuit and performed similar comparisons with Olink data generated within UK Biobank individuals14. UK Biobank individuals tended to have a larger proportion of significant pQTLs with a lower PVE, and exhibited a lower number of high-impact pQTLs compared to Greenlandic individuals (Figure 2a-b, Supplementary Table 6). In Greenlandic individuals, inflammation-related proteins, including IL-27, IgGFcRII-b, MCP-2 and IL-16 and Gal-9 exhibited PVEs >30% (Figure 2a). PQTLs related to the same proteins in UK Biobank exhibited PVEs <20% and were generally less common in allele frequency when compared to Greenlanders (Figure 2bi-iii). Furthermore, two of these variants, rs142653327 associated with Gal-9 and rs1801274 associated with IgGFcRII-b, are missense variants. Overall, these results suggest that Greenlanders exhibit more common and high-impact variants when compared to UK Biobank individuals, and that some of these variants may carry a functional impact. ![Figure 2.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2024/07/05/2024.07.03.24309577/F2.medium.gif) [Figure 2.](http://medrxiv.org/content/early/2024/07/05/2024.07.03.24309577/F2) Figure 2. Proportion of variance explained by pQTLs in Greenland Inuit vs UK Biobank individuals14. **a** Proportion of variance explained (PVE) for Inuit (GRL; green) and the UK Biobank cohort 14 (UKB; blue) for 176 proteins common between both studies, ordered by highest PVEs in Inuit (top) and highest PVEs in UKB (bottom). Dashed lines indicate the equivalent PVE for each protein as reported in Inuit (top) and UKB (bottom), respectively. **b** PVE vs minor allele frequency (MAF) for genome-wide significant pQTLs detected in (i) Greenland Inuit using GEMMA (n=3,707); (ii) a random subset of UKB individuals using GEMMA (n=3,707); and (iii) using summary statistics from Sun et al. (2024) (n=54,219). pQTLs with PVE >30% for either Greenland Inuit, UKB or both in (i) and (ii) are highlighted in green, blue and turquoise, respectively. PVE >30% for Greenland Inuit, and PVE >15% for UKB individuals or meeting both criteria in (iii) are highlighted in green, blue and turquoise, respectively. ### Associations with known high impact cardiometabolic-related Greenlandic variants Previous studies have identified causal, high-impact variants related to population-specific effects for lipid metabolism, T2D and CVD within Greenland Inuit, including *CPT1A*33, *TBC1D4*34, *HNF1A*35, *LDLR*36, and *PCSK9*26. As such, we investigated the protein expression for these known variants to gain a better understanding of potential mechanisms leading to these trait differences (Figure 3). ![Figure 3.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2024/07/05/2024.07.03.24309577/F3.medium.gif) [Figure 3.](http://medrxiv.org/content/early/2024/07/05/2024.07.03.24309577/F3) Figure 3. Known high-impact cardiometabolic-related Greenlandic variants. **a** Significant associations between known high-impact cardiometabolic-related Greenlandic variants – *CPT1A* (rs80356779), *HNF1A* (rs2135845768), *TBC1D4* (rs61736969), *LDLR* (rs730882082) and *PCSK9* (rs4609471) – and inflammatory and cardiovascular-related plasma proteins (FDR<0.05). Bubble sizes describe significance levels after multiple testing; small bubbles indicate pQTLs with FDR<0.05 and large bubbles FDR<0.01. Red – higher abundance; blue – lower abundance. **b** Forest plots for known high-impact variants within the same five genes and the top 20 proteins with the lowest FDR. The centre estimate and error bars on the x-axis describe the effect size (ß) ± 95% CI. A known variant in *CPT1A* (rs80356779)33, previously found to modulate fatty acid composition, with large effect sizes on weight and height, was associated with lower plasma levels of 55 proteins at a significance level of FDR<0.05, including eight proteins significant at FDR<0.001. Stem cell factor, also termed kit ligand (SCF/KITLG) and follistatin (FS) represented the most significant proteins associated with this variant. The known high impact variant in *TBC1D4* (rs61736969)34, explaining up to 18% of variance in diabetes cases in Greenland, was associated with lower levels of nine plasma proteins after a 5% FDR correction (i.e. IL-6, IL-18, CCL-19, CCL-28, CD40L, ITGB1BP2, PD-L1, STAMBP and SIRT2). A variant in the maturity onset diabetes of the young (MODY) gene, *HNF1A* (rs2135845768)35, which explains up to 2.5% of variance in diabetes in Greenland, was associated with lower plasma levels of two proteins (angiotensin converting enzyme 2 [ACE2] and chymotrypsin-C [CTRC]) and higher levels of two proteins (oncostatin-M [OSM] and protransforming growth factor alpha [TGF-alpha]). A known variant in *LDLR* (rs730882082)36 which has a large impact on familial hypercholesterolemia (FH), affecting nearly 30% of Greenlanders, was associated with lower plasma levels of one protein (fibroblast growth factor 21 [FGF-21]) and higher levels of two proteins (glial cell line-derived neurotrophic factor [GDNF] and interleukin-4 [IL-4]). The *PCSK9* loss-of-function variant (p.Arg46Leu; rs11591147) originally identified in European GWAS has led to PCSK9-inhibitors as a treatment for dyslipidemia37. We examined whether any proteins were associated with the recently identified *PCSK9* (rs4609471)26 variant in Greenland, but found no significant differences after correction for multiple testing using FDR. ## Discussion Here, we present the first pQTL study in a diverse, isolated population with 177 circulating inflammation– and cardiovascular-related proteins, measured using the Olink Proximity Extension Assay in 3,707 participants from Greenland. We identified 251 significant associations (93 *cis*, 158 *trans*), involving 251 independent loci for 133 unique proteins. Our results provide validation of previous pQTL studies performed in European ancestry, with 181 (∼72.1%) reported previously in peer-reviewed journals and 70 pQTLs (3 *cis*, 6 *semi-trans*, 61 *trans*) that have not been identified previously. We identify 16 pQTLs which are purely recessive, 15 out of 16 are in *semi-trans/trans* (e.g. PRSS27, FABP2 and TF). Thus, the novel contributions made by our study highlight the importance of conducting analyses in diverse populations, including alternative modes of inheritance. A previous mass-spectrometry based proteomics study conducted in a population of 2,958 Han Chinese participants, wherein proteins which are positively associated with BMI in EUR populations, showed opposite direction of effects in EAS, highlighting the importance of cross-ancestry differences.15 An additional study found that up to 89% of pQTLs identified in EUR studies can be replicated in an Arab population of 2,935 individuals, when using the SOMAscan platform.17 In the present study, we found that 72% of the pQTLs identified in Greenland can be replicated in previous studies, the majority of which have been performed in EUR populations, regardless of the platform (Olink, SOMAscan, or mass spectrometry). So far, the pQTL studies performed in isolated populations are of EUR ancestry. An earlier study used a unique approach of whole-genome sequencing and three Olink panels (CVDI, CVDII, metabolism) across three isolated European cohorts (MANOLIS, Pomak and ORCADES), identifying 5 rare variant pQTLs.18 To examine whether high impact pQTLs were more common in Greenland, we performed an additional GWAS for a subset of UK Biobank individuals, matched by the sample size of our analysis for Greenland Inuit (n=3,707 in both populations) and using both additive and recessive models. We compared this to the full summary statistics reported in UK Biobank (n=54,219) for overlapping plasma levels (n=176 proteins). The three pQTLs with the highest PVE in Greenlanders (IL-27, IgGFcRII-b, MCP-2) explained 44.3%, 43.7%, and 43.2%, compared to the top three pQTLs in the sample-size matched UK Biobank (CD6, MCP-2, IL-18R1) GWAS, which explained 33.9%, 30.0%, and 26.9%. Certain genetic variants have undergone selection as an adaptation to the extreme conditions of the Arctic and the specialised diet, which is rich in protein and fatty acids, particularly polyunsaturated fatty acids (PUFAs). An example of this is the Arctic-specific variant in *CPT1A* (rs80356779), which is a known modulator of fatty acid composition (including decreased levels of circulating omega-6 PUFAs), which may affect the regulation of growth hormones, with large effect sizes on weight and height33. However, it was previously unknown how this variant impacts systemic inflammation, as protein expression is a direct reflection of both genetics and environment influences. We demonstrate that this variant has pleiotropic effects on plasma protein levels, with a general pattern of reduced inflammatory and cardiovascular protein markers, which may be a result of reduced circulating PUFAs, primarily omega-6. *CPT1A* carriers had significantly lower plasma levels of SCF, a protein that has been shown to play a central role in inflammation (cell survival, proliferation, and hematopoiesis) and is a suggested therapeutic target for inflammatory disease38. *CPT1A* carriers also had significantly lower plasma levels of FS, a hepatokine which is elevated in patients with T2D and is associated with adipose tissue insulin resistance39. The common loss-of-function variant in *TBC1D4*34 (rs61736969) is responsible for muscle insulin resistance and explains up to 18% of variance in diabetes in Greenland, and in the present study was associated with lower expression of nine plasma proteins, most of which are interleukins (IL-6, IL-18), chemokines (CCL-19, CCL-28), other immune mediators (CD40L, PD-L1), integrins (ITGB1BP2), metalloproteases (STAMBP), and transferases (SIRT2). IL-6 is often elevated with obesity, but it is also secreted as a myokine from muscles during exercise, exerting anti-inflammatory and anti-diabetogenic effects. Physical activity has been shown to reduce postprandial hyperglycemia in *TBC1D4* carriers40. Lower expression of ITGB1BP2 (integrin beta-1-binding protein 2) was also observed in *TBC1D4* carriers, a protein that plays a role in the maturation and organisation of muscle cells41, which may be driven by the impaired insulin resistance in the skeletal muscle. It remains unclear whether *TBC1D4* carriers are at an increased risk of cardiovascular disease42. In the present study, we see elevations in several cardiovascular-disease related biomarkers such as ITGB1BP241, STAMBP (has been linked to congenital malformations of the heart)43, and SIRT2 (functions in inhibiting pathological cardiac hypertrophy)44. Known Inuit-specific variant in the MODY gene *HNF1A*35 (rs2135845768), explains up to 2.5% of variance in diabetes in Greenland, associated with altered expression of four (ACE2, CTRC, OSM, TGF-alpha) proteins in the present study. Plasma abundance of ACE2 was lower in *HNF1A* carriers. HNF1A is known to induce ACE2 expression in the pancreatic islet cells. ACE2 is important in maintaining glycemia and beta-cell function and has been suggested as a therapeutic target to treat diabetes45. Another known arctic-specific variant in *LDLR*36 (rs730882082) responsible for FH in nearly 30% of Greenlanders, associated with altered expression of three plasma proteins (FGF-21, GDNF, and IL-4) in the current study. Specifically, FGF-21 levels were lower in carriers of the *LDLR* variant, a protein responsible for fatty acid oxidation and regulation of systemic glucose homeostasis and insulin sensitivity. FGF-21 analogs are being developed to beneficially modulate lipid and lipoprotein levels, and therefore could serve as treatment for patients with FH46, and clinical trials are currently ongoing for treatment of metabolic-dysfunction-associated steatotic liver disease. ### Limitations Our study has several limitations. Firstly, our pQTL analysis is restricted to the 177 proteins measured by two Olink panels. An untargeted approach would allow for identification of more and potentially novel pQTLs, yet targeted approaches allow for the study of biologically relevant protein biomarkers. Inclusion of orthogonal methods such as mass spectrometry-based proteomics, would resolve the possibility of epitope effects, yet this kind of validation is difficult to perform at scale. Tissue– and cell-type specific pQTL studies will be valuable in future studies to understand differences in genetic signals across tissues. We did not examine ratios between protein levels which may identify novel associations when multiple proteins are associated with the same variant47. Furthermore, the raw abundance of proteins does not take into consideration protein-protein interactions, mapping the interactome can lead to additional biological insights48. ### Strengths Identification of pQTLs aid in the understanding of mechanisms of disease and can identify promising targets for pharmacological interventions. Yet, genetic evidence for drug targets needs to be generalisable across diverse populations. The majority of all pQTL studies performed to date have been conducted in EUR populations, limiting the generalizability of findings across populations. Although an attempt was made to identify pQTLs in non-EUR ancestry in the UKB-PPP (953 with South Asian and 1,513 with African ancestry), the sample size is smaller than the present study49. Here, we explore pQTLs in an isolated Greenlandic population, identify associations related to alternative modes of inheritance, and highlight the need to study diverse populations through comparisons with the UK Biobank. ## Conclusions The pQTLs identified and analysed in this study provide a resource to unravel the genetic architecture of the targeted plasma proteome in the Greenlandic population. There was a larger proportional variance explained in Inuit when compared to European ancestries, which has relevant implications for disease outcomes. Variants with large population impact on lipid metabolism (*CPT1A*) and risk of metabolic and cardiovascular disease in Greenland (*TBC1D4*, *HNF1A*, and *LDLR*), show distinct patterns with inflammatory cytokines and cardiovascular disease biomarkers, pointing towards potential therapeutic targets. ## Contributors * ● Conceptualization: SES, RFB, MKA, NG, MEJ, IM, AA, TH * ● Data curation: SES, MKA, ACBT, NG, MEJ, IM, AA, TH * ● Formal analysis: SES, RFB, FFS, SH, LL * ● Funding acquisition: PB, CVLL, NG, MEJ, IM, AA, TH * ● Investigation: SES, RFB, FFS, PB, CVLL, NG, MEJ, IM, AA, TH * ● Methodology: SES, RFB, FFS, NG, MEJ, IM, AA, TH * ● Project administration: SES, RFB, MKA, PB, CVLL, NG, MEJ, IM, AA, TH * ● Resources: PB, CVLL, NG, MEJ, IM, AA, TH * ● Software: AA * ● Supervision: NG, IM, AA, TH * ● Visualisation: SES, RFB, FFS, IM, AA, TH * ● Writing – original draft: SES, RFB, IM, AA, TH * ● Writing – review & editing: MKA, FFS, SH, ACBT, LL, PB, CVLL, NG, MEJ ## Data sharing statement Summary data from the study may be available by reasonable request to the authors. ## Declaration of interests The population surveys of Greenland were funded by the Department of Health, Greenland, The Novo Nordisk Foundation (NNF17OC0028136 & NNF17SH0027192), The Independent Research Fund Denmark, and Karen Elise Jensen Foundation. The Novo Nordisk Foundation Center for Basic Metabolic Research is an independent research center at the University of Copenhagen partially funded by an unrestricted donation from the Novo Nordisk Foundation (NNF18CC0034900). SES received funding from The Novo Nordisk Foundation, Copenhagen Bioscience PhD Program (grant number: NNF18CC0033668). RFB and IM are supported by a Villum Young Investigator grant (VIL19114), awarded to IM. RFB, SH, LL and AA are funded by the Novo Nordisk Foundation (NNF20OC0061343). AA is supported by the Independent Research Fund Denmark (8021-00360B). None of the funding agencies had any role in study design or collection, analysis, or interpretation of data. NG is currently employed at Novo Nordisk. All other authors have no further conflicts of interest to declare. ## Supporting information Supplementary Tables [[supplements/309577_file03.xlsx]](pending:yes) Description of Supplementary Figures [[supplements/309577_file04.docx]](pending:yes) Supplementary Figure 1 [[supplements/309577_file05.jpg]](pending:yes) Supplementary Figure 2 [[supplements/309577_file06.jpg]](pending:yes) Supplementary Figure 3 [[supplements/309577_file07.jpg]](pending:yes) ## Data Availability Summary data from the study may be available by reasonable request to the authors. ## Appendix A. Supplementary Data ## Appendix B. Supplementary Figures ## Acknowledgements We would like to acknowledge all the participants and the staff of the Greenland Population Health Surveys (Befolkningsundersøgelsen i Grønland). We would also like to thank Annemette Forman and Tina Hvidtfeldt Lorentzen from Novo Nordisk Foundation Center for Basic Metabolic Research, University of Copenhagen for their technical assistance. * Received July 3, 2024. * Revision received July 3, 2024. * Accepted July 5, 2024. * © 2024, Posted by Cold Spring Harbor Laboratory This pre-print is available under a Creative Commons License (Attribution-NonCommercial-NoDerivs 4.0 International), CC BY-NC-ND 4.0, as described at [http://creativecommons.org/licenses/by-nc-nd/4.0/](http://creativecommons.org/licenses/by-nc-nd/4.0/) ## References 1. Aebersold R, Mann M. Mass-spectrometric exploration of proteome structure and function. Nature 2016; 537: 347–55. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/nature19949&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=27629641&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F05%2F2024.07.03.24309577.atom) 2. Deutsch EW, Omenn GS, Sun Z, et al. Advances and Utility of the Human Plasma Proteome. J Proteome Res 2021; 20: 5241–63. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1021/acs.jproteome.1c00657&link_type=DOI) 3. Katz DH, Robbins JM, Deng S, et al. Proteomic profiling platforms head to head: Leveraging genetics and clinical traits to compare aptamer– and antibody-based methods. Sci Adv 2022; 8: eabm5164. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1126/sciadv.abm5164&link_type=DOI) 4. Meissner F, Geddes-McAlister J, Mann M, Bantscheff M. The emerging role of mass spectrometry-based proteomics in drug discovery. Nat Rev Drug Discov 2022; 21: 637–54. 5. Smith JG, Gerszten RE. Emerging Affinity-Based Proteomic Technologies for Large-Scale Plasma Profiling in Cardiovascular Disease. Circulation 2017; 135: 1651–64. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6MTQ6ImNpcmN1bGF0aW9uYWhhIjtzOjU6InJlc2lkIjtzOjExOiIxMzUvMTcvMTY1MSI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDI0LzA3LzA1LzIwMjQuMDcuMDMuMjQzMDk1NzcuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 6. Suhre K, McCarthy MI, Schwenk JM. Genetics meets proteomics: perspectives for large population-based studies. Nat Rev Genet 2020; 22: 19–37. 7. Aguet F, Alasoo K, Li YI, et al. Molecular quantitative trait loci. Nature Reviews Methods Primers 2023; 3: 1–22. 8. Sun BB, Maranville JC, Peters JE, et al. Genomic atlas of the human plasma proteome. Nature 2018; 558: 73–9. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41586-018-0175-2&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=29875488&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F05%2F2024.07.03.24309577.atom) 9. Pietzner M, Wheeler E, Carrasco-Zanini J, et al. Mapping the proteo-genomic convergence of human diseases. Science 2021; 374: eabj1541. 10. Folkersen L, Gustafsson S, Wang Q, et al. Genomic and drug target evaluation of 90 cardiovascular proteins in 30,931 individuals. Nature Metabolism 2020; 2: 1135–48. 11. Emilsson V, Ilkov M, Lamb JR, et al. Co-regulatory networks of human serum proteins link genetics to disease. Science 2018; 361: 769–73. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6Mzoic2NpIjtzOjU6InJlc2lkIjtzOjEyOiIzNjEvNjQwNC83NjkiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyNC8wNy8wNS8yMDI0LjA3LjAzLjI0MzA5NTc3LmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 12. Enroth S, Johansson A, Enroth SB, Gyllensten U. Strong effects of genetic and lifestyle factors on biomarker variation and use of personalized cutoffs. Nat Commun 2014; 5: 4684. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/ncomms5684&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25147954&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F05%2F2024.07.03.24309577.atom) 13. Suhre K, Arnold M, Bhagwat AM, et al. Connecting genetic risk to disease end points through the human blood plasma proteome. Nat Commun 2017; 8: 14357. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/ncomms14357&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=28240269&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F05%2F2024.07.03.24309577.atom) 14. Sun BB, Chiou J, Traylor M, et al. Plasma proteomic associations with genetics and health in the UK Biobank. Nature 2023; 622: 329–38. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=37794186&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F05%2F2024.07.03.24309577.atom) 15. Xu F, Yu EY-W, Cai X, et al. Genome-wide genotype-serum proteome mapping provides insights into the cross-ancestry differences in cardiometabolic disease susceptibility. Nat Commun 2023; 14: 1–12. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41467-022-35677-5&link_type=DOI) 16. Said S, Pozarickij A, Lin K, et al. Ancestry diversity in the genetic determinants of the human plasma proteome and associated new drug targets. medRxiv. 2023;: 2023.11.13.23298365. 17. Thareja G, Belkadi A, Arnold M, et al. Differences and commonalities in the genetic architecture of protein quantitative trait loci in European and Arab populations. Hum Mol Genet 2023; 32: 907–16. 18. Gilly A, Klaric L, Park Y-C, et al. Gene-based whole genome sequencing meta-analysis of 250 circulating proteins in three isolated European populations. Mol Metab 2022; 61: 101509. 19. Png G, Barysenka A, Repetto L, et al. Mapping the serum proteome to neurological diseases using whole genome sequencing. Nat Commun 2021; 12: 1–12. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41467-020-20241-w&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F05%2F2024.07.03.24309577.atom) 20. Wojcik GL, Graff M, Nishimura KK, et al. Genetic analyses of diverse populations improves discovery for complex traits. Nature 2019; 570: 514–8. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41586-019-1310-4&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F05%2F2024.07.03.24309577.atom) 21. Easteal S, Arkell RM, Balboa RF, et al. Equitable Expanded Carrier Screening Needs Indigenous Clinical and Population Genomic Data. Am J Hum Genet 2020; 107: 175–82. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ajhg.2020.06.005&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F05%2F2024.07.03.24309577.atom) 22. Jørgensen ME, Bjeregaard P, Borch-Johnsen K. Diabetes and impaired glucose tolerance among the inuit population of Greenland. Diabetes Care 2002; 25: 1766–71. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NzoiZGlhY2FyZSI7czo1OiJyZXNpZCI7czoxMDoiMjUvMTAvMTc2NiI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDI0LzA3LzA1LzIwMjQuMDcuMDMuMjQzMDk1NzcuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 23. Bjerregaard P, Curtis T, Borch-Johnsen K, et al. Inuit health in Greenland: a population survey of life style and disease in Greenland and among Inuit living in Denmark. Int J Circumpolar Health 2003; 62 **Suppl 1**: 3–79. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.3402/ijch.v62i0.18212&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=14527126&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F05%2F2024.07.03.24309577.atom) 24. Jørgensen ME, Borch-Johnsen K, Stolk R, Bjerregaard P. Fat distribution and glucose intolerance among Greenland Inuit. Diabetes Care 2013; 36: 2988–94. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NzoiZGlhY2FyZSI7czo1OiJyZXNpZCI7czoxMDoiMzYvMTAvMjk4OCI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDI0LzA3LzA1LzIwMjQuMDcuMDMuMjQzMDk1NzcuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 25. Bjerregaard P, Larsen CVL, Olesen I, et al. The Greenland population health survey 2018 – methods of a prospective study of risk factors for lifestyle related diseases and social determinants of health amongst Inuit. Int J Circumpolar Health 2022; published online June 16. DOI:10.1080/22423982.2022.2090067. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1080/22423982.2022.2090067&link_type=DOI) 26. Senftleber NK, Andersen MK, Jørsboe E, et al. GWAS of lipids in Greenlanders finds association signals shared with Europeans and reveals an independent PCSK9 association signal. Eur J Hum Genet 2023; published online Oct 30. DOI:10.1038/s41431-023-01485-8. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41431-023-01485-8&link_type=DOI) 27. Delaneau O, Zagury J-F, Marchini J. Improved whole-chromosome phasing for disease and population genetic studies. Nat Methods 2013; 10: 5–6. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/nchembio.1414&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23269371&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F05%2F2024.07.03.24309577.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000312810100003&link_type=ISI) 28. A global reference for human genetic variation. Nature 2015; 526: 68–74. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/nature15393&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=26432245&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F05%2F2024.07.03.24309577.atom) 29. Howie BN, Donnelly P, Marchini J. A flexible and accurate genotype imputation method for the next generation of genome-wide association studies. PLoS Genet 2009; 5: e1000529. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pgen.1000529&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=19543373&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F05%2F2024.07.03.24309577.atom) 30. Zhou X, Stephens M. Genome-wide efficient mixed-model analysis for association studies. Nat Genet 2012; 44: 821–4. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/ng.2310&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22706312&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F05%2F2024.07.03.24309577.atom) 31. Meisner J, Albrechtsen A. Inferring Population Structure and Admixture Proportions in Low-Depth NGS Data. Genetics 2018; 210: 719–31. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6ODoiZ2VuZXRpY3MiO3M6NToicmVzaWQiO3M6OToiMjEwLzIvNzE5IjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjQvMDcvMDUvMjAyNC4wNy4wMy4yNDMwOTU3Ny5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 32. Niu L, Stinson SE, Holm LA, et al. Plasma proteome variation and its genetic determinants in children and adolescents. bioRxiv. 2023; published online March 31. DOI:10.1101/2023.03.31.23287853. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1101/2023.03.31.23287853&link_type=DOI) 33. Fumagalli M, Moltke I, Grarup N, et al. Greenlandic Inuit show genetic signatures of diet and climate adaptation. Science 2015; 349: 1343–7. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6Mzoic2NpIjtzOjU6InJlc2lkIjtzOjEzOiIzNDkvNjI1NC8xMzQzIjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjQvMDcvMDUvMjAyNC4wNy4wMy4yNDMwOTU3Ny5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 34. Moltke I, Grarup N, Jørgensen ME, et al. A common Greenlandic TBC1D4 variant confers muscle insulin resistance and type 2 diabetes. Nature 2014; 512: 190–3. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/nature13425&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25043022&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F05%2F2024.07.03.24309577.atom) 35. Thuesen ACB, Stæger FF, Kaci A, et al. A novel splice-affecting HNF1A variant with large population impact on diabetes in Greenland. Lancet Reg Health Eur 2023; 24: 100529. 36. Jørsboe E, Andersen MK, Skotte L, et al. An LDLR missense variant poses high risk of familial hypercholesterolemia in 30% of Greenlanders and offers potential of early cardiovascular disease intervention. HGG Adv 2022; 3: 100118. 37. Shapiro MD, Tavori H, Fazio S. PCSK9: From Basic Science Discoveries to Clinical Trials. Circ Res 2018; 122: 1420–38. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6MTA6ImNpcmNyZXNhaGEiO3M6NToicmVzaWQiO3M6MTE6IjEyMi8xMC8xNDIwIjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjQvMDcvMDUvMjAyNC4wNy4wMy4yNDMwOTU3Ny5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 38. Stem cell factor and its receptor c-Kit as targets for inflammatory diseases. Eur J Pharmacol 2006; 533: 327–40. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ejphar.2005.12.067&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=16483568&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F05%2F2024.07.03.24309577.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000236265400031&link_type=ISI) 39. Wu C, Borné Y, Gao R, et al. Elevated circulating follistatin associates with an increased risk of type 2 diabetes. Nat Commun 2021; 12: 1–10. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41467-020-20241-w&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F05%2F2024.07.03.24309577.atom) 40. Schnurr TM, Jørsboe E, Chadt A, et al. Physical activity attenuates postprandial hyperglycaemia in homozygous TBC1D4 loss-of-function mutation carriers. Diabetologia 2021; 64: 1795–804. 41. Brancaccio M, Fratta L, Notte A, et al. Melusin, a muscle-specific integrin beta1-interacting protein, is required to prevent cardiac failure in response to chronic pressure overload. Nat Med 2003; 9: 68–75. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/nm805&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=12496958&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F05%2F2024.07.03.24309577.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000180209900024&link_type=ISI) 42. Jørgensen ME, Tvermosegaard M, Rønn PF, et al. Do Greenlandic Carriers of the TBC1D4 p.Arg684Ter Variant Have Increased Risk of Cardiovascular Disease? Diabetes 2018; 67: 439 – P. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.2337/db18-439-P&link_type=DOI) 43. Spielmann N, Miller G, Oprea TI, et al. Extensive identification of genes involved in congenital and structural heart disorders and cardiomyopathy. Nature Cardiovascular Research 2022; 1: 157–73. 44. Wu B, You S, Qian H, et al. The role of SIRT2 in vascular-related and heart-related diseases: A review. J Cell Mol Med 2021; 25: 6470–8. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1111/jcmm.16618&link_type=DOI) 45. Pedersen KB, Chhabra KH, Nguyen VK, Xia H, Lazartigues E. The transcription factor HNF1α induces expression of angiotensin-converting enzyme 2 (ACE2) in pancreatic islets from evolutionarily conserved promoter motifs. Biochim Biophys Acta 2013; 1829: 1225–35. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.bbagrm.2013.09.007&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F05%2F2024.07.03.24309577.atom) 46. Besseling J, Ong KL, Huijgen R, et al. The impact of LDLR function on fibroblast growth factor 21 levels. Atherosclerosis 2015; 241: 322–5. 47. Suhre K. Genetic associations with ratios between protein levels detect new pQTLs and reveal protein-protein interactions. bioRxiv. 2023; published online July 20. DOI:10.1101/2023.07.19.549734. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NzoiYmlvcnhpdiI7czo1OiJyZXNpZCI7czoxOToiMjAyMy4wNy4xOS41NDk3MzR2MSI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDI0LzA3LzA1LzIwMjQuMDcuMDMuMjQzMDk1NzcuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 48. Joshi A, Rienks M, Theofilatos K, Mayr M. Systems biology in cardiovascular disease: a multiomics approach. Nat Rev Cardiol 2021; 18: 313–30. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41569-020-00477-1&link_type=DOI) 49. Eldjarn GH, Ferkingstad E, Lund SH, et al. Large-scale plasma proteomics comparisons through genetics and disease associations. Nature 2023; 622: 348–58. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=37794188&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F05%2F2024.07.03.24309577.atom) [1]: /embed/graphic-1.gif