Proteomic associations with forced expiratory volume – a Mendelian randomisation study ======================================================================================== * GT Axelsson * T Jonmundsson * YJ Woo * EA Frick * T Aspelund * JJ Loureiro * AP Orth * LL Jennings * G Gudmundsson * V Emilsson * V Gudmundsdottir * V Gudnason ## Abstract A decline in forced expiratory volume (FEV1) is a hallmark of obstructive respiratory diseases, an important cause of morbidity among the elderly. While some data exist on biomarkers that are related to FEV1, we sought to do a systematic analysis of causal relations of biomarkers with FEV1. Data from the general population-based AGES-Reykjavik study were used. Proteomic measurements were done using 4,782 DNA aptamers (SOMAmers). Data from 1,648 participants with spirometric data were used to assess the association of SOMAmer measurements with FEV1 using linear regression. Bi-directional Mendelian randomisation (MR) analyses were done to assess causal relations of observationally associated SOMAmers with FEV1, using genotype and SOMAmer data from 5,368 AGES-Reykjavik participants and genetic associations with FEV1 from a publicly available GWAS (n = 400,102). In observational analyses, 473 SOMAmers were associated with FEV1 after multiple testing adjustment. The most significant were R-Spondin 4, Alkaline Phosphatase, Placental Like 2 and Retinoic Acid Receptor Responder 2. Of the 235 SOMAmers with genetic data, eight were associated with FEV1 in MR analyses. Three were directionally consistent with the observational estimate, Thrombospondin 2 (THBS2), Endoplasmic Reticulum Oxidoreductase 1 Beta and Apolipoprotein M. THBS2 was further supported by a colocalization analysis. Analyses in the reverse direction, testing whether changes in SOMAmer levels were caused by changes in FEV1, were performed but no significant associations were found after multiple testing adjustments. In summary, this large scale proteogenomic analyses of FEV1 reveals protein markers of FEV1, as well as several proteins with potential causality to lung function. Keywords * Lung function tests * Forced expiratory volume * Proteomics * Mendelian randomisation ## Introduction Chronic respiratory diseases such as chronic obstructive pulmonary disease (COPD) are a leading global cause of mortality and morbidity, with their relative importance increasing in the last decades (1). Diagnosis of COPD is based on obstruction on pulmonary function testing, by a low forced expiratory volume in one second (FEV1) relative to the forced vital capacity (FVC), and a progressive decline in pulmonary function is a hallmark of the disease (2). While it is undisputed that exposure to external harmful stimuli such as cigarette smoke and biomass fumes are substantial risk factors for pulmonary function decline, intrinsic factors such as genetics and gene-environment interactions play a significant part as well (2–4). Genome wide association studies (GWAS) have found several genetic polymorphisms that are associated with COPD and lung function decline, including polymorphisms in or near genes encoding matrix metalloproteinase 12, nicotinic acetylcholine receptor, hedgehog interacting protein, glutathione S-transferase, C-terminal domain–containing protein (5–9) and the antiprotease alpha-1-antitrypsin (10). In addition to genetic polymorphisms, biomarkers that predict COPD or lung function have been discovered. Among them are inflammatory markers, soluble receptor for advanced glycoprotein end products (AGER), club cell secretory protein 16 (SCGB1A1) and surfactant protein D (SFTPD) (11–14). For several of these biomarkers, estimations of potential causality have been made. AGER, SCGB1A1 and SFTPD have been suggested to be causally associated with COPD or with lung function, while analyses have pointed against such an association for the inflammatory markers CRP and IL-6 (15–19). Still, while substantial epidemiologic data exist regarding single genetic markers and biomarkers that predict COPD or lung function, a systematic large-scale analysis of observational and causal associations of protein markers with lung function has not been undertaken to our knowledge. Proteomics have emerged as a way of exploring biologic predictors of disease, especially with the advent of methods that allow for measurement and evaluation of thousands of proteins in biological samples from participants of large cohort studies (20). In addition to predicting disease and disease-related outcomes, integration of genetic data allows one to make assumptions regarding causality with proteomic markers. Mendelian randomization (MR) is such a method and utilizes genetic polymorphisms as instrumental variables to assess the relationship of an exposure with an outcome. As chromosomal alleles are randomly allocated during gamete formation, this methodology allows one to avoid the effect of confounders and to infer causality from epidemiologic data (21). The aim of the study was to systematically explore the associations of a multitude of protein markers with lung function and then to assess the potential causal relationships of these protein markers with lung function by use of bi-directional Mendelian randomization. ## Methods ### Study phenotyping The Age/Gene Environment Susceptibility (AGES)-Reykjavik study is a population-based cohort study of 5,764 elderly Icelanders that was carried out between 2002 and 2006. The participants, aged from 66 to 96 years (mean 76 years), were all prior participants of the Reykjavik Study done decades earlier. As part of AGES-Reykjavik, the participants underwent extensive phenotyping by questionnaires, physiological measurements, imaging studies and laboratory measurements, during a single clinic visit. The study was approved by the Icelandic National Bioethics Committee (VSN-00-063) and the Institutional Review Board of the Intramural Program of the National Institute for Aging, with informed consent obtained from all participants. Further details of the study design are previously published (22). A subset of the study participants underwent lung function testing in a standardised manner. The device used was a Vitalograph Gold Standard Plus (Vitalograph Ltd., UK). Each participant completed three attempts. Participants with spirometry of acceptable quality were included (n = 1,660). Spirometry measures from participants that completed at least two attempts with a no more than 300 ml difference between the attempts and exhalation for at least 6 seconds were deemed acceptable (23). Smoking history was ascertained from questionnaires while anthropometric measurements were done during the clinic visit. Protein measurements were done in serum samples from participants using a high throughput proteomics technology, the SOMAscan (SomaLogic, Boulder, CO) in which DNA aptamers (Slow-Off Rate Modified Aptamers (SOMAmers)) bind to target protein epitopes and are then quantified with the help of fluorescence after wash-out of unbound aptamers and proteins. Measurements in AGES-Reykjavik were done with a 5,034 SOMAmer platform in serum from 5,457 AGES-Reykjavik participants, targeting 4,135 individual human proteins. For the analyses, 4,782 SOMAmers targeting human proteins were used. Measurement data were transformed using Box-Cox transformation and extreme outliers were excluded, as previously described (24). ### Statistical analyses A flow chart of study design is shown in Figure 1. Descriptive statistics were compiled for participants with acceptable pulmonary function tests, demographic covariate data and protein measurements available (n = 1,648). Using data from these participants, the association of all measured human SOMAmers (n = 4,782) with FEV1 was assessed using linear regression modelling. These analyses were adjusted for variables that are commonly used to predict FEV1 in clinical practice, age, sex, height, age squared, and height squared (25). Adjustment for multiple testing was done using the Benjamini-Hochberg False Discovery Rate (FDR). To understand how tobacco smoking affected these associations, the analyses were repeated with participants stratified by smoking history (ever-smokers versus never-smokers). Genes encoding proteins significantly associated with the outcome after adjustment for multiple testing were subjected to over-representation analysis of Gene Ontology terms (26). In secondary analyses we explored the protein associations with additional outcomes, including FVC, FEV1/FVC ratio and COPD. Additional covariates included smoking status, pack years of smoking, weight and eGFR. ![Figure 1.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2023/07/01/2023.06.30.23292035/F1.medium.gif) [Figure 1.](http://medrxiv.org/content/early/2023/07/01/2023.06.30.23292035/F1) Figure 1. A flow chart of the study design For SOMAmers that were associated with FEV1 after adjustment for multiple testing, a bi-directional two-sample MR analysis was performed for the association of SOMAmers with FEV1. MR analyses were done for SOMAmers that were observationally associated with FEV1 and had genetic instruments available. Genetic instruments for SOMAmers were defined as follows. The associations of single-nucleotide polymorphisms (SNPs) with SOMAmers were calculated using data from 5,368 participants of AGES-Reykjavik, as previously published and described in detail (24). The possible instrumental variables for each SOMAmer were defined as SNPs located within a cis-window near the gene coding for the protein measured by the SOMAmer, defined as within 500 kb up- and downstream of the gene. SNPs with a window-wide significant association (p < 0.05/number of SNPs in cis-window) with a given SOMAmer were considered as potential instruments. For each gene, SNPs were filtered based on linkage disequilibrium (LD; r2 < 0.2) or distance (>1 mb) using Plink v1.9. Instruments were considered valid if F > 10 for the association of instruments with SOMAmers. The associations of the instrumental variables with FEV1 were obtained from a GWAS of lung function in which data from two cohorts, UKBiobank and SpiroMeta Consortium, were meta-analysed. The total number of participants in that analysis was 400,102 (27). A MR analysis in the reverse direction was performed using all observationally associated SOMAmers as outcomes. Genetic instruments for FEV1 were defined based on the same GWAS on lung function as described above (27). For these analyses, SNPs associated with FEV1 (p < 5×10−8) were used as instruments after clumping within a cis-window 5 mb up- and downstream of the gene, by an LD threshold of 0.01. The associations of the instrumental variables with SOMAmer levels were obtained from AGES-Reykjavik data (24). For the MR analyses, SNPs were harmonised using the TwoSampleMR R package. The MR estimates for FEV1 were obtained using the generalized weighted least squares (GWLS) method, accounting for correlation between instruments (28), except for instances where only one instrument was available, in which Wald ratios were calculated. For each MR analysis with more than two instruments, sensitivity analyses using the weighted median and Egger estimators were done to assess the validity of instruments and limit the effect of pleiotropic associations, respectively. Results were considered to pass these sensitivity analyses when the following conditions were met. For the weighted median estimator, the weighted median estimate had to be significant and directionally consistent with the GWLS estimate and for Egger, the Egger estimate had to be directionally consistent with the GWLS estimate and the intercept not significant. In analyses in the reverse direction, estimates were obtained using the inverse variance weighted method. Adjustment for multiple testing was done using the Benjamini-Hochberg False Discovery Rate (FDR). A colocalization analysis was performed to provide additional causal support for analytes associated with FEV1 in MR analyses. In that aim, AGES-Reykjavik serum pQTLs (24), summary statistics for FEV1 (29) and plasma pQTLs from deCODE (30) were harmonized to account for strand orientation and differences in genome builds. All studies were lifted over to build GRCh38 for colocalization when needed. For the QTLs, only putative cis-regulatory variants were examined defined by 500kb away from the gene body. All regions with variants that had associations of P < 1×10−5 were fine-mapped using Sum of Single Effects (SuSiE) (31) based on 1000 genome population reference (32). 95% credible-sets were filtered and colocalization between GWAS and QTLs was performed using fastENLOC using the posterior inclusion probabilities estimated in SuSiE (31, 33). Regional colocalization probability (RCP) was calculated by summing the colocalization probability within each 95% credible-set and filtered colocalization results at RCP > 80% for further examination. RCP represents the probability that a given genomic region contains a single colocalized variant (34). Visualization was done using Julia 1.7 and libraries within (35). ## Results ### The protein profile of pulmonary function Descriptive statistics for participants that had pulmonary function testing data are shown in Table 1. A majority of participants were female (56%) and participants were on average 76 years old. Most participants had a history of smoking (59%). View this table: [Table 1.](http://medrxiv.org/content/early/2023/07/01/2023.06.30.23292035/T1) Table 1. Overview of study participants Of the 4,782 SOMAmers tested, 473 were observationally associated with FEV1 after adjustment for multiple testing (Table 2, Table S1, Figure 2). The most significantly associated SOMAmers measured were R-Spondin 4 (RSPO4, β = −0.099, p = 1.13×10−13), Alkaline Phosphatase, Placental Like 2 (ALPPL2, β = −0.084, p = 4.76×10−11), Retinoic Acid Receptor Responder 2 (RARRES2, β = −0.093, p = 7.03×10−11), Sushi, Von Willebrand Factor Type A, EGF And Pentraxin Domain Containing 1 (SVEP1, β = −0.085, p = 1.67×10−10) and Polymeric Immunoglobulin Receptor (PIGR, β = −0.080, 1.80×10−10). Results for proteins that have been previously suggested as biomarkers of FEV1 (11) are shown in Table S2. Prior associations of SFTPD (beta = −0.05, p = 0.0002), CRP (beta = −0.06, p = 1.2×10−7), fibrinogen (beta = −0.04, p = 0.0008 for the stronger associated SOMAmer), IL6 (beta = −0.05, p = 9.6×10−5 for the stronger associated SOMAmer) and eotaxin (beta = −0.05, p = 0.0002) were reproduced in the AGES-Reykjavik data while associations for other suggested biomarkers of FEV1 were not. ![Figure 2.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2023/07/01/2023.06.30.23292035/F2.medium.gif) [Figure 2.](http://medrxiv.org/content/early/2023/07/01/2023.06.30.23292035/F2) Figure 2. A volcano plot showing the observational associations of all 4782 SOMAmers with FEV1 View this table: [Table 2.](http://medrxiv.org/content/early/2023/07/01/2023.06.30.23292035/T2) Table 2. Observational associations of highly-associated proteins with FEV1 The proteins most significantly associated with FEV1 overall also had strong associations with FEV1 among ever-smokers, such as RSPO4 (β = −0.106, p = 4.83×10−9), RARRES2 (β = −0.11, p = 1.19×10−8) and ALPPL2 (β = −0.092, p = 5.73×10−8). Trefoil Factor 2 (TFF2, β = −0.095, p = 3.84×10−8), Complement 9 (C9, β = −0.095, p = 5.65×10−8) and TIMP Metallopeptidase Inhibitor 1 (TIMP1, β = −0.122, p = 1.18×10−7) also had notably strong associations with the outcome. However, associations of these proteins were much weaker among never-smokers, with the strongest associations observed for two SOMAmers measuring SVEP1 (β = −0.081, p = 4.52×10−6 for the stronger associated SOMAmer) (Table S3) in this subgroup. The proteins associated with FEV1 were most strongly enriched for Gene Ontology (GO) terms related to humoral immune response, regulation of neurogenesis, compliment activation and extracellular matrix organization (Figure S1, Table S4). Secondary analyses showed a large overlap between the protein profile of FEV1 compared to that of FVC, FEV1/FVC ratio and COPD, with some additional proteins detected (204 significant SOMAmers in primary analysis were among 497 significant SOMAmers in secondary analyses; Tables S8-S15). ### Mendelian randomization analysis Of 473 SOMAmers associated with FEV1, 235 (50%) had genetic instruments available and were included in MR analyses of the association of SOMAmers with FEV1. The instruments are shown in Table S5 for significant SOMAmers. Of the 235 SOMAmers tested, eight were significantly associated with FEV1 in MR analyses (Table 3), suggesting they may have a causal effect on lung function. Of those, three, Thrombospondin 2 (THBS2, β = −0.037, p = 9.53×10−5), Endoplasmic Reticulum Oxidoreductase 1 Beta (ERO1B, β = −0.025, p = 8.05×10−4) and Apolipoprotein M (APOM, β = 0.053, p = 9.72×10−4) were directionally consistent with the observational analyses. The other five SOMAmers with significant causal estimates, R-Spondin-2 (RSPO2), TIMP Metallopeptidase Inhibitor 4 (TIMP4), CD14, Heparin Binding Growth Factor (HDGF) and Parkinsonism Associated Deglycase (PARK7) were directionally inconsistent. Table 3 and Figure 4 show the 29 SOMAmers that were nominally associated with FEV1 (unadjusted p < 0.05) using MR and passed weighted median sensitivity testing. Among nominally associated SOMAmers, the directional consistency between causal and observational estimates was low, or 38%, suggesting that the potentially causal effects of the protein are generally not reflected in the observational estimates. Of the five previously suggested biomarkers of FEV1 listed in Table S2 that were observationally associated with FEV1, only three proteins had genetic instruments available, the acute phase reactants CRP and fibrinogen as well as eotaxin. None of these proteins were significantly associated with FEV1 in MR analyses (Table S6). ![Figure 3.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2023/07/01/2023.06.30.23292035/F3.medium.gif) [Figure 3.](http://medrxiv.org/content/early/2023/07/01/2023.06.30.23292035/F3) Figure 3. A forest plot showing the observational and mendelian randomisation estimates for FEV1 for causally associated SOMAmers ![Figure 4.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2023/07/01/2023.06.30.23292035/F4.medium.gif) [Figure 4.](http://medrxiv.org/content/early/2023/07/01/2023.06.30.23292035/F4) Figure 4. Colocalization plot for *THBS2*. FEV1 GWAS (29), AGES pQTL (24), and DECODE pQTL (30) data are shown from top to bottom. Grey circles are individual SNPs from each study. Both purple diamond and red vertical line represent the lead variant (rs3253, 3’ UTR variant). X-axis is genomic position within chromosome 6 and y-axis is-log10 transformed P-values. Purple horizontal lie delineates GWAS threshold at 5×10−8 and yellow vertical lines represent gene boundary for *THBS2*. Visualization is restricted to 150,000 bp upstream and downstream of THBS2. Linkage disequilibrium (r2) within the region is plotted at the bottom in green. View this table: [Table 3.](http://medrxiv.org/content/early/2023/07/01/2023.06.30.23292035/T3) Table 3. Results of Mendelian randomisation analysis of the association of proteins with FEV1 The 29 proteins identified from the MR analysis (Table 3) were further examined for colocalization evidence. Among the 8 proteins with significant MR association, a 3’ UTR variant (rs3253) from the FEV1 GWAS colocalized with THBS2 protein expression in AGES-Reykjavik (P = 1.4×10−19, β = −0.18, RCP=0.9976). The same variant was associated with THBS2 protein expression in the deCODE cohort (P = 2.1×10−186, β =-0.26) (Table 4, Figure 4). Among 22 proteins with nominal association with MR analysis, an upstream variant (rs4968200) from the FEV1 GWAS colocalized with TNFSF12 protein levels in AGES-Reykjavik (P = 5.6×10−133, β = 0.63, RCP = 0.9995). View this table: [Table 4.](http://medrxiv.org/content/early/2023/07/01/2023.06.30.23292035/T4) Table 4. Statistics for the variants with strong colocalization (RCP > 0.9) Finally, all 473 SOMAmers observationally associated with FEV1 were included in a MR analysis in the reverse direction, i.e., to evaluate if the changes in SOMAmer levels are downstream of changes in lung function. FEV1 was not causally associated with levels of any SOMAmers after adjustment for multiple testing. Data for the 18 SOMAmers that FEV1 was nominally associated with (p < 0.05) in the reverse-MR analysis are shown in Table S7. ## Discussion We present findings from a proteomic analysis of pulmonary function with more candidate protein analytes than previously published to our knowledge (11), highlighting several proteins as strong markers of FEV1. Stratification by smoking shows that most associations are driven by ever-smokers. Mendelian randomisation was systematically applied to the candidate markers, revealing proteins whose levels may have a causal effect on lung function. Reverse causation analyses failed to demonstrate that protein level changes associated with lung function occur downstream of the phenotype change, although this could partly be due to insufficient power in the FEV1 GWAS. Among the proteins identified through MR analysis, probabilistic colocalization pointed towards *THBS2* and *TNFSF12* as putative upstream targets for COPD. Eight proteins were suggested to be causally implicated in lung function based on the MR analysis. However, only three (THBS2, ERO1B and APOM) had consistent direction of effect for the observational and causal estimates. Such discrepancy has been observed when comparing causal and observational estimates (36, 37) for serum proteins. Based on probabilistic quantification, a 3’ UTR variant within *THBS2* was identified as a putative causal variant for FEV1 changes and this lead variant colocalized with THBS2 protein expression in AGES-Reykjavik (Table 4, Figure 4). The effect of this variant was replicated in an independent cohort for THBS2 protein levels (Table 4, Figure 4). Because directions of effects were consistent across the datasets and THBS2 had non-revertible causal association to FEV1, supported by colocalization, THBS2 may be a candidate target for COPD. THBS2 is an extracellular matrix protein that has been implicated in various cardiovascular disorders and is also a candidate biomarker for non-small cell lung cancer (38, 39). *THBS2* is involved in tissue repair and interacts with many different ligands in the extracellular matrix, among them matrix metalloproteases and elastase (40). Although mechanism of *THBS2* needs further experimental validation, our work suggests that protein levels of *THBS2* may be a target for extracellular matrix and regenerative properties associated with COPD. Meanwhile, ERO1B is a disulfide oxidase in the endoplasmic reticulum that is shown to predict survival in pancreatic and pulmonary cancers (41–43) and APOM, an apolipoprotein that is mainly a component of high density lipoproteins and has been associated with COPD severity (44). Genetic variants flanking this gene have been associated with obstructive spirometry measurements (45). However, it must be kept in mind that the association of APOM is based on a single SNP, rs2736163, which is intronic to *PRRC2A*. Despite not reaching the study threshold for statistical significance, some proteins that were nominally associated in MR analyses are of interest. First, alpha-1-antitrypsin (SERPINA1) is the best-known protein known to cause COPD as severe deficiency of alpha-1-antitrypsin results in obstructive lung disease (46). However, in our data, serum levels of alpha-1-antitrypsin are inversely associated with FEV1 in both observational and MR analyses, contrary to what would possibly be expected, although this directionality is known from previous observational analyses of FEV1 and explained by alpha-1-antitrypsin’s role as an acute phase reactant (47). Also, polymorphisms that cause mild or intermediate alpha-1-antitrypsin deficiency are not consistently associated with decreased lung function, suggesting that levels of the protein may only affect lung function below a threshold level (48). Second, TNFSF12 was a protein with a colocalizing pQTL variant with the FEV1 GWAS (Table 4, Figure 4). TNFSF12 is a member of the Tumor Necrosis Factor (TNF) superfamily, of which one key cytokine, TNF-alpha, is a well-known protein target disrupted in COPD patients (49). Despite a concrete association between TNF-alpha levels and COPD in both human and murine models (49–51), TNF-alpha monoclonal antibody was a failure as a COPD therapy (52). This is likely due to COPD being a highly complex inflammatory disease in which single cytokine blockage may not be sufficient for treating the disease. These additional TNF family proteins with MR based causal evidence and probabilistic colocalization may be novel targets for COPD patients impacted by TNF-alpha associated pathobiology. Notable other nominally significant or directionally inconsistent proteins in the findings are matrix metalloproteinase 8 (MMP-8), one of the proteases observationally associated with lung function and implicated in the pathogenesis of COPD (53, 54), TIMP4, an inhibitor of matrix metalloproteinases that has been shown to be upregulated in COPD patients (55) and CD14, levels of which have been shown to be elevated in lungs of smokers (56). While only 8 proteins were significantly associated with FEV1 in MR analyses, FEV1 was not associated with any proteins in analyses in the reverse direction. The data were therefore unable to support prior causal analyses involving inflammatory markers that have suggested reverse causality of FEV1 with inflammatory markers (18) (Table S6). The study reveals novel markers with strong observational relations to lung function such as RSPO4, a signalling molecule that is part of the Wnt signalling pathway (57), the tumor marker ALPPL2 (58), the adipokine chemerin (RARRES2), and SVEP1, a protein that is thought to play a role in inflammation in atherosclerosis (59). In addition, the findings validate the observational associations of some of the previously suggested protein markers of FEV1, such as SFTPD, fibrinogen, eotaxin and CRP (Table S2) (13), while the associations of other previously suggested biomarkers of FEV1 such as AGER were not corroborated in this study (13, 60). Finally, the findings show that proteins that take part in immune responses and extracellular matrix modulation are over-represented among the proteins related to FEV1. This work is subject to a number of limitations. First, the study is based on SOMAmer technology, a relatively novel aptamer-based technology for protein measurements. While many of these SOMAmers have been validated with encouraging results, it has been pointed out that a minority of SOMAmers could be subject to cross-reactivity with related or homologous proteins (20). Second, both the AGES-Reykjavik cohort and the UKBiobank and SpiroMeta Consortium are of European ancestry (27). Therefore, the observed associations could not be generalizable to other populations. Additionally, the AGES-Reykjavik cohort is older than the UKBiobank and SpiroMeta Consortium which could distort comparisons between observational and genetic findings. Third, the measure of lung function used in this paper, FEV1, is disproportionally impaired in obstructive lung disease and is a key diagnostic parameter for such diseases. While COPD is the archetypal obstructive lung disease, especially among the elderly, the presence of other respiratory diseases such as asthma could influence the findings, especially in younger cohorts. Fourth, subtle differences in genetic structure between AGES, UKBiobank, and SpiroMeta cohorts is present and may be contributing to the lack of colocalization for the other 6 MR identified proteins. For instance, very few MR instruments used overlapped with 95% credible-sets identified (Supplementary Table 16). In comparisons to UKBioBank, missing variants in AGES-Reykjavik cohort may contribute to colocalization false negatives. Lastly, many causal estimates are directionally inconsistent with observational estimates. While this phenomenon is known from previous proteogenomic studies, its reasons are unclear. In conclusion, this proteogenomic analysis reveals several proteins that are potentially causally related to lung function, most notably THBS2, ERO1B and APOB. ## Supporting information Supplement Legends and Tables S2, S3, S6, S7 [[supplements/292035_file08.docx]](pending:yes) Supplemental Tables S1, S4, S5 [[supplements/292035_file09.xlsx]](pending:yes) Supplemental Table S8 to S15 [[supplements/292035_file10.xlsx]](pending:yes) Supplemental Table S16 [[supplements/292035_file11.xlsx]](pending:yes) ## Data Availability All data produced in the present study are available upon reasonable request to the authors ![Figure5](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2023/07/01/2023.06.30.23292035/F5.medium.gif) [Figure5](http://medrxiv.org/content/early/2023/07/01/2023.06.30.23292035/F5) ![Figure6](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2023/07/01/2023.06.30.23292035/F6.medium.gif) [Figure6](http://medrxiv.org/content/early/2023/07/01/2023.06.30.23292035/F6) ## Footnotes * Funding. The AGES-Reykjavik study was funded by Icelandic Heart Association, National Institute on Aging contract N01-AG-12100 and HHSN271201200022C, and Althingi (the Icelandic Parliament). GTA was supported by the Eimskip University Fund from the University of Iceland. GG was supported by the University of Iceland and by Landspitali University Hospital. TJ and VaG were supported by the University of Iceland, VaG by its postdoctoral fund. All protein measurements were funded by Novartis. * Competing interests: The study was supported by the Novartis Institute for Biomedical Research, and protein measurements for the AGES-Reykjavik cohort were performed at SomaLogic. JJL, YJW. and LLJ are employees and stockholders of Novartis. GTA and GG report travel support from Boehringer-Ingelheim for work not related to the study. No other potential conflicts of interest relevant to this article were reported. * Received June 30, 2023. * Revision received June 30, 2023. * Accepted July 1, 2023. * © 2023, Posted by Cold Spring Harbor Laboratory The copyright holder for this pre-print is the author. All rights reserved. The material may not be redistributed, re-used or adapted without the author's permission. ## References 1. 1.Vos T, Lim SS, Abbafati C, Abbas KM, Abbasi M, Abbasifard M, et al. Global burden of 369 diseases and injuries in 204 countries and territories, 1990–2019: a systematic analysis for the Global Burden of Disease Study 2019. The Lancet. 2020;396(10258):1204-22. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/s0140-6736(20)30925-9&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F01%2F2023.06.30.23292035.atom) 2. 2.Vestbo J, Hurd SS, Agustí AG, Jones PW, Vogelmeier C, Anzueto A, et al. Global Strategy for the Diagnosis, Management, and Prevention of Chronic Obstructive Pulmonary Disease. Am J Respir Crit Care Med. 2013;187(4):347–65. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1164/rccm.201204-0596PP&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22878278&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F01%2F2023.06.30.23292035.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000315075800006&link_type=ISI) 3. 3.Rennard SI, Vestbo J. COPD: the dangerous underestimate of 15%. The Lancet. 2006;367(9518):1216-9. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S0140-6736(06)68516-4&link_type=DOI) 4. 4.Gordon SB, Bruce NG, Grigg J, Hibberd PL, Kurmi OP, Lam KB, et al. Respiratory risks from household air pollution in low and middle income countries. The Lancet Respiratory medicine. 2014;2(10):823–60. 5. 5.Hunninghake GM, Cho MH, Tesfaigzi Y, Soto-Quiros ME, Avila L, Lasky-Su J, et al. MMP12, Lung Function, and COPD in High-Risk Populations. N Engl J Med. 2009;361(27):2599–608. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1056/NEJMoa0904006&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20018959&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F01%2F2023.06.30.23292035.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000273181300007&link_type=ISI) 6. 6.Cho MH, Boutaoui N, Klanderman BJ, Sylvia JS, Ziniti JP, Hersh CP, et al. Variants in FAM13A are associated with chronic obstructive pulmonary disease. Nat Genet. 2010;42(3):200–2. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/ng.535&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20173748&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F01%2F2023.06.30.23292035.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000274912400006&link_type=ISI) 7. 7.Pillai SG, Ge D, Zhu G, Kong X, Shianna KV, Need AC, et al. A Genome-Wide Association Study in Chronic Obstructive Pulmonary Disease (COPD): Identification of Two Major Susceptibility Loci. PLOS Genetics. 2009;5(3):e1000421. 8. 8. Soler Artigas M, Wain LV, Repapi E, Obeidat M, Sayers I, Burton PR, et al. Effect of five genetic variants associated with lung function on the risk of chronic obstructive lung disease, and their joint effects on lung function. Am J Respir Crit Care Med. 2011;184(7):786–95. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1164/rccm.201102-0192OC&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=21965014&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F01%2F2023.06.30.23292035.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000295407300011&link_type=ISI) 9. 9.Cho MH, McDonald M-LN, Zhou X, Mattheisen M, Castaldi PJ, Hersh CP, et al. Risk loci for chronic obstructive pulmonary disease: a genome-wide association study and meta-analysis. The Lancet Respiratory Medicine. 2014;2(3):214–25. 10. 10.Wood AM, Stockley RA. Alpha One Antitrypsin Deficiency: From Gene to Treatment. Respiration. 2007;74(5):481–92. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1159/000105536&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=17671403&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F01%2F2023.06.30.23292035.atom) 11. 11.Serban KA, Pratte KA, Bowler RP. Protein Biomarkers for COPD Outcomes. Chest. 2021;159(6):2244–53. 12. 12.Faner R, Tal-Singer R, Riley JH, Celli B, Vestbo J, MacNee W, et al. Lessons from ECLIPSE: a review of COPD biomarkers. Thorax. 2014;69(7):666. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6OToidGhvcmF4am5sIjtzOjU6InJlc2lkIjtzOjg6IjY5LzcvNjY2IjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjMvMDcvMDEvMjAyMy4wNi4zMC4yMzI5MjAzNS5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 13. 13.Zemans RL, Jacobson S, Keene J, Kechris K, Miller BE, Tal-Singer R, et al. Multiple biomarkers predict disease severity, progression and mortality in COPD. Respir Res. 2017;18(1):117. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/s12931-017-0597-7&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F01%2F2023.06.30.23292035.atom) 14. 14.Gan WQ, Man SFP, Senthilselvan A, Sin DD. Association between chronic obstructive pulmonary disease and systemic inflammation: a systematic review and a meta-analysis. Thorax. 2004;59(7):574. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6OToidGhvcmF4am5sIjtzOjU6InJlc2lkIjtzOjg6IjU5LzcvNTc0IjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjMvMDcvMDEvMjAyMy4wNi4zMC4yMzI5MjAzNS5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 15. 15.Keefe J, Yao C, Hwang S-J, Courchesne P, Lee GY, Dupuis J, et al. An Integrative Genomic Strategy Identifies sRAGE as a Causal and Protective Biomarker of Lung Function. Chest. 2022;161(1):76–84. 16. 16.Milne S, Li X, Hernandez Cordero AI, Yang CX, Cho MH, Beaty TH, et al. Protective effect of club cell secretory protein (CC-16) on COPD risk and progression: a Mendelian randomisation study. Thorax. 2020;75(11):934. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6OToidGhvcmF4am5sIjtzOjU6InJlc2lkIjtzOjk6Ijc1LzExLzkzNCI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIzLzA3LzAxLzIwMjMuMDYuMzAuMjMyOTIwMzUuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 17. 17.Obeidat M, en, Li X, Burgess S, Zhou G, Fishbane N, et al. Surfactant protein D is a causal risk factor for COPD: results of Mendelian randomisation. Eur Respir J. 2017;50(5):1700657. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6MzoiZXJqIjtzOjU6InJlc2lkIjtzOjEyOiI1MC81LzE3MDA2NTciO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyMy8wNy8wMS8yMDIzLjA2LjMwLjIzMjkyMDM1LmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 18. 18.Dahl M, Vestbo J, Zacho J, Lange P, Tybjærg-Hansen A, Nordestgaard BG. C reactive protein and chronic obstructive pulmonary disease: a Mendelian randomisation approach. Thorax. 2011;66(3):197. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6OToidGhvcmF4am5sIjtzOjU6InJlc2lkIjtzOjg6IjY2LzMvMTk3IjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjMvMDcvMDEvMjAyMy4wNi4zMC4yMzI5MjAzNS5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 19. 19.van Durme YMTA, Lahousse L, Verhamme KMC, Stolk L, Eijgelsheim M, Loth DW, et al. Mendelian Randomization Study of Interleukin-6 in Chronic Obstructive Pulmonary Disease. Respiration. 2011;82(6):530–8. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1159/000332336&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=21997388&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F01%2F2023.06.30.23292035.atom) 20. 20.Emilsson V, Ilkov M, Lamb JR, Finkel N, Gudmundsson EF, Pitts R, et al. Co-regulatory networks of human serum proteins link genetics to disease. Science. 2018;361(6404):769-73. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6Mzoic2NpIjtzOjU6InJlc2lkIjtzOjEyOiIzNjEvNjQwNC83NjkiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyMy8wNy8wMS8yMDIzLjA2LjMwLjIzMjkyMDM1LmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 21. 21.Emdin CA, Khera AV, Kathiresan S. Mendelian Randomization. JAMA. 2017;318(19):1925–6. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1001/jama.2017.17219&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=29164242&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F01%2F2023.06.30.23292035.atom) 22. 22.Harris TB, Launer LJ, Eiriksdottir G, Kjartansson O, Jonsson PV, Sigurdsson G, et al. Age, Gene/Environment Susceptibility – Reykjavik Study: Multidisciplinary Applied Phenomics. Am J Epidemiol. 2007;165(9):1076–87. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/aje/kwk115&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=17351290&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F01%2F2023.06.30.23292035.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000245896900014&link_type=ISI) 23. 23.Gudmundsson G, Margretardottir OB, Sigurdsson MI, Harris TB, Launer LJ, Sigurdsson S, et al. Airflow obstruction, atherosclerosis and cardiovascular risk factors in the AGES Reykjavik study. Atherosclerosis. 2016;252:122–7. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.atherosclerosis.2016.07.919&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=27522264&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F01%2F2023.06.30.23292035.atom) 24. 24.Gudjonsson A, Gudmundsdottir V, Axelsson GT, Gudmundsson EF, Jonsson BG, Launer LJ, et al. A genome-wide association study of serum proteins reveals shared loci with common diseases. Nat Commun. 2022;13(1):480. 25. 25.Hankinson JL, Odencrantz JR, Fedan KB. Spirometric reference values from a sample of the general U.S. population. Am J Respir Crit Care Med. 1999;159(1):179–87. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1164/ajrccm.159.1.9712108&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=9872837&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F01%2F2023.06.30.23292035.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000077987600027&link_type=ISI) 26. 26.Boyle EI, Weng S, Gollub J, Jin H, Botstein D, Cherry JM, et al. GO::TermFinder--open source software for accessing Gene Ontology information and finding significantly enriched Gene Ontology terms associated with a list of genes. Bioinformatics. 2004;20(18):3710–5. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/bioinformatics/bth456&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=15297299&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F01%2F2023.06.30.23292035.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000225786600064&link_type=ISI) 27. 27.Shrine N, Guyatt AL, Erzurumluoglu AM, Jackson VE, Hobbs BD, Melbourne CA, et al. New genetic signals for lung function highlight pathways and chronic obstructive pulmonary disease associations across multiple ancestries. Nat Genet. 2019;51(3):481–93. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41588-018-0321-7&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=30804560&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F01%2F2023.06.30.23292035.atom) 28. 28.Burgess S, Dudbridge F, Thompson SG. Combining information on multiple instrumental variables in Mendelian randomization: comparison of allele score and summarized data methods. Stat Med. 2016;35(11):1880–906. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/sim.6835&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=26661904&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F01%2F2023.06.30.23292035.atom) 29. 29.Shrine N, Guyatt AL, Erzurumluoglu AM, Jackson VE, Hobbs BD, Melbourne CA, et al. New genetic signals for lung function highlight pathways and chronic obstructive pulmonary disease associations across multiple ancestries. Nat Genet. 2019;51(3):481–93. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41588-018-0321-7&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=30804560&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F01%2F2023.06.30.23292035.atom) 30. 30.Ferkingstad E, Sulem P, Atlason BA, Sveinbjornsson G, Magnusson MI, Styrmisdottir EL, et al. Large-scale integration of the plasma proteome with genetics and disease. Nat Genet. 2021;53(12):1712–21. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41588-021-00978-w&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F01%2F2023.06.30.23292035.atom) 31. 31.Zou Y, Carbonetto P, Wang G, Stephens M. Fine-mapping from summary data with the “Sum of Single Effects” model. PLoS Genet. 2022;18(7):e1010299. 32. 32.Auton A, Brooks LD, Durbin RM, Garrison EP, Kang HM, Korbel JO, et al. A global reference for human genetic variation. Nature. 2015;526(7571):68-74. 33. 33.Wen X, Pique-Regi R, Luca F. Integrating molecular QTL data into genome-wide genetic association analysis: Probabilistic assessment of enrichment and colocalization. PLoS Genet. 2017;13(3):e1006646. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pgen.1006646&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=28278150&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F01%2F2023.06.30.23292035.atom) 34. 34.Hukku A, Pividori M, Luca F, Pique-Regi R, Im HK, Wen X. Probabilistic colocalization of genetic variants from complex and molecular traits: promise and limitations. Am J Hum Genet. 2021;108(1):25–35. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ajhg.2020.11.012&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=33308443&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F01%2F2023.06.30.23292035.atom) 35. 35.Kim M, Vo DD, Kumagai ME, Jops CT, Gandal MJ. GeneticsMakie.jl: a versatile and scalable toolkit for visualizing locus-level genetic and genomic data. Bioinformatics. 2023;39(1). 36. 36.Sun BB, Maranville JC, Peters JE, Stacey D, Staley JR, Blackshaw J, et al. Genomic atlas of the human plasma proteome. Nature. 2018;558(7708):73-9. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41586-018-0175-2&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=29875488&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F01%2F2023.06.30.23292035.atom) 37. 37.Gudmundsdottir V, Zaghlool SB, Emilsson V, Aspelund T, Ilkov M, Gudmundsson EF, et al. Circulating Protein Signatures and Causal Candidates for Type 2 Diabetes. Diabetes. 2020;69(8):1843–53. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.2337/db20-1843-P&link_type=DOI) 38. 38.Zhang K, Li M, Yin L, Fu G, Liu Z. Role of thrombospondin–1 and thrombospondin–2 in cardiovascular diseases (Review). Int J Mol Med. 2020;45(5):1275–93. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.3892/ijmm.2020.4507&link_type=DOI) 39. 39.Jiang YM, Yu DL, Hou GX, Jiang JL, Zhou Q, Xu XF. Serum thrombospondin-2 is a candidate diagnosis biomarker for early non-small-cell lung cancer. Biosci Rep. 2019;39(7). 40. 40.Calabro NE, Kristofik NJ, Kyriakides TR. Thrombospondin-2 and extracellular matrix assembly. Biochim Biophys Acta. 2014;1840(8):2396–402. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.bbagen.2014.01.013&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24440155&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F01%2F2023.06.30.23292035.atom) 41. 41.Zito E, Chin KT, Blais J, Harding HP, Ron D. ERO1-beta, a pancreas-specific disulfide oxidase, promotes insulin biogenesis and glucose homeostasis. J Cell Biol. 2010;188(6):821–32. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6MzoiamNiIjtzOjU6InJlc2lkIjtzOjk6IjE4OC82LzgyMSI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIzLzA3LzAxLzIwMjMuMDYuMzAuMjMyOTIwMzUuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 42. 42.Asada K, Kobayashi K, Joutard S, Tubaki M, Takahashi S, Takasawa K, et al. Uncovering Prognosis-Related Genes and Pathways by Multi-Omics Analysis in Lung Cancer. Biomolecules. 2020;10(4). 43. 43.Zhu T, Gao YF, Chen YX, Wang ZB, Yin JY, Mao XY, et al. Genome-scale analysis identifies GJB2 and ERO1LB as prognosis markers in patients with pancreatic cancer. Oncotarget. 2017;8(13):21281–9. 44. 44.Li H, Liu Y, Wang L, Shen T, Du W, Liu Z, et al. High apolipoprotein M serum levels correlate with chronic obstructive pulmonary disease. Lipids Health Dis. 2016;15:59. 45. 45.Burkart KM, Manichaikul A, Wilk JB, Ahmed FS, Burke GL, Enright P, et al. APOM and high-density lipoprotein cholesterol are associated with lung function and per cent emphysema. Eur Respir J. 2014;43(4):1003–17. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6MzoiZXJqIjtzOjU6InJlc2lkIjtzOjk6IjQzLzQvMTAwMyI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIzLzA3LzAxLzIwMjMuMDYuMzAuMjMyOTIwMzUuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 46. 46.Strnad P, McElvaney NG, Lomas DA. Alpha1-Antitrypsin Deficiency. N Engl J Med. 2020;382(15):1443–55. 47. 47.Senn O, Russi EW, Schindler C, Imboden M, von Eckardstein A, Brändli O, et al. Circulating alpha1-antitrypsin in the general population: Determinants and association with lung function. Respir Res. 2008;9(1):35. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/1465-9921-9-35&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=18439253&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F01%2F2023.06.30.23292035.atom) 48. 48.Thun G-A, Ferrarotti I, Imboden M, Rochat T, Gerbase M, Kronenberg F, et al. SERPINA1 PiZ and PiS Heterozygotes and Lung Function Decline in the SAPALDIA Cohort. PLoS One. 2012;7(8):e42728. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pone.0042728&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22912729&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F01%2F2023.06.30.23292035.atom) 49. 49.Tanni SE, Pelegrino NR, Angeleli AY, Correa C, Godoy I. Smoking status and tumor necrosis factor-alpha mediated systemic inflammation in COPD patients. J Inflamm (Lond). 2010;7:29. 50. 50.Churg A, Wang RD, Tai H, Wang X, Xie C, Wright JL. Tumor necrosis factor-alpha drives 70% of cigarette smoke-induced emphysema in the mouse. Am J Respir Crit Care Med. 2004;170(5):492–8. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1164/rccm.200404-511OC&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=15184206&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F01%2F2023.06.30.23292035.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000223605000006&link_type=ISI) 51. 51.Yao Y, Zhou J, Diao X, Wang S. Association between tumor necrosis factor-α and chronic obstructive pulmonary disease: a systematic review and meta-analysis. Ther Adv Respir Dis. 2019;13:1753466619866096. 52. 52.Rennard SI, Fogarty C, Kelsen S, Long W, Ramsdell J, Allison J, et al. The safety and efficacy of infliximab in moderate to severe chronic obstructive pulmonary disease. Am J Respir Crit Care Med. 2007;175(9):926–34. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1164/rccm.200607-995OC&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=17290043&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F01%2F2023.06.30.23292035.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000246229500011&link_type=ISI) 53. 53.Koo HK, Hong Y, Lim MN, Yim JJ, Kim WJ. Relationship between plasma matrix metalloproteinase levels, pulmonary function, bronchodilator response, and emphysema severity. Int J Chron Obstruct Pulmon Dis. 2016;11:1129–37. 54. 54.Churg A, Zhou S, Wright JL. Matrix metalloproteinases in COPD. Eur Respir J. 2012;39(1):197. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6MzoiZXJqIjtzOjU6InJlc2lkIjtzOjg6IjM5LzEvMTk3IjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjMvMDcvMDEvMjAyMy4wNi4zMC4yMzI5MjAzNS5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 55. 55.Navratilova Z, Zatloukal J, Kriegova EVA, Kolek V, Petrek M. Simultaneous up-regulation of matrix metalloproteinases 1, 2, 3, 7, 8, 9 and tissue inhibitors of metalloproteinases 1, 4 in serum of patients with chronic obstructive pulmonary disease. Respirology. 2012;17(6):1006-12. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1111/j.1440-1843.2012.02197.x&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22591289&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F01%2F2023.06.30.23292035.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000306907000017&link_type=ISI) 56. 56.Regueiro V, Campos MA, Morey P, Sauleda J, Agustí AGN, Garmendia J, et al. Lipopolysaccharide-binding protein and CD14 are increased in the bronchoalveolar lavage fluid of smokers. Eur Respir J. 2009;33(2):273. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6MzoiZXJqIjtzOjU6InJlc2lkIjtzOjg6IjMzLzIvMjczIjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjMvMDcvMDEvMjAyMy4wNi4zMC4yMzI5MjAzNS5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 57. 57.de Lau WB, Snel B, Clevers HC. The R-spondin protein family. Genome Biol. 2012;13(3):242. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/gb-2012-13-3-242&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22439850&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F01%2F2023.06.30.23292035.atom) 58. 58.Su Y, Zhang X, Bidlingmaier S, Behrens CR, Lee NK, Liu B. ALPPL2 Is a Highly Specific and Targetable Tumor Cell Surface Antigen. Cancer Res. 2020;80(20):4552–64. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1158/1538-7445.AM2020-4552&link_type=DOI) 59. 59.Jung IH, Elenbaas JS, Alisio A, Santana K, Young EP, Kang CJ, et al. SVEP1 is a human coronary artery disease locus that promotes atherosclerosis. Sci Transl Med. 2021;13(586). 60. 60.Merali S, Barrero CA, Bowler RP, Chen DE, Criner G, Braverman A, et al. Analysis of the plasma proteome in COPD: Novel low abundance proteins reflect the severity of lung remodeling. COPD: Journal of Chronic Obstructive Pulmonary Disease. 2014;11(2):177–89.