ABSTRACT
Rationale Emphysema is a COPD phenotype with important prognostic implications. Identifying blood-based biomarkers of emphysema will facilitate early diagnosis and development of targeted therapies.
Objectives Discover blood omics biomarkers for chest CT-quantified emphysema and develop predictive biomarker panels.
Methods Emphysema blood biomarker discovery was performed using differential gene expression, alternative splicing, and protein association analyses in a training set of 2,370 COPDGene participants with available whole blood RNA sequencing, plasma SomaScan proteomics, and clinical data. Validation was conducted in a testing set of 1,016 COPDGene subjects. Since low body mass index (BMI) and emphysema often co-occur, we performed a mediation analysis to quantify the effect of BMI on gene and protein associations with emphysema. Elastic net models were also developed in the training sample sequentially using clinical, complete blood count (CBC) cell proportions, RNA sequencing, and proteomic biomarkers to predict quantitative emphysema. Model accuracy was assessed in the testing sample by the area under the receiver-operator-characteristic-curves (AUROC) for subjects stratified into tertiles of emphysema severity.
Measurements and Main Results 4,913 genes, 1,478 isoforms, 386 exons, and 881 proteins were significantly associated with emphysema (FDR 10%) and yielded 109 biological pathways. 75% of the genes and 77% of the proteins associated with emphysema showed evidence of mediation by BMI. The highest-performing predictive model used clinical, CBC, and protein biomarkers, distinguishing the top from the bottom tertile of emphysema with an AUROC of 0.92.
Conclusions Blood transcriptome and proteome-wide analyses reveal key biological pathways of emphysema and enhance the prediction of emphysema.
Scientific Knowledge on the Subject Differential gene expression and protein analyses have uncovered some of the molecular underpinnings of emphysema. However, no studies have assessed alternative splicing mechanisms and analyzed proteomic data from recently developed high-throughput panels. In addition, although emphysema has been associated with low body mass index (BMI), it is still unclear how BMI affects the transcriptome and proteome of the disease. Finally, the effectiveness of multi-omic biomarkers in determining the severity of emphysema has not yet been investigated.
What This Study Adds to the Field We performed whole-blood genome-wide RNA sequencing and plasma SomaScan proteomic analyses in the large and well-phenotyped COPDGene study. In addition to confirming earlier findings, our differential gene expression, alternative splicing, and protein analyses identified novel biomarkers and pathways of chest CT-quantified emphysema. Our mediation analysis detected varying degrees of transcriptomic and proteomic mediation due to BMI. Our supervised machine learning modeling demonstrated the utility of incorporating multi-omics data in enhancing the prediction of emphysema.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
This work was supported by NHLBI K08 HL141601, K08 HL146972, K08 HL136928, K01 HL157613, R01 HL124233, U01 HL089897, R01 HL147326, R01 HL133135, P01 HL114501, U01 HL089897, and U01 HL089856. The COPDGene study (NCT00608764) is also supported by the COPD Foundation through contributions made to an Industry Advisory Committee comprised of AstraZeneca, Bayer Pharmaceuticals, Boehringer-Ingelheim, Genentech, GlaxoSmithKline, Novartis, Pfizer, and Sunovion.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
The approval of the Institutional Review Board was obtained (Brigham and Women Hospital/2007P000554).
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
Footnotes
Authors’ email addresses: Rahul Suryadevara (rahul.suryadevara{at}channing.harvard.edu), Andrew Gregory (andrew.gregory{at}channing.harvard.edu), Robin Lu (robin.lu{at}channing.harvard.edu), Zhonghui Xu (zhonghui.xu{at}channing.harvard.edu), Aria Masoomi (masoomi.a{at}northeastern.edu), Sharon M. Lutz (smlutz{at}hsph.harvard.edu), Seth Berman (seth.berman{at}channing.harvard.edu), Jeong H. Yun (jeong.yun{at}channing.harvard.edu), Aabida Saferali (aabida.saferali{at}channing.harvard.edu), Craig P. Hersh (craig.hersh{at}channing.harvard.edu), Edwin K. Silverman (ed.silverman{at}channing.harvard.edu), Jennifer Dy (jdy{at}ece.neu.edu), Katherine A. Pratte (prattek{at}njhealth.org), Russell P. Bowler (bowlerr{at}njhealth.org), Peter J. Castaldi (peter.castaldi{at}channing.harvard.edu), Adel Boueiz (adel.boueiz{at}channing.harvard.edu)
Funding Sources: This work was supported by NHLBI K08 HL141601, K08 HL146972, K08 HL136928, K01 HL157613, R01 HL124233, U01 HL089897, R01 HL147326, R01 HL133135, P01 HL114501, U01 HL089897, and U01 HL089856. The COPDGene study (NCT00608764) is also supported by the COPD Foundation through contributions made to an Industry Advisory Committee comprised of AstraZeneca, Bayer Pharmaceuticals, Boehringer-Ingelheim, Genentech, GlaxoSmithKline, Novartis, Pfizer, and Sunovion.
This article has an online data supplement, which is accessible from this issue’s table of content online at www.atjsjournals.org
Updated results, tables, figures, and discussion
Data Availability
All data produced in the present study are available upon reasonable request to the authors