SUMMARY
Background This study assessed whether deep learning applied to routine outpatient chest X-rays (CXRs) can identify individuals at high risk for incident chronic obstructive pulmonary disease (COPD).
Methods Using cancer screening trial data, we previously developed a convolutional neural network (CXR-Lung-Risk) to predict lung-related mortality from a CXR image. In this study, we externally validated CXR-Lung-Risk to predict incident COPD from routine CXRs. We identified outpatients without lung cancer, COPD, or emphysema who had a CXR taken from 2013-2014 at a Mass General Brigham site in Boston, Massachusetts. The primary outcome was 6-year incident COPD. Discrimination was assessed using AUC compared to the TargetCOPD clinical risk score. All analyses were stratified by smoking status. A secondary analysis was conducted in the Project Baseline Health Study (PBHS) to test associations between CXR-Lung-Risk with pulmonary function and protein abundance.
Findings The primary analysis consisted of 12,550 ever-smokers (mean age 62·4±6·8 years, 48.9% male, 12.4% rate of 6-year COPD) and 15,298 never-smokers (mean age 63·0±8·1 years, 42.8% male, 3.8% rate of 6-year COPD). CXR-Lung-Risk had additive predictive value beyond the TargetCOPD score for 6-year incident COPD in both ever-smokers (CXR-Lung-Risk + TargetCOPD AUC: 0·73 [95% CI: 0·72-0·74] vs. TargetCOPD alone AUC: 0·66 [0·65-0·68], p<0·01) and never-smokers (CXR-Lung-Risk + TargetCOPD AUC: 0·70 [0·67-0·72] vs. TargetCOPD AUC: 0·60 [0·57-0·62], p<0·01). In secondary analyses of 2,097 individuals in the PBHS, CXR-Lung-Risk was associated with worse pulmonary function and with abundance of SCGB3A2 (secretoglobin family 3A member 2) and LYZ (lysozyme), proteins involved in pulmonary physiology.
Interpretation In external validation, a deep learning model applied to a routine CXR image identified individuals at high risk for incident COPD, beyond known risk factors.
Funding The Project Baseline Health Study and this analysis were funded by Verily Life Sciences, San Francisco, California.
ClinicalTrials.gov Identifier NCT03154346
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
All authors acknowledge institutional research grants from Verily Life Sciences. KM reports grants from Verily, Afferent, the American Heart Association (AHA), Cardiva Medical Inc, Gilead, Luitpold, Medtronic, Merck, Eidos, Ferring, Apple Inc, Sanifit, and St. Jude; grants and personal fees from Amgen, AstraZeneca, Bayer, CSL Behring, Johnson & Johnson, Novartis, and Sanofi; and personal fees from Anthos, Applied Therapeutics, Elsevier, Inova, Intermountain Health, Medscape, Mount Sinai, Mundi Pharma, Myokardia, Novo Nordisk, Otsuka, Portola, SmartMedics, and Theravance outside the submitted work. AH reports grants from Verily; grants and personal fees from AstraZeneca, Amgen, Bayer, Merck, and Novartis; and personal fees from Boston Scientific outside the submitted work. JW reports grants from the National Academy of Medicine and the German Research Foundation and personal fees from Onc.AI outside the submitted work. VKR reports grants from the National Academy of Medicine, Norn Group, the American Heart Association, and the NHLBI and has common stock in Alphabet, Apple, NVIDIA, and Meta. HJWLA reports grants from the National Cancer Institute and the European Union, consulting fees and stock from Onc.AI, Love Health, Sphera, and Ambient outside the submitted work. MTL reports grants from the National Academy of Medicine, American Heart Association, AstraZeneca, Ionis, Johnson & Johnson Innovation, Kowa, 20 Medimmune, NHLBI, and the Risk Management Foundation of the Harvard Medical Institutions Inc outside the submitted work. DCC reports research grants from the NIH: U01CA209414.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
This study was approved by the Mass General Brigham Institutional Review Board with a waiver of informed consent for retrospective analysis of deidentified data.
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.
Yes
DATA SHARING STATEMENT
The deidentified PBHS data corresponding to this study are available upon request for the purpose of examining its reproducibility. Requests are subject to approval by PBHS governance. Due to institutional policy to protect patient privacy, MGB data cannot be shared.