PT - JOURNAL ARTICLE AU - Maiorino, Enrico AU - De Marzio, Margherita AU - Xu, Zhonghui AU - Yun, Jeong H. AU - Chase, Robert P. AU - Hersh, Craig P. AU - Weiss, Scott T. AU - Silverman, Edwin K. AU - Castaldi, Peter J. AU - Glass, Kimberly TI - JOINT CLINICAL AND MOLECULAR SUBTYPING OF COPD WITH VARIATIONAL AUTOENCODERS AID - 10.1101/2023.08.19.23294298 DP - 2024 Jan 01 TA - medRxiv PG - 2023.08.19.23294298 4099 - http://medrxiv.org/content/early/2024/01/10/2023.08.19.23294298.short 4100 - http://medrxiv.org/content/early/2024/01/10/2023.08.19.23294298.full AB - Chronic Obstructive Pulmonary Disease (COPD) is a complex, heterogeneous disease. Traditional subtyping methods generally focus on either the clinical manifestations or the molecular endotypes of the disease, resulting in classifications that do not fully capture the disease’s complexity. Here, we bridge this gap by introducing a subtyping pipeline that integrates clinical and gene expression data with variational autoencoders. We apply this methodology to the COPDGene study, a large study of current and former smoking individuals with and without COPD. Our approach generates a set of vector embeddings, called Personalized Integrated Profiles (PIPs), that recapitulate the joint clinical and molecular state of the subjects in the study. Prediction experiments show that the PIPs have a predictive accuracy comparable to or better than other embedding approaches. Using trajectory learning approaches, we analyze the main trajectories of variation in the PIP space and identify five well-separated subtypes with distinct clinical phenotypes, expression signatures, and disease outcomes. Notably, these subtypes are more robust to data resampling compared to those identified using traditional clustering approaches. Overall, our findings provide new avenues to establish fine-grained associations between the clinical characteristics, molecular processes, and disease outcomes of COPD.Competing Interest StatementJHY received consulting fees from Bridge BioTherapeutics. PJC received consulting fees from Verona Pharmaceuticals and research support from Bayer and Sanofi, both outside of this work. In the past three years, EKS received grant support from Bayer and Northpond Laboratories. STW receives royalties from UpToDate and is on the Board of Histolix, a digital pathology company. Funding StatementThis work was supported by NHLBI R01HL124233, U01 HL089897, R01 HL147326, and U01 HL089856. The COPDGene study (NCT00608764) is also supported by the COPD Foundation through contributions made to an Industry Advisory Board that has included AstraZeneca, Bayer Pharmaceuticals, Boehringer-Ingelheim, Genentech, GlaxoSmithKline, Novartis, Pfizer, and Sunovion. Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:Brigham and Women's Hospital IRB Protocol #: 2007P000554/BWHI confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.YesAll data used in this study are available in dbGaP phs000179.v6.p2