RT Journal Article SR Electronic T1 Nuclear magnetic resonance-based metabolomics with machine learning for predicting progression from prediabetes to diabetes JF medRxiv FD Cold Spring Harbor Laboratory Press SP 2024.05.14.24307378 DO 10.1101/2024.05.14.24307378 A1 Li, Jiang A1 Yu, Yuefeng A1 Sun, Ying A1 Fu, Yanqi A1 Shen, Wenqi A1 Cai, Lingli A1 Tan, Xiao A1 Cai, Yan A1 Wang, Ningjian A1 Lu, Yingli A1 Wang, Bin YR 2024 UL http://medrxiv.org/content/early/2024/09/06/2024.05.14.24307378.abstract AB Background Identification of individuals with prediabetes who are at high risk of developing diabetes allows for precise interventions. We aimed to determine the role of nuclear magnetic resonance (NMR)-based metabolomic signature in predicting the progression from prediabetes to diabetes.Methods This prospective study included 13,489 participants with prediabetes who had metabolomic data from the UK Biobank. Circulating metabolites were quantified via NMR spectroscopy. Cox proportional hazard (CPH) models were performed to estimate the associations between metabolites and diabetes risk. Supporting vector machine, random forest, and extreme gradient boosting were used to select the optimal metabolite panel for prediction. CPH and random survival forest (RSF) models were utilized to validate the predictive ability of the metabolites.Results During a median follow-up of 13.6 years, 2,525 participants developed diabetes. After adjusting for covariates, 94 of 168 metabolites were associated with risk of progression to diabetes. A panel of nine metabolites, selected by all three machine learning algorithms, was found to significantly improve diabetes risk prediction beyond conventional risk factors in the CPH model (area under the receiver operating characteristic curve [AUROC], 1-year: 0.823 for risk factors + metabolites vs 0.759 for risk factors, 5-year: 0.830 vs 0.798, 10-year: 0.801 vs 0.776, all P <0.05). Similar results were observed from the RSF model. Categorization of participants according to the predicted value thresholds revealed distinct cumulative risk of diabetes.Conclusions Our study lends support for use of the metabolite markers to help determine individuals with prediabetes who are at high risk of progressing to diabetes and inform targeted and efficient interventions.Funding Shanghai Municipal Health Commission (2022XD017). Innovative Research Team of High-level Local Universities in Shanghai (SHSMU-ZDCX20212501). Shanghai Municipal Human Resources and Social Security Bureau (2020074). Clinical Research Plan of Shanghai Hospital Development Center (SHDC2020CR4006). Science and Technology Commission of Shanghai Municipality (22015810500).Competing Interest StatementThe authors have declared no competing interest.Funding StatementShanghai Municipal Health Commission (2022XD017). Innovative Research Team of High-level Local Universities in Shanghai (SHSMU-ZDCX20212501). Shanghai Municipal Human Resources and Social Security Bureau (2020074). Clinical Research Plan of Shanghai Hospital Development Center (SHDC2020CR4006). Science and Technology Commission of Shanghai Municipality (22015810500).Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:The study was approved by the Northwest Multicenter Research Ethics Committee (REC reference for UK Biobank 11/NW/0382), and all participants provided informed consent.I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.YesThe data analyzed during this study are available at https://www.ukbiobank.ac.uk/. This research has been conducted using the UK Biobank Resource under application number 77740.