PT - JOURNAL ARTICLE AU - Li, Jiang AU - Yu, Yuefeng AU - Sun, Ying AU - Fu, Yanqi AU - Shen, Wenqi AU - Cai, Lingli AU - Tan, Xiao AU - Wang, Ningjian AU - Lu, Yingli AU - Wang, Bin TI - Nuclear magnetic resonance-based metabolomics with machine learning for predicting progression from prediabetes to diabetes AID - 10.1101/2024.05.14.24307378 DP - 2024 Jan 01 TA - medRxiv PG - 2024.05.14.24307378 4099 - http://medrxiv.org/content/early/2024/05/15/2024.05.14.24307378.short 4100 - http://medrxiv.org/content/early/2024/05/15/2024.05.14.24307378.full AB - Background Identification of individuals with prediabetes who are at high risk of developing diabetes allows for precise interventions. We aimed to determine the role of nuclear magnetic resonance (NMR)-based metabolomic signature in predicting the progression from prediabetes to diabetes.Methods This prospective study included 13,489 participants with prediabetes who had metabolomic data from the UK Biobank. Circulating metabolites were quantified via NMR spectroscopy. Cox proportional hazard (CPH) models were performed to estimate the associations between metabolites and diabetes risk. Supporting vector machine, random forest, and extreme gradient boosting were used to select the optimal metabolite panel for prediction. CPH and random survival forest (RSF) models were utilized to validate the predictive ability of the metabolites.Results During a median follow-up of 13.6 years, 2,525 participants developed diabetes. After adjusting for covariates, 94 of 168 metabolites were associated with risk of progression to diabetes. A panel of nine metabolites, selected by all three machine learning algorithms, was found to significantly improve diabetes risk prediction beyond conventional risk factors in the CPH model (area under the receiver operating characteristic curve [AUROC], 1-year: 0.823 for risk factors + metabolites vs 0.759 for risk factors, 5-year: 0.830 vs 0.798, 10-year: 0.801 vs 0.776, all P <0.05). Similar results were observed from the RSF model. Categorization of participants according to the predicted value thresholds revealed distinct cumulative risk of diabetes.Conclusions Our study lends support for use of the metabolite markers to help determine individuals with prediabetes who are at high risk of progressing to diabetes and inform targeted and efficient interventions.Funding Shanghai Municipal Health Commission (2022XD017). Innovative Research Team of High-level Local Universities in Shanghai (SHSMU-ZDCX20212501). Shanghai Municipal Human Resources and Social Security Bureau (2020074). Clinical Research Plan of Shanghai Hospital Development Center (SHDC2020CR4006). Graphical abstract.CPH, Cox proportional hazard; NMR, nuclear magnetic resonance; RF, random forest; RSF, Random survival forest; SVM, supporting vector machine; XGBoost, extreme gradient boosting.Competing Interest StatementThe authors have declared no competing interest.Funding StatementShanghai Municipal Health Commission (2022XD017). Innovative Research Team of High-level Local Universities in Shanghai (SHSMU-ZDCX20212501). Shanghai Municipal Human Resources and Social Security Bureau (2020074). Clinical Research Plan of Shanghai Hospital Development Center (SHDC2020CR4006).Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:The study was approved by the Northwest Multicenter Research Ethics Committee (REC reference for UK Biobank 11/NW/0382), and all participants provided informed consent.I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.YesThe data analyzed during this study are available at https://www.ukbiobank.ac.uk/. This research has been conducted using the UK Biobank Resource under application number 77740.