ABSTRACT
OBJECTIVE Our aim was to apply state-of-the-art machine learning algorithms to predict the risk of future progression to diabetes complications, including diabetic kidney disease (≥30% decline in eGFR) and diabetic retinopathy (mild, moderate or severe).
RESEARCH DESIGN AND METHODS Using data in a cohort of 537 adults with type 1 diabetes we predicted diabetes complications emerging during a median follow-up of 5.4 years. Prediction models were computed first with clinical risk factors at baseline (17 measures) and then with clinical risk factors and blood-derived metabolomics and lipidomics data (965 molecular features) at baseline. Participants were first classified into two groups: type 1 diabetes stable (n=195) or type 1 diabetes with progression to diabetes complications (n=190). Furthermore, progression of diabetic kidney disease (≥30% decline in eGFR; n=79) and diabetic retinopathy (mild, moderate or severe; n=111) were predicted in two complication-specific models. Models were compared by 5-fold cross-validated area under the receiver operating characteristic (AUROC) curves. The Shapley additive explanations algorithm was used for feature selection and for interpreting the models. Accuracy, precision, recall, and F-score were used to evaluate clinical utility.
RESULTS During a median follow-up of 5.4 years, 79 (21 %) of the participants (mean±SD: age 54.8 ± 13.7 years) progressed in diabetic kidney disease and 111 (29 %) of the participants progressed to diabetic retinopathy. The predictive models for diabetic kidney disease progression were highly accurate with clinical risk factors: the accuracy of 0.95 and AUROC of 0.92 (95% CI 0.857;0.995) was achieved, further improved to the accuracy of 0.98 and AUROC of 0.99 (95% CI 0.876;0.997) when omics-based predictors were included. The predictive panel composition was: albuminuria, retinopathy, estimated glomerular filtration rate, hemoglobin A1c, and six metabolites (five identified as ribitol, ribonic acid, myo-inositol, 2,4- and 3,4-dihydroxybutanoic acids).
Models for diabetic retinopathy progression were less predictive with clinical risk predictors at, AUROC of 0.81 (95% CI 0.754;0.958) and with omics included at AUROC of 0.87 (95% CI 0.781;0.996) curve. The final retinopathy-panel included: hemoglobin A1c, albuminuria, mild degree of retinopathy, and seven metabolites, including one ceramide and the 3,4-dihydroxybutanoic acid).
CONCLUSIONS Here we demonstrate the application of machine learning to effectively predict five-year progression of complications, in particular diabetic kidney disease, using a panel of known clinical risk factors in combination with blood small molecules. Further replication of this machine learning tool in a real-world context or a clinical trial will facilitate its implementation in the clinic.
Competing Interest Statement
The authors declare no potential conflicts of interests relevant to this manuscript. Outside this manuscript PR reports consultancy and/or speaking fees to Steno Diabetes Center Copenhagen from Astellas, AstraZeneca, Bayer, Boehringer Ingelheim, Gilead, Eli Lilly, MSD, Novo Nordisk Vifor, and Sanofi Aventis and research grants from AstraZeneca and Novo Nordisk.
Funding Statement
This project was funded by the Novo Nordisk Foundation grant NNF14OC0013659 (PROTON Personalizing treatment of diabetic nephropathy). Internal funding was provided by Steno Diabetes Center Copenhagen, Gentofte, Denmark.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
The study involving human participants were approved by The Ethics Committee E, Region Hovedstaden, Denmark. The participants /patients provided their written informed consent to participate in this study.
All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
The Chan Zuckerberg Initiative, Cold Spring Harbor Laboratory, the Sergey Brin Family Foundation, California Institute of Technology, Centre National de la Recherche Scientifique, Fred Hutchinson Cancer Center, Imperial College London, Massachusetts Institute of Technology, Stanford University, University of Washington, and Vrije Universiteit Amsterdam.