Abstract
Objective We aimed to develop and evaluate a non-invasive deep learning algorithm for screening type 2 diabetes in UK Biobank participants using retinal images.
Research Design and Methods The deep learning model for prediction of type 2 diabetes was trained on retinal images from 50,077 UK Biobank participants and tested on 12,185 participants. We evaluated its performance in terms of predicting traditional risk factors (TRFs) and genetic risk for diabetes. Next, we compared the performance of three models in predicting type 2 diabetes using 1) an image-only deep learning algorithm, 2) TRFs, 3) the combination of the algorithm and TRFs. Assessing net reclassification improvement (NRI) allowed quantification of the improvement afforded by adding the algorithm to the TRF model.
Results When predicting TRFs with the deep learning algorithm, the areas under the curve (AUCs) obtained with the validation set for age, sex, and HbA1c status were 0.931 (0.928-0.934), 0.933 (0.929-0.936), and 0.734 (0.715-0.752), respectively. When predicting type 2 diabetes, the AUC of the composite logistic model using non-invasive TRFs was 0.810 (0.790-0.830), and that for the deep learning model using only fundus images was 0.731 (0.707-0.756). Upon addition of TRFs to the deep learning algorithm, discriminative performance was improved to 0.844 (0.826-0.861). The addition of the algorithm to the TRFs model improved risk stratification with an overall NRI of 50.8%.
Conclusions Our results demonstrate that this deep learning algorithm can be a useful tool for stratifying individuals at high risk of type 2 diabetes in the general population.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
This work was supported by the National Research Foundation of Korea Grant funded by the Korean Government (NRF-2016R1C1B1009262) and the National Research Foundation of Korea Grant (NRF-2019R1A2C1006608) funded by the Korea government. This work was also supported by NLM R01 NL012535 and NIGMS R01 GM138597.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
The UK Biobank has ethical approval from the National Research Ethics Committee (June 17, 2011 [RES reference 11/NW/0382]), which was further extended (May 10, 2016 [RES reference 16/NW/0274]). Use of the UK Biobank Resource in the current study was approved under Application Number 67855.
All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
Data Availability
We used the UK Biobank dataset to develop and validate a deep learning algorithm for prediction of type 2 diabetes using retinal fundus photographs. The UK Biobank project is a prospective observational study that recruited 505,025 UK participants, aged 40-69 years at baseline, between 2006 and 2010. Each participant provided informed consent, completed a touchscreen and in-person interview with trained staff, and underwent a series of physical examinations. Extensive information was collected, including lifestyle, sociodemographic factors, medical history, biologic samples, imaging, and genome-wide genotype data. Detailed protocols for obtaining the data are available on the UK Biobank website at www.ukbiobank.ac.uk.
Non-Standard Abbreviations and Acronyms
- AI
- artificial intelligence
- AUC
- area under the curve
- CE
- Cross-entropy
- CVD
- Cardiovascular disease
- NPV
- negative predictive value
- PPV
- positive predictive value
- R2
- R-squared
- TRF
- traditional risk factor