Development and validation of a deep-learning model to predict 10-year ASCVD risk from retinal images using the UK Biobank and EyePACS 10K datasets

Ehsan Vaghefi; David Squirrell; Song Yang; Songyang An; Li Xie; Mary K. Durbin; Huiyuan Hou; John Marshall; Jacqueline Shreibati; Michael V McConnell; Matthew Budoff

doi:10.1101/2023.09.20.23295870

Abstract

Background Atherosclerotic Cardiovascular Disease (ASCVD) is a leading cause of death globally, and early detection of high-risk individuals is essential for initiating timely interventions. The authors aimed to develop and validate a deep learning (DL) model to predict an individual’s elevated 10-year ASCVD risk score based on retinal images and limited demographic data.

Methods The study used 89,894 retinal fundus images from 44,176 UK Biobank participants (96% non-Hispanic White, 5% diabetic) to train and test the DL model. The DL model was developed using retinal images plus age, race/ethnicity, and sex at birth to predict an individual’s 10-year ASCVD risk score using the Pooled Cohort Equation (PCE) as the ground truth. This model was then tested on the US EyePACS 10K dataset (5.8% Non-Hispanic white 99.9% diabetic), composed of 18,900 images from 8,969 diabetic individuals. Elevated ASCVD risk was defined as a PCE score of ≥7.5%.

Results In the UK Biobank internal validation dataset, the DL model achieved area under the receiver operating characteristic curve (AUROC) of 0.89, sensitivity 84%, and specificity 90%, for detecting individuals with elevated ASCVD risk scores. In the EyePACS 10K and with the addition of a regression-derived diabetes modifier, it achieved sensitivity 94%, specificity 72%, mean error −0.2%, and mean absolute error 3.1%.

Conclusion This study demonstrates that DL models using retinal images can provide an additional approach to estimating ASCVD risk, and the value of applying DL models to different external datasets and opportunities about ASCVD risk assessment in patients living with diabetes.

Introduction

Atherosclerotic Cardiovascular Disease (ASCVD) is the most common cause of hospitalization and premature death in the US ¹. The risk of an individual experiencing an ASCVD event includes both non-modifiable variables (age, sex, and race/ethnicity), and modifiable variables such as diabetes ², hypertension ³, dyslipidemia ⁴, and smoking ⁵. Across a population, the risk of experiencing an ASCVD event varies greatly. Risk-based equations have therefore been developed to identify those who are at greatest risk of ASCVD so that preventive treatments can be initiated appropriate to the individual’s risk ⁶. The landmark Framingham Heart Study was the first to demonstrate that multivariable equations could identify an individual’s ASCVD risk with far greater accuracy than the existing metrics based solely on blood pressure and cholesterol ⁷. Since the Framingham-based equations were first published, other equations have been developed to serve different and more diverse populations with refined accuracy ^8–16.

The retina is unique in being the only part of the human vasculature where the microvascular system is visible at micron-level resolution by non-invasive means. The automated detection of components within retinal images to predict ASCVD risk has been utilized with moderate degrees of success previously ^17–19, but in recent years there has been an exponential increase in the number of studies that have used artificial intelligence (AI), and deep learning (DL) in particular, to extract data from retinal images ^20–22. There is now growing interest in using the retinal image data generated by DL models to augment the traditional means of estimating ASCVD risk ²³. In this study we used retinal photographs and limited demographic data from the UK Biobank to develop and validate a DL model designed to predict an individual’s elevated 10-year ASCVD risk, based on the US-derived Pooled Cohort Equation (PCE) ²⁴.

The primary aim of this study was to develop and test a DL model to predict an individual’s 10-year ASCVD risk based on their retinal photographs plus age, race/ethnicity, and sex at birth and then further validate the findings in an external database. This model was built upon our previous work on detecting and grading retinopathy, maculopathy, macular degeneration, and effects of smoking in retinal images ^25–28. While the Framingham risk score is recommended to perform cardiovascular risk assessment in some countries ²⁹, the 2018 American Heart Association (AHA) Cholesterol Clinical Practice Guidelines recommend using the US-derived PCE to estimate the 10-year risk for hard ASCVD events (coronary heart disease death, nonfatal myocardial infarction, fatal or nonfatal stroke) ²⁴. Our DL model was trained on and validated against the 10-year ASCVD risk as calculated by the PCE from individuals in the UK Biobank dataset (Level 3 in Figure 1, Appendix 2). Further external validation was provided by testing on a US-based dataset, EyePACS 10K, with substantially greater racial diversity and predominantly from patients living with diabetes.

Figure 1:

The process of training and validation of the DL prediction model using the UK Biobank dataset

Methods

Datasets

The composition of datasets used in this study is shown in Table 1. The UK Biobank was used for training and internal validation. The validation subset represented 20% of the data, selected randomly prior to development. The data from the UK Biobank can be accessed via a direct request to the UK Biobank; and was obtained using approved data management and data transfer protocols. 89,894 fundus images from 44,176 unique participants from the UK Biobank were used in this study. Participants in the UK Biobank were recruited from a UK general population with only approximately 5% of the UK Biobank population self-identified as having diabetes “diagnosed by doctor”.

View this table:

Table 1: The demographic and risk factor makeup of the UK Biobank derived training and internal test datasets and the EyePACS 10K external validation dataset used in this study.

As described below, a dataset from the US-based EyePACS study, with multiple differences from UK Biobank, was used for external validation. The dataset used for this analysis (EyePACS 10K) consisted of a subset of 9,947 individuals who had sufficient clinical data to calculate a traditional PCE risk score. Of these, 978 were excluded as they had established ASCVD prior to the date of retinal imaging. The external validation dataset thus comprised 18,900 images from 8,969 individuals.

The mean age of individuals in the EyePACS 10K dataset was 56±10 years (compared to 57±8.3 years in the UK Biobank). They predominantly self-identified as Hispanic whilst the UK Biobank population was predominantly non-Hispanic White. As the EyePACS 10K population consisted almost exclusively of people living with diabetes presenting for diabetic retinopathy screening, the mean Hemoglobin A1c (HbA1c) level of this population was high (8.1±1.7%). Additional demographics from both datasets are summarized in Table 1.

Significance test was performed between UK Biobank training and test, as well as Biobank training and EyePACS 10K external validation (* P<0.01 z test. **P<0.001 z test. † P<0.01 Chi square). The UK Biobank did not include Hispanic ethnicity as an option and the White participants were predominantly of British and Irish origin. EyePACS 10K included choices of Hispanic, Black, or White, so separating Hispanic Black participants from Hispanic White participants was not possible.

Inclusion/ Exclusion Criteria

Individuals in both datasets who had established ASCVD (Appendix 1) prior to the acquisition of the retinal images were excluded. A previously trained DL-based image quality model was used to screen all retinal images for acceptable image quality ²⁷. Only the earliest acceptable images and the individual’s accompanying biometrics obtained on that same visit were used from the UK Biobank data. For the EyePACS 10K dataset, the earliest acceptable images which had accompanying biodata, taken no earlier than one year before image acquisition, were used. A detailed description of the DL model’s development is provided in Appendix 2.

Model assessment

The performance of the DL model was assessed on both the 20% of UK Biobank that had been set aside for internal validation (8,606 individuals), and on all eligible patients in the EyePACS 10K external validation dataset (8,969 individuals).

Receiver operator curves (ROC), sensitivity, and specificity metrics

To assess the DL model’s ability to predict an elevated ASCVD score we compared the score it issued with that generated by the PCE equation. The DL model’s performance was based on a PCE score of ≥ 7.5% being the ground truth for elevated risk. The receiver operator characteristic curve (ROC), the precision recall curve, and the overall performance of the DL model on the UK Biobank and EyePACS 10K datasets were determined. The sensitivity and specificity of the model to detect individuals with an elevated ASCVD risk in the UK Biobank and EyePACS 10K datasets were also calculated. To investigate the performance of the model in different demographic groups in the UK Biobank, the performance of the model across sex, age, and race/ethnicity were also calculated. To investigate the impact the retinal image had on the DL model’s performance, and to ascertain whether the retinal data contributed positively to the output, we re-ran the DL model withholding the retinal image input data and recalculated the AUROC, sensitivity, and specificity.

Comparison of the performance of the DL model and the PCE equation to predict 10-year ASCVD risk and events

The performance of the DL model vs. PCE was also assessed by way of a data binning technique ³⁰. Patients in the UK Biobank and EyePACS 10K datasets were first arranged in ascending order by their DL model predicted scores and then by their PCE-calculated 10-year ASCVD risk scores. Both datasets were then divided into 20 bins with equal population numbers by an ascending order of their ASCVD risk scores, i.e., from the lowest 5%, every 5% to the top 5%. The mean 10-year ASCVD risk scores generated by the DL model and the PCE equation were then plotted for each bin in both the UK Biobank and EyePACS 10K datasets. The magnitude of the deviation between the two sets of results generated for each dataset was then assessed by measuring the mean error and the mean absolute error per case.

For the UK Biobank the exercise was then repeated using actual observed ASCVD events, categorized by risk as determined by the DL model and the PCE equation. This exercise was not possible for the EyePACS 10K dataset as there were no future ASCVD events recorded after the retinal images were acquired. In accordance with ACC/AHA Work Group guidelines, an ASCVD event for the purposes of outcomes was defined as the first non-fatal acute myocardial infarction, fatal or non-fatal stroke, or fatal coronary heart artery disease ¹¹. The ICD codes used to define a hard ASCVD event are provided in Appendix 1.

Finally, to assess the probabilistic performance of the two methods for predicting 10-year ASCVD risk, the Brier score loss was calculated for both the DL model and the PCE equation.

Results

The ability of the DL model to identify elevated-risk individuals (PCE-generated ASCVD score ≥7.5%), compared to the PCE equation, for both the UK Biobank and EyePACS 10K datasets are shown in Table 2. In UK Biobank, the DL model achieved AUROC: 0.89, a sensitivity of 83%, and a specificity of 90%. Withholding the retinal image data from the DL model resulted in lower performance, with AUROC 0.84, sensitivity 70% specificity 88%.

View this table:

Table 2: Confusion matrices comparing the DL model-predicted ASCVD score v PCE-calculated ASCVD scores. The numbers in each cell are numbers of people.

In EyePACS10K, the DL model achieved a similar AUROC: 0.90, with a lower sensitivity of 52% and a higher specificity of 95%. The ROC and the accompanying precision-recall curve plots are shown in Figure 2.

Figure 2:

(left) the receiver operating characteristic (ROC) curve, (right) the precision-recall curve, for UK Biobank (top) and for EyePACS 10K (bottom row)

In post-hoc analysis, applying a regression-based diabetes modifier to the DL model output, the sensitivity and specificity to detect individuals with elevated risk against PCE in the EyePACS10K dataset were 94% and 72% respectively.

The AUROC values were then calculated for each demographic segment of both the UK Biobank and EyePACS 10K datasets [Tables 3a, 3b and 3c]. The DL model performed well across both sexes and all races/ethnicities, but its performance in younger individuals (≤ 60yrs) in UK Biobank was lower than older individuals (AUROC 0.72 v 0.84).

View this table:

Table 3a: Performance of DL model (AUROC) across different race/ethnicities and demographics in the UK Biobank test dataset.

View this table:

Table 3b: Performance of DL model (AUROC score) across different race and demographics in EyePACS 10K dataset

View this table:

Table 3c: Performance of DL model (AUROC score) across different ethnicities/races and demographics in EyePACS 10K dataset when diabetes modifier is applied.

Relationship between ASCVD risk scores generated and observed ASCVD event rates using the traditional PCE score, and the DL model predicted score

1. UK Biobank

The predicted 10-year ASCVD risk scores generated by the PCE equation and the DL model, plotted for each bin in the UK Biobank, are shown in Figure 3a. The predicted 10-year ASCVD risk scores rose equally and proportionally for all risk categories in both cohorts. Across the UK Biobank, the predicted 10-year ASCVD risk scores from PCE and the DL model were very similar (mean error 0.3%, mean absolute error 2.4%). The actual ASCVD event rates observed in both the PCE and DL model, when categorized by ascending order of predicted risk, are also shown in Figure 3a. The actual ASCVD event rate rose steadily from the non-elevated to elevated-risk categories in both the PCE and DL models across all individuals in the UK Biobank. The magnitude of the actual ASCVD event rates were again very similar to the predicted 10-year ASCVD risk score produced by the PCE and DL models for all risk categories. The actual ASCVD event rates observed when the results generated by the PCE and DL models were subdivided into the binary classification (predicted risk score <7.5 % or ≥7.5%) are shown in Table 4. The ASCVD event rate observed in those individuals from the UK Biobank allocated a “non-elevated” score by the traditional PCE method was the same as those allocated a “non-elevated” score by the DL prediction model (2.2% v 2.0%). The same finding was observed in the “elevated” risk groups; (ASCVD event rate: traditional PCE 7.5%, DL model 7.5%). Analysis with the Point Biserial Correlation Coefficient (PBCC) revealed that the 10-year ASCVD risk score produced by the PCE and the DL model were both significantly correlated with actual ASCVD events. Calculated PCE vs. ASCVD events: 0.144 (P<0.01); DL model predicted risk score vs. ASCVD events: 0.152 (P< 0.01). The accuracy of the PCE equation to correctly predict an ASCVD event was identical to that of the DL model, with the Brier score loss for both 0.067.

Figure 3a:

10-year ASCVD risk scores (crosses) and ASCVD event rates (dots) when categorized according to the traditional PCE-calculated risk score (red) or DL model predicted risk score (green) in the UK Biobank.

View this table:

Table 4. UK Biobank ASCVD event rates in those with and without elevated ASCVD risk scores by PCE-based ASCVD risk score compared to DL model-predicted ASCVD risk score. The numbers in each cell are #people (#ASCVD events)/% events/cases.

2. EyePACS 10K dataset

The ASCVD risk scores generated by the PCE equation and the DL model, plotted for each bin in the EyePACS 10K dataset, are shown in Figure 3b. For all risk categories the predicted 10-year ASCVD risk score, as measured by the PCE equation, was substantially higher than that produced by the DL model (mean error 3.6%, mean absolute error 4.4%). Comparison between Figures 3a and 3b reveals that the performance of the DL model was very consistent across both the UK Biobank and EyePACS 10K. However, the PCE equation predicted consistently higher scores across all risk profiles in the EyePACS dataset, unlike in UK Biobank.

Figure 3b:

10-year ASCVD risk when categorized according to the traditional PCE-calculated risk score (red) or DL model-predicted risk score (green) or DL model-predicted score after application of the diabetes modifier (blue) in the EyePACS 10K dataset.

As the ROC curves indicated that the performance of the DL model was very similar in both the internal (AUROC 0.89) and external (AUROC 0.90) validation datasets, but with differing sensitivities and specificities, we hypothesized that there was a systematic difference between the 10-year ASCVD risk score produced by the PCE and our DL model when it was applied to people living with diabetes. If so it should be possible to produce a correction factor; a “diabetes modifier,” which would translate the 10-year ASCVD risk score produced by our DL model to that produced by the PCE, across all risk profiles ³¹. The regression equation that solved this systematic difference was: DM-modified ASCVD risk score = DL model ASCVD risk score + max (diabetes_adjustment, 0), where diabetes_adjustment= −10.4156+(0.3560*age)+(−4.5422*gender)+(−0.5410*pce_pred). For sex, Male = 0, Female = 1. pce_pred range [0-100]. The ASCVD risk score generated by DL model with the diabetes modifier applied, plotted for each bin in the EyePACS 10K dataset, is shown in Figure 3b. The results produced by the DL model with the diabetes modifier applied were now very similar to PCE (mean error −0.2%, mean absolute error 3.1%). The sensitivity and specificity of the new diabetes-modified DL Model to detect individuals with elevated risk against PCE in the EyePACS10K dataset were 94% and 72% respectively.

Discussion

In this study we developed and validated a novel DL model to calculate a 10-year ASCVD risk score using over 90,000 retinal images from the UK Biobank dataset and then externally validated it on over 18,000 retinal images from the EyePACS 10K dataset. Our DL model reliably detected those individuals with elevated ASCVD risk scores (≥7.5%) using only the retinal image and the individuals’ age, race/ethnicity, and sex with an AUROC of 0.89, a sensitivity and specificity of 84% and 90% respectively in UK Biobank and an AUROC of 0.90, a sensitivity and specificity of 52% and 95% respectively in EyePACS 10K.

Participants in the UK Biobank were recruited from a UK general population and were mainly non-Hispanic White. Individuals in the US-based EyePACS 10K dataset predominantly identified as Hispanic and the majority were people living with diabetes presenting for diabetic retinopathy screening. This makes direct comparison with respect to ASCVD risk predictions between the two datasets challenging. As such it is important to also examine the performance of our DL model and the PCE in the context of actual ASCVD events. The actual observed ASCVD event rate in UK Biobank was very similar between the DL model and PCE; both had ∼2% ASCVD event rate observed in those allocated “non-elevated” risk, and 7.5% in those allocated to the “elevated”-risk group. There was also a significant correlation between the risk scores produced by both models and actual ASCVD events (PCE-based risk score v ASCVD events 0.144, P<0.01; DL model-predicted risk score v ASCVD events 0.152, P< 0.01), and the probabilistic accuracy of the DL model to correctly predict an ASCVD event was identical to that of the PCE (Brier score loss 0.067).

There was a clear difference between the PCE and the DL model-predicted 10-year ASCVD risk scores in the EyePACS 10K dataset. When assessed by the PCE, 48% of individuals in the EyePACS 10K dataset were deemed to be “non-elevated” and 52% were deemed “elevated” risk. The DL model apportioned the risk differently: 70% “non-elevated”; 30% “elevated” risk. As the PCE score was regarded as the ground truth in the confusion matrix, this difference in how the two models apportioned risk explains the apparently low sensitivity of the DL model. Data binning is a method to group continuous values into a smaller number of “bins’’ and is commonly used by data scientists to reduce the effects of observation errors [30]. This approach was used to further investigate the difference in how the PCE and our DL model apportioned risk in both the UK Biobank and EyePACS 10K datasets, plotting the 10-year ASCVD risk when categorized according to the traditional PCE-calculated risk score or DL model predicted risk score. Unlike what was observed in the UK Biobank, when the 10-year ASCVD risk scores generated by the PCE and the DL model from the EyePACS 10K dataset were grouped into bins of ascending risk scores, the magnitude and distribution of the predicted 10-year ASCVD risk scores generated by the PCE and the DL model were quite different; mean error 3.6%, mean absolute error as measured by each index case was 4.4% [Figures 3b]. As the performance of the DL model assessed by the AUROC was very similar in both the UK Biobank (0.89) and EyePACS 10K (0.90) datasets, and the principle difference in the two datasets was low vs. high rate of diabetes, we hypothesized that it should be possible to regress a correction factor; a “diabetes modifier,” to transform the ASCVD risk score produced by our DL model to that produced by the PCE. Applied to the EyePACS 10K dataset, this post-hoc analysis showed that 10-year ASCVD risk scores generated by the DL model were substantially better aligned to the PCE after the diabetes modifier was applied: mean error −0.2% and mean absolute error 3.1% [Figure 3b]. The subsequent sensitivity and specificity of the “diabetes-modified” DL model to detect individuals with “elevated” risk in the EyePACS10K dataset were also improved at 94% and 72% respectively. Although further validation in other datasets is required, these findings indicate that by applying an appropriate “diabetes modifier”, our DL model can more accurately match the 10-year ASCVD risk produced by the PCE when used in people living with diabetes.

Regardless of diabetes type and status, the PCE equation currently treats all people living with diabetes as a uniform group and it effectively adds a modifier to the regression equation which serves to elevate their risk score compared to non-diabetics. Recently it has been suggested that the traditional regression-based equations, like the PCE, may overestimate ASCVD risk in many people living with diabetes ^32,33. An individual living with longstanding poorly controlled type 1 diabetes for example may have a very different vascular risk profile to an older person who has just been diagnosed with mild type 2 diabetes, and yet the PCE does not discriminate between the two; both “have diabetes”. Tools that better discriminate ASCVD risk in people living with diabetes are therefore required. Recently it has been demonstrated that retinopathy could be used to further refine ASCVD risk in people living with diabetes. Modjtahedi et al demonstrated that not only is diabetic retinopathy associated with future risk of ASCVD events and death, but the magnitude of this risk correlates with the level of retinopathy ³⁴. The authors hypothesize that retinopathy could be representative of wider macrovascular disease and conclude that future calculators, designed to stratify ASCVD risk in people living with diabetes, should incorporate detailed retinopathy input information to more accurately model macrovascular risk. Presently this retinopathy data can only be made available to developers and users of ASCVD risk calculators if the subject has a recent retinal image which has been appropriately graded. DL models like ours, designed to predict ASCVD risk based on retinal images and trained on datasets which contain detailed retinopathy results, could provide the ideal opportunity to capture this retinopathy data and incorporate these findings into ASCVD risk prediction. Further studies, designed to corroborate retinopathy findings and ASCVD risk scores with actual ASCVD events in individuals living with diabetes are therefore required. Until newer risk calculators that incorporate retinopathy data into the risk prediction tool are made available, we have demonstrated that it is possible to train a DL model on a general population and overlay a diabetes modifier which then enables it to more accurately and consistently match the ASCVD risk scores issued by the PCE to people with diabetes.

The application of DL algorithms to predict ASCVD risk-related outcomes from retinal images has been comprehensively reviewed by Hu et al. ³⁵. This review revealed that to date a heterogeneous array of models have been developed using a variety of different inputs: retinal images only, retinal images + various biodata, and reporting against different cardiac-related outcomes. Direct comparison between different models is therefore challenging, but one common theme to emerge was that few of these models have been validated on external datasets. Even fewer have utilized an external dataset derived from individuals with a very different racial/ethnic background. In the current study we trained and tested a DL model to predict the 10-year risk of ASCVD events in the UK Biobank and then further validated the results in an external database which had a very different racial/ethnic make up. To date only two other groups have reported the results of a DL model trained and then externally validated to predict the ASCVD 10-year risk from retinal photographs ^22,36,37. Our results support the accumulating evidence that indicates DL algorithms can use retinal images to accurately predict ASCVD risk and they compare favorably with the one other DL model that has been validated on the UK Biobank; which had sensitivity, specificity of 83%, 88%, respectively ^22,36,37.

Strengths and Limitations

The primary strengths of this study include: we have trained and validated a DL model designed to predict an individual’s ASCVD 10-year risk based on nothing more than a retinal photograph and limited demographic data; the DL model was not only able to reliably match the PCE scores, but also showed similar prediction to PCE of actual ASCVD events; and the DL model was externally validated on a very different dataset. The EyePACS 10K dataset, which although very similar in age, differed substantially in diabetes prevalence, ethnic/racial makeup, and geography (US vs. UK). Testing of the model across race/ethnicity and demographics within the test dataset revealed that the DL model performed consistently well across most groups. However, its ability to predict ASCVD risk was less accurate for individuals of “other races” and those individuals aged less than 60. The former probably reflects the fact that there are a relatively low number of individuals of other races represented in the UK Biobank dataset. The latter may reflect an age bias in that ASCVD risk increases with, and is strongly correlated, to age. Thus, in common with all DL models, one fundamental weakness of the current study is that our model may have limited generalizability to other datasets whose racial and demographic makeup differ to those seen in the UK Biobank and our results demonstrate the importance of ensuring that DL models designed for health care settings should be trained and validated in the designated target population. Our DL model performed similarly (by AUROC) on the external EyePACS10K dataset as it did on the UK Biobank, and we investigated the application of a “diabetes modifier” to the output of our DL model to better match the sensitivity and specificity of the PCE when applied to individuals living with diabetes. However, as there were no future hard ASCVD events in the EyePACS 10K dataset, we could not validate the DL model on this key metric. Our results will therefore need to be replicated on other external datasets before it can be considered a clinically useful tool.

In the development of our DL model we relied on a systolic hypertension DL model as the sole measure of blood pressure. It is feasible that a model that combined both SBP and diastolic blood pressure may have yielded more accurate results than SBP alone. Finally, it is highly probable that the ground truth for the smoking DL model, namely self-reported smoking status, lacks robustness as it is widely accepted that individuals tend to underreport their smoking habit ^38,39. As the PCE equation places a high weighting on an individual’s smoking status, any deficiencies in the smoking model could have a significant impact on the final predicted ASCVD risk.

Conclusion

In conclusion, our results show that it is possible to train a DL model that can assess ASCVD risk as well as the traditional PCE method, using nothing more than a retinal photograph and limited demographic data. We have also shown that the application of a “diabetes modifier” to our DL model is a promising approach to matching the PCE in people living with diabetes. If these results can be replicated in other large population-based datasets, DL models like ours may offer the potential to significantly improve access to ASCVD risk detection strategies as the risk predictions these models produce do not require multiple clinical and laboratory assessments to generate an individual’s ASCVD risk score. As retinal photographs are routinely captured in optometry, ophthalmology, and some pharmacy clinics, it means these DL models can be deployed without significant additional investment, making this technology particularly relevant to lower-resource settings. As an additional ASCVD risk assessment screening tool, a DL prediction model has the potential to make initial ASCVD risk assessment more affordable and accessible to all, with the goal of improving patient outcomes through earlier detection and increasing the public awareness of ASCVD risk and prevention.

Data Availability

Databases used in this study are obtainable from their original data guardian, UK Biobank and EyePACS.

Disclosures

Ehsan Vaghefi and David Squirrell are the co-founders and employees of Toku Eyes. Li Xie, Song Yang, and Songyang An are employees of Toku Eyes. Mary K. Durbin and Huiyuan Hou are employees of Topcon. John Marshall, Jacqueline Shreibati, Michael V. McConnell and Matthew Budoff are consultants for Toku Eyes.

Sources of funding

Toku Eyes has funded this study.

Appendix 1

Definition of “clinical” ASCVD event used to identify events prior to the acquisition of the retinal image. This list defines exclusion criteria for the validation datasets.

Acute myocardial infarction was defined by ICD-9-CM codes 410.* and ICD-10-CM codes I21.*, I22.*, I23.3, I24.0, I24.9, I25.9, or I51.3

Angina: was defined by ICD-9-CM codes 413.* and ICD-10-CM codes I20.*, I23.7, I25.7, I125.11x

CABG/ PCI: were defined based on ICD-9-CM codes 414.02, 414.03, 414.04, 414.05, V45.81, V45.82 and ICD-10-CM codes Z95.1, T82.21.21*, I25.7* (exclude I25.75), I25.810, I25.812, Z98.61, Z95.5

Stroke events were defined based on ICD-9-CM codes 430.*, 431.*, 432.*, 433.*1, 434.*1, or 436.0 and ICD-10-CM codes G45, G46.*, I63.*, I67.85, I69.30, I77.89, P91.0, or Z86.73.

Definition of hard ASCVD event used to detect events subsequent to acquisition of the retinal image in the validation test datasets

Acute myocardial infarction was defined by ICD-9-CM codes 410.* and ICD-10-CM codes I21.*, I22.*, I23.3, I24.0, I24.9, I25.9, or I51.3

Stroke events were defined based on ICD-9-CM codes 430.*, 431.*, 432.*, 433.*1, 434.*1, or 436.0 and ICD-10-CM codes G46.*, I63.*, I67.85, I69.30, I77.89, P91.0, or Z86.73.

Fatal coronary artery disease was defined by the presence of an ICD-9-CM code 411.*, 413.*, or 414.* or an ICD-10-CM code I20.*, I23.7, I24.*, I25.*, or T82.85 code followed by death within a year.”

Appendix 2

Model development

The DL model pipeline encompasses approximately 50 distinct DL models, classified into two primary categories: image-based models and non-image-based models. The former uses image data as input, while the latter relies on vectorized interpretations of participant information including their biometrics. This dichotomy results in a diversity of input data and target formats within the pipeline. For instance, the systolic blood pressure (SBP) model utilizes a retinal image as input, with the corresponding SBP value serving as the target. This model’s function is independent of additional factors, thereby enabling it to be trained on samples missing the HbA1c value, provided the SBP data are accessible. Another model of note is the PCE ASCVD prediction model. It uses outputs from preceding image-based models, with the calculated PCE results serving as the target. However, the PCE equation requires multiple variables, including SBP, HbA1c, total cholesterol, and high-density lipoprotein (HDL) cholesterol. Only participants with all of the above-mentioned components available were used in this study.

Model ensemble

The DL prediction model ensemble used is demonstrated in Figure 2 and comprises 3 different levels:

Figure 1:

The structure of DL prediction model developed in this manuscript

Level 1

The first level includes an image quality check CNN (described above), a laterality (Left eye/Right eye) detector CNN, and an image location (fovea / non-fovea) detector CNN. The input of this layer is retinal images only. This process ensures that only foveal-centered images that are of sufficient quality are accepted into the model. Identifying the laterality of the image ensures that only a single fovea-centered image for each eye for each individual is used during the analysis. The final available dataset after requiring high- or medium-quality, fovea-centered images, one per eye, consisted of 89,894 images, representing 44,176 patients. This was divided into an 80/20 split for training and validation, respectively. The training dataset thus consisted of 76,321 images (representing 35,570 patients). Meanwhile, the test dataset comprised 19,080 images representing 8,606 patients who had valid biometrics. The demographics of the final dataset did not significantly differ from those that included images of low quality. The patient demographics of the test and training datasets were statistically compared with a two-sample Kolmogorov-Smirnov test to ensure that the demographic distribution of the training and test group were similar. There was no statistically meaningful difference between the two datasets, comparing the age, gender and race makeup of the two groups [Supplementary material]. The training of these models is explained elsewhere, and these models were not tuned or retrained for this study [26].

Level 2

The second level includes nine ensembles of AIs, each consisting of five CNNs (45 in total). The retinopathy, maculopathy, drusen, pigmentary abnormality, advanced AMD, and smoking CNNs were previously trained on other datasets [25-28]. The rest of the CNNs were trained using the unique UK Biobank labels in the fundus images:′ ′hba1c_result′, ′tchdl_result′, ′systolic_bp′, ′systolic_bp2′, ′smoking_status.′ Systolic_bp′ and ′systolic_bp2′ are two consecutive blood pressure measurements in the UK Biobank and in this study we used the mean of the two.

These CNNs follow modified versions of the Inception-Resnet-V2 or ResNet50 structures. Taking the single retinopathy CNN model as an example, the model has a deep structure, consisting of 164 layers, and uses a combination of inception and residual blocks. The inception blocks use a combination of convolutional layers with different filter sizes, while the residual blocks use skip connections to enable the model to learn from previous layers. We also employed batch normalization and bottleneck layers to improve training efficiency. Overall, the model architecture is designed to extract features at multiple scales and capture fine-grained details in images, making it well-suited to detect the level of retinopathy or other biomarkers. For each CNN in Level 2, the image plus biomarkers dataset was split for training, validation, and testing: 70%, 15%, 15% respectively. The excessive background of the fundus images was cropped, and the resulting image was resized to 800×800 pixels. A batch size of 8 was chosen to optimize GPU memory during training. Adam optimizer was adopted with a learning rate 1*10e-3 to update parameters towards the minimization of the loss. Dropout was enabled with a rate p = 0.2, and the model was trained for at least 100 EPOCHs. All codes related to this work were implemented using Python 3.7.

Additionally, we developed a complex jury system to arrive at the ultimate prediction for each biomarker. To elaborate, using the retinopathy model as an example, there exist six distinct levels of retinopathy (R0-R5). Five jury models were employed to assess each eye, resulting in 30 probability values per eye. These probabilities were merged and consolidated for both eyes, thereby yielding a final value for each patient.

Level 3

The third level is a Multi-Layer Perceptron (MLP), which uses the output of the second level CNNs, plus the patient’s chronological age, gender, and race to estimate their PCE ASCVD risk score. This PCE-derived ASCVD risk score is the ground-truth label, calculated from the relevant 9 fields in the UK Biobank dataset for each participant (age, sex, race, smoking status, blood pressure, diabetes, serum total cholesterol, HDL cholesterol, blood pressure lowering medication https://tools.acc.org/asASCVD-risk-estimator-plus/#!/calculate/estimate/). The architecture of the model comprises an input layer, followed by five dense layers that exhibit a gradual decrease in neuron counts, namely 1024, 512, 256, 128, and 32. These layers are interspersed with batch normalization and LeakyReLU activation functions with a leaky rate of 0.1. To address overfitting concerns, dropout layers with a rate of 0.3 were incorporated after the third, fourth, and fifth dense layers. The ultimate layer, encompassing a single neuron and a linear activation function, predicts the target value. For optimization purposes, an Adam optimizer is utilized with an exponentially decaying learning rate schedule, initialized at 3e-3 and decaying by a factor of 0.95 every 1000 steps. The Huber loss function was employed to guide the model parameters updating. To curb overfitting and ensure efficient training, early stopping was implemented.

Appendix 3

The summary list of ethnicities for the UK Biobank and EyePACS 10k studies

View this table:

References

1.↵
Control CfD, Prevention %J CDC WONDER Online Database. Atlanta GCfDC, Prevention. Underlying cause of death, 1999–2018. 2018.
2.↵
Almourani R, Chinnakotla B, Patel R, Kurukulasuriya LR, Sowers J. Diabetes and Cardiovascular Disease: an Update. Curr Diab Rep. 2019;19:161. doi: 10.1007/s11892-019-1239-x
OpenUrl CrossRef
3.↵
Kjeldsen SE. Hypertension and cardiovascular risk: General aspects. Pharmacol Res. 2018;129:95–99. doi: 10.1016/j.phrs.2017.11.003
OpenUrl CrossRef PubMed
4.↵
Alloubani A, Nimer R, Samara RJCCR. Relationship between hyperlipidemia, cardiovascular disease and stroke: a systematic review. Current Cardiology Reviews. 2021;17:52–66.
OpenUrl
5.↵
Kondo T, Nakano Y, Adachi S, Murohara T. Effects of Tobacco Smoking on Cardiovascular Disease. Circ J. 2019;83:1980–1985. doi: 10.1253/circj.CJ-19-0323
OpenUrl CrossRef
6.↵
Lloyd-Jones DM, Braun LT, Ndumele CE, Smith Jr SC, Sperling LS, Virani SS, Blumenthal RSJC. Use of risk assessment tools to guide decision-making in the primary prevention of atherosclerotic cardiovascular disease: a special report from the American Heart Association and American College of Cardiology. Circulation. 2019;139:e1162–e1177.
OpenUrl CrossRef PubMed
7.↵
Mahmood SS, Levy D, Vasan RS, Wang TJ. The Framingham Heart Study and the epidemiology of cardiovascular disease: a historical perspective. Lancet. 2014;383:999–1008. doi: 10.1016/S0140-6736(13)61752-3
OpenUrl CrossRef PubMed
8.↵
Kavousi M, Leening MJ, Nanchen D, Greenland P, Graham IM, Steyerberg EW, Ikram MA, Stricker BH, Hofman A, Franco OH. Comparison of application of the ACC/AHA guidelines, Adult Treatment Panel III guidelines, and European Society of Cardiology guidelines for cardiovascular disease prevention in a European cohort. JAMA. 2014;311:1416–1423. doi: 10.1001/jama.2014.2632
OpenUrl CrossRef PubMed Web of Science
9.
DeFilippis AP, Young R, Carrubba CJ, McEvoy JW, Budoff MJ, Blumenthal RS, Kronmal RA, McClelland RL, Nasir K, Blaha MJJAoim. An analysis of calibration and discrimination among multiple cardiovascular risk scores in a modern multiethnic cohort. Annals of Internal Medicine. 2015;162:266–275.
OpenUrl CrossRef PubMed
10.
Comín E, Solanas P, Cabezas C, Subirana I, Ramos R, Gené-Badia J, Cordón F, Grau M, Cabré-Vila JJ, Marrugat JJREdC. Estimating cardiovascular risk in Spain using different algorithms. Revista Española de Cardiología (English Edition). 2007;60:693–702.
OpenUrl
11.↵
Goff Jr DC, Lloyd-Jones DM, Bennett G, Coady S, D’agostino RB, Gibbons R, Greenland P, Lackland DT, Levy D, O’donnell CJJC. 2013 ACC/AHA guideline on the assessment of cardiovascular risk: a report of the American College of Cardiology/American Heart Association Task Force on Practice Guidelines. Circulation. 2014;129:S49–S73.
OpenUrl FREE Full Text
12.
D’Agostino RB, Sr., Grundy S, Sullivan LM, Wilson P, Group CHDRP. Validation of the Framingham coronary heart disease prediction scores: results of a multiple ethnic groups investigation. JAMA. 2001;286:180–187. doi: 10.1001/jama.286.2.180
OpenUrl CrossRef PubMed Web of Science
13.
Beswick A, Brindle P, Fahey T, Ebrahim S. A systematic review of risk scoring methods and clinical decision aids used in the primary prevention of coronary heart disease (supplement). europepmc. 2011.
14.
Brindle P, Beswick A, Fahey T, Ebrahim S. Accuracy and impact of risk assessment in the primary prevention of cardiovascular disease: a systematic review. Heart. 2006;92:1752–1759. doi: 10.1136/hrt.2006.087932
OpenUrl Abstract/FREE Full Text
15.
Eichler K, Puhan MA, Steurer J, Bachmann LM. Prediction of first coronary events with the Framingham score: a systematic review. Am Heart J. 2007;153:722–731, 731 e721-728. doi: 10.1016/j.ahj.2007.02.027
OpenUrl CrossRef PubMed Web of Science
16.↵
Grundy SM, Stone NJ, Bailey AL, Beam C, Birtcher KK, Blumenthal RS, Braun LT, de Ferranti S, Faiella-Tommasino J, Forman DE, et al. 2018 AHA/ACC/AACVPR/AAPA/ABC/ACPM/ADA/AGS/APhA/ASPC/NLA/PCNA Guideline on the Management of Blood Cholesterol: Executive Summary: A Report of the American College of Cardiology/American Heart Association Task Force on Clinical Practice Guidelines. Circulation. 2019;139:e1046–e1081. doi: 10.1161/CIR.0000000000000624
OpenUrl CrossRef PubMed
17.↵
Wang S, Yin Y, Cao G, Wei B, Zheng Y, Yang G. Hierarchical retinal blood vessel segmentation based on feature and ensemble learning. Neurocomputing. 2015;149:708–717. doi: 10.1016/j.neucom.2014.07.059
OpenUrl CrossRef
18.
Roy PK, Nguyen UT, Bhuiyan A, Ramamohanarao K. An effective automated system for grading severity of retinal arteriovenous nicking in colour retinal images. Paper/Poster presented at: 2014 36th Annual International Conference of the IEEE Engineering in Medicine and Biology Society; 2014;
19.↵
Mishra V, Samuel C, Sharma SJIJRAME. Use of machine learning to predict the onset of diabetes. Int J Recent Adv Mech Eng. 2015;4.
20.↵
Poplin R, Varadarajan AV, Blumer K, Liu Y, McConnell MV, Corrado GS, Peng L, Webster DR. Prediction of cardiovascular risk factors from retinal fundus photographs via deep learning. Nat Biomed Eng. 2018;2:158–164. doi: 10.1038/s41551-018-0195-0
OpenUrl CrossRef
21.
Rim TH, Lee CJ, Tham YC, Cheung N, Yu M, Lee G, Kim Y, Ting DSW, Chong CCY, Choi YS, et al. Deep-learning-based cardiovascular risk stratification using coronary artery calcium scores predicted from retinal photographs. Lancet Digit Health. 2021;3:e306–e316. doi: 10.1016/S2589-7500(21)00043-1
OpenUrl CrossRef
22.↵
Rim TH, Lee G, Kim Y, Tham YC, Lee CJ, Baik SJ, Kim YA, Yu M, Deshmukh M, Lee BK, et al. Prediction of systemic biomarkers from retinal photographs: development and validation of deep-learning algorithms. Lancet Digit Health. 2020;2:e526–e536. doi: 10.1016/S2589-7500(20)30216-8
OpenUrl CrossRef
23.↵
Miyazawa AA. Artificial intelligence: the future for cardiology. Heart. 2019;105:1214. doi: 10.1136/heartjnl-2018-314464
OpenUrl FREE Full Text
24.↵
Grundy SM, Stone NJJJc. 2018 American Heart Association/American College of Cardiology/Multisociety Guideline on the Management of Blood Cholesterol–Secondary Prevention. Circulation. 2019;4:589–591.
OpenUrl
25.↵
Vaghefi E, Yang S, Hill S, Humphrey G, Walker N, Squirrell D. Detection of smoking status from retinal images; a Convolutional Neural Network study. Sci Rep. 2019;9:7180. doi: 10.1038/s41598-019-43670-0
OpenUrl CrossRef
26.
Vaghefi E, Yang S, Xie L, Han D, Yap A, Squirrell D. A multi-centre prospective trial of THEIAT to detect diabetic retinopathy and diabetic macular oedema in the New Zealand screening program. Paper/Poster presented at: CLINICAL AND EXPERIMENTAL OPHTHALMOLOGY; 2022;
27.↵
Vaghefi E, Yang S, Xie L, Hill S, Schmiedel O, Murphy R, Squirrell DJDM. THEIA™ development, and testing of artificial intelligence-based primary triage of diabetic retinopathy screening images in New Zealand. Diabetic Medicine. 2021;38:e14386.
OpenUrl
28.↵
Xie L, Yang S, Squirrell D, Vaghefi E. Towards implementation of AI in New Zealand national diabetic screening program: Cloud-based, robust, and bespoke. PLoS One. 2020;15:e0225015. doi: 10.1371/journal.pone.0225015
OpenUrl CrossRef
29.↵
Pearson GJ, Thanassoulis G, Anderson TJ, Barry AR, Couture P, Dayan N, Francis GA, Genest J, Gregoire J, Grover SA, et al. 2021 Canadian Cardiovascular Society Guidelines for the Management of Dyslipidemia for the Prevention of Cardiovascular Disease in Adults. Can J Cardiol. 2021;37:1129–1150. doi: 10.1016/j.cjca.2021.03.016
OpenUrl CrossRef PubMed
30.↵
What is Binning in Data Mining? Javat Point https://www.javatpoint.com/what-is-binning-in-data-mining#:~:text=Binning%2C%20also%20called%20discretization%2C%20is,the%20number%20of%20distinct%20values. 2023.
31.↵
Tsur A, Batsry L, Toussia-Cohen S, Rosenstein MG, Barak O, Brezinov Y, Yoeli-Ullman R, Sivan E, Sirota M, Druzin ML, et al. Development and validation of a machine-learning model for prediction of shoulder dystocia. Ultrasound Obstet Gynecol. 2020;56:588–596. doi: 10.1002/uog.21878
OpenUrl CrossRef
32.↵
Pylypchuk R, Wells S, Kerr A, Poppe K, Harwood M, Mehta S, Grey C, Wu BP, Selak V, Drury PLJTL. Cardiovascular risk prediction in type 2 diabetes before and after widespread screening: a derivation and validation study. The Lancet. 2021;397:2264–2274.
OpenUrl
33.↵
Arnett DK. Widespread diabetes screening for cardiovascular disease risk estimation. Lancet. 2021;397:2228–2230. doi: 10.1016/S0140-6736(21)00764-9
OpenUrl CrossRef
34.↵
Modjtahedi BS, Wu J, Luong TQ, Gandhi NK, Fong DS, Chen W. Severity of Diabetic Retinopathy and the Risk of Future Cerebrovascular Disease, Cardiovascular Disease, and All-Cause Mortality. Ophthalmology. 2021;128:1169–1179. doi: 10.1016/j.ophtha.2020.12.019
OpenUrl CrossRef
35.↵
Hu W, Yii FSL, Chen R, Zhang X, Shang X, Kiburg K, Woods E, Vingrys A, Zhang L, Zhu Z, et al. A Systematic Review and Meta-Analysis of Applying Deep Learning in the Prediction of the Risk of Cardiovascular Diseases From Retinal Images. Transl Vis Sci Technol. 2023;12:14. doi: 10.1167/tvst.12.7.14
OpenUrl CrossRef
36.↵
Yi JK, Rim TH, Park S, Kim SS, Kim HC, Lee CJ, Kim H, Lee G, Lim JSG, Tan YY, et al. Cardiovascular disease risk assessment using a deep-learning-based retinal biomarker: a comparison with existing risk scores. Eur Heart J Digit Health. 2023;4:236–244. doi: 10.1093/ehjdh/ztad023
OpenUrl CrossRef
37.↵
Ma Y, Xiong J, Zhu Y, Ge Z, Hua R, Fu M, Li C, Wang B, Dong L, Zhao X, et al. Deep learning algorithm using fundus photographs for 10-year risk assessment of ischemic cardiovascular diseases in China. Sci Bull (Beijing). 2022;67:17–20. doi: 10.1016/j.scib.2021.08.016
OpenUrl CrossRef
38.↵
Boyd NR, Windsor RA, Perkins LL, Lowe JB. Quality of measurement of smoking status by self-report and saliva cotinine among pregnant women. Matern Child Health J. 1998;2:77–83. doi: 10.1023/a:1022936705438
OpenUrl CrossRef PubMed
39.↵
Curry LE, Richardson A, Xiao H, Niaura RS. Nondisclosure of smoking status to health care providers among current and former smokers in the United States. Health Educ Behav. 2013;40:266–273. doi: 10.1177/1090198112454284
OpenUrl CrossRef PubMed

View the discussion thread.

Posted September 22, 2023.

Download PDF

Data/Code

Citation Tools

Subject Area

Cardiovascular Medicine

Subject Areas

All Articles

Addiction Medicine (403)
Allergy and Immunology (712)
Anesthesia (207)
Cardiovascular Medicine (2969)
Dentistry and Oral Medicine (336)
Dermatology (253)
Emergency Medicine (445)
Endocrinology (including Diabetes Mellitus and Metabolic Disease) (1049)
Epidemiology (12807)
Forensic Medicine (12)
Gastroenterology (830)
Genetic and Genomic Medicine (4621)
Geriatric Medicine (423)
Health Economics (732)
Health Informatics (2941)
Health Policy (1073)
Health Systems and Quality Improvement (1092)
Hematology (393)
HIV/AIDS (932)
Infectious Diseases (except HIV/AIDS) (14143)
Intensive Care and Critical Care Medicine (853)
Medical Education (430)
Medical Ethics (116)
Nephrology (475)
Neurology (4408)
Nursing (238)
Nutrition (649)
Obstetrics and Gynecology (817)
Occupational and Environmental Health (739)
Oncology (2295)
Ophthalmology (652)
Orthopedics (260)
Otolaryngology (327)
Pain Medicine (281)
Palliative Medicine (84)
Pathology (502)
Pediatrics (1200)
Pharmacology and Therapeutics (509)
Primary Care Research (502)
Psychiatry and Clinical Psychology (3799)
Public and Global Health (7004)
Radiology and Imaging (1544)
Rehabilitation Medicine and Physical Therapy (920)
Respiratory Medicine (921)
Rheumatology (444)
Sexual and Reproductive Health (445)
Sports Medicine (386)
Surgery (491)
Toxicology (60)
Transplantation (212)
Urology (182)

[1] 1.↵
Control CfD, Prevention %J CDC WONDER Online Database. Atlanta GCfDC, Prevention. Underlying cause of death, 1999–2018. 2018.

[2] 2.↵
Almourani R, Chinnakotla B, Patel R, Kurukulasuriya LR, Sowers J. Diabetes and Cardiovascular Disease: an Update. Curr Diab Rep. 2019;19:161. doi: 10.1007/s11892-019-1239-x
OpenUrl CrossRef

[3] 3.↵
Kjeldsen SE. Hypertension and cardiovascular risk: General aspects. Pharmacol Res. 2018;129:95–99. doi: 10.1016/j.phrs.2017.11.003
OpenUrl CrossRef PubMed

[4] 4.↵
Alloubani A, Nimer R, Samara RJCCR. Relationship between hyperlipidemia, cardiovascular disease and stroke: a systematic review. Current Cardiology Reviews. 2021;17:52–66.
OpenUrl

[5] 5.↵
Kondo T, Nakano Y, Adachi S, Murohara T. Effects of Tobacco Smoking on Cardiovascular Disease. Circ J. 2019;83:1980–1985. doi: 10.1253/circj.CJ-19-0323
OpenUrl CrossRef

[6] 6.↵
Lloyd-Jones DM, Braun LT, Ndumele CE, Smith Jr SC, Sperling LS, Virani SS, Blumenthal RSJC. Use of risk assessment tools to guide decision-making in the primary prevention of atherosclerotic cardiovascular disease: a special report from the American Heart Association and American College of Cardiology. Circulation. 2019;139:e1162–e1177.
OpenUrl CrossRef PubMed

[7] 7.↵
Mahmood SS, Levy D, Vasan RS, Wang TJ. The Framingham Heart Study and the epidemiology of cardiovascular disease: a historical perspective. Lancet. 2014;383:999–1008. doi: 10.1016/S0140-6736(13)61752-3
OpenUrl CrossRef PubMed

[8] 8.↵
Kavousi M, Leening MJ, Nanchen D, Greenland P, Graham IM, Steyerberg EW, Ikram MA, Stricker BH, Hofman A, Franco OH. Comparison of application of the ACC/AHA guidelines, Adult Treatment Panel III guidelines, and European Society of Cardiology guidelines for cardiovascular disease prevention in a European cohort. JAMA. 2014;311:1416–1423. doi: 10.1001/jama.2014.2632
OpenUrl CrossRef PubMed Web of Science

[9] 9.
DeFilippis AP, Young R, Carrubba CJ, McEvoy JW, Budoff MJ, Blumenthal RS, Kronmal RA, McClelland RL, Nasir K, Blaha MJJAoim. An analysis of calibration and discrimination among multiple cardiovascular risk scores in a modern multiethnic cohort. Annals of Internal Medicine. 2015;162:266–275.
OpenUrl CrossRef PubMed

[10] 10.
Comín E, Solanas P, Cabezas C, Subirana I, Ramos R, Gené-Badia J, Cordón F, Grau M, Cabré-Vila JJ, Marrugat JJREdC. Estimating cardiovascular risk in Spain using different algorithms. Revista Española de Cardiología (English Edition). 2007;60:693–702.
OpenUrl

[11] 11.↵
Goff Jr DC, Lloyd-Jones DM, Bennett G, Coady S, D’agostino RB, Gibbons R, Greenland P, Lackland DT, Levy D, O’donnell CJJC. 2013 ACC/AHA guideline on the assessment of cardiovascular risk: a report of the American College of Cardiology/American Heart Association Task Force on Practice Guidelines. Circulation. 2014;129:S49–S73.
OpenUrl FREE Full Text

[12] 12.
D’Agostino RB, Sr., Grundy S, Sullivan LM, Wilson P, Group CHDRP. Validation of the Framingham coronary heart disease prediction scores: results of a multiple ethnic groups investigation. JAMA. 2001;286:180–187. doi: 10.1001/jama.286.2.180
OpenUrl CrossRef PubMed Web of Science

[13] 13.
Beswick A, Brindle P, Fahey T, Ebrahim S. A systematic review of risk scoring methods and clinical decision aids used in the primary prevention of coronary heart disease (supplement). europepmc. 2011.

[14] 14.
Brindle P, Beswick A, Fahey T, Ebrahim S. Accuracy and impact of risk assessment in the primary prevention of cardiovascular disease: a systematic review. Heart. 2006;92:1752–1759. doi: 10.1136/hrt.2006.087932
OpenUrl Abstract/FREE Full Text

[15] 15.
Eichler K, Puhan MA, Steurer J, Bachmann LM. Prediction of first coronary events with the Framingham score: a systematic review. Am Heart J. 2007;153:722–731, 731 e721-728. doi: 10.1016/j.ahj.2007.02.027
OpenUrl CrossRef PubMed Web of Science

[16] 16.↵
Grundy SM, Stone NJ, Bailey AL, Beam C, Birtcher KK, Blumenthal RS, Braun LT, de Ferranti S, Faiella-Tommasino J, Forman DE, et al. 2018 AHA/ACC/AACVPR/AAPA/ABC/ACPM/ADA/AGS/APhA/ASPC/NLA/PCNA Guideline on the Management of Blood Cholesterol: Executive Summary: A Report of the American College of Cardiology/American Heart Association Task Force on Clinical Practice Guidelines. Circulation. 2019;139:e1046–e1081. doi: 10.1161/CIR.0000000000000624
OpenUrl CrossRef PubMed

[17] 17.↵
Wang S, Yin Y, Cao G, Wei B, Zheng Y, Yang G. Hierarchical retinal blood vessel segmentation based on feature and ensemble learning. Neurocomputing. 2015;149:708–717. doi: 10.1016/j.neucom.2014.07.059
OpenUrl CrossRef

[18] 18.
Roy PK, Nguyen UT, Bhuiyan A, Ramamohanarao K. An effective automated system for grading severity of retinal arteriovenous nicking in colour retinal images. Paper/Poster presented at: 2014 36th Annual International Conference of the IEEE Engineering in Medicine and Biology Society; 2014;

[19] 19.↵
Mishra V, Samuel C, Sharma SJIJRAME. Use of machine learning to predict the onset of diabetes. Int J Recent Adv Mech Eng. 2015;4.

[20] 20.↵
Poplin R, Varadarajan AV, Blumer K, Liu Y, McConnell MV, Corrado GS, Peng L, Webster DR. Prediction of cardiovascular risk factors from retinal fundus photographs via deep learning. Nat Biomed Eng. 2018;2:158–164. doi: 10.1038/s41551-018-0195-0
OpenUrl CrossRef

[21] 21.
Rim TH, Lee CJ, Tham YC, Cheung N, Yu M, Lee G, Kim Y, Ting DSW, Chong CCY, Choi YS, et al. Deep-learning-based cardiovascular risk stratification using coronary artery calcium scores predicted from retinal photographs. Lancet Digit Health. 2021;3:e306–e316. doi: 10.1016/S2589-7500(21)00043-1
OpenUrl CrossRef

[22] 22.↵
Rim TH, Lee G, Kim Y, Tham YC, Lee CJ, Baik SJ, Kim YA, Yu M, Deshmukh M, Lee BK, et al. Prediction of systemic biomarkers from retinal photographs: development and validation of deep-learning algorithms. Lancet Digit Health. 2020;2:e526–e536. doi: 10.1016/S2589-7500(20)30216-8
OpenUrl CrossRef

[23] 23.↵
Miyazawa AA. Artificial intelligence: the future for cardiology. Heart. 2019;105:1214. doi: 10.1136/heartjnl-2018-314464
OpenUrl FREE Full Text

[24] 24.↵
Grundy SM, Stone NJJJc. 2018 American Heart Association/American College of Cardiology/Multisociety Guideline on the Management of Blood Cholesterol–Secondary Prevention. Circulation. 2019;4:589–591.
OpenUrl

[25] 25.↵
Vaghefi E, Yang S, Hill S, Humphrey G, Walker N, Squirrell D. Detection of smoking status from retinal images; a Convolutional Neural Network study. Sci Rep. 2019;9:7180. doi: 10.1038/s41598-019-43670-0
OpenUrl CrossRef

[26] 26.
Vaghefi E, Yang S, Xie L, Han D, Yap A, Squirrell D. A multi-centre prospective trial of THEIAT to detect diabetic retinopathy and diabetic macular oedema in the New Zealand screening program. Paper/Poster presented at: CLINICAL AND EXPERIMENTAL OPHTHALMOLOGY; 2022;

[27] 27.↵
Vaghefi E, Yang S, Xie L, Hill S, Schmiedel O, Murphy R, Squirrell DJDM. THEIA™ development, and testing of artificial intelligence-based primary triage of diabetic retinopathy screening images in New Zealand. Diabetic Medicine. 2021;38:e14386.
OpenUrl

[28] 28.↵
Xie L, Yang S, Squirrell D, Vaghefi E. Towards implementation of AI in New Zealand national diabetic screening program: Cloud-based, robust, and bespoke. PLoS One. 2020;15:e0225015. doi: 10.1371/journal.pone.0225015
OpenUrl CrossRef

[29] 29.↵
Pearson GJ, Thanassoulis G, Anderson TJ, Barry AR, Couture P, Dayan N, Francis GA, Genest J, Gregoire J, Grover SA, et al. 2021 Canadian Cardiovascular Society Guidelines for the Management of Dyslipidemia for the Prevention of Cardiovascular Disease in Adults. Can J Cardiol. 2021;37:1129–1150. doi: 10.1016/j.cjca.2021.03.016
OpenUrl CrossRef PubMed

[30] 30.↵
What is Binning in Data Mining? Javat Point https://www.javatpoint.com/what-is-binning-in-data-mining#:~:text=Binning%2C%20also%20called%20discretization%2C%20is,the%20number%20of%20distinct%20values. 2023.

[31] 31.↵
Tsur A, Batsry L, Toussia-Cohen S, Rosenstein MG, Barak O, Brezinov Y, Yoeli-Ullman R, Sivan E, Sirota M, Druzin ML, et al. Development and validation of a machine-learning model for prediction of shoulder dystocia. Ultrasound Obstet Gynecol. 2020;56:588–596. doi: 10.1002/uog.21878
OpenUrl CrossRef

[32] 32.↵
Pylypchuk R, Wells S, Kerr A, Poppe K, Harwood M, Mehta S, Grey C, Wu BP, Selak V, Drury PLJTL. Cardiovascular risk prediction in type 2 diabetes before and after widespread screening: a derivation and validation study. The Lancet. 2021;397:2264–2274.
OpenUrl

[33] 33.↵
Arnett DK. Widespread diabetes screening for cardiovascular disease risk estimation. Lancet. 2021;397:2228–2230. doi: 10.1016/S0140-6736(21)00764-9
OpenUrl CrossRef

[34] 34.↵
Modjtahedi BS, Wu J, Luong TQ, Gandhi NK, Fong DS, Chen W. Severity of Diabetic Retinopathy and the Risk of Future Cerebrovascular Disease, Cardiovascular Disease, and All-Cause Mortality. Ophthalmology. 2021;128:1169–1179. doi: 10.1016/j.ophtha.2020.12.019
OpenUrl CrossRef

[35] 35.↵
Hu W, Yii FSL, Chen R, Zhang X, Shang X, Kiburg K, Woods E, Vingrys A, Zhang L, Zhu Z, et al. A Systematic Review and Meta-Analysis of Applying Deep Learning in the Prediction of the Risk of Cardiovascular Diseases From Retinal Images. Transl Vis Sci Technol. 2023;12:14. doi: 10.1167/tvst.12.7.14
OpenUrl CrossRef

[36] 36.↵
Yi JK, Rim TH, Park S, Kim SS, Kim HC, Lee CJ, Kim H, Lee G, Lim JSG, Tan YY, et al. Cardiovascular disease risk assessment using a deep-learning-based retinal biomarker: a comparison with existing risk scores. Eur Heart J Digit Health. 2023;4:236–244. doi: 10.1093/ehjdh/ztad023
OpenUrl CrossRef

[37] 37.↵
Ma Y, Xiong J, Zhu Y, Ge Z, Hua R, Fu M, Li C, Wang B, Dong L, Zhao X, et al. Deep learning algorithm using fundus photographs for 10-year risk assessment of ischemic cardiovascular diseases in China. Sci Bull (Beijing). 2022;67:17–20. doi: 10.1016/j.scib.2021.08.016
OpenUrl CrossRef

[38] 38.↵
Boyd NR, Windsor RA, Perkins LL, Lowe JB. Quality of measurement of smoking status by self-report and saliva cotinine among pregnant women. Matern Child Health J. 1998;2:77–83. doi: 10.1023/a:1022936705438
OpenUrl CrossRef PubMed

[39] 39.↵
Curry LE, Richardson A, Xiao H, Niaura RS. Nondisclosure of smoking status to health care providers among current and former smokers in the United States. Health Educ Behav. 2013;40:266–273. doi: 10.1177/1090198112454284
OpenUrl CrossRef PubMed

Development and validation of a deep-learning model to predict 10-year ASCVD risk from retinal images using the UK Biobank and EyePACS 10K datasets

Abstract

Introduction

Methods

Datasets

Inclusion/ Exclusion Criteria

Model assessment

Receiver operator curves (ROC), sensitivity, and specificity metrics

Comparison of the performance of the DL model and the PCE equation to predict 10-year ASCVD risk and events

Results

Relationship between ASCVD risk scores generated and observed ASCVD event rates using the traditional PCE score, and the DL model predicted score

1. UK Biobank

2. EyePACS 10K dataset

Discussion

Strengths and Limitations

Conclusion

Data Availability

Disclosures

Sources of funding

Appendix 1

Appendix 2

Model development

Model ensemble

Level 1

Level 2

Level 3

Appendix 3

References

Citation Manager Formats

Subject Area