Abstract
Atherosclerotic cardiovascular disease, the leading cause of global mortality, is driven by lipid accumulation and plaque formation within arterial walls. Carotid plaques, detectable via ultrasound, are a well-established marker of subclinical atherosclerosis. In this study, we trained a deep learning model to detect plaques in 177,757 carotid ultrasound images from 19,499 UK Biobank (UKB) participants (aged 47-83 years) to assess the prevalence, risk factors, prognostic significance, and genetic architecture of carotid atherosclerosis in a large population-based cohort. The model demonstrated high performance metrics with accuracy, sensitivity, specificity, and positive predictive value of 89.3%, 89.5%, 89.2%, and 82.9%, respectively, identifying carotid plaques in 45% of the population. Plaque presence and count were significantly associated with future cardiovascular events over a median follow-up period of up to 7 years, leading to improved risk reclassification beyond established clinical prediction models. A genome-wide association study (GWAS) meta-analysis of carotid plaques (29,790 cases, 36,847 controls) uncovered two novel genomic loci (p < 5×10−8) with downstream analyses implicating lipoprotein(a) and interleukin-6 signaling, both targets of investigational drugs in advanced clinical development. Observational and Mendelian randomization analyses showed associations between smoking, low-density-lipoprotein (LDL) cholesterol, and high blood pressure and the odds of carotid plaque presence. Our study underscores the potential of carotid plaque assessment for improving cardiovascular risk prediction, provides novel insights into the genetic basis of subclinical atherosclerosis, and offers a valuable resource for advancing atherosclerosis research at the population scale.
Main
Atherosclerosis, characterized by lipid accumulation and plaque formation within arterial walls1, is the primary condition underlying cardiovascular disease (CVD), the leading cause of global mortality and morbidity2,3. Despite significant advancements in pharmacotherapies for lipid lowering and the management of other vascular risk factors such as diabetes and hypertension, the alarmingly high and rising prevalence of CVD highlight the need for novel risk assessment and preventive strategies4,5. Atherosclerosis is a chronic disease with subclinical atherosclerotic lesions developing silently over decades. Current CVD risk assessment tools used in clinical practice, including the Pooled Cohort Equations (PCE)6, Framingham Risk Score7, and Systematic Coronary risk Evaluation (SCORE8 and SCORE2)9,10, rely on demographic, clinical, and biochemical factors but do not account for the presence of subclinical atherosclerosis11,12. Imaging studies that enable screening for subclinical atherosclerotic lesions in asymptomatic individuals suggest that atherosclerotic plaques are highly prevalent, even among individuals traditionally considered at low CVD risk13–18. Coronary artery calcium (CAC) scoring on computed tomography (CT) has gained traction for assessing subclinical atherosclerosis; however, it faces limitations as a screening tool due to ionizing radiation exposure and its relatively high costs for widespread application19,20.
In contrast, carotid ultrasound offers a non-invasive, radiation-free, and widely accessible modality for assessing subclinical atherosclerotic lesions19,21. While traditional assessment of carotid intima-media thickness (cIMT) does not reliably predict incident CVD risk22, the detection of carotid atherosclerotic plaques is associated with an increased risk of future events21,23,24. Despite promising results, it remains uncertain whether screening for carotid plaques could reclassify asymptomatic individuals into higher-risk categories that justify the initiation of preventive pharmacotherapies25. Many published studies are constrained by relatively small sample sizes or insufficient follow-up24,26–30.
Several large-scale population-based cohorts have incorporated carotid ultrasound imaging into their data collection processes,31–33 but evaluating plaque presence across thousands of images remains a labor-intensive task. Recent advancements in deep learning have enhanced medical imaging analysis, enabling greater precision and the automation of processing large volumes of imaging data34,35. Although previous deep learning models for ultrasound images have shown potential in various tasks, including carotid wall and plaque segmentation36–38, these studies have primarily focused on specific populations, such as stroke patients or individuals with known carotid artery disease, which limits the generalizability of their findings to the broader population. Automating carotid plaque assessment in large population-based cohorts could allow the integration of this phenotype with genetic, omics, other imaging modalities, and clinical data collected in the context of these studies. This would facilitate in-depth research into the biology of subclinical atherosclerosis, thus enabling explorations into the natural history of the disease and potentially uncovering novel drug targets.
Here, we introduce a computer vision model designed to detect atherosclerotic plaques, applied to the largest dataset of carotid artery ultrasound images to date. We utilized ultrasound images from 19,499 deeply phenotyped participants in the UK Biobank (UKB), a large-scale population-based cohort. This study marks the first application of deep learning at a population level for assessing subclinical atherosclerosis using carotid ultrasound, offering a valuable resource for exploring the biology of atherosclerosis with implications for CVD risk assessment. Our model demonstrated high performance in both detecting plaques and quantifying their counts. We leveraged the model’s predictions to: (1) estimate the prevalence of carotid atherosclerotic lesions; (2) identify predictors of carotid plaque presence; (3) examine the associations of plaque presence and count with the risk of future CVD events; (4) assess potential improvements in CVD risk prediction and reclassification compared to traditional clinical tools; and (5) investigate the genetic underpinnings of carotid atherosclerosis (Figure 1).
Results
Study population
A total of 177,757 images from 19,499 participants who underwent carotid ultrasound during the first imaging visit of the UKB, were available for analysis (Supplementary Figure 1). The protocol for carotid artery examination has been described previously39. For the current study, we used 38,732 images obtained in the longitudinal axis of the left and right distal common carotid artery and the bifurcation, allowing for the assessment of plaque presence along the vessel wall. The demographic and medical characteristics of the study participants are presented in Table 1. The mean age at the time of the carotid ultrasound examination was 64.6 years (SD = 7.59), and 50.8% of the participants were female. A total of 1,381 (7.1%) study participants had a baseline diagnosis of atherosclerotic CVD. A comparison between UKB participants with carotid ultrasound and the rest of the UKB cohort revealed a lower prevalence of CVD risk factors (Supplementary Table 1).
Plaque detection model
To train a deep learning model for the detection of atherosclerotic plaques, we manually annotated plaques in 680 randomly selected carotid ultrasound images. Plaques were defined as focal protrusions in the arterial lumen with a thickness greater than 50% of the surrounding carotid intima-media thickness40. A plaque was present in 253 of these images. We performed transfer learning with fine-tuning by employing a pre-trained YOLOv841 object detector as the foundation for developing the plaque detection model (Figure 2A). The YOLOv8 object detection algorithm generates bounding boxes to indicate the locations of objects of interest. The images with manually annotated plaques were randomly divided into training, validation, and test sets in a 0.725/0.125/0.15 ratio. This distribution maximized the training set while ensuring a sufficient number of images for assessing model performance in the test set. We evaluated the model’s performance by training it on several subsets of our input development dataset (training + validation) in 5-fold cross-validation. The consistent performance metrics, with minimal variations in precision and recall, indicated no significant signs of overfitting (Supplementary Figure 2).
After training, the performance of the model was evaluated in a blind test set of 103 images, of which 38 contained at least one plaque (53 plaques in total). The model achieved high classification metrics for plaque presence at the image level, with an accuracy, sensitivity, specificity, and Positive Predictive Value (PPV) of 89.3%, 89.5%, 89.2%, and 82.9% (Figure 2B), respectively, at an iteratively tuned confidence score threshold of 13%. The confidence score measures the model’s certainty that a box contains an object of interest and was tuned to optimize and balance accuracy, sensitivity, and specificity. The Mean Average Precision at an Intersection over Union (IoU) threshold of 50% (mAP@50) was 68.4%, indicating the precision with which the model can localize objects with at least 50% overlap with the ground truth. The model’s detection precision and recall were 70.3% and 71.7%, respectively. Prediction examples are illustrated in Figure 2C and Supplementary Figure 3.
Prevalence and risk factors of carotid plaques in the UK Biobank
Next, we deployed the model on all available long-axis carotid ultrasound images from the UKB cohort (38,732 images, 19,499 individuals, Supplementary Figure 1). This deployment allowed us to extract data on plaque phenotypes, including plaque presence and the count of plaques in either artery for each individual. The count of plaques was determined by the number of predicted bounding boxes in each image.
Overall, the model detected at least one plaque in 45% of the UKB participants who underwent a carotid ultrasound examination. In 14% of the participants, the model detected at least two plaques, and in 3.1% of the participants, three or more plaques across both arteries. The prevalence of plaques in the left and right carotid arteries is presented in Figure 3A. As a quality control step for the model predictions, we explored whether plaque presence was associated with cIMT, which was quantified and documented for each individual at the time of the imaging assessment. Indeed, cIMT was consistently higher for individuals for whom our model predicted a plaque (Wilcoxon test p < 10−60 for maximum, mean, and mean of maximum cIMT measurements). Similar results were obtained for the left and right arteries separately (Supplementary Figure 4).
Plaques were more common in male participants (47.5% vs. 42.6%; two-proportions z-test p = 5.7 × 10−12) and plaque prevalence was significantly associated with older age, increasing from 31.1% in participants aged 45–54 years to 62% in participants aged 75 years or older (Cochran-Armitage trend test p = 1.12 × 10−88, Figure 3B). In a multivariable logistic regression model, male sex, older age, current smoking, higher systolic blood pressure (SBP), history of hypertension, pre-existing CVD, use of statins, and higher Low-Density Lipoprotein (LDL) cholesterol levels were all significantly associated with plaque presence (Figure 3C).
Associations of plaque phenotypes with the risk of future cardiovascular events
To estimate the effects of the presence and count of plaques in carotid ultrasound on the risk of future major adverse cardiovascular events (MACE), we conducted a survival analysis. Following the first imaging visit, the UKB participants with available ultrasound images were followed up for a median of 55 months (range 1-80 months). During this time interval, a total of 430 individuals experienced a MACE, defined as myocardial infarction, stroke or death due to any cardiovascular cause. Of these, 335 were first-ever events among 18,110 participants (1.8%) without a history of CVD at the time of the ultrasound examination, while 95 were secondary events among 1,389 participants (6.8%) with an existing history of CVD.
Kaplan-Meier estimates indicated a higher incidence rate of MACE among individuals with carotid plaques compared to those without plaques, demonstrating a dose-response pattern of higher incidence with an increasing plaque count (log-rank test p-value for all pairwise comparisons < 0.05, Figure 4). After adjusting for conventional cardiovascular risk factors in Cox regression models, plaque presence was significantly associated with future risk of MACE (Hazard Ratio (HR) 1.42, 95% CI: 1.16–1.73). The plaque count per individual showed a dose-dependent association with future CVD risk (HR for 1 plaque vs. no plaque = 1.30, 95% CI: 1.04–1.63; HR for 2 or more plaques vs. no plaques = 1.62, 95% CI: 1.27–2.07). These associations were consistent in the subgroup of individuals without an existing history of CVD, as well as in those who had neither a history of CVD nor statin use (Supplementary Figure 5). There was no evidence of an interaction with sex (p = 0.139). The HRs were comparable when analyzing individual MACE components, albeit with wider 95% CIs, probably due to lower statistical power (195 myocardial infarction and 172 stroke cases, Supplementary Figure 5).
Predictive power of plaque phenotypes
To examine whether assessing carotid plaque phenotypes could improve cardiovascular risk prediction, we compared the fitness, reclassification, and discrimination of prediction models that included conventional vascular risk factors with those that also considered plaque presence and count. Both plaque presence and count significantly improved the overall goodness of fit of a Cox regression model (P < 0.05), as assessed by the log-likelihood ratio test43. We observed a reclassification improvement for plaque presence (category-free net reclassification improvement (cfNRI): 0.331, 95% CI, 0.217–0.445) and plaque count (cfNRI: 0.369, 95% CI, 0.260–0.478, Table 2). Briefly, NRI quantifies model improvement by the difference between the net proportion of cases for which the new model correctly increases predicted risks and net proportion of controls for which the new model correctly decreases predicted risks44,45. Sensitivity analyses confirmed these improvements in individuals without a history of CVD and statin use (Table 2). Despite the strong associations with the future risk of MACE, none of the plaque phenotypes added to conventional risk factors significantly improved model discrimination, as measured by the C-index. Specifically, adding plaque presence to conventional risk factors only slightly changed C-index from 0.745 (95% CI, 0.723–0.767) to 0.747 (95% CI, 0.725–0.769), while adding plaque count changed it to 0.748 (95% CI, 0.726–0.770). However, minor yet significant improvements were observed in the integrated discrimination improvement (IDI): 0.0022 (95%CI, 0.0002–0.0050) for plaque presence and 0.0023 (95%CI, 2×10−5–0.0063) for plaque count. All the models demonstrated good calibration, indicating a strong alignment between observed outcomes and predicted risk estimates (Supplementary Figure 6).
To assess whether plaque phenotyping would improve the reclassification of individuals when complementing established clinical risk prediction models, we calculated the PCE risk scores for UKB participants eligible for assessment according to ACC/AHA guidelines6. Due to the PCE’s tendency to overestimate risk in the UKB population, the model was recalibrated (Supplementary Figure 7). Incorporating plaque presence and count into the PCE demonstrated significant reclassification improvement, with a categorical NRI of 0.034 (95% CI, 0.006–0.062) and 0.04 (95% CI, 0.010–0.070), respectively, at the threshold of 7.5% 10-year cardiovascular risk, which defines intermediate risk and justifies preventive initiation of statin therapy according to current guidelines. Overall, adding plaque presence correctly reclassified 17 out of 318 patients who developed MACE into a higher risk category. Including plaque count correctly reclassified 20 patients into a higher risk category (Table 3). In both cases, plaque presence and plaque count,4 patients were incorrectly reclassified into a lower risk category.
Genome-wide association study of carotid atherosclerotic plaque
As the next step, we investigated the genetic architecture of carotid atherosclerosis, defined by plaque presence. To detect single nucleotide variants (SNVs) associated with presence of a carotid atherosclerotic plaque, we conducted a genome-wide association study (GWAS) and subsequently meta-analyzed our data with the largest available GWAS for carotid plaque from the cohorts of the Cohorts for Heart and Aging Research in Genomic Epidemiology (CHARGE) consortium46. After quality control and excluding individuals without genetic data, the UKB GWAS included 18,203 White British individuals, comprising 8,250 cases with carotid plaque and 9,953 controls. The pooled sample from the UKB and CHARGE cohorts included 66,637 individuals (29,790 cases; 36,847 controls). We identified seven independent genomic loci significantly associated with the presence of carotid plaque, two of which were novel. Five of the loci (mapped to the genes EDNRA, LINC02577, CDKN2B-AS1, CFDP1, LDLR) replicated known associations, as the lead SNVs were in high linkage disequilibrium (r2 > 0.9) with SNVs previously reported to be associated with carotid plaque presence in the CHARGE study (Figure 5, Supplementary Table 2). The sixth locus included the LPA gene, which encodes lipoprotein(a) (Lp(a)) and is a known locus for atherosclerotic cardiovascular disease47–51. The lead variant at this locus (rs56393506), associated with higher odds for an atherosclerotic plaque (OR for T allele: 1.12, 95%CI: 1.07–1.16), is an intronic variant in the LPA gene that has been previously strongly associated with higher Lp(a) levels52. The lead SNV at the seventh locus is located in a non-coding region (rs1893250, OR for A allele: 0.91, 95%CI: 0.88–0.93) and was previously associated with angina pectoris53.
Mendelian randomization analyses
Finally, we performed Mendelian Randomization (MR) to explore whether genetically proxied risk factors and biomarkers of CVD are associated with carotid atherosclerotic plaque. We used the largest to-date publicly available GWAS summary statistics for vascular risk factors to generate genetic instruments for the risk variables under study (Supplementary Table 3). The inverse-variance weighted (IVW) MR analyses revealed associations between higher genetically proxied SBP, diastolic blood pressure (DBP), LDL cholesterol, interleukin-6 (IL-6) signaling activity, and genetic predisposition to smoking initiation and type 2 diabetes (T2D) with the odds of carotid plaque presence (Figure 6). Additionally, higher genetically proxied HDL cholesterol levels were associated with lower odds of carotid plaque (Figure 6). There was evidence of directional pleiotropy, as assessed by a significant Egger intercept (p = 0.036), for the association between smoking initiation and carotid plaque, with the MR estimate derived by MR Egger regression being in the opposite direction, even after excluding potential outlier instruments detected with Mendelian Randomization Pleiotropy RESidual Sum and Outlier (MR-PRESSO) (Supplementary Table 4). The results for the remaining significant IVW associations were generally highly consistent in sensitivity analyses, including MR Egger regression and the weighted median estimator (Supplementary Table 4).
Discussion
In this study, we developed a computer vision model that accurately detects atherosclerotic plaques in carotid ultrasound images and applied it to a population-based cohort of 19,499 participants from the UKB. The model demonstrated strong performance in detecting carotid plaques and classifying plaque-positive carotid ultrasound images, achieving approximately 90% in accuracy, sensitivity, and specificity. Consistent with previous studies in comparable demographics (mean age 64.6±7.6 years, 53% female)14,55, our model identified at least one carotid plaque in 45% of participants. The presence of plaques was associated with conventional vascular risk factors and was predictive of future adverse cardiovascular events over a follow-up period of up to 7 years. Importantly, both plaque presence and count led to improved risk reclassification for future adverse cardiovascular events beyond the established PCE risk assessment tool. Leveraging the phenotypic depth of the UKB, we conducted the largest genomic analysis of carotid atherosclerosis to date, identifying two novel loci and risk pathways, including ones targeted by emerging cardiovascular therapeutics, such as Lp(a) and IL-6 signaling.
While atherosclerosis can develop long before clinical symptoms appear56, modern risk assessment tools do not consider subclinical atherosclerotic pathology. Many studies have highlighted the potential of integrating imaging biomarkers of subclinical atherosclerosis into conventional risk assessment tools28. CAC assessed through CT is a well-established predictor of CVD risk57. However, CAC has limitations, including insensitivity to early stages of atherosclerosis and exposure to radiation19,20,58. In contrast, carotid ultrasound is an inexpensive, well-tolerated, radiation-free tool capable of detecting early-stage atherosclerosis14,59. Our results, based on a population of almost 20,000 individuals, suggest that both the presence and count of plaques are strongly associated with the risk of future acute CVD events. Incorporating plaque information into Cox regression models led to significant reclassification improvements, which were robust across the full cohort, those without a history of CVD statin-naive participants. The reclassification metrics in our study indicate that adding plaque information to conventional risk factors and directly incorporating it into the PCE has the potential to improve patient stratification. Specifically, incorporating plaque count reassigned 6.3% (20 out of 318) of individuals who went on to develop cardiovascular events from a low to a higher risk category, making them eligible for preventative statin therapy.
Despite growing evidence that carotid ultrasound-derived plaque phenotypes—such as total plaque area and vulnerability features—are independently associated with future CVD events 26,60,61, its use in clinical practice remains underutilized. The primary challenges for wider adoptions include the time required for assessment and reliance on operator skill23. However, an efficient artificial intelligence model could significantly streamline these labor-intensive tasks. The model developed in this study demonstrates high performance in identifying individuals with carotid atherosclerosis and localizing plaques from a single screenshot of the carotid bifurcation. Further model advancements could not only address the issue of labor intensity but also enable the extraction of more detailed plaque features, thus enhancing the predictive performance of carotid ultrasound.
Importantly, our model enhances the phenotypic depth of the UKB to subclinical atherosclerosis, facilitating integration with the unique resources available in this population, such as multi-omics and other imaging data62. Moving in this direction, we leveraged the available genomic data to perform the largest exploration to date of the genetic architecture of carotid plaque. We replicated five genomic loci previously associated with atherosclerosis endophenotypes46 and also found two new loci related to clinical cardiovascular outcomes. Furthermore, downstream MR confirmed the effect of genetic predisposition to known vascular risk factors, such as smoking, high blood pressure, and LDL cholesterol, on carotid plaque presence. Importantly, the GWAS and MR results showed that genetic variation leading to elevated Lp(a) levels and higher IL-6 signaling activity is associated with higher odds of carotid plaque. Both pathways are believed to play key roles in atheroprogression and are the targets of investigational drugs in advanced clinical development47,48,63–65. These results suggest that drugs targeting these pathways could be promising, particularly in the preclinical stages of atherosclerosis. Integrating carotid plaque phenotypes with additional omics layers may provide further insights into novel drug targets for atherosclerotic CVD.
Our study has several limitations. First, during model development, the low quality of some images necessitated contrast enhancement and noise reduction techniques. These adjustments may have introduced bias, especially in cases where high noise levels complicated plaque detection. However, our model achieved high classification and detection metrics, which could potentially be improved further by annotating more images. Moreover, the consistency of predicted carotid plaque prevalence with previously reported plaque prevalence in similar demographics, along with the associations of predicted plaque presence with known risk factors and future CVD risk, supports the model’s reliability55. Second, the UKB is a cohort of healthy volunteers with a lower incidence rate of CVD than the general population, particularly within its imaging subsample (Supplementary Table 1)66. This discrepancy contributes to an overestimation of CVD risk calculated by clinical risk models, such as the PCE tool. To improve the reliability of our reclassification evaluation, we recalibrated the model using data from the study population to better align predicted risk with observed outcomes. Due to the low prevalence of risk factors and events, PCE-derived absolute risk estimates remain notably low. It is worth noting that deriving categorical NRI metrics comparable to those from other studies with different incidence rates may be problematic67. We addressed this issue by calculating continuous NRI estimates using bootstrap estimates, which are threshold-independent and less sensitive to event rates68. Third, the carotid ultrasound examination took place 2 to 15 years after the baseline visit and the assessment of cardiovascular risk factors. This gap introduces bias into the effect estimates. To account for changes in baseline risk factors over this period, we used, wherever available, data from the primary care records of participants collected at the closest date to the ultrasound exam. Fourth, our study has a shorter follow-up period of 7 years compared to most clinical risk assessment tools, including the PCE, which typically calculate 10-year risk estimates for CVD. Fifth, we observed significant heterogeneity in the results of IVW MR analyses for several risk factors (Supplementary Table 4), which could indicate the presence of pleiotropy. To explore whether the derived estimates could be biased by directional pleiotropy, we conducted several sensitivity analyses to test the robustness of the estimates against different MR assumptions. Sixth, there was some population overlap between the exposure and outcome GWAS datasets used in our MR analyses, which could introduce weak instrument bias into the derived effect estimates. We addressed this concern by using the largest available summary statistics for the exposure data. Given that the effective population overlap was less than 5% for all exposure-outcome pairs, we estimated any bias in the effect estimates to be under 5%69. Lastly, this study analyzed a sample from the UKB, which predominantly consists of White ancestry volunteers, who are healthier than the general population. Therefore, replicating these findings in more diverse populations and real-world settings is crucial for improved generalizability and informed decision-making.
In conclusion, we have successfully developed and implemented a deep learning model for plaque detection within the population-based UKB, significantly enhancing the phenotypic characterization of this cohort. This model sets the stage for automating carotid plaque assessment in other large-scale cohorts, thereby enabling broader population-based research in subclinical atherosclerosis. Our results highlight the potential of carotid plaque assessment for refining cardiovascular risk prediction, offer insights into the genetic architecture of atherosclerosis, and provide a valuable resource for advancing atherosclerosis research at the population scale.
Methods
Study population
In this study, we utilized data from the UKB, a large-scale prospective cohort study that recruited between 2006 and 2010 502,422 individuals aged 40 to 69 at baseline from across the United Kingdom70. Participants underwent detailed assessments, which included comprehensive data collection through questionnaires, physical measurements, and biological sample collections. All participants provided electronic informed consent. Ethical approval was granted by the National Health Service North West Multicenter Research Ethics Committee.
Following a baseline visit between 2006 and 2010, a total of 82,340 individuals returned for a follow-up imaging visit starting 2014, which included a carotid ultrasound. A total of 177,757 raw images from 19,768 individuals were released by the UKB and used in this study (Supplementary Figure 1). Four anatomic views of the distal common carotid artery and the bifurcation were available for each side for nearly every UKB participant who underwent a carotid ultrasound: images along the main longitudinal axis, images along the short axis, and images along the main longitudinal axis at two different angles for each artery, which were used by the analysts for cIMT quantification. Our study focused on images from 19,507 UKB participants derived along the main longitudinal axis. Eight individuals withdrew from the study post-recruitment (field 190), resulting in a total sample size of 19,499. For participants with repeat imaging visits, only the ultrasound data from the first visit were retained for analysis.
Pre-processing
The flowchart for extracting the carotid ultrasound imaging data for analysis is summarized in Supplementary Figure 1. After developing an algorithm (Supplementary Figure 8) that automatically detects images along the longitudinal axis, we extracted 45,210 long-axis images without cIMT measurements from the left and right arteries for 19,507 individuals. The obtained images were cropped to a size of 480×448 pixels to retain only the ultrasound image while maintaining the original resolution. After excluding participants who withdrew from the study and keeping only the images from the first ultrasound visit, a total of 19,362 left and 19,370 right carotid images from 19,499 individuals remained for analysis.
In order to enhance contrast and reduce noise in the images, we applied two functions from the OpenCV v. 4.7.0 library71: median blur filtering (ksize=5) and Contrast Limited Adaptive Histogram Equalization (clipLimit=2.0, tileGridSize=(8,8)), respectively, to facilitate the manual segmentation process. These processing steps were applied to the full sample of images in this study. Plaques were manually annotated by two medical doctors with postgraduate training in vascular imaging and subsequently validated by a doctor certified in carotid ultrasound imaging. Label Studio version 1.8.2 (https://github.com/HumanSignal/label-studio) was used to segment plaques on the ultrasound images. The edge coordinates of the segmentation masks were used to obtain the bounding boxes. Plaques were defined according to standards, as focal protrusions in the arterial lumen with a thickness >50% of the surrounding carotid intima-media thickness40. If multiple longitudinal images were available for the same artery, those where the model detected a plaque were prioritized, or a random image was used if plaques were found in more than one.
Model Development and Deployment
We performed transfer learning with fine-tuning which involves selecting a model pre-trained on a large dataset of natural images and then re-training it on a new dataset. This approach allows for adjusting the model weights and biases to better suit the task related to the new dataset. Here, we employed the YOLOv8l41 model for object detection, pre-trained on over 330,000 images, and re-trained it on our dataset of 680 carotid ultrasound images. The dataset was divided into training, validation, and test sets at a ratio of 0.725/0.125/0.150, resulting in 103 images allocated to the test set. To enhance model generalizability and predictive power, we randomly selected 50% of the training set (490 images) and applied various augmentation techniques from Albumentations72 Python library: GridDistortion (p=0.15), RandomBrightnessContrast (((0,0.5),(0,0.5)), HorizontalFlip(p=0.2), GaussNoise(p=0.15), and RandomSizedCrop (min_max_height=(384, 384), p=0.4). These augmentations increased the variability of the training set, making the model more invariant to noise and other distortions. This augmented dataset, along with the rest of the images, was processed in batches for further augmentation within the YoloV8 framework, as detailed below.
The model was trained with a batch size of 44 images. Early stopping was set to 5 epochs, and training concluded after 14 epochs, with peak performance observed at epoch 9. The following loss function parameters were selected based on a grid search, as detailed in Supplementary Table 5: distribution focal loss (DFL) = 2.5, box loss = 10, and binary cross entropy loss (CLS) = 1.1. Default augmentation techniques in the YOLOv8 framework were partially suppressed due to prior augmentation efforts; specifically, mosaic, copy-paste, shear, close mosaic, flip up-down, and mix-up were disabled. However, flip left-right (p=0.1), degrees (10), HSV-Saturation (hsv_s: 0.05), HSV-Value (hsv_v: 0.05), translate (0.1), and scale (0.1) were retained. Training was conducted using an NVIDIA QUADRO RTX 5000 GPU (16GB). PyTorch73 version 1.12.1 and the Ultralytics41 framework version 8.1.16 were used for model development.
Model’s classification metrics for detecting plaque-positive images were estimated in a confusion matrix by comparing ground truth annotations with bounding box predictions. A true positive was recorded when both annotation and prediction contained a bounding box.
Demographic and Clinical Variables
Age at carotid ultrasound assessment was derived by subtracting the participant’s date of birth (field 33) from the carotid ultrasound assessment visit date (extracted from the manifest files linked to the ultrasound images). Ethnicity for the PCE risks score calculation was determined by field 21000. Observations with missing data or responses of “Do not know” or “Prefer not to answer” were encoded as “Other”. Pre-existing CVD was defined based on self-reported history (UKB field 20002, with 1075 - heart attack/myocardial infarction, 1081 - stroke, 1583 - ischemic stroke, 1491 - brain haemorrhage); general practice records (131298, 131300, 131302, 131368, 131366); hospital data records (defined using ICD-10 codes I20-I25, I60-I61, and ICD-9 codes 410-412, 429-431, 434, 436, as well as operation codes K40-46, K49, K471, K49, K50, K75, L294 and L295 observed before the initial ultrasound visit). SBP was quantified using fields 4080 and 93 by averaging the observations from each field, followed by taking the mean of the resulting values. Smoking status was categorized as “current” or “other”, with missing data (<0.5%) treated as “other” (field 20116). Total cholesterol, LDL and HDL cholesterol levels were obtained from fields 30690, 30780 and 30760, respectively. When available, values from the assessment closest to the ultrasound visit were extracted; otherwise, baseline measurements were used. Missing values were imputed using the multivariate imputation by chained equations method74, affecting 12% of HDL-cholesterol values and 6% of total and LDL-cholesterol values, based on other CVD risk factors. Diabetes was defined by self-reported cases (UKB field 20002 – codes: 1220, 1222, 1223), use of glucose-lowering medications (field 20003), and hospital records prior to the first carotid ultrasound exam (ICD-9 codes: 250* and ICD-10 codes E10, E11). Information on the use of antihypertensive drugs, diabetes medications, and statins was obtained from field 20003 (Supplementary Table 6).
Assessment of major adverse cardiovascular events
MACE endpoints were defined as follows: myocardial infarction (ICD-10 codes I21, I22 from hospital inpatient data, UKB data-fields 131298 and 131300), stroke (ICD-10 codes I60-64 from hospital inpatient data, UKB data-fields 131368 and 131366), or death due to any cardiovascular cause (defined as the cause of death with an ICD-10 code starting with ‘I’ extracted from the death registry). The date of the first episode observed after the carotid ultrasound assessment was considered as the date of the event of interest.
Carotid intima media thickness
We calculated three types of cIMT measurement characteristics between groups with and without carotid plaque, as predicted by the model. Measurements of mean and maximum cIMT were obtained using UKB data fields 22670-22681, as described by Strawbridge et al75.
cIMT mean: average of the mean values from four mean cIMT measurements (two angles for each carotid artery: left and right).
cIMT max: The maximum value of the cIMT measurements across both arteries.
cIMT mean-max: The mean of the maximum cIMT values per artery.
All the obtained values were log-transformed for the analysis. Individuals with more than one missing cIMT measurement were excluded, resulting in a total of 18,497 individuals for this analysis. The analysis was repeated separately for the right and left arteries.
Logistic regression for plaque presence and Cox regression for future events
To estimate the associations between model-derived plaque presence and CVD risk factors, we applied logistic regression, using plaque presence as the outcome variable. A history of hypertension was defined by the use of antihypertensive treatment at the time of the carotid ultrasound assessment. Definitions of other clinical and demographic variables are described above.
For each subset of the cohort (full sample, primary events with and without individuals on statin therapy), two separate Cox regression models were constructed: one with plaque presence as a binary variable and another with the count of plaques as a categorical variable with three levels (no plaques as the reference, one plaque, and two or more plaques). The time variable in the survival analysis was calculated as the duration from the first ultrasound visit to the event of interest or to the censoring date, which included date at death from causes other than CVD or the date at the last observed event (2022-10-30) available at the time of the analysis. Controls were censored at the time point of the latest observed event. The fitted Cox regression models included all vascular risk factors included in the PCE (age, sex, SBP, smoking status, history of diabetes, antihypertensive therapy, cholesterol, and HDL cholesterol), as well as statin usage. Ancestry was not included due to its very low variance in our sample (Table 1). The proportional hazard assumptions were tested with the scaled Schoenfeld residuals, and no violation of the assumptions was detected.
All analyses were conducted using R software, version 4.4.0. We considered two-sided p-values less than 0.05 to be statistically significant. The category-free NRI was calculated using the nricens v.1.676 package with confidence intervals and p-values based on 1000-fold bootstrap standard errors. The IDI metric was calculated using the survIDINRI library v.1.1-277. Harrell’s C-statistic, along with its 95% confidence interval, was used to evaluate the discriminative ability of the time-to-event models78. Comparisons between models based on the C-statistic were conducted using the CsChange package79.
PCE risk estimation
The calculation of the PCE risk score was performed using published equations6. PCE eligibility included the following criteria:
1) 40≤Age≤79,
2) 130 ≤ Total cholesterol ≤320 mg/dL;
3) 20 ≤ HDL-cholesterol ≤ 100;
4) 90 ≤ Systolic blood pressure ≤ 200 mmHg
To enable fair model comparison and justify the recommended threshold usage, the original PCE was calibrated to the UKB population. This recalibration was achieved by fitting calculated log-hazards from the published PCE coefficients in a Cox regression stratified by sex to obtain recalibrated probabilities. The calibration of the predicted risk values from the original and recalibrated models was assessed using Greenwood-Nam-D’Agostino statistics and the integrated calibration index80,81. Calibration plots for the PCE are presented in Supplementary Figure 7.
The plaque variables were incorporated into the PCE, with plaque presence as a binary variable and plaque count as an ordinal variable, as previously described82. Briefly, the recalculated risk is derived from the relative risk estimate for the novel risk factor, the baseline risk, and the prevalence of the novel risk factor. As an example, the presence of plaque was incorporated using equation (1), adapted from Kooter et al82: where [r] represents the weighted mean risk; p is the plaque prevalence; HR is the hazard ratio for plaque presence; MF(+) is the multiplication factor; R(+) is recalculated risk in the presence of plaque; Rbl is the baseline risk estimated from the baseline hazard and log-HRs estimated from the fitted Cox model ; and R(−) is the recalculated risk in the absence of plaque.
HRs for both plaque presence and plaque count, used to enhance the recalibrated PCE risk model with plaque information, were estimated from a Cox regression model fitted on the sub-cohort eligible for PCE risk assessment. The model was adjusted for sex, age, HDL and total cholesterol levels, antihypertensive drug use, smoking status, SBP, and statin use at the time of the initial ultrasound assessment. Plaque information was incorporated separately for males and females, with prevalence estimates calculated for each cohort.
Model calibration was estimated using the survival.calib83 package in R. Reclassification tables and NRI statistics were calculated using the PredictABEL v.1.2-484 library in R.
Genome-wide association study and meta-analysis
Genomic quality control in the UKB was performed as previously described85. For the GWAS analysis Regenie86 v3.3 was employed. In step 1, we used directly genotyped variants with MAF > 1%, <10% missingness, Hardy–Weinberg equilibrium test P > 1 × 10−15, and a minor allele count > 100. Age at the time of the ultrasound exam, along with sex, the first 10 genetic principal components, and the genotyping chip were used as covariates. Fixed-effect meta-analysis was conducted with METAL87, using effect size estimates and standard errors (option SCHEME STDERR). Results were clumped using the clump_data function of TwoSampleMR88,89 R package version 0.5.6) at an r2<0.001 based on the European 1000 Genomes Project reference panel with 10,000 kb window90. Results were then visualized in a Manhattan plot, which was constructed using the gwaslab Python package91.
Mendelian randomization
Summary-level data sources for exposures, along with their descriptions, are provided in Supplementary Table 3. Two-sample MR was conducted using the TwoSampleMR package in R. The instrumental variable for each exposure was constructed by selecting SNVs from summary statistics files with a significant association (p < 5×10⁸), followed by clumping for linkage disequilibrium at an r² < 0.001 threshold within a 10,000 kb window. For IL-6 receptor-mediated signaling activity, the genetic instrument was constructed as previously described92, using beta estimates derived from a cohort that excluded the UKB to minimize bias. Main association estimates were derived using random-effects IVW analysis. To account for potential bias due to horizontal pleiotropy, sensitivity analyses were performed using both MR-Egger and the weighted median estimator, as these methods are known to be more robust against pleiotropic effects93,94. Heterogeneity and horizontal pleiotropy of the genetic instruments were estimated using the mr_heterogeneity and mr_pleiotropy_test functions from the TwoSampleMR package. When MR-Egger indicated significant pleiotropy, we utilized the MR-PRESSO method from the MRPRESSO95 package in R to identify and exclude significant pleiotropic instrumental variables (P < 0.05), which were considered as “outliers”. Subsequently, IVW, weighted median, and MR-Egger analyses were carried out on the outlier-corrected models.
Data availability
The UKB provides an accessible research resource, available to researchers upon submitting a research proposal at https://www.ukbiobank.ac.uk. The GWAS meta-analysis data obtained in this study will be uploaded to the GWAS Catalog (https://www.ebi.ac.uk/gwas/home) upon the publication of the current manuscript. Additionally, the carotid plaque phenotypes derived from the developed model will be returned to UKB for use in future studies.
Code availability
The code used for this study will be available on GitHub upon publication of this manuscript.
Funding
This work was supported by the Fritz-Thyssen Foundation (grant ref. 10.22.2.024MN to MKG), the German Research Foundation (DFG; Emmy Noether grant GZ: GE 3461/2-1, ID 512461526 to MKG; Munich Cluster for Systems Neurology EXC 2145 SyNergy, ID 390857198 to MKG), and the Hertie Foundation (Hertie Network of Excellence in Clinical Neuroscience, ID P1230035 to MKG). V.K.R. is supported by Norn Longevity Impetus grant. C.D.A. is supported by NIH R01NS103924, U01NS069673, RF1NS139183 AHA 18SFRN34250007, AHA-Bugher 21SFRN812095, and the MGH McCance Center for Brain Health.
Ethics declarations
The UK Biobank has obtained approval from the Northwest Multi-Center Research Ethics Committee. All participants have provided written informed consent.
Competing interests
M.K.G reports consulting fees from Tourmaline bio, Inc. unrelated to this work. V.K.R has common stock in NVIDIA, Alphabet, Apple and Amazon. P.N. reports research grants from Allelica, Amgen, Apple, Boston Scientific, Genentech / Roche, and Novartis, personal fees from Allelica, Apple, AstraZeneca, Blackstone Life Sciences, Creative Education Concepts, CRISPR Therapeutics, Eli Lilly & Co, Esperion Therapeutics, Foresite Capital, Foresite Labs, Genentech / Roche, GV, HeartFlow, Magnet Biomedicine, Merck, Novartis, TenSixteen Bio, and Tourmaline Bio, equity in Bolt, Candela, Mercury, MyOme, Parameter Health, Preciseli, and TenSixteen Bio, and spousal employment at Vertex Pharmaceuticals, all unrelated to the present work. C.D.A. has received sponsored research support from Bayer AG and has consulted for ApoPharma. The other authors declare no competing interests.
Acknowledgements
UKB data were accessed through applications 36993, 7089, and 151281.
References
- 1.↵
- 2.↵
- 3.↵
- 4.↵
- 5.↵
- 6.↵
- 7.↵
- 8.↵
- 9.↵
- 10.↵
- 11.↵
- 12.↵
- 13.↵
- 14.↵
- 15.
- 16.
- 17.
- 18.↵
- 19.↵
- 20.↵
- 21.↵
- 22.↵
- 23.↵
- 24.↵
- 25.↵
- 26.↵
- 27.
- 28.↵
- 29.
- 30.↵
- 31.↵
- 32.
- 33.↵
- 34.↵
- 35.↵
- 36.↵
- 37.
- 38.↵
- 39.↵
- 40.↵
- 41.↵
- 42.↵
- 43.↵
- 44.↵
- 45.↵
- 46.↵
- 47.↵
- 48.↵
- 49.
- 50.
- 51.↵
- 52.↵
- 53.↵
- 54.↵
- 55.↵
- 56.↵
- 57.↵
- 58.↵
- 59.↵
- 60.↵
- 61.↵
- 62.↵
- 63.↵
- 64.
- 65.↵
- 66.↵
- 67.↵
- 68.↵
- 69.↵
- 70.↵
- 71.↵
- 72.↵
- 73.↵
- 74.↵
- 75.↵
- 76.↵
- 77.↵
- 78.↵
- 79.↵
- 80.↵
- 81.↵
- 82.↵
- 83.↵
- 84.↵
- 85.↵
- 86.↵
- 87.↵
- 88.↵
- 89.↵
- 90.↵
- 91.↵
- 92.↵
- 93.↵
- 94.↵
- 95.↵