Machine Learning to Predict 10-year Cardiovascular Mortality from the Electrocardiogram: Analysis of the Third National Health and Nutrition Examination Survey (NHANES III)
============================================================================================================================================================================

* Chang H. Kim
* Sadeer Al-Kindi
* Yasir Tarabichi
* Suril Gohel
* Riddhi Vyas
* Shankar Srinivasan

## ABSTRACT

**Background** The value of the electrocardiogram (ECG) for predicting long-term cardiovascular outcomes is not well defined. Machine learning methods are well suited for analysis of highly correlated data such as that from the ECG.

**Methods** Using demographic, clinical, and 12-lead ECG data from the Third National Health and Nutrition Examination Survey (NHANES III), machine learning models were trained to predict 10-year cardiovascular mortality in ambulatory U.S. adults. Predictive performance of each model was assessed using area under receiver operating characteristic curve (AUROC), area under precision-recall curve (AUPRC), sensitivity, and specificity. These were compared to the 2013 American College of Cardiology/American Heart Association Pooled Cohort Equations (PCE).

**Results** 7,067 study participants (mean age: 59.2 ± 13.4 years, female: 52.5%, white: 73.9%, black: 23.3%) were included. At 10 years of follow up, 338 (4.8%) had died from cardiac causes. Compared to the PCE (AUROC: 0.668, AUPRC: 0.125, sensitivity: 0.492, specificity: 0.859), machine learning models only required demographic and ECG data to achieve comparable performance: logistic regression (AUROC: 0.754, AUPRC: 0.141, sensitivity: 0.747, specificity: 0.759), neural network (AUROC: 0.764, AUPRC: 0.149, sensitivity: 0.722, specificity: 0.787), and ensemble model (AUROC: 0.695, AUPRC: 0.166, sensitivity: 0.468, specificity: 0.912). Additional clinical data did not improve the predictive performance of machine learning models. In variable importance analysis, important ECG features clustered in inferior and lateral leads.

**Conclusions** Machine learning can be applied to demographic and ECG data to predict 10-year cardiovascular mortality in ambulatory adults, with potentially important implications for primary prevention.

## INTRODUCTION

Atherosclerotic cardiovascular disease (ASCVD) is a major cause of morbidity and mortality in the United States, with an incidence of 580,000 myocardial infarctions and 610,000 strokes occurring each year1. The current standard risk calculator in the U.S., the 2013 American College of Cardiology and American Heart Association Pooled Cohort Equations (PCE)2, utilizes demographic and clinical variables (age, sex, race (white or African American), total cholesterol, high-density lipoprotein cholesterol, systolic blood pressure, treatment for hypertension, diabetes mellitus, and smoking status) to estimate 10-year risk of incident ASCVD events, defined as fatal and nonfatal myocardial infarction or stroke. While the PCE is in routine clinical use in the U.S., it has been criticized for its suboptimal calibration and risk prediction in various patient populations3, and the requirement for blood draws represents a hindrance in its utility as a primary screening tool.

The electrocardiogram (ECG) is widely used in clinical practice to diagnose various cardiac conditions, such as myocardial infarctions, arrhythmia, and others. While prior studies have identified individual ECG components as predictors of adverse cardiovascular events4-7, the value of aggregate screening ECG data for prediction of long-term cardiovascular outcomes is less well understood. Given their highly correlated nature, ECG data are well suited for analysis by machine learning methods, and recent studies have found machine learning on ECG data to be useful in identifying specific cardiac disease states8-12. Considering such background, we sought to examine whether machine learning methods could predict 10-year cardiovascular mortality from aggregate ECG data in an ambulatory population.

## METHODS

### Data source

This study utilized the third iteration of the National Health and Nutrition Examination Survey (NHANES III)13, which consists of healthcare survey data compiled from a nationally representative sample of 39,695 persons from 1988 to 1994. In addition to demographic, historical, and physical items from the questionnaire and examination, biochemical laboratory studies and ECG data are available for a subset of the surveyed population. Mortality outcomes, including the cause of death, are available via linked National Death Index files14. Since NHANES III is a publicly available, de-identified data set, a separate Institutional Review Board review was not required for this study.

This study included NHANES III participants who were 18 and older whose demographic and mortality data were available, and who did not have pre-existing cardiovascular disease, defined as lack of self-reported history of myocardial infarction, congestive heart failure, or stroke. Among this group, 7,067 participants had additional data from a standard 12-lead ECG and cardiovascular risk factors required to compute the PCE, who formed the main study cohort for this analysis. The study flow diagram is shown in **Figure 1**.

![Figure 1.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/09/14/2021.09.09.21263327/F1.medium.gif)

[Figure 1.](http://medrxiv.org/content/early/2021/09/14/2021.09.09.21263327/F1)

Figure 1. 
Study flow diagram

CHF: congestive heart failure, CV: cardiovascular, ECG: electrocardiogram, MI: myocardial infarction, ML: machine learning, NHANES: National Health and Nutrition Examination Survey

### Data preparation

All data were imported and analyzed using R 3.5.1 statistical software15. Publicly available R packages were utilized for data preparation (*tidyverse*16, *mice*17, *ROSE*18), general machine learning (*caret*19), deep learning (*Keras*20), ensemble learning (*SuperLearner*21), classification performance evaluation (*precrec*22), and survival analysis and plotting (*survival*23, *survminer*24). Details of data preparation, model training, and performance evaluation are described below.

Demographic and clinical data were recorded from relevant sections of NHANES III. Baseline age and sex were recorded as reported at time of survey. Race categories were simplified to White, Black, or Other to avoid data sparsity. For vital signs, the median value of multiple measurements was recorded. For clinical data, an affirmative answer on the questionnaire or relevant medication and laboratory measurements were utilized (e.g. for history of diabetes, qualifying criteria included answering ‘yes’ to questionnaire or having laboratory values of any fasting glucose ≥126 mg/dL or hemoglobin A1c ≥ 6.5%). For participants taking medications for high cholesterol, laboratory values were adjusted to reflect average statin effect (total cholesterol: 21% reduction, high-density lipoprotein cholesterol: 3.5% increase)25.

For outcomes, death status and cause of death were determined using International Statistical Classification of Diseases and Related Health Problems - Tenth Revision (ICD-10) codes. Cardiovascular death was identified by codes: I00-I09, I11, I13, I20-I51. Other outcomes related to ASCVD, including nonfatal myocardial infarction and stroke, were not available in the NHANES III dataset. Death events were right-censored at 10 years. The full list of clinical variable names and their corresponding NHANES III codes are listed in **Supplemental Table 1**.

For ECG data, 133 features based on direct ECG measurements were used. Due to the high proportion of missing values, preprocessing steps for ECG data included removing rows (participants) and columns (ECG features) which had >50% missing data. Remaining missing values were imputed using the Multivariate Imputation by Chained Equations (*mice*) package17, based on demographic variables and other ECG features. A full list of selected ECG features is highlighted in **Supplemental Table 2**. Further preprocessing steps included converting the rhythm code to a binary variable (sinus vs. non-sinus rhythm) to avoid data sparsity and replacing QT interval with corrected QT interval based on Bazett’s formula. All ECG data were standardized to have a mean of 0 and standard deviation of 1.

Prior to model training, data were split into 80:20 train:test partitions by random sampling. Given the low frequency of outcome events, the training set was augmented by two procedures to improve class imbalance: 1) oversampling of positive events and 2) synthetic data generation using the Random Over-Sampling Examples (*ROSE*) package18, which creates a synthetic data sample of balanced class and parameter distribution by drawing new examples from a conditional kernel density estimate of the majority and minority classes.

### Model training

All machine learning models were trained via 10-fold cross validation on six different training data combinations based on the following schema: Three training data sets (base, oversample, synthetic) in two different parameter combinations (PCE + ECG variables or Demographic (age, sex, race) + ECG variables). Model performance was assessed in a single hold out test set, which was not used for any part of the model training process. As a comparison, the PCE was implemented as a Cox proportional hazards model based on published parameters2,26. For machine learning, various classification models were trained with 10-year cardiovascular mortality as a binary outcome. Logistic regression, random forest, gradient boosting machine, and support vector machine models were trained using the *caret*19 package with automated hyperparameter tuning, while for neural networks, the R implementation for *Keras*20 package was used for deep learning based on multilayer perceptron architecture with three hidden layers and regularization via dropout. Finally, ensemble models were trained using the *SuperLearner*21 package. Briefly, ensemble models compute multiple models (“base learners”) and aggregate their predictions, which improves overall accuracy but increases computational costs. For this study, ensemble models were built using logistic regression, random forest, gradient boosting machine, support vector machine, and neural network models as base learners, and optimal model weighting was determined by maximizing area under receiver operating characteristics curve (AUROC) based on the Nelder-Mead method and 5-fold cross validation.

### Model assessment and comparison

Discriminative performance of classification models and their ensembles were compared using AUROC, area under the precision-recall curve (AUPRC), sensitivity, and specificity metrics. To allow for comparison with the PCE, classification performance metrics were computed for the PCE by assessing the probability of cardiovascular death at or before 10 years, with threshold value set to maximize AUPRC. Machine learning models with superior prediction performance characteristics were further assessed using standard calibration plots.

To assess the prognostic value of individual ECG features, the variable importance rank metric from the *caret**19* package was compared, except for neural network and ensemble models where such variable importance rank metrics were not available. For aggregate assessment of classification models, the cumulative count of top ten important predictors for each model was counted and plotted on a standard 12-lead ECG for visual assessment and clinical interpretation.

## RESULTS

### Study population

Baseline characteristics of the study population are summarized in **Table 1**. The study cohort (N=7,067) had baseline age range of 40 to 90 years, with mean 59.2 and standard deviation of 13.4 years. Notably, while the initial inclusion criteria included all adults age 18 years and above, ECG data were only available for age 40 and above. For sex, there was a slight female majority (52.5%), while for race there was a significant majority in white (73.9%), followed by black (23.3%), and under-representation of other races. There was a wide range in body mass index (mean: 27.6, range: 13.3-64.5) and a significant comorbidity burden, including hypertension (33.8%), hyperlipidemia (23.3%), diabetes mellitus (46.7%), and tobacco use (24.5%). Follow-up period was 18.1±7.6 years, with 10-year all-cause mortality rate of 19.8% and cardiovascular mortality rate of 4.8%.

View this table:
[Table 1.](http://medrxiv.org/content/early/2021/09/14/2021.09.09.21263327/T1)

Table 1. 
Characteristics of the study population

### Prediction performance of machine learning models

Best performing machine learning models and their classification metrics are shown in **Table 2**. Compared to the PCE (AUROC: 0.668 [95% confidence interval: 0.625-0.712], AUPRC: 0.125 [0.081-0.170], sensitivity: 0.492, specificity: 0.859), several machine learning models trained on the PCE + ECG variable combination showed improved performance: logistic regression (AUROC: 0.762 [0.723-0.800], AUPRC: 0.143 [0.100-0.185], sensitivity: 0.759, specificity: 0.756), neural network (AUROC: 0.729 [0.646-0.813], AUPRC: 0.153 [0.089-0.216], sensitivity: 0.658, specificity: 0.822), and ensemble model (AUROC: 0.683 [0.648-0.718], AUPRC: 0.146 [0.092-0.199], sensitivity: 0.494, specificity: 0.884). Limiting training data to only demographic and ECG variables did not adversely affect predictive performance of machine learning models: logistic regression (AUROC: 0.754 [95% CI: 0.702-0.806], AUPRC: 0.141 [0.093-0.189], sensitivity: 0.747, specificity: 0.759), neural network (AUROC: 0.764 [0.712-0.815], AUPRC: 0.149 [0.110-0.188], sensitivity: 0.722, specificity: 0.787), ensemble model (AUROC: 0.695 [0.668-0.722], AUPRC: 0.166 [0.116-0.215], sensitivity: 0.468, specificity: 0.912). Comparison plots of AUROC and AUPRC are shown in **Figure 2**. When predicted results were plotted as survival curves, there was clear separation between those predicted to be with and without cardiovascular death occurring at or before 10 years, with the best curve separation achieved by ensemble models (**Figure 3**).

View this table:
[Table 2.](http://medrxiv.org/content/early/2021/09/14/2021.09.09.21263327/T2)

Table 2. 
Model performance comparison

![Figure 2.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/09/14/2021.09.09.21263327/F2.medium.gif)

[Figure 2.](http://medrxiv.org/content/early/2021/09/14/2021.09.09.21263327/F2)

Figure 2. 
Receiver operator characteristic (ROC) and Prediction-recall curves of machine learning models

glm2: Logistic regression (PCE+ECG data), glm3: Logistic regression (Demographic+ECG data), nn2: Neural network (PCE+ECG data), nn3: Neural network (Demographic+ECG data), pce: Pooled Cohort Equations, sl2: Ensemble model (PCE+ECG data), sl3: Ensemble model (Demographic+ECG data)

![Figure 3.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/09/14/2021.09.09.21263327/F3.medium.gif)

[Figure 3.](http://medrxiv.org/content/early/2021/09/14/2021.09.09.21263327/F3)

Figure 3. 
Survival curves based on model prediction

(Left-side): Models based on PCE+ECG data. pce: Pooled Cohort Equations, glm2: Logistic regression, nn2: Neural network, sl2: Ensemble model

(Right-side): Models based on Demographic+ECG data. pce: Pooled Cohort Equations, glm3: Logistic regression, nn3: Neural network, sl3: Ensemble model

Training data augmentation generally had beneficial effects on model classification performance, though the degree varied by model family (**Supplemental Table 3**). For logistic regression, both oversampled and synthetic training data markedly improved classification performance. For neural networks, there was sequential improvement in performance from base to oversampled to synthetic training data. For ensemble models, base training data worked well, with only comparable performance seen with synthetic data. Training data augmentation also generally improved model calibration (**Supplemental Figure 1**). In general, models tended to overestimate risk when predicting higher event probability. Calibration curves were haphazard for base logistic regression and ensemble models, compared to smoother calibration curves for models based on oversampled and synthetic data, suggesting improved calibration when using augmented training data. However, training data augmentation did not improve classification performance or calibration of random forest, gradient boosting machine, and support vector machine models, which all had poor predictive value overall.

### Variable importance comparison

Among traditional cardiovascular risk factors, age was the most important predictor of 10-year cardiovascular mortality, occurring as the top variable in all models, followed by systolic blood pressure and treatment for hypertension. Among ECG features, the most important were R amplitude in lead II, J amplitude in lead V5, J amplitude in lead V6, R amplitude in lead aVL, S amplitude in lead III, S amplitude in lead aVF, and S duration in lead V6. Interestingly, the most important ECG features appeared to cluster in inferior (II, III, aVF) and lateral (aVL, V5, V6) leads, as easily seen when plotted on a standard 12-lead ECG (**Figure 4**). Ranking of important predictor variables for individual models and the aggregate counts for the top ten predictors for each model are shown in **Supplemental Table 4**.

![Figure 4.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/09/14/2021.09.09.21263327/F4.medium.gif)

[Figure 4.](http://medrxiv.org/content/early/2021/09/14/2021.09.09.21263327/F4)

Figure 4. 
Variable importance plot on the 12-lead electrocardiogram

Color schema is based on decreasing frequency of occurrence in machine learning models: Dark green (HSAGEIR: Age), Light green (medianSBP: Systolic blood pressure), Yellow (ECPRA2: R amplitude, lead II), Light orange (ECPJ11: J amplitude, lead V5; ECPJ12: J amplitude, lead V6), Orange (HTN_tx1: Treatment for hypertension; ECPRA5: R amplitude, lead aVL; ECPSA3: S amplitude, lead III; ECPSA6: S amplitude, lead aVF; ECPSD12: S duration, lead V6). Red and blue circles indicate inferior and lateral lead groupings of the 12-lead ECG, respectively.

## DISCUSSION

In this study, we showed that machine learning methods could be pragmatically applied to aggregate ECG data to predict 10-year cardiovascular mortality. Machine learning models, particularly the logistic regression, neural network, and ensemble models, performed favorably in terms of classification metrics compared to the current clinical standard, the PCE. Only demographic and ECG features were required to achieve comparable performance, without requiring traditional cardiovascular risk factors represented in the PCE. Interestingly, individual ECG features with the most prognostic information were seen to cluster in inferior and lateral segments of the ECG.

Prior studies have examined individual ECG components (e.g. P wave duration4, deep terminal negativity of P wave in V15,27, QRS duration6, QT interval28, JT interval29, and isolated ST-segment and T-wave abnormalities30) or groups of ECG components31 for their moderate additive predictive value with respect to standardized cardiovascular risk calculators such as the Framingham Risk Score or the PCE. Beyond individual components, groups of ECG components have been evaluated in the framework of global electrical heterogeneity32, though this is not typically computed or utilized in clinical practice. Despite the potential, the United States Preventive Services Task Force in 2018 recommended against using the ECG to screen adults with low risk of cardiovascular events (grade D), and remained undecided for adults with presumed intermediate or high risk of cardiovascular events (grade I), citing the imbalance between the potential benefits of early disease detection versus harms related to unnecessary invasive testing and overtreatment33.

Subsequent studies employing modern machine learning techniques, however, have shown that aggregate ECG data contain significant predictive information for detection of various cardiac abnormalities such as systolic dysfunction10,34, diastolic dysfunction9, atrial fibrillation12, pulmonary arterial hypertension8, hypertrophic cardiomyopathy8, cardiac amyloid8, mitral valve prolapse8, as well as prognostic information for short term mortality35. Expanding on this trend, this study has demonstrated that machine learning models based on aggregate ECG data can also predict long-term outcomes such as 10-year cardiovascular mortality. In fact, demographic and ECG features appear to contain as much prognostic information as rest of traditional cardiovascular risk factors represented in the PCE, with potentially important implications for primary prevention such as obviating the need to obtain blood draws for long-term cardiovascular risk prediction.

The important individual ECG features that contributed to model performance were not the traditional markers of ischemia or infarction (e.g. abnormalities in Q wave, ST segment, or R wave progression) that would be expected on a clinical basis. These are supportive of the findings of a recent study by Raghunath *et al*.35, where subclinical ECG markers were most predictive of short-term mortality. While it is conceivable that a significant portion of the study population had clinically silent coronary artery disease that manifested in the inferior and lateral leads of the ECG, these findings could also represent an entirely different mechanism that contributes to long-term cardiovascular mortality risk, or simply highly correlated data. Further studies are needed to validate these findings.

### Limitations

There are several important limitations to this study. First, this was a retrospective study based on a single data source, with the usual limitations associated with this study design. Not all data components, specifically ECG data, were available for all participants of NHANES III. Therefore only a subset of the survey participants formed the study cohort, who may not be representative of the target population of undifferentiated ambulatory adults. There was a significant proportion of missing values in ECG data, requiring substantial preprocessing steps and data imputation. It is possible that the ECG data were not missing at random, and possibly impacted by variable lead placement techniques, which may have led to biased results. Machine learning models were trained as classification and not survival models, which can lead biased parameter estimates. However, parameter estimation was not a focus of this study and survival curves based on model prediction showed clear curve separation. Finally, some machine learning models were not at all effective for event prediction in test data, revealing the potential for overfitting in any machine learning algorithm. Despite these limitations, the comparative analysis framework adopted in this study with less focus on individual model parameters but greater emphasis on aggregate findings demonstrated the feasibility and utility of applying machine learning to aggregate ECG data for prediction of long-term cardiovascular mortality.

## CONCLUSIONS

Machine learning can be applied to demographic and ECG data to predict 10-year cardiovascular mortality in ambulatory adults, with potentially important implications for primary prevention of cardiovascular disease. Further studies are needed to validate these findings.

## Supporting information

Supplemental material [[supplements/263327_file02.docx]](pending:yes)

## Data Availability

NHANES III is a publicly available, de-identified data set.

[https://www.n.cdc.gov/nchs/nhanes/nhanes3/default.aspx](https://www.n.cdc.gov/nchs/nhanes/nhanes3/default.aspx) 

## ACKNOWLEDGEMENTS

The primary author would like to express gratitude to Dr. Jarrod Dalton, PhD at Cleveland Clinic, who provided the R code adaptation for the 2013 ACC/AHA Pooled Cohort Equations via personal communication.

## Footnotes

*   **Disclosures:** None of the authors have financial conflicts of interest pertinent to this manuscript.

## ABBREVIATIONS

ASCVD
:   Atherosclerotic cardiovascular disease
AUPRC
:   Area under precision-recall curve
AUROC
:   Area under receiver operating characteristics curve
ECG
:   Electrocardiogram
NHANES
:   National Health and Nutrition Examination Survey
PCE
:   Pooled Cohort Equations

*   Received September 9, 2021.
*   Revision received September 9, 2021.
*   Accepted September 14, 2021.


*   © 2021, Posted by Cold Spring Harbor Laboratory

The copyright holder for this pre-print is the author. All rights reserved. The material may not be redistributed, re-used or adapted without the author's permission.

## REFERENCES

1.  1.Benjamin EJ, Blaha MJ, Chiuve SE, et al. Heart Disease and Stroke Statistics-2017 Update: A Report From the American Heart Association. Circulation. 2017;135(10):e146–e603.
    
    [FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiRlVMTCI7czoxMToiam91cm5hbENvZGUiO3M6MTQ6ImNpcmN1bGF0aW9uYWhhIjtzOjU6InJlc2lkIjtzOjExOiIxMzUvMTAvZTE0NiI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIxLzA5LzE0LzIwMjEuMDkuMDkuMjEyNjMzMjcuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 

2.  2.Goff DC, Jr.., Lloyd-Jones DM, Bennett G, et al. 2013 ACC/AHA guideline on the assessment of cardiovascular risk: a report of the American College of Cardiology/American Heart Association Task Force on Practice Guidelines. J Am Coll Cardiol. 2014;63(25 Pt B):2935–2959.
    
    [FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6MzoiUERGIjtzOjExOiJqb3VybmFsQ29kZSI7czo0OiJhY2NqIjtzOjU6InJlc2lkIjtzOjE3OiI2My8yNV9QYXJ0X0IvMjkzNSI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIxLzA5LzE0LzIwMjEuMDkuMDkuMjEyNjMzMjcuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 

3.  3.Arnett DK, Blumenthal RS, Albert MA, et al. 2019 ACC/AHA Guideline on the Primary Prevention of Cardiovascular Disease: A Report of the American College of Cardiology/American Heart Association Task Force on Clinical Practice Guidelines. J Am Coll Cardiol. 2019.
    
    
4.  4.Magnani JW, Gorodeski EZ, Johnson VM, et al. P wave duration is associated with cardiovascular and all-cause mortality outcomes: the National Health and Nutrition Examination Survey. Heart Rhythm. 2011;8(1):93–100.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.hrthm.2010.09.020&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20868770&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F09%2F14%2F2021.09.09.21263327.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000285622700020&link_type=ISI) 

5.  5.Tereshchenko LG, Henrikson CA, Sotoodehnia N, et al. Electrocardiographic deep terminal negativity of the P wave in V(1) and risk of sudden cardiac death: the Atherosclerosis Risk in Communities (ARIC) study. J Am Heart Assoc. 2014;3(6):e001387.
    
    [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NToiYWhhb2EiO3M6NToicmVzaWQiO3M6MTE6IjMvNi9lMDAxMzg3IjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjEvMDkvMTQvMjAyMS4wOS4wOS4yMTI2MzMyNy5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 

6.  6.Badheka AO, Singh V, Patel NJ, et al. QRS duration on electrocardiography and cardiovascular mortality (from the National Health and Nutrition Examination Survey-III). Am J Cardiol. 2013;112(5):671–677.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.amjcard.2013.04.040&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23726176&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F09%2F14%2F2021.09.09.21263327.atom) 

7.  7.Badheka AO, Patel NJ, Grover PM, et al. ST-T wave abnormality in lead aVR and reclassification of cardiovascular risk (from the National Health and Nutrition Examination Survey-III). Am J Cardiol. 2013;112(6):805–810.
    
    
8.  8.Tison GH, Zhang J, Delling FN, Deo RC. Automated and Interpretable Patient ECG Profiles for Disease Detection, Tracking, and Discovery. Circ Cardiovasc Qual Outcomes. 2019;12(9):e005289.
    
    
9.  9.Kagiyama N, Piccirilli M, Yanamala N, et al. Machine Learning Assessment of Left Ventricular Diastolic Function Based on Electrocardiographic Features. J Am Coll Cardiol. 2020;76(8):930–941.
    
    [FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6MzoiUERGIjtzOjExOiJqb3VybmFsQ29kZSI7czo0OiJhY2NqIjtzOjU6InJlc2lkIjtzOjg6Ijc2LzgvOTMwIjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjEvMDkvMTQvMjAyMS4wOS4wOS4yMTI2MzMyNy5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 

10. 10.Attia ZI, Kapa S, Lopez-Jimenez F, et al. Screening for cardiac contractile dysfunction using an artificial intelligence-enabled electrocardiogram. Nat Med. 2019;25(1):70–74.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41591-018-0240-2&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F09%2F14%2F2021.09.09.21263327.atom) 

11. 11.Hannun AY, Rajpurkar P, Haghpanahi M, et al. Cardiologist-level arrhythmia detection and classification in ambulatory electrocardiograms using a deep neural network. Nat Med. 2019;25(1):65–69.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41591-018-0268-3&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F09%2F14%2F2021.09.09.21263327.atom) 

12. 12.Attia ZI, Noseworthy PA, Lopez-Jimenez F, et al. An artificial intelligence-enabled ECG algorithm for the identification of patients with atrial fibrillation during sinus rhythm: a retrospective analysis of outcome prediction. Lancet. 2019;394(10201):861–867.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S0140-6736(19)31721-0&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F09%2F14%2F2021.09.09.21263327.atom) 

13. 13.National Health and Nutrition Examination Survey III (NHANES III) (1988-1994). Centers for Disease Control and Prevention. [https://www.n.cdc.gov/nchs/nhanes/nhanes3/Default.aspx](https://www.n.cdc.gov/nchs/nhanes/nhanes3/Default.aspx). Accessed 08/29/2019.
    
    
14. 14.2015 Public-Use Linked Mortality Files. National Center for Health Statistics, Center for Disease Control and Prevention. [https://www.cdc.gov/nchs/data-linkage/mortality-public.htm](https://www.cdc.gov/nchs/data-linkage/mortality-public.htm). Accessed.
    
    
15. 15.R: A language and environment for statistical computing. In. [https://www.R-project.org/](https://www.R-project.org/): R Foundation for Statistical Computing, Vienna, Austria. ; 2019.
    
    
16. 16.Wickham H. Tidyverse package. In:2019.
    
    
17. 17.van Buuren S, Groothuis-Oudshoorn K, Vink G, al. e. mice: Multivariate Imputation by Chained Equations. In: [https://cran.r-project.org/web/packages/mice/index.html](https://cran.r-project.org/web/packages/mice/index.html); 2020.
    
    
18. 18.Lunardon N, Menardi G, Torelli N. ROSE: Random Over-Sampling Examples. In. [https://cran.r-project.org/web/packages/ROSE/ROSE.pdf2014](https://cran.r-project.org/web/packages/ROSE/ROSE.pdf2014).
    
    
19. 19.Kuhn MW, J.; Weston, S.; Williams, A.; Keefer, C.; Engelhardt, A.; Cooper, T.; Mayer, Z.; Kenkel, B.; Benesty, M.; Lescarbeau, R.; Ziem, A.; Scrucca, L.; Tang, Y.; Candan, C.; Hunt, T. caret: Classification and Regression Training. In: [https://cran.r-project.org/web/packages/caret/caret.pdf](https://cran.r-project.org/web/packages/caret/caret.pdf); 2020.
    
    
20. 20.Falbel DA, J.J.; Chollet, F.; Tang, Y.; van der Bifl, W.; Struder, M.; Keydana, S. R Interface to ‘Keras’. In. [https://cran.r-project.org/web/packages/keras/index.html2020](https://cran.r-project.org/web/packages/keras/index.html2020).
    
    
21. 21.Polley E. SuperLearner: Super Learner Prediction. In. [https://cran.r-project.org/web/packages/SuperLearner/SuperLearner.pdf2019](https://cran.r-project.org/web/packages/SuperLearner/SuperLearner.pdf2019).
    
    
22. 22.Saito TR, M. precrec: Calculate Accurate Precision-Recall and ROC (Receiver Operator Characteristics) Curves. [https://cran.r-project.org/web/packages/precrec/index.html](https://cran.r-project.org/web/packages/precrec/index.html). Published 2021. Accessed.
    
    
23. 23.Therneau TML, T.; Atkinson, E.; Crowson, C. survival: Survival Analysis. In. [https://cran.r-project.org/web/packages/survival/index.html](https://cran.r-project.org/web/packages/survival/index.html) 2020.
    
    
24. 24.Kassambara AK, M.; Biecek, P.; Fabian, S. survminer: Drawing Survival Curves using ‘ggplot2’. In. [https://cran.r-project.org/web/packages/survminer/index.html](https://cran.r-project.org/web/packages/survminer/index.html) 2020.
    
    
25. 25.Cholesterol Treatment Trialists C, Mihaylova B, Emberson J, et al. The effects of lowering LDL cholesterol with statin therapy in people at low risk of vascular disease: meta-analysis of individual data from 27 randomised trials. Lancet. 2012;380(9841):581–590.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S0140-6736(12)60367-5&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22607822&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F09%2F14%2F2021.09.09.21263327.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000307511400031&link_type=ISI) 

26. 26.Muntner P, Colantonio LD, Cushman M, et al. Validation of the atherosclerotic cardiovascular disease Pooled Cohort risk equations. Jama. 2014;311(14):1406–1415.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1001/jama.2014.2630&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24682252&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F09%2F14%2F2021.09.09.21263327.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000333935000016&link_type=ISI) 

27. 27.Tereshchenko LG, Shah AJ, Li Y, Soliman EZ. Electrocardiographic deep terminal negativity of the P wave in V1 and risk of mortality: the National Health and Nutrition Examination Survey III. J Cardiovasc Electrophysiol. 2014;25(11):1242–1248.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1111/jce.12453&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24837486&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F09%2F14%2F2021.09.09.21263327.atom) 

28. 28.Zhang ZM, Rautaharju PM, Prineas RJ, Tereshchenko L, Soliman EZ. Electrocardiographic QRS-T angle and the risk of incident silent myocardial infarction in the Atherosclerosis Risk in Communities study. J Electrocardiol. 2017;50(5):661–666.
    
    
29. 29.Zulqarnain MA, Qureshi WT, O’Neal WT, Shah AJ, Soliman EZ. Risk of Mortality Associated With QT and JT Intervals at Different Levels of QRS Duration (from the Third National Health and Nutrition Examination Survey). Am J Cardiol. 2015;116(1):74–78.
    
    
30. 30.Badheka AO, Rathod A, Marzouka GR, et al. Isolated nonspecific ST-segment and T-wave abnormalities in a cross-sectional United States population and Mortality (from NHANES III). Am J Cardiol. 2012;110(4):521–525.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.amjcard.2012.04.023&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22608358&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F09%2F14%2F2021.09.09.21263327.atom) 

31. 31.Shah AJ, Vaccarino V, Janssens AC, et al. An Electrocardiogram-Based Risk Equation for Incident Cardiovascular Disease From the National Health and Nutrition Examination Survey. JAMA Cardiol. 2016;1(7):779–786.
    
    
32. 32.Waks JW, Sitlani CM, Soliman EZ, et al. Global Electric Heterogeneity Risk Score for Prediction of Sudden Cardiac Death in the General Population: The Atherosclerosis Risk in Communities (ARIC) and Cardiovascular Health (CHS) Studies. Circulation. 2016;133(23):2222–2234.
    
    [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6MTQ6ImNpcmN1bGF0aW9uYWhhIjtzOjU6InJlc2lkIjtzOjExOiIxMzMvMjMvMjIyMiI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIxLzA5LzE0LzIwMjEuMDkuMDkuMjEyNjMzMjcuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 

33. 33.Screening for Cardiovascular Disease Risk With Electrocardiography: US Preventive Services Task Force Recommendation Statement. JAMA. 2018;319(22):2308–2314.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1001/jama.2018.6848&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=29896632&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F09%2F14%2F2021.09.09.21263327.atom) 

34. 34.Attia ZI, Kapa S, Yao X, et al. Prospective validation of a deep learning electrocardiogram algorithm for the detection of left ventricular systolic dysfunction. J Cardiovasc Electrophysiol. 2019;30(5):668–674.
    
    
35. 35.Raghunath S, Ulloa Cerna AE, Jing L, et al. Prediction of mortality from 12-lead electrocardiogram voltage data using a deep neural network. Nat Med. 2020;26(6):886–891.
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F09%2F14%2F2021.09.09.21263327.atom)