Abstract
Background Electrocardiographic (ECG) Heart Age conveying cardiovascular risk has been estimated by both Bayesian and artificial intelligence approaches. We hypothesized that explainable measures from the 10-second 12-lead ECG could successfully predict Bayesian ECG Heart Age.
Methods Advanced analysis was performed on ECGs from healthy subjects and patients with cardiovascular risk or proven heart disease. Regression models were used to predict a Bayesian 5-minute ECG Heart Age from the standard resting 10-second 12-lead ECG. The difference between 10-second ECG Heart Age and chronological age was compared.
Results In total, 2,771 subjects were included (n=1682 healthy volunteers, n=305 with cardiovascular risk factors, n=784 with cardiovascular disease). Overall, 10-second Heart Age showed strong agreement with the 5-minute Heart Age (R2=0.94, p<0.001, mean±SD bias 0.0±5.1 years). The difference between 10-second ECG Heart Age and chronological age was 0.0±5.7 years in healthy individuals, 7.4±7.3 years in subjects with cardiovascular risk factors (p<0.001), and 14.3±9.2 years for patients with cardiovascular disease (p<0.001).
Conclusions ECG Heart Age can be accurately estimated from a 10-second 12-lead ECG in a transparent and explainable fashion based on known ECG measures, without artificial intelligence techniques. The difference between ECG Heart Age and chronological age increases markedly with cardiovascular risk and disease.
Background
Cardiovascular disease is a significant contributor to mortality, and pathological processes begin early and can progress silently for many years1-3. Many of the risk factors for developing cardiovascular disease are lifestyle-related4. Fortunately, several of these are modifiable, and risk can therefore be reduced for example by smoking cessation, dietary changes and increased physical activity5,6. To accomplish this, an individual must fully understand their risk and become motivated to act upon it. One way of communicating risks to the patient is to present the risk as a “Heart Age”, which can be contrasted to the patient’s chronological age. A Heart Age can either be determined by translating risk factor scores to what age that score would represent in an individual with no risk factors, or it can be based on electrocardiographic changes7-12. Describing the risk to the patient using Heart Age has been reported to reduce metabolic risk factors and may have the advantage of being easily understood by the patient11,13. A similar approach has been applied when conveying risk to smokers by describing how ‘old’ their lungs are, and such an approach increased the chance of smoking cessation14.
Moving beyond but not excluding basic ECG measurements such as heart rate and waveform amplitudes and durations, the diagnostic output from the ECG can be further substantially improved by using combinations of advanced ECG measures from 12-lead-ECG-derived vectorcardiography and waveform complexity15-17. An accurate ECG Heart Age using advanced, 5-minute, 12-lead ECG was developed in 2014, based on Bayesian statistics9. The Bayesian approach was transparent in so much as the advanced ECG measures contributing to the ECG Heart Age were well-described. After being trained on a set of healthy individuals, this approach yielded increased ECG Heart Ages for subjects at risk of cardiovascular disease and even higher ECG Heart Ages for those with established cardiovascular disease9. However, the reliance on 5-minute, high-fidelity 12-lead ECG recordings lessens the likelihood of widespread clinical use. If standard 10-second ECG recordings could be used instead, the clinical impact might be enhanced. Moreover, artificial intelligence has been used to estimate ECG Heart Age using the 10-second resting 12-lead ECG7,8,10. However, artificial intelligence techniques are limited by their “black box” approach, whereby the clinician does not have transparency as to the exact source(s) of the changes in the ECG that can affect an ECG Heart Age or other output18,19.Therefore, the aim of the study was to predict 5-minute ECG Heart Age from measures available by 10-second 12-lead ECG, and to compare the 10-second ECG Heart Age to chronological age in healthy subjects, subjects with cardiovascular risk factors, and patients with established cardiovascular disease. We hypothesized that 10-second 12-lead ECG recordings could accurately predict Bayesian ECG Heart Ages derived from 5-minute 12-lead ECG recordings.
Methods
A database of de-identified patients with both 5-minute and 10-second 12-lead ECG recordings was utilized for the study9. Within that database, healthy individuals, patients at cardiovascular risk, and patients with established cardiovascular disease were included. All healthy subjects were low-risk asymptomatic volunteers with absence of any cardiovascular or systemic disease, based on clinical history and physical examination. Exclusion criteria for the healthy subjects included increased blood pressure at physical examination (≥140/90 mm Hg), treatment for hypertension or diabetes, or active smoking. Patients with established cardiovascular disease were included based on the presence of either coronary heart disease (determined by coronary angiography with at least one obstructed vessel (≥50%) in at least one major native coronary vessel or coronary graft, or, if coronary angiography was either unavailable or clinically not indicated, one or more reversible perfusion defects on 99m-Tc-tetrofosmin single-photon emission computed tomography (SPECT)20-22), left ventricular hypertrophy (LVH) based on imaging evidence of at least moderate, concentric wall thickening according to guidelines of the American Society of Echocardiography23, left systolic dysfunction (left ventricular ejection fraction ≤50%) at echocardiography, cardiac magnetic resonance imaging (CMR) or SPECT, or findings suggestive of dilated/hypertrophic/ischemic cardiomyopathy at echocardiography or CMR16. Finally, subjects at cardiovascular risk were included based on the presence of cardiovascular risk factors such as hypertension or diabetes but no confirmed established cardiovascular disease9.
Based on the above, three groups of study participants were included: healthy subjects, subjects at cardiovascular risk, and patients with established cardiovascular disease. By methodological design, onlyhealthy subjectswere initially included when considering optimal measures of ECG available from 10-second 12-lead ECG for predicting the 5-minute ECG Heart Age. The 10-second ECG measures considered for the prediction model included: (1) From the conventional ECG: heart rate, R-to-R, P-wave, PR, QRS, QT, QTc, and TQ interval durations, as well as the conventional ECG amplitudes and axes; (2) From the transformation of the 12-lead ECG to the Frank X, Y and Z lead vectorcardiogram (VCG) via Kors’ transform24-28: the spatial means and peaks QRS-T angles, the spatial ventricular gradient and its individual QRS and T components, the spatial QRS- and T-wave axes (azimuths and elevations), waveform amplitudes and areas, including those in the three individual vectorcardiographic planes, and spatial QRS-and T-wave velocities; and (3) measures of QRS-and T-wave waveform complexity based on singular value decomposition after signal averaging29-31.
For all study participants, the difference between the 10-second ECG Heart Age and the chronological age was also calculated. Results were then compared between the three groups defined above. All participants gave written informed consent. All recordings were obtained under Institutional Review Board (IRB) approvals from NASA’s Johnson Space Center and partner hospitals that fall under IRB exemptions for previously collected and de-identified data. The study was performed in accordance with the Declaration of Helsinki.
Statistical analysis
Continuous variables were described using mean and standard deviation (SD). The chi-squared test was used to test for proportional differences between groups. Student’s t test was used to compare group means. The 10-second ECG Heart Age was first derived only in the healthy subject group through a process of feature selection, via optimized stepwise procedures, of measures available from 10-s, standard-fidelity ECG recordings, first using univariable linear regression, and finally as multivariable linear regression, to best predict the 5-minute ECG Heart Age9. The best model was defined as the model with highest R2 value that was also the most parsimonious, i.e., a model with statistically equal performance that incorporated a lesser number of measures was considered more parsimonious due to less proneness to over-fitting. The best model from the healthy volunteers was then applied forward to an expanded population that also included the other two groups (cardiovascular risk and cardiovascular disease, respectively), while not changing any of the included variables, and only allowing optimization of the included coefficients. The final sex-specific model was then applied across all three groups as the 10-secondECG Heart Age, and comparisons with 5-minute ECG Heart Age are presented as scatter plots and Bland-Altman plots. A two-sided p-value of 0.05 was used as to define statistical significance. Statistical analyses were performed using SAS JMP version 11.0, SAS Institute Inc, Cary, NC, USA, and R version 3.5.3R, R Foundation for Statistical Computing, Vienna, Austria, https://www.R-project.org/.
Results
In total, 2,771 patients were included (n=1682 healthy volunteers, n=305 subjects with cardiovascular risk factors, n=784 with cardiovascular disease). Baseline characteristics are presented in Table 1. The ECG measures included in the final prediction models are presented in Table 2 (males) and Table 3 (females).
The 10-second ECG Heart Age showed excellent agreement with the 5-minute Heart Age (R2=0.94, p<0.001, mean±SD bias 0.0±5.2 years), Figure 1. Agreement was strong for both males and females (R2=0.91, p<0.001, and R2=0.92, p<0.001 respectively). In healthy subjects, there was no difference in ECG Heart Age and chronological age (0.0±5.7 years). In subjects with cardiovascular risk factors, the difference was higher (7.4±7.3 years, p<0.001). Patients with cardiovascular disease showed the largest difference between ECG Heart Age and chronological age (14.3±9.2 years, p<0.001 when compared to subjects at cardiovascular risk), Figure 2.
Discussion
We found that ECG Heart Age based on measures of advanced ECG can be accurately predicted from standard resting 10-second 12-lead ECG recordings. This facilitates widespread use of ECG Heart Age in routine clinical settings, since neither specialized ECG machines nor unusually lengthy recordings are necessary. Also, if digital ECG raw data are available and the recording is of acceptable quality, ECG Heart Age can be retrospectively determined. Further, we found ECG Heart Age to be similar to chronological age in healthy individuals, while the ECG Heart Age was increasingly older with increasing cardiovascular disease status. This suggests that the ECG Heart Age is likely to provide accurate cardiovascular risk prediction, although validation in other datasets is necessary.
Explainability and transparency of variables that contribute to ECG Heart Age
For an estimation of Heart Age to be accurate in predicting an age that is similar to the chronological age when the heart is healthy, and increased when the heart is diseased, it is desirable that the included ECG measures change with age, and that the change is augmented with increasing cardiovascular risk or disease severity. Beyond age itself, the two ECG measures that had the strongest influence (highest t ratio) on the model were P-wave duration and spatial QT duration, and these measures fit this description well. P-wave duration increases with age32, and increased P-wave duration can be seen in advanced cardiovascular pathologies, e.g. heart failure and cardiac amyloidosis33. Similarly, QT duration increases with age34. Further, QT prolongation is associated with increased of cardiovascular risk, even beyond the rare long QT syndromes35, and with incrementally increased risk in advanced ages36. These general characteristics are also true for increased heart rate and for leftward shifting of the frontal plane QRS axis37-40. The other measures included in the score track changes in the vectorcardiographic QRS and T, and in T-wave complexity by singular value decomposition. Such changes are also known to occur in conditions associated with increased cardiovascular risk, such as hypertension and diabetes41, and in established cardiovascular disease, in which they often provide strong diagnostic and prognostic information24-27,29-31. Notably, these changes are not easily detectable by visual interpretation of a standard 12-lead ECG. How these ECG measures can affect the ECG Heart Age is exemplified in Figure 3. Taken together, the described ECG measures that contribute in a multivariable fashion to the ECG Heart Age all have physiologically reasonable associations with age and disease in a way that is transparent to the assessing clinician, thus providing important explainability to the model.
Differences compared to Bayesian 5-minute ECG Heart Age
The original, Bayesian 5-minute ECG Heart Age requires information from measures of beat-to-beat heart rate and QT variability,9 and of the root-mean square voltage or other aspects of high-frequency (high fidelity) components of the QRS complex42-44. However, 10-second-duration recordings of standard fidelity do not allow for such measures, and therefore they were not included in the 10-second ECG Heart Age. However, unlike the original Bayesian 5-minute ECG Heart Age, the 10-second ECG Heart Age should be derivable from any standard 12-lead ECG machine, as long as it is sufficiently equipped with software that can measure the included measures and calculate the 10-second ECG Heart Age. The presented 10-second ECG Heart Age might therefore be anticipated to contribute to more widespread clinical penetration and use.
Comparison with other Heart (or Vascular) Ages
Different means of expressing cardiovascular risk by translating it into a heart or vascular age have recently been published13. Attia, et al, showed that by using a deep neural network (DNN) artificial intelligence (AI) technique, a patient’s chronological age could be predicted, and that if the difference between the predicted and actual age was small, prognosis was good 8. When Heart Age by Attia et al’s technique was older than the chronological age, the risk of future death was increased7. This corresponds well to the findings in our study that ECG Heart Age increased with increasing burden of cardiovascular risk. However, the prognostic value of the ECG Heart Age presented in the current study requires additional validation. Furthermore, another AI method similar to that of Attia et al reported similarly encouraging results10. However, although the results of such AI studies are promising, DNN-based AI techniques are inherently problematic in several respects, especially in relation to their lack of transparency and explainability, i.e., the ‘black box’ of AI18,19.Without the ability to know the exact features of the 12-lead ECG that are most important in a given DNN model’s output, both interpretability and ethical accountability are compromised45. Moreover, it is effectively impossible for a clinician to identify, when critically evaluating the diagnostic output of a DNN-based AI model, the possible contribution to the result from methodological artifact or bias merely related to noise or to differing technical specifications between different ECG machines46. Alternatively, it is not possible to know if unanticipated results are possibly just related to excess dependency on the particular characteristics of a given DNN AI model’s training set47. In addition, a major flaw in both DNN-based AI models is that the age predictions were made using datasets including individuals with both cardiovascular risk factors and established disease8,10. For ECG Heart Age to be used as a marker of cardiovascular risk, it is imperative that ECG Heart Age agrees with chronological age in healthy populations, since it is the deviations from the line of identity in this relationship that form the basis of the assumed risk increase8.
Hence, we believe that the pursuit of an ECG Heart Age developed from heart-healthy subjects of varying ages, but without a black-box DNN or related AI methodology is valuable, and that the present results provide sufficient confirmation of accuracy to encourage further development. In addition, models in which the assessments were based on age predictions in healthy subjects will likely outperform models that were not. And finally, the use of more transparent regression models will also increase the ability of clinicians to better understand the origin of any unexpected result, and to thereafter relay it to the patient with a more convincing sense of trust and ethical accountability45.
Limitations
Using the same dataset for training and validation, our results can only be considered as proof-of-concept. Although the difference between ECG Heart Age and the subjects’ chronological age increased with increasing cardiovascular risk and/or disease, the prognostic value is unknown. Also, although ECG Heart Age was highly accurate, the precision cannot be reliably defined in this study. These aspects therefore need to be addressed in future studies.
Conclusion
We show that ECG Heart Age can be accurately, transparently, and explainably estimated from a standard 10-s, resting 12-lead ECG utilizing multiple, discrete conventional and advanced ECG measures. The difference between ECG Heart Age and subjects’ chronological age increases with increasing cardiovascular risk and disease. The prognostic value of our transparent and explainable 10-second ECG Heart Age requires prospective evaluation in future studies.
Data Availability
All data utilized in the present study are available upon reasonable request to the authors.
Footnotes
Fundings: TL is currently under the support of postdoctoral research grants from The Swedish Heart-Lung Foundation (grant no 20200553), the Swedish Cardiac Society, the Royal Swedish Academy of Sciences (grant no LM2019-0013), Women and Health Foundation, Region Kronoberg (grant no 8301), The Swedish Heart and Lung Association (grant no LKH1387), Swedish Association of Clinical Physiology, and the Scandinavian Society of Clinical Physiology; Nuclear Medicine. The study was funded in part by grants (PI Ugander) from New South Wales Health, Heart Research Australia, and the University of Sydney.
Competing interests: TL, IPL: None TTS is owner and founder of Nicollier-Schlegel SARL, which performs ECG interpretation consultancy using software that can quantify the advanced ECG measures used in the current study. TTS and MU are owners and founders of Advanced ECG Systems, a company that is developing commercial applications of advanced ECG technology used in the current study.