IMPROVING CARDIOVASCULAR DISEASE RISK PREDICTION WITH MACHINE LEARNING USING MENTAL HEALTH DATA: A PROSPECTIVE UK BIOBANK STUDY =============================================================================================================================== * Mohsen Dorraki * Zhibin Liao * Derek Abbott * Peter J. Psaltis * Emma Baker * Niranjan Bidargaddi * Hannah R. Wardill * Anton van den Hengel * Jagat Narula * Johan W. Verjans ## ABSTRACT **Background** Robust and accurate prediction of cardiovascular disease (CVD) risk facilitates early intervention to benefit patients. It is well-known that mental disorders and CVD are interrelated. Nevertheless, psychological factors are not considered in existing models, which use either a limited number of clinical and lifestyle factors, or have been developed on restricted population subsets. **Objectives** To assess whether inclusion of psychological data could improve CVD risk prediction in a new machine learning (ML) approach. **Methods** Using a comprehensive, long-term UK Biobank dataset (n=375,145), we examined the correlation between CVD diagnoses and traditional and psychological risk factors. An ensemble ML model containing five constituent algorithms [decision tree, random forest, XGBoost, support vector machine (SVM), and deep neural network (DNN)] was tested for its ability to predict CVD risk based on two training datasets: one using traditional CVD risk factors alone, or a combination of traditional and psychological risk factors. **Results** Our ensemble ML model could predict CVD with 71.31% accuracy using traditional CVD risk factors alone. However, by adding psychological factors to the training data, accuracy dramatically increased to 85.13%. The accuracy and robustness of our ensemble ML model outperformed all five constituent learning algorithms. Re-testing the model on a control dataset to predict bone diseases returned random results, confirming specificity of the training data for prediction of CVD. **Conclusions** Incorporating mental health assessment data within an ensemble ML model results in a significantly improved, highly accurate, state-of-the-art CVD risk prediction. **AUTHOR APPROVAL** All authors have seen and approved the manuscript. **COMPETING INTERESTS** The authors declare no competing interests. **DATA AVAILABILITY STATEMENT** All data needed to evaluate the conclusions in the paper are present in the paper or in the supplementary materials. In addition, we used UK Biobank in this study: [www.ukbiobank.ac.uk](http://www.ukbiobank.ac.uk). **FUNDING** No funding. KEYWORDS * Deep learning * heart * psychological factors * early intervention * artificial intelligence ## INTRODUCTION Coronary heart disease and cerebrovascular disease are the world’s leading cause of death (1). Early intervention is highly effective for people at risk of developing the disease, and a widely used approach is to screen at-risk populations (2). Several Cardiovascular disease (CVD) risk prediction models have been developed during the past four decades. The Reynolds Risk Score in women, used in the US, is based on data from the Women’s Health Study (3). Another multi-marker risk model assesses the contribution of ten genetic markers, including B-type natriuretic peptide and C-reactive protein (4). SCORE, introduced by the European Society of Cardiology, was developed retrospectively from data originating from 12 European cohorts undergoing baseline examination (5), and QRISK is recommended in Europe (6,7). Most risk prediction models follow a statistical regression approach that relates CVD risk with an individual’s characteristics (8), but the number of factors used for prediction is often limited. New approaches, covering a more comprehensive range of markers that do not rely solely on traditional clinical and lifestyle factors, could improve precision in identifying CVD risk. In addition to traditional, recognized risk factors for CVD, such as smoking, high cholesterol levels, hypertension, and obesity, there is also evidence of a strong association between psychological factors and CVD (9). The association is bidirectional, in that psychological factors may be common in certain CVDs and portend worse outcomes, regardless of whether the psychological disorder or CVD occurs first (10). The important role of psychological stress has been investigated in myocardial ischemia (11), coronary artery disease (12,13), acute and reversible cardiomyopathy (14), and ventricular arrhythmias (15). Prospective evidence implies that depression plays a major role CVD development (16,17). The association of depression is also reported with coronary artery disease (18,19), heart rate reactivity (20), myocardial infarction (21), and respiratory sinus arrhythmia fluctuation (22). A considerable number of studies have shown that anxiety is an independent risk factor for cardiac mortality and coronary heart disease (23–25). Although the association between psychological factors and CVD is well documented, no method has previously been devised for exploiting the former in predicting the latter. Until now, the development of such a model has not been possible due to the lack of an appropriate dataset and an automated assessment framework. This study aims to develop an ensemble machine learning (ML) model to predict CVD risk, initially using only traditional risk factors factors [such as body mass index (BMI), hypertension (HT), systolic blood pressure (SBP), diastolic blood pressure (DBP), diabetes, hyperlipidemia, smoking status, age, and gender], then with the addition of psychological data [such as depression, anxiety, and stress], to train and assess the accuracy of our ML model. We develop an ensemble machine learning (ML) model to predict CVD risk, initially using only traditional risk factors, then with the addition of psychological data, to train and assess the accuracy of our ML model. ## METHODS ### Study design The UK Biobank is a large-scale biomedical database and research resource containing in-depth genetic and health information. It comprises a large, long-term (since 2006) dataset from the United Kingdom, recruiting >500,000 participants from the general population—design and participant population are described in (26). Data is collected through a combination of questionnaires, interviews, and blood, urine, and saliva samples and describes these physical measurements, together with lifestyle, medical history, and a series of health risk factors. We considered two categories as training data for our ML model. The first consisted of traditional CVD risk factors, such as gender, age, smoking status, BMI, HT, diabetes, SBP, DBP, and hyperlipidemia. The second category contained data from the UK Biobank touchscreen questionnaire on psychological factors and mental health including diagnosed disorders (bipolar disorder, depression, anxiety, etc.), and symptoms (neuroticism, mood swings, miserableness, irritability, sensitivity, hurt feelings, nervous feelings, loneliness, isolation, guilty feelings, severity of mania, etc.; full details and ICD-10 codes in Supplemental Table 1). CVDs considered in this study included atrial fibrillation/flutter, ventricular arrhythmias, angina pectoris, acute and subsequent myocardial infarctions and subsequent complications, chronic and acute ischemic heart diseases, and hypertensive heart disease (full details in Supplemental Table 2). After preprocessing to remove entries with incomplete data, the data from n=375,145 subjects were analyzed using a Pearson correlation between input features {*X*1, *X*2, *X*3, …, *X**n*} and output label *y**i*, where *i* = 1, 2, 3, …, *n* was mathematically defined as (27): ![Formula][1] where *x* and *y* denote the input variable and the dependent variable or target feature in the dataset, respectively. The median diagnosis age of included patients was 58.0 years (interquartile range: 37–73 years). We separated the data into training and test sets with 90% and 10% of subjects. ### Modeling and predictive analyses We developed an ensemble model containing five ML methods: decision tree, random forest, XGBoost, support vector machine (SVM), and deep neural networks (DNN). Briefly, a decision tree algorithm (28) employs a tree-like model of decisions and their possible consequences, including chance event outcomes, resource costs, and utility. Random forest (29) is a learning method operating by constructing a multitude of decision trees at training time and XGBoost (30) is a scalable end-to-end tree boosting system. An SVM (31) is a supervised ML approach that maps training examples to points in space to maximize the width of the gap between the two clusters of points. A neural network recognizes hidden patterns in a dataset by mimicking the human brain, and DNNs (32) possess multiple layers to progressively extract higher-level patterns from raw input. Our proposed ensemble-learning model collected results from each individual learning approach and produced a final output prediction by adopting a majority voting strategy; each model makes a prediction (votes) for each test instance and the final result is the one that receives more than half the votes. The process of UK Biobank data collection and our model overview are shown in the Figure 1. ![Figure 1:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2022/10/25/2022.10.23.22281428/F1.medium.gif) [Figure 1:](http://medrxiv.org/content/early/2022/10/25/2022.10.23.22281428/F1) Figure 1: Model overview. A) Data collection by the UK Biobank. B) Our ensemble machine learning approach, including decision tree, random forest, XGBoost, SVM, and DNN models, was developed to predict CVD risk from psychological factors. ## RESULTS ### Risk factor correlation Our correlation analysis (Figure 2A) shows that almost all psychological factors are highly correlated, with the exception of risk taking (code:2040). There is generally a negative correlation between age and psychological disorders, which may imply that psychological illnesses are not a natural part of ageing, and that psychological disorders predominantly affect younger adults. ![Figure 2:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2022/10/25/2022.10.23.22281428/F2.medium.gif) [Figure 2:](http://medrxiv.org/content/early/2022/10/25/2022.10.23.22281428/F2) Figure 2: CVD risks for participants from the UK Biobank. A) The correlation between CVD and a selection of psychological factors and traditional CVD risk factors. Descriptions for psychological factor codes are provided in Supplemental Table 1. B) The association of four traditional CVD risk factors by gender: age, BMI, SBP, and DBP. C) The prevalence of CVD by gender and smoking status. Positive associations were observed between CVD and the traditional risk factors, especially for HT (Figure 2A). Figure 2B depicts the distribution of key physical factors (age, BMI, SBP, DBP) by gender. The incidence of CVD in females is lower than in males, and CVD incidence increases for previous and current smokers (Figure 2C). ### Prediction of CVD We explored the ability of our model to predict CVD from two different groups of training data, namely traditional CVD risk factors only, and the combination of traditional and psychological factors. We separately trained the model on each dataset, then evaluated its accuracy with unseen test data. Trained using traditional CVD risk factors only (gender, age, BMI, HT, hyperlipidemia, SBP, DBP, diabetes, and smoking status), our ensemble model was able to predict CVD with an accuracy of 71.31% (precision 74.91%; recall 70.48%), which exceeded the performance of each individual ML model, particularly in precision (Figure 3A; metrics defined in Supplemental Table 3). ![Figure 3.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2022/10/25/2022.10.23.22281428/F3.medium.gif) [Figure 3.](http://medrxiv.org/content/early/2022/10/25/2022.10.23.22281428/F3) Figure 3. Results from different ML models for two training data categories. Accuracy, precision, and recall results from our ensemble model and five individual learning approaches are shown for training data comprising A) traditional CVD risk factors only; and B) the combination of traditional CVD risk and psychological factors. All metrics are described in Supplemental Table 3. A second, independent training using combined traditional and psychological risk factors significantly increased CVD prediction accuracy of the ensemble model to 85.13% (precision 79.62%; and recall 89.40%; Figure 3B). The accuracy and recall of our ensemble model outperformed any of the five constituent learning algorithms; intriguingly, the precision of the individual random forest and DNN algorithms were higher than for the ensemble model, suggesting the appropriate data classification of the ensemble model. Thus, adding amassed psychological data to train our ensemble model increased the baseline 71.31% accuracy of CVD risk prediction by over 10%, to be 85.13% accurate. All other performance metrics also increased significantly after adding psychological factors to the training data, indicating their value in improving identification of patients at-risk of CVD. ### Bone disease prediction cross-check To examine whether the predicted results were not generated spuriously, we performed a control experiment using the UK Biobank bone disease dataset. Using our model trained on traditional and psychological CVD risk factors, we tested its accuracy in predicting bone diseases such as tuberculosis of bones and joints, disorders of bone density and structure, and osteopathies (see Supplemental Table 4). The ensemble model returned performance results similar to those of the five individual approaches; none were effective at predicting bone disorders using CVD risk factors (Supplemental Figure 1). ## DISCUSSION In this prospective study, we developed an ensemble ML approach that can accurately predict CVD risk from traditional risk factors alone. The accuracy of CVD prediction substantially improved to more than >85% by including mental health assessment data in the model, which addressed participant symptoms and disorders such as depression, anxiety, and stress. Prediction of CVD risk is a critical domain within CVD prevention. Several CVD risk prediction systems, such as SCORE (5), work with traditional risk factors, e.g., age, sex, smoking, SBP, and total cholesterol. While these factors correlate well with CVD, their number is limited. Further, some prediction systems have been developed on narrow datasets, e.g., subjects with age of 40–65 years for SCORE and 40–69 years for SCORE2, limiting their validity to subjects outside that range. Because of the relatively limited performance, and variation on individual level, new approaches are needed to guide primary prevention with features that do not rely solely on a limited number of clinical and lifestyle factors, and that can be applied to a broader range of individuals. Psychological studies have shown that mental health factors can be used for CVD risk prediction, which was demonstrated via correlation analysis and statistical risk prediction (33). In this study we have used for the first time an ML approach that exploits these relationships to improve CVD risk prediction to impact management and early intervention. ML approaches have the capability to interpret hidden patterns and structures within psychological and CVD data, that cannot be detected by humans or traditional statistical approaches. These approaches have the power to analyze very large quantities of data to generate data-driven recommendations, rather than hypothesis driven answers. These intelligent methods do not discount the value of conventional diagnosis assessments to evaluate CVD risks; rather, they provide opportunities to predict or make diagnoses faster, more convenient, and more accurate by analyzing larger volumes of data than would be possible by humans. Over the past few years, ML researchers have studied ensemble ML schemes that employ multiple approaches to provide more accurate performance than could be returned from any single constituent learning algorithm. In this study, we have developed an ensemble ML model to analyze UK Biobank data that improved the decision robustness and accuracy over any individual approach. For the first time, we show that adding mental health factors to traditional CVD risk factors can improve the accuracy of ML prediction models. Our results may open new avenues for medical researchers and clinicians to predict CVD risk more precisely and apply early intervention strategies for at-risk individuals using a single time point mental health assessment. The principles demonstrated here can be readily applied to more complex ML models with more input data. Comprehensive biobank datasets are necessary for training and validation, and these are becoming more available in healthcare settings through several sources such as hospitals. Our ensemble ML approach can be used to automate analysis and management of these big datasets to derive meaningful information to benefit clinical outcomes. ### Study limitations We have focused on the prediction of CVD using traditional and psychological factors regardless of CVD diagnosis timing. Although the date of first CVD diagnosis and the date of completing mental health questionnaires are available for individual patients, it is difficult to conclude with certainty whether the reported psychological disorders preceded or followed the development of CVD, especially as the mental health questionnaires only began in August 2016 (Supplemental Figure 2). Addressing this question is challenging as many patients reported with psychological disorders have unknowingly already developed CVD or vice versa. Investigations typically assume that the personality structure occurred before the appearance of heart disease (34), however, a longitudinal study of the personality of individuals prior to CVD would be required to address this question, opening future opportunities to investigate causal relationships. ## CONCLUSIONS We have developed a new CVD risk prediction ensemble ML model that improves CVD prediction outcomes relative to all five constituent algorithms when trained on traditional CVD risk factors. The accuracy remarkably increased to more than 85% when the model was trained using mental health data in addition to traditional CVD risk factor data. The findings imply that psychological assessment can become a reliable, easy and affordable contribution to improving CVD risk prediction and management using ML approaches. ## Supporting information Supplemental Material [[supplements/281428_file02.zip]](pending:yes) ## Data Availability All data needed to evaluate the conclusions in the paper are present in the paper or in the supplementary materials. In addition, we used UK Biobank in this study: [www.ukbiobank.ac.uk](https://www.ukbiobank.ac.uk). [https://www.ukbiobank.ac.uk](https://www.ukbiobank.ac.uk) * Received October 23, 2022. * Revision received October 23, 2022. * Accepted October 25, 2022. * © 2022, Posted by Cold Spring Harbor Laboratory This pre-print is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), CC BY-NC 4.0, as described at [http://creativecommons.org/licenses/by-nc/4.0/](http://creativecommons.org/licenses/by-nc/4.0/) ## REFERENCES 1. 1.Task FM, Piepoli M, Hoes A et al. 2016 European Guidelines on cardiovascular disease prevention in clinical practice: The Sixth Joint Task Force of the European Society of Cardiology and Other Societies on Cardiovascular Disease Prevention in Clinical Practice (constituted by representatives of 10 societies and by invited experts) Developed with the special contribution of the European Association for Cardiovascular Prevention & Rehabilitation (EACPR). Atherosclerosis 2016;252:207. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=27664503&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F10%2F25%2F2022.10.23.22281428.atom) 2. 2.Casto G, Mastropieri MA. The efficacy of early intervention programs: A meta-analysis. Exceptional children 1986;52:417–424. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1177/001440298605200503&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=2936605&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F10%2F25%2F2022.10.23.22281428.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=A1986AYD0600002&link_type=ISI) 3. 3.Ridker PM, Buring JE, Rifai N, Cook NR. Development and validation of improved algorithms for the assessment of global cardiovascular risk in women: the Reynolds Risk Score. Jama 2007;297:611–619. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1001/jama.297.6.611&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=17299196&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F10%2F25%2F2022.10.23.22281428.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000244177400024&link_type=ISI) 4. 4.Wang TJ, Gona P, Larson MG et al. Multiple biomarkers for the prediction of first major cardiovascular events and death. New England Journal of Medicine 2006;355:2631–2639. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1056/NEJMoa055373&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=17182988&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F10%2F25%2F2022.10.23.22281428.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000242956200004&link_type=ISI) 5. 5.Conroy RM, Pyörälä K, Fitzgerald Ae et al. Estimation of ten-year risk of fatal cardiovascular disease in Europe: the SCORE project. European heart journal 2003;24:987–1003. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S0195-668X(03)00114-3&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=12788299&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F10%2F25%2F2022.10.23.22281428.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000183577800002&link_type=ISI) 6. 6.Hippisley-Cox J, Coupland C, Vinogradova Y, Robson J, May M, Brindle P. Derivation and validation of QRISK, a new cardiovascular disease risk score for the United Kingdom: prospective open cohort study. Bmj 2007;335:136. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6MzoiYm1qIjtzOjU6InJlc2lkIjtzOjEyOiIzMzUvNzYxMS8xMzYiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyMi8xMC8yNS8yMDIyLjEwLjIzLjIyMjgxNDI4LmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 7. 7.Hippisley-Cox J, Coupland C, Vinogradova Y, Robson J, Brindle P. Performance of the QRISK cardiovascular risk prediction algorithm in an independent UK sample of patients from general practice: a validation study. Heart 2008;94:34–39. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6ODoiaGVhcnRqbmwiO3M6NToicmVzaWQiO3M6NzoiOTQvMS8zNCI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIyLzEwLzI1LzIwMjIuMTAuMjMuMjIyODE0MjguYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 8. 8.Altman DG, Royston P. What do we mean by validating a prognostic model? Statistics in medicine 2000;19:453–473. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/(SICI)1097-0258(20000229)19:4<453::AID-SIM350>3.0.CO;2-5&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=10694730&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F10%2F25%2F2022.10.23.22281428.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000085613900005&link_type=ISI) 9. 9.Rutledge T, Linke SE, Krantz DS et al. Comorbid depression and anxiety symptoms as predictors of cardiovascular events: results from the NHLBI-sponsored Women’s Ischemia Syndrome Evaluation (WISE) study. Psychosomatic medicine 2009;71:958. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6ODoicHN5Y2htZWQiO3M6NToicmVzaWQiO3M6ODoiNzEvOS85NTgiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyMi8xMC8yNS8yMDIyLjEwLjIzLjIyMjgxNDI4LmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 10. 10.Piña IL, Di Palo KE, Ventura HO. Psychopharmacology and cardiovascular disease. Journal of the American College of Cardiology 2018;71:2346–2359. [FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6MzoiUERGIjtzOjExOiJqb3VybmFsQ29kZSI7czo0OiJhY2NqIjtzOjU6InJlc2lkIjtzOjEwOiI3MS8yMC8yMzQ2IjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjIvMTAvMjUvMjAyMi4xMC4yMy4yMjI4MTQyOC5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 11. 11.Jiang W, Babyak M, Krantz DS et al. Mental stress—induced myocardial ischemia and cardiac events. Jama 1996;275:1651–1656. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1001/jama.1996.03530450041030&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=8637138&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F10%2F25%2F2022.10.23.22281428.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=A1996UN25200027&link_type=ISI) 12. 12.Bosma H, Peter R, Siegrist J, Marmot M. Two alternative job stress models and the risk of coronary heart disease. American journal of public health 1998;88:68–74. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.2105/AJPH.88.1.68&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=9584036&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F10%2F25%2F2022.10.23.22281428.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000073804800012&link_type=ISI) 13. 13.Sheps DS, McMahon RP, Becker L et al. Mental stress–induced ischemia and all-cause mortality in patients with coronary artery disease: results from the Psychophysiological Investigations of Myocardial Ischemia study. Circulation 2002;105:1780–1784. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6MTQ6ImNpcmN1bGF0aW9uYWhhIjtzOjU6InJlc2lkIjtzOjExOiIxMDUvMTUvMTc4MCI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIyLzEwLzI1LzIwMjIuMTAuMjMuMjIyODE0MjguYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 14. 14.Sharkey SW, Lesser JR, Zenovich AG et al. Acute and reversible cardiomyopathy provoked by stress in women from the United States. Circulation 2005;111:472–479. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6MTQ6ImNpcmN1bGF0aW9uYWhhIjtzOjU6InJlc2lkIjtzOjk6IjExMS80LzQ3MiI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIyLzEwLzI1LzIwMjIuMTAuMjMuMjIyODE0MjguYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 15. 15.Lampert R, Jain D, Burg MM, Batsford WP, McPherson CA. Destabilizing effects of mental stress on ventricular arrhythmias in patients with implantable cardioverter-defibrillators. Circulation 2000;101:158–164. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6MTQ6ImNpcmN1bGF0aW9uYWhhIjtzOjU6InJlc2lkIjtzOjk6IjEwMS8yLzE1OCI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIyLzEwLzI1LzIwMjIuMTAuMjMuMjIyODE0MjguYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 16. 16.Kuper H, Marmot M, Hemingway H. Systematic review of prospective cohort studies of psychosocial factors in the etiology and prognosis of coronary heart disease. Seminars in vascular medicine: Copyright© 2002 by Thieme Medical Publishers, Inc., 333 Seventh Avenue, New …, 2002:267–314. 17. 17.Salomon K, Clift A, Karlsdóttir M, Rottenberg J. Major depressive disorder is associated with attenuated cardiovascular reactivity and impaired recovery among those free of cardiovascular disease. Health psychology 2009;28:157. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1037/a0013001&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=19290707&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F10%2F25%2F2022.10.23.22281428.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000264326300003&link_type=ISI) 18. 18.Ford DE, Mead LA, Chang PP, Cooper-Patrick L, Wang N-Y, Klag MJ. Depression is a risk factor for coronary artery disease in men: the precursors study. Archives of Internal Medicine 1998;158:1422–1426. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1001/archinte.158.13.1422&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=9665350&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F10%2F25%2F2022.10.23.22281428.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000075095500005&link_type=ISI) 19. 19.York KM, Hassan M, Li Q, Li H, Fillingim RB, Sheps DS. Coronary artery disease and depression: patients with more depressive symptoms have lower cardiovascular reactivity during laboratory-induced mental stress. Psychosomatic medicine 2007;69:521–528. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6ODoicHN5Y2htZWQiO3M6NToicmVzaWQiO3M6ODoiNjkvNi81MjEiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyMi8xMC8yNS8yMDIyLjEwLjIzLjIyMjgxNDI4LmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 20. 20.Knight BG, McCallum T. Heart rate reactivity and depression in African-American and white dementia caregivers: reporting bias or positive coping? Aging & Mental Health 1998;2:212–221. 21. 21.Pratt LA, Ford DE, Crum RM, Armenian HK, Gallo JJ, Eaton WW. Depression, psychotropic medication, and risk of myocardial infarction: prospective data from the Baltimore ECA follow-up. Circulation 1996;94:3123–3129. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6MTQ6ImNpcmN1bGF0aW9uYWhhIjtzOjU6InJlc2lkIjtzOjEwOiI5NC8xMi8zMTIzIjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjIvMTAvMjUvMjAyMi4xMC4yMy4yMjI4MTQyOC5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 22. 22.Rottenberg J, Clift A, Bolden S, Salomon K. RSA fluctuation in major depressive disorder. Psychophysiology 2007;44:450–458. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1111/j.1469-8986.2007.00509.x&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=17371497&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F10%2F25%2F2022.10.23.22281428.atom) 23. 23.Compare A, Germani E, Proietti R, Janeway D. Clinical psychology and cardiovascular disease: an up-to-date clinical practice review for assessment and treatment of anxiety and depression. Clinical practice and epidemiology in mental health: CP & EMH 2011;7:148. 24. 24.Janszky I, Ahnve S, Lundberg I, Hemmingsson T. Early-onset depression, anxiety, and risk of subsequent coronary heart disease: 37-year follow-up of 49,321 young Swedish men. Journal of the American College of Cardiology 2010;56:31–37. [FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6MzoiUERGIjtzOjExOiJqb3VybmFsQ29kZSI7czo0OiJhY2NqIjtzOjU6InJlc2lkIjtzOjc6IjU2LzEvMzEiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyMi8xMC8yNS8yMDIyLjEwLjIzLjIyMjgxNDI4LmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 25. 25.Zafar MU, Paz-Yepes M, Shimbo D et al. Anxiety is a better predictor of platelet reactivity in coronary artery disease patients than depression. European heart journal 2010;31:1573–1582. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/eurheartj/ehp602&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20097703&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F10%2F25%2F2022.10.23.22281428.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000279472900011&link_type=ISI) 26. 26.Sudlow C, Gallacher J, Allen N et al. UK biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age. PLoS medicine 2015;12:e1001779. 27. 27.Assegie TA, Sushma S, Bhavya B, Padmashree S. Correlation Analysis for Determining Effective Data in Machine Learning: Detection of Heart Failure. SN Computer Science 2021;2:1–5. 28. 28.Safavian SR, Landgrebe D. A survey of decision tree classifier methodology. IEEE transactions on systems, man, and cybernetics 1991;21:660–674. 29. 29.Ho TK. Random decision forests. Proceedings of 3rd international conference on document analysis and recognition: IEEE, 1995:278–282. 30. 30.Chen T, He T, Benesty M, Khotilovich V, Tang Y, Cho H. Xgboost: extreme gradient boosting. R package version 04-2 2015;1:1–4. 31. 31.Cortes C, Vapnik V. Support-vector networks. Machine learning 1995;20:273–297. 32. 32.Schmidhuber J. Deep learning in neural networks: An overview. Neural networks 2015;61:85–117. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.neunet.2014.09.003&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25462637&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F10%2F25%2F2022.10.23.22281428.atom) 33. 33.Cunningham R, Poppe K, Peterson D, Every-Palmer S, Soosay I, Jackson R. Prediction of cardiovascular disease risk among people with severe mental illness: a cohort study. PLoS One 2019;14:e0221521. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F10%2F25%2F2022.10.23.22281428.atom) 34. 34.Fisher SH. Psychological factors and heart disease. Circulation 1963;27:113–117. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6MTQ6ImNpcmN1bGF0aW9uYWhhIjtzOjU6InJlc2lkIjtzOjg6IjI3LzEvMTEzIjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjIvMTAvMjUvMjAyMi4xMC4yMy4yMjI4MTQyOC5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) [1]: /embed/graphic-1.gif