Framingham risk score prediction at 12 months in the STANDFIRM randomised control trial

Thanh G Phan; Velandai K Srikanth; Dominique A Cadilhac; Mark Nelson; Joosup Kim; Muideen T Olaiya; Sharyn M Fitzgerald; Christopher Bladin; Richard Gerraty; Henry Ma; Amanda G Thrift; the Shared Team Approach between Nurses and Doctors For Improved Risk factor Management (STANDFIRM) Collaborators

doi:10.1101/2023.09.05.23295104

Abstract

Background The Shared Team Approach between Nurses and Doctors For Improved Risk factor Management (STANDFIRM, ACTRN12608000166370) trial was designed to test the effectiveness of chronic disease care management for modifying the Framingham risk score (FRS) among patients with stroke or transient ischemic attack. The primary outcome of change in FRS between baseline and 12 months was not met. We aimed to determine characteristics of participants at baseline that predict reduction in FRS at 12 months and whether future FRS is predetermined at the time of randomization

Method Data included 35 variables encompassing demographics, risk factors, psychological, social and education status, and laboratory tests. Five supervised machine learning (ML) methods were used: random forest (RF), extreme gradient boosting (XGBoost), support vector regression (SVR), multilayer perceptron artificial neural network (MLP) and K-nearest neighbor (KNN). We split data for training (80%, n=406) and testing (20%, n=102).

Results Training and test data were evenly matched for age, sex, baseline and 12-month FRS. Following tuning of the five ML methods, the optimal model for predicting FRS at 12 months was SVR (R²=0.763, root mean squared error or RMSE=8.52). The five most important variables for SVR were: baseline FRS, age, male sex, sodium/potassium excretion and proteinuria. All ML methods were poor at determining change in FRS at 12 months (R²<0.161).

Conclusion Our findings suggest that change in FRS as an endpoint in trials may have limited value as it is largely determined at baseline. In this cohort, Support Vector Regression was the optimal method to predict future but not change in FRS.

Introduction

Stroke is the second leading cause of death and third leading cause of disability worldwide and results in significant economic and societal cost¹. In Australia, the lifetime risk of stroke from the age of 25 years was one in five ². Despite trials providing evidence for the effectiveness of antithrombotic, antihypertensive medication and statins for stroke prevention, a pathway for consistently implementing these findings, as well as individualising therapy, is lacking^3–5. In Australia, a Medicare item known as chronic disease care management plan was implemented to aid general practitioners to manage patients with chronic disease, including stroke. The Shared Team Approach between Nurses and Doctors For Improved Risk Factor Management (STANDFIRM) trial was designed to test the use of chronic disease care management plan to modify risk factors among patients with ischemic and hemorrhagic stroke (trial registration ACTRN12608000166370)⁶. The intervention arm included the use of chronic disease care plans generated by a stroke physician and supported by a nurse educator working in collaboration with patients and their family doctor.

For the primary outcome of this trial, there was no difference in the Framingham risk score (FRS 0.04, 95% CI −1.7 – 1.8) between the randomized groups at 12 months after randomization. This outcome might have occurred because of the high frequency of secondary stroke prevention medications prescribed at hospital discharge⁶. Several similar trials in patients with stroke have also obtained negative results when using change in FRS as outcome ^7–10. The reasons underlying this finding remain unclear, but the baseline variables (at the time of recruitment in hospital or stroke clinic) in our cohort could potentially determine future FRS and if that is the case then it has implications for using the FRS in clinical trials of secondary stroke prevention.

In this exploratory (post-hoc) analysis, and applying a range of machine learning methods, we used baseline variables to predict FRS at 12 months and change in the score from baseline. This approach was performed to test the hypothesis that future FRS is predetermined at baseline or time of randomization. Furthermore, we wanted to evaluate the role of the other covariates in the outcome of FRS. Initially, we had considered using linear regression analysis as clinicians are comfortable with the outputs that are expressed in terms of regression coefficients and p-values¹¹. A potential drawback in regression analyses is that only a small number of covariates collected during clinical trials are used and a large portion of the collected data are often not used. This issue is related to the underlying architecture of the linear regression method versus the non-linear supervised machine learning methods used here (see Supplementary Material for further details)¹². The draw-back to using machine learning tools is their poor standing as ‘black box’ and, by inference, lack of model interpretability¹². Recent addition of explainable machine learning tools, based on cooperative game theory, permit unravelling of the ‘mystery’ of machine learning. This implementation seeks to explain the model in a way analogous to distributing the profit resulting from coalition of workers relative to their individual contribution ^{13, 14}.

Method

Data

The methodology, primary outcome, and secondary analyses of this trial have been published^{6, 15–17}. There were 35 covariates (features) available for analyses including demographic, social, anxiety, depression and education status (9 variables), dietary and alcohol intake (8 variables), stroke risk factors (9 variables), blood and urine electrolytes (8 variables) and FRS. The FRS was calculated using the method by D’Agostino with higher score indicating a greater risk of death from cardiovascular disease¹⁸. Change in FRS was defined as difference between scores at baseline and 12 months.

Statistical Analysis

All analyses were based on scikit-learn library in Python programming language version 3.7.3 ¹⁹. In this section, we provide a very brief overview of Support Vector Regression (SVR) while tree-based methods (extreme gradient boost machine/XGBoost and random forest/RF), multi-later perceptron (MLP) neural network and K-Nearest Neighbour regression are covered in the Supplementary Material ^{12, 19}. SVR is a machine learning method which uses of kernel manipulation and support vectors (around margin of the plane to reduce errors to an acceptable limit) to perform regression. This idea can be interpreted as the hyperplane separating two classes of data in two-dimensional space is a line. By contrast, a plane is better suited to describe transformed data with three or more dimensions. The idea in SVR is that data is easier to separate once transformed (kernel manipulation) into higher dimension space compared to its original low dimension native space. The fitted line arising from the margin of the data is analogous to fitting a regression line based on minimising least squares¹⁹. However, SVR differs from linear least squares regression in that it tries to minimise the errors to within an acceptable range.

An issue with using a large number of covariates for a given dataset is multicollinearity or relatedness of some of the covariates. We evaluate effect of multicollinearity on the outcome in several ways. Firstly, we removed highly correlated variables from the correlation matrix (Supplementary Figure 1) and calculated the variance inflation factors. The correlation matrix of the covariates can provide an indication of the presence of multicollinearity (Supplementary Figure 1)¹². The sensitivity analysis is performed by first reducing variables with high (0.7) and then by those with moderate correlation (0.5) (Supplementary Figure 2).

The pre-processing steps included removal of outliers based on the concept of neighbourhood i.e. keep data points which are adjacent (neighbours) and remove data points which are far from the rest of the group defined by distance. This was followed by scaling of the data to ensure that all the data had the same range. Next, the scaled data were randomly split into training (80%) and test sets (20%). This is a standard approach in machine learning as it enables the model that we develop to be tested in a dataset that has not been used in development of the model. Each method was put through a tuning process including k-fold cross validation of the training data to search for the optimal parameters to run in the analyses (for the steps in the process see the Supplementary Material). The optimal model was chosen based on the minimal sum of squared errors for each method. Variable importance was determined by SHAP (SHapley Additive exPlanation) values using the SHAP library ¹⁴. The Shapley value is a dimensionless average measure of the marginal gain calculated for each of the covariate from all possible permutations of covariates with respect to the model without that covariate ¹³. The SHAP values produced here are additive combination of Shapley values and other local interpretable machine methods and hence we have used the term SHAP values in the paper¹⁴. To enable understanding of the covariates, dependence plots of interactions between covariates and each other are provided. Finally, the covariates identified as important from machine learning can be considered as the first step in feature extraction for linear regression.

Results

The demographics of the STANDFIRM cohort have been published previously⁶. The training group (n=406) and test group (n=102) data were evenly matched in terms of baseline age (68.63±13.04 vs 66.56±13.89), sex, mean systolic blood pressure, creatinine, HDL cholesterol, body mass index, FRS and 12-month FRS (Table 1). There was a trend to difference in the sodium to potassium excretion between the 2 groups (p=0.07). The correlation matrix among the 35 covariates shows that variables having high correlation with baseline FRS were age (0.53), systolic blood pressure (0.52), male sex (0.42; Supplementary Figure 1). Age was positively correlated with systolic blood pressure (0.37) and negatively with creatinine excretion (−0.38). Male sex was correlated with creatinine excretion (0.49), body weight (0.28), alcohol (0.25) and salt intake (0.22). For the sensitivity analysis, we have reduced the number of covariates down to 32 when using a correlation coefficient <0.70 and 23 when using a correlation coefficient <0.50 (Supplementary Figure 2)

View this table:

Table 1: Patient characteristics of the training and test datasets

Choosing Machine learning model

Following tuning, the optimal model was SVR: SVR (R²=0.763, RMSE or root mean squared error=8.52), XGBoost (R²=0.690, RMSE=9.74), MLP (R²=0.707, RMSE=9.46), RF (R²=0.671, RMSE=10.03) and KNN (R²=0.464, RMSE=12.81). Plots of ground truth versus FRS at 12 months show that the observed (actual) and predicted values tended to vary with each other for SVR (Figure 1a). As such SVR was the better method for this data. By contrast, the predicted values for 12-month FRS were much lower than the observed values for XGBoost, RF and KNN (Figure 1). The time taken to tune the models was longest for MLP (577.4 minutes) and shortest for SVR (14.4 sec). These parameters indicate that SVR is the optimal model for this dataset followed by MLP and XGBoost.

Figure 1:

Model fitting: Plot of ground truth versus machine learning predicted values.

The tuning time to optimise each machine learning method is shown on the panel. It was longest for multilayer perceptron (MLP) and shortest for support vector regression (SVR). There was deviation of the predicted from observed FRS values with random forest (RF), extreme gradient boost (XGBoost) and K-nearest neighbor (KNN) compared to SVR and MLP. The tuning time to optimise each machine learning method is shown on the panel. It was longest for multilayer perceptron (MLP) and shortest for support vector regression (SVR). There was deviation of the predicted from observed FRS values with random forest (RF), extreme gradient boost (XGBoost) and K-nearest neighbor (KNN) compared to SVR and MLP.

The five most important variables defined by dimensionless SHAP values for SVR were: baseline FRS (SHAP=8.84, 41.44%), age (SHAP=1.98, 9.28%), male (SHAP=1.87, 8.76%), ratio of urinary sodium to potassium excretion (SHAP=0.93, 4.36%), creatinine excretion (SHAP=0.61, 2.85%) (Figure 2). Figure 2 is the summary plot for SHAP values for all covariates and provides global interpretability. The summary plot (Figure 2a) showed that high baseline FRS, older age, or male sex are associated with high 12-month FRS. The sensitivity analyses for the different thresholds of correlation coefficient are provided in Figure 3. In essence, the R² drop down to 0.728 at a correlation coefficient threshold of 0.50. Plots for the other machine learning methods illustrate the different extent to which the variables are used by each method (Supplementary Figure 3).

Figure 2:

Global interpretability: SHAP values and covariates for support vector regression (SVR)

The SHAP plot show that the covariates with the largest SHAP values are non-modifiable. FRS shift the prediction by 8.77 points, age by 2.17 points, male by 1.78 points, Sodium potassium ratio by 0.91 points. The dimensionless SHAP value is on the X-axis and the Y-axis represent the covariates ordered from highest to lowest importance. The color appears as a range for continuous scale and as 2 colors for binary values. FRS, Framingham risk score; IHD, ischemic heart disease, AF, atrial fibrillation; HbA1c, glycosylated hemoglobin.

Figure 3:

Global interpretability: Sensitivity analysis at correlation coefficient threshold of 0.7 and 0.5

The R² drops from 0.770 for 32 variables data with down to 0.728 with 23 variables. FRS, Framingham risk score; IHD, ischemic heart disease, AF, atrial fibrillation; HbA1c, glycosylated hemoglobin.

Dependence plot – interaction effect

The dependence plots for the 5 variables with the highest SHAP values are shown in Supplementary Figure 4. It shows interaction between FRS with alcohol, age with urinary potassium, male with systolic blood pressure, sodium potassium ratio with fruit consumption and proteinuria with FRS.

Local interpretability

Local interpretability is provided in Figure 4 with the waterfall plot where covariates with the largest SHAP values affecting an individual’s prediction at the top and the covariates with the least prediction at the bottom. This style of plot is provided for the cases in which the difference between predicted and observed FRS is greater than 19 points (extreme points on histogram analysis, data not shown). The insight here is that FRS plays a major role in shifting the expected value to the final prediction for the individual patient on display in the Figure 4. while the next most important covariates rotate between age, male, proteinuria. The next level alternates among male, age, hypercholesterolemia and sodium potassium ratio. Five of these 6 patients were females (indicated by the negative value).

Figure 4:

Local interpretability: Waterfall plot showing individual patient prediction in Test data

illustrates the prediction for individual patients with greater than 19-point difference in predicted and observed FRS values. The expected values from the model for each patient is provided below the x-axis and the observed value is at the top. The plot show how the covariates move towards the prediction for that patient based on the SHAP values of each covariate, A feature of this waterfall plot is that the covariates with the largest contribution is listed at the top and the ones with the least contribution are listed at the bottom. The direction of the shift is provided in the Figure. Red color implies the shift is in the positive direction and blue color for shift in the negative direction.

Linear regression

The five variables with the largest SHAP value from SVR were used in linear regression. The findings were: FRS (β=0.576, 95% CI 0.490-0.662, p <0.001), age (β =0.266, 95% CI 0.177-0.354, p <0.001), male (β=5.690, 95% CI 3.384-7.998, p <0.001), ratio of sodium to potassium excretion (β=1.214, 95% CI 0.227-2.200, p=0.016) and proteinuria (β=3.073, 95% CI 0.254 −5.892, p=0.03). The model R² was 0.631.

Change in FRS

In the trial cohort, all machine learning models performed poorly at predicting change in the FRS, with all R² <0.161 (Supplementary Figure 5). Consequently, the SHAP values of the covariates are not displayed.

Removal of baseline FRS from model

When the baseline FRS is removed from the data, the R² for SVR dropped to 0.523 but the predicted values underestimate the higher observed values (Supplementary Figure 6). The top 5 features for SVR are older age, male, hypercholesterolemia, higher education, and ever-smoking.

Discussion

In this analysis, we used various machine learning approaches to evaluate the large data available at the start of the patient journey to determine how well the 12-month FRS can be predicted. This is an issue critical to the planning of future trials based on using the FRS as an endpoint. The machine learning methods could predict future FRS with accuracy but not predict change in the FRS score. Importantly, three of the five most important variables associated with 12-month FRS were not modifiable. Our findings that the FRS outcome at 12 months is largely predetermined has large implications for conducting clinical trials undertaken to modify the risk of stroke as a main endpoint in trials. Consequently, FRS or change in FRS may not be an optimal choice as an endpoint.

STANDFIRM trial

Practice appears to have changed considerably between planning and completion of the STANDFIRM trial. The trial was predicated on poor uptake of secondary stroke prevention in the community^{20, 21} and that addressing the risk factors through a structured program would reduce the 10-year cardiovascular risk ¹⁵. In one of these prior studies conducted in general practice, only 42% of patients with stroke or TIA were on secondary prevention therapy such as antithrombotic, antihypertensive medications and statins²¹, with only 27% of these patients being rated as being at high risk of stroke or TIA by their general practitioners ²¹.

Investigators from the North East Melbourne Stroke Incidence Study (NEMESIS) reported greater use of antihypertensive drugs at 10 years among those commencing these medications at hospital discharge (256 patients followed up at 10 years from an original cohort of 1,242 patients)²² This supported the notion that long-term adherence would be better in those commencing preventative medications at the time of hospital discharge. Observations from the STANDFIRM trial that >80% of patients were on secondary prevention therapy at the time of recruitment⁶, provides evidence for a significant change in clinical practice between the NEMESIS in 1997-99 and commencement of the STANDFIRM trial in 2013⁶. In accordance with this, selected centres in Australia and around the world have well structure outpatient clinics including TIA clinics and low rate of stroke recurrence²³.

Change in Framingham Risk Score

Patients were included in this study if they had a diagnosis of stroke and thus the FRS¹⁸ and its change were used to identify the effect of secondary prevention. With that in mind, the caveat to the findings from this analysis is that the FRS was used in a secondary prevention setting among hospitals classified as tertiary referral and do not necessarily invalidate the FRS approach in a specific setting of primary care and or for primary prevention of cardiovascular disease¹⁸. We were unable to develop models to predict change in FRS between baseline and 12 months. This finding is consistent with our observation that non-modifiable contributes to most of the total explainable FRS at 12 months (Figure 2). Serum creatinine, its excretion and proteinuria reflect renal function and have been identified as a risk factors for cardiovascular outcome and stroke recurrence^{24, 25}. However, it’s not clear if these markers of renal function are reversible²⁶. Sodium potassium ratio has been identified as risk factor for hypertension and potassium has physiologic role in decreasing sodium reabsorption. A recent metaanalysis of randomised control trials found a non-linear relationship between potassium supplement intake and systolic or diastolic blood pressure²⁷. These investigators identified selected group in which excessive potassium intake can be detrimental. Glycemic control, systolic blood pressure and fruit consumption are potentially modifiable but individually they make up a small percentage of the SHAP values. This SHAP analysis suggest that if risk factor modification is to work, it should simultaneously address these multiple variables. Variables such as social demographics and level of education could be considered to be non-contributory based on the analysis of this cohort.

Explaining machine learning

Machine learning is considered a black box and hence is less often used than classical regression methods. We have tried to circumvent this approach by using an interpretable machine learning (library SHAP) tool to aid interpretation of the model¹⁴. This method is agnostic to the method in question and can be applied to regression or classification analyses³⁵. A caveat is that the explanation by SHAP is still dependent on us choosing the optimal machine learning method. As such we have created a ‘tournament’ style evaluation of the different machine learning method for this data. In this study we have provided global interpretability (average SHAP values in Figures 2, Supplementary Figure 3 and bees warm plot of individual data point in Figure 3) to understand how the model work. The local interpretability (waterfall plots in Figure 4) illustrates how the machine learning model is applied to predict outcome for the individual patient.

Implication for use of FRS in clinical trials

Investigators of a Canadian trial found significant changes in FRS at 1 year for patients in the primary prevention arm but not those in the secondary prevention arm (n=296)⁸. By contrast, Japanese investigators did not observe a change in FRS in their trial of disease management plan (n=321)⁷. These stroke trials have been performed with relatively small sample size compared to cardiac trials ²⁷. Secondary prevention trials for coronary artery disease, targeting lifestyle modification in the form of smoking cessation, increase physical activity and alcohol reduction, have shown a reduction in all-cause mortality ²⁷. These findings potentially suggest that designing trials targeting lifestyle modification in stroke should replicate the cardiovascular trials with large sample size and with clinical endpoints.

Outside the setting of trials targeting change in FRS, other trials have incorporated a variety of methods to target post stroke blood pressure control in the community; these have been negative^{9, 10, 28–30}. The setting of one of these studies was similar to the STANDFIRM trial, in which low-income patients in Southern California received a chronic care plan in a team setting, regardless of insurance status ²⁸. This trial had no difference in blood pressure between the two groups at the end of the trial, but the intervention group had a greater self-reported reduction of salt intake and greater reduction in measured C-reactive protein than the usual care group²⁸. In contrast, selected trials targeted at clinical endpoints (combined cardiovascular endpoints) rather than the FRS or medication compliance have been successful ³¹. Examples include the Austrian STROKE-CARD trial which comprised an e-tool and multidisciplinary team visit at 3 months to address risk factor modification.

Clinical trial data

Similar to the STANDFIRM trial, often only a subset of the data collected in a clinical trial is used in regression analysis resulting in limited exploration of all the available data^{12, 32}. This issue is likely related to the potential for overfitting regression analyses when a large number of covariates are used ¹¹. Such issues are not always resolved by stepwise regression strategy³². In regression analyses, ‘rules’ have been proposed around the trade-off between number of covariates to sample size ³³. Therefore, the possible contribution of variables such as anxiety, depression, level of education, fluency in English and other data regarding urinary electrolytes have not been incorporated into one global test. Because of the negative primary outcome of STANDFIRM, we have undertaken this post-hoc analysis to determine whether future trials should be conducted using the FRS as an outcome and the role of the other variables ⁶. To enable use of all the information available at the onset of the trial a different approach to regression analysis was required to determine future FRS with one solution being to use machine learning. In the medical literature machine learning methods are increasingly used as tool for classification³⁴. These tools can also be used to perform regression tasks when the outcome is a continuous variable¹².

Structured approach to analysis

Among the machine learning methods available for tabular (structured) data, XGBoost is often seen as the go-to machine learning tool as it has been used successfully in competitions¹⁹. In this analysis, we carefully tuned different machine learning models and undertook a comparison between these methods to determine the best tool for the given data¹⁹. For this dataset, the finding that SVR was the best tool following tuning is critical as one should not assume that there is one general go-to machine learning tool for all data^{12, 19}. MLP is an example of neural network with deep learning architecture in having multiple layers but it should not always be assumed to be better than other methods for tabular data such as the one here. In this cohort, MLP was computationally expensive and took a considerably longer time than the other methods²⁸.

Multicollinearity or relatedness affects the accuracy of the coefficients of linear regression and the associated standard errors¹². The authoritative book on machine learning methods cautioned against placing excessive trust on the regularization parameter in modulating multicollinearity as the regularization parameter is directed at all observations and not on the subspace in building of the hyperplane ¹². In this analysis, removal of correlated terms did not improve the performance of tree-based method for the STANDFIRM data. MLP uses non-linear projection method to create multiple weights which are passed through the layers of neural network. In this way, neural network such as MLP are said to be overparameterized or overfitted as they include many variables in order to provide a model that can be generalized over the whole data. This generalization approach would lead to overfitting with linear regression as the latter solves the weights by finding the inverse of the correlation matrix.

Limitations

The main limitation of our analysis is that it is a post-hoc analysis of a trial with negative results on primary outcome⁶. Given the changes in practice over time, our findings should be considered as exploratory in nature and applicable to the setting of the trial⁶. Further, our findings might have been affected by sample size being smaller than the cardiac trials or trials of clinical endpoints²⁷. In our initial sample size calculation, we had cautiously planned 80% power to detect 4.5% difference in future FRS³⁶. Other investigators were more optimistic and planned for 20% difference in FRS⁷. It is likely that any future trial incorporating this methodology should be performed in rural areas or areas of need, at large distances from tertiary teaching hospitals and include a much smaller effect size than the 4.5% effect size that we used. Identification of areas of need can be performed using data driven approaches to assess geographical variation rather than a simple rural metropolitan divide ³⁷.

Conclusion

Future FRS is largely determined at trial entry. Given that the major determinants of future FRS are not modifiable, other approaches to secondary prevention trial such as using clinical endpoints should be explored. Our findings have implications for the use of FRS as the primary outcome in clinical trials targeting stroke prevention.

Data Availability

The data is available on written request to Dr Amanda Thrift

Disclosure

None

Funding

The STANDFIRM trial was supported by a National Health and Medical Research Council (NHMRC) project grant (586605) and fellowship support from the NHMRC for AGT (1042600) and DAC (1063761)

Supplementary Figure 6: SVR with baseline FRS removed. The predicted values are underestimated at higher range of the observed values. FRS, Framingham risk score; IHD, ischemic heart disease, AF, atrial fibrillation; HbA1c, glycosylated hemoglobin.

References

1.↵
Collaborators GBDS. Global, regional, and national burden of stroke and its risk factors, 1990-2019: A systematic analysis for the global burden of disease study 2019. Lancet Neurol. 2021;20:795–820
OpenUrl CrossRef PubMed
2.↵
Collaborators GBDLRoS, Feigin VL, Nguyen G, Cercy K, Johnson CO, Alam T, et al. Global, regional, and country-specific lifetime risks of stroke, 1990 and 2016. N Engl J Med. 2018;379:2429–2437
OpenUrl CrossRef PubMed
3.↵
Amarenco P, Lavallee PC, Monteiro Tavares L, Labreuche J, Albers GW, Abboud H, et al. Five-year risk of stroke after tia or minor ischemic stroke. N Engl J Med. 2018;378:2182–2190
OpenUrl CrossRef PubMed
4.
Olaiya MT, Cadilhac DA, Kim J, Nelson MR, Srikanth VK, Andrew NE, et al. Long-term unmet needs and associated factors in stroke or tia survivors: An observational study. Neurology. 2017;89:68–75
OpenUrl
5.↵
McCarthy J, Yang J, Clissold B, Young MJ, Fuller PJ, Phan T. Hypertension management in stroke prevention: Time to consider primary aldosteronism. Stroke. 2021;52:e626–e634
OpenUrl
6.↵
Olaiya MT, Kim J, Nelson MR, Srikanth VK, Bladin CF, Gerraty RP, et al. Effectiveness of a shared team approach between nurses and doctors for improved risk factor management in survivors of stroke: A cluster randomized controlled trial. Eur J Neurol. 2017;24:920–928
OpenUrl
7.↵
Fukuoka Y, Hosomi N, Hyakuta T, Omori T, Ito Y, Uemura J, et al. Effects of a disease management program for preventing recurrent ischemic stroke. Stroke. 2019;50:705–712
OpenUrl PubMed
8.↵
Wister A, Loewen N, Kennedy-Symonds H, McGowan B, McCoy B, Singer J. One-year follow-up of a therapeutic lifestyle intervention targeting cardiovascular disease risk. CMAJ. 2007;177:859–865
OpenUrl Abstract/FREE Full Text
9.↵
Cheng EM, Cunningham WE, Towfighi A, Sanossian N, Bryg RJ, Anderson TL, et al. Efficacy of a chronic care-based intervention on secondary stroke prevention among vulnerable stroke survivors: A randomized controlled trial. Circulation. Cardiovascular quality and outcomes. 2018;11:e003228
OpenUrl Abstract/FREE Full Text
10.↵
Boden-Albala B, Goldmann E, Parikh NS, Carman H, Roberts ET, Lord AS, et al. Efficacy of a discharge educational strategy vs standard discharge care on reduction of vascular risk in patients with stroke and transient ischemic attack: The deserve randomized clinical trial. JAMA neurology. 2019;76:20–27
OpenUrl
11.↵
Draper NR, Smith H. Applied regression analysis. Danvers: A Wiley-Interscience Publication; 1998.
12.↵
Hastie T, ibshirani R, Firedman J. The elements of statistical learning: Data mining, inference, and prediction. Springer; 2009.
13.↵
Shapley LS. A value for n-person games. Contributions to the theory of games 1953:307–317.
14.↵
Lundberg SM, Nair B, Vavilala MS, Horibe M, Eisses MJ, Adams T, et al. Explainable machine-learning predictions for the prevention of hypoxaemia during surgery. Nat Biomed Eng. 2018;2:749–760
OpenUrl
15.↵
Thrift AG, Olaiya MT, Phan TG, Cadilhac DA, Nelson MR, Srikanth VK, et al. Statistical analysis plan (sap) for shared team approach between nurses and doctors for improved risk factor management (standfirm): A randomised controlled trial. Int J Stroke. 2015;10:770–772
OpenUrl CrossRef PubMed
16.
Orman Z, Thrift AG, Olaiya MT, Ung D, Cadilhac DA, Phan T, et al. Quality of life after stroke: A longitudinal analysis of a cluster randomized trial. Qual Life Res. 2022;31:2445–2455
OpenUrl
17.↵
Olaiya MT, Cadilhac DA, Kim J, Nelson MR, Srikanth VK, Gerraty RP, et al. Community-based intervention to improve cardiometabolic targets in patients with stroke: A randomized controlled trial. Stroke. 2017;48:2504–2510
OpenUrl Abstract/FREE Full Text
18.↵
D’Agostino RB, Sr., Vasan RS, Pencina MJ, Wolf PA, Cobain M, Massaro JM, et al. General cardiovascular risk profile for use in primary care: The framingham heart study. Circulation. 2008;117:743–753
OpenUrl Abstract/FREE Full Text
19.↵
Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, et al. Scikit-learn: Machine learning in python. Journal of Machine Learning Research. 2011;12:2825–2830
OpenUrl CrossRef
20.↵
Paul SL, Thrift AG. Control of hypertension 5 years after stroke in the north east melbourne stroke incidence study. Hypertension. 2006;48:260–265
OpenUrl CrossRef
21.↵
Heeley E, Anderson C, Patel A, Cass A, Peiris D, Weekes A, et al. Disparities between prescribing of secondary prevention therapies for stroke and coronary artery disease in general practice. Int J Stroke. 2012;7:649–654
OpenUrl CrossRef PubMed Web of Science
22.↵
Thrift AG, Kim J, Douzmanian V, Gall SL, Arabshahi S, Loh M, et al. Discharge is a critical time to influence 10-year use of secondary prevention therapies for stroke. Stroke. 2014;45:539–544
OpenUrl Abstract/FREE Full Text
23.↵
Sanders LM, Srikanth VK, Jolley DJ, Sundararajan V, Psihogios H, Wong K, et al. Monash transient ischemic attack triaging treatment: Safety of a transient ischemic attack mechanism-based outpatient model of care. Stroke. 2012;43:2936–2941
OpenUrl Abstract/FREE Full Text
24.↵
Go AS, Chertow GM, Fan D, McCulloch CE, Hsu CY. Chronic kidney disease and the risks of death, cardiovascular events, and hospitalization. N Engl J Med. 2004;351:1296–1305
OpenUrl CrossRef PubMed Web of Science
25.↵
Ueki K, Matsuo R, Kuwashiro T, Irie F, Wakisaka Y, Ago T, et al. Decreased estimated glomerular filtration rate and proteinuria and long-term outcomes after ischemic stroke: A longitudinal observational cohort study. Stroke. 2023;54:1268–1277
OpenUrl
26.↵
Kelly DM, Ademi Z, Doehner W, Lip GYH, Mark P, Toyoda K, et al. Chronic kidney disease and cerebrovascular disease: Consensus and guidance from a kdigo controversies conference. Stroke. 2021;52:e328–e346
OpenUrl PubMed
27.↵
Iestra JA, Kromhout D, van der Schouw YT, Grobbee DE, Boshuizen HC, van Staveren WA. Effect size estimates of lifestyle and dietary changes on all-cause mortality in coronary artery disease patients: A systematic review. Circulation. 2005;112:924–934
OpenUrl Abstract/FREE Full Text
28.↵
Towfighi A, Cheng EM, Ayala-Rivera M, Barry F, McCreath H, Ganz DA, et al. Effect of a coordinated community and chronic care model team intervention vs usual care on systolic blood pressure in patients with stroke or transient ischemic attack: The succeed randomized clinical trial. JAMA Netw Open. 2021;4:e2036227
OpenUrl
29.
Boehme C, Toell T, Lang W, Knoflach M, Kiechl S. Longer term patient management following stroke: A systematic review. Int J Stroke. 2021;16:917–926
OpenUrl
30.↵
Peng B, Ni J, Anderson CS, Zhu Y, Wang Y, Pu C, et al. Implementation of a structured guideline-based program for the secondary prevention of ischemic stroke in china. Stroke. 2014;45:515–519
OpenUrl Abstract/FREE Full Text
31.↵
Willeit P, Toell T, Boehme C, Krebs S, Mayer L, Lang C, et al. Stroke-card care to prevent cardiovascular events and improve quality of life after acute ischaemic stroke or tia: A randomised clinical trial. EClinicalMedicine. 2020;25:100476
OpenUrl
32.↵
Harrell FE. Regression modeling strategies: With applications to linear models, logistic regression, and survival analysis. New York: Springer-Verlag; 2001.
33.↵
Harrell FE, Jr., Lee KL, Califf RM, Pryor DB, Rosati RA. Regression modelling strategies for improved prognostic prediction. Stat Med. 1984;3:143–152
OpenUrl CrossRef PubMed Web of Science
34.↵
Abedi V, Kawamura Y, Li J, Phan TG, Zand R. Editorial: Machine learning in action: Stroke diagnosis and outcome prediction. Frontiers in neurology. 2022;13:984467
OpenUrl
35.↵
Phan TG, Kooblal T, Matley C, Singhal S, Clissold B, Ly J, et al. Stroke severity versus dysphagia screen as driver for post-stroke pneumonia. Frontiers in neurology. 2019;10:16
OpenUrl
36.↵
Thrift AG, Srikanth VK, Nelson MR, Kim J, Fitzgerald SM, Gerraty RP, et al. Risk factor management in survivors of stroke: A double-blind, cluster-randomized, controlled trial. Int J Stroke. 2014;9:652–657
OpenUrl CrossRef PubMed
37.↵
Kawai A, Hui S, Beare R, Srikanth VK, Sundararajan V, Ma H, et al. Spatiotemporal analysis of regional tia trends. Frontiers in neurology. 2022;13:983512
OpenUrl

View the discussion thread.

Posted September 06, 2023.

Download PDF

Data/Code

Citation Tools

Subject Area

Cardiovascular Medicine

Subject Areas

All Articles

Addiction Medicine (321)
Allergy and Immunology (623)
Anesthesia (162)
Cardiovascular Medicine (2334)
Dentistry and Oral Medicine (283)
Dermatology (205)
Emergency Medicine (373)
Endocrinology (including Diabetes Mellitus and Metabolic Disease) (828)
Epidemiology (11693)
Forensic Medicine (10)
Gastroenterology (692)
Genetic and Genomic Medicine (3675)
Geriatric Medicine (345)
Health Economics (630)
Health Informatics (2361)
Health Policy (924)
Health Systems and Quality Improvement (881)
Hematology (339)
HIV/AIDS (771)
Infectious Diseases (except HIV/AIDS) (13256)
Intensive Care and Critical Care Medicine (764)
Medical Education (363)
Medical Ethics (104)
Nephrology (396)
Neurology (3432)
Nursing (194)
Nutrition (519)
Obstetrics and Gynecology (665)
Occupational and Environmental Health (659)
Oncology (1799)
Ophthalmology (532)
Orthopedics (215)
Otolaryngology (285)
Pain Medicine (229)
Palliative Medicine (66)
Pathology (444)
Pediatrics (1020)
Pharmacology and Therapeutics (424)
Primary Care Research (415)
Psychiatry and Clinical Psychology (3140)
Public and Global Health (6082)
Radiology and Imaging (1254)
Rehabilitation Medicine and Physical Therapy (730)
Respiratory Medicine (820)
Rheumatology (375)
Sexual and Reproductive Health (365)
Sports Medicine (320)
Surgery (396)
Toxicology (50)
Transplantation (171)
Urology (144)

[1] 1.↵
Collaborators GBDS. Global, regional, and national burden of stroke and its risk factors, 1990-2019: A systematic analysis for the global burden of disease study 2019. Lancet Neurol. 2021;20:795–820
OpenUrl CrossRef PubMed

[2] 2.↵
Collaborators GBDLRoS, Feigin VL, Nguyen G, Cercy K, Johnson CO, Alam T, et al. Global, regional, and country-specific lifetime risks of stroke, 1990 and 2016. N Engl J Med. 2018;379:2429–2437
OpenUrl CrossRef PubMed

[3] 3.↵
Amarenco P, Lavallee PC, Monteiro Tavares L, Labreuche J, Albers GW, Abboud H, et al. Five-year risk of stroke after tia or minor ischemic stroke. N Engl J Med. 2018;378:2182–2190
OpenUrl CrossRef PubMed

[4] 4.
Olaiya MT, Cadilhac DA, Kim J, Nelson MR, Srikanth VK, Andrew NE, et al. Long-term unmet needs and associated factors in stroke or tia survivors: An observational study. Neurology. 2017;89:68–75
OpenUrl

[5] 5.↵
McCarthy J, Yang J, Clissold B, Young MJ, Fuller PJ, Phan T. Hypertension management in stroke prevention: Time to consider primary aldosteronism. Stroke. 2021;52:e626–e634
OpenUrl

[6] 6.↵
Olaiya MT, Kim J, Nelson MR, Srikanth VK, Bladin CF, Gerraty RP, et al. Effectiveness of a shared team approach between nurses and doctors for improved risk factor management in survivors of stroke: A cluster randomized controlled trial. Eur J Neurol. 2017;24:920–928
OpenUrl

[7] 7.↵
Fukuoka Y, Hosomi N, Hyakuta T, Omori T, Ito Y, Uemura J, et al. Effects of a disease management program for preventing recurrent ischemic stroke. Stroke. 2019;50:705–712
OpenUrl PubMed

[8] 8.↵
Wister A, Loewen N, Kennedy-Symonds H, McGowan B, McCoy B, Singer J. One-year follow-up of a therapeutic lifestyle intervention targeting cardiovascular disease risk. CMAJ. 2007;177:859–865
OpenUrl Abstract/FREE Full Text

[9] 9.↵
Cheng EM, Cunningham WE, Towfighi A, Sanossian N, Bryg RJ, Anderson TL, et al. Efficacy of a chronic care-based intervention on secondary stroke prevention among vulnerable stroke survivors: A randomized controlled trial. Circulation. Cardiovascular quality and outcomes. 2018;11:e003228
OpenUrl Abstract/FREE Full Text

[10] 10.↵
Boden-Albala B, Goldmann E, Parikh NS, Carman H, Roberts ET, Lord AS, et al. Efficacy of a discharge educational strategy vs standard discharge care on reduction of vascular risk in patients with stroke and transient ischemic attack: The deserve randomized clinical trial. JAMA neurology. 2019;76:20–27
OpenUrl

[11] 11.↵
Draper NR, Smith H. Applied regression analysis. Danvers: A Wiley-Interscience Publication; 1998.

[12] 12.↵
Hastie T, ibshirani R, Firedman J. The elements of statistical learning: Data mining, inference, and prediction. Springer; 2009.

[13] 13.↵
Shapley LS. A value for n-person games. Contributions to the theory of games 1953:307–317.

[14] 14.↵
Lundberg SM, Nair B, Vavilala MS, Horibe M, Eisses MJ, Adams T, et al. Explainable machine-learning predictions for the prevention of hypoxaemia during surgery. Nat Biomed Eng. 2018;2:749–760
OpenUrl

[15] 15.↵
Thrift AG, Olaiya MT, Phan TG, Cadilhac DA, Nelson MR, Srikanth VK, et al. Statistical analysis plan (sap) for shared team approach between nurses and doctors for improved risk factor management (standfirm): A randomised controlled trial. Int J Stroke. 2015;10:770–772
OpenUrl CrossRef PubMed

[16] 16.
Orman Z, Thrift AG, Olaiya MT, Ung D, Cadilhac DA, Phan T, et al. Quality of life after stroke: A longitudinal analysis of a cluster randomized trial. Qual Life Res. 2022;31:2445–2455
OpenUrl

[17] 17.↵
Olaiya MT, Cadilhac DA, Kim J, Nelson MR, Srikanth VK, Gerraty RP, et al. Community-based intervention to improve cardiometabolic targets in patients with stroke: A randomized controlled trial. Stroke. 2017;48:2504–2510
OpenUrl Abstract/FREE Full Text

[18] 18.↵
D’Agostino RB, Sr., Vasan RS, Pencina MJ, Wolf PA, Cobain M, Massaro JM, et al. General cardiovascular risk profile for use in primary care: The framingham heart study. Circulation. 2008;117:743–753
OpenUrl Abstract/FREE Full Text

[19] 19.↵
Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, et al. Scikit-learn: Machine learning in python. Journal of Machine Learning Research. 2011;12:2825–2830
OpenUrl CrossRef

[20] 20.↵
Paul SL, Thrift AG. Control of hypertension 5 years after stroke in the north east melbourne stroke incidence study. Hypertension. 2006;48:260–265
OpenUrl CrossRef

[21] 21.↵
Heeley E, Anderson C, Patel A, Cass A, Peiris D, Weekes A, et al. Disparities between prescribing of secondary prevention therapies for stroke and coronary artery disease in general practice. Int J Stroke. 2012;7:649–654
OpenUrl CrossRef PubMed Web of Science

[22] 22.↵
Thrift AG, Kim J, Douzmanian V, Gall SL, Arabshahi S, Loh M, et al. Discharge is a critical time to influence 10-year use of secondary prevention therapies for stroke. Stroke. 2014;45:539–544
OpenUrl Abstract/FREE Full Text

[23] 23.↵
Sanders LM, Srikanth VK, Jolley DJ, Sundararajan V, Psihogios H, Wong K, et al. Monash transient ischemic attack triaging treatment: Safety of a transient ischemic attack mechanism-based outpatient model of care. Stroke. 2012;43:2936–2941
OpenUrl Abstract/FREE Full Text

[24] 24.↵
Go AS, Chertow GM, Fan D, McCulloch CE, Hsu CY. Chronic kidney disease and the risks of death, cardiovascular events, and hospitalization. N Engl J Med. 2004;351:1296–1305
OpenUrl CrossRef PubMed Web of Science

[25] 25.↵
Ueki K, Matsuo R, Kuwashiro T, Irie F, Wakisaka Y, Ago T, et al. Decreased estimated glomerular filtration rate and proteinuria and long-term outcomes after ischemic stroke: A longitudinal observational cohort study. Stroke. 2023;54:1268–1277
OpenUrl

[26] 26.↵
Kelly DM, Ademi Z, Doehner W, Lip GYH, Mark P, Toyoda K, et al. Chronic kidney disease and cerebrovascular disease: Consensus and guidance from a kdigo controversies conference. Stroke. 2021;52:e328–e346
OpenUrl PubMed

[27] 27.↵
Iestra JA, Kromhout D, van der Schouw YT, Grobbee DE, Boshuizen HC, van Staveren WA. Effect size estimates of lifestyle and dietary changes on all-cause mortality in coronary artery disease patients: A systematic review. Circulation. 2005;112:924–934
OpenUrl Abstract/FREE Full Text

[28] 28.↵
Towfighi A, Cheng EM, Ayala-Rivera M, Barry F, McCreath H, Ganz DA, et al. Effect of a coordinated community and chronic care model team intervention vs usual care on systolic blood pressure in patients with stroke or transient ischemic attack: The succeed randomized clinical trial. JAMA Netw Open. 2021;4:e2036227
OpenUrl

[29] 29.
Boehme C, Toell T, Lang W, Knoflach M, Kiechl S. Longer term patient management following stroke: A systematic review. Int J Stroke. 2021;16:917–926
OpenUrl

[30] 30.↵
Peng B, Ni J, Anderson CS, Zhu Y, Wang Y, Pu C, et al. Implementation of a structured guideline-based program for the secondary prevention of ischemic stroke in china. Stroke. 2014;45:515–519
OpenUrl Abstract/FREE Full Text

[31] 31.↵
Willeit P, Toell T, Boehme C, Krebs S, Mayer L, Lang C, et al. Stroke-card care to prevent cardiovascular events and improve quality of life after acute ischaemic stroke or tia: A randomised clinical trial. EClinicalMedicine. 2020;25:100476
OpenUrl

[32] 32.↵
Harrell FE. Regression modeling strategies: With applications to linear models, logistic regression, and survival analysis. New York: Springer-Verlag; 2001.

[33] 33.↵
Harrell FE, Jr., Lee KL, Califf RM, Pryor DB, Rosati RA. Regression modelling strategies for improved prognostic prediction. Stat Med. 1984;3:143–152
OpenUrl CrossRef PubMed Web of Science

[34] 34.↵
Abedi V, Kawamura Y, Li J, Phan TG, Zand R. Editorial: Machine learning in action: Stroke diagnosis and outcome prediction. Frontiers in neurology. 2022;13:984467
OpenUrl

[35] 35.↵
Phan TG, Kooblal T, Matley C, Singhal S, Clissold B, Ly J, et al. Stroke severity versus dysphagia screen as driver for post-stroke pneumonia. Frontiers in neurology. 2019;10:16
OpenUrl

[36] 36.↵
Thrift AG, Srikanth VK, Nelson MR, Kim J, Fitzgerald SM, Gerraty RP, et al. Risk factor management in survivors of stroke: A double-blind, cluster-randomized, controlled trial. Int J Stroke. 2014;9:652–657
OpenUrl CrossRef PubMed

[37] 37.↵
Kawai A, Hui S, Beare R, Srikanth VK, Sundararajan V, Ma H, et al. Spatiotemporal analysis of regional tia trends. Frontiers in neurology. 2022;13:983512
OpenUrl