Abstract
Background Lithium is the most effective treatment in bipolar disorder. Its use is limited by concerns about risk of chronic kidney disease (CKD). We aimed to develop a model to predict risk of CKD following lithium treatment initiation, by identifying individuals with a high-risk trajectory of renal function.
Methods We used United Kingdom Clinical Practice Research Datalink (CPRD) electronic heath records (EHRs) from 2000-2018. CPRD Aurum for prediction model development and CPRD Gold for external validation. We used elastic net to generate a prediction model from potential features. We performed discrimination and calibration assessments in an external validation data set.
We included all patients aged ≥16 with bipolar disorder prescribed lithium. To be included patients had to have ≥1 year of follow-up before lithium initiation, ≥3 estimated glomerular filtration rate (eGFR) measures after lithium initiation (to be able to determine a trajectory) and a normal (≥60 mL/min/1.73m2) eGFR at lithium initiation (baseline). In the Aurum development cohort 1609 fulfilled these criteria. The Gold external validation cohort included 934 patients.
We included 44 potential baseline features in the prediction model, including sociodemographic, mental and physical heath and drug treatment characteristics. We compared a full model with the 3-variable five-year kidney failure risk equation (KFRE) and a 3-variable elastic net model.
We used group-based trajectory modelling to identify latent trajectory groups for eGFR. We were interested in the group with deteriorating renal function (the high-risk group).
Findings The high-risk group included 191 (11.87%) of the Aurum cohort and 137 (14.67%) of the Gold cohort, of these 168 (87.96%) and 117 (85.40%) respectively developed CKD 3a or more severe during follow-up. The model, developed in Aurum, had a ROC area of 0.879 (95%CI 0.853-0.904) in the Gold external validation data set. At the empirical optimal cut-point defined in the development dataset, the model had a sensitivity of 0. 91 (95%CI 0.84-0.97) and a specificity of 0.74 (95% CI 0.67-0.82). However, a 3-variable elastic net model (including only age, sex and baseline eGFR) performed similarly well (ROC area 0.888; 95%CI 0.864-0.912), as did the KFRE (ROC area 0.870; 95%CI 0.841-0.898).
Conclusions Individuals at high-risk of a poor trajectory of renal function can be identified before initiation of lithium treatment by a simple equation including age, sex and baseline eGFR. We did not identify strong predicters of renal impairment specific to lithium treated patients.
Background
Lithium is the most effective maintenance treatment for bipolar disorder, and is first-line in all international clinical practice guidelines (1). However, its use has been declining globally (2). Reasons for this include the required monitoring due to its narrow therapeutic window and concerns about adverse effects, particularly irreversible renal failure (3). In fact, renal failure is rare (4), with end-stage renal failure occurring at similar rates to those treated with other mood stabilisers (5), but more commonly than the general population (0.23% vs 0.11% (6)). Bipolar disorder itself appears to be associated with increased risk of renal failure independent of lithium exposure (7). There are inconsistencies in the existing literature about the association between renal failure and lithium treatment duration and episodes of lithium toxicity (8).
Being able to identify individuals at risk of compromised renal function would have high clinical utility; it would encourage the use of this effective treatment in those at low-risk and so improve outcomes for people with bipolar disorder. In the general population, established risk factors for CKD include age, sex, ethnicity, family history, smoking, obesity, hypertension, diabetes mellitus, excessive alcohol consumption and acute kidney injury (9). Prediction models have been developed for end stage renal failure in groups with a range of underlying risk (10-14). These tend to include a small number of core features including age, gender, ethnicity, eGFR and albuminuria. Models then vary in terms of additional features such as glucose, blood pressure, haemoglobin, lipids, calcium and phosphate. It is unclear if features related to mental health are useful in predicting CKD risk at the point of lithium initiation. It is also likely that risk factors for CKD cluster differently in patients with bipolar disorder prescribed lithium [8] so we cannot assume that risk prediction models for CKD that are of value in the general population would apply to people with bipolar disorder receiving lithium. Because CKD requiring clinical intervention (CKD stage 4 or more severe – eGFR<30 mL/min/1.73m2) is a rare and late-stage outcome we aimed to develop a model to classify individuals into high-risk and low-risk trajectories of renal function following lithium treatment initiation.
Methods
Population
This study used patient data from the Clinical Practice Research Datalink (CPRD) Gold and Aurum databases between 1 January 2000 and 31 December 2018. CPRD contains electronic health records (EHRs) from general practices across the United Kingdom. Combined, these databases include 42 million patient records from over 1,800 primary care practices (www.cprd.com). Both databases contain coded and anonymised data including demographic details, symptoms, diagnoses, prescribed medication, laboratory tests and referrals. CPRD Gold contains data contributed by practices using Vision software and CPRD Aurum contains data contributed by practices using EMIS Web software (15, 16). Contributing practices have different geographical distributions; CPRD Gold contains patients from the whole of the UK, whereas Aurum contains only practices from England and Northern Ireland. Therefore, there are some differences in population structures. We used data from the Aurum database for development of our prediction model and data from the Gold database for external validation. Ethical approval for this study was obtained from the Independent Scientific Advisory Committee of CPRD (protocol no. 18_316). Informed consent was waived because data are anonymised for research purposes. In line with ethical guidance subgroups containing fewer than 5 people are censored in the results section.
Cohort definition
The cohort comprised any patient who; was aged 16 or over, ever received a diagnosis of bipolar disorder in their clinical record, was prescribed lithium (defined as receiving two or more concurrent prescriptions), had at least a year of follow-up before their first lithium prescription and no previous record of being prescribed lithium (to capture patients’ first exposure to lithium), had at least three estimated glomerular filtration rate (eGFR) measures after lithium initiation and had a baseline measure of eGFR ≥60 mL/min/1.73m2 before starting lithium (normal or close to normal renal function).
Renal function trajectories
eGFR values were calculated from recorded creatinine blood tests using the CKD-EPI equation (17). Using the eGFR, and the date the blood test was performed relative to lithium initiation, we conducted group-based trajectory modelling to identify latent subgroups within the cohort (18). We included lithium exposure as a time-varying covariate, as rate of change in eGFR may potentially differ between the lithium exposed period and following lithium cessation. In the process of determining the number of trajectory groups, we initially used a cubic polynomial function for all groups. The final number of groups was determined based on the Bayesian Information Criterion (BIC), trajectory shapes for similarity, and the proportion of cohort members in each class (19). We initially set a 2-group model and increased the number of groups until BIC was minimised, but no group was less than 10% of the total cohort. After identifying the optimal number of groups, the level of the polynomial function for each group was reduced from cubic to zero-order until the BIC was minimised. With this final model, each participant was assigned to one of the subgroups based on maximum posterior probability. We were primarily interested in the group predicted to have the most rapidly declining eGFR trajectory; referred to as the high-risk group.
Prediction model features
We identified features present in a patient’s record before they commenced lithium treatment as potential predictors of being in the high-risk group. These included predictors of impaired renal function in the general population and features related to mental health and its treatment:
Sociodemographics
Age, sex, ethnicity (as Black, Asian and Minority Ethnic (BAME) vs White), relationship status (single vs. in a relationship).
Mental health characteristics
Illness duration before lithium initiation, presentations for depression, mania, anxiety, psychosis, stress, self-harm, disturbed sleep.
Physical health characteristics
Hyper/hypocalcaemia, hypo/hyperthyroidism, high LDL cholesterol, low HDL cholesterol, hypertension, coronary heart disease, a measure of eGFR<60 mL/min/1.73m2 any time before lithium initiation, Type II Diabetes Mellitus, asthma, weight loss, peptic ulcer, iron deficiency anaemia, liver disease, chronic pulmonary disease, neurological disorders.
Health behaviours
Smoking status (never smoked, current smoker, ex-smoker), body mass index group (underweight, healthy weight, overweight, obese), cannabis use, other substance misuse, alcohol misuse.
Other drug treatment
Antipsychotic prescription, other mood stabiliser prescription, antidepressant prescription.
Interactions
Baseline eGFR with sex and age, sex with age and body mass index group.
Statistical Analysis
We described differences in prevalence of binary covariates and medians of continuous covariates between high-risk and low-risk groups using p-values from chi-squared tests. We used probit elastic net regression with 10-fold cross-validation to perform variable selection and penalization of coefficients to generate the prediction model in the Aurum cohort. Elastic net is a regularization method for regression and classification models which comprises the Least Absolute Shrinkage And Selection Operator (LASSO) penalty (L1) and the ridge penalty (L2)(20). The LASSO (L1) penalty function performs variable selection and dimension reduction by shrinking coefficients, while the ridge (L2) penalty function shrinks the coefficients of correlated variables toward their average. The overall elastic net is a function of parameters λ and α (0 ≤ α ≤ 1), with λ being a parameter for the level of penalty, while α being the weight of L1 penalty and (1-α) being that of L2 penalty function. We reported Receiver Operating Characteristic (ROC) area (and 95% confidence interval)(CI), sensitivity and specificity at the empirical optimal cut-off point using Youden’s index and the predictive accuracy. We compared the derived full model with predictions from the 3-variable five year kidney failure risk equation (KFRE) which includes age, sex and eGFR, and an elastic net model containing only these 3 variables (14). We chose the 3-variable KFRE as albumin-to-creatine ratio was poorly recorded before lithium initiation and the 3-variable model performed well in previous validation studies (ROC area 0.79)(21).
External validation
We used patient data from CPRD Gold for external validation of the model generated in the Aurum cohort. To categorise individuals at high-risk of a rapid decline in eGFR, we ran group-based trajectory models of the eGFRs independently of the Aurum patients’ trajectory model. We compared trajectory group membership with the predicted group membership from the Aurum model. We reported the ROC area, sensitivity and specificity at the cut-off point defined in the development data, brier score, predictive accuracy, calibration belt (a graphical approach designed to evaluate the goodness of fit of binary outcome models) (22) and decision curve analysis. We examined how well the model could predict CKD stage 3b or more severe (eGFR<45 mL/min/1.73m2) during follow-up. We also compared the full model with simple models: the 3-variable KFRE and 3-variable elastic net.
Post hoc supplementary analysis
We combined data from the Aurum and Gold datasets for patients who initiated lithium treatment with a baseline eGFR ≥90 mL/min/1.73m2. We adopted the same approach in this smaller cohort: we identified a high-risk trajectory group using group-based trajectory modelling and then used the full model, 3-variable KFRE and 3-variable elastic net to predict group membership. We also examined how well this model could predict CKD stage 3a or more severe (eGFR<60 mL/min/1.73m2) during follow-up. This analysis was completed to address issues arising from the strong association between baseline eGFR and future eGFR measurements in the initial model. All analysis was completed using Stata 16 (23).
Findings
We identified 1609 patients in the development sample (Aurum cohort), with a median of 14 (IQR 7-26) eGFR test results each. The median length of lithium treatment was 1.42 years (IQR 0.53-3.58). Of these patients, 401 (24.92%) developed CKD stage 3a or more severe (eGFR<60 mL/min/1.73m2), 38 (2.36%) CKD stage 3b (eGFR<45 mL/min/1.73m2), but none developed CKD stage 4. In total, 158 (9.82%) died during follow-up.
To categorize risk groups based on eGFR trajectories we chose a 5-group model, all groups with cubic trajectories (BIC=3566.99). This defined 11.87% of the cohort as high-risk. Models with 6 groups had lower BIC but included one group with less than 10% of the cohort.
Trajectories of the high-risk vs other groups (combined group 2-5) are shown in Figure 1 and described in Table 1. Of those in the high-risk group 168 (87.96%) develop CKD stage 3a or more severe, and 25 (13.09%) developed stage 3b or more severe, compared to 233 (16.43%) and 13 (0.92%) respectively in the low-risk group.
Patients in the high-risk group were more likely to be female, younger at lithium initiation, have a lower eGFR before starting lithium and be obese. They were more likely to have a pre-existing diagnosis of migraine. They were more likely to have a record of high LDL cholesterol. They were less likely to have had an eGFR<60 mL/min/1.73m2 any time before baseline.
There was no statistical evidence of a difference in duration of lithium treatment and incidence of lithium toxicity (>1.5mmol/L) between groups. Those in the high-risk group were less likely to die during follow-up and had fewer eGFR tests in total.
We used 44 features known to the clinician prior to lithium initiation to generate a prediction model for being in the high-risk group. Elastic net with 10-fold cross-validation fitted a model with λ=0.014 and α=1.00. The ROC area 0.868 (95%CI 0.844-0.891) (Figure 2). The empirical optimal cut-point was 0.134 with a sensitivity of 0.86 (95%CI 0.78-0.94) and a specificity of 0.73 (95%CI 0.63-0.84). The Youden index was 0.589. This gave a prediction accuracy of 74.54% (Table 2).
Features retained in the model were (in order of coefficient size): baseline eGFR, sex, sex by BMI group interaction, baseline eGFR by age interaction, hypothyroidism, migraine, BMI group, SSRI exposure, high LDL cholesterol, BAME, hyperthyroidism, smoking status, type 2 diabetes mellitus, self-harm. The 3-variable KFRE and the 3-variable elastic net model performed similarly well to the full model: ROC area 0.828 (95%CI 0.801-0.855) and ROC area 0.852 (95%CI 0.827-0.876) respectively (Table 2).
External validation
The external validation data set (Gold cohort) included 934 individuals. We developed new trajectory groups independently for these patients. BIC in the group-based trajectory model was minimised by a 5-group solution, with cubic or quadratic polynomials fitted for each group trajectory; 3, 2, 2, 3, 3 respectively from “highest risk” to “lowest risk” groups (BIC=1919.07). Of the total Gold cohort, 14.67% (n=137) were in the high-risk group. Of the total cohort, 229 (24.52%) developed CKD stage 3a or more severe and 14 (1.50%) CKD stage 3b or more severe.
Patient characteristics by risk group are described in Table 3, and trajectories relative to end of lithium exposure are shown in Figure 3. Of those in the high-risk group, 117 (85.40%) develop CKD stage 3a or more severe, and 14 (10.22%) developed stage 3b or more severe, compared to 112 (14.05%) and <5 respectively in the low-risk group.
As with the Aurum cohort, patients in the high-risk group were more likely to be female, be younger, have a lower eGFR before starting lithium and less likely to have a prior record of eGFR<60 mL/min/1.73m2. High-risk individuals were also more likely to experience migraine. Unlike the Aurum cohort, the high-risk group were more likely to have anaemia and less likely to have hypertension. There was no between group difference for lithium duration and lithium toxicity was potentially more common in the low-risk group.
We predicted high-risk group membership using the model generated in the Aurum Data set. The ROC area was 0.879 (95%CI 0.853-0.904) (Table 2, Figure 4). At the empirical optimal cut-point defined in the development dataset, the model had a sensitivity of 0. 91 (0.84-0.97) and a specificity of 0.74 (95% CI 0.67-0.82) The Brier score was 0.0967. This gave a predictive accuracy of 76.55%. However, the simpler models also predicted high-risk group membership similarly well: 3-variable KFRE ROC area 0.870 (95%CI 0.841-0.898), 3-variable elastic net ROC area 0.888 (95%CI 0.864-0.912)(Box 1). The calibration plot suggested that the model performs well up to a probability of 0.60 at the 95% confidence level (Figure 5).
3-variable equation for prediction of high-risk group membership
Risk score = 4.621 - (0.717 x eGFR) - (0.616 x sex†) - (0.008 x age)
probability of high-risk group membership = exp (risk score)/ (1+exp (risk score))
†where sex=1 if male and sex=0 if female
We also predicted CKD 3b or more severe using these models: ROC area 0.849 (95%CI 0.792-0.905), ROC area 0.865 (95%CI 0.808-0.922), ROC area 0.858 (95%CI 0.792-0.922) using the full model, 3-variable elastic net and 3-variable KFRE respectively (Table 4).
The decision curve analysis showed that all 3 of these models were superior to classifying everyone as high-risk or low-risk between a threshold probability of 0.10 and 0.80 and there was little difference between them (Figure 6).
Post hoc supplementary analysis
In 668 patients with a baseline eGFR ≥90 mL/min/1.73m2 a two-group cubic trajectory model minimised the BIC (642.39) with 120 patients (17.96%) in the high-risk group (Table 5, Figure 7). CKD stage 3a or more severe and stage 3b or more severe were more common in the high-risk group. Individuals in the high-risk group were again more likely to be female, be younger and have a lower eGFR before starting lithium. They were more likely to be current smokers. We did not observe any of the other between group differences present in the Aurum or Gold trajectory groups.
In this reduced dataset our full model was better at predicting high-risk group membership than the 3-variable KFRE model (ROC area 0.725; 95%CI 0.675-0.776 vs ROC area 0.667; 95%CI 0.617-0.716; p-value for equality=0.0018), but not the 3-varible elastic net model (ROC area 0.729; 95%CI 0.679-0.779; p-value for equality 0.5846) (Table 6, Figure 8).
Discussion
As far as we are aware, this is the first model developed to predict high-risk of future impaired renal function in people with bipolar disorder prescribed lithium. We used a large representative sample of people with bipolar disorder initiated on lithium and followed up for up for a median of 7.10 years (IQR 3.85-11.36). It is also the first study to use the two CPRD datasets, covering a large, representative sample of the UK population to develop a prediction model and provide external validation (external validation is a vital step often lacking in such studies).
Because of the rarity of renal failure, and the varied follow-up time and eGFR recording frequency in EHRs, we sought to identify approximately 10% of individuals prescribed lithium who were at highest risk of deteriorating renal function, defined by the trajectory of their serial eGFR measurements. Using group-based trajectory analysis we identified high-risk groups independently in the Aurum and Gold cohorts. In both cases approximately 85% of those categorised as high-risk developed CKD stage 3a or more severe compared to approximately 15% in the low-risk groups. Approximately 10% of those identified as high-risk developed CKD stage 3b or more severe, compared to <1% in the low-risk group. A number of features differed between the high-risk and low-risk groups in both cohorts. Those in the high-risk groups were more likely to be female, younger, more likely to have a lower eGFR before starting lithium, more likely to experience migraine and less likely to have a prior record of eGFR<60 mL/min/1.73m2. These CKD risk factors have been previously identified. CKD is more common in women (24), and this has been shown in lithium users specifically (25). Younger women appear to be at particular risk (25). Low baseline eGFR increases risk of CKD in the general population (26). Migraine is not commonly thought of as a risk factor for CKD, but has been identified as such in one study, especially in younger age groups (27). Migraine is highly comorbid with bipolar disorder (28) and it may also be a proxy for medication use which could impair renal function. In both cohorts, there was no association between duration of lithium treatment or lithium toxicity (which was rare) and high-risk group membership.
Our model, developed in CPRD Aurum to predict whether individuals were in a high-risk group for deteriorating renal function during treatment with lithium for bipolar disorder, had excellent discrimination in the CPRD Gold cohort (ROC area 0.879). However, simple models only including sex, age and baseline eGFR performed similarly well (3-variable KFRE ROC area 0.870, 3-variable elastic net ROC area 0.888), all with similar levels of accuracy (>75%). In the external validation data set, our model designed to predict high-risk trajectory predicted CKD stage 3b or more severe (eGFR <45 mL/min/1.73m2) with a ROC area 0.849. Again, simple models performed well, with the 3-variable KFRE having the highest accuracy (78%).
When we reduced our cohort to those starting lithium with an eGFR ≥90 mL/min/1.73m2 Our full model and 3-variable elastic net model performed better than the KFRE. However, our sample was too small to complete external validation of these new models.
Calculating risk of renal impairment could help clinicians and patients better understand the potential for adverse effects of lithium and guide shared decision making. Given our findings, simple risk calculators should be used in clinical practice at the decision to commence lithium and when eGFR is regularly measured, and should be added to lithium monitoring protocols. Risk could be calculated via the KFRE or our 3-varaible elastic net model (Box 1), which performs better than the 3-variable KFRE when eGFR ≥90 mL/min/1.73m2. We did not find predictors of poor renal function that were specific to lithium treated patients.
Strengths and Limitations
Our large, population-based longitudinal sample avoided selection bias and is generalisable and representative. Our group-based trajectory approach avoided issues with differential follow-up time and potential surveillance bias. Our use of elastic net allowed us to build a parsimonious prediction model from a large number of potential features.
The study has a number of limitations. Instead of using precise eGFR values to define outcome we split individuals into those with a high-risk and low-risk trajectory. We forced the group-based trajectory model to identify >10% of individuals prescribed lithium who were at highest risk of developing impaired renal function. A more useful model clinically would be to predict true renal failure requiring intervention; however, this was too rare in our cohort (2% developed CKD stage 3b or more severe), suggesting it is uncommon in modern clinical practice. We included a large number of potential predictors. However, we may not have included all important features in our elastic net model. Some variables of interest, such as proteinuria, were poorly recorded and biased by diabetes diagnosis.
It is possible that we failed to identify people previously exposed to lithium, but we attempted to limit this by ensuring patients had at least a year of follow-up at the same primary care practice before their first identified lithium prescription. In most cases this would also include the uploading of historical records to the EHR. It may be that we excluded people with longer lithium exposure because of this inclusion criteria, and these excluded patients may be at higher risk of CKD, however our cohort did include indivuduals with up to 17 years of lithium exposure. Patients could also be misclassified in terms of different features included in the model. However, our intention was to build a model based on what is already known about the patient from the EHR. Misclassification may be more likely for some features (for example, in a relationship) than others (for example, chronic obstructive pulmonary disease).
We initially planned to develop a model for individuals with essentially normal renal function (eGFR ≥60 mL/min/1.73m2). Although the discrimination and calibration of the model was good, a simple model based on baseline eGFR, age and sex performed just as well.
Conclusion
We developed a model for predicting, at lithium initiation, individuals at high-risk of a poor trajectory of renal function using serial eGFR measurements. We externally validated this model, which had excellent discrimination and good calibration. CKD stage 3b or more severe occurred in 2% of the population across the two cohorts. Whilst this is worrying, it means that the vast majority of patients treated with lithium do not develop renal failure, and those at risk can be identified prior to initiating lithium using their age, sex and baseline eGFR.
Data Availability
Data is avaiable via the Clinical Practice Research Datalink (https://www.cprd.com/) following appropriate ethical approval.