Abstract
Cognitive and neural mechanisms underlying bipolar disorder (BD) and its treatment are still poorly understood. Here we examined the role of adaptations in risk-taking using a reward- guided decision-making task.
We recruited volunteers with high (n=40) scores on the Mood Disorder Questionnaire, MDQ, suspected of high risk for bipolar disorder and those with low-risk scores (n=37). We also recruited patients diagnosed with BD who were assigned (randomized, double-blind) to six weeks of lithium (n=19) or placebo (n=16) after a two-week baseline period (n=22 for FMRI). Participants completed mood ratings daily over 50 (MDQ study) or 42 (BD study) days, as well as a risky decision-making task and functional magnetic resonance imaging. The task measured adaptation of risk taking to past outcomes (increased risk aversion after a previous win vs. loss, ‘outcome history’).
While the low MDQ group was risk averse after a win, this was less evident in the high MDQ group and least so in the patients with BD. During fMRI, ‘outcome history’ was linked to medial frontal pole activation at the time of the decision and this activation was reduced in the high risk MDQ vs. the low risk MDQ group. While lithium did not reverse the pattern of BD in the task, nor changed clinical symptoms of mania or depression, it changed reward processing in the dorsolateral prefrontal cortex.
Participants’ modulation of risk-taking in response to reward outcomes was reduced as a function of risk for BD and diagnosed BD. These results provide a model for how reward may prime escalation of risk-related behaviours in bipolar disorder and how mood stabilising treatments may work.
Key points Question: Do bipolar disorder and lithium treatment change adaptation of risk-taking over time?
Findings: Across an observational study and a randomized controlled trial, we found that while participants modulate their risk taking in a gambling task over time, this was reduced as a function of risk for bipolar disorder. Neurally, this was accompanied by changes in reward memory traces in medial frontal pole.
Meaning: The results show that bipolar disorder is linked to a reduction in adaptation of risk- taking to the environment, suggesting a possible computational mechanism and treatment target.
Introduction
Bipolar disorder (BD) is typically characterized by episodes of depression or mania, lasting weeks and months. Lithium is the most effective mood stabiliser for management of BD, reducing the frequency of both manic and depressive episodes (1). While fluctuating mood episodes have traditionally be seen as lasting weeks or months, more recent work has shown that, in fact, patients with BD show large day-to-day fluctuations in mood even when symptoms are in the non-clinical range (2) and that this is affected by lithium treatment (3). Understanding the processes underpinning bipolar disorder may help us develop and assess more effective treatment approaches.
From a computational psychiatry perspective, two causes for mood fluctuations in bipolar disorder could be considered. First, mood fluctuations could be the result of either increased and prolonged responses to valanced outcomes. Recent work from the field of reinforcement learning has suggested that destabilizing positive feedback cycles between mood and perceptions of rewards may contribute to BD (4–7): In people with subclinical symptoms of BD, positive or negative surprises were found to affect the neural and behavioural responses to reward and punishments. In particular, symptoms were associated with an increase in reward value after a positive surprise. This kind of reward sensitivity has been linked to later changes in mood, suggesting a route by which escalation of reward responses may translate into clinical symptoms (4). Second, mood fluctuations could be the result of reduced behaviours that stabilize mood. Using momentary ecological monitoring has revealed that in the healthy state, when mood fluctuates, people self-report using strategies to re-establish mood homeostasis such as engaging with aversive activities when they are in a good mood
(8). This strategy is reduced in people with depression or low mood (9). However, it is yet unclear whether regulating behaviour is also reduced in BD. In the lab, adaptations of behaviour to past outcomes have been studied in the field of decision-making, revealing temporal interdependencies. For example, people show ‘biases’ such as ‘loss chasing’ (10) (taking more risks to try and recover losses). Here, we used a lab-based task that allowed us to test the impact of BD and its’ treatment on both putative processes.
Optimal decision making involves interplay between frontostriatal systems, which play a role in motivation, reward value and its regulation. The ventral striatum and the ventromedial prefrontal cortex (vmPFC) are implicated in reward anticipation as well as its hedonic impact (11,12). vmPFC is further implicated in the evaluation of options (13), including tracking of past reward outcomes (14). We would therefore expect that if bipolar disorder affects the adaptation of behaviour to past outcomes, these signals in the vmPFC should be changed. By contrast, activity in the dorsolateral PFC is associated with regulation of behaviour towards reward, including self-regulation of reward craving (15,16). Previous work has linked bipolar disorder to increased reward related striatal signalling, coupled with altered patterns of ventromedial and dorsolateral PFC engagement (17) and interaction (18), while a meta- analysis (19) has highlighted a role for orbitofrontal cortex abutting dlPFC.
Here, we have built on these findings to test whether a gradient across a bipolar disorder spectrum (i.e. from low risk to diagnosed bipolar disorder), was linked to changed behavioural adaptation (risk taking) from trial to trial in response to reward/loss outcomes. For this, we recruited 40 volunteers with high scores on the mood disorder questionnaire (MDQ (20)), at suspected high risk for bipolar disorder, 37 volunteers with low scores, and 35 treatment seeking patients with diagnosed BD (n=22 for FMRI). To assess whether behaviour and naturally occurring daily-life mood fluctuations were related, participants completed up to 50 longitudinal testing sessions at home. To understand the neural mechanisms of risk adaptation behaviour, we measured brain activity with fMRI. To test the causal effect of a commonly prescribed mood-stabilizing drug, lithium, 19 patients were randomly assigned to receive six weeks of lithium treatment (dose titrated individually to plasma levels of 0.6-1 mmol/L) and 16 to placebo treatment in a double-blind design.
We hypothesized that BD and risk for BD would be associated with reduced adaption of risk taking behaviour (i.e. choice being less connected to previous experience of a win or a loss), which would be associated with changes in vmPFC and dlPFC signalling of previous win/loss experiences during fMRI. We also hypothesized that these behavioural and neuroimaging differences would be normalised following six weeks of lithium vs placebo treatment in BD.
METHODS
Participants
Participants were recruited in two separate studies (see below). The non-interventional study was approved by the local ethics committee (MSD-IDREC-C2-2014-023) and the interventional study by the National Research Ethics Service Committee South Central – Oxford A (15/SC/0109) and the Oxford Health NHS Foundation Trust. Participants gave informed consent and were reimbursed for taking part in the study.
Volunteers at suspected high vs low risk of bipolar disorder
Participants were recruited through local advertisement and from pools of previous participants. In an online pre- screening session, participants completed the Mood Disorders Questionnaire (MDQ (20)), a self-report screening instrument to identify risk for bipolar disorder. Participants were only invited for a full screening session if they scored either <5 points (‘low MDQ’ group, n=37 included, at presumed low risk for bipolar disorder); or ≥ 7 (‘high MDQ’ group, n=40 included). The screening verified that several of these symptoms measured with the MDQ happened during the same period of time. Structured clinical interviews with the SCID revealed that 5 of this group met criteria for bipolar disorder, despite not having received a formal diagnosis or seeking treatment. See supplementary method [1A] for detailed exclusion criteria.
Patients with BD
Participants were recruited through the BD Research Clinic (Oxford). All participants met criteria for BD-I (n=7), BD-II (n=27) or BD not otherwise specified (BD-NOS, n=1), based on structured clinical interview. All participants were outside major mood episodes requiring immediate treatment. Full exclusion criteria are provided in the supplementary materials [1B]. Participants were assigned to placebo (n=16) or lithium (n=19), in a randomised double-blind design, see below.
Study design
Volunteers
We measured participants’ mood and behaviour in a cognitive task longitudinally five times a week over ten weeks. Brain activity during the same task was measured during an MRI scan. The data here were part of a larger study (supplementary method [1B]).
Patients with BD
This study was a randomised, 6-week, double-blind, placebo-controlled trial (21). See supplementary method [1B] for full information. All participants underwent a two- week pre-randomization phase (‘baseline’) during which they completed the cognitive task and mood ratings daily at home. Due to logistic challenges, for some participants this phase lasted longer than two weeks. For the next phase (6 weeks), participants were pseudo- randomly assigned to receive either lithium (starting dose of 400mg and then titrated to plasma levels of 0.6-1 mmol/L) or placebo in a double-blind design. Only 22 participants were fMRI compatible. Participants were invited to complete online weekly assessments of depression symptoms with the Quick Inventory of Depressive Symptomatology (QIDS, (22)) and symptoms of Mania with the Altman Self Rating Mania Scale (23).
Throughout, we performed two types of group comparisons. First, we compared across risk of BD (i.e. group as ordered factor (24) in regressions, Low MDQ ≤ High MDQ ≤ patients with BD), subsequently referred to as ‘bipolar disorder gradient’. Ordered factors in regression imply a relationship of order between the groups, this does not have to be a linear relationship (i.e. the difference low MDQ to high MDQ can be larger or smaller [but of same sign] than the difference high MDQ to patients with BD). MDQ was not measured in the patient group. As this involved data from the BD group before assignment to lithium or placebo, all participants were included. Significant results were post-hoc followed up comparisons of the individual groups (t-tests). Second, we tested for the effects of lithium treatment as drug (lithium/placebo) x time point (pre, i.e. baseline/post) interactions.
‘Wheel of fortune’ task
Trial structure
On each trial of the task, participants were given two options shown side-by- side. In the at-home version, these were wheels of fortune (Figure 1A). In the fMRI version, they were instead presented as bars. Each option had three attributes: probability of winning vs. losing (size of green vs. red area), magnitude of possible gain (number on green area, 10 to 200), and magnitude of possible loss (number on red area, also 10 to 200). After participants chose one option, the wheel of fortune started spinning and then randomly landed on either win or loss. Finally, participants were shown their updated total score. The experiment was designed so that most choices were difficult, i.e., the options were very similar in expected value, i.e. relative utility (reward magnitude * probability; 90% of choices were not more than 20 points apart; 76% not more than 5 points apart, Figure 1B, Figure S1).
Timings and number of trials
Each day, participants rated their positive and negative mood using the Positive and Negative Affect Schedule – Short Form, PANAS-SF (25). They also gave an overall rating of their mood (‘How are you feeling’, referred to here as ‘happiness VAS’) using a slider ranging from ‘very unhappy’ (red sad face drawing) to ‘very happy’ (green smiley face). They then played 20 trials of the task. After the task, they repeated the happiness VAS.
In the fMRI scanner, participants played 100 trials. All timings were jittered. From the onset of options until participants could make a choice: 1-2 sec; delay between participants’ response and outcomes shown: 2.7 to 7.7 sec; duration of outcome shown: 1-3 sec; duration of total score shown: 1-9 sec; ITI: 1-9 sec.
Behaviour
Behavioural data were analysed in R (26) (version 4.0.2) and Matlab. R-packages: Stan (27), BRMS (28,29), dplyr (30), ggpubr (31), sjPlot (32), compareGroups (33), emmeans (34), ggsci (35).
Group comparisons
To compare groups, instead of a standard ANOVA procedure which tests for any differences between groups, we tested for a systematic effect, i.e. bipolar disorder gradient (group as ordered factor (24), Low MDQ ≤ High MDQ ≤ patients with BD) in linear regressions, also controlling for age and gender. Models used the BRMS toolbox interface for Stan (supplementary methods 2). For this and all subsequent analyses, we used Bayesian Credible Intervals (36) to establish significance by the 95% CI not including zero.
Computational models
Decision making
We used a computational model to capture participants’ choices. The model first computed the overall expected (‘utility’) of each option, then made a choice (left or right option) depending on which option had the better utility, but also allowing for some random choice behaviour (37,38).
First, the model compared the options’ utilities as displayed at the time of choice on the current trial, i.e., probability (prob) x magnitude (mag). We allowed for individual differences in sensitivity to the loss vs. reward utility (λ). We also included in the model a measure of adaptions of risk taking (i.e. loss vs win sensitivity) to past outcomes (‘outcome history’). Specifically, a parameter (γ) changed the weighting of the loss utility on the current trial depending on whether the previous trial’s outcome was a win or a loss (i.e., γ>0 means increased sensitivity to losses after a win on the previous trial). To decide which option to choose, the model compared the utilities of the left and right options taking into account each participant’s ‘randomness’ (inverse temperature (β), higher numbers indicating higher choice consistency): To allow fitting of individual sessions (20 trials), a Bayesian approach was implemented that allowed specifying priors for each parameter (supplementary methods [2C]). The model was validated using simulations and model comparisons (Table S1-3 and supplementary methods [2A-B]).
Group differences
To assess group differences, we entered the session-wise parameters into hierarchical regressions (using BRMS). This allowed us to take into account that parameters might change over days of testing, as well as individual differences in the means and variability (standard deviation) across sessions. For example:
Mean: invTemp(β) ∼ 1 + day + group+ Age + Gender + (1 + day | ID),
And error term: sigma ∼ 1 + group + Age + Gender + (1|ID))
The effect of lithium (vs. placebo) was tested analogously:
Mean: invTemp(β) ∼ 1 + day + group*pre, i.e. baseline/post+ Age + Gender + number_days_baseline + (1 + day | ID)
These models were used for group comparisons of mean parameters (supplementary methods 2C+D). Variabilities of parameters over days were not compared as model validation (Table S1) suggested poor recovery. Mood data (positive and negative PANAS, happiness VAS) were analysed using similar regressions (supplementary methods 2D) to assess group differences in mood (mean or variability) or the relationship between task outcomes and changes in happiness VAS.
Model-free analyses of behaviour
To test that participants could perform the task, i.e., that their choices were sensitive to expected value, we binned their choices (% left vs. right option) according to the overall utility difference between the two options (i.e., left vs. right reward utility minus loss utility, utility=probability*magnitude).
To test sensitivity to risk of losses, as has been previously reported to be affected in BD (39,40), we refined the binning of choices (as above) by further splitting the data according to win and loss utility (i.e. probability * magnitude).
We next analysed behaviour for adaptions of risk taking to past outcomes by considering how participants change their behaviour – here risk-taking (avoidance of potential losses) – based on win/loss outcomes on previous trials (‘outcome history’ effect). For this, we computed how their choices differed after a win or loss on the previous trial (difference % choosing option with lower potential loss [loss utility] after win minus after a loss). We focused on the most extreme (lowest/highest) loss utility bins from the analyses above (‘taking loss utility more into account’) as adaptation to past trial outcomes by taking loss more into account (i.e. a multiplicative effect) should most strongly affect choices the more dissimilar the loss utilities of the two options.
MRI acquisition
Data from all 77 high and low MDQ volunteers and 13 patients with BD were collected on a 3T Siemens Magneton Trio. Data from 9 patients with BD were collected at a different site using a Siemens Magneton PRISMA. Group comparisons include scanner as a control regressor. Scan protocols were carried out following (14), supplementary methods [3A].
FMRI analysis – whole-brain
General approach
Data were pre-processed using FSL ((41), supplementary methods [3B]). Statistical analysis was performed at two levels, event-related GLM for each participant, followed by group-level mixed-effect model using FSL’s FLAME 1 (42,43) with outlier de- weighting. Whole-brain images are all cluster-corrected (p<0.05 two-tailed, FWE), voxel inclusion threshold: z < 2.3.
Regression designs
At the time of the decision, we looked for neural activity correlating with the utility (reward, loss) of the choice. At the time of the outcome of the gamble, we looked for neural activity related to the processing of the outcome (win/loss as continuous regressor). Decision and outcome-related activity could be dissociated due to jitter used in the experimental timing (14). As a key measure of interest, we looked at whether there was a history effect at the time of the choice (i.e., previous trial’s gamble win/loss outcome (14,44), analogous to the behavioural analyses). Full design information: supplementary methods [3C], Figure S2.
Group-level comparisons
We compared the low vs high MDQ groups (n=77) in whole-brain analyses. As only 22 patients with BD were available, these group comparisons were first performed in regions of interest (ROIs) derived from comparisons of the high/low MDQ groups. As exploratory analyses, BD groups were also compared at the whole-brain level.
ROI analyses
Mean brain activations (z-stats) were extracted for each participant. These were used to illustrate group differences and also to perform independent statistical tests (e.g., ROIs of clusters defined based on group differences of high vs. low MDQ could be used to test group differences between lithium and placebo). For this, non-hierarchical Bayesian regressions were used, also controlling for age and gender. Brain activations for the outcome history effect were correlated with the corresponding behavioural measures. For this, effects of age, gender and group (and for the patients with BD: number of days in the baseline phase pre-randomization, i.e. before the MRI scan) were first removed using regressions from both neural and behavioural measures.
RESULTS
We recruited four groups of participants in two separate studies (Table 1). In the group of patients with BD, based on self-report scores (Altman self-report scale, quick inventory of depressive symptomatology), in the phase before the assignment to placebo or lithium, 30% scored in the mania range and 53% scored at least moderate symptoms of depression (Table 1). Similar numbers persisted throughout treatment with lithium vs. placebo (Table 1, Figure S3). Participants with BD took several medications at study inclusion (Table 2).
General performance
Participants completed longitudinal daily behavioural test sessions at home, consisting of 20 trials of a gambling task and mood self-reports. In the task (Figure 1A), participants needed to choose repeatedly between two gambles (wheels of fortune), considering the probabilities of winning or losing points and the number of points that could be won or lost. Participants in all groups performed the task well (Figure 1B), selecting options with higher values more frequently.
Risk taking (avoidance of potential losses)
To test whether sensitivity to (i.e. avoidance of) potential losses vs. wins when gambling was reduced with a bipolar disorder gradient (low MDQ ≤ high MDQ ≤ patients with BD), we built a stochastic decision-making model that described participants’ choices as being based on the reward and loss utilities of the two options while allowing for individual differences in how people made decisions (see table S2 for model comparisons; model accuracy: 71%). The model captured participants’ sensitivity to losses (vs. wins) as a parameter (λ). We found that the higher the bipolar disorder gradient, the lower the sensitivity to losses vs wins (Figure 2Ai, Table S4A, mean =-0.27, 95%CI = [-0.49; -0.05]). This was driven mainly by a step change decrease in the group of patients with BD compared to the low/high MDQ groups, rather than a continuous linear relationship (table S4A for group comparison and continuous measure of mania symptoms across all groups). Lithium vs. placebo did not affect this (Figure 2Aii, table S4B). To illustrate the effect in a model-free way, we plotted the sensitivity of choices to the win or loss dimensions (i.e., steepness of the curve, Figure 2Aiii). This revealed that the difference between groups (group*win/loss dimension* utility bin: mean=0.33, 95% CI = [0.06; 0.61]) is driven by both an increased sensitivity to wins (group*utility bin: mean 0.24, 95% CI = [0.08; 3.99]) and a decreased sensitivity to losses (group* utility bin: mean = -0.15, 95% CI = [-0.30; -0.01]) with the bipolar disorder gradient. Alternative computational models in Table S5.
Outcome history effects
We next analysed how participants adapted their risk taking across trials based on win or loss outcomes in the previous trial (‘outcome history effect’). In the computational model, outcome history effects were captured as a parameter (γ) that described to what extent participants were more sensitive to (i.e. avoidant of) potential losses after a win on the previous trial. We found that the bipolar disorder gradient reduced outcome history effects (Figure 2Bi, Table S4A+S5 mean=-0.05, 95% CI=[-0.11; -0.0003], showing also a continuous effect with mania symptoms across all groups, table S4A). This was not affected by lithium (Figure 2Bii, Table S4B+S5). We can unpack this effect in the data without a model (Figure 2Biii) by focusing on the most extreme loss utility bins (if the loss utility difference is small, it will not affect choices if it is taken slightly more or less into account). If people show no outcome history effect, their choices should not change depending on the last trial’s outcome. However, the low MDQ group in fact takes the loss dimension more into account after a previous trial win vs. loss (i.e. less likely to pick options with high potential loss and more likely to pick options with low potential loss). This effect decreases with the bipolar disorder gradient (group*last loss/win: mean=0.02, 95% CI = [0.0008; 0.04]).
Mood
Finally, an advantage of the behavioural data being collected at home was that we could relate daily mood ratings to task-based behaviour. As reported previously (3,45) and similar to other studies (2,46,47) groups differed in their instability (standard deviation) of mood: The low MDQ group showed the lowest and the patients with BD the highest mood instability (positive PANAS: mean= 0.22, 95%CI = [0.11; 0.33]; negative PANAS: mean=0.64, 95%CI = [0.45; 0.83], Table S8A, Figure S4A). Lithium did not affect instability when using our measure of standard deviation here (Table S8B, Figure S4B), though note that using a measure of Bayesian volatility, lithium has been found to increase volatility of positive mood (3). Across all groups, happiness VAS at the end of each session, compared to before was increased by overall (summed across the whole session) reward and decreased by loss outcomes (mean = 0.42, 95%CI = [0.31; 0.52]), similar to previous reports (48,49). However, this did not differ by bipolar disorder gradient (mean =-0.06, 95% CI = [-0.15, 0.03], table S8C). While mood instability differed between the groups, the impact on behaviour was distinct, with mood instability affecting the choice noisiness (the more unstable the mood, the more random the choices), without clearly affecting either loss sensitivity or outcome history effects (Figure S4C). The relationship between mood (PANAS) on the day of testing (rather than an overall measure of instability) and behaviour was not robust (table S8D). An exploratory analysis found that in the BD group, positive mood (PANAS) before the session led to reduced choice noisiness (figure S4C, stats on the regression interaction term BD gradient x PANAS predicting choice noisiness: mean: 0.18, 95%CI: [0.01; 0.35]).
Neural results
Neural data were available for 77 volunteers and 22 patients. Across volunteers, brain activations to reward and loss utility during decisions (Figure 3A) and at the receipt of outcomes (Figure 3C) activated brain evaluation networks, including ventromedial prefrontal cortex (vmPFC), ventral striatum, dorsal anterior cingulate cortex (dACC), insula (Table S9). Next, we tested whether, related to the outcome history effect, there was brain activity when participants made a choice that correlated with the previous trial’s outcome. Indeed, we found that activity in a network including the ventral striatum, vmPFC and medial frontal pole (FPm) related to the outcome of the previous trial, i.e. increased activity the more positive (and less negative) the previous trial’s outcome (Figure 3B, Table S9).
Next, we compared the low and high MDQ groups. Activity for the previous trial’s outcome was higher for the low MDQ vs high MDQ group in FPm (Figure 4Ai-ii, Table S10, p=0.038, whole-brain cluster corrected). In other words, while all participants showed activity in vmPFC/FPm, in low MDQ participants the cluster extended further into FPm. Moreover, there was a correlation between the neural signal for the previous trial’s outcome and the behavioural outcome history effect: the stronger the activity for the last trial’s outcome in this area, the stronger the behavioural outcome history effect (Figure 4Aiii, r=0.24, p=0.017, partial correlation after correction for control variables and group; without correction: r=0.28, p=0.005; test performed as robust regression, controlling for outliers: 95% Bayesian CI = [0.03; 1.52]). Lithium vs. placebo participants’ activity did not differ in this area (mean=0.64, 95% CI = [-0.23; 1.44]).
As exploratory analyses (due to low sample sizes in BD groups for the MRI scan), we next compared lithium vs. placebo treatment at the whole-brain level. We found that patients receiving placebo had stronger activity related to the outcome of gambles in an area spanning dorsolateral prefrontal cortex (dlPFC, area 46) and lateral frontal pole (Figure 4B, Table S10B, p=0.009). We also tested whether the gamble outcome activation related to the behavioural outcome history effect, finding that interestingly it did (Figure S5), in an mFP area overlapping with the area of group differences identified above, though not in the dlPFC area of group differences between lithium and placebo (Figure 4A).
DISCUSSION
We designed a study to test the computational and neural correlates of adaptations of risk- taking to gains and losses in bipolar disorder (BD), in risk of bipolar disorder and treatment with lithium. We included participants along a gradient of bipolar disorder ranging from volunteers with low risk of BD (low MDQ group), to volunteers with high risk of BD, to patients with diagnosed BD. In the patients, we tested the effect of lithium treatment in a placebo- controlled double-blind design. We measured how much participants adapted their risk- taking following reward outcomes in a risky decision-making task (‘outcome history effects’). We measured behaviour both longitudinally over up to 50 days and during a brain imaging (FMRI) session. We found that the low MDQ group showed an ‘outcome history effect’. Specifically, after a win on a trial, they were more risk averse (avoiding potential losses). This was reduced across the bipolar disorder gradient (lowest risk aversion adaptation in patients with BD). Neurally, outcome history was related to the representation of past information in a large network including ventromedial prefrontal cortex (vmPFC) and medial frontal pole (FPm). In low MDQ volunteers, this brain signal extended further dorsally into FPm compared to the high MDQ scorers and this was correlated with risk adaption behaviour.
Decreased loss sensitivity and reward hypersensitivity have been suggested as central to BD (39,40) (50,51) and may drive risky or impulsive decision making. Our findings of decreased sensitivity to potential losses (vs wins) with BD gradient are in agreement with this. This effect showed a step change between the volunteer and the patient groups, rather than a continuous effect across the gradient. Then, we went further looking at adaptation of risk taking to past outcomes. We found that volunteers with presumed low risk of bipolar disorder (low MDQ) showed sequential dependencies between their choices and previous trials’ outcomes, avoiding potential losses after a win on the previous trial (‘outcome history effect’), as similarly recently reported in a go/no go decision-making task (52) . This was not strictly rational in our task since outcomes for gambles across trials were independent (10). However, this kind of behaviour observed in the lab may be functionally appropriate in more naturalistic environments (38,53–55) and thus reflect prior beliefs participants have about reward distributions (e.g. non-independence between trials). For example, in natural environments, which are experienced continually rather than in discrete trials and in which different types of rewards (e.g. food, water) need to be accumulated or a homeostatic setpoint needs to be reached, it would make sense to adapt behaviour according to previous outcomes (56–60). The influence of past losses (vs wins) was lower in the high vs low MDQ group and lowest in patients with BD (i.e. the pattern showed a continuous gradient, also captured as a linear relationship to mania scores across all groups, rather than a step change from volunteers to patients). Reduced homeostatic behaviour of this kind could lead to unstable moods since in the healthy population mood has been found to be regulated through behaviour (8). Relatedly, in patients with BD, purposefully regulating behaviour during the prodromal periods has been shown to reduce the risk of relapse (61). However, in our study, links between ratings of mood and behaviour were weak and so this suggestion remains speculative. Future studies could measure mood over longer timescales, more frequently than done here and using a more naturalistic task. We also note that our findings diverge from previous findings (4) of a stronger impact of previous rewards (and associated emotions) on the perception of outcomes in a study including a participant sample not specifically selected for BD diagnosis or risk of bipolar disorder, but completing the Hypomanic Personality scale (62) after the task.
We focused on whole-brain analyses for the low/high MDQ volunteer sample due to the larger sample size compared to the patient study. Decision-making and the processing of outcomes produced a typical pattern of activation (63–66) in areas including dorsal anterior cingulate cortex, striatum and vmPFC. However, there were no group differences in any of these signals, matching our behavioural results of an absence of differences in general ability to make decisions or sensitivity to rewards vs. losses per se in the low vs. high MDQ groups. We next looked for brain activity related to the modulation of risk taking with ‘outcome history’. We found that at the time when people made decisions, there was activity representing the last trial’s outcome in an area spanning vmPFC to FPm. This is similar to previous findings in a learning context of between-trial activities (14,44,67). This gamble outcome activation was related to the behavioural outcome history effect across participants. This signal extended more dorsally into FPm in low MDQ volunteers. Furthermore, the stronger this signal, the stronger the modulation of risk taking by outcome history. As such the influence of outcomes on decision making may be a feature of risk for bipolar disorder which involves the FPm. This adds to previous work linking BD to changes in reward related signals in ventral striatum and OFC (19,68) and changes in connectivity between striatum and PFC (17,18). In this region, lithium did not affect brain activity, suggesting that its mechanism of action may not involve direct modulation of vmPFC value weighting.
In an exploratory analysis, we compared the brain activity of patients with BD randomised to lithium or placebo. Patients given placebo showed larger outcome-related activity in dorsolateral prefrontal cortex. Yet at the same time, lithium did not change behaviour. dlPFC signalling has largely been associated with regulation of mood and reward-related behaviour. Previous work in bipolar disorder has showed altered patterns of both vmPFC and dlPFC activity. In particular, Mason et al. (17) reported that while controls activated dlPFC more to rewards of high probability, patients with bipolar disorder showed greater dlPFC to low probability (more risky) rewards. As such, our preliminary findings suggest that lithium may modulate a key component of frontostriatal circuitry important for effective decision making. Previous work in healthy volunteers also reported an effect of lithium on reward related signals in the ventral striatum which wasn’t detected in the current study(69).
The current work has a number of limitations. Our sample size was low for the comparison between lithium and placebo fMRI responses, which may have affected our statistical power for key comparisons. It is also relevant that we saw no effect of lithium on the clinical questionnaires included in this study. However, this is consistent with the characteristics of the sample recruited here, where current symptoms were largely residual (i.e. outside of an acute episode). Furthermore, lithium is largely used for relapse prevention rather than acute treatment of mania or depression (70) which could not be tested in the short timescale of the current investigation. Data across a large number of tasks and measures were also completed as part of these studies, and analysis is still ongoing. These complete results may shed light on the overall effects of bipolar disorder risk and treatment on different facets of mood and cognition. While we pre-registered our lithium trial (2014-002699-98), we did not pre-register our specific hypotheses for this part of the analysis. While we found an expected value signal (chosen minus unchosen value) in a typical ‘negative value’ network including the dACC, we did not find a ’positive value’ signal in a typically expected area like the vmPFC. This is unlikely to be due to signal drop out as vmPFC showed activation with reward outcome and an outcome history signal at the time of choice. This result is reminiscent of our previous findings (14), where it was interpreted as possibly due to the integration of an aversive dimensions (there: effort) with reward, rather than only integrating two positive dimensions (e.g. reward probability and reward magnitude). Similarly, here, participants were faced with a negative dimension, i.e. monetary loss.
Our results highlight the importance of considering rewarded decision-making and related neural activity to understand symptoms of bipolar disorder and the stabilising effects of lithium.
Contributions
JS: Conceptualization, software, formal analysis, writing, original draft, reviewing and editing
PP: Investigation, writing, reviewing and editing
NN: Conceptualization, investigation, reviewing
NK: Conceptualization, reviewing and editing
LZA: Investigation, writing, reviewing and editing.
MFSR: Conceptualization, design, reviewing and editing.
ACN: Funding acquisition, conceptualization, supervision, design, reviewing and editing.
PJH: Funding acquisition, Conceptualisation, recruitment, reviewing and editing
JG: Funding acquisition, conceptualisation, design, supervision, reviewing and editing
KS: Investigation, project administration, reviewing and editing
CJH: Funding acquisition, conceptualization, design, supervision, reviewing and editing
Financial disclosure
The study was funded by a Wellcome Trust Strategic Award (CONBRIO: Collaborative Oxford Network for Bipolar Research to Improve Outcomes, reference No. 102,616/Z). JRG, CJH, PJH and KEAS are supported by the Oxford Health NIHR Biomedical Research Centre. MFSR is funded by the Wellcome Trust (221794/Z/20/Z). The Wellcome Centre for Integrative Neuroimaging is supported by core funding from the Wellcome Trust (203139/Z/16/Z). JS has been funded by the Institut National de la Santé et de la Recherche Médicale, the Biotechnology and Biological Sciences Research Council (BB/V004999/1, Discovery Fellowship) and Medical Research Council (MR/N014448/1, Skills Development Fellowship). The views expressed are those of the authors and not necessarily those of the NHS, the NIHR, or the Department of Health and Social Care.
JS, PP, NN, LZA, NK, JG, MFSR report no biomedical financial interests or potential conflicts of interest. CJH has received consultancy payments from P1vital, Lundbeck, Compass Pathways, IESO, Zogenix (now UCB). PJH reports receiving an honorarium for editorial work for Biological Psychiatry and Biological Psychiatry Global Open Science. ACN was non- executive director at the Oxford Health Foundation Trust during a period overlapping with the study. KEAS has received consultancy payment from Yale University.
Code and data availability
Code and anonymized (behaviour, selected demographics, brain activity from regions of interest) or group level (whole-brain FMRI) data will be available on osf.io upon publication. Access for reviewers (link to be replaced with public one upon publication): https://osf.io/ychbf/?view_only=0a6fce5cc7e74e9c88c542b6aa7d8680
Data Availability
All data produced in the present study are available upon reasonable request to the authors.
Supplementary methods
[1] Additional participant information
[1A] Exclusion criteria
Healthy volunteers with low and high mood instability
Common inclusion criteria across both groups were: over 18 years old, no current medication (other than contraceptive pill), no use of antidepressants, antipsychotics, lithium or anticonvulsant medication in the last 6 weeks, no contraindication to MRI or MEG. Participants were recruited based on their scores of mood instability (MDQ). The questionnaire includes first questions describing symptoms of bipolar and second a question about whether the different symptoms occurred at the same time. Participants were included in the low MDQ group if their number of reported symptoms was 5 or less. Participants were included in the high MDQ group if they reported 7 or more symptoms and report that they have happened at the same time. Additionally, in the low mood instability group, we excluded participants with a current or past diagnosis of an axis 1 psychiatric disorder (assessed using DSM-IV interview) or a first degree relative with bipolar disorder. In the high mood instability group, we excluded participants with a current or past diagnosis of an axis 1 psychiatric disorder other than bipolar disorder I or II, major depression or anxiety disorders.
Participants diagnosed with bipolar disorder
Inclusion criteria: over 18 years old, meeting criteria for BDI, BDII or BDNOS as assessed using the Structured Clinical Interview for DSM-V Axis I Disorders (SCID-I), clinically significant mood instability (established through interview), not currently suicidal (currently suicidal assessed as a score of ≥ 4 on the C-SSRS (Columbia Suicide Severity Rating Scale Score (8)), no counterindications to lithium (assessed through pre-treatment tests including renal, cardiac, thyroid and parathyroid functions), not currently taking any psychotropic drugs that could not be withdrawn, not requiring acute treatment so that placebo would be inappropriate, participation in a previous research trial in the past 12 weeks. Participants with counterindications to MRI scanner were included in the behavioural part of the study.
All participants
When inspecting the PANAS daily mood ratings, we noticed that for some participants on some days, every single response (to all positive and negative items was zero). This suggests a technical problem. We set the data from these days to zero. There were 25 participants (across all groups) for who this happened for more than 10% of measurements. Because of using hierarchical models, we could include most participants for all analyses. Specifically, we could include all participants that did not have a standard deviation of zero for the mood measures (this excluded 1 participant for the analyses for negative PANAS, 0 for positive PANAS and 2 for the analyses of changes in mood before and after the task).
[1B] Larger study – full information
Volunteers
The data presented here was part of larger study (CONBRIO, Collaborative Oxford Network for Bipolar Research to Improve Outcomes: The cognitive neuroscience of mood instability; Cognition and Mood Evolution across Time (COMET) – MSD-IDREC-C2-2014-023). Participants who expressed interested in the study were given an electronic version of the information sheet and the Mood Disorder Questionnaire (MDQ). If they scored in either the ‘low MDQ’ or the ‘high MDQ’ category they were invited for a first study visit.
During the first study visit, inclusion and exclusion criteria were checked. Participants were also screened for psychiatric disorders using the Structured Clinical Interview for DSM Disorders (SCID) (9). Participants completed several questionnaires, including Barratt Impulsiveness Scale (10), Sleep Condition Indicator (11), Maclean Screening Instrument (12), Affective Lability Scale (13), Affect Intensity Measure (14). Participants were set up with and instructed how to use devices to measure their activity (GeneActiv watch and FitBit) over the ten weeks study period. They were given an iPad mini and trained on four cognitive tasks: ‘wheel of fortune’ – risky decision making (presented here), ‘guess the gap’ – performance learning, ‘fractals’ – stimulus-outcome learning, ‘whack-a-t’ – implicit spatial learning.
Over a period of ten weeks, they were asked to complete these tasks five times a week and to wear the GeneActiv and FitBit devices as much as possible. They were also asked to complete clinical questionnaires (Quick Inventory of Depressive Symptomatology (15), Altman self-rating mania scale (16), Generalized Anxiety Disorder-7 (17), EuroQol-5 [health-related quality of life] (18)) on the True Colours mood monitoring system (19) once a week. They were also asked to stay within the recommended daily alcohol intake levels throughout.
In the beginning of the ten-week period (weeks one or two) and in the end (weeks nine and ten), participants attended an MRI scan and a MEG scan. Due to MEG scanner downtime, only 24 participants received the MEG scan. During both MRI scans, resting state and structural data was obtained. At the first scan, additional the ‘wheel of fortune’ was measured. At the second MRI scan, ‘fractals’ and ‘guess the gap’ was measured, as well as diffusion tension imaging and fluid-attenuated inversion recovery.
Participants diagnosed with bipolar disorder
Participants first took part in a screening visit in which inclusion criteria were checked and informed consent was taken. Using the SCID-I, a diagnosis check was done. In addition, demographic and clinical information was obtained, including duration of illness, previous use of psychotropic medicines, family history of mood disorders, presence of comorbid borderline personality disorder and attention deficit hyperactivity disorder and current suicidal ideation, concomitant medication and substance use and a physical examination. If blood samples have not been taken as part of routine monitoring, they were taken at this visit. Two sets of samples were taken, one was sent to the pathology lab for analysis and the other was retained as replacement for samples lost/damaged in transit and for storage for future research. Tests included urea and electrolytes, full blood count, fasting blood glucose, glycosylated haemoglobin (HbA1c), blood lipid profile, LFTs, T4, T3, TSH, thyroid antibodies, PTH, vitamin D, eGFR, Cystatin C and NGAL and inflammatory markers CRP and IL-6. A sample was taken to measure calcium level using the InSight™ Electrolyte Analyser located in the NIHR-CRF. Weight/BMI, pulse and blood pressure were also recorded, and an ECG was performed. Participants were given an iPad mini and trained on the same cognitive tasks as the healthy volunteers described above. They were also set up with the True Colours system to rate weekly mood and on Mood Zoom (20) to rate daily mood. Participants were also given activity monitors. They were also given saliva swabs.
Before being randomised to lithium or placebo, all participants completed two weeks of daily cognitive tasks, mood and activity measurements at home (though some participants completed up to 30 days due to logistic challenges). Then, they were randomised and performed six weeks of cognitive tasks, mood and activity measurements.
In the beginning of the six weeks period (week one or two), they completed an MRI and a MEG scan using the same scans as described for the healthy volunteers.
Randomisation: The first 10 participants were fully randomly assigned to avoid predictability, while for subsequent participants, an algorithm was used to minimize differences in age (<25 or >25 years) and gender between the two groups. In the lithium group, participants were titrated to doses producing plasma levels of 0.6-1 mmol/L (see supplementary methods 1C for dosing details).
[1C] Lithium dosing information
In the lithium group, participants were prescribed an initial dose of 400g/day, unless there was a clinical indication to start at a lower dose. During this phase, participants attended brief assessments at 4-days, 8- days and between 2 and 3 weeks post-randomisation to review lithium levels by a psychiatrist (if lithium level ≤ 0.3 mmol/L, dose increased to 800mg/day; if lithium level between 0.4 to 0.5 mmol/L, dose increased to 600mg/day; if lithium level 0.6 -1.0mmol/L, continued current dose; if lithium level ≥ 1.0mmol/L, decrease dose by 200mg/day or 400mg/day as found appropriate by psychiatrist), receive additional supplies of lithium/ placebo as needed and were asked about adverse events. Participants took part in one neuroimaging session in week 3 or 4. During the trial, participants were asked to complete the cognitive tasks daily.
[2] Computational modelling, additional information
[2A] Decision making model validation
We validated our computational models using simulations (21,22). We simulated 400 participants with parameter values (mean and standard deviations) drawn from a uniform distribution in the 95% range of the parameters for real individual participants. For each participant, we simulated 47-50 sessions (uniform distribution). Parameters for single sessions were drawn from normal distributions of simulated participants’ means, standard deviations and linear effects of days. Simulated data was then fitted using same approaches as above. To speed up fitting of data, variational Bayesian approximation (23) was used unless control indices (pareto smoothed importance sampling, khat >0.7 (24)) suggested unsuccessful fitting even after increasing number of samples and decreasing tolerance, in which case sampling was used. When fitting the models, initially, 4 chains, with each 15,000 iterations were drawn and the target acceptance rate (adapt_delta, (25)) was set to 0.85. Whether models had been fit appropriately was checked using a criterion of R-hat (measure of mixing of chains) < 1.1 and absence of divergent samples. If these were not fulfilled, number of iterations were increased by 50% and adapt_delta was increased towards 1 (by 50% of distance from 1). This was repeated until all models converged.
To validate the model, we then checked the correlations between true and fitted values for mean and standard deviation of parameters across individual subjects (table S1).
[2B] Alternative decision-making models and model comparison
In addition to the models in the main text (section Computational models – decision making), we also considered two other classes of models (see table S2 for full list). First models that incorporated probability and magnitude distortions according to prospect theory (26) (class ‘M2’): Where Here, ε is the probability distortion; δr is the distortion of the reward magnitudes; δl is the distortion of the loss magnitudes and λ is the weighting of the loss (scaled mag * scaled prob).
We also fitted further versions of this model, leaving out the probability distortion, the loss scale or the magnitude distortions.
Of note, due to there only being 20 trials available per session, not all of these models could be fitted. Specifically models that contained probability distortion could not be fitted and models with a loss weight in addition to exponential scaling of loss could not be fitted.
In these models, the outcome history effect was initially included in the exponential of the loss magnitude: However, we noted that, potentially due to the difficulty of estimating parameters that are used as an exponent, parameter recovery for the outcome history effect in this model was not very good. Therefore, we also included the outcome history parameter as a linear weight of the exponentially distorted magnitudes.
The second set of models allowed participants to differ in their relative weighting of expected value, variance and skew (27) (class ‘M3’): Where: (given that Magloss is a positive number). Again, we fitted this model also without the weighing for skew or without the weighing for variance. In this model, outcome history was captured as impacting the weighing of variance or skew or both.
To compare models, we used the Akaike Information Criterion (AIC) (28), which combines the log likelihood with the number of model parameters to avoid selection of over-parameterized models. Models were fit across all sessions from each participant, including for each parameter (other than outcome history effects, see table S5) a linear effect of day:
Parametert = Parametert0 + day_effect*current_day
AIC values were summed across participants. Participants for whom not all models could be fit were omitted (n=2).
[2C] Bayesian models – additional information for standard settings
Regression models were computed with the BRMS toolbox (29) which uses the Bayesian programming language Stan (30). The key advantages of the Bayesian approach are: priors can be defined to ease fitting, particularly when little data is available (as here only 20 trials per session); models can be hierarchical and account for individual differences in each parameter (i.e. taking into account data consistent of within and between subject measurements, e.g. several data points per person and several subjects); variability in measurements across people can be taken into account.
Linear non-hierarchical regression
All regression estimates (parameters) were given flat priors for all parameters and 5000 iterations for each of four chains were drawn (target acceptance rate, adapt_delta = 0.9). Model fit was checked using criterion of Rhat <1.1 and the absence of divergent samples (25). If models did not converge, iterations and adapt_delta were increased step-wise, up to a max of 25312 iterations and adapt_delta = 0.991. If fitting was then still not successful (only the case for Prospect Theory models, class M2, listed in [2B] above), the sessions were left out from model comparisons.
Linear hierarchical regressions
To ease fitting (31), all regression estimates were given weakly informative priors, normal(0,5). 6,000 iterations for each of 4 chains were drawn (target acceptance rate, adapt_delta = 0.9). Model fit was checked as above for non-hierarchical models. Significance follows the standard definition of the Bayesian 95% Credible Interval not including zero. To compare individual groups, the same model was fitted with group as an unordered factor and posthoc tests were then done using the emmeans package (32) (again using 95% Credible Intervals to define significance). Results of regressions are illustrated as conditional effects, i.e. all other variables are set to their mean. We computed mean parameters for individual participants to relate to neural activity.
Computational decision-making models – prior settings
For each parameter weakly informative priors were specified for models for the longitudinal or the FMRI data for each session (longitudinal) or person (FMRI): inverse temperature (β): cauchy(5,3), weighting of the loss utility (λ): cauchy(-1,1), the impact of the previous trial’s win/loss on the weighting of the loss utility (γ); group level standard deviations: inverse temperature (β): cauchy(0,3), all other parameters: cauchy(0, 1).
When fitting the decision-making models, number of iterations and adapt_delta were increased until fit indices suggested appropriate fit, as described above.
FMRI
Computational models were fitted as for the longitudinal data, i.e. first separately for each individual participant before then comparing the computational model parameters across groups using non- hierarchical models (as one FMRI session per person).
[2D] Regressions relating mood, task outcomes and behaviour
We used hierarchical regression models to test for group differences in the impact of task outcomes on mood:
Mean: Happiness (post minus pre) ∼ 1 + Task outcomes* group + Task outcomes + group + day + Age + Gender + (1 + day + outcome | ID)
And error term: sigma ∼ 1 + group + age + gender + (1|ID)
Where outcome was either the total wins in the daily task, the total losses or the total wins minus losses. Group was coded as monotonic factor.
In addition to the happiness VAS that was measured before and after the task, we also measured mood using a more detailed questionnaire (PANAS-SF) before the task. We used this to replicate previous findings (1,20,33,34) of mood instability related to bipolar disorder (with hierarchical models):
Mean: PANAS ∼ 1 + day + group + age + gender + (1 + day |ID)
Error term: Sigma ∼ 1 + group + age + gender + (1|ID)
Where PANAS was either the positive or the negative PANAS score.
[3] MRI scan
[3A] MRI acquisition sequences
Scan protocols were similar across both sites and differences are highlighted. T1-weighted structural images were acquired with the settings TR=3 sec, TE=4.71 msec (4.65ms for second site [some bipolar patients]), TI (inversion time) = 1.1 sec, 1x1x1 mm voxel size, 256x176x224 mm grid, flip angle = 8°, phase-encoding direction = R-L, GRAPPA (Generalized autocalibrating partially parallel acquisition) = 2. Functional images were acquired using a Deichmann echo-planar imaging (EPI) sequence with TR=3 s, TE=30 ms, 3x3x3 mm voxel size, 87° flip angle, 30° slice angle and z-shimming to reduce signal distortions as well as dropout in medial orbitofrontal areas (35). A fieldmap with dual echo-time images (TE1 = 5.19 ms, TE2 = 7.65 ms, whole brain coverage, voxel size 3.5 × 3.5 × 3.5 mm) was obtained for each subject to allow for corrections in geometric distortions induced in the functional images.
[3B] FMRI preprocessing
We used FSL (36) version 6.00 for standard image preprocessing and analysis (suppl. Methods [4B]. We used FSL’s BET (37) on the high-resolution structural MRI images and fieldmaps images to separate brain matter from nonbrain matter. We used the structural images to register functional images in MNI space using nonlinear registration as implemented in FNIRT (38). Functional images were corrected for motion using FSL’s MCFLIRT (39), corrected for geometric distortions using FSL’s FUGUE (FMRIB’s Utility for Geometrically Unwarping EPIs) and spatially smoothed with a Gaussian kernel of 5mm full-width half-maximum. Finally, images were then high-pass filtered with a 3 dB cutoff of 100s.
[3C] FMRI analysis
Data were pre-whitened before analysis (40). The fMRI design was as follows (see Figure S2 for design correlation matrix): We included four boxcar regressors capturing the different phases of each trial: the decision phase (aligned to the onset of the decision phase and lasting until participants could make a choice), the spinning phase (aligned to when the indicator on the chosen wheel of fortune started moving and lasting until it stopped), the outcome phase (the time the outcome was shown to participants and lasting until it disappeared from the screen) and the total score phase (aligned to when the screen with the total score was shown and lasting until it disappeared). Here the decision and the outcome phase are the main phases of interest, the others are included as control regressors. We also included parametric boxcar regressors aligned to same onsets as the phases described above, but with duration one second. All regressors were z- score normalized within each participant. In the decision phase, we included separate regressors for the reward and loss utilities (i.e. probability x magnitude) of the chosen minus the unchosen options, a regressor for last trial’s outcome (win loss, including the magnitude therefore, e.g. +10 or -20), as well as participants’ log-transformed reaction time as a control regressor. In the outcome phase, we included a regressor indicating the current trial’s outcome (win/loss, including the magnitude thereof). As control regressor we also included the total score phase with total scores as parametric value. All regressors were convolved with a double-gamma hemodynamic response function.
[4] Bayesian mood instability models
Pulcu et al. (1) proposed a model of mood variations that captures simultaneously variability in mood ratings and drifts (volatility) in the mean mood ratings. We adapted this model here (simplifying due to less data being available than in Pulcu et al., that neither volatility nor standard deviations changed over time and fit to both positive and negative PANAS simultaneously). The same model also captured relationships between PANAS standard deviation and behaviour. The key equations of the model included:
PANASratings[t] ∼ normal(PANASt,PANASsd)
PANASt ∼ normal(PANASt-1, PANASvolatility)
Where volatility and standard deviation of PANAS ratings were shared across positive and negative PANAS. In contrast, PANAS values (PANASt above) were captured separately for positive and negative PANAS. Behaviour ∼ normal(b0 + bPANAS_sd*PANASsd + btesting day*testing_day, behavioursd)
The model was fit to data from all participants who had at least 5 data points available for all measurements and standard deviations of both positive and negative PANAS above 0 (i.e. who did not always report exactly the same mood). Models were fitted as hierarchical models (mixed effects models), with group level parameters (mean and standard deviations) fitted for PANASsd, PANASvolatility, btesting_day and behavioursd. Parameters for individual participants were then drawn from the thus defined normal distributions. PANASt for positive and negative PANAS was fitted as one parameter per person per day (with the temporal order constraints as described in the regressions above). All parameters were given priors normal(0,1), for constraint parameters (i.e. standard deviations and volatility), log transformations were used.
To test how predictive mood instability was of group membership, we first fitted a simpler model of only the PANAS scores, excluding the behaviour, separately for each person. We then used the PANASsd and PANASvolatility to predict group membership in a leave-one-out cross-validation procedure, each time fitting a model of the form:
Group ∼ 1+ PANASsd + PANASvolatility
The model was fit to training data (i.e. all participants apart from one) and used to predict the test data (the left out participant). We trained models separately predicting low vs. high MDQ and predicting all three groups.
Footnotes
Expanded introduction and discussion. Clarified methods. Expand description of sample. Performed new supplementary analyses