Digitally managing depression: a fully remote randomized attention-placebo controlled trial
===========================================================================================

* Aaron Kandola
* Kyra Edwards
* Marie AE Muller
* Bettina Dührkoop
* Bettina Hein
* Joris Straatman
* Joseph F Hayes

## Abstract

**Background** Depression is a common and disabling condition. Digital apps may augment or facilitate care, particularly in under-served populations. We tested the efficacy of juli, a digital self-management app for depression in a fully remote randomized controlled trial.

**Methods** We completed a pragmatic single-blind trial of juli for depression. We included participants aged over 18 who self-identified as having depression and scored 5 or more on the Patient Health Questionnaire-8 (PHQ-8). Participants were randomly assigned (1:1) to receive juli for 8 weeks or a limited attention-placebo control version of the app. Our primary outcome was the difference in PHQ-8 scores at 8 weeks. Secondary outcomes were remission, minimal clinically important difference, worsening of depression, and health-related quality of life. Analyses were per protocol (primary) and modified intention-to-treat (secondary). The trial was registered at the ISRCTN registry ([ISRCTN12329547](http://medrxiv.org/external-ref?link_type=ISRCTN&access_num=ISRCTN12329547)).

**Results** Between May 2021 and January 2023, we randomised 908 participants. 662 completed the week 2 outcome assessment and were included in the modified intention-to-treat analysis, and 456 completed the week 8 outcome assessments (per protocol). The mean baseline PHQ-8 score was consistent with a diagnosis of moderately severe depression. In the per-protocol analysis, the juli group had a lower mean PHQ-8 score (10.78, standard deviation 6.26) than the control group (11.88, standard deviation 5.73) by week 8 (baseline adjusted β-coefficient -0.94, 95%CI -1.87 to -0.22, p=0.045). Remission and minimal clinically important difference were increased in the juli group at 8 weeks (adjusted odds ratio 2.22, 95%CI 1.45-3.39, p<0.001 and adjusted odds ratio 1.56, 95%CI 1.08 to 2.27, p=0.018). There were no between-group differences in health-related quality of life physical or mental component scores or worsening of depression.

**Conclusion** Use of juli reduced symptoms of depression at 8 weeks compared with an attention-placebo control. The juli app is a digital self-management tool that could increase accessibility of evidence-based depression treatments.

## Introduction

Depression is a major contributor to the global burden of disease. Each year, millions of people are diagnosed with depression, and is likely to be the leading cause of disability in high-income countries by 20301. The mainstay of treatment for depression includes psychological intervention and antidepressant medication. Barriers to accessing care include high costs, low availability, long waiting lists and stigma. Digital interventions may remove some of these barriers by offering scalable solutions that are convenient and timely for people with depression2.

Recent meta-analyses of randomised controlled trials (RCTs) of smartphone app-based psychological interventions for depression symptoms find a small to moderate reduction in symptoms compared to placebo, such as waiting list control3,4. However, subgroup analyses find no difference in depressive symptoms in studies with active control groups, which is corroborated by the findings of other systematic reviews5. Reviews also only include small numbers of trials, which are frequently of short duration and/or include few participants. Participants often do not have severe depression3 Many of the apps included in the reviews are no longer (or have never been) commercially available, highlighting that translation from research to clinical impact is difficult in this space. Other challenges include low levels of retention and engagement with digital health apps in trials and real-world use6. The reviews all conclude that further methodologically robust RCTs are required. We sought to address some of the limitations of previous RCTs by pre-registering our trial, powering it adequately to detect between-group differences, following-up users for an adequate duration, efforts to minimise attrition and avoiding recruitment from unrepresentative populations (such as from mood disorders clinics). Our RCT was fully remote, increasing cost-effectiveness, time efficiency and reach.

The digital health app juli, aims to support people with depression via numerous evidence-based approaches, including mood tracking, medication reminders, positive affect journaling, data visualisation of sleep, activity, exercise, heart rate variability and behavioural activation recommendations about how to improve these parameters7. As such, it combines many of the elements that have been found to be effective in research-grade apps for depression. However, there has been less evaluation of consumer-grade apps in real-life practice. This is important as many popular health apps have not been scrutinised in the way new interventions traditionally would be. We hypothesised that individuals randomized to juli would have a greater reduction in depression symptoms at 8 weeks than those in an attention placebo control group.

## Methods

### Study design and participants

We conducted a fully remote pragmatic single-blind, placebo control randomised controlled trial of people with depression from anywhere in the world. Individuals we eligible for inclusion if they were aged 18 to 65, were English speaking, had access to an iPhone, and self-identified as having depression, with a score of 5 or more on the Patient Health Questionnaire 8-item version (PHQ-8) at baseline. We recruited participants via online adverts, social media posts and self-help groups for depression. Recruitment was from May 2021 until January 2023. All participants provided written informed consent via a consent form within the app. Ethical approval was from the University College London Ethics Committee (ID number 19413/001). The trial was registered on the ISRCTN registry ([https://doi.org/10.1186/ISRCTN12329547](https://doi.org/10.1186/ISRCTN12329547)).

### Randomisation and masking

We randomly assigned participants (1:1) to a full version of juli or an attention-placebo control version. Block randomisation was conducted within the app using automated code, with random block sizes between 4 and 8. The researchers and independent statisticians were masked to treatment allocation until the completion of the analysis.

### Procedures

Participants were either allocated to the full version of the juli app or an attention placebo control app group. If allocated to the full version of the app, participants were prompted to open the app each day via an automated alert at the time of their choosing. They were asked to rate how they were feeling on a scale using 5 emoji faces and a circumplex model with mood on the x-axis and energy on the y-axis8. Individuals were also able to track things that they considered as important contributors to their mood9. The app passively gathered information via smartphone and smartwatch sensors on sleep, activity, workouts, menstrual cycle, and heart rate variability on a daily basis and presented this data to the participant, showing associations with mood10. The app then provided recommendations about these parameters to guide healthy behaviours via behavioural activation11. The app includes a medication reminder function that can be set by the participants to improve medication adherence12. Participants were also encouraged to engage in positive affect journaling via the app (Figure 1)13. The juli app was designed by a psychiatrist and experts in gamification14. It is grounded in evidence-based treatments. Participants were guided towards all elements of the app, but did not have to engage with elements they did not like. If allocated to the control arm, participants were prompted to open the app each day via an automated alert and to rate how they were feeling on a scale using 5 emoji faces. Control participants did not receive any further intervention.

![Figure 1.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2023/04/11/2023.04.05.23288184/F1.medium.gif)

[Figure 1.](http://medrxiv.org/content/early/2023/04/11/2023.04.05.23288184/F1)

Figure 1. Example juli screen shots

Baseline assessments and follow-up assessments at two, four, six and eight weeks were all completed within the app. Participants completed the PHQ-8 for depression symptoms and the 12-Item Short Form Health Survey (SF-12) for health-related quality of life at baseline. The PHQ is a widely used self-completed depression scale that is recommended by the Common Measures in Mental Health Science Governance board as one of a core list of research questionnaires that should be used by funded researchers. The PHQ-9 closely matches the DSM-IV criteria for a major depressive episode and may be more sensitive to change than other measures of depression, such as the Hamilton Rating Scale for Depression and the Beck Depression Inventory15. The PHQ-8 excludes the question about suicidality, which is preferred in studies where patient contact is remote, such as via digital technologies or telephone. Research indicates that the deletion of this question has little effect on the scale’s psychometric properties because this question is the least frequently endorsed item on the PHQ-9. Subsequently, the PHQ-8 has identical scoring thresholds for depression severity, with higher scores representing more severe depression16,17. The SF-12 is a self-reported health status measure18. Possible scores range from 0 to 100, with higher scores indicating better quality of life.

### Outcomes

The primary outcome was the total score on the PHQ-8 at 8 weeks.

The secondary outcomes were: 1) PHQ-8 score as a continuous outcome at 2, 4, 6 and 8 weeks in a repeated measures analysis, 2) PHQ-8 score as a binary outcome where remission is a score of <10 at 8 weeks, 3) remission at 2, 4, 6 and 8 weeks in a repeated measures analysis, 4) Difference in SF-12 physical and mental component scores at 8 weeks and, 5) SF-12 physical and mental component scores at 4 and 8 weeks in a repeated measures analysis.

We added post-hoc outcomes that included: 1) achieving a minimal clinically important difference (MCID) at 8 weeks defined by the effective dose 50 method, which accounts for baseline severity and is the smallest difference in PHQ-8 scores that are of perceived benefit19, 2) a worsening of depression, defined as a >20% increase in PHQ-8 from baseline.

### Statistical analyses

Our analysis plan was pre-printed ([https://discovery.ucl.ac.uk/id/eprint/10129350/](https://discovery.ucl.ac.uk/id/eprint/10129350/)) and included on the ISRCTN registry. We followed the Consolidated Standards of Reporting Trials (CONSORT) guidelines in reporting and analysing our data 17. Primary and secondary outcomes were described in the published protocol and on the ISRCTN registry before the study started.

The best estimate of a MCID in PHQ-8 is between 11% and 14%, with a standard deviation of 0.32-0.3820,21. 80% power at the two-sided 5% significance level requires a total sample size of 378. Allowing for 26% attrition22, we aimed to recruit 238 participants per arm. Power calculations were carried out using Stata.

The primary outcome was the difference in total PHQ-8 score at 8 weeks between control and intervention groups in a per protocol analysis. This was estimated with a linear regression model adjusted for baseline PHQ-8. We calculated the odds ratio of remission at 8 weeks (PHQ-8<10), achieving MCID and worsening of depression, adjusting for baseline severity using logistic regression. Repeat measures analyses were completed using linear or logistic mixed effect models adjusting for baseline severity.

We also examined all outcomes in a modified intention-to-treat analysis including all randomised participants with a complete baseline and week 2 PHQ-8. Missing outcome data were imputed by last observation carried forward and controlled multiple imputation23.

All analyses were completed by independent statisticians using Stata and R, who have no financial conflict of interest with the company providing the juli app.

## Results

### Baseline characteristics

We recruited 456 individuals who were retained in the trial for 8 weeks and formed the basis of our primary per protocol analysis (Figure 2). The majority of participants were female and had experienced depression for more than five years (Table 1). The majority were diagnosed by a physician and continued to be in regular or occasional contact with a doctor about their depression. The mean PHQ-8 score at baseline was 16.16 (standard deviation (SD) 4.71).

View this table:
[Table 1.](http://medrxiv.org/content/early/2023/04/11/2023.04.05.23288184/T1)

Table 1. Baseline characteristics

![Figure 2.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2023/04/11/2023.04.05.23288184/F2.medium.gif)

[Figure 2.](http://medrxiv.org/content/early/2023/04/11/2023.04.05.23288184/F2)

Figure 2. CONSORT diagram

### Per protocol analysis

At 8 weeks, participants in the intervention group had a mean PHQ-8 score of 10.78 (SD 6.26) and participants in the control group had a mean of 11.88 (SD 5.73). After accounting for baseline PHQ-8 score the intervention group had lower depression symptom scores at 8 weeks (−0.94, 95% confidence interval (CI) -1.87 to -0.22, p=0.045). The odds of being in remission by week 8 was higher in the intervention group after accounting for baseline depression severity (adjusted odds ratio 2.22, 95%CI 1.45 to 3.39, p<0.001). Participants in the intervention group were more likely to experience MCID (adjusted odds ratio 1.56, 95%CI 1.08 to 2.27, p=0.018) than those in the control group. Repeat measures analyses of these outcomes at 2, 4, 6 and 8 weeks suggest that this effect was maintained over time (Supplement Table 2). We found no effect of the intervention on SF-12 mental or physical component scores. The odds of experiencing worsening symptoms were similar in the intervention and control groups (adjusted odds ratio 0.83, 95%CI 0.38 to 1.81, p=0.633).

### Intention-to-treat analysis

Modified intention-to-treat analysis included 322 participants in the intervention group and 340 controls who completed baseline and week 2 PHQ-8 (Supplement Table 1). Baseline characteristics of participants in these groups were similar to the per protocol analysis. In the last observation carried forward dataset, there was no clear difference between intervention and control groups after accounting for baseline severity (β-coefficient -0.72, 95%CI -1.49 to 0.06)(Supplement table 2) at week 8. However, the repeated measures analysis of PHQ-8 scores found a positive effect of the intervention on depression symptoms. The odds ratios for remission at 8 weeks and remission in repeated measures were similar to the per protocol analysis, and suggested higher odds of remission in the intervention group. The odds ratios for MCID and worsening of symptoms were consistent with the per protocol results. Results from the multiple imputation dataset were consistent with the last observation carried forward findings.

## Discussion

We found a small reduction in depression symptoms in participants using juli for 8 weeks compared to an attention-placebo control. Both groups had a reduction in PHQ-8 scores over the course the trial. The mean intervention arm PHQ-8 score decreased by 37%, a reduction considerably greater than the MCID24. Participants allocated to juli were more than twice as likely to be in remission by 8 weeks, and more likely meet the threshold for MCID in depression symptoms.

The participants had a mean baseline PHQ-8 score consistent with a diagnosis of moderately severe depression, the majority had longstanding depression and were under the ongoing care of a physician. Therefore these participants were chronically unwell and potentially experiencing depressive symptoms that were difficult to treat. This is important as our participants differ from those included in many digital depression intervention RCTs in terms of severity and duration3.

The improvement in the attention-control group was of a similar magnitude to other placebo controlled trials of depression interventions25. In addition, even basic mood monitoring, as required by our control group, has been found to decrease depression symptoms26. This may have reduced difference observed between intervention and control groups, compared to an inactive control (such as waiting list).

Findings from meta-analyses support the use of digital technologies for the treatment of depression. However, many of the interventions reviewed are not available to patients, often because they are not available commercially or via healthcare providers. juli uses a combination of evidence-based approaches to support symptom reduction in depression and is available globally on Apple and Android formats. Our trial provides methodologically robust evidence of efficacy.

### Strengths and limitations

Our RCT has a number of strengths and limitations. We successfully recruited, screened, randomized, treated, and assessed a geographically dispersed sample of participants. We modified the juli app so that for RCT participants could consent, be randomized and take the baseline assessment within the app. This facilitated global recruitment, with low cost, in a pragmatic manner, with good external validity. However, this also meant that we lacked information on potentially important baseline characteristics, such as social determinants of health, as we did not want to overburden the participants. Despite this, balance in the recorded baseline characteristics after randomisation supports the assumption that randomisation was successful. Additionally, we did not include a large battery of outcome measures, which may have shed further light on our findings. For example, improvements in anxiety symptoms have been found in RCTs of interventions for depression21.

We pre-registered our analysis plan and did not deviate from this. Attrition was higher than we anticipated (49.78% from randomisation to week 8). The majority of the attrition occurred between randomisation and week 2, which is common in RCTs, including for depression apps. A recent meta-analysis of dropout rates found similar attrition after adjusting for publication bias22. We overcame this by recruiting until we had sufficient numbers who had completed the week 8 outcome measures and examined differences in completers vs non-completers.

We analysed participants using the app for 8 weeks in our primary analysis to focus on the effects of maintained use. Baseline characteristics were similar in the modified intention-to-treat and primary analysis, and results were consistent, suggesting little difference between completers and non-completers. Following the publication of our protocol and commencement of our RCT, newer research on the PHQ-8 MCID was published suggesting that we may have underestimated our power calculations19,24. We used these newer methods to derive a post hoc MCID outcome19. We used two imputation methods for the intention-to-treat analysis that make different assumptions23. Results from both methods did not differ.

We found that most participants were diagnosed by a doctor and remained under their care. This highlights the severity of our participants depression, but does not suggest that people were accessing juli because of an absence of traditional care. Individuals with no previous or ongoing healthcare may differentially benefit from digital technologies. Digital apps could be one solution to an overburdened healthcare system, particularly in groups or areas where accessing treatment for mental health conditions is challenging or stigmatising27. However, some people may still be unwilling or unable to use apps for health, despite their relative ease of access. The majority of participants were female, which reflects established differences in sex-specific rates of depression and help-seeking for depression28. Some of our participants (∼7%) identified as transgender or non-binary. This group is under-represented in research but has higher risk of depression and other mental disorders29. Our high uptake suggests that digital technologies may be a better way of engaging these populations.

An even bigger problem than the high attrition in RCTs of digital apps for depression is their lack of real-world retention and engagement. The proportion of 30-day retention is less than 10% across mental health apps6. The 30-day retention of juli users (non-RCT participants) is 25%. This highlights that engagement and the potential clinical benefit can be increased by methods employed by juli. However, it is unclear which specific features increase engagement. For example, a 2021 meta-analysis found no benefit of gamification30. More research is needed to understand which depression app features are integral to improving mental health symptoms.

## Conclusion

The juli app can reduce depressive symptoms within 8 weeks, with increased probability of remission and MCID. As such it represents a low-risk addition to the care package of people with mild to severe depression. Further research is required to determine the most cost-effective technical support processes to enhance engagement, and how juli could be implemented in current public health or clinical care models.

## Supporting information

Supplemental material [[supplements/288184_file02.docx]](pending:yes)

## Data Availability

All data produced in the present study are available upon reasonable request to the authors.

## Author Contributions

JFH conceived the study; JFH, AK, BD, BH and JS designed the study; AK, BD and BH collected the data; KE and MM analysed the data; JFH wrote the initial draft; all authors edited and approved the final manuscript.

## Conflicts of Interest

The current study was funded by juli Health. AK, BD, BH, JS and JFH are shareholders in juli Health. AK has received consultancy fees from juli Health and Wellcome Trust. BD, BH, JS and JFH are a co-founders of juli Health. JFH has received consultancy fees from juli Health and Wellcome Trust. KE and MM have no conflicts of interest. The funders played no part in the analysis of the data.

## Data Availability

All data produced in the present study are available upon reasonable request to the authors.

## Acknowledgements

AK is supported by the UK Research and Innovation (UKRI) Digital Youth Programme award [MRC project reference MR/W002450/1], which is part of the AHRC/ESRC/MRC Adolescence, Mental Health and the Developing Mind programme. JFH is supported by the UK Research and Innovation grant MR/V023373/1, the University College London Hospitals NIHR Biomedical Research Centre, and the NIHR North Thames Applied Research Collaboration.

## Footnotes

*   Additional author added.

*   Received April 5, 2023.
*   Revision received April 11, 2023.
*   Accepted April 11, 2023.


*   © 2023, Posted by Cold Spring Harbor Laboratory

This pre-print is available under a Creative Commons License (Attribution 4.0 International), CC BY 4.0, as described at [http://creativecommons.org/licenses/by/4.0/](http://creativecommons.org/licenses/by/4.0/)

## References

1.  Mathers, C. D. & Loncar, D. Projections of global mortality and burden of disease from 2002 to 2030. PLoS medicine 3, e442 (2006).
    
    
2.  Moshe, I. et al. Digital interventions for the treatment of depression: A meta-analytic review. Psychological bulletin 147, 749 (2021).
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F04%2F11%2F2023.04.05.23288184.atom) 

3.  Serrano-Ripoll, M. J., Zamanillo-Campos, R., Fiol-DeRoque, M. A., Castro, A. & Ricci-Cabello, I. Impact of Smartphone App–Based Psychological Interventions for Reducing Depressive Symptoms in People With Depression: Systematic Literature Review and Meta-analysis of Randomized Controlled Trials. JMIR mHealth and uHealth 10, e29621 (2022).
    
    
4.  Astafeva, D. et al. The efficacy of mobile phone-based interventions for the treatment of depression: A systematic meta-review of meta-analyses of randomized controlled trials. Psychiatria Danubina 34, 155–163 (2022).
    
    
5.  Hernández-Gómez, A., Valdés-Florido, M. J., Lahera, G. &Andrade-González, N. Efficacy of smartphone apps in patients with depressive disorders: A systematic review. Frontiers in Psychiatry, 1166 (2022).
    
    
6.  Baumel, A., Muench, F., Edan, S. & Kane, J. M. Objective user engagement with mental health apps: systematic search and panel-based usage analysis. Journal of medical Internet research 21, e14567 (2019).
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F04%2F11%2F2023.04.05.23288184.atom) 

7.  Firth, J. et al. The efficacy of smartphone-based mental health interventions for depressive symptoms: a meta-analysis of randomized controlled trials. World Psychiatry 16, 287–298 (2017).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/wps.20472&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F04%2F11%2F2023.04.05.23288184.atom) 

8.  Posner, J., Russell, J. A. & Peterson, B. S. The circumplex model of affect: An integrative approach to affective neuroscience, cognitive development, and psychopathology. Development and psychopathology 17, 715–734 (2005).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1017/S0954579405050340&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=16262989&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F04%2F11%2F2023.04.05.23288184.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000233772000008&link_type=ISI) 

9.  Dubad, M., Winsper, C., Meyer, C., Livanou, M. & Marwaha, S. A systematic review of the psychometric properties, usability and clinical impacts of mobile moodmonitoring applications in young people. Psychological medicine 48, 208–228 (2018).
    
    
10. Colombo, D. et al. Current state and future directions of technology-based ecological momentary assessment and intervention for major depressive disorder: a systematic review. Journal of clinical medicine 8, 465 (2019).
    
    
11. Ekers, D. et al. Behavioural activation for depression; an update of meta-analysis of effectiveness and sub group analysis. PloS one 9, e100100 (2014).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pone.0100100&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24936656&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F04%2F11%2F2023.04.05.23288184.atom) 

12. Hammonds, T. et al. Adherence to antidepressant medications: a randomized controlled trial of medication reminding in college students. Journal of American College Health 63, 204–208 (2015).
    
    
13. Smyth, J. M. et al. Online positive affect journaling in the improvement of mental distress and well-being in general medical patients with elevated anxiety symptoms: A preliminary randomized controlled trial. JMIR mental health 5, e11290 (2018).
    
    
14. Wu, A. et al. Smartphone apps for depression and anxiety: a systematic review and meta-analysis of techniques to increase engagement. NPJ digital medicine 4, 20 (2021).
    
    
15. Löwe, B., Kroenke, K., Herzog, W. &Gräfe, K. Measuring depression outcome with a brief self-report instrument: sensitivity to change of the Patient Health Questionnaire (PHQ-9). Journal of affective disorders 81, 61–66 (2004).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S0165-0327(03)00198-8&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=15183601&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F04%2F11%2F2023.04.05.23288184.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000222281100008&link_type=ISI) 

16. Kroenke, K. et al. The PHQ-8 as a measure of current depression in the general population. Journal of affective disorders 114, 163–173 (2009).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.jad.2008.06.026&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=18752852&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F04%2F11%2F2023.04.05.23288184.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000264223900018&link_type=ISI) 

17. Shin, C., Lee, S.-H., Han, K.-M., Yoon, H.-K. & Han, C. Comparison of the usefulness of the PHQ-8 and PHQ-9 for screening for major depressive disorder: analysis of psychiatric outpatient data. Psychiatry investigation 16, 300 (2019).
    
    
18. Jenkinson, C. et al. A shorter form health survey: can the SF-12 replicate results from the SF-36 in longitudinal studies? xsJournal of Public Health 19, 179–186 (1997).
    
    
19. Bauer-Staeb, C. et al. Effective dose 50 method as the minimal clinically important difference: Evidence from depression trials. Journal of Clinical Epidemiology 137, 200–208 (2021).
    
    
20. Salaminios, G. et al. A randomised controlled trial assessing the severity and duration of depressive symptoms associated with a clinically significant response to sertraline versus placebo, in people presenting to primary care with depression (PANDA trial): study protocol for a randomised controlled trial. Trials 18, 1–14 (2017).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/s13063-017-2128-8&link_type=DOI) 

21. Lewis, G. et al. The clinical effectiveness of sertraline in primary care and the role of depression severity and duration (PANDA): a pragmatic, double-blind, placebocontrolled randomised trial. The Lancet Psychiatry 6, 903–914 (2019).
    
    
22. Torous, J., Lipschitz, J., Ng, M. & Firth, J. Dropout rates in clinical trials of smartphone apps for depressive symptoms: a systematic review and meta-analysis. Journal of affective disorders 263, 413–419 (2020).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.jad.2019.11.167&link_type=DOI) 

23. Cro, S., Morris, T. P., Kenward, M. G. & Carpenter, J. R. Sensitivity analysis for clinical trials with missing continuous outcome data using controlled multiple imputation: a practical guide. Statistics in medicine 39, 2815–2842 (2020).
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F04%2F11%2F2023.04.05.23288184.atom) 

24. Kounali, D. et al. How much change is enough? Evidence from a longitudinal study on depression in UK primary care. Psychological Medicine 52, 1875–1882 (2022).
    
    
25. Jones, B. D. et al. Magnitude of the placebo response across treatment modalities used for treatment-resistant depression in adults: a systematic review and metaanalysis. JAMA Network Open 4, e2125531–e2125531 (2021).
    
    
26. Kauer, S. D. et al. Self-monitoring using mobile phones in the early stages of adolescent depression: randomized controlled trial. Journal of medical Internet research 14, e1858 (2012).
    
    
27. Bucci, S., Schwannauer, M. & Berry, N. The digital revolution and its impact on mental health care. Psychology and Psychotherapy: Theory, Research and Practice 92, 277–297 (2019).
    
    
28. Paykel, E. S. Depression in women. The British Journal of Psychiatry 158, 22–29 (1991).
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=A1991FM75800004&link_type=ISI) 

29. Newcomb, M. E. et al. High burden of mental health problems, substance use, violence, and related psychosocial factors in transgender, non-binary, and gender diverse youth and young adults. Archives of sexual behavior 49, 645–659 (2020).
    
    
30. Six, S. G., Byrne, K. A., Tibbett, T. P. & Pericot-Valverde, I. Examining the effectiveness of gamification in mental health apps for depression: systematic review and meta-analysis. JMIR mental health 8, e32199 (2021).