Abstract
Studies of developmental trajectories of depression are important for understanding its etiology. Existing studies have been limited by short time frames and no studies have explored a key factor: differential patterns of responding to life events. This paper introduces a novel analytic technique, growth mixture modeling with structured residuals, to examine the course of youth depression symptoms in a large, prospective cohort (N=11,641, ages 4-16.5). Age-specific critical points were identified at ages 10 and 13 where depression symptoms spiked for a minority of children. However, most depression risk was due to dynamic responses to environmental events, drawn not from a small pool of persistently depressed children, but a larger pool of children who varied across higher and lower symptom levels.
Depression is one of the most common, costly, and disabling mental disorders worldwide (GBD Disease and Injury Incidence and Prevalence Collaborators, 2017) with lifetime prevalence estimates of 11.7% among adolescents (Merikangas et al., 2011) and 16.6% among adults in the United States (Kessler et al., 2005). A large proportion of depression cases begin early in development, with one-third of people who experience depression having their first onset before age 21 (Zisook et al., 2007). Youth who experience depression are a particularly vulnerable group, as they are at an increased risk of suicide (Gould, Greenberg, Velting, & Shaffer, 2003), substance abuse (Lai, Cleary, Sitharthan, & Hunt, 2015), cigarette smoking (Chaiton, Cohen, O’Loughlin, & Rehm, 2009), and are more likely to experience recurrent episodes of depression as adults (P.M. Lewinsohn, Rohde, Seeley, Klein, & Gotlib, 2000; Rao et al., 1995). In fact, earlier onsets of depression are associated with worse illness course and outcome into adulthood (Costello, Foley, & Angold, 2006; Zisook et al., 2007). These findings underscore the need to understand the etiology and course of depression over time in order to prevent and treat the disorder as early in the lifespan as possible.
However, efforts to characterize the etiology and course of depression have had limited success due to depression’s heterogenous nature (van Loo, de Jonge, Romeijn, Kessler, & Schoevers, 2012). Here, heterogeneity refers to the nature of depression itself as a phenomenon with a variable symptom profile and variable developmental course regulated by time-invariant and time-varying processes. There are at least three major challenges related to depression heterogeneity that may lead researchers to arrive at erroneous, inconsistent, or conflicting findings regarding the etiology and course of depression.
The first major challenge is that depression is difficult to measure both in childhood and across the life course, because no single definition of “depression” exists. According to the Diagnostic and Statistical Manual of Mental Disorders fifth edition ((American Psychiatric Association, 2013), there are 227 different ways to meet the diagnostic threshold for a major depressive episode, owing to different combinations of affective, cognitive, and behavioral symptoms (I.R. Galatzer-Levy & Bryant, 2013). Most troublingly, depression phenotypes may differ by age because symptom type, frequency, and severity are more or less prominent at different ages (Carlson & Kashani, 1988; Garvey & Schaffer, 1994; Hegeman, Kok, van der Mast, & Giltay, 2012). Even with DSM-5’s wide inclusion criteria, the boundaries for what constitutes “depressed” may be too narrow, as subclinical depressive symptoms have been associated with substantial functional impairment (J. P. Allen, Chango, Szwedo, & Schad, 2014; Kessler, Zhao, & Blazer, 1997). Thus, depression has been alternately viewed not just as a binary construct (where people either have or do not have depression), but as a dimensional construct (everyone has some amount of depressive symptoms), or a combination of the two (Kotov et al., 2017). Reflecting these nuances, the most widely used scales to measure depressive symptoms vary in the affective, cognitive, and behavioral symptoms measured and consequently are only moderately correlated (Fried, 2017), meaning that a person’s level of “depression” could be assessed very differently depending on the instrument used to measure it.
The second challenge is that depression emerges and changes across development; that is, it is an age-dependent trajectory of symptoms. Depression is rare in childhood until age 11, but then has larger yearly increases in prevalence through age 15 (Avenevoli, Swendsen, He, Burstein, & Merikangas, 2015). This age-dependence suggests that age-related events likely influence the onset of the disorder. Such events may span everything from biological changes including puberty (Graber, Lewinsohn, Seeley, & Brooks-Gunn, 1997) to sociocultural factors like psychosocial stress (Hyde, Mezulis, & Abramson, 2008) and factors that lie in between, such as family history of depression (Kendler, Gatz, Gardner, & Pedersen, 2005). Moreover, developmental trajectories of depressive symptoms may be systematically different across children, meaning that different children may develop depression at different times via different pathways (Ellis et al., 2017; Shore, Toumbourou, Lewis, & Kremer, 2018).
The third challenge is that depression varies in how long symptoms last (chronicity) and how frequently symptoms return (recurrence). Some young people have minimal symptoms throughout their lives, whereas others have persistently high symptoms, and others have symptoms that fluctuate more dynamically across levels of severity (Hosenfeld et al., 2015; Lorenzo-Luaces, 2015). Prior studies in adolescents have found that the duration of a depressive episode can vary in length from 2 weeks to 10 years, and that one third of recovered 14 to 18-year-olds had a recurrence within 4 years (Lewinsohn, Clarke, Seeley, & Rohde, 1994). In this paper, we posit that chronicity and recurrence may in part be driven by renitent responding and its opposite, reversing responding (collectively called responding). We define renitent responding as the degree to which a deviation from one’s own average symptom levels perpetuates across time. Renitent responding is different from a similar construct, inertia (Kuppens et al., 2012), because renitent responding captures how a person’s symptoms respond to life events, rather than how symptoms cause themselves from one moment to the next. People with renitent responding may have high chronicity. When something causes them to become depressed, they remain depressed for long periods of time; when they feel better, this too perpetuates. In contrast to renitent responding, reversing responding characterizes people whose symptoms, once moved from their average level, tend to fluctuate up and down. People with reversing responding may transition more frequently from relapse to recurrence. Extreme levels of both renitent and reversing responding may be associated with increased psychopathology (Houben, Van Den Noortgate, & Kuppens, 2015). As posited by developmental psychopathology theories (Curran & Bauer, 2011), patterns of responding to any given depressive episode may be regulated by personal characteristics. For example, young adults with depression have been shown to employ coping behaviors that make their depressive symptoms better or worse (Bolger, 1990; Compas, Orosan, & Grant, 1993; Galatzer-Levy, Burton, & Bonanno, 2012). Such behaviors, including reaching out for social support or withdrawing from social interactions, can shape responding during a depressive episode. Vulnerability characteristics, including early childhood abuse, negative life events, parental psychopathology, and chronic physical disorders (I.R. Galatzer-Levy & Bryant, 2013; Ten Have et al., 2018), may increase people’s tendency for renitent responding, whereas sleep, healthy diet (Cairns, Yap, Pilkington, & Jorm, 2014), social support, and positive coping (Dumont & Provost, 1999) may lead to a balanced, middle ground between renitent and reversing responding. Unpacking how personal characteristics and environmental experiences shape renitent and reversing responding may be a key component to understanding depression heterogeneity (Franklin, Jamieson, Glenn, & Nock, 2015).
To address some of these heterogeneity issues, growth mixture modeling (GMM; (Jung & Wickrama, 2008; Muthen & Muthen, 2000) has emerged as an analytic technique to characterize developmental pathways. In brief, GMM seeks to identify different pathways to developing depression by classifying people into subgroups with similar patterns of symptom change across time. It is a latent variable approach because subgroup membership is not directly observed, but rather indirectly indicated by average symptom levels and trajectories of change over time. A major advantage of GMM over growth curve modeling is that it relaxes the assumption that all individuals are drawn from a single population, so inferences can be made within and across more meaningful subgroups (e.g., different risk factors can be explored for people with minimal symptoms versus those who develop depression at younger ages versus those who develop depression at older ages). In their recent review of the literature, Ellis and colleagues (2017) found 18 studies that examined depressive symptom trajectories in children and adolescents. The studies are difficult to parsimoniously summarize because findings on the numbers of classes and the development of depression over time varied according to the age ranges included (notably, only one study included children younger than 10), and whether community or clinical samples were used. However, despite methodological differences, all studies found support for multiple classes over a single class, and nearly all studies identified a group with persistently minimal symptoms, as well as a group with symptoms that increase over time (typically examining children aged 12 and older).
GMM has also been used to model developmental changes in internalizing symptoms, a closely related construct to depression. Internalizing symptoms consist of negative internal experiences such as anxiety, sadness, and somatic issues. These GMM studies have found that symptom trajectories are best characterized by four to six subgroups (Dekker et al., 2007; Edwards et al., 2014), capturing children with consistently low symptoms, persistently high symptoms, low symptoms that increased during the adolescent years, and high symptoms that somewhat decreased as children aged.
For example, using data from the Avon Longitudinal Study of Parents and Children (ALSPAC), Edwards et al. (2014) examined trajectories of internalizing symptoms from ages 4 through 11.5. They found that 75% of the sample had low symptoms across childhood, with the rest being divided into four subgroups consistent with previous analyses: a group with persistently high symptoms, a group whose symptoms increased across childhood, a group whose symptoms decreased across childhood, and a group whose symptoms rose to a moderate symptom level around age 8 and then tapered off by age 11. In another GMM analysis of ALSPAC, Rice et al. (2018) used latent class growth analysis, a special form of growth mixture modeling that assumes there is no within-class variability in growth factors, and similar to Edwards and colleagues, they characterized 74% of people as having persistently low risk, and the remainder as having early adolescent onset or late adolescent onset depressive symptoms.
Together, GMM studies of depression and internalizing symptoms demonstrate that subtyping depression by its developmental course can paint a clearer picture of qualitatively different courses of depressive symptoms, which can aid in generating hypotheses about their etiological determinants. Despite their utility, existing GMM studies of depression have been limited in three primary ways. First, all studies ignored the first major heterogeneity challenge described previously: the problem of measurement. These studies used only a single instrument over time and assumed that symptoms measured depression equally well at different ages. Given the heterogeneity in both symptoms and measures of depression, better measuring depressive symptoms by using multiple measures, and testing and accommodating how the measurement model changes over time, can strengthen conclusions and lead to more replicable results. Second, existing studies are limited because they mostly focus on a narrow range of ages, with only a few including children younger than 10 years old (Dekker et al., 2007; Edwards et al., 2014; Whalen et al., 2016). However, developmental pathways to depression likely begin much earlier than adolescence, as certain depressive symptoms have been shown to manifest as early as age 5 (Whalen, Sylvester, & Luby, 2017). Furthermore, childhood studies of depressive symptom trajectories have followed participants for an average of 3-6 years (Ellis et al., 2017), limiting the opportunity to track longer-term developmental patterns. Thus, the extent to which developmental profiles of depression symptoms vary from childhood through adolescence remains unknown. Third, with some recent exceptions (see review by Musliner, Munk-Olsen, Eaton, & Zandi, 2016), little work has linked these developmental trajectories to their etiological determinants. Linking discrete developmental trajectories to the social and biological factors that set these trajectories in motion is critical for understanding how risk factors differentially influence the development of depression, and can lead to identifying, as early on as possible, children most at risk. Finally, no existing studies have addressed the challenge that depression varies systematically over time due to things other than age. Because all existing studies have developed latent classes purely as a function of age, they have ignored systematic patterns of renitent and reversing responding. Identifying systematic differences in patterns of responding may help identify groups of people who cope more or less effectively with the life events that lead to their depression, and also elucidate malleable risk factors (Franklin et al., 2015).
Current Study
The primary goal of this study was to use GMM to identify subgroups of children with different depressive symptom trajectories and patterns of relapse and recurrence across two decades of life—from ages 4 through 16.5 years—using data from a unique ongoing longitudinal sample called the Avon Longitudinal Study of Parents and Children (ALSPAC). To simultaneously explore age-dependent trajectories and patterns of responding to changes in symptoms, we introduce a novel form of growth mixture modeling, the growth mixture model with structured residuals (GMM-SR). A secondary goal of this study was to explore how five of the most common and impactful social factors related to depression influenced membership in these subgroups. In the current study, we address the aforementioned limitations of previous GMM studies by modeling depression as a latent variable, measured by multiple instruments assessing depressive symptoms across time, and accounting for the possibility that symptoms functioned differently at different ages. We used this latent depression variable in a GMM-SR to characterize depression trajectories, including patterns of renitent responding and reversing responding, across this 13-year period. By jointly characterizing both age-related change and patterns of responding and identifying etiological factors for both, our aim was to better describe heterogeneity in the etiology of depression.
Methods
Sample and Procedures
ALSPAC sampled children born to mothers who were living in the county of Avon England (120 miles west of London) with estimated delivery dates between April 1991 and December 1992 (Boyd et al., 2012; Fraser et al., 2013). When the oldest children were approximately 7 years old, an attempt was made to bolster the initial sample with eligible cases who had failed to join the study originally. 15,454 eligible pregnant women agreed to participate and were enrolled in the study, which resulted in 15,589 fetuses and a sample size of 14,901 who were alive at 1 year of age. The study website contains details of all the data that is available through a fully searchable data dictionary and variable search tool: www.bristol.ac.uk/alspac/researchers/our-data/. Ethical approval for the study was obtained from the ALSPAC Ethics and Law Committee and the Local Research Ethics Committee.
The current study is based on an analytic sample of 11,641 children who had data on at least one measure of depressive symptoms, as described below. Children included in our analytic sample did not differ from those who were excluded with respect to sex. However, children who were excluded tended to be non-white (9.4% of excluded participants vs. 4.3% in analytic sample), have lower levels of maternal education (48% of excluded participants were less than O-level vs. 27% of included participants), and were more likely to be unmarried (40% vs. 22%).
Measures
Outcome: Youth depressive symptoms
We used three measures that tapped depressive symptoms, all reported by mothers. These measures were derived from two subscales of the Strengths and Difficulties Questionnaire (SDQ; Goodman, 1997), and the Short Mood and Feelings Questionnaire (SMFQ; Angold et al., 1995). Questionnaires were selected, in part, because they tapped the most relevant depression symptoms at each age. The latent variable, discussed below, is interpreted as the common cause of all symptoms across questionnaires.
Strengths and Difficulties Questionnaire
The current study used the SDQ to assess child emotional and behavioral problems. The SDQ is one of the most commonly used dimensional ratings of child psychopathology in epidemiology studies and has excellent psychometric properties (Ezpeleta, Granero, de la Osa, Penelo, & Domènech, 2013; Muris, Meesters, & van den Berg, 2003). The SDQ has five subscales containing five items each and is rated on a three-point scale (0=not true, 1=somewhat true, or 2=certainly true), capturing the child’s behavior and feelings within the past six months. The current study used two of the five SDQ subscales, child emotional problems and peer difficulties, which are often combined to represent “internalizing symptoms” (R. Goodman, 2001). Mothers completed the SDQ using mailed questionnaires at seven time-points when their child was ages 4, 7, 8, 9.5, 11.5, 13, and 16.5 years.
Short Mood and Feelings Questionnaire
The SMFQ is commonly used in epidemiological studies to assess depressive symptoms in adolescents and older children (Lundervold, Hinshaw, Sorensen, & Posserud, 2016; Patton et al., 2008). The SMFQ has been shown in diverse populations to differentiate clinic from community samples and correlates highly with questionnaire and interview measures of psychopathology and clinician-rated diagnoses of depression in early and late adolescence (Turner, Joinson, Peters, Wiles, & Lewis, 2014). Mothers completed the SMFQ by mailed questionnaire at three time-points when their child was 11.5, 13, and 16.5.
Social factors
We examined the role of five social factors as predictors of trajectory class membership: 1) maternal education (as a measure of socioeconomic position), 2) maternal psychopathology, 3) caregiver physical/emotional cruelty, 4) living in a single parent household, and 5) neighborhood disadvantage. These factors were chosen based on previous studies showing they precede the development of psychopathology in other samples (Petterson & Albers, 2001); (Felitti et al., 1998; S. H. Goodman & Gotlib, 1999), which suggests they also precede the development of related coping behaviors. Each determinant was measured via maternal report on one to four occasions from psychometrically validated standardized measures, a single item via mailed questionnaires, or via an in-person interview and was coded as present versus absent, as described in Appendix A. To ensure these factors preceded the measurement of depressive symptoms, all factors were measured at or before the age of four years. The current analysis focused on etiology rather than factors more dynamically related to inertia, such as sleep, diet, and social support, because including them would have considerably complicated the analysis and interpretation of an already complex statistical model.
Analytic Strategy
Analyses were conducted in four phases: (1) construction of a measurement model, (2) identification of a growth model, (3) enumeration and selection of a mixture model, and (4) exploration of the predictors of latent class membership.
1. Latent Depression Score Measurement Model
Item level confirmatory factor models were used to test and account for the possibility that questionnaire items had different relationships to the latent construct, depression, at different ages (i.e., measurement invariance).Based on results from these preliminary analyses (see Appendix B), which revealed that the SDQ behavioral items and several of the SMFQ items had age-related changes caused by factors other than depression, parcels were constructed, meaning all items were divided into subsets and each subset was averaged (Little, Cunningham, Shahar, & Widaman, 2002). There were five parcels: the SDQ-emotions subscale, SDQ-peer difficulties subscale, and three groups of items from the SMFQ. Because the preliminary analyses found that some symptoms had systematic changes over time unrelated to depression (scalar non-invariance), each parcel was constructed so that its items followed a similar pattern of scalar non-invariance. This accounted for these unrelated, systematic changes by releasing the scalar constraints across noninvariant waves. Factor score estimates, which represent the latent depression score at each time point, were obtained from this model to use in subsequent growth models (Curran et al., 2018).
2. Growth model
To examine the importance of renitent and reversing responding, latent curve models were compared to latent curve models with structured residuals (LCM-SR; (Curran, Howard, Bainter, Lane, & McGinley, 2014). In traditional latent curve models, depression scores are modeled only as a function of age. In LCM-SRs, response to symptom changes is modeled in the form of a structured residual. In this model, the residual is the difference between the observed depression score and the depression score predicted by each child’s growth trajectory. The residual represents the influence of unmodeled life events (i.e., the effect of everything except for age). The structured residual imposes an autoregressive structure whereby each residual was systematically related to the one immediately before it and immediately after it. Thus, when an unmodeled life event caused a child’s depression to go higher or lower than their own average level, the autoregressive terms represents how the child’s symptoms responded to that event. A positive autoregression represents renitent responding, or the tendency of a child’s depression level to remain above or below their predicted score from one time point to the next. A negative autoregression represents reversing responding, or the tendency of a child’s depression to systematically fluctuate across higher and lower scores. An autoregression of zero (the assumption of traditional growth models) means that the child’s depression was well-modeled by the trajectory alone and did not have any systematic carry-over due to unmodeled events.
Three phantom variables (i.e., placeholders with missing values for all participants) were included between the longest time intervals to reduce the influence of unequally spaced measures on the autoregressive parameter. Time was coded as age in months.
3. Growth mixture model class enumeration
Growth mixture models using up to 8 classes were fit to the data, with varying levels of constraints placed on the covariance structure. A wide variety of fit statistics were used to empirically determine the best model. The final model was chosen based on empirical fit to the data (using fit statistics), fit to the individual (visual inspection to ensure that latent classes accurately represented trajectories of the people within them), and the model’s ability to provide theoretically useful distinctions across classes (Masyn, 2013).
4. Predictors of latent class membership
The two-step method (Bakk & Kuha, 2018) was used to examine how the probability of latent class membership varied across social factors. The first of the “two-steps” refers to determining the best growth mixture model (as above). In the second step, the growth mixture portion of the model was held fixed, while class probabilities and the relationships between predictors and latent classes were freely estimated. To manage Type I error, predictors were block-tested using a Wald Test with the Benjamini-Hochberg correction (Benjamini & Hochberg, 1995) to probe contrasts between classes.
Missing data
Data missingness was handled in two stages. In the first stage, missingness on depression scores was adjusted using the saturated correlates method, which included information from baseline auxiliary variables in the measurement model (see Appendix C; (Collins, Schafer, & Kam, 2001). In the second stage, missing predictor variables were imputed based on the mixture model. Mixture model parameters were fixed to the estimates derived from the final mixture model and means and covariances of predictors were free to vary across classes (this procedure is analogous the two-step method; Bakk & Kuha, 2017).
Mplus Version 8.0 was used for all analyses (Muthén & Muthén, 1998-2010).
Results
Sample Characteristics
Descriptive statistics for predictor variables and depression factor scores are presented in Table 1. Mean depression scores did not change much over time, although variability increased with age. Nearly one in four children had mothers who met criteria for psychopathology between birth and 3 years old. Twelve percent of children lived in a household with a single parent for at least part of their first four years, and 13% of children met criteria for neighborhood disadvantage. Just 6% of children met criteria for emotional or physical abuse at least once through 3.5 years of age. Most predictor correlations were small and in the expected direction, suggesting these social factors represented distinct constructs (Table 1).
Measurement model
Fit of the final model with partial scalar invariance was excellent (CFI = .999, TLI = .998, RMSEA = .015) and superior to the fit of the model with full scalar invariance (χ2diff(5) = 970.50, p < .0001), indicating that some symptoms changed systematically over time due to something other than depression (full details of the measurement model are provided in Appendix B). Factor score distributions had moderate levels of skew and kurtosis. Because skew and kurtosis may cause GMMs to extract latent classes that characterize properties of the distribution rather than true population-wide heterogeneity (Curran & Bauer, 2003), factor scores were log-transformed, resulting in approximately normal distributions with skew and kurtosis values all < |1.0|.
Growth model
Fit statistics for growth models are shown in Appendix D. The best fit to the data was provided by a latent curve model with structured residuals. This model used a fourth-degree polynomial (e.g., quartic) with random linear, quadratic, and cubic variances, the quartic variance constrained at zero, three phantom variables to make measure spacing more equidistant, and stationarity of autoregressive parameters (i.e., the autoregressive term was equal over time). This model indicated that the sample mean did not change much from baseline scores (the peak and trough were 0.23 SD apart), but there was considerable variation in trajectories, and that residuals were moderately positively correlated (standardized AR parameter ∼ 0.38), meaning that the average child had a small-to-moderate amount of renitent responding.
Mixture model
Fit statistics for mixture models are presented in Appendix E. Information criteria were unhelpful in model selection because they all decreased without signs of leveling off with additional classes. Consequently, the final model was selected based on likelihood ratio tests (Lo-Mendell-Rubin adjusted likelihood ratio test), interpretability, and visual inspection of fit to individual level data. The best performing model based on these criteria was a six-class model with intercept and slope variances constrained equal across classes, quadratic and higher degree polynomial variances constrained at zero, a diagonal covariance matrix (i.e., the covariation between intercept and slope was fully explained by latent class membership), and an autoregressive parameter that varied across classes.
Latent class trajectories are shown in Figure 1a and prototypical patterns of responding, generated from a parametric bootstrap, are shown in Figure 1b. To ease interpretation, Figure 1a includes a reference line at the 90th-percentile depression score, which is often used as a rough proxy for a clinical diagnosis of depression (Zavos, Rijsdijk, Gregory, & Eley, 2010). Raw trajectories of individuals in each latent class are shown in Figure 2. Classification precision of individuals into latent classes was moderately low (entropy = .67), perhaps because autoregressive terms have considerable error at the individual level with just seven observed timepoints. However, in samples as large as the present one, autoregressive terms are still accurately estimated at the group level (Schultzberg & Muthen, 2018).
Classes were more clearly separated by their trajectories and patterns of responding than by baseline symptom levels. Developmental trajectories varied considerably across classes. Approximately half of all children experienced stable, low symptoms with moderate levels of renitent responding across childhood and adolescence (Minimal Symptoms class, 48.7%, n= 6543, AR parameter=0.43). The next two largest classes experienced moderately high symptoms and were primarily differentiated by their patterns of responding. The High and Renitent class (30.4%, n = 3129) exhibited the strongest levels of renitent responding in the sample (AR parameter = 0.90), meaning that if a life event caused them to move above or below their typical depression level, they tended to stay there for very long periods of time. In contrast to the High and Renitent class, the High and Reversing class (8.1%, n = 618) had nearly the same mean trajectory, but experienced high levels of reversing responding (AR parameter = -0.50), as they experienced sharp oscillations around their estimated mean levels. The Childhood Decrease (7.5%, n = 756) class experienced high symptoms at an early age, but these symptoms came down gradually over the course of early and middle childhood and remained at a low level over adolescence; their symptom changes had minimal carry-over across waves (AR = 0.05). The final two classes were separated by their trajectories over middle childhood and adolescence. The Late Childhood Peak class (3.0%, n = 334) experienced an increase in symptoms, primarily through middle childhood, then a large decrease in symptoms over adolescence; this class was also characterized by reversing responding. The Adolescent Spike class (2.3%, n = 261) remained relatively stable at a low-to-moderate level of symptoms until adolescence, at which point their symptoms spiked to the highest average levels observed in the study.
Predictors of class membership
Predictors of class membership are presented in Table 2, and corresponding probabilities of class membership across predictors are shown in Figure 3. Likelihood ratio tests indicated that each variable was a statistically significant predictor of class membership. Generally, maternal psychopathology, neighborhood disadvantage, childhood abuse, and female sex were associated with increased risk of belonging to a group other than the minimal symptoms group. Maternal psychopathology conveyed the highest risk, with the probability of being in the Minimal Symptoms class dropping by more than half from 57% for those with no maternal psychopathology to 24% for those with maternal psychopathology. The pattern of findings for maternal education were somewhat more complex. Higher levels of maternal education were associated with lower odds of being in the High and Renitent group as compared to any other group, but—counterintuitively—higher levels of maternal education were also associated with increased risk of being in any group as compared to the Minimal Symptoms group (excluding the High and Renitent group).
Two different sets of comparisons were particularly meaningful. The first was between the High and Renitent versus High and Reversing classes, because those classes both had high symptom levels and were distinguished primarily by differences in response to symptom changes. The second was between Late Childhood Peak versus Adolescent Spike classes, because those classes were distinguished primarily by the timing of a spike in depression symptoms. The High and Renitent group was differentiated from the High and Reversing group only by maternal education, with higher levels of maternal education associated with higher probabilities of being in the High and Reversing group. The Late Childhood Peak and Adolescent Spike classes were differentiated only by sex; males had three times the odds of females of spiking in adolescence as compared to late childhood.
Sensitivity analysis
To understand the effect of model specification on latent class structure, the final model was compared to analogous models with the structured residuals held class invariant, and with the structured residuals removed entirely. Similar developmental trajectory patterns across both parameterizations were found, but latent classes without inertia had more extreme scores. Notably, when excluding structured residuals from the model, the group with consistently high symptom levels was smaller (11.1% versus 30.7%) and plateaued at more extreme scores (above the 90th percentile), suggesting that accounting for renitent and reversing responding functioned to explain the most extreme scores. A side-by-side comparison of the two models is available in Appendix F.
Discussion
This study characterized subgroups of children by their differing developmental trajectories of depression—and differing patterns of responding to life events—from age 4 through 16.5, in a large, prospective, population-based sample. To our knowledge, this was the first GMM study to disentangle age-dependent and age-independent patterns of symptom change, via the concepts of renitent and reversing responding. This study also included a 13-year follow-up period, a wider age span than nearly all previous GMM studies of depression. Overall, the primary findings included identification of potentially sensitive periods for developing depression in both early childhood and adolescence as well as the importance of patterns of responding to life events for explaining the highest levels of childhood depression. Results also suggested robust but complex relationships between social factors of depression and latent class.
One striking difference from previous research is that the classes in this study with consistently high symptom levels were larger in size, but had lower average symptom levels than observed elsewhere, indicating that those at risk for severe depression were drawn from a relatively wide pool of children who experienced varying periods of higher and lower symptoms rather than a smaller pool of children with consistently high symptoms. With a few exceptions, average depression scores in even the most extreme latent class were well below the 90th percentile, which is typically used as a rough proxy for a clinical diagnosis of depression (Zavos et al., 2010). A sensitivity analysis using traditional GMM methods, which excluded the autoregressive parameter that captured renitent and reversing responding (Appendix D), found latent classes similar to those identified in previous ALSPAC work (Edwards et al., 2014; Rice et al., 2019), with 75% of children in a minimal symptoms group, and those in the higher symptoms groups showing more extreme scores than the high symptoms groups observed in the main analyses. Thus, explicitly modeling renitent and reversing responding leads to the conclusion that the highest levels of depression were better explained by transient states than persistent traits.
Two extremes of responding were evident in these data. The High and Renitent and High and Reversing groups had nearly identical trajectories but were clearly separated by their autoregressive parameters. For the High and Renitent group, experiencing a perturbation above their typical depression level at any given observation occasion was strongly predictive of remaining above their typical level at the next observation. Consequently, these children spent longer continuous periods of time either above or below their expected averages. The High and Reversing group had more frequent symptom oscillations (AR= -0.50), meaning that if they were below their typical level at one time point, they were likely to be above it at the next time point and vice versa. Together, these two classes can be thought of as comprising children with dynamic risk for depression, with the High and Renitent group more likely to experience longer episodes, and the High and Reversing group more likely to experience frequent episodes. This risk is termed dynamic because it occurred independently of child age, and was instead driven by unmodeled, trait-level processes that could include both biological factors and characteristic ways of coping with life events. These results are consistent with daily monitoring studies that have found that people who experience both highly unstable emotions (strong negative autocorrelations) and highly inert emotions (strong positive autocorrelations) had overall lower psychological well-being (Houben et al., 2015). Such processes have been thought to reflect deficits in coping, to be characteristic of people who experience multiple bouts of depression, and to serve as modifiable targets for intervention (Waugh & Koster, 2015). Given that the average gap between depressive symptoms measures in the current study was 17 months, both renitent and reversing responding may better represent difficulty adjusting to life events than patterns of daily coping, although these two processes are strongly related (Kanner, Coyne, Schaefer, & Lazarus, 1981).
The present study builds on the literature by showing that certain periods of development have increased importance in shaping the course of depressive symptoms. In contrast to the dynamic risk groups describe above, there appear to be two inflection points of age-specific risk for a minority of the sample, one occurring around age 10 and peaking at age 13 (Late Childhood Peak group; 3.0% of the sample), and another beginning after age 13 and peaking at age 16.5 (Adolescent Spike group; 2.3% of the sample). It is notable that these inflection points correspond with a heightened period of internal transitions (i.e., puberty) and external transitions (i.e., social environmental changes, such as high stakes academic testing and more complex friendships and peer expectations) that previous studies have linked to the onset of depression (Graber et al., 1997). Sex differences in the present sample correspond closely to different pubertal timing between girls and boys. Girls, who begin to experience puberty as early as age 10, were much more likely to be in the Late Childhood Peak group that experienced increasing symptoms between ages 10-13. Boys, who begin to experience puberty several years later—as early as age 12—were much more likely to belong to the Adolescent Spike group that experienced increases between ages 13-16. These results are consistent with those of Kwong and colleagues (2019) who, analyzing ALSPAC data with a different methodology, found that ages 13 and 16 had the peak velocity of changes in depression symptoms for boys and girls, respectively (Kwong et al., 2019). The current results suggest that the changes that led to these peak velocities were set in motion 3-5 years prior to the peak velocities being achieved and may have been driven by a small subset of the population. Moreover, the combined size of these two age-specific risk groups was just 5.3% of the sample, whereas 38.5% of the sample experienced dynamic risk, suggesting that most of the risk for depression was driven by dynamic responding to environmental events.
In contrast to the sharp inflection points described above, the Childhood Decrease group had high levels of symptoms early in childhood that decreased over development and into adolescence. Compared to the High and Renitent group, children in this class were more likely to be male, have more educated mothers, and less likely to have maternal psychopathology. The contrast to the High and Renitent group may reflect early risk that was buffered by protective factors of higher maternal education and lower history of maternal psychopathology.
Three of the six social factors examined in this study—maternal psychopathology, neighborhood disadvantage, and child abuse—were strongly related to differences between the lowest symptoms class (Minimal Symptoms) and the highest symptom classes (High and Renitent, High and Reversing), with odds ratios ranging from 2 to 5. Coming from a single parent household generated a similar pattern of effects, but with a much smaller effect size. This pattern of findings is consistent with the stress sensitization hypothesis, that exposure to early childhood adversity increases the probability of developing depression in response to any given stressor (Hammen, Henry, & Daley, 2000; McLaughlin, Conron, Koenen, & Gilman, 2010).
The role of maternal education in determining developmental trajectories of depressive symptoms appears complex. On the one hand, children whose mothers had higher levels of education were less likely to belong to the High and Renitent group than any other group, indicating a protective effect consistent with previous research (e.g., (J. Allen, Balfour, Bell, & Marmot, 2014; Meltzer, Gatward, Goodman, & Ford, 2003). On the other hand, children with higher maternal education also had lower odds of belonging to the Minimal Symptoms group than any other group (except for the High and Renitent group), indicating that although they were protected from the High and Renitent group, they faced higher risks of experiencing more depressive symptoms in each of the sensitive periods identified by this study.
One possibility for explaining the protective effects of education—that is, the movement from the High and Renitent class to other classes with less consistent depressive symptoms—is that education level, an indicator of social class, captures the level of access to and quality of resources families can use to buffer their children’s risk of depressive symptoms. Notably, maternal education was the only social determinant to differentiate the High and Renitent class from the High and Reversing class. Although it is important to avoid overinterpreting this finding because maternal education was generally predictive of belonging to any group other than the High and Renitent group, one possibility is that low socioeconomic status predicts poorer child socio-emotional development and adaptive functioning (Bradley & Corwyn, 2002), and these poor socio-emotional skills lead to maladaptive coping behaviors that increase children’s vulnerability to depression the face of stressful life events and other risk factors.
Why children with higher maternal education were less likely to be in the Minimal Symptoms class than any other class is puzzling. One possibility is that given the relative lack of social mobility and perceived importance of education for class mobility (Goldthorpe, 2016), children of more educated parents feel more pressure to achieve academically from a young age, leading to higher levels of stress. Alternatively, parents with higher levels of economic disadvantage may be facing myriad other stressors and have less opportunity to identify smaller shifts in their children’s emotions and behavior as they emerge over time.
Limitations
The present study was limited by examining maternal report of symptoms only. While latent variable models were used to attempt to combine information across mother and child reports of symptoms, the only wave of overlap—age 16.5—had such low correlations between parent and child measures (r = .36) that they did not appear to measure the same construct. Consequently, the very high odds ratios between maternal psychopathology and risk of being in any class besides Minimal Symptoms must be tempered by the possibility that depressed mothers may have been more likely to rate their children as depressed.
An additional limitation of this study is the length of time between observations. The theory of renitent and reversing responding draws from clinical observation of recurrent major depression. More frequent measurement occasions would be ideal for understanding the duration of depression episodes with more precision. Nonetheless, the present findings illustrate that patterns of responding to life events that transact over longer periods of time have important implications for how depression functions across childhood and adolescence.
Finally, this sample is limited in terms of geographic diversity. Given high levels of study enrollment from families in the Bristol area, it is a strongly representative sample; however, the extent to which the sensitive period findings will generalize outside of the UK, with its particular social class structure and socially-specific developmental milestones, is unclear. Cross-cultural replication may be helpful for disentangling the effects of age-related sensitive periods from those related to socially imposed developmental milestones, such as educational testing.
Conclusions
This study captured the widest age range to-date in a GMM study of childhood and adolescent depression and was the first GMM study to simultaneously explore systematic age-dependent and age-independent patterns of changes in depression. It identified critical points of age-specific risk at ages 8 and 13 when depression symptoms began to grow for a minority of the sample, whose symptoms would peak 3-5 years later. Most importantly, risk for experiencing the highest levels of depression was neither age-specific nor drawn from a small pool of persistently depressed children, but instead drawn from a larger pool of children who moved dynamically between higher and lower symptom levels depending on how they responded to life events. From a prevention perspective, these findings suggest that early interventions targeting broad coping processes rather than indicated interventions targeting age-specific developmental events may be most effective for decreasing the burden of depression across the life course.
Data Availability
Data came from Avon Longitudinal Study of Parents and Children (ALSPAC). To explore data and samples, the study website contains details of all the data that is available through a fully searchable data dictionary and variable search tool: www.bristol.ac.uk/alspac/researchers/our-data/. To request existing data, ALSPAC encourages researchers to: 1. Please read the ALSPAC access policy, which describes the process of accessing the data and samples in detail, and outlines the costs associated with doing so. 2. You may also find it useful to browse ALSPAC's fully searchable research proposals database, which lists all research projects that have been approved since April 2011. 3. Please submit your research proposal for consideration by the ALSPAC Executive Committee. You will receive a response within 10 working days to advise you whether your proposal has been approved.
https://proposals.epi.bristol.ac.uk/?_ga=2.149616694.1895310552.1574350382-1596101750.1574350382
https://proposals.epi.bristol.ac.uk/?_ga=2.149616694.1895310552.1574350382-1596101750.1574350382
Acknowledgments
This research was funded by the National Institute of Mental Health of the National Institutes of Health under Award Numbers K01MH102403 and 1R01MH113930 (Dunn). The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health. We are extremely grateful to all the families who took part in this study, the midwives for their help in recruiting them, and the whole ALSPAC team, which includes interviewers, computer and laboratory technicians, clerical workers, research scientists, volunteers, managers, receptionists and nurses. The UK Medical Research Council and the Welcome Trust (Grant ref: 102215/2/13/2) and the University of Bristol provide core support for ALSPAC. A comprehensive list of grants funding is available on the ALSPAC website (http://www.bristol.ac.uk/alspac/external/documents/grant-acknowledgements.pdf). This publication is the work of the authors who will serve as guarantors for the contents of this paper.