What if risk factors influenced the variability of health outcomes as well as the mean? Evidence and implications

David Bann; Tim J Cole

doi:10.1101/2021.03.30.21254645

Abstract

Risk factors may affect the variability as well as the mean of health outcomes. Understanding this can aid aetiological understanding and public health translation, in that interventions which shift the outcome mean and reduce variability are preferable to those which affect only the mean. However, few statistical tools routinely test for differences in variability. We used GAMLSS (Generalised Additive Models for Location, Scale and Shape) to investigate how multiple risk factors (sex, childhood social class and midlife physical inactivity) related to differences in health outcome mean and variability. The 1970 British birth cohort study was used, with body mass index (BMI; N = 6,025) and mental wellbeing (Warwick-Edinburgh Mental Wellbeing Scale; N = 7,128) as outcomes. For BMI, males had a 2% higher mean than females yet 28% lower variability. Lower social class and physical inactivity were associated with higher mean and higher variability (6% and 13% respectively). For mental wellbeing, gender was not associated with the mean while males had 4% lower variability. Lower social class and physical inactivity were associated with lower mean yet higher variability (−7% and 11% respectively). This provides empirical support for the notion that risk factors can reduce or increase variability in health outcomes. Such findings may be explained by heterogeneity in the causal effect of each exposure, by the influence of other (typically unmeasured) variables, and/or by measurement error. This underutilised approach to the analysis of continuously distributed outcomes may have broader utility in epidemiological, medical, and psychological sciences.

Introduction

What is health? Contrary to simplistic notions of it being defined as the absence of disease, it is now increasingly understood that most outcomes of public health significance are continuous in nature.¹ This applies to both physical and mental health outcomes.^2,3 The use of binary endpoints, while having utility in clinical applications, should not hinder investigation of the influences of health outcomes which are ultimately continuous. Analysing the determinants of health using continuous rather than binary outcomes is beneficial both practically (with more statistical power and less information loss) and substantively (greater aetiological understanding).

Studies into the effect on continuous outcomes of exposures, be they risk factors in observational studies or interventions in randomised trials,³ typically focus on mean differences in the outcome, using linear regression. However linear regression assumes homoscedasticity, i.e. that the variability of the outcome is unrelated to the exposure, and often this is not the case. It is possible to extend the regression analysis to model the variability as well as the mean, and this has benefits in terms of not only the model’s fit but also its interpretation. If for example the intervention in an intervention trial can be shown to reduce variability in the outcome, this could reasonably be viewed as evidence of intervention success⁴ that is independent of the intervention’s effect on the mean. Treatment for refractive vision errors—glasses, contact lenses, and/or corrective surgery—seeks to improve vision by shifting individuals towards a specified standard (eg, 20/20 vision).⁵ Successful treatments alter the mean refraction, but they are even more successful if they also reduce the substantial variability in refraction arising from the mix of short- and long-sighted individuals.

Similarly, obesity interventions aim to reduce body mass index (BMI) and shift treated individuals from overweight (25-30 kg/m²), obese (>30 kg/m²) or severely obese (>45 kg/m²) to the normal range (20-25 kg/m²). However here the effect of the intervention on variability is often to increase it. Even if not formally tested, visual comparisons of outcome distributions of some influential trials suggest that weight loss interventions increase rather than reduce BMI variability,⁶ presumably since they are effective in some but not all participants.

Understanding if and how risk factors influence variability in health outcomes has aetiological significance, consistent with the goal of epidemiological science to understand the distribution of health.⁷ Risk factors could feasibly affect outcome variability yet not affect the mean—for example, one study found that breastfeeding was not related to mean childhood BMI, yet was related to lower childhood BMI variability.⁸ Identifying associations between risk factors and outcome variability may also be useful to identify the absence or presence of heterogeneity in susceptibility to interventions or risk factors. Indeed, the finding that substantial increases in mean BMI in recent decades have been matched by increases in BMI variability indicates that there may be differential susceptibility to the obesogenic environment.^9,10

Recent studies in biological,^11,12 environmental¹³ and economic science^14,15,16 have begun to examine how risk factors relate to the distribution of the outcome of interest. However, there have been few epidemiological applications of this approach to date;¹⁷ and fewer still that provide explanations for such findings, which are essential if such methods are to have utility. Indeed, one recent study which investigated the association between mental health symptoms and lower income explicitly avoided interpretation of its findings on variability, focusing instead on issues relating to the application of such methods.¹⁵

Regression methods that allow variability to be modelled are uncommon. One particular method, Generalised Additive Models for Location, Scale and Shape (GAMLSS)¹⁸ has become the standard for constructing growth reference centiles,¹⁹ where the aim is to model the outcome’s distribution as a function of age. It defines the distribution in terms of distribution moments, i.e. the mean, variance, and optionally skewness and kurtosis. This allows for factors influencing the higher moments to be identified in just the same way as for the mean, and it provides a simple and elegant interface for modelling variability in epidemiology.

Another arguably underutilised²⁰ and related statistical approach to investigating risk factors for continuous outcomes is quantile regression. Recent epidemiological studies using this method have found that risk factors for higher BMI—particularly lower social class and physical inactivity—have sizably larger effect sizes at higher BMI centiles.^21,22 This has potentially important policy implications—risk factors which have larger effects amongst those at highest health risk are likely to have a more favourable effect on population health than alternatives which do not.²¹ However, the reason for this phenomenon is not yet understood—it may be logically consistent with results of GAMLSS analyses in which risk factors influence outcome means, variability and/or skewness.

Our primary aim is explore factors affecting outcome variability in an epidemiological context. We investigate whether and how several established risk factors—sex, socioeconomic circumstances, and physical inactivity²³—relate to differences in outcome mean and variability. We use two different continuous outcomes as examples, an indicator of adiposity (body mass index, BMI) and mental wellbeing. A second aim is to investigate whether these risk factors are additionally related to the skewness of the outcome distribution. Finally, we investigate the same risk factor and outcome associations using quantile regression models.

Methods

Study sample

The 1970 British birth cohort study (1970c) consists of all 17,196 babies born in Britain during one week of March 1970, with 10 subsequent waves of follow-up from childhood to midlife.²⁴ At the most recent wave (46 years), 12,368 eligible participants (those alive and not lost to follow-up) were invited to be interviewed at home by trained research staff—8,581 participants provided at least some data in this wave. At all waves, informed consent was provided and ethical approval granted.

Health outcomes

We selected two outcomes in midlife which capture different dimensions of health and are continuously distributed: adiposity (BMI), and mental wellbeing (Warwick-Edinburgh Mental Wellbeing Scale (WEMWBS)). BMI was measured at 46 years, and wellbeing at 42 years.²⁵ WEMWBS consists of 14 positively worded items—such as “I’ve been feeling optimistic about the future” and “…feeling cheerful”—measured on a five-point Likert scale, which are summed to give a total well-being score ranging from 14 to 70 (highest well-being).²⁶

Risk factors

We chose three risk factors across different domains—each of them likely to independently influence health outcomes.²³ They were coded as binary variables to simplify comparison of descriptive and GAMLSS results: sex (female/male), socioeconomic position (social class at birth; coded as non-manual/manual), and a behavioural risk factor (reported physical activity at 42 years; reported days in which the participant took part in exercise for 30 mins or more in a typical week ‘working hard enough to raise your heart rate and break into a sweat’, coded as active (≥1 days)/inactive (0 days)). We examined if the binary split of risk factors influenced the inferences drawn—additional analyses were conducted with them coded instead as categorical variables (social class in 6 categories and physical inactivity from 0-7 days).

Analytical strategy

To visually inspect the outcome distributions and their differences across risk factor groups, we first plotted separate kernel density estimates alongside relevant descriptive statistics (mean, standard deviation, and coefficient of variation (SD/mean)). We then used GAMLSS¹⁸ separately with each outcome, to formally investigate whether risk factors were associated with 1) differences in mean outcome and 2) differences in outcome variability.

GAMLSS requires the underlying distributional family to be specified—in primary analyses we used the normal distribution (NO family), where the mean and standard deviation are modelled. By convention, variability is modelled on the log scale, where differences are fractional and can be multiplied by 100 and interpreted as percentage differences in variability.²⁷ Mean differences were also expressed as percentage differences to aid comparability across outcomes and model estimates.

We also used an alternative distribution family: the Box-Cox Cole and Green (BCCG family).²⁸ This enabled us to examine the median (rather than the mean) and generalised coefficient of variation (rather than the SD), and in addition whether risk factors were associated with the skewness of the outcome distribution. Skewness here is quantified in terms of the Box-Cox power; the power transformation required to transform the outcome distribution to normality, where 1 indicates normal, 0 log-normal and −1 inverse normal. A smaller Box-Cox power corresponds to more right skewness.

To aid comparison of descriptive statistics and model estimation results, we first conducted analyses adjusting for each risk factor alone. We then adjusted for the risk factors jointly.

Separately we fitted conditional quantile regression models to estimate risk factor and BMI associations at the lower, middle and upper quartiles of the outcome distribution, i.e. the 25^th, 50^th and 75^th centiles.

All analyses were conducted using Stata v16 (StataCorp, Texas), and the gamlss package version 5.2-0 in Rstudio 1.4.²⁹

Results

6,025 participants had valid data for BMI and all risk factors, and 7,128 for WEMWEBS. Mean BMI was 28.4 (SD = 5.5), and mean WEMWEBS 49.2 (8.3). Higher BMI was weakly associated with lower wellbeing (r = −0.08). BMI was moderately right-skewed (Figure 1) and WEMWEBS left-skewed (Figure 2). Visual and descriptive comparisons of the BMI and wellbeing distributions by risk factor (Figures 1 and 2) suggest that differences in the outcome mean and variability are not always in the same direction.

Figure 1.

Kernel density plots for body mass index, stratified by risk factor group. Note: CoV = coefficient of variation (SD/mean).

Figure 2.

Kernel density plots for mental wellbeing, stratified by risk factor group. Note: COV = coefficient of variation (SD/mean).

GAMLSS results for the binary risk factors are shown in Tables 1 and 2, with the results using the extra risk factor categories in Supplementary Tables 1 and 2. Associations were similar in the unadjusted and mutually adjusted analyses, so the former are described below.

View this table:

Table 1.

Risk factors in relation to body mass index: differences in mean, variability and skewness estimated by GAMLSS (n = 6,025)

View this table:

Table 2.

Risk factors in relation to mental wellbeing (WEMWEBS): differences in mean, variability and skewness estimated by GAMLSS (n = 7,128)

Body mass index

Males had higher mean BMI yet lower variability than females—see Figure 1 and Table 1. The SD for BMI was lower in males (4.63) than females (6.10) i.e., a 27.6% difference (difference in log(SD) *100). This matches the estimate obtained from GAMLSS—males had 27.6% (SE: 1.8%) lower variability than females (Table 1).

In contrast, lower social class and physical inactivity were both associated with higher mean BMI and higher BMI variability (Figure 1 and Table 1). Those from lower social class households had 3.9% (0.5%) higher BMI than those from non-manual classes, and 6.2% (1.9%) higher variability.

Physically inactive participants had 3.3% (0.6%) higher mean BMI and 13.5% (2.0%) higher variability.

The GAMLSS results were similar with the BCCG distribution family rather than the NO family (Table 1). That is, risk factors associated with higher mean BMI and higher SD were also associated with higher median BMI and higher COV. Male sex and lower social class were both associated with less skewness of the BMI distribution; the Box-Cox power was 0.5 (0.1) higher in males and 0.4 (0.1) higher for manual social class. Physical activity was not associated with outcome skewness.

Mental wellbeing – Warwick-Edinburgh Mental Wellbeing Scale

There was little evidence of sex differences in mean wellbeing, while males had marginally less variability than females—4.0% (1.7%) lower. Lower social class and physical inactivity were both associated with lower mean yet higher variability (Figure 1 and Table 2). Those from lower social class households had a 2.8% (0.4%) lower mean yet 7.3% (1.8%) higher variability. Physically inactive participants had 5.3% (0.48) lower mean yet 10.9% (1.9%) higher variability. These findings were similar in mutually adjusted analyses (Table 2).

The results were similar with the BCCG distribution family (Table 2). There was evidence suggesting that lower social class was associated with less skewness in the wellbeing distribution; sex and physical activity were not associated with outcome skewness.

Comparison with quantile regression findings

For BMI, the associations of lower social class and physical inactivity were stronger at upper quantiles (Table 3; e.g., manual social class had 3.37 (0.35) higher BMI at the 25^th centile, 4.06 (0.54) at the median, and 4.82 (0.69) at the 75^th); estimates at higher centiles were also estimated less precisely than at lower centiles (larger SE). In contrast sex differences were present at lower centiles but absent at the 75^th. These findings corresponded with those from GAMLSS using the BCCG family, in which all BMI centiles are plotted by risk factor group (Figure 3). This comparison highlights the utility of GAMLSS—risk factor differences in the mean, variability, and skewness can each be quantified and thus visually depicted.

View this table:

Table 3.

Risk factors in relation to body mass index (BMI) and mental wellbeing (WEMWEBS): percentage differences at multiple points of the outcome distribution estimated by quantile regression

Figure 3.

Plots of body mass index (BMI) centile by risk factor group. Plotted lines are calculated using GAMLSS estimation results of the entire outcome distribution; points at the 25^th, 50^th, and 75^th centiles are estimated using quantile regression models.

For WEMWEBS, the associations of lower social class and physical inactivity were also stronger at lower quantiles (Table 3), yet had larger SE. Sex was not associated with WEMWEBS at any centile. These findings corresponded with those from GAMLSS (Figure 4).

Figure 4.

Plots of mental wellbeing (Warwick-Edinburgh Mental Wellbeing Scale, WEMWEBS) centiles by risk factor group. Plotted lines are calculated using GAMLSS estimation results of the entire outcome distribution; points at the 25^th, 50^th, and 75^th centiles are estimated using quantile regression models.

Discussion

Using an underutilised and little-known analytical approach (GAMLSS), we present empirical evidence to support the idea that risk factors can relate to sizable differences in outcome variability, in addition to differences in the outcome mean. Females had higher variability in BMI and mental wellbeing than males; lower social class and physical inactivity were each associated with higher variability in both BMI and mental wellbeing, despite having different directions of association with the mean (higher BMI yet lower mental wellbeing).

Our findings add to an emerging literature which has investigated associations between risk factors and outcome variability. Studies^11–17 have reported that risk factors associated with higher means are also associated with higher outcome variability. For example, Beyerlein et al (2008)¹⁷ found that multiple risk factors for high childhood BMI (such as more frequent television viewing and greater rapid infant weight gain) were related to both higher mean BMI and greater variability in BMI. However, previous studies have not utilised multiple outcomes or nationally representative samples, and have not systematically considered explanations for such findings or their implications.

Our findings help to reconcile findings from GAMLSS with those using quantile regression^17,21,22 which have reported stronger effect sizes for BMI risk factors at higher BMI centiles. This finding is both consistent with and helps explain the GAMLSS findings. For instance, lower social class and physical inactivity are related to higher mean BMI, higher BMI variability, yet less BMI skewness; the net result is higher effect estimates at upper centiles which are less precisely estimated, as seen in quantile regression. While both analytical approaches have merit, GAMLSS has a number of attractive features for use in aetiological research: it enables each distribution moment to be separately investigated, and uses predetermined distribution families which enable computation of sparsely distributed variables.

Why are risk factors associated with differences in outcome variability? There are multiple possible explanations. First, it may represent the influence of unmeasured variables which influence the outcome—particularly at upper values—leading to both higher means and variability. Such additional variables could also be unmeasured effect modifiers which increases the strength of the risk factor effect at higher outcome centiles. Unmeasured factors such as genetic propensity to weight gain may for example modify the effect on weight gain of exposure to adverse socioeconomic circumstances.³⁰ Other environmental factors could operate similarly—such that the association between lower social class and higher BMI is weaker amongst those living in a local environment which is less ‘obesogenic’ (i.e., conducive to more physical activity and lower energy intake).^31,32 The net result of such divergent effects would be increased variability since the effects would range from zero to the upper bound of the effect. This explanation may also apply to mental wellbeing, given evidence for the myriad environmental^33,25 and genetic determinants^34,35 which could modify the effects observed in the current study.

Alternatively, between-person differences in confounding and/or measurement error may also lead to risk factors being associated with outcome variability. For example, in the present study physical activity was measured via a single item capturing reported activity of a moderate-vigorous intensity for at least 30 minutes per day; this is an imperfect reflection of the underlying exposure which may have a causal effect (e.g., total energy expenditure (across all intensities of activity) in the case of adiposity;³⁶ or time spent in specific activities conducive to wellbeing in the case of mental wellbeing³⁷). The net result would be higher variability in those reporting higher physical activity levels. A related issue is the extent to which the exposure captures the same ‘dose’ across participants in a given study. The physical activity measure used here counted the number of days that bouts of activity lasted at least 30 minutes; this likely reflects substantial variability in the level of exercise actually undertaken, thus leading to greater differences in outcome variability. This could partly explain the associations of lower social class with greater outcome variability, since social class is one dimension of socioeconomic position, such that there may be substantial between-person variation in other dimensions (eg, parental education, income and/or wealth^38,39) which may each influence outcomes, leading to greater variability.

The study highlights the fact that analyses by GAMLSS and quantile regression lead to very similar results at the selected quantiles of the outcome distribution—see Figures 3 and 4. However GAMLSS, by analysing the whole distribution, can in some cases provide more efficient estimates of the quantiles. Compare for example the standard errors of the median as obtained by the BCCG family (Tables 2 and 3) and quantile regression (Table 4); for BMI the standard errors of around 0.5 are broadly similar the two ways, but for WEMWEBS the GAMLSS standard errors are appreciably smaller.

Strengths and limitations

Strengths of this study include the analytical approach used (GAMLSS) to empirically investigate differences in outcome variability. While differences in variability can be informed by descriptive comparison (e.g., comparing SD values), GAMLSS additionally enables computation of estimates of precision and incorporates multivariable specifications (e.g., confounder or mediator adjustment; and inclusion of interaction terms). The use of the 1970 birth cohort data is an additional strength, enabling investigation of multiple risk factors and two largely orthogonal yet important continuous health outcomes. The national representation of this cohort is also advantageous—highly distorted sample selection can bias conventional epidemiological results (i.e., mean differences in outcomes),⁴⁰ and may also bias comparisons of outcome variability.

The study also has limitations. As in all observational studies, causal inference is challenging despite the use of longitudinal data. Associations of social class at birth with outcomes for example could be explained by unmeasured confounding—this may include factors such as parental mental health. This is challenging to falsify empirically owing to a lack of such data collected before birth. In contrast, sex is randomly assigned at birth, and thus its associations with outcomes are unlikely to be confounded. However, sex differences in reporting may bias associations with mental wellbeing.

Physical activity and mental wellbeing were ascertained at broadly the same age, so that associations between the two could be explained by reverse causality; existing evidence appears to suggest bi-directionality of links between physical activity and both outcomes.^41,42 Finally, attrition led to lower power to precisely estimate smaller effect sizes (e.g., gender differences in mental wellbeing) or confirm null effects. Such attribution could potentially bias associations—those in worse health and adverse socioeconomic circumstances are disproportionately lost to follow-up.^43,44 The focus of principled approaches to handle missing data in epidemiology has been on the main parameter of interest—typically beta coefficients in linear regression models—and further empirical work is required to investigate the potential implications of (non-random) missingness for the variability and other moments of the outcome distribution.

Potential implications

This study used an underutilised approach to empirically investigate associations between risk factors and outcome variability in a single cohort study. Thus, our findings require replication and extension in other datasets across other risk factors and health outcomes. Future studies should also seek to explain their findings, and where possible falsify potential explanations. Understanding how risk factors relate to and/or cause differences in outcome variability is not a standard part of epidemiological training, and it entails additional analytical and conceptual complexity. Thus, with greater application of these tools an emerging consensus on best practice should develop. In the first instance we recommend both descriptive and formal investigation, and that analysts carefully consider the use of both absolute (e.g., SD) and relative (e.g., CoV) differences in variability. Since the CoV is fractional standard deviation (eg, SD/mean or log SD), its suitability of use depends on the a priori anticipated relationship between the mean and variance.

In the context of randomised controlled trials, the finding of variability in treatment effects between individuals has been used to justify individualised approaches to treatment (personalised medicine). It is beyond the scope of the current article to discuss the tractability of this for complex outcomes in which treatment effects are unpredictable.⁴⁵ Trials are designed typically to detect only mean differences in outcomes;⁴⁶ nevertheless, additionally presenting outcome variability before and after treatment would be helpful to better appraise intervention effects.⁴ GAMLSS provides a useful framework with which to formally investigate this, even where the homoscedasticity assumption does not hold (i.e., where risk factors or treatment groups differ in their outcome variance). Where there are multiple potential efficacious interventions, further studies could meta-analyse existing trials to identify the types of intervention which additionally reduce outcome variability.

Conclusion

We provide empirical support for the notion that risk factors or interventions can either reduce or increase variability in health outcomes. This finding is consistent with results from quantile regression analysis where a risk factor vs outcome association is stronger (or lower) at higher outcome centiles. Such findings may be explained by heterogeneity in the causal effect of each exposure, by the influence of other (typically unmeasured) variables, and/or by measurement error. This underutilised approach to the analysis of continuously distributed outcomes may have broader utility in epidemiological, medical, and psychological sciences.

Data Availability

Data are available on the UK Data Archive

https://beta.ukdataservice.ac.uk/datacatalogue/series/series?id=200001

Funding

DB is supported by the Economic and Social Research Council (grant number ES/M001660/1), The Academy of Medical Sciences / Wellcome Trust (“Springboard Health of the Public in 2040” award: HOP001/1025), and Medical Research Council (MR/V002147/1). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Kernel density plots for BMI and WEMWEBS by risk factor group

View this table:

Supplementary Table 1.

Risk factors in relation to body mass index (BMI): differences in mean, variability and skewness estimated by GAMLSS (n = 6,025)

View this table:

Supplementary Table 2.

Risk factors in relation to mental wellbeing (WEMWEBS): differences in mean, variability and skewness estimated by GAMLSS (n = 7,128)

References

1.↵
Keyes KM, Galea S. Population health science: Oxford University Press 2016.
2.↵
Plomin R, Haworth CM, Davis OS. Common disorders are quantitative traits. Nature reviews genetics 2009;10(12):872–78.
OpenUrl CrossRef PubMed Web of Science
3.↵
Keyes CL. The mental health continuum: From languishing to flourishing in life. J Health Soc Behav 2002:207–22.
4.↵
Subramanian S, Kim R, Christakis NA. The” average” treatment effect: A construct ripe for retirement. A commentary on Deaton and Cartwright. Soc Sci Med 2018
5.↵
Vitale S, Cotch MF, Sperduto RD. Prevalence of visual impairment in the United States. JAMA 2006;295(18):2158–63.
OpenUrl CrossRef PubMed Web of Science
6.↵
Truby H, Baic S, DeLooy A, et al. Randomised controlled trial of four commercial weight loss programmes in the UK: initial findings from the BBC “diet trials”. BMJ 2006;332(7553):1309–14.
OpenUrl Abstract/FREE Full Text
7.↵
Porta M. A dictionary of epidemiology. Oxford, UK: Oxford University Press 2008.
8.↵
Beyerlein A, Toschke AM, Von Kries R. Breastfeeding and childhood obesity: shift of the entire BMI distribution or only the upper parts? Obesity 2008;16(12):2730–33.
OpenUrl
9.↵
Flegal KM, Troiano RP. Changes in the distribution of body mass index of adults and children in the US population. Int J Obes 2000;24(7):807.
OpenUrl CrossRef PubMed Web of Science
10.↵
Johnson W, Li L, Kuh D, et al. How Has the Age-Related Process of Overweight or Obesity Development Changed over Time? Co-ordinated Analyses of Individual Participant Data from Five United Kingdom Birth Cohorts. PLoS Med 2015;12(5):e1001828.
OpenUrl CrossRef PubMed
11.↵
Sun J, Covaci A, Bustnes JO, et al. Temporal trends of legacy organochlorines in different whitetailed eagle (Haliaeetus albicilla) subpopulations: A retrospective investigation using archived feathers. Environ Int 2020;138:105618.
OpenUrl
12.↵
Nakagawa S, Poulin R, Mengersen K, et al. Meta-analysis of variation: ecological and evolutionary applications and beyond. Methods Ecol Evol 2015;6(2):143–52.
OpenUrl CrossRef
13.↵
Pitt D, Trück S, van den Honert R, et al. Modeling risks from natural hazards with generalized additive models for location, scale and shape. J Environ Manage 2020;275:111075.
OpenUrl
14.↵
Hohberg M, Pütz P, Kneib T. Treatment effects beyond the mean using distributional regression: Methods and guidance. PLoS One 2020;15(2):e0226514.
OpenUrl
15.↵
Silbersdorff A, Schneider KS. Distributional Regression Techniques in Socioeconomic Research on the Inequality of Health with an Application on the Relationship between Mental Health and Income. Int J Environ Res Public Health 2019;16(20):4009.
OpenUrl
16.↵
Silbersdorff A, Lynch J, Klasen S, et al. Reconsidering the income-health relationship using distributional regression. Health Econ 2018;27(7):1074–88.
OpenUrl
17.↵
Beyerlein A, Fahrmeir L, Mansmann U, et al. Alternative regression models to assess increase in childhood BMI. BMC Med Res Methodol 2008;8(1):59.
OpenUrl CrossRef PubMed
18.↵
Rigby RA, Stasinopoulos DM. Generalized additive models for location, scale and shape. Journal of the Royal Statistical Society: Series C (Applied Statistics) 2005;54(3):507–54.
OpenUrl CrossRef
19.↵
Cole T, Stanojevic S, Stocks J, et al. Age-and size-related reference ranges: A case study of spirometry through childhood and adulthood. Stat Med 2009;28(5):880–98.
OpenUrl CrossRef PubMed Web of Science
20.↵
Beyerlein A. Quantile regression—opportunities and challenges from a user’s perspective. Am J Epidemiol 2014;180(3):330–31.
OpenUrl CrossRef PubMed Web of Science
21.↵
Bann D, Fitzsimons E, Johnson W. Determinants of the population health distribution: an illustration examining body mass index. Int J Epidemiol 2020;49(3):731–37.
OpenUrl
22.↵
Green MA, Rowe F. Explaining the widening distribution of Body Mass Index: A decomposition analysis of trends for England, 2002–2004 and 2012–2014. Area 2020
23.↵
Stringhini S, Carmeli C, Jokela M, et al. Socioeconomic status and the 25× 25 risk factors as determinants of premature mortality: a multicohort study and meta-analysis of 1·7 million men and women. Lancet 2017;389(10075):1229–37.
OpenUrl CrossRef PubMed
24.↵
Elliott J, Shepherd P. Cohort profile: 1970 British birth cohort (BCS70). Int J Epidemiol 2006;35(4):836–43.
OpenUrl CrossRef PubMed Web of Science
25.↵
Wood N, Hardy R, Bann D, et al. Childhood correlates of adult positive mental well-being in three British longitudinal studies. J Epidemiol Community Health 2021;75(2):177–84.
OpenUrl Abstract/FREE Full Text
26.↵
Tennant R, Hiller L, Fishwick R, et al. The Warwick-Edinburgh mental well-being scale (WEMWBS): development and UK validation. Health and Quality of life Outcomes 2007;5(1):63.
OpenUrl
27.↵
Lewontin RC. On the measurement of relative variability. Syst Zool 1966;15(2):141–42.
OpenUrl CrossRef
28.↵
Cole TJ, Green PJ. Smoothing reference centile curves: the LMS method and penalized likelihood. Stat Med 1992;11(10):1305–19.
OpenUrl CrossRef PubMed Web of Science
29.↵
Stasinopoulos DM, Rigby RA. Generalized additive models for location scale and shape (GAMLSS) in R. Journal of Statistical Software 2007;23(7):1–46.
OpenUrl
30.↵
Tyrrell J, Wood AR, Ames RM, et al. Gene–obesogenic environment interactions in the UK Biobank study. Int J Epidemiol 2017:dyw337.
31.↵
Drewnowski A, Rehm CD, Solet D. Disparities in obesity rates: analysis by ZIP code area. Soc Sci Med 2007;65(12):2458–63.
OpenUrl CrossRef PubMed Web of Science
32.↵
Stafford M, Cummins S, Ellaway A, et al. Pathways to obesity: Identifying local, modifiable determinants of physical activity and diet. Soc Sci Med 2007;65(9):1882–97.
OpenUrl CrossRef PubMed Web of Science
33.↵
Ludwig J, Duncan GJ, Gennetian LA, et al. Neighborhood effects on the long-term well-being of low-income adults. Science 2012;337(6101):1505–10.
OpenUrl Abstract/FREE Full Text
34.↵
Luciano M, Hagenaars SP, Davies G, et al. Association analysis in over 329,000 individuals identifies 116 independent variants influencing neuroticism. Nat Genet 2018;50(1):6–11.
OpenUrl CrossRef PubMed
35.↵
De Moor MH, Van Den Berg SM, Verweij KJ, et al. Meta-analysis of genome-wide association studies for neuroticism, and the polygenic association with major depressive disorder. JAMA psychiatry 2015;72(7):642–50.
OpenUrl
36.↵
Bann D, Kuh D, Wills AK, et al. Physical activity across adulthood in relation to fat and lean mass in early old age: findings from the MRC National Survey of Health and Development, 1946- 2010. Am J Epidemiol 2014;179(10):1197–207.
OpenUrl CrossRef PubMed Web of Science
37.↵
Black SV, Cooper R, Martin KR, et al. Physical activity and mental well-being in a cohort aged 60– 64 years. Am J Prev Med 2015;49(2):172–80.
OpenUrl CrossRef
38.↵
Moulton V, Goodman A, Nasim B, et al. Parental Wealth and Children’s Cognitive Ability, Mental, and Physical Health: Evidence From the UK Millennium Cohort Study. Child Dev 2021;92(1):115–23.
OpenUrl
39.↵
Galobardes B, Shaw M, Lawlor DA, et al. Indicators of socioeconomic position (part 1). J Epidemiol Community Health 2006;60(1):7–12.
OpenUrl Abstract/FREE Full Text
40.↵
Munafò MR, Tilling K, Taylor AE, et al. Collider scope: when selection bias can substantially influence observed associations. Int J Epidemiol 2018;47(1):226–35.
OpenUrl CrossRef PubMed
41.↵
Pereira SMP, Geoffroy M-C, Power C. Depressive symptoms and physical activity during 3 decades in adult life: bidirectional associations in a prospective cohort study. JAMA psychiatry 2014;71(12):1373–80.
OpenUrl
42.↵
Gibbs BB, Aaby D, Siddique J, et al. Bidirectional 10-year associations of accelerometer-measured sedentary behavior and activity categories with weight among middle-aged adults. Int J Obes 2020;44(3):559–67.
OpenUrl
43.↵
Mostafa T, Wiggins D. The impact of attrition and non-response in birth cohort studies: a need to incorporate missingness strategies. Longitudinal and Life Course Studies 2015;6(2):131–46.
OpenUrl CrossRef
44.↵
Mostafa T, Narayanan M, Pongiglione B, et al. Improving the plausibility of the missing at random assumption in the 1958 British birth cohort: A pragmatic data driven approach. 2020
45.↵
Smith GD. Epidemiology, epigenetics and the ‘Gloomy Prospect’: embracing randomness in population health research and practice. Int J Epidemiol 2011;40(3):537–62.
OpenUrl CrossRef PubMed Web of Science
46.↵
Senn S. Mastering variation: variance components and personalised medicine. Stat Med 2016;35(7):966–77.
OpenUrl PubMed

View the discussion thread.

Posted March 31, 2021.

Download PDF

Data/Code

Citation Tools

Subject Area

Epidemiology

Subject Areas

All Articles

Addiction Medicine (322)
Allergy and Immunology (626)
Anesthesia (162)
Cardiovascular Medicine (2352)
Dentistry and Oral Medicine (286)
Dermatology (206)
Emergency Medicine (377)
Endocrinology (including Diabetes Mellitus and Metabolic Disease) (832)
Epidemiology (11740)
Forensic Medicine (10)
Gastroenterology (698)
Genetic and Genomic Medicine (3712)
Geriatric Medicine (347)
Health Economics (632)
Health Informatics (2383)
Health Policy (928)
Health Systems and Quality Improvement (890)
Hematology (340)
HIV/AIDS (776)
Infectious Diseases (except HIV/AIDS) (13293)
Intensive Care and Critical Care Medicine (767)
Medical Education (364)
Medical Ethics (104)
Nephrology (397)
Neurology (3470)
Nursing (197)
Nutrition (521)
Obstetrics and Gynecology (669)
Occupational and Environmental Health (661)
Oncology (1809)
Ophthalmology (534)
Orthopedics (218)
Otolaryngology (286)
Pain Medicine (232)
Palliative Medicine (66)
Pathology (445)
Pediatrics (1026)
Pharmacology and Therapeutics (426)
Primary Care Research (417)
Psychiatry and Clinical Psychology (3163)
Public and Global Health (6116)
Radiology and Imaging (1268)
Rehabilitation Medicine and Physical Therapy (740)
Respiratory Medicine (824)
Rheumatology (379)
Sexual and Reproductive Health (371)
Sports Medicine (320)
Surgery (398)
Toxicology (50)
Transplantation (171)
Urology (145)

[1] 1.↵
Keyes KM, Galea S. Population health science: Oxford University Press 2016.

[2] 2.↵
Plomin R, Haworth CM, Davis OS. Common disorders are quantitative traits. Nature reviews genetics 2009;10(12):872–78.
OpenUrl CrossRef PubMed Web of Science

[3] 3.↵
Keyes CL. The mental health continuum: From languishing to flourishing in life. J Health Soc Behav 2002:207–22.

[4] 4.↵
Subramanian S, Kim R, Christakis NA. The” average” treatment effect: A construct ripe for retirement. A commentary on Deaton and Cartwright. Soc Sci Med 2018

[5] 5.↵
Vitale S, Cotch MF, Sperduto RD. Prevalence of visual impairment in the United States. JAMA 2006;295(18):2158–63.
OpenUrl CrossRef PubMed Web of Science

[6] 6.↵
Truby H, Baic S, DeLooy A, et al. Randomised controlled trial of four commercial weight loss programmes in the UK: initial findings from the BBC “diet trials”. BMJ 2006;332(7553):1309–14.
OpenUrl Abstract/FREE Full Text

[7] 7.↵
Porta M. A dictionary of epidemiology. Oxford, UK: Oxford University Press 2008.

[8] 8.↵
Beyerlein A, Toschke AM, Von Kries R. Breastfeeding and childhood obesity: shift of the entire BMI distribution or only the upper parts? Obesity 2008;16(12):2730–33.
OpenUrl

[9] 9.↵
Flegal KM, Troiano RP. Changes in the distribution of body mass index of adults and children in the US population. Int J Obes 2000;24(7):807.
OpenUrl CrossRef PubMed Web of Science

[10] 10.↵
Johnson W, Li L, Kuh D, et al. How Has the Age-Related Process of Overweight or Obesity Development Changed over Time? Co-ordinated Analyses of Individual Participant Data from Five United Kingdom Birth Cohorts. PLoS Med 2015;12(5):e1001828.
OpenUrl CrossRef PubMed

[11] 11.↵
Sun J, Covaci A, Bustnes JO, et al. Temporal trends of legacy organochlorines in different whitetailed eagle (Haliaeetus albicilla) subpopulations: A retrospective investigation using archived feathers. Environ Int 2020;138:105618.
OpenUrl

[12] 12.↵
Nakagawa S, Poulin R, Mengersen K, et al. Meta-analysis of variation: ecological and evolutionary applications and beyond. Methods Ecol Evol 2015;6(2):143–52.
OpenUrl CrossRef

[13] 13.↵
Pitt D, Trück S, van den Honert R, et al. Modeling risks from natural hazards with generalized additive models for location, scale and shape. J Environ Manage 2020;275:111075.
OpenUrl

[14] 14.↵
Hohberg M, Pütz P, Kneib T. Treatment effects beyond the mean using distributional regression: Methods and guidance. PLoS One 2020;15(2):e0226514.
OpenUrl

[15] 15.↵
Silbersdorff A, Schneider KS. Distributional Regression Techniques in Socioeconomic Research on the Inequality of Health with an Application on the Relationship between Mental Health and Income. Int J Environ Res Public Health 2019;16(20):4009.
OpenUrl

[16] 16.↵
Silbersdorff A, Lynch J, Klasen S, et al. Reconsidering the income-health relationship using distributional regression. Health Econ 2018;27(7):1074–88.
OpenUrl

[17] 17.↵
Beyerlein A, Fahrmeir L, Mansmann U, et al. Alternative regression models to assess increase in childhood BMI. BMC Med Res Methodol 2008;8(1):59.
OpenUrl CrossRef PubMed

[18] 18.↵
Rigby RA, Stasinopoulos DM. Generalized additive models for location, scale and shape. Journal of the Royal Statistical Society: Series C (Applied Statistics) 2005;54(3):507–54.
OpenUrl CrossRef

[19] 19.↵
Cole T, Stanojevic S, Stocks J, et al. Age-and size-related reference ranges: A case study of spirometry through childhood and adulthood. Stat Med 2009;28(5):880–98.
OpenUrl CrossRef PubMed Web of Science

[20] 20.↵
Beyerlein A. Quantile regression—opportunities and challenges from a user’s perspective. Am J Epidemiol 2014;180(3):330–31.
OpenUrl CrossRef PubMed Web of Science

[21] 21.↵
Bann D, Fitzsimons E, Johnson W. Determinants of the population health distribution: an illustration examining body mass index. Int J Epidemiol 2020;49(3):731–37.
OpenUrl

[22] 22.↵
Green MA, Rowe F. Explaining the widening distribution of Body Mass Index: A decomposition analysis of trends for England, 2002–2004 and 2012–2014. Area 2020

[23] 23.↵
Stringhini S, Carmeli C, Jokela M, et al. Socioeconomic status and the 25× 25 risk factors as determinants of premature mortality: a multicohort study and meta-analysis of 1·7 million men and women. Lancet 2017;389(10075):1229–37.
OpenUrl CrossRef PubMed

[24] 24.↵
Elliott J, Shepherd P. Cohort profile: 1970 British birth cohort (BCS70). Int J Epidemiol 2006;35(4):836–43.
OpenUrl CrossRef PubMed Web of Science

[25] 25.↵
Wood N, Hardy R, Bann D, et al. Childhood correlates of adult positive mental well-being in three British longitudinal studies. J Epidemiol Community Health 2021;75(2):177–84.
OpenUrl Abstract/FREE Full Text

[26] 26.↵
Tennant R, Hiller L, Fishwick R, et al. The Warwick-Edinburgh mental well-being scale (WEMWBS): development and UK validation. Health and Quality of life Outcomes 2007;5(1):63.
OpenUrl

[27] 27.↵
Lewontin RC. On the measurement of relative variability. Syst Zool 1966;15(2):141–42.
OpenUrl CrossRef

[28] 28.↵
Cole TJ, Green PJ. Smoothing reference centile curves: the LMS method and penalized likelihood. Stat Med 1992;11(10):1305–19.
OpenUrl CrossRef PubMed Web of Science

[29] 29.↵
Stasinopoulos DM, Rigby RA. Generalized additive models for location scale and shape (GAMLSS) in R. Journal of Statistical Software 2007;23(7):1–46.
OpenUrl

[30] 30.↵
Tyrrell J, Wood AR, Ames RM, et al. Gene–obesogenic environment interactions in the UK Biobank study. Int J Epidemiol 2017:dyw337.

[31] 31.↵
Drewnowski A, Rehm CD, Solet D. Disparities in obesity rates: analysis by ZIP code area. Soc Sci Med 2007;65(12):2458–63.
OpenUrl CrossRef PubMed Web of Science

[32] 32.↵
Stafford M, Cummins S, Ellaway A, et al. Pathways to obesity: Identifying local, modifiable determinants of physical activity and diet. Soc Sci Med 2007;65(9):1882–97.
OpenUrl CrossRef PubMed Web of Science

[33] 33.↵
Ludwig J, Duncan GJ, Gennetian LA, et al. Neighborhood effects on the long-term well-being of low-income adults. Science 2012;337(6101):1505–10.
OpenUrl Abstract/FREE Full Text

[34] 34.↵
Luciano M, Hagenaars SP, Davies G, et al. Association analysis in over 329,000 individuals identifies 116 independent variants influencing neuroticism. Nat Genet 2018;50(1):6–11.
OpenUrl CrossRef PubMed

[35] 35.↵
De Moor MH, Van Den Berg SM, Verweij KJ, et al. Meta-analysis of genome-wide association studies for neuroticism, and the polygenic association with major depressive disorder. JAMA psychiatry 2015;72(7):642–50.
OpenUrl

[36] 36.↵
Bann D, Kuh D, Wills AK, et al. Physical activity across adulthood in relation to fat and lean mass in early old age: findings from the MRC National Survey of Health and Development, 1946- 2010. Am J Epidemiol 2014;179(10):1197–207.
OpenUrl CrossRef PubMed Web of Science

[37] 37.↵
Black SV, Cooper R, Martin KR, et al. Physical activity and mental well-being in a cohort aged 60– 64 years. Am J Prev Med 2015;49(2):172–80.
OpenUrl CrossRef

[38] 38.↵
Moulton V, Goodman A, Nasim B, et al. Parental Wealth and Children’s Cognitive Ability, Mental, and Physical Health: Evidence From the UK Millennium Cohort Study. Child Dev 2021;92(1):115–23.
OpenUrl

[39] 39.↵
Galobardes B, Shaw M, Lawlor DA, et al. Indicators of socioeconomic position (part 1). J Epidemiol Community Health 2006;60(1):7–12.
OpenUrl Abstract/FREE Full Text

[40] 40.↵
Munafò MR, Tilling K, Taylor AE, et al. Collider scope: when selection bias can substantially influence observed associations. Int J Epidemiol 2018;47(1):226–35.
OpenUrl CrossRef PubMed

[41] 41.↵
Pereira SMP, Geoffroy M-C, Power C. Depressive symptoms and physical activity during 3 decades in adult life: bidirectional associations in a prospective cohort study. JAMA psychiatry 2014;71(12):1373–80.
OpenUrl

[42] 42.↵
Gibbs BB, Aaby D, Siddique J, et al. Bidirectional 10-year associations of accelerometer-measured sedentary behavior and activity categories with weight among middle-aged adults. Int J Obes 2020;44(3):559–67.
OpenUrl

[43] 43.↵
Mostafa T, Wiggins D. The impact of attrition and non-response in birth cohort studies: a need to incorporate missingness strategies. Longitudinal and Life Course Studies 2015;6(2):131–46.
OpenUrl CrossRef

[44] 44.↵
Mostafa T, Narayanan M, Pongiglione B, et al. Improving the plausibility of the missing at random assumption in the 1958 British birth cohort: A pragmatic data driven approach. 2020

[45] 45.↵
Smith GD. Epidemiology, epigenetics and the ‘Gloomy Prospect’: embracing randomness in population health research and practice. Int J Epidemiol 2011;40(3):537–62.
OpenUrl CrossRef PubMed Web of Science

[46] 46.↵
Senn S. Mastering variation: variance components and personalised medicine. Stat Med 2016;35(7):966–77.
OpenUrl PubMed

What if risk factors influenced the variability of health outcomes as well as the mean? Evidence and implications

Abstract

Introduction

Methods

Study sample

Health outcomes

Risk factors

Analytical strategy

Results

Body mass index

Mental wellbeing – Warwick-Edinburgh Mental Wellbeing Scale

Comparison with quantile regression findings

Discussion

Strengths and limitations

Potential implications

Conclusion

Data Availability

Funding

Kernel density plots for BMI and WEMWEBS by risk factor group

References

Citation Manager Formats

Subject Area