Abstract
Study Objective Sleep is essential for both physical and mental health. There is an increasing interest in understanding how different factors shape individual variation in sleep duration, quality and patterns, or confer risk for sleep disorders. The present study aimed to identify novel causal relationships between sleep-related traits and other phenotypes, using a genetics-driven hypothesis-free approach not requiring longitudinal data.
Methods We used genetic data and the latent causal variable (LCV) method to screen the phenome and infer causal relationships between seven sleep-related traits (insomnia, daytime dozing, easiness of getting up in the morning, snoring, sleep duration, napping, and morningness) and 1,527 different phenotypes.
Results We identify 84 significant causal relationships. Among other findings, poor health of musculoskeletal and connective tissue disorders increase insomnia risk and reduce sleep duration; depression-related traits increase insomnia and daytime dozing; insomnia, napping and snoring are affected by obesity and cardiometabolic traits and diseases; and working with asbestos, thinner, or glues increases insomnia, potentially through an increased risk of respiratory disease.
Conclusion Overall, our results indicate that changes in sleep variables are predominantly the consequence, rather than the cause, of other underlying phenotypes and diseases. These insights could inform the design of future epidemiological and interventional studies in sleep medicine and research.
Introduction
Sleep is a complex neurological and physiological state that is essential for various biological processes and systems, ranging from homeostasis and restoration of energy levels to memory consolidation.1 Individual differences in sleep quality, duration, and patterns have been associated with numerous variables, including sex, age, genetics, body size, occupation, mental and physical illness status, and cultural and environmental factors.2 Furthermore, growing evidence suggests that inadequate sleep and sleep disorders contribute to the aetiology of metabolic, psychiatric and neurodegenerative conditions.3–5
Epidemiological studies of sleep have focused on a few measurable traits, such as sleep duration, snoring, chronotype, and daytime dozing, and diseases such as insomnia and sleep apnoea.6,7 Twin and family studies have shown that genetic factors partly explain individual variation in sleep-related traits, with heritability estimates ranging from 0.30 for insomnia and 0.34 for daytime dozing,8 to 0.42 for morningness 9 and 0.52 for snoring.10,11
More recently, genome-wide association studies (GWAS) have achieved considerable progress in the characterisation of the genetic architecture of sleep-related traits.12–15 GWAS have identified 351 genomic risk loci associated with chronotype,13 41 genomic loci associated with snoring,15 and 202 genomic loci associated with insomnia.16 Furthermore, significant genetic overlap with other conditions has been uncovered. For instance, genetic correlations (rG) were found between insomnia and depressive symptoms, major depression, anxiety disorder, higher neuroticism scores and lower subjective well-being scores.16 While snoring was genetically correlated with sleep apnoea, excessive daytime sleepiness, body mass index (BMI), daytime dozing, schizophrenia, anorexia nervosa and neuroticism score, among others.15
A genetic correlation between two polygenic traits may be due to horizontal pleiotropic effects, and therefore is not necessarily indicative of a causal relationship.17 Horizontal pleiotropy poses challenges to statistical methods designed to infer causal relationships between environmental exposures and health outcomes using genetic data, such as Mendelian randomisation (MR). Besides pleiotropy, MR can also become limited by weakly-powered discovery GWAS studies, which increase the likelihood of false-positive findings.17,18 Given the limitations of MR, alternative methods such as Latent Causal Variable (LCV) have been developed.17 LCV mediates the genetic correlation between two traits through a latent variable that has a causal effect on each trait. The LCV method distinguishes between genetic correlation due to horizontal pleiotropy and full or partial genetic causation by estimating the genetic causality proportion (GCP), a parameter that can range from 0 (no genetic causality) to 1 (full genetic causality of trait A on trait B) or −1 (full genetic causality of trait B on trait A).17
Due to the importance of sleep in human health, there is growing interest in understanding the determinants of sleep-related traits and their relationships with other health conditions. Previous studies have reported associations between tobacco smoking, alcohol consumption, and body mass index with an increased risk of snoring;15 insomnia and an increased risk of depression, diabetes, and cardiovascular disease;16 longer sleep duration with an increased risk of breast cancer in women19 and shorter sleep duration with a higher risk for myocardial infarction.20 Nonetheless, most of these studies lack the design principles to perform causal inferences. In fact, many interventional studies on sleep would be considered unethical.
In the present study, we sought to explore the potential causal relationships of sleep-related traits with a broad spectrum of variables, using new statistical methods to infer causation which rely single nucleotide polymorphism (SNP) data from well-powered GWAS for pairs of traits measured on the same, or different samples. We leverage the extensive collection (n=1,527) of GWAS summary statistics in the Complex Traits Genetics Virtual Lab (CTG-VL) to conduct a hypothesis-free phenome-wide screening of variables causally associated with seven sleep-related phenotypes: insomnia, daytime dozing, easiness to get up, snoring, sleep duration, napping, and morningness. Our results confirm some of the causal associations hypothesised through observational studies, and provide new insights into the relationships between sleep, lifestyle and health.
Methods
Datasets
The present study used summary statistics from genome-wide association studies (GWAS) for the seven sleep phenotypes under investigation. The summary statistics resulting from a genome-wide scan summarise relevant parameters including allele frequency, effect size, standard error and the p-value of each genetic variant tested on the trait of interest. Most published GWAS have made their summary statistics available to the scientific community, which enables researchers to leverage previous findings to advance knowledge in distinct fields. The GWAS summary statistics used here correspond to studies of snoring, insomnia, daytime dozing, getting up, sleep duration, napping, and morningness (information for each study is listed in Table 1). Datasets were obtained from the repositories reported in their corresponding publications (Campos and García-Marín et al. 2019 and Jansen et al. 2018)15,16
Causal architecture analysis pipeline
Summary statistics from genome-wide association studies (GWAS) for seven sleep-related traits were collected from previous studies (Jansen et al. 2018 and Campos and García-Marín et al. 2019)15,16. Then, they were formatted using in-house scripts and uploaded onto the Complex Trait Genomics Virtual Lab (CTG-VL; https://genoma.io/) web-based platform. Subsequently, the MASSIVE analysis pipeline, which includes bivariate LD-score regression and latent causal variable analysis (LCV) was implemented for each sleep-related trait of interest. Finally, causal architecture plots were used to depict the LCV results (Figure 1).
Genetic correlations
A genetic correlation between two traits describes the relationship of genetic effects sizes at mutual genetic variants across two different phenotypes.21 The LCV method estimates a genetic correlation between traits A and B through a modified linkage disequilibrium score regression.22 If the genetic correlation is nominally significant, then a latent variable L is introduced into the model to assess causality between trait A and trait B, assuming that L is the causal component that mediates the genetic correlation between both traits (see below).17,18 We corrected for multiple testing using Benjamini-Hochberg’s False Discovery Rate (FDR < 5%).
Genetic causal proportion
Latent causal variable (LCV) is a method that uses summary statistics from genome-wide association studies to estimate the genetic causality proportion (GCP) parameter, by mediating the genetic correlation between Trait A and Trait B with a latent variable L. A GCP of 0 indicates no genetic causality. A GCP of 1 or −1 is indicative of full genetic causality, of Trait A on Trait B, or Trait B on Trait A, respectively. Values between 0 and 1 or 0 and −1 would indicate partial genetic causality.17,18 Although causality is often thought of as a binary characteristic, the idea of partial genetic causality is consistent with both a causal relationship between two complex traits and the notion of distinct genetic components underlying complex traits (e.g. a disease could be caused by both, direct genetic effects, and genetic predisposition to one or more environmental exposures).
An advantage of the LCV method over other methods such as Mendelian randomisation is that it differentiates horizontal from vertical pleiotropy. The model assumes that given a directed effect of trait A on trait B, the effects of genetic variants underlying trait A are expected to have proportional effects on trait B, but not vice versa. Thus, by mediating the genetic correlation between trait A and trait B through the L parameter, one can estimate the GCP (see O’Connor & Price17 for more details). A GCP value close to 0 suggests that horizontal pleiotropy mediates the genetic correlation between traits A and B, and thus any intervention targeting trait A should not affect trait B.17
Notably, the LCV method assumes no bidirectional causality and no confounding by environmental correlates of genotypes. Therefore, care is required when these assumptions are not met.18 Moreover, the most attractive features of this method include that it is robust to sample overlap, has higher statistical power than MR and is unconfounded by horizontal pleiotropy.18 Multiple testing in GCP was corrected for using Benjamini-Hochberg’s False Discovery Rate (FDR < 5%).
Results
Insomnia
We identified genetic correlations between insomnia and 608 traits (FDR < 5%). Forty-eight of these traits showed a causal effect on insomnia risk (negative GCP estimates), and we did not find any significant causal effect of insomnia on another trait (Table 1). The traits with the strongest evidence of causal effect on insomnia were dyspepsia (rG = 0.34, GCP = −0.97, p-valueGCP = 2.37 × 10−213), often worked with materials containing asbestos (rG = 0.26, GCP = −0.97, p-valueGCP = 7.23 × 10−202) and chest pain during physical activity (rG = 0.57, GCP = −0.96, p-valueGCP = 1 33 × 10−100; Table 2). Consistently, rarely/never worked with materials containing asbestos was causally associated with reduced insomnia risk (rG = −0.44, GCP= −0.92, p-valueGCP = 2.30 × 10−33; Figure 2 and Supplementary File 1).
Traits related to connective tissue and musculoskeletal health, such as other specific joint derangements/joint disorders (rG = 0.64, GCP = −0.73, p-valueGCP = 3.21 × 10−04), synovitis and tenosynovitis (ICD10) (rG = 0.28, GCP = −0.50, p-valueGCP = 9.65 × 10−26) and other arthrosis (rG = 0.51, GCP = −0.79, p-valueGCP = 1.17 × 10−07), were also found to causally influence insomnia risk. Additionally, respiratory-related traits including interstitial lung disease (ILD) (rG = 0.38, GCP = −0.65, p-valueGCP = 0.003) and chronic obstructive pulmonary disease (COPD) (rG = 0.38, GCP = −0.65, p-valueGCP = 0.003) showed evidence of increasing the risk for insomnia as did gastrointestinal phenotypes including other gastritis (rG = 0.51, GCP = −0.76, p-valueGCP = 2.35 × 10−06). A similar pattern was observed for stopped smoking due to an illness (rG = 0.46, GCP = −0.85, p-valueGCP = 1.64 × 10−14) or due to a doctor’s advice (rG = 0.40, GCP = −0.75, p-valueGCP = 7.20 × 10−06) (Figure 2).
Daytime dozing
We identified 147 traits with a significant genetic correlation with daytime dozing and 9 with evidence for a causal relationship. Of those, seven influence dozing, and two are putative consequences of it (Figure 3a). Moreover, five out of the nine causal relationships directly involve depression, with the trait no bipolar disorder or depression showing the most robust evidence for decreasing the risk for daytime dozing (rG = −0.32, GCP = −0.68, p-valueGCP = 2.13 × 10−05), followed by seeing a doctor (GP) for nerves, anxiety, tension or depression, which increased risk for daytime dozing (rG = 0.27, GCP = −0.53, p-valueGCP = 2.15 × 10−04;Table 3).
Snoring
Snoring was genetically correlated with 299 different traits, ten of which held a causal effect on snoring (Figure 3b). The identified causal relationships that increased the risk of snoring included umbilical hernia (rG = 0.25, GCP = −0.42, p-valueGCP = 4.55 × 10−11), angina pectoris (rG = 0.26, GCP = −0.80, p-valueGCP = 3.90 × 10−09) and obesity (rG = 0.27, GCP = −0.73, p-valueGCP = 2.54 × 10−05; Table 3), all of which were ascertained as an International Classification of Diseases (ICD10) diagnosis (see discussion).
Sleep Duration
Two hundred and two traits were genetically correlated with sleep duration. Five of them were found to causally influence sleep duration. Similar to insomnia, traits related to connective tissue and musculoskeletal health showed evidence of a causal association, including disorders of synovium and tendon + bursophaties (rG = −0.33, GCP = −0.83, p-valueGCP = 2.39 × 10−16) and primary gonarthrosis (bilateral) (rG = −0.35, GCP = −0.79, p-valueGCP = 9.34 × 10−08), both decreasing sleep duration (Figure 4a and Table 3).
Getting Up
The ease of getting up in the morning was genetically correlated with 192 traits. We found evidence for ten traits causally influencing getting up and one being causally influenced by it. The age at the first episode of depression was the only trait associated with being easier for an individual to get up in the morning (rG = 0.44, GCP = −0.79, p-valueGCP = 1.51 × 10−07). In contrast, the only trait that was influenced by getting up was ever had prolonged feelings of sadness or depression (rG = −0.37, GCP = 0.68, p-valueGCP = 0.002). Out of the 11 causal relationships that were identified, six of them directly involve either depression, anxiety or panic attacks (Figure 4b and Table 3).
Napping and morningness
Out of the 35 traits genetically correlated with napping, only triglyceride levels held a significant causal effect that increased the risk for napping (rG = 0.16, GCP = −0.83, p-valueGCP = 1.30 × 10−14; Table 3; Supplementary Figure 1a). For morningness, genetic correlations with 84 traits were identified. However, none of them supported a potential causal relationship independent of pleiotropy (Supplementary Figure 1b and Supplementary File 1).
Discussion
This study provides new insights into the determinants and consequences of seven sleep-related traits. We examined potential causal associations between sleep-related phenotypes and 1527 traits and identified 84 significant causal relationships based on genetic evidence. Overall, our results indicate that changes in sleep variables are predominantly the consequence, rather than the cause, of other underlying phenotypes and diseases.
We identified causal genetic influences of several conditions on insomnia risk. Consistent with previous studies, 23,24 gastrointestinal disorders such as dyspepsia and other gastritis, including duodenitis, and respiratory diseases, increased the risk of insomnia. The effects of asthma on insomnia have been described before, showing that uncontrolled asthma is a risk factor for insomnia.25,26 Additionally, COPD and with ILD, also showed a causal effect on increased insomnia risk. Exposure to asbestos, dust and substances containing solvents such as paint, thinners and glues was also a causal factor for insomnia. Asbestos and solvents are hazardous chemicals that induce an inflammatory response in the respiratory system that may lead to pulmonary fibrosis,27 ILD,28 COPD,29 and lung cancer.30,31 Therefore, our results suggest that exposure to asbestos and solvents could lead to insomnia as a consequence of the development of severe respiratory diseases such as COPD, ILD and asthma.
Musculoskeletal conditions and connective tissue disorders also increased insomnia risk. Synovitis and tenosynovitis (ICD10), disorders of synovium and tendon, self-reported sciatica, fibroblastic disorders (ICD10), self-reported cervical spondylosis, primary gonarthrosis (bilateral) and knee pain for more than three months, among others, could be used as a proxy for poor musculoskeletal and connective tissue health. Previous studies have found an association between shorter sleep duration and insomnia with chronic pain.32 In the present study, sciatica, primary gonarthrosis (bilateral) and disorders of synovium and tendon were also causally associated with shorter sleep duration. We speculate that the discomfort and inflammation arising from problems in the musculoskeletal system and connective tissue may reduce sleep duration and, in cases where these disorders prevail for long periods, this relationship may be mediated by chronic pain. However, more research is needed to understand the intricate relationship between pain and sleep.
Depression and anxiety are common among people with insomnia, and previous studies suggest that insomnia may increase the risk of depression and anxiety.33,34 According to the Diagnostic and Statistical Manual of Mental Disorders, Fifth Edition (DSM-5), insomnia is considered a secondary symptom of major depression.35 Although our results did not show a direct causal relationship between insomnia and anxiety or depression diagnosis, the frequency in the use of Diazepam, a common medication for anxiety disorders36 and a proxy for anxiety severity, had a causal association with insomnia. Also, depression diagnosed by a professional and age at last episode of depression among other depression-related traits, showed a causal relationship increasing the risk for daytime dozing. This is expected, as excessive daytime sleepiness is a common symptom of depression.37,38
The relationship between insomnia and cardiometabolic diseases has been described before providing an unclear set of conclusions. Most studies suggest that insomnia, particularly in the context of short sleep duration, poses a risk for cardiometabolic diseases, in particular, hypertension 39–42 and diabetes mellitus.40,43 In contrast, others suggest that insomnia symptoms are not positively associated with hypertension.44 In the present study, results uncovered insomnia-increasing causal relationships with cardiometabolic traits including self-reported type 2 diabetes, electrolyte and acid-base balance, as well as endocrine and metabolic diseases, indicating insomnia is most likely a consequence of these diseases. Furthermore, the causal relationships found with chest pain during physical activity, the use of treatment/medication: GTN 400 micrograms spray, which is commonly prescribed for hypertension,45 and small vessel stroke, which is known to be a consequence of hypertension,46 suggest that the development of cardiometabolic disease may be causal for insomnia.
Mouth ulcers and stopped smoking due to an illness or a doctor recommendation shown insomnia-increasing causal relationships. Nonetheless, these relationships are likely explained by a single causal effect on insomnia. Smoking cessation is accompanied by an abstinence phase, which is well-known as a risk factor for mouth ulcers 47,48 and insomnia.48,49 We speculate that the apparent causal relationship of mouth ulcers on insomnia is mediated through smoking cessation. Formally testing this hypothesis was outside the scope of the present study.
A negative association between the ease of getting up in the morning and depression has been reported previously.50 Our results consistently showed causal relationships for anxiety traits such as panic attacks diagnosed by a professional and use of citalopram, a common antidepressant,51,52 increasing the difficulty to get out of bed in the morning. Further, the age at the first episode of depression, which is a proxy for the severity and recurrence of depression,53 showed a causal relationship with getting up, where higher age increases the ease of getting up. Consistently, we identified a causal relationship where the risk of ever having prolonged feelings of sadness or depression was lower for individuals who can effortlessly get up in the morning. These results agree with the fact that sleep problems and reduced energy are part of the diagnostic criteria for clinical depression.
Relationships between snoring and cardiometabolic traits have been reported before.15,54 We previously reported a causal link between BMI and snoring15 and putative causal links of whole-body fat mass on snoring risk, and snoring increasing blood pressure and pulse rate.15 Other studies had also suggested that snoring is a risk factor for cardiometabolic traits such as hypertension55 and angina pectoris.56 In this study, we identified several factors that influence snoring risk, including obesity (ICD10), angina pectoris (ICD10), a known risk factor for coronary heart disease (CHD),57,58 self-reported high cholesterol and use of candesartan, a common drug for treating hypertension.59 Although this suggests that coronary heart disease exerts a causal relationship on snoring, we cannot currently rule out whether this is mediated through the genetic component for obesity that underlies CHD. The causal relationship found for triglycerides causing napping is consistent with previous studies in the Chinese population.60 However, no evidence was found between napping and CHD as described in other studies.60 We hypothesize that well-powered GWAS would show a relationship with CHD and obesity, all known to correlate with triglyceride levels.
Some limitations of the present study need to be acknowledged. First, our analyses only employed data from individuals of European ancestry. Given that previous studies have highlighted ethnic differences in sleep-related traits,61–64 the generalizability of the results may be limited. Also, the GCP estimates are tied to the statistical power of the GWAS, limiting the capacity to identify causal effects for some traits.64 In addition, despite the inclusion of more than 1500 traits, other causal associations not tested here may exist. Further, it is crucial to keep in mind the possible biases or designs of the GWAS involved. For instance, our results implicate several medication use GWAS, however, our results suggest these should be interpreted as a proxy for the underlying disease or symptom requiring the medication. Finally, our study highlights the challenge of dealing with non-pleiotropic horizontal associations, where a third trait may moderate the association between two other traits through a shared genetic component. An example is the association of cardiovascular disease-related phenotypes and snoring, which could be mediated through obesity. While we cannot rule out a direct causal association, the most likely explanation is that obesity causes both snoring and cardiovascular disease through a shared genetic component. This limitation is implicit in the bivariate nature of the LCV approach, and future developments on statistical genetics could leverage causal architecture statistical networks to disentangle confounding effects.
In summary, we provide evidence for the causal architecture of sleep-related traits and show that insomnia, daytime dozing, getting up, snoring, sleep duration and napping are mostly consequences of other phenotypes. Our analyses uncovered the role of musculoskeletal and connective tissue disorders in increasing the risk for insomnia and shorter sleep duration. Also, we show the influence of depression on insomnia getting up, and daytime dozing as well as the role of obesity and potentially cardiometabolic traits and diseases, causing an increased risk for insomnia, napping and snoring. We also observed an influence of diet and lifestyle-related variables such as working with asbestos, thinner or glues on respiratory diseases, which in turn increase insomnia risk. Altogether, our results generate testable hypotheses that, if confirmed, could inform the design of novel treatment and intervention strategies to support better sleep quality and overall health.
Data Availability
Results will be made publicly available via the Complex Trait Genomics Virtual Lab.
Funding
AIC is supported by a UQ Research Training Scholarship from The University of Queensland (UQ). MER thanks the support of the NHMRC and Australian Research Council (ARC) through a Research Fellowship (APP1102821).
Disclosure statement
The authors declare no conflicts of interest.
SUPPLEMENTARY FIGURES
SUPPLEMENTARY FILES
Supplementary File 1. LCV output for insomnia, dozing, getting up, snoring, sleep duration, napping and morningness.
Acknowledgments
We thank our colleague Jackson Thorp for his valuable feedback and for proof-reading the manuscript.
Footnotes
↵¶ These authors jointly supervised this study