Genome-wide analysis identifies significant contribution of brain-expressed genes in chronic, but not acute, back pain
======================================================================================================================

* Andrey V. Bortsov
* Marc Parisien
* Samar Khoury
* Dmitri V. Zaykin
* Amy E Martinsen
* Marie Udnesseter Lie
* Ingrid Heuch
* HUNT All-In Pain
* Kristian Hveem
* John-Anker Zwart
* Bendik S Winsvold
* Luda Diatchenko

## ABSTRACT

Back pain is the leading cause of disability worldwide. Although most cases of back pain are acute, 20% of people with acute back pain go on to experience chronic back pain symptoms. It is unclear if acute and chronic pain states have similar or distinct underlying genetic mechanisms. Here we performed a genome-wide analysis for acute and chronic back pain in 375,158 individuals and found a significant genetic contribution to chronic, but not to acute, back pain. Using the UK Biobank cohort for discovery and the HUNT cohort for replication, we identified 7 loci for chronic back pain, of which 3 are novel. Pathway analyses, tissue-specific heritability enrichment analyses, epigenetic characterization, and tissue-specific transcriptome mapping in mouse pain models suggest a substantial genetic contribution to chronic, but not acute, back pain from the loci predominantly expressed in the central nervous system. Our findings show that chronic back pain is more heritable than acute back pain and is driven mostly by genes expressed in the central nervous system.

## INTRODUCTION

Back pain is the world’s leading cause of disability, reducing quality of life, and imposing significant health care costs.1–4 According to the International Association for the Study of Pain (IASP) criteria, a back pain episode lasting less than 3 months is defined as acute back pain while pain lasting for 3 months or longer is defined as chronic.5 Although the majority of individuals with acute back pain resolve their pain,6,7 a substantial proportion of these cases transition to chronic back pain. As a result, chronic back pain is the most common painful condition which affects 10–15% of the adult population. 2,3,8 Mechanisms involved in the development of chronic back pain remain largely unknown, and seem to be different from causes leading to the onset of acute back pain. While some believe that the majority (up to 85%) of cases of chronic back pain cannot be attributed to a known specific pathology,9 others attribute chronic back pain to processes such as central sensitization,10 disc herniation causing inflammation of the affected nerve,11 facet joint syndrome,12 sacroiliac joint pain,13–15 lumbar spinal stenosis16, and lumbar disc degeneration.17

Heritability estimates for back pain from twin studies range from 30 to 68%.18–21 Twin studies also show shared heritability between chronic back pain and lumbar disc degeneration,18 depression,22 anxiety,22 education,23 obesity,24 chronic widespread musculoskeletal pain.25 Recent genome-wide association studies (GWAS) using the UK Biobank cohort have identified a number of genes associated with chronic back pain, including *SPOCK2, SOX5*, and *DCC*.26,27 These studies confirm genetic involvement both in the etiology of chronic back pain and its related comorbidities.

Despite the mounting evidence that acute and chronic back pain are distinct states with different pathologies, there is a significant gap in knowledge about the genetic risk factors and molecular pathophysiology associated with acute versus chronic back pain. Previous genetic studies have looked either at back pain in general,26 or only at the chronic pain condition.27 One important question is whether acute and chronic pain states have similar underlying genetic pathways. Here we used GWAS for both acute and chronic back pain states and characterized the corresponding molecular and cellular pathways contributing to these genetics assotiations. We hypothesized that acute and chronic back pain were different with respect to individual genetic risk loci, overall genetic heritability, heritability distribution across tissues and cells, and contributing molecular and cellular pathways.

## RESULTS

In our genome-wide exploration of genetic factors contributing to acute versus chronic back pain we used data on individuals of white British ancestry from the UK Biobank cohort (n=375,158) (Fig. 1). Acute and chronic back pain phenotypes were inferred from an affirmative response to the question of whether the respondent had experienced back pain in the last month that interfered with their usual activities. Those who reported back pain in the last month were asked if their back pain lasted for more than 3 months. Acute back pain was defined as back pain (that interfered with activities) in the last month that lasted no more than three months. Chronic back pain was defined as back pain that lasted for more than three months. Individuals with no acute or chronic back pain (but possibly with other types of pain) were used as controls. The defined phenotypes were used as outcome variables to perform the GWAS.

![Figure 1.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2020/09/07/2020.09.04.20187575/F1.medium.gif)

[Figure 1.](http://medrxiv.org/content/early/2020/09/07/2020.09.04.20187575/F1)

Figure 1. 
Genome-wide association analysis of chronic back pain (A, B) and acute back pain (C, D) in 375,158 participants from the UK Biobank cohort. **(A, C)** Miami plots show –log10 P-values for imputed (top, orange) and genotyped (bottom, green) SNPs (N, number of analyzed subjects). Horizontal red lines indicate genome-wide statistical significance at 5×10−8. Genetic heritability estimate (h2) shown with corresponding P-value (*P*h). **(B, D)** QQ plots show observed –log10 P-values (*P*observed) as a function of expected –log10 P-values (*P*expected), for both genotyped (green) and imputed (orange) SNPs. Genomic control parameter (*λ*GC), LD Score regression intercept (I) and their ratios (*θ*) are shown.

A genome-wide association analysis of chronic back pain (n=375,158; 70,633 cases; 304,525 controls) revealed 355 SNPs that reached the genome-wide significance threshold (5×10−8), grouped into 13 loci (Fig. 1A, Supplementary Table 1). The majority of the significant loci (8 of 13) have already been reported as significantly associated with back pain in the UK Biobank cohort by other groups (chr4: TMED11P • SPON2, chr8: C8orf34-AS1 • C8orf34 • LINC01592, chr8: CCDC26 • GSDMC, chr9: NRARP • EXD3 • NOXA1, chr10: PSAP • CHST3 • SPOCK2 • ASCC1, chr10: BNIP3 • JAKMIP3 • DPYSL4, chr12: LOC101928441 • SOX5 • SOX5-AS1, chr18: LINC01630 • DCC • LINC01919), and five loci were new (chr5: LOC105374704 • CDH6, chr5: NUDT12 • RAB9BP1, chr10: LINC00844 • CCEPR, chr14: SLC25A21-AS1 • MIPOL1 • FOXA1, chr19: BCAM • NECTIN2 • TOMM40 • APOE • APOC1 • APOC1P1).26,27 The estimated narrow-sense SNP heritability (h2) for chronic back pain was 4.6% (*P*=1.4×10−78, Fig. 1A). While there was some inflation of test statistics (genomic control parameter λGC = 1.31), the LD Score regression intercept was close to 1 (Fig. 1B), indicating that the inflation was due to polygenicity rather than confounding factors such as population stratification and/or cryptic relatedness, the use of a mixed model, or population stratification.28 The latter confounding factors were mitigated using BOLT-LMM via genetic principal components and kinship covariance matrices.29,30

The GWAS of acute back pain (n=336,734; 32,209 cases and 304,525 controls) revealed no genetic variants reached the genome-wide statistical significance threshold (Fig. 1C). The estimated narrow-sense SNP heritability (h2=0.81%) for acute back pain was also lower than for chronic back pain; however, the heritability was significantly different from zero (*P*=1.4×10−8, Fig. 1C). The LD Score regression intercept was 1, suggesting that the moderate inflation (λGC1.05) was due to polygenicity rather than confounding factors (Fig. 1D). Of note, amongst the 13 statistically significant loci in chronic back pain, only one locus on chromosome 8 was associated with both acute and chronic back pain (Supplementary Table 2).

In the HUNT replication cohort, we found similar differences in heritability estimates between acute and chronic back pain. Heritability of chronic back pain was 3.4% (*P*=0.0011), whereas heritability of acute back pain was 0.6%(*P*= 0.851). From this we concluded that the heritability of acute back pain is approximately 5–6 times smaller than the heritability of chronic back pain. A total of 7 out of 13 statistically significant loci were replicated in the HUNT cohort for chronic pain cases (Table 1) at the P < 0.05 level using sets of SNPs at high LD (r > 0.8).

View this table:
[Table 1.](http://medrxiv.org/content/early/2020/09/07/2020.09.04.20187575/T1)

Table 1. 
SNP-level discovery and replication results of chronic back pain GWAS. Replication of the top hit within each locus was performed using SNP sets in high LD (r2 > 0.8). SNPs with lowest replication P-values for each locus in the discovery and replication cohort are shown.

At the pathway level, a total of 21 Gene Ontology (GO) pathways were significantly enriched (FDR 20%) in chronic back pain and 10 pathways were enriched (FDR 20%) in acute back pain in the UK Biobank discovery cohort (Table 2). Importantly, different pathways were identified for acute and chronic pain states. Neurogenesis and synaptic plasticity were significant in chronic back pain while odontogenesis, cardiac muscle depolarization, and immune response through Th2-helpers were significant in acute back pain. Of note, the significance of the odontogenesis pathway is driven by genes linked to connective tissue disorders (*RSPO2*) and bone remodeling (*TNFRSF11B*).31,3233 Three pathways for chronic pain were replicated (*P* < 0.05) in the HUNT cohort: Spinal cord ventral commissure morphogenesis (GO:0021965), Positive regulation of kinase activity (GO:0033674,) and Positive regulation of transferase activity (GO:0051347). None of the acute pain pathways were replicated in the HUNT cohort.

View this table:
[Table 2.](http://medrxiv.org/content/early/2020/09/07/2020.09.04.20187575/T2)

Table 2. 
Pathway analysis in the discovery (UK Biobank) and replication (HUNT) cohorts. Pathways that pass the significance threshold FDR 20% in the discovery cohort are shown.

We next investigated how the genetic heritabilities of chronic and acute back pain were distributed across different tissues. We measured the enrichment for the genes corresponding to identified genomic loci expressed in various human tissues. A total of 152 cell types and tissues from Fehtmann *et al*,34 grouped into eight categories (adipose, blood and immune, cardiovascular, central nervous system (CNS), digestive, endocrine, musculoskeletal and connective, and other tissues) were included in the analyses (Fig. 2). For chronic back pain, estimates for heritability that exhibited the smallest *P*-values were attributed to genomic regions expressed only in brain regions or brain as a whole (Fig. 2A). Specific parts of the brain that reached the significance threshold (FDR 10%) were the limbic system, parietal lobe, brain stem, cerebral cortex, entorhinal cortex, cerebellum, hippocampus, and metencephalon (Fig. 2B). In contrast, for acute back pain, none of the cell types or tissues reached the statistical significance threshold (Fig. 2C), likely due to low overall heritability of acute back pain. Furthermore, the pattern of tissue-specific partitioned heritabilities for acute back pain appeared to be different from those for chronic back pain, with notable absence of significant signals from brain regions (Fig. 2A and 2C).

![Figure 2.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2020/09/07/2020.09.04.20187575/F2.medium.gif)

[Figure 2.](http://medrxiv.org/content/early/2020/09/07/2020.09.04.20187575/F2)

Figure 2. 
Tissue-specific heritability enrichment for chronic back pain, acute back pain, and other related phenotypes. Genomic tissue-specific annotations for 152 tissues were grouped into eight categories: adipose, ADI (n=3, purple); blood + immune, B+I (n=37, red), cardiovascular, CVS (n=9, brown); central nervous system, CNS (n=19, green); digestive, DIG (n=14, pink); endocrine, END (n=9, blue); musculoskeletal + connective, M+C (n=15, orange); and other, OTH (n=46, grey). Vertical bars in each plot denote –log10(FDR) values for enrichment of each tissue. FDR 10% threshold is denoted by a horizontal grey line. **(A, B)** Tissue-specific partitioned heritability for chronic back pain. **(C)** Tissue-specific partitioned heritability for acute back pain, genetic correlation (Rg) with chronic back pain, and genetic correlation P-value. **(D)** Tissue-specific partitioned heritability for depression, body mass index, insomnia, neuroticism and standing height, genetic correlation (Rg) with chronic back pain and genetic correlation P-value. **(E)** Brain regions partitioned heritability enrichment correlation between chronic back pain (horizontal axis, not shown) and acute back pain, depression, body mass index, insomnia, neuroticism, and standing height (vertical axis, not shown). Squared correlation coefficient (r2) and associated P-value (*P*) are shown in orange.

Next, we used an LD Score regression to estimate genetic correlations between acute and chronic back pain and several selected phenotypes known to be correlated with chronic back pain at the phenotypic level, including body mass index (BMI), insomnia, neuroticism, and depression (Fig. 2D). All of these phenotypes were moderately genetically correlated with chronic back pain, and some of them (BMI, insomnia, and neuroticism) appeared to have significant partitioned heritability in brain tissues, similar to that of chronic back pain (Fig. 2D). In order to look at the similarities/dissimilarities of heritability patterns in brain tissues between chronic back pain and the other phenotypes, we performed a correlation analysis of the heritability estimate coefficients in brain regions (Fig. 2E). A positive correlation would suggest that partitioned heritability is similarly distributed in brain regions between chronic back pain and another phenotype. Interestingly, chronic back pain and acute back pain had strong overall genetic correlation (Rg=0.97, *P*< 10−194), suggesting that genetic heritability for acute pain, although small, largely overlaps with genetic heritability for chronic back pain. However, when considering only brain tissues, no correlation was found between acute and chronic back pain(*P*=0.53, Fig. 2E). Positive correlations of partitioned heritability coefficients in chronic back pain were found for BMI, depression, insomnia, and neuroticism, suggesting that heritability estimates are distributed similarly across brain tissues for these conditions. In contrast, partitioned heritability estimates for brain tissues were not statistically significant in standing height, and did not correlate with heritability estimates for chronic back pain (Fig. 2D–E).

Our findings suggested that chronic back pain is determined by genes predominantly expressed in the brain. To validate our findings, we performed a gene enrichment analysis using data on tissue-specific differential gene expression in mouse assays of inflammatory and neuropathic pain.35 Four tissues, namely the whole brain, spinal cord, dorsal root ganglia, and whole blood, were available for investigation in two mouse pain assays: Complete Freund Adjuvant (CFA) for inflammatory pain, and Spared Nerve Injury (SNI) for neuropathic pain. In line with our partitioned heritability findings, we observed enrichment of genes upregulated in the nervous system tissues in both mouse pain models (Fig. 3), suggesting that both neuropathic and inflammatory components contribute to human back pain. Inflammatory pain-related genes that were differentially expressed in the brain, spinal cord, and dorsal root ganglia in mice were significantly enriched in chronic, but not acute, back pain GWAS (Fig. 3A). Similarly, neuropathic pain-related genes that were differentially expressed in the mouse brain were significantly enriched in chronic, but not acute, back pain GWAS (FDR 10%, Fig. 3B). Altogether, heritability of chronic back pain is not only largely mediated by SNPs in genes expressed in the brain, but also by those genes that are differentially-expressed in a pain-conditioned brain.

![Figure 3.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2020/09/07/2020.09.04.20187575/F3.medium.gif)

[Figure 3.](http://medrxiv.org/content/early/2020/09/07/2020.09.04.20187575/F3)

Figure 3. 
Tissue-specific heritability enrichment for back pain GWAS results using transcriptomic annotations from inflammatory **(A)** and neuropathic **(B)** mouse pain assays. Annotations for genes upregulated in whole brain (BRN), spinal cord (SC), dorsal root ganglia (DRG), and whole blood (BLD) were used to determine heritability enrichment in chronic and acute back pain GWAS. Shown are –log10 FDR-corrected P-values for enrichment of heritability. Vertical black lines indicate FDR 10%; vertical grey lines indicate FDR 20%.

We then hypothesized that epigenetic markers of the 13 genomic loci that reached genome-wide significance (Table 1) in our chronic back pain analyses would be enriched in brain tissues (Fig. 4). To test this hypothesis, we first mapped the significant SNPs onto corresponding genes and defined 13 gene clusters (Fig. 4A). Then we looked at the intersection of SNPs in linkage disequilibrium (LD) with the lead SNP in each cluster with NIH Epigenetics Roadmap activation markers (H3K4me3 for promoter regions,36,37 H3K4me1 for enhancers,36 and H3K36me3 for transcribed regions38) for statistically significant enrichment in 396 different tissues39 (Fig. 4B). Our results again aligned with the partitioned heritability findings: 7 out of 13 loci co-localized with epigenetic markers in multiple brain tissues, as evidenced by statistically significant P-values (Bonferroni-corrected *P*<0.05, Fig. 4B). These results suggest that these regions are transcriptionally active in the CNS. Using a different chromatin epigenome mapping (NIH Epigenetic Roadmap chromatin 15-state model),40 we found that 9 out of 13 loci had significant enrichment in multiple brain tissues (Fig. 4C). Thus, our epigenetic characterization of the chronic back pain-related variants suggests that expression of the genes in the CNS plays a significant role in the chronic back pain phenotype, followed by genes expressed in the musculoskeletal system (osteoblasts, chondrocytes, and muscles).

![Figure 4.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2020/09/07/2020.09.04.20187575/F4.medium.gif)

[Figure 4.](http://medrxiv.org/content/early/2020/09/07/2020.09.04.20187575/F4)

Figure 4. 
Epigenetic characterization of the GWAS significant 13 chronic back pain loci. **(A)** Gene clusters. Lead SNP effect of minor allele in the discovery cohort: protective (green) or risk (red). **(B)** Intersection of SNPs in LD with lead SNP (r2≥ 0.5) with NIH Epigenetics Roadmap activation markers in selected tissues. Markers are: H3K4me3, H3K4me1, H3K36me3. Darker hues for increased signal. **(C)** Intersection of SNPs in LD with lead SNP (r2 ≥ 0.5) with NIH Epigenetics Roadmap 15-states chromatin model in selected tissues. States are: transcription start site (state 1; dark brown) and enhancer (states 6 or 7; light brown) indicating active transcription, and bivalent or poised transcription start site (state 10; orange) indicating absence of active ≤ transcription. Statistical significance for ChIP-Seq signal established at *P* 0.05/(3 markers × 13 loci × 19 tissues). Cells are colored white otherwise.

## DISCUSSION

Previous studies suggest substantial differences between acute and chronic back pain in terms of predisposing factors.41,42 Our results from two large cohorts indicate the genetic contribution to chronic back pain is greater than to acute back pain, and much of the heritability of chronic back pain can be traced to genes predominantly expressed in the CNS. At the pathway level, we found the enrichment for genes in the spinal cord ventral commissure morphogenesis pathway in both cohorts. We also mapped the genetic component of chronic but not acute pain states to genes differentially expressed in the brain in mouse pain models. Interestingly, we found this enrichment for brain specific differentially-expressed genes in both inflammatory and neuropathic mouse pain models, and also in the spinal cord and in dorsal root ganglia of the inflammatory pain model.

Our observation that acute back pain is significantly less heritable than chronic back pain(0.8% for acute versus 4.6% for chronic, narrow-sense SNP heritability) may be due to a greater role of environmental (e.g. tissue injury) factors in acute pain. In turn, transition to chronic back pain, or development of chronic back pain *per se*, appears to require predisposing genetic and/or epigenetic risk factors. It was observed that epigenetic regulation was still occurring in the forebrains of mice that underwent peripheral nerve injuries long after the injury.43

While we found a high genetic correlation between chronic and acute back pain, this observation does not contradict our finding that acute back pain is considerably less heritable than chronic back pain. Indeed, genetic correlation is the genetic covariance, i.e., the covariance between sets of SNP contributions to heritability for two traits divided by the square root of h12 and h22. This genetic correlation can be large if total heritability of one of the traits (h1) due to a set of SNPs is relatively small compared to total heritability of the second trait (h2) and when h1 constitutes a subset of h2 in terms of individual SNP contributions. It is inevitable that some individuals with acute back pain in our study will not resolve their pain over time. It is conceivable that the positive genetic correlation may predispose these individuals to chronic back pain. Genetic correlations between chronic back pain and other phenotypes (BMI, insomnia, neuroticism, and depression) were of moderate strength, and are in line with previous studies’ results showing genetic similarities between these phenotypes.26,44,45

The main finding in our study is that heritability of chronic back pain, but not acute back pain, is largely attributed to genes expressed in brain tissues. Involvement of the CNS in chronic back pain is an area of active research,46 supported by a significant comorbidity between chronic back pain and psychological disorders.47 Indeed, we found genetic overlap of chronic back pain with sleep disorders, neuroticism, and BMI. While this overlap has been previously described in several genetic studies 22,26,48), here we show that the overlap is localized in genes predominantly expressed in the CNS. Specifically, we found that chronic back pain heritability is mainly localized in the following brain regions: limbic system, parietal lobe, brain stem, cerebral cortex, entorhinal cortex, cerebellum, hippocampus, and metencephalon. Moreover, the distribution of heritability across brain regions was similar for chronic back pain, insomnia, neuroticism, and BMI, suggesting shared genetic and pathophysiological mechanisms between these phenotypes. Our finding of predominant involvement of brain-expressed genes in chronic back pain, but not acute back pain, corroborates previous studies’ findings suggesting that spontaneous pain in chronic pain patients involves specific spatiotemporal neuronal mechanisms implicating a salient role for the emotional brain, distinct from mechanisms observed for acute experimental pain.49 Interestingly, in our study we found only limited evidence of the involvement of musculoskeletal tissues in chronic back pain.

Using the UK Biobank cohort for discovery allowed use of a larger sample size than in previous GWAS studies 26,27 and enabled us to identify 13 loci that exceeded a genome-wide significance threshold for chronic back pain. We have replicated seven loci in the HUNT cohort. Of those, 3 loci are novel (Chr5: LOC105374704 • CDH6, Chr5: NUDT12 • RAB9BP1, Chr14: SLC25A21-AS1 • MIPOL1 • FOXA1). Another locus (Chr18: LINC01630 • DCC • LINC01919) has never previously been replicated in an independent cohort.

Our results suggest that chronic pain is much more heritable than acute pain. Furthermore, despite of the strong genetic correlation between acute and chronic pain, the most significant pathways contributing to these pain states seem to be different. Pathway analyses of chronic back pain GWAS revealed enrichment for genes involved in the spinal cord ventral commissure morphogenesis pathway in both the UK Biobank and HUNT cohorts. The ventral white commissure is comprised of A delta and C fibers, which are known to be involved in pain signal transduction. This suggests that molecular mechanisms of nerve fiber growth might be involved in the pathophysiology of chronic back pain.50 Interestingly, the most significant pathway in acute back pain, GO:00042483, contains two genes linked to connective tissue disorders (RSPO2) and bone remodeling (TNFRSF11B). 31,32,33 It is possible, therefore, that different gene subsets drive the bulk of the onset and persistence of back pain, with the majority of the pathways contributing to acute back pain not necessarily contributing to pain becoming chronic, in addition to a strong environmental component in acute back pain.

A limitation of our study is in the extent to which the phenotypes of interest, acute and chronic back pain, are ascertained in the UK Biobank cohort. Pain intensity, frequency of episodes, and pain medication use were not available for analyses. Additionally, phenotype definitions varied slightly between the UK Biobank and the HUNT cohorts (back pain in the UK Biobank and low back pain in the HUNT). However, we concluded that these differences did not affect the validity of our findings given the high replication success rate of our findings between the two cohorts.

In conclusion, our analyses show that chronic back pain is substantially more heritable than acute back pain, and this heritability is mostly attributed to genes expressed in the brain. Furthermore, our results suggest that different genetic pathways may be responsible for acute and chronic pain manifestation. Molecular pathophysiology of acute back pain is largely contributed to by connective and bone tissue remodeling and inflammatory pathways while chronic back pain is contributed to by neuronal development processes. These results provide substantial insight into physiological systems and pathways responsible for acute and chronic back pain, and should be expanded upon to further to develop targeted treatments.

## METHODS

### The UK Biobank cohort

The UK Biobank cohort was used as a discovery cohort. This cohort is a high-powered prospective study of 500,000 people recruited in the United Kingdom.51,52 It is mainly comprised of individuals of Caucasian ancestry whose ages range between 37 and 73 years, with a female to male participant ratio of about 1.2:1. Chronic cases were defined as those answering “yes” to the question “Have you had back pains for more than 3 months?” (field 3571). This question was asked to participants if they answered “yes” for the “Back Pain” option at this question: “In the last month have you experienced any of the following that interfered with your usual activities?” (field 6159). Participants were discarded based on the following criteria: answering “no”, “do not know” or “prefer not to answer” (field 3571), not “White British”, failed genotyping quality control or sex mismatch, and voluntary withdrawal from the study. Acute back pain cases were identified as those answering “no” at field 3571. All other participants (i.e. those who did not met the criteria for acute or chronic back pain) were qualified as controls. Therefore, control individuals might have experienced acute or chronic pain at other bodily sites than in the back. Genotyping, quality control, and genomic imputation in the UK Biobank cohort were previously described.53

### The HUNT cohort

The Nord-Trøndelag Health Study (HUNT) cohort was used for replication of results. The HUNT study has been conducted in Nord-Trøndelag County, Norway, in three consecutive waves54: HUNT in 1984–1986, HUNT2 in 1995–1997 and HUNT3 in 2006–2008. This work combines baseline data from the second survey, HUNT2,55 and the third survey, HUNT3.54 All residents of Norway aged 20 years and older were invited to take part in the HUNT2 survey. They were requested to complete a questionnaire on health status and invited to a clinical consultation that included measuring height and weight. In the HUNT3 survey 11 years later, similar information was collected using a questionnaire and a clinical examination. Each participant in the HUNT2 and HUNT3 surveys signed a written informed consent regarding the collection and use of data for research purposes.

One question in the HUNT2 and HUNT3 questionnaires was expressed: ‘During the last year, have you suffered from pain and/or stiffness in your muscles and joints that has lasted for at least 3 consecutive months?’ Each participant answering yes was given the following question: ‘Where did you have these complaints?’ Several body regions were listed. Individuals answering yes to the first question and including the lower back as a relevant region were regarded as having chronic lower back pain.56 Acute low back pain cases (n=4,379) were defined as those who had lumbar pain in the last month (at HUNT2), but not had musculoskeletal pain for more than 3 of the past 12 months (at HUNT2). Controls (n=11,309) were defined as those participants without musculoskeletal pain in the last month or musculoskeletal pain for more than 3 of the past 12 months (at HUNT2) and if they did not have musculoskeletal pain for more than 3 of the past 12 months (at HUNT3). Chronic low back pain cases (n=19,760) were defined as those who had lumbar pain for more than 3 of the past 12 months. Controls (n=28,674) had no musculoskeletal pain for more than 3 of the past 12 months at either HUNT2 or HUNT3.

Genotyping of study subjects was performed in three batches using the Illumina HumanCoreExome (Illumina Inc, CA, USA).

## Statistical analyses

Genome-wide association tests in the UK Biobank cohort were performed using a linear mixed model as implemented in BOLT-LMM software, version 2.3.29,30 Covariates were sex, age, age2, genotyping platform, first 40 genetic principal components, and recruitment centers. Kinship was considered by BOLT-LMM using genotyped position data. Retained SNPs had minor allele frequencies of at least one-in-a-thousand, departed from Hardy-Weinberg equilibrium with *P-*values greater or equal to 10−12, and were part of the Haplotype Reference Consortium (HRC) panel.57 Association analyses in the HUNT cohort were performed using a mixed logistic regression(SAIGE), adjusted for sex, genotyping batch, and 5 principal components.

Epigenetic data for functional characterization of GWAS significant SNPs was taken from the NIH Roadmap Epigenomics Consortium.39,40 At each significant locus, the list of SNPs in high LD (r2 ≥ 0.5) with the locus’ lead SNP was retrieved from LDlink58 using the Great Britain population as the reference. We used the ‘intersect’ option of the bedtools59 to retrieve high-LD SNPs that overlapped with epigenetic features. Two analyses of epigenetic markers were performed: the first one in relation to the overlap of high-LD SNPs with known epigenetic activation marker peaks, and the second one in relation to the overlap with NIH Roadmap’s 15-states chromatin model built from peaks of activation/repression markers, focusing on transcription start sites (state 1), transcription enhancers (states 6 or 7) indicating active transcription, and bivalent or poised transcription start site (state 10) indicating absence of active transcription. For analyses of activation peaks, retained broad peaks had experimental ChIP-Seq evidence P-values ≤ (0.05 / (13×19×3)), correcting for 13 genome-wide significant loci in 19 pain-relevant tissues for 3 epigenetic activation markers (H3K4me3, H3K4me1, and H3K36me3). For chromatin states, we only considered the presence of an overlap with one of these states (since no P-values were available), prioritized as follows: state 1; states 6 and 7; state 10. All data files were downloaded from the following URLs: [https://egg2.wustl.edu/roadmap/data/byFileType/peaks/consolidated/gappedPeak/](https://egg2.wustl.edu/roadmap/data/byFileType/peaks/consolidated/gappedPeak/), [https://egg2.wustl.edu/roadmap/data/byFileType/chromhmmSegmentations/ChmmModels/coreMarks/jointModel/final/](https://egg2.wustl.edu/roadmap/data/byFileType/chromhmmSegmentations/ChmmModels/coreMarks/jointModel/final/).

Partitioned heritability analyses were performed to determine heritability enrichment in annotated SNP sets using an LD Score Regression (LDSC).60 Functional genomic annotations (tissue- and cell-specific gene expression) were retrieved from an online source database comprised of 152 different human tissues.34

Additional custom annotations were built from genes overexpressed in two mouse pain assays.35 The mouse pain assays were Complete Freund’s Adjuvant, a model for inflammatory pain, and Spared Nerve Injury, a model for neuropathic pain. Quantitation of gene expression from the RNA-Seq data (GEO accession GSE111216) was obtained using STAR61 on Genome Reference Consortium’s GRCm38 genome, followed by featureCount62 to extract reads per genes. Over-expressed genes in the two pain assays compared to naïve were identified via DESeq2.63 A total of three replicate samples per condition (naïve, CFA, SNI) were available. Mouse gene names were translated to human gene names using the BioMart tool.64 The top 10% (n=1,663) genes with the greatest positive test statistic (more expression in the pain assay) were retained to build LDSC’s genetic annotation track, as recommended.60 The reference or null model annotation track was built from all annotated genes (n=16,629).

For pathway analyses, the summary GWAS data were first analyzed using MAGMA,65,66 which aggregated GWAS SNP P-values into gene-level ones, while considering linkage-disequilibrium between the SNPs. MAGMA was also used to deduce pathway-level P-values from gene-level ones. Pathways were sourced from Gene Ontology’s biological processes,67,68 obtained from the Bader lab at URL: [http://download.baderlab.org/EM\_Genesets/December\_01\_2019/](http://download.baderlab.org/EM_Genesets/December_01_2019/). Pathways in the discovery cohort (the UK Biobank) at the FDR 20% level were ascertained for replication in the replication cohort (HUNT) with P-value of replication smaller or equal to 0.05.

## Data Availability

The datasets used in this work can be downloaded from the following sources.

[https://www.ukbiobank.ac.uk/](https://www.ukbiobank.ac.uk/) 

[https://egg2.wustl.edu/roadmap/data/byFileType/peaks/consolidated/gappedPeak/](https://egg2.wustl.edu/roadmap/data/byFileType/peaks/consolidated/gappedPeak/) 

[http://download.baderlab.org/EM\_Genesets/December\_01\_2019/](http://download.baderlab.org/EM_Genesets/December_01_2019/) 

## Author contributions

LD and AB conceived the paper; AB, MP, and LD designed the analytical plan and experiments. MP, SK, and AEM performed the bioinformatics analyses. AB and MP interpreted the results and wrote the manuscript. AEM, MUL, IH, KH, JAZ, BSW provided the data from the HUNT study. SK, DZ, AEM, MUL, IH, KH, JAZ, BSW, and LD critically reviewed the manuscript. All the authors read and approved the final manuscript.

## Competing interests

The authors declare no competing financial interests.

## Funding

Funding for this work was kindly provided by the Canadian Excellence Research Chairs (CERC) Program ([https://www.cerc.gc.ca](https://www.cerc.gc.ca)) grant CERC09 (to LD). DVZ was supported in part by the Intramural Research Program of the NIH.

## Acknowledgments

We would like to thank all participants enrolled in the studies herein. The analysis in the UK Biobank was performed under application 20802. The Nord-Trøndelag Health Study (The HUNT Study) is a collaboration between HUNT Research Centre (Faculty of Medicine and Health Sciences, NTNU, Norwegian University of Science and Technology), Trøndelag County Council, Central Norway Regional Health Authority, and the Norwegian Institute of Public Health. The genotyping was financed by the National Institute of Health (NIH), University of Michigan, The Norwegian Research Council, and Central Norway Regional Health Authority and the Faculty of Medicine and Health Sciences, Norwegian University of Science and Technology (NTNU). The genotype quality control and imputation has been conducted by the K.G. Jebsen Center for Genetic Epidemiology, Department of Public Health andN, Faculty of Medicine and Health Sciences, Norwegian University of Science and Technology (NTNU).

*   Received September 4, 2020.
*   Revision received September 4, 2020.
*   Accepted September 7, 2020.


*   © 2020, Posted by Cold Spring Harbor Laboratory

This pre-print is available under a Creative Commons License (Attribution-NonCommercial-NoDerivs 4.0 International), CC BY-NC-ND 4.0, as described at [http://creativecommons.org/licenses/by-nc-nd/4.0/](http://creativecommons.org/licenses/by-nc-nd/4.0/)

## REFERENCES

1.  Dagenais, S., Caro, J. & Haldeman, S. A systematic review of low back pain cost of illness studies in the United States and internationally. Spine J 8, 8–20, doi:10.1016/j.spinee.2007.10.005 (2008).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.spinee.2007.10.005&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=18164449&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F07%2F2020.09.04.20187575.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000252601300002&link_type=ISI) 

2.  Lim, K.-L., Jacobs, P. & Klarenbach, S. A population-based analysis of healthcare utilization of persons with back disorders: results from the Canadian Community Health Survey 2000–2001. Spine 31, 212–218 (2006).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1097/01.brs.0000194773.10461.9f&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=16418643&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F07%2F2020.09.04.20187575.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000234715300016&link_type=ISI) 

3.  Von Korff, M. et al. Chronic spinal pain and physical–mental comorbidity in the United States: results from the national comorbidity survey replication. Pain 113, 331–339 (2005).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.pain.2004.11.010&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=15661441&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F07%2F2020.09.04.20187575.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000226930300013&link_type=ISI) 

4.  Coyte, P.C., Asche, C.V., Croxford, R. & Chan, B. The economic cost of musculoskeletal disorders in Canada. Arthritis & Rheumatism 11, 315–325 (1998).
    
    

5.  Deyo, R.A. et al. Report of the NIH Task Force on research standards for chronic low back pain. J Pain 15, 569–585, doi:10.1016/j.jpain.2014.03.005 (2014).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.jpain.2014.03.005&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24787228&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F07%2F2020.09.04.20187575.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000337331400001&link_type=ISI) 

6.  Cassidy, J. D., Côté, P., Carroll, L.J. & Kristman, V. Incidence and course of low back pain episodes in the general population. Spine 30, 2817–2823 (2005).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1097/01.brs.0000190448.69091.53&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=16371911&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F07%2F2020.09.04.20187575.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000234181300020&link_type=ISI) 

7.  Hancock, M.J. et al. Assessment of diclofenac or spinal manipulative therapy, or both, in addition to recommended first-line treatment for acute low back pain: a randomised controlled trial. The Lancet 370, 1638–1643 (2007).
    
    

8.  Coyte, P.C., Asche, C.V., Croxford, R. & Chan, B. The economic cost of musculoskeletal disorders in Canada. Arthritis & Rheumatism: Official Journal of the American College of Rheumatology 11, 315–325 (1998).
    
    

9.  Krismer, M. & Van Tulder, M. Strategies for prevention and man agementof muscul oskeletal conditions. Best Pract Res Clin Rheumatol 21, 77–91 (2007).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.berh.2006.08.004&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=17350545&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F07%2F2020.09.04.20187575.atom) 

10. Correa, J.B., Costa, L.O., de Oliveira, N.T., Sluka, K.A. & Liebano, R.E. Central sensitization and changes in conditioned pain modulation in people with chronic nonspecific low back pain: a case-control study. Exp Brain Res 233, 2391–2399, doi:10.1007/s00221-015-4309-6 (2015).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s00221-015-4309-6&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25963754&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F07%2F2020.09.04.20187575.atom) 

11. Kreiner, D.S. et al. An evidence-based clinical guideline for the diagnosis and treatment of lumbar disc herniation with radiculopathy. Spine J 14, 180–191, doi:10.1016/j.spinee.2013.08.003 (2014).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.spinee.2013.08.003&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24239490&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F07%2F2020.09.04.20187575.atom) 

12. van Kleef, M. et al. 12. Pain originating from the lumbar facet joints. Pain Pract 10, 459–469, doi:10.1111/j.1533-2500.2010.00393.x (2010).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1111/j.1533-2500.2010.00393.x&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20667027&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F07%2F2020.09.04.20187575.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000296463700008&link_type=ISI) 

13. Patel, V. Sacroiliac joint injuries. Orthopedics 35, 398, doi:10.3928/01477447-20120426-05 (2012).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.3928/01477447-20120426-05&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22588394&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F07%2F2020.09.04.20187575.atom) 

14. Manchikanti, L. et al. An update of comprehensive evidence-based guidelines for interventional techniques in chronic spinal pain. Part II: guidance and recommendations. Pain Physician 16, S49–283 (2013).
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23615883&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F07%2F2020.09.04.20187575.atom) 

15. Dreyfuss, P., Dreyer, S.J., Cole, A. & Mayo, K. Sacroiliac joint pain. J Am Acad Orthop Surg 12, 255–265, doi:10.5435/00124635-200407000-00006 (2004).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.5435/00124635-200407000-00006&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=15473677&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F07%2F2020.09.04.20187575.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000223467800006&link_type=ISI) 

16. Ciricillo, S.F. & Weinstein, P.R. Lumbar spinal stenosis. West J Med 158, 171–177 (1993).
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=8434469&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F07%2F2020.09.04.20187575.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=A1993KM08800008&link_type=ISI) 

17. Zheng, C.J. & Chen, J. Disc degeneration implies low back pain. Theor Biol Med Model 12, 24, doi:10.1186/s12976-015-0020-3 (2015).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/s12976-015-0020-3&link_type=DOI) 

18. Battie, M.C., Videman, T., Levalahti, E., Gill, K. & Kaprio, J. Heritability of low back pain and the role of disc degeneration. Pain 131, 272–280, doi:10.1016/j.pain.2007.01.010 (2007).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.pain.2007.01.010&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=17335977&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F07%2F2020.09.04.20187575.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000250178400009&link_type=ISI) 

19. Junqueira, D.R. et al. Heritability and lifestyle factors in chronic low back pain: results of the Australian twin low back pain study (The AUTBACK study). Eur J Pain 18, 1410–1418, doi:10.1002/ejp.506 (2014).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/ejp.506&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24733726&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F07%2F2020.09.04.20187575.atom) 

20. MacGregor, A.J., Andrew, T., Sambrook, P.N. & Spector, T.D. Structural, psychological, and genetic influences on low back and neck pain: a study of adult female twins. Arthritis Rheum 51, 160–167, doi:10.1002/art.20236 (2004).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/art.20236&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=15077255&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F07%2F2020.09.04.20187575.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000220764600003&link_type=ISI) 

21. Nyman, T., Mulder, M., Iliadou, A., Svartengren, M. & Wiktorin, C. High heritability for concurrent low back and neck-shoulder pain: a study of twins. Spine (Phila Pa 1976) 36, E1469–1476, doi:10.1097/BRS.0b013e3181e2c878 (2011).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1097/BRS.0b013e3181e2c878&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=21192295&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F07%2F2020.09.04.20187575.atom) 

22. Pinheiro, M.B. et al. Genetics and the environment affect the relationship between depression and low back pain: a co-twin control study of Spanish twins. Pain 156, 496–503, doi:10.1097/01.j.pain.0000460330.56256.25 (2015).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1097/01.j.pain.0000460330.56256.25&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25679471&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F07%2F2020.09.04.20187575.atom) 

23. Zadro, J.R. et al. Does educational attainment increase the risk of low back pain when genetics are considered? A population-based study of Spanish twins. Spine J 17, 518–530, doi:10.1016/j.spinee.2016.10.021 (2017).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.spinee.2016.10.021&link_type=DOI) 

24. Dario, A.B. et al. The relationship between obesity, low back pain, and lumbar disc degeneration when genetics and the environment are considered: a systematic review of twin studies. Spine J 15, 1106–1117, doi:10.1016/j.spinee.2015.02.001 (2015).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.spinee.2015.02.001&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25661432&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F07%2F2020.09.04.20187575.atom) 

25. Malkin, I. et al. Low back and common widespread pain share common genetic determinants. Ann Hum Genet 78, 357–366, doi:10.1111/ahg.12074 (2014).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1111/ahg.12074&link_type=DOI) 

26. Freidin, M.B. et al. Insight into the genetic architecture of back pain and its risk factors from a study of 509,000 individuals. Pain 160, 1361–1373, doi:10.1097/j.pain.0000000000001514 (2019).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1097/j.pain.0000000000001514&link_type=DOI) 

27. Suri, P. et al. Genome-wide meta-analysis of 158,000 individuals of European ancestry identifies three loci associated with chronic back pain. PLoS Genet 14, e1007601, doi:10.1371/journal.pgen.1007601 (2018).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pgen.1007601&link_type=DOI) 

28. Bulik-Sullivan, B.K. et al. LD Score regression distinguishes confounding from polygenicity in genome-wide association studies. Nat Genet 47, 291–295, doi:10.1038/ng.3211 (2015).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/ng.3211&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25642630&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F07%2F2020.09.04.20187575.atom) 

29. Loh, P.R., Kichaev, G., Gazal, S., Schoech, A.P. & Price, A.L. Mixed-model association for biobank-scale datasets. Nat Genet 50, 906–908, doi:10.1038/s41588-018-0144-6 (2018).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41588-018-0144-6&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=29892013&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F07%2F2020.09.04.20187575.atom) 

30. Loh, P.R. et al. Efficient Bayesian mixed-model analysis increases association power in large cohorts. Nat Genet 47, 284–290, doi:10.1038/ng.3190 (2015).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/ng.3190&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25642633&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F07%2F2020.09.04.20187575.atom) 

31. Nakajima, M., Kou, I., Ohashi, H., Genetic Study Group of the Investigation Committee on the Ossification of Spinal, L. & Ikegawa, S. Identification and Functional Characterization of RSPO2 as a Susceptibility Gene for Ossification of the Posterior Longitudinal Ligament of the Spine. Am J Hum Genet 99, 202–207, doi:10.1016/j.ajhg.2016.05.018 (2016).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ajhg.2016.05.018&link_type=DOI) 

32. Ten Dam, E.J., van Beuge, M.M., Bank, R.A. & Werker, P.M. Further evidence of the involvement of the Wnt signaling pathway in Dupuytren’s disease. J Cell Commun Signal 10, 33–40, doi:10.1007/s12079-015-0312-8 (2016).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s12079-015-0312-8&link_type=DOI) 

33. Ren, X. et al. Osteoprotegerin reduces osteoclast resorption activity without affecting osteogenesis on nanoparticulate mineralized collagen scaffolds. Sci Adv 5, eaaw4991, doi:10.1126/sciadv.aaw4991 (2019).
    
    [FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6MzoiUERGIjtzOjExOiJqb3VybmFsQ29kZSI7czo4OiJhZHZhbmNlcyI7czo1OiJyZXNpZCI7czoxMjoiNS82L2VhYXc0OTkxIjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjAvMDkvMDcvMjAyMC4wOS4wNC4yMDE4NzU3NS5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 

34. Fehrmann, R.S. et al. Gene expression analysis identifies global gene dosage sensitivity in cancer. Nat Genet 47, 115–125, doi:10.1038/ng.3173 (2015).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/ng.3173&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25581432&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F07%2F2020.09.04.20187575.atom) 

35. Parisien, M. et al. Genetic pathway analysis reveals a major role for extracellular matrix organization in inflammatory and neuropathic pain. Pain 160, 932–944, doi:10.1097/j.pain.0000000000001471 (2019).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1097/j.pain.0000000000001471&link_type=DOI) 

36. Heintzman, N.D. et al. Distinct and predictive chromatin signatures of transcriptional promoters and enhancers in the human genome. Nat Genet 39, 311–318, doi:10.1038/ng1966 (2007).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/ng1966&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=17277777&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F07%2F2020.09.04.20187575.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000244480000012&link_type=ISI) 

37. Bernstein, B.E. et al. Genomic maps and comparative analysis of histone modifications in human and mouse. Cell 120, 169–181, doi:10.1016/j.cell.2005.01.001 (2005).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.cell.2005.01.001&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=15680324&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F07%2F2020.09.04.20187575.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000226740000007&link_type=ISI) 

38. Bonasio, R., Tu, S. & Reinberg, D. Molecular signals of epigenetic states. Science 330, 612–616, doi:10.1126/science.1191078 (2010).
    
    [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6Mzoic2NpIjtzOjU6InJlc2lkIjtzOjEyOiIzMzAvNjAwNC82MTIiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyMC8wOS8wNy8yMDIwLjA5LjA0LjIwMTg3NTc1LmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 

39. Bernstein, B.E. et al. The NIH Roadmap Epigenomics Mapping Consortium. Nat Biotechnol 28, 1045–1048, doi:10.1038/nbt1010-1045 (2010).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/nbt1010-1045&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20944595&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F07%2F2020.09.04.20187575.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000282870500025&link_type=ISI) 

40. Roadmap Epigenomics, C. et al. Integrative analysis of 111 reference human epigenomes. Nature 518, 317–330, doi:10.1038/nature14248 (2015).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/nature14248&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25693563&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F07%2F2020.09.04.20187575.atom) 

41. Gatchel, R.J., Polatin, P.B. & Mayer, T.G. The dominant role of psychosocial risk factors in the development of chronic low back pain disability. Spine (Phila Pa 1976) 20, 2702–2709, doi:10.1097/00007632-199512150-00011 (1995).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1097/00007632-199512150-00011&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=8747248&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F07%2F2020.09.04.20187575.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=A1995TM51200011&link_type=ISI) 

42. Jayson, M. Chronic back pain is not the same as acute back pain lasting longer. BMJ 314, 1639–1640 (1997).
    
    [FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiRlVMTCI7czoxMToiam91cm5hbENvZGUiO3M6MzoiYm1qIjtzOjU6InJlc2lkIjtzOjEzOiIzMTQvNzA5NS8xNjM5IjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjAvMDkvMDcvMjAyMC4wOS4wNC4yMDE4NzU3NS5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 

43. Tajerian, M. et al. Peripheral nerve injury is associated with chronic, reversible changes in global DNA methylation in the mouse prefrontal cortex. PLoS One 8, e55259, doi:10.1371/journal.pone.0055259 (2013).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pone.0055259&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23383129&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F07%2F2020.09.04.20187575.atom) 

44. Ropponen, A. et al. Personality traits and life dissatisfaction as risk factors for disability pension due to low back diagnoses: A 30-year longitudinal cohort study of Finnish twins. Journal of psychosomatic research 73, 289–294 (2012).
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22980535&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F07%2F2020.09.04.20187575.atom) 

45. Suri, P. et al. Modifiable risk factors for chronic back pain: insights using the co-twin control design. The Spine Journal 17, 4–14 (2017).
    
    

46. Schouppe, S., Van Oosterwijck, S., Danneels, L., Van Damme, S. & Van Oosterwijck, J. Are Functional Brain Alterations Present in Low Back Pain? A Systematic Review of EEG Studies. J Pain, doi:10.1016/j.jpain.2019.06.010 (2019).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.jpain.2019.06.010&link_type=DOI) 

47. Vachon-Presseau, E. et al. Identification of traits and functional connectivity-based neurotraits of chronic pain. PLoS Biol 17, e3000349, doi:10.1371/journal.pbio.3000349 (2019).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pbio.3000349&link_type=DOI) 

48. Pinheiro, M.B. et al. Genetic and environmental contributions to sleep quality and low back pain: a population-based twin study. Psychosomatic medicine 80, 263–270 (2018).
    
    

49. Baliki, M.N. et al. Chronic pain and the emotional brain: specific brain activity associated with spontaneous fluctuations of intensity of chronic back pain. J Neurosci 26, 12165–12173, doi:10.1523/JNEUROSCI.3576-06.2006 (2006).
    
    [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6Njoiam5ldXJvIjtzOjU6InJlc2lkIjtzOjExOiIyNi80Ny8xMjE2NSI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIwLzA5LzA3LzIwMjAuMDkuMDQuMjAxODc1NzUuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 

50. Cross, S.A. in Mayo Clinic Proceedings. 375–383 (Elsevier).
    
    

51. Allen, N.E., Sudlow, C., Peakman, T., Collins, R. & Biobank, U.K. UK biobank data: come and get it. Sci Transl Med 6, 224ed224, doi:10.1126/scitranslmed.3008601 (2014).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1126/scitranslmed.3008601&link_type=DOI) 

52. Sudlow, C. et al. UK biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age. PLoS Med 12, e1001779, doi:10.1371/journal.pmed.1001779 (2015).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pmed.1001779&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25826379&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F07%2F2020.09.04.20187575.atom) 

53. Bycroft, C. et al. The UK Biobank resource with deep phenotyping and genomic data. Nature 562, 203–209, doi:10.1038/s41586-018-0579-z (2018).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41586-018-0579-z&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=30305743&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F07%2F2020.09.04.20187575.atom) 

54. Krokstad, S. et al. Cohort Profile: the HUNT Study, Norway. Int J Epidemiol 42, 968–977, doi:10.1093/ije/dys095 (2013).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/ije/dys095&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22879362&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F07%2F2020.09.04.20187575.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000325167800009&link_type=ISI) 

55. Holmen, J. et al. The Nord-Trøndelag Health Study 1995–97 (HUNT 2): objectives, contents, methods and participation. Norsk epidemiologi 13, 19–32 (2003).
    
    

56. Bogduk, N. Management of chronic low back pain. Medical journal of Australia 180, 79–83 (2004).
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=14723591&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F07%2F2020.09.04.20187575.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000188382500011&link_type=ISI) 

57. McCarthy, S. et al. A reference panel of 64,976 haplotypes for genotype imputation. Nat Genet 48, 1279–1283, doi:10.1038/ng.3643 (2016).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/ng.3643&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=27548312&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F07%2F2020.09.04.20187575.atom) 

58. Machiela, M.J & Chanock, S.J. LDlink: a web-based application for exploring population-specific haplotype structure and linking correlated alleles of possible functional variants. Bioinformatics 31, 3555–3557, doi:10.1093/bioinformatics/btv402 (2015).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/bioinformatics/btv402&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=26139635&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F07%2F2020.09.04.20187575.atom) 

59. Quinlan, A. R. & Hall, I.M. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 26, 841–842, doi:10.1093/bioinformatics/btq033 (2010).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/bioinformatics/btq033&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20110278&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F07%2F2020.09.04.20187575.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000275243500019&link_type=ISI) 

60. Finucane, H. K.. et al. Partitioning heritability by functional annotation using genome-wide association summary statistics. Nat Genet 47, 1228–1235, doi:10.1038/ng.3404 (2015).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/ng.3404&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=26414678&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F07%2F2020.09.04.20187575.atom) 

61. Dobin, A. et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics 29, 15–21, doi:10.1093/bioinformatics/bts635 (2013).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/bioinformatics/bts635&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23104886&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F07%2F2020.09.04.20187575.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000312654600003&link_type=ISI) 

62. Liao, Y., Smyth, G. K. & Shi, W. featureCounts: an efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics 30, 923–930, doi:10.1093/bioinformatics/btt656 (2014).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/bioinformatics/btt656&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24227677&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F07%2F2020.09.04.20187575.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000334078300005&link_type=ISI) 

63. Love, M. I., Huber, W. & Anders, S.. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol 15, 550, doi:10.1186/s13059-014-0550-8 (2014).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/s13059-014-0550-8&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25516281&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F07%2F2020.09.04.20187575.atom) 

64. Smedley, D. et al. The BioMart community portal: an innovative alternative to large, centralized data repositories. Nucleic Acids Res 43, W589–598, doi:10.1093/nar/gkv350 (2015).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/nar/gkv350&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25897122&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F07%2F2020.09.04.20187575.atom) 

65. 65 de Leeuw, C. A., Mooij, J. M., Heskes, T. & Posthuma, D.. MAGMA: generalized gene-set analysis of GWAS data. PLoS Comput Biol 11, e1004219, doi:10.1371/journal.pcbi.1004219 (2015).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pcbi.1004219&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25885710&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F07%2F2020.09.04.20187575.atom) 

66. de Leeuw, C. A., Neale, B. M., Heskes, T. & Posthuma, D.. The statistical properties of gene-set analysis. Nat Rev Genet 17, 353–364, doi:10.1038/nrg.2016.29 (2016).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/nrg.2016.29&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=27070863&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F07%2F2020.09.04.20187575.atom) 

67. Ashburner, M. et al. Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet 25, 25–29, doi:10.1038/75556 (2000).
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/75556&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=10802651&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F07%2F2020.09.04.20187575.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000086884000011&link_type=ISI) 

68. Gene\_Ontology\_Consortium. Gene Ontology Consortium: going forward. Nucleic Acids Res 43, D1049–1056, doi:10.1093/nar/gku1179 (2015)
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/nar/gku1179&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25428369&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F09%2F07%2F2020.09.04.20187575.atom)