Abstract
Aims Juvenile idiopathic arthritis (JIA) is an autoimmune disease and a common cause of chronic disability in children. Diagnosis of JIA is based purely on clinical symptoms, leading to treatment delays. Despite JIA having substantial heritability, the construction of genomic risk scores (GRSs) to aid or expedite diagnosis has not been assessed. Here, we generate GRSs for JIA and its subtypes and evaluate their performance.
Methods We examined three case/control cohorts (UK, US, and Australia) with genome-wide single nucleotide polymorphism (SNP) genotypes. We trained GRSs for JIA and its subtypes using lasso-penalised linear models in cross-validation on the UK cohort, and externally tested in the Australian and US cohorts.
Results The JIA GRS alone achieved cross-validated AUC=0.670 in the UK cohort and externally validated AUCs of 0.657 and 0.671 in US-based and Australian cohorts, respectively. In logistic regression of case/control status, the corresponding odds ratios per standard deviation (s.d.) of GRS were 1.831 [1.685-1.991] and 2.008 [1.731-2.345], and were unattenuated by adjustment for sex or the top 10 genetic principal components. Extending our analysis to JIA subtypes revealed that enthesitis-related JIA had both the longest time-to-referral and the subtype GRS with the strongest predictive capacity overall across datasets: AUCs 0.80 in UK; 0.83 Australian; 0.69 US-based. The particularly common oligoarthritis JIA subtype also had a subtype GRS outperformed those for JIA overall, with AUCs of 0.71, 0.75 and 0.77, respectively.
Conclusions A genomic risk score for JIA has potential to augment purely clinical JIA diagnosis protocols, prioritising higher-risk individuals for follow-up and treatment. Consistent with JIA heterogeneity, subtype-specific GRSs showed particularly high performance for enthesitis-related and oligoarthritis JIA.
Introduction
Juvenile idiopathic arthritis (JIA) is an autoimmune disease that comprises all form of arthritis arising before the age of 16 years and persisting for more than 6 weeks1. JIA has a significant impact on quality of life, physical function, and future development, and its prevalence is estimated at 0.07–4 per thousand children of European descent2–4. The International League of Associations of Rheumatology (ILAR) classification system recognises seven subtypes of JIA based on the number of joints affected, age of onset, and other features5. The aetiology of JIA is not well understood and its diagnosis remains purely clinical with no sensitive or specific tests available to assist clinicians in making the diagnosis.
Early diagnosis and treatment of JIA is critical as delays increase the risk of prolonged and more difficult to control diseases and consequently poorer long-term outcomes6–9. However, in most cases, general practitioners, emergency medicine physicians, and paediatricians have limited experience in recognising and diagnosing JIA. This might affect time to diagnosis in two ways (Figure 1). Firstly, JIA may not be suspected in a symptomatic child, and secondly, children with generalised symptoms may be incorrectly referred to paediatric rheumatologists for management10–12, putting undue pressure on clinics and waiting lists. For example, a study in Australia13 found that the average time from parent reported symptom onset to diagnosis of JIA by a paediatric rheumatologist was 7 months14. Therefore, there is an urgent need for tools that assist clinicians in assessing children with musculoskeletal symptoms who may be JIA cases, and thus reduce complications of the disease and poor long-term health outcomes.
Schematic of typical clinical path from first symptoms to JIA diagnosis and treatment. Potential informative points are included for JIA genomic risk scores to prioritising higher-risk individuals for referral, follow-up, and treatment.
JIA is a complex disease15 where susceptibility is due to a complex interaction between genetic and environmental factors. It has been shown that JIA is heritable and that it possesses a genetic architecture similar to other autoimmune diseases, including shared susceptibility genes and strong HLA associations16–18. The sibling recurrence risk has been estimated to be 11.6-fold, and for first cousins 5.8-fold19. Moreover, a 2015 study estimated SNP-based heritability (SNP-h2) of JIA to be ∼75%18. Taken together, it is apparent that genetics has an important aetiological role in JIA and may have utility in risk prediction, potentially via stratification of JIA cases from non-cases to aid clinical diagnosis.
Genetics is increasingly used to aid risk prediction, diagnosis and earlier treatment, with HLA testing for various immune disorders being an exemplar. More recently, the clinical utility of genetic and polygenic risk scores for diverse aetiologies, from coeliac disease to cardiovascular diseases, is under intense investigation20–24. In coeliac disease, research has shown that a genomic risk score (GRS) based on genome-wide genetic variation (SNPs) can accurately predict coeliac disease cases with high specificity and sensitivity as compared to population-based controls23. By using a regularised linear model fitted to individual-level genotype data, it was found that a GRS of ∼200 SNPs provided superior risk prediction over other approaches, including HLA typing. Furthermore, array technologies are relatively affordable with genotyping only needing to be performed once at any point in the lifetime of an individual. GRSs themselves are quantitative measurements of the likelihood that an individual of unknown phenotype has or will have a particular disease, providing potential advantages in terms of flexibility for clinical translation as compared to other tests which are temporally variable or offer only a binary ‘susceptible/not susceptible’ output.
This study aims to create a GRS which in-principle could be used to support the current clinical JIA diagnosis practice. We utilised three large-scale independent cohorts of European ancestry to develop and test a GRS for JIA. Furthermore, we extended the GRS approach to design JIA subtype-specific genetic models, which we externally tested to quantify their potential relative clinical value in supporting each JIA subtype’s time to diagnosis. We make these JIA GRSs publicly available to the community via the Polygenic Score Catalog (www.pgscatalog.org).
Results
Our analyses utilised three JIA case-control collections from the UK (n=7,505), the Children’s Hospital of Philadelphia (CHOP; n=3,513) and Australian ChiLdhood Arthritis Risk factor Identification sTudY (CLARITY; n=940) which have undergone genome-wide genotyping, quality control, and imputation using standard protocols (for more details see Methods and Table 1). The UK dataset was used to train the GRS model since it was the dataset with the greatest number of cases and had the least population stratification. For each outcome (JIA or JIA subtype status versus control), GRSs were generated using lasso-penalised regression as implemented in the SparSNP software25 and run on the intersecting set of genotypes across the UK, CHOP, and CLARITY datasets (5,545,761 SNPs with MAF>1% across all datasets). After GRS training in the UK dataset, each GRS was externally validated in CHOP and CLARITY (Figure S1 shows the research workflow).
Dataset characteristics after imputation and quality control.
Training and external validation of the JIA genomic risk score
We used 10×10-fold cross-validation to tune the penalised model and estimate the area under the receiver operating characteristic curve (AUC) in the UK dataset, achieving a maximum AUC=0.671 (95% CI 0.668−0.674) and selecting 26 SNPs in the model (Figure S2). The selected SNPs and their weights in the GWAS clearly showed the strong HLA dependency of JIA (Figure S3). The final model (SNP weights) was held fixed for external validation GRS (http://www.pgscatalog.org/pgs/PGS000114/).
We applied the GRS to the CLARITY and CHOP datasets and evaluated the prediction power of our model in terms of AUC and odds ratios (Table 2). Overall, the GRS showed highly consistent performance across both validation datasets, with AUCs of 0.657 and 0.671 for CHOP and CLARITY, respectively. The corresponding odds ratios per s.d. of GRS were 1.831 (95% CI 1.685-1.991) and 2.008 (95% CI 1.731-2.345) in CHOP and CLARITY, respectively. The associations of the GRS with JIA status were unattenuated when adjusting for the top 10 genetic principal components (PCs) and sex. Furthermore, we compared the GRS model with a model generated using known SNPs highly association with JIA (Table 1 from Hinks et al.16). Using this model, we achieved AUC of 0.614, 0.642 and 0.648 for UK, CHOP, and CLARITY respectively, showing that the GRS model was a stronger risk predictor.
The predictive power of the GRS in the validation sets. Based on logistic regression on the test sets, optionally adjusted for sex and 10 genetic principal components. Effect sizes are per standard deviation of the GRS.
Recent work has also shown that a metaGRS approach can substantially improve genomic prediction of common diseases21, thus we compared the metaGRS method to the lasso-penalised regression above (Supplementary Material). We did not observe that a JIA metaGRS, based on a set of related autoimmune disease GWAS summary statistics, significantly improved upon the JIA GRS above, thus subsequent analyses utilised lasso-regression.
Subtype analysis
We next extended our analysis to consider subtypes of JIA and construct subtype-specific GRSs thereof. The International League of Associations of Rheumatology (ILAR) classification system recognises seven subtypes of JIA, systemic arthritis, oligoarthritis, rheumatoid-factor-positive polyarthritis (RF-Positive), rheumatoid-factor-negative polyarthritis (RF-Negative), enthesitis-related arthritis (ERA), psoriatic arthritis, and undifferentiated arthritis5. Subtypes vary substantially in average age at onset, sex distribution, number of joint affected, and clinical features (see Figure 2)26. Of particular interest clinically is the time between onset of symptoms to visiting a paediatric rheumatologist which has been estimated to vary from 11 months (range: 8 to 70 weeks) in the case of enthesitis-related JIA to 1 month (range: 2 to 36 weeks) for systemic JIA14,27. The heterogeneity of JIA was reflected in our case data; all seven subtypes were present, albeit at frequencies ranging from common (∼41% of UK cases were oligoarthritis JIA) to relatively rare (∼4% of UK cases were undifferentiated JIA) (Table 3).
Characteristics of JIA subtypes across cohorts, including rate (%) of each subtype amongst cases of each cohort. CLARITY excludes 29 cases with no subtype classification.
For each of the seven subtypes, we utilised the UK dataset to train subtype specific GRSs, employing a similar lasso-regression in cross-validation approach as for the JIA GRS above (see Methods). Each subtype GRS was trained on that subtype’s cases and all non-JIA controls, with all other JIA subtypes were excluded from training, and then externally validated in CHOP and CLARITY. We found that there was a high degree of variability in discrimination between subtype GRSs, with some subtypes displaying cross-validated AUCs greater than the JIA GRS and others not being significantly different than random chance (AUC=0.5; Figure 2).
10×10 cross-validated AUC (LOESS-smoothed) AUC achieved by training the seven JIA subtype specific models (top) and median time taken by an individual with JIA to be referred for first time to a paediatric rheumatologist visit (in months; bottom)27.
When compared to a previous estimate of the median time for a child with JIA to be referred to a paediatric rheumatologist visit27, we found that the JIA subtype (enthesitis-related) with the longest time-to-referral (∼11 months) also had the strongest subtype GRS (AUC=0.80 with 149 SNPs). This result may be partially due to circulating HLA-B27 antigen being a component of the enthesitis-related subtype classification criteria5,28. To test this, we utilised previously reported HLA-B27 tag SNPs29,30 (Supplementary Material), and found that our enthesitis-related JIA GRS still outperformed the HLA-B27 tag SNPs model in the UK dataset (AUC=0.75). The next strongest subtype GRS (AUC=0.71 with 25 SNPs) was for oligoarthritis JIA, which has a median time-to-referral of ∼3 months. While JIA subtype rates are not well characterised, oligoarthritis JIA is the most common clinical subtype reported31.
The weakest subtype GRSs were for the undifferentiated (AUC=0.573 with 119 SNPs) and systemic (AUC=0.546 with 3,430 SNPs) subtypes. This was not unexpected as these subtypes are somewhat different to the other five subtypes. Systemic JIA is considered an autoinflammatory disease and, while there is little genetic overlap with the other JIA subtypes32, it has strong genetic signals from the HLA class II molecule encoded by HLA-DRB1*11, confirming the role of the adaptive immune system33. Children are diagnosed with the undifferentiated subtype when their symptoms do not fit within other subtypes, or meet the criteria for multiple subtypes.
External validation of the subtype GRSs in CLARITY showed highly consistent AUC estimates with cross-validation performance in the UK (Table 4). The CHOP dataset showed somewhat less consistent external validation than CLARITY for subtype GRSs, but in both CLARITY and CHOP the top performing subtype GRSs were both the enthesitis-related JIA GRS (AUCs of 0.83 and 0.69, respectively) and oligoarthritis JIA GRS (0.75 and 0.77, respectively). In both external validations, the systemic JIA subtype GRS had poor predictive capacity with AUCs of 0.47 and 0.55 in CLARITY and CHOP, respectively.
External validation of the subtype-specific GRSs in CLARITY and CHOP. Based on logistic regression on the test sets. Effect sizes are per standard deviation of the GRS.
Discussion
The diagnosis of juvenile idiopathic arthritis has a pressing, yet unmet, clinical need. In this study, we aimed to address the paucity of molecular tools to aid the entirely clinical diagnosis of JIA by leveraging the wealth of human genomic data developed over the last decade to develop a series of genomic risk scores for JIA. We showed that genomic machine learning can yield highly predictive GRSs for JIA as a composite diagnosis as well as subtype-specific GRSs which can classify for example the common oligoarthritis JIA subtype as well as the enthesitis-related JIA subtype, which can present with non-specific pains initially and be difficult to diagnose clinically. Given the cost effectiveness of a genotyping array and the time invariant properties of assaying germline DNA, these JIA GRSs hold promise for rapid clinical translation with at-risk children being non-invasively stratified as high risk much earlier in the diagnostic pathway. It also allows for children with low risk non-inflammatory disease to be appropriately triaged and managed earlier. To facilitate translation and clinical uptake, we have made the genetic variants and weights of the JIA GRSs developed here freely available (http://www.pgscatalog.org/pgs/PGS000114/).
A strength of this study was that the JIA GRS was developed on a UK dataset and externally validated in two independent studies in Australia and the US, indicating the robustness of the score. A limitation of the current study is that the vast majority of participants in our cohorts (and the original JIA GWASs) were of European descent and we were unable to assess the performance of a JIA GRS in individuals of non-European ancestry34.
Many clinicians at the primary health level, from general practitioners through to tertiary hospital settings, often have difficulty recognising and diagnosing JIA in children and adolescents. There are many non-inflammatory conditions that are common in children and other conditions presenting with musculoskeletal pain that mimic JIA. In primary care, accessing paediatric rheumatology specialist services is difficult as wait lists are usually lengthy and access to care is problematic due to workforce shortages worldwide35.
Therefore, in the diagnostic algorithm a JIA-GRS would be particularly useful when assessing children with musculoskeletal symptoms who may be JIA cases, for example enabling the general practitioner to facilitate appropriate triage to a paediatric rheumatologist, and thus reduce pain, complications of the disease and poor long-term health outcomes.
Methods
Ethics Statement
All participants gave informed consent and the study protocols were approved by the relevant institutional or national ethics committees. The Australian CLARITY cohort collection was approved by the Royal Children’s Hospital Human Research Ethics Committee; UK ethical approval was obtained from the North West Multi-centre for Research Ethics Committee (MREC:02/8/104 and MREC:99/8/84), West Midlands Multi-centre Research Ethics Committee (MREC:02/7/106), North West Research Ethics Committee (REC:09/H1008/137), and the NHS Research Ethics Committee (REC:05/Q0508/95); and the US CHOP cohort collection was approved by the institutional review boards of the Texas Scottish Rite Hospital for Children, the Children’s Mercy Hospitals and Clinics, and the Children’s Hospital of Philadelphia.
Phenotype and clinical data
The International League of Associations for Rheumatology (ILAR) classification system5 provides generally accepted guidelines for researchers and clinicians to delineate the seven mutually-exclusive categories of JIA based on the predominate clinical and laboratory features. For example, to diagnose the enthesitis-related JIA subtype, an individual has have at least two of the following conditions: presence of HLA-B27 antigen; onset of arthritis in a male over age six; acute anterior uveitis; presence of or history of sacroiliac joint tenderness and/or inflammatory lumbosacral pain; or have a first-degree relative with history of ankylosing spondylitis, enthesitis-related arthritis, Reiter’s syndrome, or acute anterior uveitis.
In this work, the JIA diagnosis for the three cohorts was made according to the ILAR revised criteria and all JIA cases were of age of onset <16 years old. In the UK cohort16, the JIA cases were obtained from five sources: The British Society for Paediatric and Adolescent Rheumatology National Repository of JIA; a group of UK cases with long-standing JIA described previously36; a cohort collected as part of the Childhood Arthritis Prospective Study10; a cohort of children recruited for the SPARKS-CHARM (Childhood Arthritis Response to Medication)37; and an ongoing collection of UK cases from the UK JIA Genetics Consortium. The UK controls were from the shared UK 1958 Birth cohort and UK Blood Services Common Controls, established as part of the WTCCC38.
For the CHOP cohort18,39, the clinical data of JIA cases were collected from the electronic health records (EHR) completed by the pediatric rheumatology specialist within the Division of Rheumatology at CHOP and abstracted into a JIA Registry maintained within the Center for Applied Genomics (CAG) at CHOP. Control subjects were unrelated and disease-free children recruited by the CAG recruitment team within the CHOP Healthcare Network. In addition, control subjects had no history of JIA or other chronic illnesses and were screened as negative for a diagnosis of autoimmune diseases, based on data from CHOP’s electronic health record and by intake questionnaires obtained by the recruiting staff from the CHOP Center for Applied Genomics.
Finally, in the CLARITY cohorts13,40, all the cases were aged 0–18 years at interview, with diagnosis of JIA by a paediatric rheumatologist. Incident cases were defined as children recruited within six months of diagnosis and prevalent cases were defined as those children diagnosed more than 6 months before recruitment and since 1997. Controls were recruited through the Royal Children’s Hospital Day Surgery Unit. Exclusion criteria for cases and controls were the presence of major congenital abnormalities or illness that would forgo school attendance in the one year prior to recruitment.
Genotype data and quality control
All collected cases are pediatric rheumatologist-diagnosed and controls have no history of any autoimmune disease. Additionally, all participants are white European descent with the exception of ∼20% of the participants in CLARITY, who are of non-European ancestries. Finally, all genotyped included in each dataset were aligned to the GRCh37/hg19 assembly build and passed stringent quality control measures (detailed below).
The initial UK dataset consists of 2,758 cases and 5,187 controls. Controls were obtained from the Wellcome Trust Case Control Consortium (WTCCC) which have been demonstrated to be well-matched to UK JIA cases16,41. UK cases were collected through five cohort studies recruiting from multiple tertiary centers around the UK, where 1,670 were genotyped on the Illumina HumanOmniExpress array and 1,088 on the Illumina HumanCoreExome array. The CHOP (US) dataset consisted of 1,229 cases from US and Norway, and 5,512 pediatric controls, all recruited from within the CHOP Healthcare Network. All CHOP controls and 1,008 JIA cases were genotyped using Illumina HumanHap550 or Human610-Quad arrays40 and the remaining 250 cases were genotyped using the Illumina HumanCore array. Lastly, the Australian CLARITY (ChiLdhood Arthritis Risk factor Identification sTudY)13 consisted of 558 cases and 704 controls collected from the Royal Children’s Hospital and Monash Medical Centre in Melbourne. All controls and 406 cases were genotyped using the Illumina HumanCore array, with the remaining 152 cases genotyped on the Illumina HumanHap55017,18.
Before genotype imputation, we applied stringent quality control (QC) procedures standardised across all datasets. The initial QC performed, using plink1.942,43, consisted of removing non-autosomal SNPs, SNPs with MAF<1%, SNPs and individuals with missingness>10%, and SNPs with deviations from Hardy-Weinberg Equilibrium in controls (P<10−3). Additionally, using KING v2.1.544, we identified and removed one of two individuals with a second or higher degree relatedness within the datasets. The resulting datasets consisted of 7,505 individuals (2,324 cases and 5,181 controls) with 144,964 SNPs for UK, and 6,741 individuals (1,229 cases and 5,512 controls) with 489,672 SNPs for CHOP. The CLARITY dataset consisted of three batches; after quality control, these consisted of 152 individuals (all cases) with 490,629 SNPs, 838 individuals (247 cases and 591 controls) with 246,814 SNPs, and 118 individuals (5 cases and 113 controls) with 243,662 SNPs, respectively.
For genotype imputation, we used the Michigan Server to impute the datasets using Minimac3 and the 2016 Haplotype Reference Consortium (HRC) as the reference panel45. We merged all the subsets of CLARITY into a single dataset. Then, for each dataset (UK, CHOP and CLARITY), we removed multi-allelic and duplicated SNPs, SNPs with imputation r2<0.5 and MAF<0.01, SNPs deviating from the Hardy-Weinberg Equilibrium in controls (P<10−3), and those with ambiguous strand (A/T or C/G alleles).
Next, we performed principal component analysis (PCA) using FlashPCA246,47 over the filtered data (Figures S8–S11). For each dataset we selected the largest homogeneous subset of individuals based on the first five principal components within each set, where we defined as outliers the set of cases (controls) individuals that clearly differed in the PCA plots from the controls (cases). The UK dataset was kept in its entirety; for CLARITY n=168 individuals were removed and for CHOP n=3,228 individuals were removed (including non-US individuals). Figures S12 and S13 show the principal component analysis for the resulting subsets. Table 1 shows the final datasets; for the analysis we used the n=5,545,761 genotyped and imputed SNPs which were available post-QC on all three datasets.
Before training our model we also estimated the heritability and SNP-heritability of JIA in our datasets. The sibling recurrence risk of JIA has previously been estimated to be 11.6-fold19, and for first cousins 5.8-fold; assuming the JIA prevalence to be 1 in 1000, we estimated JIA heritability to be approximately 54%. For SNP-heritability, we used GCTA version 1.91.727,48 and adjusted for the first ten principal components together with an assumed JIA population prevalence of 1/1000. The estimated SNP-heritability was 0.25 (std err 0.02) for UK, 0.37 (std err 0.13) for CLARITY, and 0.51 (std err 0.07) for CHOP.
Development and validation of the genomic risk score
Because it was the most homogenous dataset with the largest case sample size (2,324 cases, 5,181 controls), we utilised the UK dataset to train our models. To account for potential confounding by the case/control genotyping batch in the UK dataset, we used logistic regression of case/control status on sex and the first 10 genetic PCs. The PCs were computed over a subset of the SNPs of UK, excluding the HLA region as well as known or putative JIA risk loci16 (defined here as SNPs with P<10−5 and all SNPs within 1 Mb of the former). The residuals from the regression were then used as the phenotype for constructing the GRS.
To create the GRS we used SparSNP25, which is an efficient implementation of a lasso-penalised linear model which has been previously shown to outperform other methods when there are known to be strong effects within regions of complex LD, such as the MHC49. This model is induced by minimising the L1-penalised squared loss function (standard lasso) over N individuals and m SNPs,
where xi is the m-vector of genotypes for the ith sample in allele-dosage coding {0,1,2}, yi is the phenotype,
is the m-vector of weights,
is the intercept, and λ is the L1 penalty. All SNPs in the selected training dataset are considered in the model but typically only a small number of SNPs receive a non-zero weight. The number of SNPs with non-zero weight varies depending on the value of the L1 penalty (higher penalties lead to fewer SNPs) which was tuned via 10-fold cross-validation. The optimal number of SNPs selected (non-zero weighted) in the chosen model is decided based on the model with the highest average AUC (or R2 if the phenotype is continuous) across the replications. We also explored adding an L2 penalty to the model (elastic-net); however, no significant improvement was found.
Once a model was chosen, we computed genomic risk scores (GRS) for each of the test datasets (CHOP and CLARITY). Assuming that the number of SNPs is m, then the GRS gi for a new individual with genotypes xi is
where and
are the SNP weights obtained from the model. Subsequently, in order to validate the GRS, we evaluated it using logistic regression in the CHOP and CLARITY datasets, adjusting for sex and ten first genetic principal components of each test set.
Data Availability
For the individual-level data, note that CHOP is available through the eMERGE Network dbGaP and the WTCCC controls are available through the Wellcome Trust Case Control Consortium webpage https://www.wtccc.org.uk/). For the individual-level genotype data for the UK JIA cases and CLARITY, researchers should contact the cohort PIs (Prof Wendy Thomson [Wendy.Thomson{at}manchester.ac.uk] and Dr Jane Munro [Jane.Munro{at}rch.org.au], respectively). Finally, the JIA GRSs presented will be publicly available to the community via the Polygenic Score Catalog (www.pgscatalog.org)
Acknowledgments
This study was supported in part by the Victorian Government’s OIS Program, the Australian National Health and Medical Research Council (NHMRC Project no. 1122744), the Murdoch Children’s Research Institute, and the Royal Children’s Hospital Foundation (grant no. 2017-896). Also, this work was supported by core funding from the UK Medical Research Council (MR/L003120/1), the British Heart Foundation (RG/13/13/30194; RG/18/13/33946) and the National Institute for Health Research (Cambridge Biomedical Research Centre at the Cambridge University Hospitals NHS Foundation Trust)*. It was also supported by Health Data Research UK, which is funded by the UK Medical Research Council, Engineering and Physical Sciences Research Council, Economic and Social Research Council, Department of Health and Social Care (England), Chief Scientist Office of the Scottish Government Health and Social Care Directorates, Health and Social Care Research and Development Division (Welsh Government), Public Health Agency (Northern Ireland), British Heart Foundation and Wellcome. GA was supported by an NHMRC Early Career Fellowship (no. 1090462). MI was supported by the Munz Chair of Cardiovascular Prediction and Prevention. This study acknowledges the use of the following UK JIA cohort collections: The Biologics for Children with Rheumatic Diseases (BCRD) study (funded by Arthritis Research UK Grant 20747). The British Society for Paediatric and Adolescent Rheumatology Etanercept Cohort Study (BSPAR-ETN) (funded by a research grant from the British Society for Rheumatology (BSR). BSR has previously also received restricted income from Pfizer to fund this project). Childhood Arthritis Prospective Study (CAPS) (funded by Versus Arthritis, grant reference number 20542), Childhood Arthritis Response to Medication Study (CHARMS) (funded by Sparks UK, reference 08ICH09, and the Medical Research Council, reference MR/M004600/1), United Kingdom Juvenile Idiopathic Arthritis Genetics Consortium (UKJIAGC). Genotyping of the UK JIA case samples were supported by the Versus Arthritis grants reference numbers 20385 and 21754. This research was funded by the NIHR Manchester Biomedical Research Centre and supported by the Manchester Academic Health Sciences Centre (MAHSC). The views expressed are those of the author(s) and not necessarily those of the NHS, the NIHR or the Department of Health. We would like to acknowledge the assistance given by IT Services and the use of the Computational Shared Facility at The University of Manchester. Finally, the CHOP data used were funded by an Institute Development Fund to the CAG center from The Children’s Hospital of Philadelphia and by NIH grant, U01-HG006830, from the NHGRI-sponsored eMERGE Network.