Abstract
Objectives To identify serum protein biomarkers which might separate early inflammatory arthritis (EIA) patients with psoriatic arthritis (PsA) from those with rheumatoid arthritis (RA) to provide an accurate diagnosis and support appropriate early intervention.
Methods In an initial protein discovery phase, the serum proteome of a cohort of patients with PsA and RA was interrogated using unbiased liquid chromatography mass spectrometry (LC-MS/MS) (n=64 patients), a multiplexed antibody assay (Luminex) for 48 proteins (n=64 patients) and an aptamer-based assay (SOMAscan) targeting 1,129 proteins (n=36 patients). Subsequently, analytically validated targeted multiple reaction monitoring (MRM) assays were developed to further evaluate those proteins identified as discriminatory during the discovery. During an initial verification phase, MRM assays were developed to a panel of 150 proteins (by measuring a total of 233 peptides) and used to re-evaluate the discovery cohort (n=60). During a second verification phase, the panel of proteins was expanded to include an additional 23 proteins identified in other proteomic discovery analyses of arthritis patients. The expanded panel was evaluated using a second, independent cohort of PsA and RA patients (n=167).
Results Multivariate analysis of the protein discovery data revealed that it was possible to discriminate PsA from RA patients with an area under the curve (AUC) of 0.94 for nLC-MS/MS, 0.69 for Luminex based measurements; 0.73 for SOMAscan analysis. During the initial verification phase, random forest models confirmed that proteins measured by MRM could differentiate PsA and RA patients with an AUC of 0.79 and during the second phase of verification the expanded panel could segregate the two disease groups with an AUC of 0.85.
Conclusion We report a serum protein biomarker panel which can separate EIA patients with PsA from those with RA. We suggest that the routine use of such a panel in EIA patients will improve clinical decision making and with continued evaluation and refinement using additional patient cohorts will support the development of a diagnostic test for patients with PsA.
Psoriatic Arthritis (PsA) is a form of inflammatory arthritis (IA) affecting approximately 0.25% of the population [1-4]. It is a highly heterogeneous disorder associated with joint damage, disability, disfiguring skin disease and poor patient-related quality of life outcome measures [4]. Inherently irreversible and frequently progressive, the process of joint damage begins at, or before, the clinical onset of disease. Indeed, structural joint damage, which is likely to result in joint deformity and disability, is present in 47% of patients within 2 years of disease onset [3, 5]. Reductions in quality of life and physical function are comparable to rheumatoid arthritis (RA) and compounded by the presence of chronic disfiguring skin disease [6-9]. Direct and in-direct health costs pose a significant economic burden on society and rise with severe physical dysfunction [9].
Early diagnosis and management of PsA leads to better long-term outcomes however with no diagnostic laboratory test available, the diagnosis is often delayed or missed and this has significant consequences for individuals with PsA [10-12]. At disease onset PsA often resembles other forms of arthritis including RA. Despite the clinical similarities between PsA and RA, their distinctive pathologies often require different treatments. For example, drugs targeting the IL-12/IL-23 and IL-17 pathway which are highly effective in psoriasis and PsA, are ineffective in RA, while drugs targeting B cells such as rituximab are effective in RA but have not been proven beneficial in PsA [4, 13].
PsA is most often diagnosed when a patient presents with musculoskeletal inflammation in the presence of psoriasis and in the absence of rheumatoid factor (RF). However, a clear diagnosis can be difficult as up to 10% of PsA patients may have RF or Anti-Citrullinated Peptide Antibody (ACPA) and joint involvement may precede the development of skin or nail psoriasis in 15 % of patients with PsA [14]. The “ClASsification criteria for Psoriatic ARthritis” (CASPAR) are accepted as having high sensitivity (98.7 %) and specificity (91.4 %) in classifying patients with long-standing PsA. CASPAR shows reduced sensitivity in patients with early disease (87.4 %) though specificity is improved (99.1 %) [15]. CASPAR are valid when including patients in research studies or in clinical trials, but it is recognised that they should not be used for diagnosis and are of little value therefore in a primary care or dermatology setting where specialist rheumatological expertise is very often not readily available [4, 16]. An effective clinical laboratory test is needed to improve diagnosis and clinical decision making in PsA.
Ideally, a clinical laboratory test should be based on an easily accessible biological sample such as blood [10]. Therefore, we set out to discover serum-based biomarkers that could discriminate between patients with PsA or RA. With advances in multiplexed technologies, it has become possible to simultaneously measure multiple analytes. However, in complex bio-fluids such as serum, it is apparent that no single technological platform is capable of measuring the entire protein content of a given sample [3, 4, 17]. For this reason, we undertook a comprehensive and complementary analysis of the serum proteome in an EIA cohort. We used unbiased nLC-MS/MS to identify the more abundant, differentially expressed proteins. In parallel, the SOMAscan and Luminex assays were employed to target low abundant proteins not easily detectable by nLC-MS/MS. Statistical analysis revealed that proteins identified by nLC-MS/MS were the most useful in discriminating individuals with PsA from those with RA. Hence in subsequent steps we prioritised these proteins for further investigation. Using multiple reaction monitoring (MRM), a form of targeted mass spectrometry, we designed a two-phase verification study: in the first phase, we measured a panel of 150 proteins identified in the discovery cohort and, in the second phase, we measured an expanded panel of 173 proteins in an independent cohort. Figure 1 provides an overview of the workflow of the study.
Patients and Methods
Patients
In the discovery phase and initial verification phase, a total of 64 patient samples were used and a full description of the extensive clinical characterisation of the cohort can be found in Szentpetery et al. [18]. In brief, recent-onset (symptom duration <12 months), treatment naïve PsA and RA patients with active joint inflammation, aged 18 to 80 years were enrolled. PsA patients (n=32) fulfilled the CASPAR criteria [19] and patients with RA (n=32) met the 2010 ACR/EULAR classification criteria [20]. Baseline serum samples were obtained from each patient using standard methodology, aliquoted and frozen at -70°C (Suppl Doc 1). The study was approved by St.Vincent’s Healthcare Group Ethics and Medical Research Committee and patients were enrolled only after agreeing to participate in the study and having given their informed consent.
Samples from a total of 167 patients were used in the second verification phase. There were 95 patients recruited from a cross-sectional cohort of established PsA patients, all meeting CASPAR criteria and 72 patients recruited from the RA Biologics Registry of Ireland (RABRI) who all met 2010 ACR/EULAR classification criteria and had similar levels of active disease, as the PsA patients. Again, baseline serum samples were obtained, aliquoted and frozen at - 70°C.
Label Free nLC-MS/MS Analysis
A detailed description of the unbiased LC-MS/MS workflow can be found in Mc Ardle et al. [10] In brief serum samples (1,700 ug) were depleted of the 14 most abundant serum proteins (albumin, transferrin, haptoglobin, IgG, IgA, 1-antitrypsin, fibrinogen, β2-macroglobulin, 1-acid glycoprotein, complement C3, IgM, apolipoprotein AI, apolipoprotein AII, and transthyretin)) using the Agilent Multiple Affinity Removal System comprising a Hu-14 column (HuMARS14) (4.6 × 100 mm; Agilent Technologies, 5188-6557) on a Biocad Vision Workstation. Depleted fractions (containing 50 µg protein) were reduced, denatured and alkylated prior to trypsinization. The digested samples were desalted and purified using C18 resin pipette stage tips. Purified samples were dried under vacuum and resuspended in mass spectrometer compatible buffer A (3% ACN, 0.1% formic acid) [21, 22]. Label-free nano-flow LC-MS/MS analysis was performed on a Q-Exactive mass spectrometer equipped with a Dionex Ultimate 3000 (nano-RSLC) chromatography system (Thermo Fisher Scientific). Two microliters (equivalent to 2 µg of digested protein) of each sample was injected onto a fused silica emitter separated by an increasing acetonitrile gradient over 101.5 min (flow rate of 250 nL/min) [10].
Bioinformatic Data Analysis
As previously reported nLC-MS/MS data were visually inspected using XCalibur software (2.2 SP1.48). MaxQuant (1.4.12) was then used for quantitative analysis of the LC-MS/MS data (Thermo Scientific. Raw) while Perseus software (1.5.0.9) supported statistical analysis [10, 23].
SOMAscan Analysis
Individual patient serum samples were subjected to a multiplexed aptamer-based assay (SOMAscan) developed by Gold et al. to measure the levels of 1129 proteins as previously reported [10].
Luminex Analysis
Individual serum samples were subjected to an in-house developed and validated multiplexed immunoassays measuring 48 analytes with Luminex xMAP proteomics technology (Austin TX, USA). The assays, including the analyses, were undertaken as previously described by the Multiplex Core Facility Laboratory of Translational Immunology LTI, in the University Medical Centre Utrecht [10].
MRM design and Optimisation
The development and optimisation of MRM assays was performed using Skyline software (version 3.6.0.1062) (MacCoss laboratory, Washington DC) [24]. Assays were developed to proteotypic peptides for all proteins of interest where peptides showed no missed cleavages or ‘ragged ends’ and sequence length was between 7-25 amino acids. Where possible, peptides sequences with reactive cysteine (C) or methionine (M) residues were avoided but not excluded. An MRM assay was deemed to be analytically validated when it demonstrated the following characteristics: dot product ≥0.8; signal to noise ≥10; data points under the curve ≥10 [25]; and percentage coefficient of variance showing a retention time ≤ 1 % and area ≤ 20 % [26]. The majority of MRM assays developed significantly exceeded these criteria.
Sample Preparation for LC-MRM Analysis
Initial Phase
Crude serum (2 µl) was added to the wells of 96 deep well plates (Thermo) and diluted 1 in 50 with NH4CO3 (Sigma). Rapigest denaturant (Waters), resuspended in 50mM NH4CO3 to give a stock solution of 0.1% w/v 50 ul stock solution, was added to each sample so that the final concentration of Rapigest was 0.05 %. Plates were covered with adhesive foil (Thermo) and samples were incubated in the dark at 80 °C for 10 minutes (min). After incubation plates were centrifuged at 2,000 relative centrifugal force (rcf) at 4 °C for 2 min to condense droplets. Subsequently, dithiothreitol (DTT) was added to each sample at a final concentration of 20 mM. Samples were then incubated at 60°C for 1 hr followed by centrifugation at 2,000 rcf at 4 °C for 2 min. Next iodoacetamide (IAA) was added to each sample to give a final concentration of 10 mM and plates were incubated at 37 °C in the dark for 30 min. Plates were again centrifuged at 2,000 rcf at 4 °C for 2 min and samples were then diluted with LC-MS/MS grade H2O to give a final concentration of 25 mM NH4CO3. Trypsin (Promega) was added to each sample so that the protein enzyme ratio was 25:1. The reaction was stopped with the addition of 2 ul of neat trifluoroacetic acid (TFA, Sigma) to each sample and incubated for a further 30 min at 37 °C. In order to pellet Rapigest, digests were transferred from 96 well plates to 1.5 ml low-bind Eppendorf tubes and centrifuged for 30 min at 12,000 rcf. Supernatants were removed and transferred into clean Eppendorf tubes and lyophilised by speed vacuum at 30 °C for 2 hr. Lyophilised samples were stored at -80 °C until further use. For the second phase: the denaturant used previously (rapigest) was substituted with 25 µL denaturant solution comprising 50 % trifluoroethanol (TFE) in 50 mM NH4HCO3with 10 mM DTT and this mitigated the need for the high-speed spin and transfer of supernatant which represented an additional processing step less compatible with 96 well plate workflows.
LC-MRM Analysis
MRM analysis was performed using an Agilent 6495 triple quadrupole (QqQ) mass spectrometer with a JetStream electrospray source (Agilent) coupled to a 1290 Quaternary Pump HPLC system. Peptides were separated using analytical Zorbax Eclipse plus C18, rapid resolution HT: 2.1 x 50 mm, 1.8um, 600 Bar columns (Agilent) before introduction to the QqQ. A linear gradient of acetonitrile (99.9 % ACN & 0.1 % FA) 3 -75 % over 17 mins was applied at a flow rate of 0.400 µl/min with a column oven temperature of 50 °C. Source parameters were as follows; gas temp: 150 °C, gas flow 15 l/min, nebuliser psi 30, sheath gas temp 200 °C and sheath gas flow 11 l/min. Peptide retention times and optimised collision energies were supplied to MassHunter (B0.08 Agilent Technologies) to establish a dynamic MRM scheduled method based on input parameters of 80 millisecond (ms) cycle times and 2 min retention time windows. The percentage coefficient of variance (% Cv) of biological and technical replicates was used as a measure of variance and was calculated using the following standard calculation: % CV = (standard deviation/mean) x 100.
Enzyme linked Immunosorbent Assay Analysis
CRP levels were evaluated at St Vincent’s University Hospital, Dublin using an automated CRPL3 Tina-quant assay (Roche Diagnostics, GmbH).
Statistical Analysis
Graphpad Prism software package (7.00) was used to investigate the statistical significance of Luminex data whereas SOMAsuite (1.0) was used to analyse SOMAscan data. The ability of quantified proteins/peptides to predict the diagnosis (PsA or RA) of individual patients was assessed using the Random forest package in R (version 3.3.2). The most important variables in providing the area under the receiver operating curve were selected by using the variable importance index and the Gini decrease in impurity was used to assess the importance of each variable. All area under the curve (AUC) values were obtained using the ROCR package in R (version 3.3.2).
Results
Patient sample characterisation and study design
For the discovery of candidate novel protein biomarkers, serum samples were collected at baseline from early onset, treatment naïve PsA (n=32) and RA (n=32) patients. Samples from a second independent cohort (PsA n=95; RA n=72) were used to confirm the performance of the putative markers identified during discovery. While these PsA and RA patients may have been on treatment at time of baseline serum sampling, there were similar levels of active disease (as reflected by CRP, ESR and joint counts) in both patient groups. Key demographic and clinical features of all patients are summarised in Table 1.
Unbiased nLC-MS/MS based protein analysis
To investigate differential serum protein expression between patients with PsA and RA, individual serum samples which had been depleted of high abundance serum proteins were analysed by nLC-MS/MS using a QExactive Hybrid Quadrupole-Orbitrap Mass Spectrometer mass spectrometer. A total of 451 proteins were identified of which 121 were identified in all 64 individual serum samples. Univariate analysis was applied to the 121 commonly identified proteins and multivariate analysis was applied to the complete data set. Univariate analysis (Student t-test using a Benjamini-Hochberg FDR of 0.01) showed that 66 proteins were significantly differentially expressed between PsA and RA (Suppl. Table 1). Unsupervised hierarchical cluster and principle component analysis on these 66 proteins revealed the overall differences/similarities between serum protein levels in the individual PsA and RA patients; clear within group clustering and between group separations could be observed (Figure 2). Random forest analysis of data from 451 proteins identified in the 64 patient samples demonstrated that patients with PsA and RA could be differentiated with an AUC of 0.94 (Table 2) (ROC plot Suppl. Fig 1A). Together these data strongly suggest that there is a difference in the serum protein profiles between newly diagnosed PsA and RA patients. The top 50 proteins providing the AUC are listed in (Suppl. Table 2).
SOMAscan and Luminex targeted protein analysis
To extend the breadth and depth of proteome coverage afforded by nLC-MS/MS, serum samples were subjected to analysis on 2 complementary protein measurement platforms. SOMAscan analysis supported the quantification of 1129 proteins in a subset of the patient samples PsA (n=18) and RA (n=18). Univariate analysis revealed that 175 proteins were significantly differentially expressed between PsA and RA patients (Suppl. Table 3). Multivariate analysis of the data obtained from the SOMAscan analysis revealed that it was possible to discriminate PsA from RA patients with an AUC of 0.73 (Table 2) (ROC plot Suppl. Fig 1B).
Based largely on their known importance in PsA and RA (from a literature review [3]), 48 proteins were selected for analysis using in-house developed multiplexed Luminex assays [10]. Of the 48 proteins targeted, 23 were identified in every sample. T-tests revealed that 4 proteins; IL-18 (p ≤ 0.001), Il-18 binding protein (BPa) (p ≤ 0.05), hepatocyte growth factor (HGF) (p ≤ 0.05) and tumour necrosis factor receptor superfamily member 6 (FAS) (p ≤ 0.05) were differentially expressed between PsA and RA samples (Suppl Fig 2). Random forest analysis of the Luminex data showed that patients could be segregated with an AUC of 0.64 (Table 2) (Suppl Fig 1C). In comparison to the nLC-MS/MS analysis, the candidate protein biomarker discovery by both SOMAscan and by Luminex yielded data sets with reduced predictive power and therefore the subsequent evaluation process was streamlined to focus only on proteins identified by nLC-MS/MS.
LC-MRM verification of nLC-MS/MS identified biomarkers
Multiple reaction monitoring (MRM) is a targeted MS technology which is increasingly used to support candidate biomarker evaluation following LC-MS/MS and other protein discovery approaches. Both the cost of MRM analysis and the time required to develop and optimise MRM assays are considerably less than antibody-based methods [27]. For these and other reasons, MRM based measurement of the nLC-MS/MS-identified proteins represented an attractive approach for verification and evaluation of their biomarker performance. The multiplexing capabilities afforded by MRM facilitated the development of an assay that included the top-ranking discriminatory candidate proteins from univariate and multivariate analysis of the nLC-MS/MS discovery data described above but also allowed for the inclusion of additional proteins identified previously during studies of pooled patient samples (data not shown). During a first verification phase, a total of 233 proteins represented by 735 peptides and 3735 transitions (5 per peptide) were brought forward for MRM assay development. Of the 233 proteins brought forward it was possible to develop analytically validated assays for 150 of them, represented by 299 peptides; the remaining candidates were either undetectable in crude serum or had assays that did not meet our analytical criteria plasma (Of the 50 priority proteins listed in Suppl Table 2, 33 were included in the assay). This MRM assay panel was then used to verify the candidate proteins in the discovery cohort (n=60). It is noteworthy that to minimise any technical bias both the pre-analytical processing and MRM analysis of was undertaken in a randomised manner. Random forest analysis revealed that using this MRM assay panel, it was possible to discriminate PsA from RA with an AUC of 0.79 (Figure 3A). During a second verification phase, an additional 23 proteins, identified as being discriminatory in other analyses of inflammatory arthritis patients, were added to the initial MRM assay panel, giving a new total number of proteins of 173 (represented by 334 peptides) [28]. This expanded panel was used to measure protein candidates in an independent verification cohort of 95 PsA and 72 RA patients (Table 1). Seven synthetic isotopically labelled (SIL) peptides were incorporated into the assay to control for potential analytical variation. Summed intensity values from the SIL peptides were used to normalise patient data. Random forest analysis revealed that PsA patients could be separated from those with RA with an AUC of 0.85 (Figure 3B). The proteins ranked most important in providing the AUC values are reported in Table 3. The data demonstrate clear overlap between proteins used to segregate PsA and RA patients included in the discovery and verification cohorts. The differential expression levels of these overlapping proteins are illustrated in Suppl Fig 3. To this end, Alpha-2-HS-glycoprotein (A2AGL), Alpha-1-antichymotrypsin AACT), Haptoglobin (HPT), Haptoglobin-related protein (HPTR), Rheumatoid factor C 6 light Chain (V-Kappa-1) were found to be significantly upregulated in RA patients compared to PsA when measured by MRM in the cohorts included in phase I and phase II. Alpha-1-acid glycoprotein (A1AG) and coagulation factor XI (FA11) were also found to be upregulated in RA compared to PsA during both rounds of verification, however the observation only reached significance during the second phase. In contrast thrombospondin 1 (TSP1) was found to slightly upregulated in RA patients during the first phase but significantly upregulated in PsA patients during the second evaluation phase. These observations confirmed the validity and performance of the initially developed classifier, but also demonstrated how further development of the assay enhanced performance of the predictive algorithm. This on-going evolution of the MRM assay panel and associated artificial intelligence (AI) and machine learning algorithms represents a new and powerful approach to biomarker development.
Finally, there are at least two potential routes to implementing a multiplexed protein biomarker panel in the clinical setting. One is to use MRM assays and the other to develop antibody-based assays to the proteins of interest. To explore the extent to which MRM data may align with ELISA we compared our MRM data for CRP with results obtained by standard clinical laboratory ELISA. MRM measurements were compared to the ELISA measurements in the 60 samples from the discovery set. It was not surprising to find that serum levels of CRP were significantly upregulated in patients with RA as compared to those PsA when measured by both ELISA (p ≤ 0.0009) and MRM (p ≤ 0.0006) (Figure 4B). Furthermore, CRP values from both platforms were strongly correlated (R2 = 0.8345) (Figure 4C) indicating that a measurement made by MRM can give a value similar to that obtained by an existing immuno-assay.
Discussion
PsA is a complex disease with diverse manifestations; the clinical features observed within individuals with PsA often vary substantially but also overlap with other diseases. While differentiating between PsA and RA can be clinically challenging because of the similarities in their clinical presentation [29], making an accurate diagnosis is increasingly important in order to determine which therapeutic strategy will optimise clinical and radiographic outcomes [30]. Only a few studies have investigated whether there are biomarkers which discriminate between PsA and RA. In one study of synovial tissue, mRNA for VEGF and ANG-2 were elevated in PsA patients compared to RA [31]. Obtaining a synovial biopsy however is an invasive procedure and the discomfort, time and cost associated with tissue sampling makes it undesirable for use in routine clinical practice [31, 32]. More recently, Siebert et al. identified 170 urinary peptides which discriminated between patients with long-standing PsA from other arthropathies including early RA with an AUC of 0.97 [33]. These results are promising but urine collection is especially vulnerable to physiological variation arising from diet and liquid intake. Additionally, urine tends to be a very dilute matrix high in salt and low in protein concentration. Thus, in the absence of stepwise workflows for sample concentration and clean up, the quantification of proteins in urine can prove difficult as a result interfering signals present in the matrix [34].
Serum is well recognised as a suitable matrix for biomarker discovery, not least because proteins are shed from relevant affected tissue into the circulation but also because it is readily obtained under standardised operating procedures [35]. Hence, our studies were using serum samples analysed on three proteomic platforms (nLC-MS/MS, SOMAscan and Luminex), each capable of measuring a limited but complementary range of proteins present at different abundance levels. This approach was adopted in order to maximise coverage of serum proteome and to date it is the most comprehensive analysis of the serum proteome in patients with PsA and RA. Although 3 platforms were used to identify putative biomarkers, results from the nLC-MS/MS analysis were the most discriminatory compared to the Luminex and SOMAscan platforms. A potential reason for this is that LC-MS/MS analysis allows for unbiased discovery of biomarkers whereas the other approaches are limited by having fixed panels of protein markers. Furthermore, the SOMAscan platform uses a single aptamer to capture proteins thus potentially reducing the specificity of readouts [36]. It is also possible that the smaller number of patient samples used in the SOMAscan experiments may have constrained the statistical power of the analysis. With respect to the Luminex platform, the 48 carefully selected proteins we measured may not have included key candidate cytokines and chemokines which could be support the discrimination between PsA and RA. The proteins were selected based on their known importance in the pathogenesis of PsA and RA but the panel was limited by the availability of proteins measurable with the in-house assay. With no compelling evidence to justify the time and the cost required to develop further multiplex antibody and/or aptamer assays, we instead focused on the nLC-MS/MS data and performed follow up studies using multiple reaction monitoring (MRM).
MRM represents an excellent tool for supporting large scale multi-protein biomarker studies. It is typically used to refine an initial list of candidate proteins derived from discovery experiments to the subset that may truly address the clinical question under study [37]. MRM analysis is performed on triple quadrupole (QqQ) mass spectrometers which inherently have higher sensitivity and greater linear dynamic range than the orbitrap mass spectrometer used in the discovery experiments here. This boost in sensitivity facilitates the detection of lower abundant proteins in complex samples and therefore reduces the need for sample pre-enrichment steps [38]. Thus, MRM supports more robust workflows as well as time and cost-effective assay development compared to traditional antibody-based approaches. MRM is frequently less sensitive than an equivalent immuno-assay and it was also for this reason that we did not initially attempt to develop MRM assays for putative markers identified only by the SOMAscan or the Luminex analysis [17, 39]. The development of immunoMRM assays to these candidate biomarker proteins represents an obvious way in which improving the performance of the existing panel could be explored [40].
From the two phase of MRM analysis carried described here it was especially interesting to note that a subpanel of 8 proteins (Leucine-rich-alpha-2-glycoprotein, alpha-1-antichymotrypsin, haptoglobin, haptoglobin -related protein, rheumatoid factor C6 light chain, alpha-1-acid glycoprotein 1, coagulation factor XI, thrombospondin-1) were identified as highly discriminatory during the initial verification phase were again confirmed as highly discriminatory during the second evaluation phase. Follow up T-test analysis was performed on this set of proteins and it was found that 7 out of 8 proteins were upregulated in RA compared to PsA during both phases of analysis. Whereas thrombospondin-1 was found to be significantly upregulated in PsA vs RA during the second phase, whereas no significant difference was observed in the initial. This discordance may relate to differences in patient numbers included in the 2 phases or it may relate to the differences in the patients included with the initial phase subjects having early onset, treatment-naive disease whereas those included in the second phase had longer standing disease and were receiving therapy. This in part highlights the advantage of maintaining large panels of proteins for on-going evaluation in patient cohorts. Further analysis of this 8 protein subpanel was carried out using a web-based resource “Search Tool for the retrieval of Interacting Genes/Proteins” (STRING) https://string-db.org/cgi/network.pl revealing the biological functions of these 8 markers of interest, see Suppl Table 4 for a description. It is interesting to note that this panel is enriched for proteins functionally involved in structural remodelling, angiogenesis, homeostasis and transportation. This perhaps is not surprising since PsA and RA are characterised by an increase in bone turnover and dysregulated angiogenesis.[3, 41]. The radiographic features in PsA and RA can be quite different, with bony erosion observed in both conditions but osteoproliferation only seen in PsA [3]. Hence, in the context of this investigation, it was not unanticipated, that markers of structural remodelling contributed to an algorithm discriminating between individuals with PsA and RA.
Here we demonstrated that a major advantage of using MRM is that it allows the investigator to rapidly adapt a panel to include new candidate biomarkers. We have also shown with our CRP assay developed on MRM over a few days that values highly correlated with those made by ELISA.
Our study has several strengths, including the comprehensive and logical approach to biomarker development. Limitations include the modest number of patient samples in both study phases as well as the absence of healthy and disease controls. Differentiating between PsA from RA is the focus of the current study but it is not the only challenge faced by clinicians, as it can also be challenging to distinguish PsA from other arthropathies and from patients who have skin psoriasis only [14]. This certainly represents a future objective and assessing this biomarker panel in the appropriate additional cohorts is a critical next step. It is noteworthy that the independent cohort included in the second phase of evaluation included patients that had long-standing disease compared to the discovery cohort which were defined as early onset. The performance of the panels may reflect a genuine difference in the protein profile between PsA and RA patients at different stages of disease progression and further work in a larger number of patient samples, will be required to establish if the biomarkers identified here could also be used to distinguish PsA from other diseases and from healthy individuals, at early stages of the disease or as we anticipate, the panel while suited for initial intended use will benefit from further development. Finally, although disease controls weren’t included in our present analysis it is worth highlighting research by Chandran et al. that identified differences in serum proteins is patients with PsA compared to patients with osteoarthritis [42] and psoriasis [43]. The protein markers identified in these studies are prime candidates which should be included in future generations of the panel MRM assays.
At present there is no diagnostic test for PsA and as a result, the diagnosis is often late or missed resulting in functional consequences to the patient [12, 44]. With at least 20 % of individuals referred to early arthritis clinics having PsA, there is an urgent need to develop a test to support early detection of this disease [45]. The work described here-in represents a significant contribution towards the development of such a test. Fundamental next steps have been outlined and the MRM approach is ideally suited to support the large-scale studies required to develop and validate a robust panel of discriminatory biomarkers. We believe with further development it will be possible to establish a diagnostic test for PsA which will reduce diagnostic delay, inform treatment selection and improve both short-term and long-term outcomes.
Data Availability
Data is included in suppl material. Any additional information is available upon request.
Supplementry Figures & Tables
Suppl Doc 1
Standard Operating Procedure for Serum Processing from Whole Blood
This Standard Operating Procedure is developed in accordance with the International Conference on Harmonisation (ICH) Harmonised Tripartite guideline for Good Clinical Practice (GCP) and Clinical Trials on Products for Human Medicinal Use Regulations 2004 to 2006 (SI 190 of 2004 & SI 374 of 2006), where applicable.
The Standard Operating Procedure applies to all staff involved in MIAMI projects.
Objectives: To ensure the biomarker discovery and validation (mRNA, miRNA and proteomic) samples are collected, processed and stored in an appropriate manner.
Scope: This SOP applies to the proteomic studies of MIAMI (WP4)
Definition: Collection, processing and storage of study samples.
Prepared by: Stephen Pennington, Conway Institute, University College Dublin with input from Phil Gallagher, St Vincent’s University Hospital, Dublin
Procedure:
All sample processing information should be recorded and stored in the sample database associated with the sample i.d./labels.
Gloves and laboratory coat should be worn throughout sample processing to protect the sample from contamination (this includes labelling the tubes prior to sample processing). All tubes and pipettes should be covered at all times to minimise dust contamination.
Blood samples should be collected using a vacutainer to avoid haemolysis.
Three 7.5 ml / 10 ml anti-coagulant free tube (red top serum tubes) will be used to collect serum.
Filled blood tubes should be left to sit upright after blood is drawn at room temperature for a minimum of 30 to a maximum of 60 minutes to allow the clot to form.
Centrifuge the blood sample at the end of the clotting time (30-60 minutes)
for 15 minutes at 1800 g at room temperature. If the blood is not centrifuged immediately after the clotting time, the tubes should be refrigerated (4°C) for no longer than 4 hours prior to centrifugation.
In all cases the time of clotting and temperature the samples were subjected to should be recorded.
Using pipette transfer the serum from the vacutainers into a larger tube and mix.
Pipette serum into the labelled cryovials in aliquots of 1000 μl. Close the cap on the vials tightly. This process should be completed within 1 hour of centrifugation.
Note: To avoid picking up red blood cells when aliquoting keep the pipet above the red blood cell layer and leaving a small amount of serum in the tube.
Samples should be labelled as described in MIAMI SOP:OO2.
Place all aliquots upright in a specimen box and store at -80°C.
Footnotes
↵* Oliver FitzGerald and Steve Pennington are joint co-leads on this work
This work was supported by the European Commission under the EU FP7 Programme ‘MIAMI’ (Monitoring innate Immunity in Arthritis and Mucosal Inflammation) project grant (ref. 305266) and Health Research Board grant ref. HRA-HSR-2015-1284. The UCD Conway Institute is funded by the program for research in Third Level Institutions as administered by the Higher Education Authority of Ireland.