Identification and Analysis of Novel Synovial Tissue-based Biomarkers and Interacting Pathways for Rheumatoid Arthritis
=======================================================================================================================

* Paridhi Latawa

## Abstract

Rheumatoid arthritis (RA) is an inflammatory disorder with autoimmune pathogenesis characterized by the immune system attacking the synovium. It is a clinically heterogeneous disease that affects approximately 1.2 million Americans and 20 million people worldwide. It is advantageous to diagnose RA before extensive erosion as treatments are more effective at early stages. RA treatments have made notable progress, yet a significant number of patients still fail to respond to current medication and most of these come with harmful side effects. While the mechanistic reason for such failure rates remains unknown, the cellular and molecular signatures in the synovial tissues of patients with RA are likely to play a role in the variable treatment response and heterogeneous clinical evolution. While blood-based criteria are currently employed for diagnostics and treatments. such serologic parameters do not necessarily reflect biological actions in the target tissue of the patient and are relatively nonspecific to RA. Synovial tissue-based biomarkers are especially attractive as they can provide a confirmed diagnosis for RA. The shortage of accurate synovial tissue-based identifiers for RA diagnosis encouraged this research. This study analyzed data from several gene expression studies for differentially expressed genes in donor synovial tissue. Bioinformatics tools were used to construct and analyse protein interaction networks. Analysis deduced that regulating hematopoietic stem cell migration could serve as a potential RA diagnostic. VAV1, CD3G, LCK, PTPN6, ITGB2, CXCL13, CD4, and IL7R are found to be previously unclassified, potential biomarkers.

## Introduction

Rheumatoid Arthritis is a clinically heterogeneous and complex autoimmune disease that affects approximately 1.3 million Americans and 20 million people worldwide14. The disease manifests between the age of 20 and 40 years, being twice more common in women than men. According to the World Health Organization (WHO), within ten years of the onset of RA, at least 50% of patients in developed countries have to discontinue a full-time job due to the disability that ensues with RA8. RA increases inflammation throughout the body, leading to the risk of heart disease and attack on the pericardium. It can also lead to obesity as the joint pain results in decreased exercise among patients49. RA is characterized by the release of inflammatory chemicals by the immune system that attacks the synovium, which is the tissue lining around a joint that produces a fluid to help the joint move smoothly. The inflamed synovium thickens and causes the joint area to feel painful and tender. Uncontrolled inflammation can cause destruction and wear down the cartilage, leading to joint deformities and painful bone erosion37. Early diagnosis of RA is essential to promote effective treatments. However, it is relatively difficult to do so, since RA’s early-stage symptoms are usually subtle and generic, such as fatigue, slight fever, and stiffness38.

Established RA displays clinical heterogeneity, reflected by variable diagnosis leading to the unpredictable, rapid progression of structural damage and inconsistent response to therapy26. There have been advancements in synthetic and biological therapies that target specific immune-mediated pathways, but a significant number of patients still fail to respond to current medication--with only 20-30% reaching low disease activity status26. The mechanistic reason for such failure rates remains unknown. However, the cellular and molecular variation found in the synovial tissue of patients with long-standing RA is likely to play a role in the variable treatment response and heterogeneous clinical evolution. RA is partially due to genetics, therefore it could potentially be diagnosed through genetic screening, but other cases arise spontaneously or due to environmental factors and autoimmunity24.

Early detection and personalized approaches to RA treatment are known to improve patients’ treatment outcomes, minimizing or preventing joint destruction51. Therefore, diagnostic tests and biomarkers are in demand for RA. There are various likely avenues for biomarker discovery, and researchers are actively exploring various factors like antibodies and serum. While ACR & EULAR criteria are based mostly on blood tests that measure the ESR and levels of CRP, RF, & ACPAs, such serologic parameters do not necessarily reflect biological actions in the target tissue of the patient and are relatively non-specific to RA4, 17. Synovial tissue biomarkers are especially attractive as they can provide a confirmed diagnosis of RA.

Currently, there is still a shortage of an accurate synovial tissue-based biomarker panel for RA diagnosis. Besides diagnosis tests, synovial-tissue-based biomarkers could be applied to develop novel RA treatments, developing therapeutics that can halt or reverse RA progression.

Tissue-based molecular biomarkers can be profiled using various methods, such as microarrays and RNA-seq, both of which are used to perform gene expression studies42. Microarray data has been previously used to identify biomarkers for other autoimmune and skeletal system disorders, such as osteoarthritis (OA)23. Differential gene expression among RA patients, OA patients, and controls provides detailed information about the mechanisms of RA and could offer specific valuable biomarkers for diagnosis. Studies have previously identified RA biomarker genes through RNA-seq and microarray analysis. For example, one research group identified IL7R as a potential RA biomarker using microarray data53. Anti-Cyclic Citrullinated Peptide and Rheumatoid Factor have also been established as serum biomarkers44.

Although gene expression methods are instrumental, there are often discrepancies between data due to differences in methodologies or samples, meaning that genes identified as significant in one study might be not so in another. To get greater confidence in biomarkers, it is important to identify genes that are significant across multiple studies, which decreases the likelihood of genes being expressed due to chance.

In this study, the microarray data from 3 separate studies were analyzed to deduce biomarkers for RA. The studies gathered data on gene expression based on the synovial tissue of RA and control patients. DEGs of interest were identified across studies, and their interactions were studied further.

## Methods

### GEO Dataset Selection

NCBI has created a data repository called Gene Expression Omnibus (GEO) that contains public gene expression data from various studies3, 16. Datasets matching “synovial,” “Rheumatoid Arthritis,” “expression profiling by array,” and “*Homo sapiens*” were searched in GEO. The GSE55235, GSE2053, and GSE1919 datasets were selected for this study47, 48, 50. Through three studies, a total of 19 RA patients and 19 controls were analyzed. All three studies used microarrays for collecting gene expression data from RNA obtained from synovial tissue or synovial membrane was profiled. Microarrays are a chip-based technology that uses probes to capture strands of complementary DNA (cDNA), which is reverse transcribed from the mRNA in the tissue samples, to measure the level of gene expression20.

### GEO2R Differential Expression Analysis

GEO provides the GEO2R tool which was used to analyze the three datasets3, 16. GEO2R processes gene expression data to identify differentially expressed genes (DEGs) between user-defined groups, in this case, RA and control. For each dataset, thousands of genes were found by GEO2R as statistically significant. T-tests were then used to determine p-values, and GEO2R calculated the fold change for each gene. Fold change is the ratio of the average expression of a gene in one group divided by the average expression in a different group, in this case, comparing RA expression values vs. control values. Some genes are overexpressed in RA while others are underexpressed in RA to controls, hence fold change can be greater than or less than one.

The normal distribution of gene expression values using box-plotting was verified using GEO2R and no outliers were present. Since the gene expression values were all relatively similar, normalization was not required.

Google sheets were used to process the DEGs obtained from GEO2R. DEGs with p-values greater than 0.05 were removed, and the remaining DEGs were then sorted into the two categories of overexpression and underexpression, as some genes were less expressed in RA than the control, while others are more highly expressed in RA. Only genes with a fold change greater than 1.25 were kept.

### Classification of Common Genes

A Venn Diagram for the classification of genes in each of the three studies was generated using the limma package in R studio39. Genes found in only one of the three studies were removed from the list. Genes found in only two of the three studies were saved separately from the genes found in all three studies.

### STRING and Cytoscape Network Analysis

The STRING database was used to analyze the two final lists of DEGs, including genes found in all three datasets and the genes found in two out of three datasets. STRING was used to create a network of the protein-protein interactions, and calculate statistically significant Gene Ontology (GO) processes and pathways43. A tab-separated value (TSV) file containing the text version of the gene graph was inputted into Cytoscape, a software that can perform more robust analyses and functions on networks40. Cytoscpae’s Network Analyzer tool calculated gene network metrics like clustering coefficient and node degree, which is the number of genes connected to a particular gene. When comparing the DEGs found in the list with at least two of the three datasets, hub genes were genes with a node degree of ≥ 8. When comparing the DEGs found in the list with all three datasets, hub genes with a node degree of ≥ 3.

### Gene Ontology Analysis

The biological pathways, processes, and genes linked to them are stored in a database called Gene Ontology (GO) 2, 19. The built-in PANTHER tool in GO accepts a gene list and identifies relevant biological mechanisms29. Enriched biological processes and molecular functions were isolated from the two DEG lists that were submitted to PANTHER.

### Kegg Analysis

The hub genes identified were inputted into the Kyoto Encyclopedia of Genes and Genomes (KEGG) database to analyze the high-level function of the biological systems that the inputted genes are involved in22.

### Reactome Analysis

The genes found in all three datasets were inputted into Reactome to analyze the cellular pathways involved. Reactome is a browser-based database storing information on molecular details of various biological pathways21.

## Results

GSE55235, GSE2053, and GSE1919 were analyzed in GEO2R, which generated a list of DEGs. The lists were downloaded, and gene expression value distribution was confirmed to be normal with no outliers based on box-plots of the sample expression distribution.

![Figure 1,2,3:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/03/26/2021.03.19.21253995/F1.medium.gif)

[Figure 1,2,3:](http://medrxiv.org/content/early/2021/03/26/2021.03.19.21253995/F1)

Figure 1,2,3: 
Box plot for gene expression value distribution for GSE55235, GSE2053, and GSE1919. There are no units for the y-axis. The box plots represent the gene expression values for patient samples.

Cut-off criteria for genes were p-value < 0.05. In GSE55235, 5754 genes passed the criteria, in GSE2053 960, and in GSE1919 2220. There was substantial overlap between studies. 64 genes were found in all three datasets, while 1264 genes were found in two out of three datasets.

![Figure 4:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/03/26/2021.03.19.21253995/F2.medium.gif)

[Figure 4:](http://medrxiv.org/content/early/2021/03/26/2021.03.19.21253995/F2)

Figure 4: 
The Venn diagram shows the number of genes that are shared between GSE55235, GSE2053, and GSE1919. 64 genes were differentially expressed in all three sets.

First, the genes that were common in two out of the three microarray dataset were analyzed. Out of the 1264 shared genes in all three datasets, further cut-off criteria were applied: fold change >= 1.2511. 320 genes passed all cut-off criteria and were submitted to the STRING tool43. 292 out of 320 genes were successfully mapped, as shown in the network below. Solitary genes with no interactions are hidden. The minimum confidence score for protein interactions was set to 0.900 (p-value = 0.009). There were 559 connections between proteins instead of 234 expected interactions, indicating that the network has significantly more interactions than prognosticated of a random sample.

![Figure 5:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/03/26/2021.03.19.21253995/F3.medium.gif)

[Figure 5:](http://medrxiv.org/content/early/2021/03/26/2021.03.19.21253995/F3)

Figure 5: 
STRING network of protein interactions between 292 DEGs.

The colors of the nodes are arbitrary, and the colors of the edges connecting the nodes display how the interaction was determined. STRING identifies several biological processes and interactions as significant among the inputted genes.

Next, Cytoscape software was used to analyze network characteristics, particularly node degree, to find hub genes.

View this table:
[Table 1:](http://medrxiv.org/content/early/2021/03/26/2021.03.19.21253995/T1)

Table 1: 
Cytoscape generated clustering coefficients and node degrees for the genes in the network. The clustering coefficient measures how clustered nodes are around a single node, specifically the hub genes. A clustering coefficient of 1 represents the maximum density of a node cluster while 0 is the least dense.

Genes with a node degree of at least 13 are presented, with VAV1, CD3G, LCK, PTPN6, ITGB2, and CD3D with the highest degree.

STRING found 26 gene products that interacted with VAV1. VAV1 codes for an intracellular signal transduction protein involved in T cell proliferation, development, and signal transduction. It has been suggested that VAV1 is involved with rheumatoid arthritis as well as osteogenesis, lymphopoiesis, cardiovascular homeostasis, various cancers, autoimmune diseases, and ML31.

The PANTHER tool was then used to analyze the gene lists. PANTHER was set to Annotation Data Set: GO biological process complete, Test Type: Fisher’s Exact, and Correction: Calculate False Discovery Rate (FDR).

PANTHER identified several statistically significant biological processes45. Notably, the “negative regulation of antigen processing and presentation of peptide antigen via MHC class II” was enriched by 67.52 times with p-value = 1.25E-03 and an FDR of 3.21E-02. The enrichment value is a number that is the genes in the inputted list that belong to a specific biological process divided by the expected number of genes that would belong to a specific biological process if taken from a random sample of genes. A higher enrichment means more genes in an inputted list are involved in a particular biological process. “Positive regulation of hematopoietic stem cell migration”, “interleukin-10 mediated signaling pathway”, “neutrophil aggregation”, “DN2 thymocyte differentiation”, “cell-matrix adhesion involved in amoeboid cell migration”, and “positive regulation of type III interferon production” were also enriched 67.52 times.

View this table:
[Table 2:](http://medrxiv.org/content/early/2021/03/26/2021.03.19.21253995/T2)

Table 2: 
panther outputs a table showing biological processes, ranked by fold enrichment, that are overrepresented or under-represented among the list of genes. GO accession terms are provided for each process.

Many GO molecular functions were also altered. The deoxycytidine deaminase activity function was enriched by 25.32 times with a p-value of 4.66E-04 and an FDR of 4.73E-02. Receptor activity and protein binding appear to be altered in RA.

View this table:
[Table 3:](http://medrxiv.org/content/early/2021/03/26/2021.03.19.21253995/T3)

Table 3: 
panther outputs a table showing the various molecular functions that are over or under-represented among the list of genes. GO accession terms are provided for each function.

The genes that were only present in all three sets were also analyzed.

Out of the 64 shared genes in all three datasets, further cut-off criteria were applied for stringency: fold change >= 1.251. 24 genes passed all cut-off criteria and were submitted to the STRING tool. 17 out of 24 genes were successfully mapped, and the following graph was created. Solitary genes are hidden to improve clarity. The minimum confidence score for whether a protein interaction existed was set to 0.400 (p-value = 0.004). There were 16 connections between proteins instead of an expected 3, indicating that the network has significantly more interactions than predicted of a random sample.

![Figure 6:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/03/26/2021.03.19.21253995/F4.medium.gif)

[Figure 6:](http://medrxiv.org/content/early/2021/03/26/2021.03.19.21253995/F4)

Figure 6: 
STRING network of protein interactions between 17 DEGs. IL7R, a purple node, is shown to have many interactions with other proteins.

Cytoscape software was then used to analyze the node degree network characteristic to identify hub genes.

View this table:
[Table 4:](http://medrxiv.org/content/early/2021/03/26/2021.03.19.21253995/T4)

Table 4: 
Cytoscape generated clustering coefficients and node degrees for the genes in the network. The genes with a node degree of at least 3 are presented: CXCL13, CXCL9, CXCL10, CCL19. CD4, and IL7R.

STRING found 4 gene products to interact with CXCL10. CXCL10 codes for a chemokine and ligand for the receptor CXCR3. Binding of the CXCL10 protein to CXCR3 results in the stimulation of monocytes, natural killer migration, & T-cell migration7. This gene has been known to be involved in cytokine storm inflammatory response. CXCL10 was upregulated in all three microarray datasets.

The PANTHER tool was then used to analyze the gene lists. The previously mentioned settings were utilized.

PANTHER identified a number of statistically significant biological processes. The process “regulation of T cell chemotaxis” was enriched >100 times, with p-value 1.36E-04 and FDR of 1.87E-02. “Regulation of myoblast fusion”, “positive regulation of cell adhesion mediated by integrin”, “regulation of lymphocyte chemotaxis”, “regulation of syncytium formation by plasma membrane fusion”, and “positive regulation of cytokine secretion” were also enriched.

View this table:
[Table 5:](http://medrxiv.org/content/early/2021/03/26/2021.03.19.21253995/T5)

Table 5: 
panther outputs a table showing the various biological processes that are over or under-represented among the list of genes.

Many GO molecular functions were also altered. CCR10, CXCR3, and CXCR chemokine receptor binding functions were enriched by >100 times, which is quite substantial. Receptor binding functions appear to be commonly altered in RA as well.

View this table:
[Table 6:](http://medrxiv.org/content/early/2021/03/26/2021.03.19.21253995/T6)

Table 6: 
panther outputs a table showing the various molecular functions that are over or under-represented among the list of genes.

These genes are then inputted into Reactome to analyze the pathways that were involved21. Pathways in the immune system, related to signal transduction, transport of small molecules, gene expression, disease, cell cycle, extracellular matrix organization, and programmed cell death were shown to be involved with the inputted list of genes.

![Figure 7:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/03/26/2021.03.19.21253995/F5.medium.gif)

[Figure 7:](http://medrxiv.org/content/early/2021/03/26/2021.03.19.21253995/F5)

Figure 7: 
Reactome outputs a map of pathways. Gray pathways represent inputted genes that are not involved in the respective pathways, while yellow-colored pathways are those involved with the inputted genes.

## Discussion

Rheumatoid arthritis is a common autoimmune disease that is often difficult to diagnose at early stages due to the lack of specific symptoms. A synovial tissue-based gene biomarker approach may be beneficial in diagnosing patients earlier to receive more effective treatment.

Analysis of the 3 microarray datasets on RA gene expression yielded 344 DEGs found across studies. These genes pass criteria for p-value and fold change and are attractive for further research as they are both biologically relevant and reproducible. While 320 genes were present in two out of three studies, only 24 were found in all datasets, demonstrating how variable gene expression studies can be and underscoring the need for this study.

VAV1, CD3G, LCK, PTPN6, and ITGB2 had at least 13 interactions with other genes, indicating that they play an essential role in RA. VAV1, LCK, CD3G, and CD3D were located in a cluster along with other genes, such as LAT, CFIP2, CD86, and ZAP70. CXCL13, CXCL9, CXCL10, CCL19, CD4, and IL7R were found in all three datasets and had at least three interactions with other genes, indicating that they may also play an important role in RA. CXCL13, CXCL9, CXCL10, and CCL19 were located in a cluster.

VAV1 was shared across two studies and was a hub gene with degree 26. There is a suggested association between the VAV1 gene rs2617822 polymorphism and RA, yet this research was only preliminary [31]. This study confirms the need for an investigation into the role of VAV1, and other VAV family proteins, in RA pathogenesis. The aforementioned genes that VAV1 was found clustered with, namely LCK, CD3G, CD3D, LAT, CFIP2, CD86, and ZAP70, indicate a potential signal transduction pathway that is useful in RA pathogenesis.

CD3G is a hub gene with degree 22. It is known to play an essential role in the regulation ofT-cell receptor (TCR) expression at the cell surface and is also involved in signal transduction in T-cell activation12. Currently, there is interest in elucidating the pathways that drive the production of cytokines and other inflammatory mediators in RA. However, the current kinases that are being researched are ubiquitously expressed, which can lead to side effects15. There is a need for more specific targets that can lead to the complete resolution of chronic inflammation. CD3G is not ubiquitously expressed, meaning it can serve as a specific target. Further elucidation on this could lead to the identification of successful therapeutic targets6.

![Figure 8:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/03/26/2021.03.19.21253995/F6.medium.gif)

[Figure 8:](http://medrxiv.org/content/early/2021/03/26/2021.03.19.21253995/F6)

Figure 8: 
Expression analysis of HPA RNA-seq normal tissues. RNA-sequencing was performed on tissue samples from 95 human individuals representing 27 different tissues to determine the tissue-specificity of the CD3G gene. The CD3G gene is not ubiquitously expressed, indicating that it can serve as a specific therapeutic target.

LCK is a gene that is involved in T-cell signaling. T cells are components of the immune cell infiltrate that have been detected in the joints of RA patients28. Since T cell activation is central to mounting the immune response, the inhibition of LOCK blocks T cell activation and suppresses the immune response. While there has been an indication that LCK can serve as an ideal target for the development of novel immunosuppressant agents, extensive experimentation has not been conducted.

PTPN6, also known as SHP-1, modulates signaling through tyrosine-phosphorylated cell surface receptors and plays a role in hematopoiesis46. There has currently only been one study that investigates the role of SHP-1 in arthritis, and this study has only been conducted using animals overexpressing this phosphatase27. The activation of the SHP-1 coil is considered as a potential treatment of RA.

CXCL13, CXCL9, CXCL10, CCL19, CD4, and IL7R were found in all three datasets and had at least three interactions with other genes, indicating that they may also play an important role in RA. CXCL13, CXCL9, CXCL10, and CCL19 were located in a cluster.

CXCL13, which is involved in B-cell and tumor cell responses, has been found to be important for the diagnosis of early RA, having had a superior performance as compared to RF and anti-CCP, which have been the past biomarkers for RA1. Therefore, CXCL13 should be considered a potential biomarker of RA disease activity.

IL-7, the gene that codes for a protein that is a hematopoietic growth factor that is secreted in the bone marrow, and IL-7R, coding for a protein found on the surface of cells, have been linked to RA. IL-7 has been found to have a role in RA angiogenesis34. IL-7R should be explored further to explore its relation to RA pathogenesis.

CCL19 has also been found to have a role in RA angiogenesis33. CXCL10 signaling has been found to be involved in the pathogenesis of RA25. CXCL9 has been found to have increased serum levels when evaluated by ELISA35. CXCL9 may serve as a potential biomarker for RA These six genes were inputted into KEGG. As shown in the table below, KEGG indicates that 5 out of the 6 genes are involved in the Cytokine-cytokine receptor interaction pathway. This pathway is involved in the Chemokine signaling pathway, which is associated with leukocyte transendothelial migration and cytokine production. Chemokines have been found to be involved in RA pathogenesis, especially in CCL2 expression, which may contribute to chronic inflammation that is associated with RA52. Targeting the Chemokine signaling pathway as well as the cytokine-cytokine receptor interaction pathway as potential avenues of therapeutics for RA should be explored.

![Figure 9:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/03/26/2021.03.19.21253995/F7.medium.gif)

[Figure 9:](http://medrxiv.org/content/early/2021/03/26/2021.03.19.21253995/F7)

Figure 9: 
KEGG Mapper Pathway results for CXCL13, CXCL9, CXCL10, CCL19, CD4, and IL7R. 5/6 genes found to be involved in the Cytokine-cytokine receptor interaction pathway - hsa04060. CD4 is not shown in the table as it is in the non-classified section, but it was also found to be interacting in this pathway.

While the hub genes are promising, other genes among the 344 may be of interest as well. “Negative regulation of antigen processing and presentation of peptide antigen via MHC class II” genes are significantly over-represented in the list of DEGs (67.52 times enriched). Peptide bodies have been explored as serum biomarkers10. Further research could be done on harnessing the effects of positive stimulation for RA therapeutics.

Genes involved in “positive regulation of hematopoietic stem cell migration” were also overrepresented by 67.52 times. This implies that Hematopoietic stem cells (HSCs), which give rise to other blood cells through Hematopoiesis, migrate to the defined microenvironments in the bone marrow in which they can proliferate and survive9. Further research could be done exploring HSCs as an indicator of RA pathogenesis.

“Regulation of T cell chemotaxis” genes were expressed>100 times in the list of DEGs found in all three sets. T cells and other immune cells do migrate to the synovial tissue during RA, which contributes to its pathogenesis28. While studies have dashed the promise of targeting individual cytokine receptors of RA, alternative strategies that are aimed at the interactions between the chemokine signaling pathways presented by KEGG and the other involved genes from all three datasets should be considered for RA therapy.

Reactome presented TICAM-1 deficiency in RA. The TICAM 1 gene codes for a protein that mediates protein-protein interactions between toll-like receptors (TLRs) and signal transduction components13. While Toll-like receptors have potentially been linked to RA, not much has been found in TICAM-1 deficiency in RA.

There was also an overrepresentation of the Adenylate cyclase inhibitory pathway. In this pathway, Adenylate cyclase is inhibited, the production of cAMP from ATP is inhibited, which ultimately decreases the activity of cAMP-dependent protein kinase21. Becker et al. found that the caAMP pathway can serve as a therapeutic target in autoimmune and inflammatory diseases, as cAMP (Cyclic adenosine monophosphate) is a regulator of innate and adaptive immune cell function36. There is a plethora of evidence that suggests that the activation of the innate immune system is linked to RA. cAMP as a therapeutic target is a potential avenue that could lead to better therapeutic development for RA.

Defective SFTP2 causing IPF was also overrepresented in Reactome. Mutations in SFTPA2 disrupt protein structure, and the defective protein is retained in the ER membrane which causes idiopathic pulmonary fibrosis21. While formerly RA has been linked with pulmonary fibrosis, this study links RA to the development of IPF.

The strengths of this study include the rigor applied to DEG selection. To qualify for the final list, a gene had to have a fold change of at least 1.25, a p-value of < 0.05, and be present in at least 2 datasets. The selective analysis was done on genes found in all 3 datasets. When inputted into STRING, the highest confidence interval of 0.900 was used to analyze the genes that were in at least 2 out of 3 datasets, indicating the statistical significance of the protein-protein interactions that were identified. Additionally, the samples were mixed between males and females, indicating that the significant differences in gene expression and RA pathology between sexes were taken into account in this study. Another strength is the sample size. RA is known for its clinical heterogeneity between patients. By considering 20 RA patients, there is greater assurance of obtaining a comprehensive view of the gene expression of RA than with just a few samples.

The limitations of this study are that no normalization was performed between datasets, though this is common with cross-study gene expression analysis. Additionally, RA patients were not compared with other arthritis controls. Thus, there is a degree of uncertainty around whether the DEGs found are specific to RA or also occur in, say, osteoarthritis. Yet, considering that RA and osteoarthritis differ as A involves the immune system, there can be a differentiation between the gene expression data. The studies that developed the three datasets also contain gene expression data from osteoarthritis, so the next step in the research is to evaluate the biomarkers found in this study and examine how their expression compares in RA vs. other skeletal disorders. Lastly, the results of this study should be confirmed in vitro to certify that the aforementioned pathways and hub genes are statistically significant.

The DEGs identified are useful as diagnostic markers for RA tests and may also be useful targets for novel treatments for RA. The present study elucidates several pathomechanisms, such as the negative regulation of antigen processing and presentation of peptide antigen via MHC class II, the positive regulation of hematopoietic stem cell migration, and the regulation of T cell chemotaxis, that could be avenues for research in alleviating RA symptoms. Currently, not much is known about the involvement of Hematopoietic stem cells in RA development, however, there is growing interest. Mills et al. find that RA can cause hematopoietic stem cell reprogramming30. This study’s finding that HSCs are affected through RA pathogenesis provides further evidence for Mills’s findings.

Future research directions include finding genes that can distinguish between RA and other skeletal and immune conditions. Additionally, epigenetic factors such as miRNAs, histone modification, and PTMs and their interactions with genes should be explored. Examining the role of Hematopoiesis and signal transduction pathways in RA may also yield important information.

## Data Availability

Supplementary information is available for this paper. All sources are cited, and the repository of differentially expressed genes is provided. Correspondence and requests for materials should be addressed to paridhi.latawa@gmail.com.

[https://docs.google.com/spreadsheets/d/11yDBMAnymuDmIV6B3Tc2y1ltM\_YRYu\_QE04G4mHFQHQ/edit#gid=0](https://docs.google.com/spreadsheets/d/11yDBMAnymuDmIV6B3Tc2y1ltM_YRYu_QE04G4mHFQHQ/edit#gid=0) 

## Supplement

Differentially Expressed Genes [https://docs.google.com/spreadsheets/d/11yDBMAnymuDmIV6B3Tc2y1ltM\_YRYu\_QE04G4mHFQHQ/edit?usp=sharing](https://docs.google.com/spreadsheets/d/11yDBMAnymuDmIV6B3Tc2y1ltM_YRYu_QE04G4mHFQHQ/edit?usp=sharing)

*   Received March 19, 2021.
*   Revision received March 19, 2021.
*   Accepted March 26, 2021.


*   © 2021, Posted by Cold Spring Harbor Laboratory

This pre-print is available under a Creative Commons License (Attribution-NonCommercial-NoDerivs 4.0 International), CC BY-NC-ND 4.0, as described at [http://creativecommons.org/licenses/by-nc-nd/4.0/](http://creativecommons.org/licenses/by-nc-nd/4.0/)

## References

1.  1.Allam, S. I., Sallam, R. A., Elghannam, D. M., & El-Ghaweet, A. I. (2019). Clinical significance of serum B cell chemokine (CXCL13) in early rheumatoid arthritis patients. The Egyptian Rheumatologist, 41(1), 11–14. doi:10.1016/j.ejr.2018.04.003
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ejr.2018.04.003&link_type=DOI) 

2.  2.Ashburner, M., Ball, C. A., Blake, J. A., Botstein, D., Butler, H., Cherry, J. M., … Sherlock, G. (2000). Gene Ontology: Tool for the unification of biology. Nature Genetics, 25(1), 25–29. doi:10.1038/75556
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/75556&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=10802651&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F03%2F26%2F2021.03.19.21253995.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000086884000011&link_type=ISI) 

3.  3.Barrett, T., Wilhite, S. E., Ledoux, P., Evangelista, C., Kim, I. F., Tomashevsky, M., … Soboleva, A. (2012). NCBI GEO: Archive for functional genomics data sets—update. Nucleic Acids Research, 41(D1). doi:10.1093/nar/gks1193
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/nar/gks1193&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23193258&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F03%2F26%2F2021.03.19.21253995.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000312893300141&link_type=ISI) 

4.  4.Biomarker Tests for Rheumatoid Arthritis Improve Care. (2015, November 24). Retrieved from [http://blog.arthritis.org/rheumatoid-arthritis/biomarker-tests-rheumatoid-arthritis/#:~:text=To](http://blog.arthritis.org/rheumatoid-arthritis/biomarker-tests-rheumatoid-arthritis/#:~:text=To) spot RA and measure,fairly non-specific to RA
    
    

5.  5.Burkhardt, J., Blume, M., Petit-Teixeira, E., Teixeira, V. H., Steiner, A., Quente, E., … Kirsten, H. (2014). Cellular Adhesion Gene SELP Is Associated with Rheumatoid Arthritis and Displays Differential Allelic Expression. PLoS ONE, 9(8). doi:10.1371/journal.pone.0103872
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pone.0103872&link_type=DOI) 

6.  6.CD3G CD3g molecule [Homo sapiens (human)] - Gene - NCBI. (n.d.). Retrieved from [https://www.ncbi.nlm.nih.gov/gene/917](https://www.ncbi.nlm.nih.gov/gene/917)
    
    

7.  7.CXCL10 C-X-C motif chemokine ligand 10 [Homo sapiens (human)] - Gene - NCBI. (n.d.). Retrieved from [https://www.ncbi.nlm.nih.gov/gene/3627](https://www.ncbi.nlm.nih.gov/gene/3627)
    
    

8.  8.Chronic rheumatic conditions. (2016, November 17). Retrieved from [https://www.who.int/chp/topics/rheumatic/en/](https://www.who.int/chp/topics/rheumatic/en/)
    
    

9.  9.Colmegna, I., & Weyand, C. M. (2010). Haematopoietic stem and progenitor cells in rheumatoid arthritis. Rheumatology, 50(2), 252–260. doi:10.1093/rheumatology/keq298
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/rheumatology/keq298&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20837497&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F03%2F26%2F2021.03.19.21253995.atom) 

10. 10.Cruyssen, B. V., Peene, I., Cantaert, T., Hoffman, I., Rycke, L. D., Veys, E., & Keyser, F. D. (2005). Anti-citrullinated protein/peptide antibodies (ACPA) in rheumatoid arthritis: Specificity and relation with rheumatoid factor. Autoimmunity Reviews, 4(7), 468–474. doi:10.1016/j.autrev.2005.04.018
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.autrev.2005.04.018&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=16137613&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F03%2F26%2F2021.03.19.21253995.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000232287200009&link_type=ISI) 

11. 11.Dalman, M. R., Deeter, A., Nimishakavi, G., & Duan, Z. (2012). Fold change and p-value cutoffs significantly alter microarray interpretations. BMC Bioinformatics, 13(S2). doi:10.1186/1471-2105-13-s2-s11
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/1471-2105-13-s2-s11&link_type=DOI) 

12. 12.Database, G. H. (n.d.). CD3G Gene (Protein Coding). Retrieved from [https://www.genecards.org/cgi-bin/carddisp.pl?gene=CD3G#:~:text=In](https://www.genecards.org/cgi-bin/carddisp.pl?gene=CD3G#:~:text=In) addition to this role,sorting motif present in CD3G
    
    

13. 13.Database, G. H. (n.d.). TICAM1 Gene (Protein Coding). Retrieved from [https://www.genecards.org/cgi-bin/carddisp.pl?gene=TICAM1#:~:text=ICAM](https://www.genecards.org/cgi-bin/carddisp.pl?gene=TICAM1#:~:text=ICAM) 1 Gene (Protein Coding)&text=This gene encodes an adaptor
    
    

14. 14.Dennis, G., Holweg, C. T., Kummerfeld, S. K., Choy, D. F., Setiadi, A. F., Hackney, J. A., … Townsend, M. J. (2014). Synovial phenotypes in rheumatoid arthritis correlate with response to biologic therapeutics. Arthritis Research & Therapy, 16(2). doi:10.1186/ar4555
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/ar4555&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25167216&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F03%2F26%2F2021.03.19.21253995.atom) 

15. 15.Drexler, S. K., Kong, P. L., Wales, J., & Foxwell, B. M. (2008). Cell signalling in macrophages, the principal innate immune effector cells of rheumatoid arthritis. Arthritis Research & Therapy, 10(5), 216. doi:10.1186/ar2481
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/ar2481&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=18947379&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F03%2F26%2F2021.03.19.21253995.atom) 

16. 16.Edgar, R. (2002). Gene Expression Omnibus: NCBI gene expression and hybridization array data repository. Nucleic Acids Research, 30(1), 207–210. doi:10.1093/nar/30.1.207
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/nar/30.1.207&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=11752295&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F03%2F26%2F2021.03.19.21253995.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000173077100055&link_type=ISI) 

17. 17.Fueldner, C., Mittag, A., Knauer, J., Biskop, M., Hepp, P., Scholz, R., … Lehmann, J. (2012). Identification and evaluation of novel synovial tissue biomarkers in rheumatoid arthritis by laser scanning cytometry. Arthritis Research & Therapy, 14(1). doi:10.1186/ar3682
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/ar3682&link_type=DOI) 

18. 18.Gao, Andrew. “Identification of Blood-Based Biomarkers for Early Stage Parkinson’s Disease.” MedRxiv, Cold Spring Harbor Laboratory Press, 1 Jan. 2020, [www.medrxiv.org/content/10.1101/2020.10.22.20217893v1](http://www.medrxiv.org/content/10.1101/2020.10.22.20217893v1).
    
    

19. 19.The Gene Ontology Resource: 20 years and still GOing strong. (2018). Nucleic Acids Research, 47(D1). doi:10.1093/nar/gky1055
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/nar/gky1055&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=30395331&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F03%2F26%2F2021.03.19.21253995.atom) 

20. 20.Heller, M. J. (2002). DNA Microarray Technology: Devices, Systems, and Applications. Annual Review of Biomedical Engineering, 4(1), 129–153. doi:10.1146/annurev.bioeng.4.020702.153438
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1146/annurev.bioeng.4.020702.153438&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=12117754&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F03%2F26%2F2021.03.19.21253995.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000177827800007&link_type=ISI) 

21. 21.Jassal, B., Matthews, L., Viteri, G., Gong, C., Lorente, P., Fabregat, A., … D’Eustachio, P. (2019). The reactome pathway knowledgebase. Nucleic Acids Research. doi:10.1093/nar/gkz1031
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/nar/gkz1031&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=31691815&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F03%2F26%2F2021.03.19.21253995.atom) 

22. 22.Kanehisa, M. (2000). KEGG: Kyoto Encyclopedia of Genes and Genomes. Nucleic Acids Research, 28(1), 27–30. doi:10.1093/nar/28.1.27
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/nar/28.1.27&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=10592173&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F03%2F26%2F2021.03.19.21253995.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000084896300007&link_type=ISI) 

23. 23.Kuo, T., & Chen, C. (2017). Bone biomarker for the clinical assessment of osteoporosis: Recent developments and future perspectives. Biomarker Research, 5(1). doi:10.1186/s40364-017-0097-4
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/s40364-017-0097-4&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=28529755&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F03%2F26%2F2021.03.19.21253995.atom) 

24. 24.Kurkó, J., Besenyei, T., Laki, J., Glant, T. T., Mikecz, K., & Szekanecz, Z. (2013). Genetics of Rheumatoid Arthritis — A Comprehensive Review. Clinical Reviews in Allergy & Immunology, 45(2), 170–179. doi:10.1007/s12016-012-8346-7
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s12016-012-8346-7&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23288628&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F03%2F26%2F2021.03.19.21253995.atom) 

25. 25.Lee, J., Kim, B., Jin, W. J., Kim, H., Ha, H., & Lee, Z. H. (2017). Pathogenic roles of CXCL10 signaling through CXCR3 and TLR4 in macrophages and T cells: Relevance for arthritis. Arthritis Research & Therapy, 19(1). doi:10.1186/s13075-017-1353-6
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/s13075-017-1353-6&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=28724396&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F03%2F26%2F2021.03.19.21253995.atom) 

26. 26.Lewis, M. J., Barnes, M. R., Blighe, K., Goldmann, K., Rana, S., Hackney, J. A., … Pitzalis, C. (2019). Molecular Portraits of Early Rheumatoid Arthritis Identify Clinical and Treatment Response Phenotypes. Cell Reports, 28(9). doi:10.1016/j.celrep.2019.07.091
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.celrep.2019.07.091&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F03%2F26%2F2021.03.19.21253995.atom) 

27. 27.Markovics, A., Toth, D. M., Glant, T. T., & Mikecz, K. (2020). Regulation of autoimmune arthritis by the SHP-1 tyrosine phosphatase. Arthritis Research & Therapy, 22(1). doi:10.1186/s13075-020-02250-8
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/s13075-020-02250-8&link_type=DOI) 

28. 28.Mellado, M. (2015). T cell migration in rheumatoid arthritis. Frontiers in Immunology, 6. doi:10.3389/fimmu.2015.00384
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.3389/fimmu.2015.00384&link_type=DOI) 

29. 29.Mi, H., Muruganujan, A., Ebert, D., Huang, X., & Thomas, P. D. (2018). PANTHER version 14: More genomes, a new PANTHER GO-slim and improvements in enrichment analysis tools. Nucleic Acids Research, 47(D1). doi:10.1093/nar/gky1038
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/nar/gky1038&link_type=DOI) 

30. 30.Mills, T., Hernandez, G., Rabe, J. L., Kuldanek, S., Chavez, J., Kirkpatrick, G., … Pietras, E. (2018). Rheumatoid Arthritis Causes Hematopoietic Stem Cell Reprogramming to Maintain Functionality. Blood, 132(Supplement 1), 2573–2573. doi:10.1182/blood-2018-99-120272
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1182/blood-2018-99-120272&link_type=DOI) 

31. 31.Novel Small Molecule Inhibitor of Rheumatoid Arthritis. (n.d.). Retrieved from [https://www.sbir.gov/sbirsearch/detail/76336](https://www.sbir.gov/sbirsearch/detail/76336)
    
    

32. 32.Pawlik, A., Malinowski, D., Paradowska-Gorycka, A., Safranow, K., & Dziedziejko, V. (2020). VAV1 Gene Polymorphisms in Patients with Rheumatoid Arthritis. International Journal of Environmental Research and Public Health, 17(9), 3214. doi:10.3390/ijerph17093214
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.3390/ijerph17093214&link_type=DOI) 

33. 33.Pickens, Sarah R., et al. “Characterization of CCL19 and CCL21 in Rheumatoid Arthritis.” Arthritis & Rheumatism, vol. 63, no. 4, 2011, pp. 914–922., doi:10.1002/art.30232.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/art.30232&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=21225692&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F03%2F26%2F2021.03.19.21253995.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000289421100009&link_type=ISI) 

34. 34.Pickens, S. R., Chamberlain, N. D., Volin, M. V., Pope, R. M., Talarico, N. E., Mandelin, A. M., & Shahrara, S. (2011). Characterization of interleukin-7 and interleukin-7 receptor in the pathogenesis of rheumatoid arthritis. Arthritis & Rheumatism, 63(10), 2884–2893. doi:10.1002/art.30493
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/art.30493&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=21647866&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F03%2F26%2F2021.03.19.21253995.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000295293000007&link_type=ISI) 

35. 35.Proceedings of the 26th European Paediatric Rheumatology Congress: Part 1. (2020). Pediatric Rheumatology, 18(S2). doi:10.1186/s12969-020-00469-y
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/s12969-020-00469-y&link_type=DOI) 

36. 36.Raker, V. K., Becker, C., & Steinbrink, K. (2016). The cAMP Pathway as Therapeutic Target in Autoimmune and Inflammatory Diseases. Frontiers in Immunology, 7. doi:10.3389/fimmu.2016.00123
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.3389/fimmu.2016.00123&link_type=DOI) 

37. 37.Rheumatoid Arthritis: Causes, Symptoms, Diagnosis & Treatments. (n.d.). Retrieved from [https://my.clevelandclinic.org/health/diseases/4924-rheumatoid-arthritis](https://my.clevelandclinic.org/health/diseases/4924-rheumatoid-arthritis)
    
    

38. 38.Rheumatoid arthritis. (2019, March 01). Retrieved from [https://www.mayoclinic.org/diseases-conditions/rheumatoid-arthritis/diagnosis-treatment/drc-20353653#:~:text=Rheumatoid](https://www.mayoclinic.org/diseases-conditions/rheumatoid-arthritis/diagnosis-treatment/drc-20353653#:~:text=Rheumatoid) arthritis can be difficult,for swelling, redness and warmth
    
    

39. 39.Ritchie, M. E., Phipson, B., Wu, D., Hu, Y., Law, C. W., Shi, W., & Smyth, G. K. (2015). Limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Research, 43(7). doi:10.1093/nar/gkv007
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/nar/gkv007&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25605792&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F03%2F26%2F2021.03.19.21253995.atom) 

40. 40.Shannon, P. (2003). Cytoscape: A Software Environment for Integrated Models of Biomolecular Interaction Networks. Genome Research, 13(11), 2498–2504. doi:10.1101/gr.1239303
    
    [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NjoiZ2Vub21lIjtzOjU6InJlc2lkIjtzOjEwOiIxMy8xMS8yNDk4IjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjEvMDMvMjYvMjAyMS4wMy4xOS4yMTI1Mzk5NS5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 

41. 41.Singh, P. K., Kashyap, A., & Silakari, O. (2018). Exploration of the therapeutic aspects of Lck: A kinase target in inflammatory mediated pathological conditions. Biomedicine & Pharmacotherapy, 108, 1565–1571. doi:10.1016/j.biopha.2018.10.002
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.biopha.2018.10.002&link_type=DOI) 

42. 42.Su, Z., Fang, H., Hong, H., Shi, L., Zhang, W., Zhang, W., … Tong, W. (2014). An investigation of biomarkers derived from legacy microarray data for their utility in the RNA-seq era. Genome Biology, 15(12). doi:10.1186/s13059-014-0523-y
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/s13059-014-0523-y&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25633159&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F03%2F26%2F2021.03.19.21253995.atom) 

43. 43.Szklarczyk, D., Gable, A. L., Lyon, D., Junge, A., Wyder, S., Huerta-Cepas, J., … Mering, C. (2018). STRING v11: Protein–protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets. Nucleic Acids Research, 47(D1). doi:10.1093/nar/gky1131
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/nar/gky1131&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=30476243&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F03%2F26%2F2021.03.19.21253995.atom) 

44. 44.Taylor, P., Gartemann, J., Hsieh, J., & Creeden, J. (2011). A Systematic Review of Serum Biomarkers Anti-Cyclic Citrullinated Peptide and Rheumatoid Factor as Tests for Rheumatoid Arthritis. Autoimmune Diseases, 2011, 1–18. doi:10.4061/2011/815038
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.4061/2011/815038&link_type=DOI) 

45. 45.Thomas, P. D. (2003). PANTHER: A Library of Protein Families and Subfamilies Indexed by Function. Genome Research, 13(9), 2129–2141. doi:10.1101/gr.772403
    
    [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NjoiZ2Vub21lIjtzOjU6InJlc2lkIjtzOjk6IjEzLzkvMjEyOSI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIxLzAzLzI2LzIwMjEuMDMuMTkuMjEyNTM5OTUuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 

46. 46.Tyrosine-protein phosphatase non-receptor type 6 (human). (n.d.). Retrieved from [https://pubchem.ncbi.nlm.nih.gov/protein/P29350](https://pubchem.ncbi.nlm.nih.gov/protein/P29350)
    
    

47. 47.Ungethuem, U., Haeupl, T., Witt, H., Koczan, D., Krenn, V., Huber, H., … Bläß, S. (2010). Molecular signatures and new candidates to target the pathogenesis of rheumatoid arthritis. Physiological Genomics, 42A(4), 267–282. doi:10.1152/physiolgenomics.00004.2010
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1152/physiolgenomics.00004.2010&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20858714&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F03%2F26%2F2021.03.19.21253995.atom) 

48. 48.Ungethuem, U., Haeupl, T., Witt, H., Koczan, D., Krenn, V., Huber, H., … Bläß, S. (2010). Molecular signatures and new candidates to target the pathogenesis of rheumatoid arthritis. Physiological Genomics, 42A(4), 267–282. doi:10.
    
    

49. 49.Vandever, L. (2019, January 15). Rheumatoid Arthritis by the Numbers: Facts, Statistics, and You. Retrieved from [https://www.healthline.com/health/rheumatoid-arthritis/facts-statistics-infographic#Complications](https://www.healthline.com/health/rheumatoid-arthritis/facts-statistics-infographic#Complications)
    
    

50. 50.Woetzel, D., Huber, R., Kupfer, P., Pohlers, D., Pfaff, M., Driesch, D., … Kinne, R. W. (2014). Identification of rheumatoid arthritis and osteoarthritis patients by transcriptome-based rule set generation. Arthritis Research & Therapy, 16(2). doi:10.1186/ar4526
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/ar4526&link_type=DOI) 

51. 51.Yap, H., Tee, S., Wong, M., Chow, S., Peh, S., & Teow, S. (2018). Pathogenic Role of Immune Cells in Rheumatoid Arthritis: Implications in Clinical Treatment and Biomarker Development. Cells, 7(10), 161. doi:10.3390/cells7100161
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.3390/cells7100161&link_type=DOI) 

52. 52.Zhang, L., Yu, M., Deng, J., Lv, X., Liu, J., Xiao, Y., … Li, C. (2015). Chemokine Signaling Pathway Involved in CCL2 Expression in Patients with Rheumatoid Arthritis. Yonsei Medical Journal, 56(4), 1134. doi:10.3349/ymj.2015.56.4.1134
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.3349/ymj.2015.56.4.1134&link_type=DOI) 

53. 53.Zhang, R., Yang, X., Wang, J., Han, L., Yang, A., Zhang, J., … Xiong, Y. (2018). Identification of potential biomarkers for differential diagnosis between rheumatoid arthritis and osteoarthritis via integrated genome-wide gene expression profiling analysis. Molecular Medicine Reports. doi:10.3892/mmr.2018.9677
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.3892/mmr.2018.9677&link_type=DOI)