Abstract
Gene expression signatures can stratify patients with heterogeneous diseases, such as Systemic Lupus Erythematosus (SLE), yet understanding the contributions of ancestral background to this heterogeneity is not well elucidated. We hypothesized that ancestry would significantly influence gene expression signatures and measured 34 gene modules in 1566 SLE patients of african (AA), european (EA) or native american (NAA) ancestry to determine the impact of ancestry on gene expression. Healthy subject ancestry-specific gene expression provided the transcriptomic background upon which the SLE patient signatures were built. Although standard therapy affected every gene signature, and significantly increased myeloid cell signatures, logistic regression analysis determined that ancestral background significantly changed 23/34 gene signatures. Additionally, the strongest association to gene expression changes was autoantibodies and this also had etiology in ancestry; the AA predisposition to have both RNP and dsDNA autoantibodies compared to EA predisposition to have only antidsDNA. A machine learning approach was used to determine a gene signature characteristic to distinguish AA SLE and was most influenced by genes characteristic of the perturbed B cell axis in AA SLE patients.
Competing Interest Statement
The authors have declared no competing interest.
Clinical Protocols
https://figshare.com/articles/AMPEL_BioSolutions_Gene_Set_Variation_Analysis_Code/7780457
Funding Statement
Funding for this work was provided by the RILITE foundation.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
Dataset GSE88884: Clinical Trials NCT01205438 and NCT01196091 were conducted by Eli Lilly and Company and are listed at clinicaltrials.gov. The protocol was approved by each institutional review board subject to applicable laws and regulations and ethical principles consistent with the Declaration of Helsinki .IRB approval was obtained and written informed consent at each of the 186 clinical trial sites for NCT01205438 and the 192 study locations for NCT01196091. Dataset GSE35846: All samples were obtained under written informed consent for participation in the Center for Health Discovery and Well Being study with the approval of the Institutional Review Boards of Emory University and the Georgia Tech. Dataset GSE45291: The study protocol for SPARE (Study of biological Pathways, disease Activity and REsponse markers in patients with systemic lupus Erythematosus) was approved by the Johns Hopkins University School of Medicine Institutional Review Board. All patients provided written informed consent. Dataset GSE111368: The study was approved by the NHS National Research Ethics Service, Outer West London REC (09/H0709/52, 09/MRE00/67). Patients or their legally authorised representatives provided informed consent. Additional adult healthy controls were recruited as part of a separate study and consented to their samples being used in additional studies (Central London 3 Research Ethics Committee, 09/H0716/41). Informed consent was obtained from all participants and we complied with all relevant ethical regulations.
All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
Data Availability
All microarray datasets in this publication are available on the NCBI database Gene Expression Omnibus (GEO) (https://www.ncbi.nlm.nih.gov/geo/). All bioinformatic software used in this publication is open source, freely available for R. Additionally, code used in this paper for LIMMA, GSVA and WGCNA are available at figshare, www.figshare.com. File names are AMPEL BioSolutions LIMMA Differential Expression Analysis Code, AMPEL BioSolutions Gene Set Variation Analysis Code, and AMPEL BioSolutions Weighted Correlation Network Analysis WGCNA Code.
https://www.ncbi.nlm.nih.gov/gds/?term=GSE88884[Accession]
https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE45291