RT Journal Article SR Electronic T1 A gene-based clustering approach reveals QSOX1/IL1RAP as promising biomarkers for the severity of non-alcoholic fatty liver disease JF medRxiv FD Cold Spring Harbor Laboratory Press SP 2023.07.26.23293038 DO 10.1101/2023.07.26.23293038 A1 Ma, Wenfeng A1 Huang, Jinrong A1 Cai, Benqiang A1 Shao, Mumin A1 Yu, Xuewen A1 Kjær, Mikkel Breinholt A1 Lv, Minling A1 Zhong, Xin A1 Xu, Shaomin A1 Zhan, Bolin A1 Li, Qun A1 Huang, Qi A1 Ma, Mengqing A1 Cheng, Lei A1 Luo, Yonglun A1 Grønæk, Henning A1 Zhou, Xiaozhou A1 Lin, Lin YR 2024 UL http://medrxiv.org/content/early/2024/05/01/2023.07.26.23293038.abstract AB Background and Aims Non-alcoholic fatty liver disease (NAFLD) is a progressive liver disease that ranges from simple steatosis to inflammation, fibrosis, and cirrhosis. To address the unmet need for new NAFLD biomarkers, we aimed to identify candidate biomarkers using publicly available RNA sequencing (RNA-seq) and proteomics data.Methods An approach involving unsupervised gene clustering was performed using homogeneously processed and integrated RNA-seq data of 625 liver specimens to screen for NAFLD biomarkers, in combination with public proteomics data from healthy controls and NAFLD patients. Additionally, we validated the results in the NAFLD and healthy cohorts using enzyme-linked immunosorbent assay (ELISA) of plasma and immunohistochemical staining (IHC) of liver samples.Results We generated a database (https://dreamapp.biomed.au.dk/NAFLD/) for exploring gene expression changes along NAFLD progression to facilitate the identification of genes and pathways involved in the disease’s progression. Through cross-analysis of the gene and protein clusters, we identified 38 genes as potential biomarkers for NAFLD severity. Up-regulation of Quiescin sulfhydryl oxidase 1 (QSOX1) and down-regulation of Interleukin-1 receptor accessory protein (IL1RAP) were associated with increasing NAFLD severity in RNA-seq and proteomics data. Particularly, the QSOX1/IL1RAP ratio in plasma demonstrated effectiveness in diagnosing NAFLD, with an area under the receiver operating characteristic (AUROC) of up to 0.95 as quantified by proteomics profiling, and an AUROC of 0.82 with ELISA.Conclusions We discovered a significant association between the levels of QSOX1 and IL1RAP and NAFLD severity. Furthermore, the QSOX1/IL1RAP ratio shows promise as a non-invasive biomarker for diagnosing NAFLD and assessing its severity.Lay Summary This study aimed to find non-invasive biomarkers for non-alcoholic fatty liver disease (NAFLD). Researchers utilized a new gene clustering method to analyze RNA-seq data from 625 liver samples. The identified biomarkers were further validated using plasma proteomics profiling, enzyme-linked immunosorbent assay (ELISA), and liver immunohistochemical staining (IHC) in three separate groups of healthy controls and NAFLD patients. The study revealed that the levels of QSOX1 were elevated while IL1RAP levels were reduced with increasing severity of NAFLD. Importantly, the ratio of QSOX1 to IL1RAP expression in plasma showed promise as a non-invasive diagnostic tool for assessing the severity of NAFLD, eliminating the reliance on liver biopsy.HighlightsRNA-seq data from 625 liver specimens comprising healthy controls and NAFLD patients with increasing severity were utilized for screening NAFLD biomarkers.An unsupervised method for clustering genes based on the similarity of gene expression trajectory across all samples enhanced the discovery of novel effective non-invasive NAFLD biomarkers.QSOX1, IL1RAP, and especially the QSOX1/IL1RAP ratio, were found to be associated with NAFLD severity.The high sensitivity of the QSOX1/IL1RAP ratio in predicting NAFLD severity was validated with plasma proteomics quantification (AUROC = 0.95) and ELISA (AUROC = 0.82) in two independent patient cohorts.Competing Interest StatementHenning Gronbaek has received research grants from Abbvie, Intercept, ARLA Food for Health, ADS AIPHIA Development Services AG. Consulting Fees from Ipsen, NOVO, Pfizer. Lecturer for AstraZeneca and EISAI; and on Data Monitoring Committee at CAMURUS AB. All other authors have no conflicts of interest to declare.Funding StatementThis research was funded by the Shenzhen Science and Technology Project and Sanming Project of Medicine in Shenzhen, China (grant nos. SZSM201612074, JCYJ20210324120405015).Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:Ethics committee/IRB of Shenzhen Traditional Chinese Medicine Hospital, China gave ethical approval for this workI confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.YesThe data and code that support the findings of this study are available at https://github.com/cynthia139/NAFLD-RNA-seq. https://dreamapp.biomed.au.dk/NAFLD/ AUROCarea under the receiver operating characteristicavg_log2FClog fold-change of the average expression between the two groupsBMI,body mass indexCK18circulating keratin 18 fragmentsECMextracellular matrixELISAenzyme-linked immunosorbent assayFFibrosis scoreFIB-4Fibrosis-4GEOGene Expression OmnibusGOGene OntologyGRCh37Genome Reference Consortium Human Build 37HCChepatocellular carcinomaIHCimmunohistochemistry stainingIL1RAPInterleukin-1 receptor accessory proteinlg, log10PCA, Principal components analysisQSOX1Quiescin sulfhydryl oxidase 1RNA-seqRNA sequencingNNAS scoreNAFLNon-alcoholic Fatty LiverNAFLDNon-alcoholic fatty liver diseaseNAFLD-DBNAFLD gene expression databaseNAFLD_ngtNAFLD with normal glucose toleranceNAFLD_T2DNAFLD with type 2 diabetesNASNAFLD activity scoresNASHNon-alcoholic SteatohepatitisncRNAsnon-coding RNAsscRNA-seqsingle-cell RNA sequencingsnRNA-seqSingle-nuclei RNA sequencingSZTCMHShenzhen Traditional Chinese Medicine Hospital, ChinaTHBS2thrombospondin 2TMMTrimmed Mean of M-valuesTPMTranscript Per Million