PT - JOURNAL ARTICLE AU - Ma, Wenfeng AU - Zhong, Xin AU - Cai, Benqiang AU - Shao, Mumin AU - Yu, Xuewen AU - Lv, Minling AU - Xu, Shaomin AU - Zhan, Bolin AU - Li, Qun AU - Ma, Mengqing AU - Kjær, Mikkel Brejnholt AU - Huang, Jinrong AU - Luo, Yonglun AU - Grønbæk, Henning AU - Lin, Lin TI - Unveiling the pathogenesis of non-alcoholic fatty liver disease by decoding biomarkers through integrated single-cell and single-nucleus profiles AID - 10.1101/2023.10.05.23296635 DP - 2023 Jan 01 TA - medRxiv PG - 2023.10.05.23296635 4099 - http://medrxiv.org/content/early/2023/11/01/2023.10.05.23296635.short 4100 - http://medrxiv.org/content/early/2023/11/01/2023.10.05.23296635.full AB - Background and Aims Non-alcoholic fatty liver disease (NAFLD) is a heterogenous liver disease encompassing pathological changes ranging from simple steatosis, inflammation and fibrosis to cirrhosis. To further unravel NAFLD pathogenesis, we aimed to decode the candidate NAFLD biomarkers associated with NAFLD severity using publicly available single-cell RNA sequencing (scRNA-seq) and single-nucleus RNA sequencing (snRNA-seq) data.Methods Seurat v5 and anchor-based reciprocal principal components analysis (RPCA) integration were performed to integrate and analyze the scRNA-seq and snRNA-seq data of 82 liver and Peripheral Blood Mononuclear Cell (PBMC) specimens from NAFLD patients and healthy controls to decode the candidate NAFLD biomarkers generated previously. Using the ‘CellChat’ R package, we analyzed ligand-receptor interactions of our candidate biomarkers from secreted genes to understand their signaling crosstalk and implications in NAFLD’s biological processes.Results We generated a database (https://dreamapp.biomed.au.dk/NAFLD-scRNA-seq/) to present the NAFLD pathogenesis by analyzing integrated scRNA-seq and snRNA-seq data. Through cell-level decoding, we discovered the expression distribution of the candidate biomarkers associated with NAFLD severity. The analysis of ligand-receptor pairs in NAFLD liver and PBMC data suggests that the IL1B-(IL1R1+IL1RAP) interaction between liver monocytes and hepatocytes/cholangiocytes may explain the correlation between NAFLD severity and IL1RAP down-regulation.Conclusions We confirmed a strong correlation between liver QSOX1/IL1RAP concentrations and NAFLD severity at the cellular level. Additionally, our analysis of comprehensive data unveiled new aspects of NAFLD pathogenesis and intercellular communication through the use of scRNA and snRNA sequencing data. (ChiCTR2300073940).HighlightsIntegrated single-cell and single-nucleus profiles from 82 liver and PBMC specimens comprising NAFLD patients and healthy controls with increasing severity were utilized to unveil the NAFLD pathogenesis through decoding candidate biomarkers of NAFLD.In cell-level observations, we decoded 16 up-regulated and 22 down-regulated secreting genes previously identified as associated with increasing NAFLD severity in the liver RNA-seq and plasma proteomics data.QSOX1, enriched in fibroblasts, and IL1RAP, enriched in hepatocytes, have been further validated and interpreted in integrated single-cell and single-nucleus profiles for their potential to predict NAFLD severity.The analysis of intercellular crosstalk, focusing on secreted signaling from our previously identified candidate biomarkers sourced from secreted genes, highlighted the IL1B-(IL1R1+IL1RAP) pathway between liver monocytes and hepatocytes/cholangiocytes. This suggests that this pathway might be a potential reason for the observed downregulation of IL1RAP in NAFLD liver.Lay Summary We integrated single-cell RNA sequencing (scRNA-seq) and single-nucleus RNA sequencing (snRNA-seq) data to unravel non-alcoholic fatty liver disease (NAFLD) pathogenesis. We decoded candidate biomarkers associated with NAFLD progression, which were previously screened from RNA sequencing (RNA-seq) data of 625 liver samples with a novel gene clustering method. A new version of the R package ‘’Seurat v5’ and anchor-based reciprocal principal components analysis (RPCA) integration were performed to process and integrate scRNA-seq and snRNA-seq data of 82 liver and Peripheral Blood Mononuclear Cell (PBMC) specimens from NAFLD patients and healthy controls. The research delved deeper into the cellular expression patterns of the candidate biomarkers and examined the intercellular communication of their secreted signaling.Competing Interest StatementHenning Gronbaek has received research grants from Abbvie, Intercept, ARLA Food for Health, ADS AIPHIA Development Services AG. Consulting Fees from Ipsen, NOVO, Pfizer. Lecturer for AstraZeneca and EISAI; and on Data Monitoring Committee at CAMURUS AB. All other authors have no conflicts of interest to declare.Funding StatementThis research was funded by the Shenzhen Sanming Project of Medicine in Shenzhen, China (grant nos. SZSM201612074).Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:GSE122083, GSE125188, GSE136103, GSE144430, GSE159977, GSE169446, GSE175793, GSE179886, GSE182159, GSE189539, GSE212837 and GSE217235. Ma W, Huang J, Cai B, Shao M, Yu X, Kjaer MB, Lv M, et al. A novel gene-screening approach reveals QSOX1/IL1RAP as promising biomarkers for the severity of non-alcoholic fatty liver disease. medRxiv 2023:2023.2007.2026.23293038.I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.YesThe data and code that support the findings of this study are available at https://github.com/cynthia139/NAFLD-scRNA-seq. https://dreamapp.biomed.au.dk/NAFLD-scRNA-seq/ avg_log2FClog fold-change of the average expression between the two groupsBMbasement membranecDC1type 1 conventional dendritic cellcDNAscomplementary DNAsECMextracellular matrixFFibrosis scoreGEOGene Expression OmnibusGOGene OntologyGRCh37Genome Reference Consortium Human Build 37HCChepatocellular carcinomaHSCshepatic stellate cellsIHCimmunohistochemistry stainingIL-1RItype I IL-1 receptorIL1RAPInterleukin-1 receptor accessory proteinLECslymphatic endothelial cellsNNAS scoreNAFLNon-alcoholic Fatty LiverNAFLDNon-alcoholic fatty liver diseaseNAFLD-DBNAFLD gene expression databaseNASNAFLD activity scoresNASHNon-alcoholic SteatohepatitisNK cellsNatural killer cellsPBMCPeripheral blood mononuclear cellPCAPrincipal components analysisPCsPrincipal ComponentsQSOX1Quiescin sulfhydryl oxidase 1RCAregulators of complement activationRNA-seqRNA sequencingRPCAreciprocal principal components analysisscRNA-seqsingle-cell RNA sequencingsIL1RAPthe soluble isoform of the IL-1 receptor accessory proteinSNNshared nearest neighborsnRNA-seqSingle-nucleus RNA sequencingSZTCMHShenzhen Traditional Chinese Medicine Hospital, ChinaUMAPUniform Manifold Approximation and Projection.