Abstract
Background and Aims Non-alcoholic fatty liver disease (NAFLD) is a heterogenous liver disease encompassing pathological changes ranging from simple steatosis, inflammation and fibrosis to cirrhosis. To further unravel NAFLD pathogenesis, we aimed to decode the candidate NAFLD biomarkers associated with NAFLD severity using publicly available single-cell RNA sequencing (scRNA-seq) and single-nucleus RNA sequencing (snRNA-seq) data.
Methods Seurat v5 and anchor-based reciprocal principal components analysis (RPCA) integration were performed to integrate and analyze the scRNA-seq and snRNA-seq data of 82 liver and Peripheral Blood Mononuclear Cell (PBMC) specimens from NAFLD patients and healthy controls to decode the candidate NAFLD biomarkers generated previously. Using the ‘CellChat’ R package, we analyzed ligand-receptor interactions of our candidate biomarkers from secreted genes to understand their signaling crosstalk and implications in NAFLD’s biological processes.
Results We generated a database (https://dreamapp.biomed.au.dk/NAFLD-scRNA-seq/) to present the NAFLD pathogenesis by analyzing integrated scRNA-seq and snRNA-seq data. Through cell-level decoding, we discovered the expression distribution of the candidate biomarkers associated with NAFLD severity. The analysis of ligand-receptor pairs in NAFLD liver and PBMC data suggests that the IL1B-(IL1R1+IL1RAP) interaction between liver monocytes and hepatocytes/cholangiocytes may explain the correlation between NAFLD severity and IL1RAP down-regulation.
Conclusions We confirmed a strong correlation between liver QSOX1/IL1RAP concentrations and NAFLD severity at the cellular level. Additionally, our analysis of comprehensive data unveiled new aspects of NAFLD pathogenesis and intercellular communication through the use of scRNA and snRNA sequencing data. (ChiCTR2300073940).
Highlights
Integrated single-cell and single-nucleus profiles from 82 liver and PBMC specimens comprising NAFLD patients and healthy controls with increasing severity were utilized to unveil the NAFLD pathogenesis through decoding candidate biomarkers of NAFLD.
In cell-level observations, we decoded 16 up-regulated and 22 down-regulated secreting genes previously identified as associated with increasing NAFLD severity in the liver RNA-seq and plasma proteomics data.
QSOX1, enriched in fibroblasts, and IL1RAP, enriched in hepatocytes, have been further validated and interpreted in integrated single-cell and single-nucleus profiles for their potential to predict NAFLD severity.
The analysis of intercellular crosstalk, focusing on secreted signaling from our previously identified candidate biomarkers sourced from secreted genes, highlighted the IL1B-(IL1R1+IL1RAP) pathway between liver monocytes and hepatocytes/cholangiocytes. This suggests that this pathway might be a potential reason for the observed downregulation of IL1RAP in NAFLD liver.
Lay Summary We integrated single-cell RNA sequencing (scRNA-seq) and single-nucleus RNA sequencing (snRNA-seq) data to unravel non-alcoholic fatty liver disease (NAFLD) pathogenesis. We decoded candidate biomarkers associated with NAFLD progression, which were previously screened from RNA sequencing (RNA-seq) data of 625 liver samples with a novel gene clustering method. A new version of the R package ‘’Seurat v5’ and anchor-based reciprocal principal components analysis (RPCA) integration were performed to process and integrate scRNA-seq and snRNA-seq data of 82 liver and Peripheral Blood Mononuclear Cell (PBMC) specimens from NAFLD patients and healthy controls. The research delved deeper into the cellular expression patterns of the candidate biomarkers and examined the intercellular communication of their secreted signaling.
Competing Interest Statement
Henning Gronbaek has received research grants from Abbvie, Intercept, ARLA Food for Health, ADS AIPHIA Development Services AG. Consulting Fees from Ipsen, NOVO, Pfizer. Lecturer for AstraZeneca and EISAI; and on Data Monitoring Committee at CAMURUS AB. All other authors have no conflicts of interest to declare.
Funding Statement
This research was funded by the Shenzhen Sanming Project of Medicine in Shenzhen, China (grant nos. SZSM201612074).
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
GSE122083, GSE125188, GSE136103, GSE144430, GSE159977, GSE169446, GSE175793, GSE179886, GSE182159, GSE189539, GSE212837 and GSE217235. Ma W, Huang J, Cai B, Shao M, Yu X, Kjaer MB, Lv M, et al. A novel gene-screening approach reveals QSOX1/IL1RAP as promising biomarkers for the severity of non-alcoholic fatty liver disease. medRxiv 2023:2023.2007.2026.23293038.
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.
Yes
Footnotes
complement: The data and code that support the findings of this study are available at https://github.com/cynthia139/NAFLD-scRNA-seq. For subtype cell clustering, a shared nearest neighbor (SNN) modularity optimization-based algorithm was employed, with the dimensions of reduction set to 1:15 and the resolution parameter set to 0.05 (HSC/MFB).
Data Availability
The data and code that support the findings of this study are available at https://github.com/cynthia139/NAFLD-scRNA-seq.
Abbreviations
- avg_log2FC
- log fold-change of the average expression between the two groups
- BM
- basement membrane
- cDC1
- type 1 conventional dendritic cell
- cDNAs
- complementary DNAs
- ECM
- extracellular matrix
- F
- Fibrosis score
- GEO
- Gene Expression Omnibus
- GO
- Gene Ontology
- GRCh37
- Genome Reference Consortium Human Build 37
- HCC
- hepatocellular carcinoma
- HSCs
- hepatic stellate cells
- IHC
- immunohistochemistry staining
- IL-1RI
- type I IL-1 receptor
- IL1RAP
- Interleukin-1 receptor accessory protein
- LECs
- lymphatic endothelial cells
- N
- NAS score
- NAFL
- Non-alcoholic Fatty Liver
- NAFLD
- Non-alcoholic fatty liver disease
- NAFLD-DB
- NAFLD gene expression database
- NAS
- NAFLD activity scores
- NASH
- Non-alcoholic Steatohepatitis
- NK cells
- Natural killer cells
- PBMC
- Peripheral blood mononuclear cell
- PCA
- Principal components analysis
- PCs
- Principal Components
- QSOX1
- Quiescin sulfhydryl oxidase 1
- RCA
- regulators of complement activation
- RNA-seq
- RNA sequencing
- RPCA
- reciprocal principal components analysis
- scRNA-seq
- single-cell RNA sequencing
- sIL1RAP
- the soluble isoform of the IL-1 receptor accessory protein
- SNN
- shared nearest neighbor
- snRNA-seq
- Single-nucleus RNA sequencing
- SZTCMH
- Shenzhen Traditional Chinese Medicine Hospital, China
- UMAP
- Uniform Manifold Approximation and Projection.