ABSTRACT
Understanding the genetic basis of gene expression can help us understand the molecular underpinnings of human traits and disease. Expression quantitative trait locus (eQTL) mapping can help in studying this relationship but have been shown to be very cell-type specific, motivating the use of single-cell RNA sequencing and single-cell eQTLs to obtain a more granular view of genetic regulation. Current methods for single-cell eQTL mapping either rely on the “pseudobulk” approach and traditional pipelines for bulk transcriptomics or do not scale well to large datasets. Here, we propose SAIGE-QTL, a robust and scalable tool that can directly map eQTLs using single-cell profiles without needing aggregation at the pseudobulk level. Additionally, SAIGE-QTL allows for testing the effects of less frequent/rare genetic variation through set-based tests, which is traditionally excluded from eQTL mapping studies. We evaluate the performance of SAIGE-QTL on both real and simulated data and demonstrate the improved power for eQTL mapping over existing pipelines.
Competing Interest Statement
B.M.N. is a member of the scientific advisory board at Deep Genomics and Neumora. M.D. is a founder of Maze Therapeutics.
Funding Statement
W.Z. was supported by the National Human Genome Research Institute of the National Institutes of Health under award number K99/R00HG012222. A.C. was supported by an EMBO Long Term Postdoctoral Fellowship (EMBO ALTF 424-2022). B.M.N. is supported by the Novo Nordisk Foundation (NNF21SA0072102) and R01MH101244. BMN and GC are supported by R37MH107649.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
https://onek1k.org/ https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE196830
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.
Yes
Data Availability
The OneK1K single-cell RNA-seq and genotype data were available via Gene Expression Omnibus (GSE196830). Summary statistics for the eQTLs in this study can be found on Zenodo
Data availability
The OneK1K single-cell RNA-seq and genotype data were available via Gene Expression Omnibus (GSE196830).
Summary statistics for the eQTLs in this study can be found on Zenodo: https://zenodo.org/records/10811106.
GWAS summary stats for the MR analysis were downloaded from: i) CD and IBD GWAS were downloaded from IBD Genetics Consortium website (www.ibdgenetics.org) from Liu et al. Nature Genetics. 2015; ii) RA GWAS summary was downloaded from GWAS Catalog (study ID: GCST90132223) in Ishigaki et al. Nature Genetics, 2022 (https://ftp.ebi.ac.uk/pub/databases/gwas/summary_statistics/GCST90132001-GCST90133000/GCST90132223/GCST90132223_buildGRCh37.tsv.gz); iii) SLE GWAS summary was downloaded from GWAS Catalog (study ID: GCST90018917) in Sakaue et al. Nature Genetics, 2021 (https://ftp.ebi.ac.uk/pub/databases/gwas/summary_statistics/GCST90018001-GCST90019000/GCST90018917/GCST90018917_buildGRCh37.tsv.gz).