Data Availability
All single-cell multiome data sets analyzed are publicly available. The 10x PBMC data set is available at https://www.10xgenomics.com/datasets. The Luecken BMMC and SHARE-Seq LCL data sets are available at Gene Expression Omnibus (accession codes GSE194122 and GSE140203, respectively).
Linking scores and percentiles for pgBoost and constituent methods have been made publicly available at 10.5281/zenodo.11211926.
Fine-mapped eQTL data from ref.34 are available at https://www.finucanelab.org/data.
ABC scores are available on the ENCODE portal (https://www.encodeproject.org/).
Biosample IDs and file accessions are listed in Supplementary Table 7.
The CRISPR data set is available at https://github.com/EngreitzLab/CRISPR_comparison/blob/ main/resources/crispr_data/EPCrisprBenchmark_ensemble_data_GRCh38.tsv.gz.
GWAS-derived SNP-gene links from ref.43 are available at https://github.com/Deylab999/GWAS_benchmark_IGVF/blob/main/UKBiobank.ABCGene.anyabc.tsv.
GWAS fine-mapping results from ref.44 are available at https://www.finucanelab.org/data.