Abstract
The polygenic risk score (PRS) is an important method for assessing genetic susceptibility to diseases; however, its clinical utility is limited by a lack of interpretability tools. To address this problem, we introduce eXplainable PRS (XPRS), an interpretation and visualization tool that decomposes PRSs into genes/regions and single nucleotide polymorphism (SNP) contribution scores via Shapley additive explanations (SHAPs), which provide insights into specific genes and SNPs that significantly contribute to the PRS of an individual. This software features a multilevel visualization approach, including Manhattan plots, LocusZoom-like plots and tables at the population and individual levels, to highlight important genes and SNPs. By implementing with a user-friendly web interface, XPRS allows for straightforward data input and interpretation. By bridging the gap between complex genetic data and actionable clinical insights, XPRS can improve communication between clinicians and patients.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
This study was supported by Brain Pool Plus (Brain Pool+) Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Science and ICT [2020H1D3A2A03100666].
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
The study used ONLY openly available human data that were originally located at public repositories before the initiation of the study. The genotype data were sourced from the 1000 Genomes Project (available at [https://www.internationalgenome.org/]) and the PRS scoring file was obtained from the PGS Catalog (available at [https://www.pgscatalog.org/]). Additionally, GWAS association files were retrieved from the GWAS Catalog (available at [https://www.ebi.ac.uk/gwas/]). These data sources are freely accessible and required no prior application, screening, or registration for access.
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.
Yes
Data Availability
SNPs are mapped to genes based on their genomic positions using the RefGene annotation from the UCSC Genome Browser (https://hgdownload.soe.ucsc.edu/goldenPath/hg38/database/). The cS2G file utilized in this process was obtained from Zenodo (https://zenodo.org/records/6354007).