Abstract
Genetic variants have robustly been associated with multiple traits through genome-wide association studies (GWAS) over the past two decades. However, pinpointing the true causal genetic variant and its biological mechanism is still a considerable challenge. Recently, much concerned has been raised about the weak overlap between expression quantitative trait loci or DNA methylation with GWAS variants, when these very same molecular phenotypes have been routinely used to interpret GWAS variants. Therefore, we propose to takes the opposite approach to conventional methods and to infer variant causal networks by leveraging pleiotropy. We introduce PRISM (Pleiotropic Relationships to Infer the SNP Model) that aims to distinguish between true direct effects and pleiotropic effects in order to infer a causal network for each genetic variant. The fundamental principle of PRISM is to reassess GWAS associations to test for the consistency of a given variant-trait effect in the pleiotropic context of the other traits. PRISM clusters significant genetic variant effects in 3 categories: trait-mediated, confounder-mediated, and direct effects. By cross-referencing the information on all traits, a causal network is built for each genetic variant. On simulations, PRISM was able to recover direct effects with high precision in complex networks of traits. Then, we applied PRISM to a set of 61 heritable traits and diseases, using GWAS summary statistics from the UK Biobank. Interestingly, direct effects represent less than 13% of total significant effects, while vertical and confounding effects represent 43% and 44% respectively. Direct variants were largely enriched in per-variant heritability compared to GWAS-significant variants and pleiotropic variants. Pathways from direct variants lead to higher enrichment than GWAS variants. PRISM was able to pinpoint direct variants mapped to more trait-specific genes than GWAS, and the PRISM gene-trait network appeared disentangled and more relevant compared to the GWAS gene-trait network. Finally, we could show the concordance of the causal networks inferred by PRISM with some networks for a panel of validated variants from the literature.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
MV is supported by the French National Research Agency (ANR) (ANR-21-CE45-0023-01).
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
The study used only openly available human data in the form of summary statistics from the UK Biobank.
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.
Yes
Data Availability
PRISM is implemented in R and a user-friendly tutorial can be found on github. As long as GWAS summary statistics are available and the studied variants are mapped in HapMap3, it is possible to compute any network of traits of interest to distinguish direct variants from vertical and network variants. PRISM results are easily accessible through an online user-friendly interface. We developed a ShinyR interface, freely available online, to display PRISM results on our network of 61 highly heritable traits. Results can be visualized at the trait level. It is also possible to display the pleiotropic network of a genetic variant of interest.