1. Abstract
Accurate disease risk stratification can lead to more precise and personalized prevention and treatment of diseases. As an important component to disease risk, genetic risk factors can be utilized as an early and stable predictor for disease onset. Recently, the polygenic risk score (PRS) method has combined the effects from hundreds to millions of single nucleotide polymorphisms (SNPs) into a score that can be used for genetic risk stratification. However, current PRS approaches only utilize the additive associations between SNPs and disease risk in a one-dimensional score. Here, we show that leveraging multiple types of genetic effects in multi-dimensional risk vectors, or a polygenic risk vector (PRV), can improve the stratification of cardio-metabolic diseases risks. Using data from UK Biobank (UKBB) and Electronic Medical Records and Genomics (eMERGE) Network biobank linked electronic health records (EHR) as development and evaluation data, we found that the combined effects between the additive PRS and the dominant PRS outperformed either one in terms of disease risk stratification, especially for the individuals in the high-risk group. Our results demonstrate that disease risks are likely to be influenced by multiple types of genetic effects, and PRV could utilize these effects for better risk stratification while retaining the simplicity of the PRS method.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
We would like to acknowledge the grant support from NIH LM010098.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
I confirm all relevant ethical guidelines have been followed, and all necessary IRB approvals have been obtained. UKBB Research Ethics Committee has approved the collection of the UK Biobank (UKBB) data. The UKBB genotype and phenotype data used in the study were obtained under application #32133. eMERGE is a national network organized and funded by the National Human Genome Research Institute (NHGRI). The eMERGE data was obtained under application NT432
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
Data Availability
The UK biobank data can be applied from https://www.ukbiobank.ac.uk/. The eMERGE data can be applied from https://emerge-network.org/