ABSTRACT
Genetics plays an important role in psychiatric disorders. A clinically relevant question is whether we can predict psychiatric traits from genetics, which holds promise for early detection and tailored intervention. Imputed gene expression, also known as genetically-regulated expression (GRE), reflects the tissue-specific regulatory effects of multiple single nucleotide polymorphisms (SNPs) on genes. In this work, we explored the utility of GRE for trait association studies and how the GRE-based polygenic risk score (gPRS) compared with SNP-based PRS (sPRS) in predicting psychiatric traits. A total of 13 schizophrenia-related gray matter networks identified in another study served as the target brain phenotypes for assessing genetic associations and prediction accuracies in 34,149 individuals from the UK Biobank cohort. GRE was computed leveraging MetaXcan and GTEx tools for 56,348 genes across 13 available brain tissues. We then estimated the effects of individual SNPs and genes separately on each tested brain phenotype in the training set. The effect sizes were then used to compute gPRS and sPRS in the testing set, whose correlations with the brain phenotypes were used to assess the prediction accuracy. The results showed that, with the testing sample size set to 1,138, for training sample sizes from 1,138 up to 33,011, overall both gPRS and sPRS successfully predicted the brain phenotypes with significant correlations observed in the testing set, and higher accuracies noted for larger training sets. In addition, gPRS outperformed sPRS by showing significantly higher prediction accuracies across 13 brain phenotypes, with greater improvement noted for training sample sizes below ∼15,000. These findings support that GRE may serve as the primary genetic variable in brain phenotype association and prediction studies. Future imaging genetic studies may consider GRE as an option depending on the available sample size.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
This project was funded by the National Institutes of Health (P20GM103472, R01EB005846, 1R01EB006841, R01MH106655, 5R01MH094524) and the National Science Foundation (1539067).
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
The current work leveraged the population-based UK Biobank cohort. The UK Biobank study was approved by the North West Haydock Research Ethics Committee, and the data used in our work were obtained under data application number 34175.
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.
Yes
Data Availability
All data produced in the present study are available upon reasonable request to the authors.