Abstract
SNP heritability is a fundamental quantity in the genetic analysis of complex traits. For binary phenotypes, in which the continuous distribution of risk in the population is unobserved, observed-scale heritabilities must be transformed to the more interpretable liability-scale. We demonstrate here that the field standard approach for performing the liability conversion can downwardly bias estimates by as much as ∼20% in simulation and ∼30% in real data. These attenuated estimates stem from the standard approach failing to appropriately account for varying levels of ascertainment across the cohorts comprising the meta-analysis. We formally derive a simple procedure for incorporating cohort-specific ascertainment based on the summation of effective sample sizes across the contributing cohorts, and confirm via simulation that it produces unbiased estimates of liability-scale heritability.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
ADG was supported by NIH Grants R01MH120219 and RF1AG073593. EMTD was supported by NIH grants R01MH120219 and RF1AG073593 and the Jacobs Foundation. EMTD is a faculty associate of the Population Research Center at the University of Texas, which is supported by NIH grant P2CHD042849. MGN is additionally supported by ZonMW grants 849200011 and 531003014 from The Netherlands Organisation for Health Research and Development, a VENI grant awarded by NWO (VI.Veni.191G.030), NIH grant R01MH120219 and is a Jacobs Foundation Fellow. J.F. is member of the Population Research Center (PRC) and Center on Aging and Population Sciences (CAPS) at The University of Texas at Austin, which are supported by National Institutes of Health (NIH) grants P2CHD042849 and P30AG066614, respectively.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
The primary analyses presented in the paper are for simulation results only and do not use any human subject data. For the real data analyses, we have used de-identified GWAS summary statistics. For those no IRB oversight is necessary. Summary statistics used in the present analyses are publicly available for download.
All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
Data Availability
The data that support the findings of this study are all publicly available. Summary statistics for data from the Psychiatric Genomics Consortium (PGC) for ADHD, ALCH, AN, ASD, BIP, CUD, MDD, OCD, PTSD, SCZ and TS can be downloaded here: https://www.med.unc.edu/pgc/download-results/ The summary statistics for ALZ can be found here: https://www.niagads.org/datasets/ng00075 LD-scores and reference files used to estimate LD-score regression can be downloaded here: https://alkesgroup.broadinstitute.org/LDSCORE/
https://www.med.unc.edu/pgc/download-results/