Abstract
In large cohort studies the number of unaffected individuals outnumbers the number of affected individuals, and the power can be low to detect associations for outcomes with low prevalence. We consider how including recorded family history in regression models increases the power to detect associations between genetic variants and disease risk. We show theoretically and using Monte-Carlo simulations that including family history of the disease, with a weighting of 0.5 compared to true cases, increases the power to detect associations. This is a powerful approach for detecting variants with moderate effects, but for larger effect sizes a weighting of >0.5 can be more powerful. We illustrate this both for common variants and for exome sequencing data for over 400,000 individuals in UK Biobank to evaluate the association between the burden of protein-truncating variants in genes and risk for 4 cancer types.
Competing Interest Statement
JRBP and EJG are employees of Insmed Innovation UK and hold stock/stock options in Insmed Inc. JRBP also receives research funding from GSK and engages in paid consultancy for WW International Inc.
Funding Statement
QC of the data has been funded by the Medical Research Council (unit programs: MC_UU_12015/2, MC_UU_00006/2). The research has been conducted using the UKB Resource under application number 28126. N.W. was supported by the International Alliance for Cancer Early Detection, an alliance between Cancer Research UK (C14478/A29329), Canary Center at Stanford University, the University of Cambridge, OHSU Knight Cancer Institute, University College London and the University of Manchester. J.D. was supported by core funding from the NIHR Cambridge Biomedical Research Centre (NIHR203312). X.W. and J.P.T. were supported by Cancer Research UK (PPRPGM-Nov20\100002, PRCPJT-May21\100006 and G110748).
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
The research has been conducted using the UKB Resource under application number 28126
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.
Yes
Data Availability
All data produced in the present study are available upon reasonable request to the authors