ABSTRACT
Objective While population screening programs for cancer colorectal (CRC) have proven benefit, risk-stratified approaches may improve screening outcomes further. To date, genome-wide polygenic risk scores (PRS) for CRC have not been integrated with non-genetic risk factors. We aimed to evaluate several genome-wide approaches, and the benefit of adding PRS to the QCancer-10 (colorectal cancer) non-genetic risk model, to identify those at highest risk of CRC.
Design Using UK Biobank we developed and compared six different PRS for CRC. The top-performing genome-wide and GWAS-significant PRS were then combined with QCancer-10 and performance compared to QCancer-10 alone.
Results PRS derived using LDpred2 software performed best, with an odds-ratio per standard deviation of 1.58, and top age- and sex-adjusted C-statistic of 0.733 in logistic regression and 0.724 in Cox regression models in the Geographic Validation Cohort. Integrated QCancer-10+PRS models out-performed QCancer-10, with C-statistics of 0.730 and 0.693, and explained variation of 28.1% and 21.0% from QCancer-10+LDpred2 and QCancer-10 respectively in men; performance improvements in women were similar. Men in the top 20% of risk accounted for 47.6% of cases, and women 42.5% using QCancer-10+LDpred2 models, with a 3.49-fold increase in risk in men and 2.75-fold increase in women in the top 5% of risk, compared to average risk. Decision curve analysis showed that adding PRS to QCancer-10 improved net-benefit and interventions avoided across most probability thresholds.
Conclusion Integrated QCancer-10+PRS models out-perform existing CRC risk prediction models. Evaluation of risk stratified screening using this approach in a bowel screening population could be warranted.
What is already known about this subject
Risk stratification based on genetic or environmental risk factors may improve cancer screening outcomes
Many polygenic risk scores (PRS) based on a limited number of genome-wide significant SNPs have been assessed in colorectal cancer (CRC), but just two studies have examined the use of genome-wide PRS methodologies
No previously published study has examined integrated models combining genome-wide PRS and non-genetic risk factors beyond age
QCancer-10 (colorectal cancer) is the top-performing non-genetic risk prediction model for CRC
What are the new findings?
PRS derived using LDpred2 software outperform existing models, and other genome-wide and genome-wide significant models evaluated here
Adding either LDpred2 PRS or genome-wide significant PRS improves the performance and clinical benefit of the QCancer-10 model, with greater gain from the LDpred2 model
How might it impact on clinical practice in the foreseeable future?
The performance and clinical benefit of QCancer-10 is improved by adding PRS, to a level that suggests utility in stratifying CRC screening and prevention
Competing Interest Statement
JHC is director of the QResearch database, a not-for-profit collaboration between University of Oxford and EMIS (commercial supplier of NHS computer systems). She is founder and shareholder of ClinRisk Ltd and was its medical director until June 2019. ClinRisk Ltd supplies free open-source software for research purposes. It also licenses other closed source software to implement risk prediction tools into NHS computer systems outside the submitted work. She is also an adviser to the CMO in England on cancer screening. JEE has served on clinical advisory boards for Lumendi, Boston Scientific, and Paion; has served on the clinical advisory board and owns share options in Satisfai Health; and reports speaker fees from Falk. JEE serves on the ACPGBI / BSG guideline group for implementation FIT for the detection of CRC in patients with symptoms suspicious of CRC.
Funding Statement
SEB is supported by an MRC Clinical Research Training Fellowship (MR/P001106/1). JEE and SW receive funding from the NIHR Oxford Biomedical Research Centre (BRC). This work of the Houlston Laboratory (PL, RH) is supported by a grant from Cancer Research UK (CR-UK) (C1298/A25514). JHC received funding from the John Fell Oxford University Press Research Fund, grants from CR-UK grant number C5255/A18085, through the Cancer Research UK Oxford Centre, grants from the Oxford Wellcome Institutional Strategic Support Fund (204826/Z/16/Z) and other research councils, during the conduct of the study. MD is funded by CR-UK Programme Grant C348/A12076. IT is funded by CR-UK Programme Grant C6199/A27327. The research was supported by the Wellcome Trust Core Award Grant Number 203141/Z/16/Z with funding from the NIHR Oxford BRC. The views expressed are those of the authors and not necessarily those of the NHS, the NIHR or the Department of Health. Funders had no role in the study design; in the collection, analysis and interpretation of the data; in the writing of the report; or in the decision to submit the paper for publication.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
The UK Biobank study has ethical approval from the North West Multi-centre Research Ethics Committee (16/NW/0274). This study was performed under UK Biobank application number 8508.
All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
Data Availability
UK Biobank data can be obtained through http://www.ukbiobank.ac.uk/. Genotype data are available in the European Genome-phenome Archive under accession numbers EGAS00001005412, EGAS00001005421, or from the Edinburgh University DataShare Repository (https://datashare.ed.ac.uk/). Finnish cohort samples can be requested from the THL Biobank https://thl.fi/en/web/thl-biobank. PRS SNP inclusion lists and model specifications will be deposited in the PGS catalogue repository (https://www.pgscatalog.org/). Risk scores for UKB participants will be returned to UK Biobank for use by approved researchers.