Abstract
Background Proteins are essential for the development and progression of cancer and for the human body’s defense against tumor onset. The availability of a large panel of protein measurements and whole exome sequence data in the UK Biobank has enabled the simultaneous examination of plasma protein associations with risk across multiple cancer sites and their potential role in cancer etiology.
Methods We investigated the associations of plasma proteins with incidence of 19 cancers and 9 cancer subsites in up to 44,645 middle-aged adults in the UK Biobank, who had measurements of 1,463 plasma proteins generated using Olink Explore Proximity Extension Assay in baseline blood samples (2006-2010). Using multivariable-adjusted Cox regression, we estimated the risk of each protein with each cancer overall and by time-to-diagnosis after correction for multiple-testing. Identified protein-cancer associations were further assessed in an analysis of cancer risk using cis-pQTL and exome-wide protein genetic scores (exGS) in all UK Biobank participants (n=337,543).
Results We identified 371 proteins associated with the risk of at least one incident cancer, represented by a total of 621 protein-cancer associations. These proteins were associated with cancers of the blood (201 proteins), liver (131), kidney (51), lung (28), esophagus (22), colorectum (15), stomach (8), breast (5), prostate (3), endometrium (3), ovary (2), bladder (1), head and neck (1), and brain (1). 100 of these 621 protein-cancer associations persisted for cases diagnosed more than seven years after blood draw. Of these 621 associations, there was further support from cis-pQTL analyses for the etiological role of TNFRSF14 in risk of non-Hodgkin lymphoma (NHL), and from whole exome protein score (exGS) analyses for 28 other protein-cancer associations, including SRP14 and risk of leukemia. Proteins with directionally concordant evidence from long time-to-diagnosis analyses and from both cis-pQTL and exGS analyses were SFTPA2 for lung cancer, TNFRSF1B and CD74 for NHL, and ADAM8 for leukemia.
Conclusions For the first time using an integrated multi-omics and cross-cancer approach, we have comprehensively assessed the plasma proteome in relation to cancer risk and identified multiple novel etiological candidates. Differences in the levels of many circulating proteins were detectable more than seven years before cancer diagnosis; while some of these are likely to be markers of early cancer processes that may inform risk stratification, and/or risk factors, concordant evidence from genetic analyses suggests that some may have a role in cancer development.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
This work was supported by Cancer Research UK (grant numbers C8221/A29017 and C8221/A29186) to fund the centralized pooling, checking and data analysis. Tammy Tong is supported by an NDPH fellowship. Keren Papier is supported by Wellcome, Our Planet Our Health (Livestock, Environment and People LEAP) [grant number 205212/Z/16/Z]. Trishna Desai is supported by a Cancer Research UK studentship (grant number C8221/A30904). Chibuzor F. Ogamba is supported by an NDPH studentship. The funders had no role in study design, data collection, analysis, decision to publish, or preparation of the manuscript.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
The study was approved by the National Information Governance Board for Health and Social Care and the National Health Service Northwest Multicenter Research Ethics Committee (06/MRE08/65)
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.
Yes
Footnotes
↵* Keren Papier and Joshua R Atkins are shared first authors.
Conflict of interests: None
Data Availability
The UK Biobank resource is available to bona fide researchers for health-related research in the public interest
Abbreviations
- CI
- confidence interval
- cis-pQTL
- cis protein quantitative trait locus
- exGS
- exome-wide genetic score
- HR
- hazard ratio
- pLI
- probable loss-of-function intolerant