Genetic architectures of proximal and distal colorectal cancer are partly distinct
ABSTRACT
Objective An understanding of the etiologic heterogeneity of colorectal cancer (CRC) is critical for improving precision prevention, including individualized screening recommendations and the discovery of novel drug targets and repurposable drug candidates for chemoprevention. Known differences in molecular characteristics and environmental risk factors among tumors arising in different locations of the colorectum suggest partly distinct mechanisms of carcinogenesis. The extent to which the contribution of inherited genetic risk factors for sporadic CRC differs by anatomical subsite of the primary tumor has not been examined.
Design To identify new anatomical subsite-specific risk loci, we performed genome-wide association study (GWAS) meta-analyses including data of 48,214 CRC cases and 64,159 controls of European ancestry. We characterized effect heterogeneity at CRC risk loci using multinomial modeling.
Results We identified 13 loci that reached genome-wide significance (P<5×10−8) and that were not reported by previous GWAS for overall CRC risk. Multiple lines of evidence support candidate genes at several of these loci. We detected substantial heterogeneity between anatomical subsites. Just over half (61) of 109 known and new risk variants showed no evidence for heterogeneity. In contrast, 22 variants showed association with distal CRC (including rectal cancer), but no evidence for association or an attenuated association with proximal CRC. For two loci, there was strong evidence for effects confined to proximal colon cancer.
Conclusion Genetic architectures of proximal and distal CRC are partly distinct. Studies of risk factors and mechanisms of carcinogenesis, and precision prevention strategies should take into consideration the anatomical subsite of the tumor.
Significance of this study
What is already known about this subject?Heterogeneity among colorectal cancer (CRC) tumors originating at different locations of the colorectum has been revealed in somatic genomes, epigenomes, and transcriptomes, and in some established environmental risk factors for CRC.
Genome-wide association studies (GWAS) have identified over 100 genetic variants for overall CRC risk; however, a comprehensive analysis of the extent to which genetic risk factors differ by the anatomical sublocation of the primary tumor is lacking.
In this large consortium-based study, we analyzed clinical and genome-wide genotype data of 112,373 CRC cases and controls of European ancestry to comprehensively examine whether CRC case subgroups defined by anatomical sublocation have distinct germline genetic etiologies.
We discovered 13 new loci at genome-wide significance (P<5×10−8) that were specific to certain anatomical sublocations and that were not reported by previous GWAS for overall CRC risk; multiple lines of evidence support strong candidate target genes at several of these loci, including PTGER3, LCT, MLH1, CDX1, KLF14, PYGL, BCL11B, and BMP7.
Systematic heterogeneity analysis of genetic risk variants for CRC identified thus far, revealed that the genetic architectures of proximal and distal CRC are partly distinct.
Taken together, our results further support the idea that tumors arising in different anatomical sublocations of the colorectum may have distinct etiologies.
Our results provide an informative resource for understanding the differential role that genes and pathways may play in the mechanisms of proximal and distal CRC carcinogenesis.
The new insights into the etiologies of proximal and distal CRC may inform the development of new precision prevention strategies, including individualized screening recommendations and the discovery of novel drug targets and repurposable drug candidates for chemoprevention.
Our findings suggest that future studies of etiological risk factors for CRC and molecular mechanisms of carcinogenesis should take into consideration the anatomical sublocation of the colorectal tumor.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
Funding statements and acknowledgements are given in the supplemental text.
Author Declarations
All relevant ethical guidelines have been followed; any necessary IRB and/or ethics committee approvals have been obtained and details of the IRB/oversight body are included in the manuscript.
Yes
All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
Footnotes
Disclosures: The authors disclose no conflicts of interests.
Disclaimer: Where authors are identified as personnel of the International Agency for Research on Cancer / World Health Organization, the authors alone are responsible for the views expressed in this article and they do not necessarily represent the decisions, policy or views of the International Agency for Research on Cancer / World Health Organization.
Data Availability
All genotype data analyzed in this study have been previously published and have been deposited in the database of Genotypes and Phenotypes (dbGaP), which is hosted by NCBI, under accession numbers phs001415.v1.p1, phs001315.v1.p1, and phs001078.v1.p1. The UK Biobank resource was accessed through application number 8614. CRC-relevant epigenome data were retrieved from the NCBI Gene Expression Omnibus (GEO) database under accession numbers GSE77737 and GSE36401.
Subject Area
- Addiction Medicine (322)
- Allergy and Immunology (623)
- Anesthesia (162)
- Cardiovascular Medicine (2335)
- Dermatology (205)
- Emergency Medicine (373)
- Epidemiology (11694)
- Forensic Medicine (10)
- Gastroenterology (692)
- Genetic and Genomic Medicine (3679)
- Geriatric Medicine (345)
- Health Economics (630)
- Health Informatics (2361)
- Health Policy (925)
- Hematology (339)
- HIV/AIDS (772)
- Medical Education (363)
- Medical Ethics (104)
- Nephrology (396)
- Neurology (3436)
- Nursing (194)
- Nutrition (519)
- Oncology (1799)
- Ophthalmology (532)
- Orthopedics (216)
- Otolaryngology (285)
- Pain Medicine (229)
- Palliative Medicine (66)
- Pathology (444)
- Pediatrics (1021)
- Primary Care Research (415)
- Public and Global Health (6083)
- Radiology and Imaging (1254)
- Respiratory Medicine (821)
- Rheumatology (375)
- Sports Medicine (320)
- Surgery (396)
- Toxicology (50)
- Transplantation (171)
- Urology (144)