Abstract
Polygenic risk scores (PRSs) are useful to predict breast cancer risk, but the prediction accuracy of existing PRSs in women of African ancestry (AA) remain relatively low. We aim to develop optimal PRSs for prediction of overall and estrogen receptor (ER) subtype-specific breast cancer risk in women of African ancestry. The AA dataset comprised 9,235 cases and 10,184 controls from four genome-wide association study (GWAS) consortia and a GWAS study in Ghana. We randomly divided samples into training and validation sets. Genetic variants were selected by forward stepwise logistic regression or lasso penalized regression in the training set and the corresponding PRSs were evaluated in the validation set. To improve accuracy, we also developed joint PRSs that combined 1) the best PRSs built in the AA training dataset, 2) a previously-developed 313-variant PRS in women of European ancestry, and 3) PRSs using variants that were discovered in previous GWASs in women of European and African ancestry and were nominally significant the training set. For overall breast cancer, the odd ratio (OR) per standard deviation of the joint PRS in the validation set was 1.39 (95%CI: 1.31-1.46) with area under receiver operating characteristic curve (AUC) of 0.590. Compared to women with average risk (40th-60th PRS percentile), women in the top decile of the PRS had a 2.03-fold increased risk (95%CI: 1.68-2.44). For PRSs of ER-positive and ER-negative breast cancer, the AUCs were 0.609 and 0.597, respectively. The proposed PRS can improve prediction of breast cancer risk in women of African ancestry.
Author Summary Polygenic risk scores have been developed to predict breast cancer risk in non-Hispanic white American women, where polygenic risk score combines the effects of multiple single nucleotide polymorphisms. However, reliable polygenic risk scores do not exist for women of African ancestry, including African Americans, African Barbadians, and indigenous Africans. Due to distinct allele frequencies and linkage disequilibrium structures across populations, polygenic risk scores developed in European ancestry populations have an attenuated predictive value when applied to African ancestry populations. In this study, we constructed polygenic risk scores for African ancestry women by using African ancestry datasets. Since the sample sizes of existing African ancestry datasets are much smaller than those from European-ancestry studies, these polygenic risk scores using only African ancestry datasets may have limited accuracy. To increase the prediction accuracy, we constructed joint polygenic risk scores by combining polygenic risk scores trained in African ancestry datasets with polygenic risk scores that were previously developed using a large European ancestry dataset. Results showed that the joint polygenic risk scores could improve prediction of breast cancer risk in women of African ancestry.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
The ROOT investigators were supported by National Cancer Institute grants R01-CA228198, R01-CA142996, R01-CA89085, and P20-CA233307. DH and OIO were also supported by Breast Cancer Research Foundation (BCRF-21-071). DH and GG were also partially supported by the National Cancer Institute (R03-CA227357 and R01-CA242929). KLN and SD were also supported by Basser Center for BRCA. FZ was supported by the Susan G. Komen Foundation (TREND21675016). AABC was supported by a Department of Defense Breast Cancer Research Program Era of Hope Scholar Award to CAH [W81XWH-08-1-0383] and the Norris Foundation. Each of the participating AABC studies was supported by the following grants: MEC (National Institutes of Health grants R01-CA63464 and R37-CA54281); CARE (National Institute for Child Health and Development grant NO1-HD-3-3175, K05 CA136967); WCHS (U.S. Army Medical Research and Material Command (USAMRMC) grant DAMD-17-01-0-0334, the National Institutes of Health grant R01-CA100598, and the Breast Cancer Research Foundation; SFBCS (National Institutes of Health grant R01-CA077305 and United States Army Medical Research Program grant DAMD17-96-6071); NC-BCFR (National Institutes of Health grant U01-CA069417); CBCS (National Institutes of Health Specialized Program of Research Excellence in Breast Cancer, grant number P50-CA58223, and Center for Environmental Health and Susceptibility National Institute of Environmental Health Sciences, National Institutes of Health, grant number P30-ES10126); PLCO (Intramural Research Program, National Cancer Institute, National Institutes of Health); NBHS (National Institutes of Health grant R01-CA100374). The Breast Cancer Family Registry (BCFR) was supported by the National Cancer Institute, National Institutes of Health under RFA-CA-06-503 and through cooperative agreements with members of the Breast Cancer Family Registry and Principal Investigators. The content of this manuscript does not necessarily reflect the views or policies of the National Cancer Institute or any of the collaborating centers in the BCFR, nor does mention of trade names, commercial products, or organizations imply endorsement by the U.S. Government or the BCFR. MP was supported by Breast Cancer Research Foundation, Tower Cancer Research Foundation, and a gift from Dr. Richard Balch. AMBER was supported by the National Cancer Institute grants P01-CA151135, R01-CA098663, R01-CA058420, UM1-CA164974, R01-CA100598, P50-CA58223, R01CA202981, U01-CA164974, R01-CA228357, and the University Cancer Research Fund of North Carolina. JRP was supported by the Susan G. Komen Foundation and the Karin Grunebaum Foundation. Pathology data were obtained from numerous state cancer registries (Arizona, California, Colorado, Connecticut, Delaware, District of Columbia, Florida, Georgia, Hawaii, Illinois, Indiana, Kentucky, Louisiana, Maryland, Massachusetts, Michigan, New Jersey, New York, North Carolina, Oklahoma, Pennsylvania, South Carolina, Tennessee, Texas, Virginia). For the studies included in AMBER, individual protocols were approved by the relevant Institutional Review Boards (IRBs) and by the IRBs of participating cancer registries as required. The results reported do not necessarily represent the views of the National Institutes of Health, or the state cancer registries. BCAC is funded by Cancer Research UK [C1287/A16563, C1287/A10118], the European Union's Horizon 2020 Research and Innovation Programme (grant numbers 634935 and 633784 for BRIDGES and B-CAST respectively), and by the European Community's Seventh Framework Programme under grant agreement number 223175 (grant number HEALTH-F2-2009-223175) (COGS). The EU Horizon 2020 Research and Innovation Programme funding source had no role in study design, data collection, data analysis, data interpretation or writing of the report. The Sister Study was funded by the Intramural Research Program of the NIH, National Institute of Environmental Health Sciences (Z01-ES044005). GBHS authors acknowledge the research contributions of the Cancer Genomics Research Laboratory for their expertise, execution, and support of this research in the areas of project planning, wet laboratory processing of specimens, and bioinformatics analysis of generated data. This project has been funded in whole or in part with Federal funds from the National Cancer Institute, National Institutes of Health, under NCI Contract No. 75N910D00024. The content of this publication does not necessarily reflect the views or policies of the Department of Health and Human Services, nor does mention of trade names, commercial products, or organizations imply endorsement by the U.S. Government. The success of this investigation would not have been possible without exceptional teamwork and the diligence of the field staff who oversaw the recruitment, interviews and collection of data from study subjects. Special thanks are due to the following individuals: Korle Bu Teaching Hospital,Accra-Dr Adu-Aryee, Obed Ekpedzor, Angela Kenu, Victoria Okyne, Naomi Oyoe Ohene Oti, Evelyn Tay; Komfo Anoyke Teaching Hospital, Kumasi: Marion Alcpaloo, Bernard Arhin, Emmanuel Asiamah, Isaac Boakye, Samuel Ka-chungu and; Peace and Love Hospital, Kumasi: Samuel Amanama, Emma Abaidoo, Prince Agyapong, Thomas Agyei, Debora Boateng-Ansong, Margaret Frempong, Bridget Nortey Mensah, Richard Opoku, and Kofi Owusu Gyimah. The study was further enhanced by surgical expertise provided by Dr Lisa Newman of the University of Michigan and by pathological expertise provided by Drs. Stephen Hewitt and Petra Lenz of the National Cancer Institute and Dr. Maire A. Duggan from the Cumming School of Medicine, University of Calgary, Canada. Study management assistance was received from Ricardo Diaz, Shelley Niwa, and Usha Singh. Appreciation is also expressed to the many women who agreed to participate in the study and to provide information and biospecimens in hopes of preventing and improving outcomes of breast cancer in Ghana.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
The IRB of the University of Chicago gave ethical approval for this work.
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
Data Availability
The genotype datasets used in this study are available via dbGaP (https://www.ncbi.nlm.nih.gov/gap/).
https://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs000851.v1.p1
https://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs001265.v1.p1
https://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs000669.v1.p1
https://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs000383.v1.p1
https://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs002387.v1.p1