ABSTRACT
Breast and ovarian cancers harboring homologous recombination deficiency (HRD) are sensitive to PARP inhibitors and platinum chemotherapy. Conventionally, detecting HRD involves screening for defects in BRCA1, BRCA2, and other relevant genes. Recent analyses have shown that HRD cancers exhibit characteristic mutational patterns due to the activities of HRD-associated mutational signatures. At least three machine learning tools exist for detecting HRD based on mutational patterns. Here, using sequencing data from 1,043 breast and 182 ovarian cancers, we trained Homologous Recombination Proficiency Profiler (HRProfiler), a machine learning method for detecting HRD using six mutational features. HRProfiler’s performance is assessed against prior approaches using additional independent datasets of 417 breast and 115 ovarian cancers, including retrospective data from a clinical trial involving patients treated with PARP inhibitors. Our results demonstrate that HRProfiler is the only tool that robustly and consistently predicts clinical response from whole-exome sequenced breast and ovarian cancers.
SIGNIFICANCE HRProfiler is a novel machine learning approach that harnesses only six mutational features to detect clinically useful HRD from both whole-genome and whole-exome sequenced breast and ovarian cancers. Our results provide a practical way for detecting HRD and caution against using individual HRD-associated mutational signatures as clinical biomarkers.
Competing Interest Statement
LBA is a co-founder, CSO, scientific advisory member, and consultant for io9, has equity and receives income. The terms of this arrangement have been reviewed and approved by the University of California, San Diego in accordance with its conflict of interest policies. LBA's spouse is an employee of Hologic. ENB is a consultant for io9, has equity, and receives income. AA and LBA declare U.S. provisional patent application filed with UCSD with serial numbers 63/366,392 for detecting homologous recombination deficiency from genomics data. ENB and LBA declare U.S. provisional patent application filed with UCSD with serial numbers 63/269,033 for artificial intelligence architecture for predicting cancer biomarkers, including homologous recombination deficiency. LBA also declares U.S. provisional applications with serial numbers: 63/289,601; 63/483,237; 63/412,835; and 63/492,348. All other authors declare that they have no competing interests.
Funding Statement
This work was supported by the US National Institute of Health grants R01ES030993-01A1, U01DE033345, R01ES032547-01, and R01CA269919-01 to LBA as well as LBA's Packard Fellowship for Science and Engineering and Cancer Research UK Grand Challenge Award C98/A24032. The research in this grant was also supported by a Curebound Targeted grant to LBA and RRM. The computational analyses reported in this manuscript have utilized the Triton Shared Computing Cluster at the San Diego Supercomputer Center of UC San Diego. The funders had no roles in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
All data used in this study are publicly available. Genomics data were downloaded from dbGaP accession numbers phs001783.v1.p1 and phs003019 as well as from ftp://ftp.sanger.ac.uk/pub/cancer/Nik-ZainalEtAl-560BreastGenomes/, https://data.mendeley.com/datasets/2mn4ctdpxp/, and https://dcc.icgc.org/releases/PCAWG.
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.
Yes
Data Availability
All data used in this study were previously generated by others and are publicly available.
https://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs001783.v6.p1
https://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs003019.v1.p1
ftp://ftp.sanger.ac.uk/pub/cancer/Nik-ZainalEtAl-560BreastGenomes/