RT Journal Article SR Electronic T1 Leveraging sequences missing from the human genome to diagnose cancer JF medRxiv FD Cold Spring Harbor Laboratory Press SP 2021.08.15.21261805 DO 10.1101/2021.08.15.21261805 A1 Georgakopoulos-Soares, Ilias A1 Barnea, Ofer Yizhar A1 Mouratidis, Ioannis A1 Bradley, Rachael A1 Easterlin, Ryder A1 Chan, Candace A1 Chen, Emmalyn A1 Witte, John S. A1 Hemberg, Martin A1 Ahituv, Nadav YR 2021 UL http://medrxiv.org/content/early/2021/08/17/2021.08.15.21261805.abstract AB Cancer diagnosis using cell-free DNA (cfDNA) can significantly improve treatment and survival but has several technical limitations. Here, we show that tumor-associated mutations create neomers, DNA sequences 11-18bp in length that are absent in the human genome, that can accurately detect cancer subtypes and features. We show that we can detect twenty-one different tumor-types with higher accuracy than state-of-the-art methods using a neomer-based classifier. Refinement of this classifier via supervised learning identified additional cancer features with even greater precision. We also demonstrate that neomers can precisely diagnose cancer from cfDNA in liquid biopsy samples. Finally, we show that neomers can be used to detect cancer-associated non-coding mutations affecting gene regulatory activity. Combined, our results identify a novel, sensitive, specific and straightforward cancer diagnostic tool.Competing Interest StatementI.G.S, O.Y.B, I.M, M.H and N.A are co-founders and equity holders of Neomer Diagnostics. Certain authors of this manuscript have filed a patent application covering embodiments and concepts disclosed in the manuscript.Funding StatementThis work was supported in part by the Benioff Initiative for Prostate Cancer Research. MH was supported by core funding from the Wellcome Trust and core funding from the Evergrande Center.Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:All data for this study is deidentified and publicly available. This includes: TCGA data: publicly available mutation calls were obtained at: https://portal.gdc.cancer.gov/ Permission to use the data from https://ega-archive.org/studies/EGAS00001003206 was obtained from the Data Access Committee after contacting Dr Ellen Heitzer whose study was approved by the Ethics Committee of the Medical University of Graz (approval number 21-228 ex 09/10 [prostate cancer] and 29-272 ex 16/17 [high-resolution analysis of plasma DNA]), conducted according to the Declaration of Helsinki and written informed consent was obtained from all patients and healthy probands, respectively.All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesFor the TCGA data, mutation calls are publicly available in: https://portal.gdc.cancer.gov/ Permission to use the data from https://ega-archive.org/studies/EGAS00001003206 was obtained from the DAC after contacting Dr Ellen Heitzer. https://portal.gdc.cancer.gov/ https://ega-archive.org/studies/EGAS00001003206