PT - JOURNAL ARTICLE AU - Yuan, Bo AU - Schulze, Katharina AU - Batzir, Nurit Assia AU - Sinson, Jefferson AU - Dai, Hongzheng AU - Zhu, Wenmiao AU - Bocanegra, Francia AU - Fong, Chin-To AU - Holder, Jimmy AU - Nguyen, Joanne AU - Schaaf, Christian AU - Yang, Yaping AU - Bi, Weimin AU - Eng, Christine AU - Shaw, Chad AU - Lupski, James R. AU - Liu, Pengfei TI - Sequencing individual genomes with recurrent deletions reveals allelic architecture and disease loci for autosomal recessive traits AID - 10.1101/2021.02.16.21251842 DP - 2021 Jan 01 TA - medRxiv PG - 2021.02.16.21251842 4099 - http://medrxiv.org/content/early/2021/02/22/2021.02.16.21251842.short 4100 - http://medrxiv.org/content/early/2021/02/22/2021.02.16.21251842.full AB - In medical genetics, discovery and characterization of new “disease genes” and alleles depend on patient ascertainment strategies to enrich previously uncharacterized alleles. Here, we present a novel strategy of new allele and gene discovery for recessive/biallelic disease traits. In this approach, patients with large recurrent genomic deletions mediated by nonallelic homologous recombination (NAHR) are sequenced, and new discoveries are revealed in the hemizygous chromosomal regions in trans to the large deletion, essentially enabling haploid genomic segment genetics. We demonstrate through computational analyses that a collection of 30 large recurrent genomic deletions scattered in the human genome contribute to more than 10% of individual disease load for 2.13% of all known “recessive disease genes”. We performed meta-analyses for all literature reported patients affected with the 13 genes whose carrier burden are predicted to be almost exclusively from large recurrent genomic deletions. The results suggest that current sequencing efforts for personal genomes with large recurrent deletions is under-appreciated. By retrospective analyses of previously undiagnostic exome sequencing (ES) data on 69 subjects harboring 26 types of recurrent deletions, probable diagnostic variants were uncovered in genes including COX10, ERCC6, PRRT2 and OTUD7A, demonstrating new disease allele/gene/mechanism characterization. Findings from this study support the contention that more whole genome sequencing (WGS) may further resolve molecular diagnoses and provide evidence for multi-locus pathogenic variation (MPV). Such analyses benefit all stakeholders in both research development and patient clinical care.Competing Interest StatementBaylor College of Medicine (BCM) and Miraca Holdings Inc. have formed a joint venture with shared ownership and governance of Baylor Genetics (BG), which performs clinical exome sequencing and chromosomal microarray genomics assay services. The authors who are affiliated with BG are employees of BCM and derive support through a professional services agreement with the BG. JRL has stock ownership in 23andMe, is a paid consultant for Regeneron Pharmaceuticals and Novartis, and is a co-inventor on multiple United States and European patents related to molecular diagnostics for inherited neuropathies, eye diseases, and bacterial genomic fingerprinting.Funding StatementThis work was supported by the Baylor College of Medicine Precision Medicine Initiative Pilot Award to PL, National Human Genome Research Institute (NHGRI)/ National Heart Lung and Blood Institute (NHLBI) grant number UM1HG006542 to the Baylor Hopkins Center for Mendelian Genomics (BHCMG) to JRL, the National Institute of Neurological Disorders and Stroke (NINDS) R35NS105078 to JRL.Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:This study has been performed in accordance with the research protocol approved by Institutional Review Boards at Baylor College of Medicine. A waiver of informed consent has been obtained (H-41191).All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesAll patient variants are included in the manuscript and will be deposited in ClinVar. The methods and results of the computational analyses are available at https://github.com/liu-lab/cnvNAHR/. https://github.com/liu-lab/cnvNAHR/