RT Journal Article SR Electronic T1 A validated heart-specific model for splice-disrupting variants in childhood heart disease JF medRxiv FD Cold Spring Harbor Laboratory Press SP 2023.11.23.23298903 DO 10.1101/2023.11.23.23298903 A1 Lesurf, Robert A1 Breckpot, Jeroen A1 Bouwmeester, Jade A1 Hanafi, Nour A1 Jain, Anjali A1 Liang, Yijing A1 Papaz, Tanya A1 Lougheed, Jane A1 Mondal, Tapas A1 Alsalehi, Mahmoud A1 Altamirano-Diaz, Luis A1 Oechslin, Erwin A1 Audain, Enrique A1 Dombrowsky, Gregor A1 Postma, Alex V A1 Woudstra, Odilia I A1 Bouma, Berto J A1 Hitz, Marc-Phillip A1 Bezzina, Connie R A1 Blue, Gillian A1 Winlaw, David S A1 Mital, Seema YR 2023 UL http://medrxiv.org/content/early/2023/11/27/2023.11.23.23298903.abstract AB Congenital heart disease (CHD) is the most common congenital anomaly. Non-canonical splice-disrupting variants are not routinely evaluated by clinical tests. Algorithms including SpliceAI predict such variants, but are not specific to cardiac-expressed genes. Whole genome (WGS) (n=1083) and myocardial RNA-Sequencing (RNA-Seq) (n=114) of CHD cases was used to identify splice-disrupting variants. Using features of variants confirmed to affect splicing in myocardial RNA, we trained a machine learning model that outperformed SpliceAI for predicting cardiac-specific splice-disrupting variants (AUC 0.92 vs 0.66), and was independently validated in 43 cardiomyopathy probands (AUC 0.88 vs 0.64). Application of this model to 971 CHD WGS samples identified 9% patients with splice-disrupting variants in CHD genes. Forty-one% of predicted splice-disrupting variants were deeply intronic. The burden of variants in CHD genes was higher in cases compared with 2,570 controls. Our model improved genetic yield by identifying splice-disrupting variants that are not evaluated by routine tests.Competing Interest StatementSeema Mital is on the Advisory Board of Bristol Myers Squibb, and Tenaya Therapeutics.Funding StatementThis project was supported by the Canadian Institutes of Health Research (ENP 161429) under the frame of ERA PerMed (RL, MH, CB, SM), the Ted Rogers Centre for Heart Research (SM), and the Data Sciences Institute at the University of Toronto (SM). SM holds the Heart and Stroke Foundation of Canada & Robert M Freedom Chair in Cardiovascular Science. CRB and AVP are supported by the CVON project 2014-18 CONCOR-genes. EO held the Bitove Family Professorship of Adult Congenital Heart Disease until March 2021. GB is supported by a NSW CVRN Career Advancement Grant. JB is supported by a senior clinical investigator fellowship of FWO Flanders and by the Frans Van de Werf fund for clinical cardiovascular research.Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:Institutional Research Ethics Boards of The Hospital for Sick Children, Amsterdam Medical Center, The Children's Hospital at Westmead and Kompetenznetz Angeborene Herzfehler gave ethical approval for the collection and use of biospecimens through respective registries The Heart Centre Biobank (Ontario, Canada), CONCOR (Amsterdam, Netherlands), Kids Heart BioBank (Sydney, Australia) and German Heart Registry (Berlin, Germany). Written informed consent was obtained from all patients and/or their parents/legal guardians and study protocols adhered to the Declaration of Helsinki.I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.YesSequencing data for the Discovery and Extension cohorts will be deposited in the European Genome-Phenome Archive (EGA), and will be available for download upon approval by the Data Access Committee. Sequencing data for the cardiomyopathy Validation cohort is available in EGA under accession EGAS00001004929, and are available for download upon approval by the Data Access Committee. Control cohort MGRB data are available by controlled access in EGA under accession EGAS00001003511. Additional data generated or analyzed during this study are included in the supplementary information files, and additional raw data used for figures and results are available from the corresponding author on reasonable request. https://ega-archive.org/studies/EGAS00001004929 https://ega-archive.org/studies/EGAS00001003511