Abstract
Clinical genetic testing of protein-coding regions identifies a likely causative variant in only ∼35% of severe developmental disorder (DD) cases. We screened 9,858 patients from the Deciphering Developmental Disorders (DDD) study for de novo mutations in the 5’untranslated regions (5’UTRs) of dominant haploinsufficient DD genes. We identify four single nucleotide variants and two copy number variants upstream of MEF2C that cause DD through three distinct loss-of-function mechanisms, disrupting transcription, translation, and/or protein function. These non-coding variants represent 23% of disease-causing variants identified in MEF2C in the DDD cohort. Our analyses show that non-coding variants upstream of known disease-causing genes are an important cause of severe disease and demonstrate that analysing 5’UTRs can increase diagnostic yield, even using existing exome sequencing datasets. We also show how non-coding variants can help inform both the disease-causing mechanism underlying protein-coding variants, and dosage tolerance of the gene.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
NW is currently supported by a Sir Henry Dale Fellowship jointly funded by the Wellcome Trust and the Royal Society (Grant Number 220134/Z/20/Z). Initial work was completed whilst NW was supported by a Rosetrees and Stoneygate Imperial College Research Fellowship. NQ is supported by the Imperial College Academic Health Science Centre. This work is additionally supported by The Rosetrees Trust (Grant Number H5R01320), Fondation Leducq [16 CVD 03], the National Institute for Health Research (NIHR) Imperial College Biomedical Research Centre, the Cardiovascular Research Centre, Royal Brompton & Harefield NHS Trust, and the NIHR Oxford Biomedical Research Centre Programme. The views expressed are those of the author(s) and not necessarily those of the NHS, the NIHR or the Department of Health. The DDD study presents independent research commissioned by the Health Innovation Challenge Fund (Grant Number HICF-1009-003) a parallel funding partnership between the Wellcome Trust and the Department of Health, and the Wellcome Trust Sanger Institute (Grant Number WT098051). See Nature 2015;519:223-8 or www.ddduk.org/access.html for full acknowledgement. This research was made possible through access to the data and findings generated by the 100,000 Genomes Project. The 100,000 Genomes Project is managed by Genomics England Limited (a wholly owned company of the Department of Health and Social Care). The 100,000 Genomes Project is funded by the National Institute for Health Research and NHS England. The Wellcome Trust, Cancer Research UK and the Medical Research Council have also funded research infrastructure. The 100,000 Genomes Project uses data provided by patients and collected by the National Health Service as part of their care and support.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
The DDD Study has UK Research Ethics Committee approval (10/H0305/83, granted by the Cambridge South REC, and GEN/284/12 granted by the Republic of Ireland REC).
All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
Data Availability
The DDD study data are available under managed access from the European Genome-phenome Archive (Study ID EGAS00001000775), and likely diagnostic variants are available open access in DECIPHER. Code used for modelling case and population variants on the MEF2C protein structure can be found here: https://github.com/matteoferla/MEF2C_analysis