Abstract
The genetic aetiology of a major fraction of patients with intellectual disability (ID) remains unknown. De novo mutations (DNMs) in protein-coding genes explain up to 40% of cases, but the potential role of regulatory DNMs is still poorly understood. We sequenced 63 whole genomes from 21 ID probands and their unaffected parents (trio). Additionally, we analysed 30 previously sequenced genomes from exome-negative ID probands. We found that regulatory DNMs were selectively enriched in fetal brain-specific and human-gained enhancers. DNM-containing enhancers were associated with genes that show preferential expression in the pre-frontal cortex, have been previously implicated in ID or related disorders, and exhibit intolerance to loss of function mutations. Moreover, we found that highly interacting regulatory regions from intermediate progenitor cells of the developing human cortex were strongly enriched for ID DNMs. Furthermore, we identified recurrently mutated enhancer clusters that regulate genes involved in nervous system development (CSMD1, OLFM1, and POU3F3). The majority of the DNMs from ID probands showed allele-specific enhancer activity when tested using luciferase assay. Using CRISPR-mediated mutation and editing of epigenomic marks, we show that regulatory elements harbouring DNMs indeed function as enhancers and DNMs at regulatory elements affect the expression of putative target genes. Our results, therefore, provide new evidence to indicate that DNMs in fetal brain-specific enhancers play an essential role in the aetiology of ID.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
National Institute for Health Research (NIHR) Imperial Biomedical Research Centre Wellcome Trust Institute Strategic Support Wellcome Trust Medical Research Council European Research Council Advanced Grant UKRI/MRC Barts charity
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
The ethics approval for this study was obtained from Multi-centre Research Ethics Committees (MRECs), Scotland. The ethics application has been approved by the Scottish MREC with application number 05/MRE00/74 (Scottish MREC 05/MRE00/74).
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
Data Availability
The whole-genome sequence (WGS) data is not publicly available because it contains information that could compromise research participant privacy/consent. However, WGS data and variant calls that support the findings of this study are available on request from the corresponding author [S.S.A.].
Abbreviations
- ASD
- Autism Spectrum Disorder
- ATAC-seq
- Assay for Transposase-Accessible Chromatin with high-throughput sequencing
- BWA
- Burrows-Wheeler Aligner
- ccREs
- candidate cis-Regulatory Elements
- ChIP-seq
- Chromatin ImmunoPrecipitation sequencing
- CNV
- Copy Number Variant
- CRISPR
- Clustered Regularly Interspaced Short Palindromic Repeats
- CRISPRi
- CRISPR interference
- DDD
- Deciphering Developmental Disorder
- DNA
- Deoxyribonucleic Acid
- DNM
- De novo Mutation
- eN
- excitatory Neurons
- ENCODE
- Encyclopedia of DNA Elements
- ExAC
- Exome Aggregation Consortium
- FBSE
- Fetal Brain-Specific Enhancers
- GATK
- Genome Analysis Toolkit
- GEL
- Genomics England Limited
- GoNL
- Genome of Netherlands
- GQ
- Genotype Quality
- GWAS
- Genome Aide Association Studies
- H3K27ac
- acetylation of histone H3 at lysine 27
- H3K4me1
- mono-methylation of histone H3 at lysine 4
- H3K4me3
- tri-methylation of histone H3 at lysine 4
- HEK293T
- Human Embryonic Kidney 293T
- HGE
- Human Gain Enhancers
- Hi-C
- Chromosome Conformation Capture
- HPO
- Human Phenotype Ontology
- ID
- Intellectual Disability
- iN
- interNeurons
- Indels
- Insertions and deletions
- IPC
- Intermediate Progenitor Cells
- LoF
- Loss of Function
- PCHi-C
- Promoter Capture Hi-C
- PLAC-seq
- Proximity ligation-assisted ChIP-Seq
- pLI
- probability of Loss of function Intolerance
- PTV
- Protein-Truncating Variants
- RG
- Radical Glia
- SFARI
- Simons Foundation Autism Research Initiative
- SID4x
- four copies of Sin3 Interacting Domain
- SNV
- Single Nucleotide Variant
- TAD
- Topologically Associated Domains
- TF
- Transcription Factors
- TFBS
- TF Binding Sites
- TPM
- Tags Per Million
- VCF
- Variant Call Format
- WGS
- Whole-Genome Sequencing