Abstract
Purpose Structural variants such as multi-exon deletions and duplications are an important cause of disease, but are often overlooked in standard exome/genome sequencing analysis. We aimed to evaluate the detection of copy number variants (CNVs) from exome sequencing (ES) in comparison to genome-wide low-resolution and exon-resolution chromosomal microarrays (CMA), and to characterise the properties of de novo CNVs in a large clinical cohort.
Methods We performed CNV detection using ES of 13,462 parent-offspring trios in the Deciphering Developmental Disorders (DDD) study, and compared them to CNVs detected from exon-resolution array comparative genomic hybridization (aCGH) in 5,197 probands from the DDD study.
Results Integrating calls from multiple ES-based CNV algorithms using random forest machine learning generated a higher quality dataset than using individual algorithms. Both ES- and aCGH-based approaches had the same sensitivity of 89% and detected the same number of unique pathogenic CNVs not called by the other approach. Of DDD probands pre-screened with low resolution CMA, 2.6% had a pathogenic CNV detected by higher resolution assays. De novo CNVs were strongly enriched in known DD-associated genes and exhibited no bias in parental age or sex.
Conclusion ES-based CNV calling has higher sensitivity than low-resolution CMAs currently in clinical use, and comparable sensitivity to exon-resolution CMA. With sufficient investment in bioinformatic analysis, exomebased CNV detection could replace low-resolution CMA for detecting pathogenic CNVs.
Competing Interest Statement
M.E.H. is a co-founder of, consultant to, and holds shares in, Congenica Ltd, a genetics diagnostic company. E.J.G. is an employee of and holds shares in Adrestia Therapeutics Ltd. Other authors declare no potential conflict of interest.
Funding Statement
This study makes use of DECIPHER (http://decipher.sanger.ac.uk), which is funded by the Wellcome.The DDD study presents independent research commissioned by the Health Innovation Challenge Fund [grant number HICF-1009-003], a parallel funding partnership between Wellcome and the Department of Health, and the Wellcome Sanger Institute [grant number WT098051].
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
The study has UK Research Ethics Committee approval (10/H0305/83, granted by the Cambridge South REC, and GEN/284/12 granted by the Republic of Ireland REC). All participants gave informed consent, as required by the REC. All published data were de-identified.
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.
Yes
Footnotes
↵* meh{at}sanger.ac.uk
Data Availability
Sequence and variant-level data and phenotypic data for the DDD study data are available from the European Genome-phenome Archive (EGA; https://www.ebi.ac.uk/ega/) with study ID EGAS00001000775. Clinically interpreted variants and associated phenotypes from the DDD study are available through DECIPHER (https://decipher.sanger.ac.uk).