Abstract
The diagnostic rate of Mendelian disorders in sequencing studies continues to increase, along with the pace of novel disease gene discovery. However, variant interpretation in novel genes not currently associated with disease is particularly challenging and strategies combining gene functional evidence with approaches that evaluate the phenotypic similarities between patients and model organisms have proven successful.
A full spectrum of intolerance to loss-of-function variation has been previously described, providing evidence that gene essentiality should not be considered as a simple and fixed binary property. Here we further dissected this spectrum by assessing the embryonic stage at which homozygous loss-of-function results in lethality in mice from the International Mouse Phenotyping Consortium, classifying the set of lethal genes into one of three windows of lethality: early, mid or late gestation lethal.
We studied the correlation between these windows of lethality and various gene features including expression across development, paralogy and constraint metrics together with human disease phenotypes, and found that the members of the early gestation lethal category show distinctive characteristics and a strong enrichment for genes linked with recessive forms of inherited metabolic disease.
Based on these findings, we explored a gene similarity approach for novel gene discovery focused on this subset of lethal genes. Finally, we investigated unsolved cases from the 100,000 Genomes Project recruited under this disease category to look for signs of enrichment of biallelic predicted pathogenic variants among early gestation lethal genes and highlight two novel candidates with phenotypic overlap between the patients and the mouse knockout.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
This work was supported by NIH grant U54 HG006370 (P.C, C.H.W., V.M-F., J.M., H.M.M., H.P, A-M.M., D.S). Other National Institute of Health grants include R01 HD083311 (J.M), UM1 HG006348 (M.E.D, C.H., J.D.H, L.T, S.W), UM1 OD023221 (C.M. K.C.K.L., L.L), UM1 21 OD023221-09S1 (C.M.), UM1 OD0023222 (S.A.M.), U42 OD011174 and 5UM1 HG006348- 10 (L.T, S.W., J.C., R.B.S., M.S., J.H.). Additional support was provided by the Medical Research Council, Strategic Award A410-53658 (L.T, S.W., J.C., R.B.S., M.S., J.H.). This research was made possible through access to the data and findings generated by the 100,000 Genomes Project (http://www.genomicsengland.co.uk). The 100,000 Genomes Project is managed by Genomics England Limited (a wholly owned company of the Department of Health and Social Care). The 100,000 Genomes Project is funded by the National Institute for Health Research and NHS England. The Wellcome Trust, Cancer Research UK and the Medical Research Council have also funded research infrastructure. The 100,000 Genomes Project uses data provided by patients and collected by the National Health Service as part of their care and support and we are grateful to both for making this available.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
Ethics committee/IRB of Genomics England gave ethical approval for this work
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
Footnotes
↵# A full list of collaborators is provided at the end of the manuscript
Data Availability
The supplementary files supporting the findings of this study are made publicly available through Zenodo: https://doi.org/10.5281/zenodo.5796622. Viability reports and additional files containing mouse embryo and adult phenotypes are available through the IMPC web portal (https://www.mousephenotype.org/) and the IMPC FTP repository (http://ftp.ebi.ac.uk/pub/databases/impc/).