ABSTRACT
While over a hundred genes are significantly associated with autism, little is known about the prevalence of variants affecting them in the general population. Nor do we fully appreciate the phenotypic diversity beyond the formal autism diagnosis. Using data from more than 13,000 autistic individuals and 210,000 undiagnosed individuals, we provide a gene-level map of the odds ratio for autism associated to rare loss-of-function (LoF) variants in 185 genes robustly associated with autism, alongside 2,492 genes displaying intolerance to LoF variants. In contrast to autism-centric approaches, we investigated the phenotype of undiagnosed individuals heterozygous for such variants and show that they exhibit a decrease in fluid intelligence, qualification level and income, and an increase in material deprivation. These effects were larger for LoFs in autism-associated genes than in other LoF-intolerant genes and appeared largely independent of sex and polygenic scores for autism. Using brain imaging data from 21,049 UK-Biobank individuals, we provide evidence for smaller cortical surface area and volume among carriers of LoFs in genes with high odds ratios for autism. Our gene-level map is a key resource to distinguish genes with high and low odds ratio for autism, and highlights the importance of including quantitative data on both diagnosed and undiagnosed individuals to better delineate the effect of genetic variants beyond the categorical diagnosis. Data are available at https://genetrek.pasteur.fr/.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
This research has been conducted using the Simons Simplex Collection and Simons Powering Autism Research for Knowledge from the Simons Foundation Autism Research Initiative, and the UK-Biobank cohort (projects 51869, 40980 and 18584). This work was supported by a grant from SFARI (#: 240059, TB). We are grateful to all of the families at the participating Simons Simplex Collection (SSC) sites, at the participating Simons Searchlight sites, the Simons Searchlight Consortium, as well as the principal investigators (A. Beaudet, R. Bernier, J. Constantino, E. Cook, E. Fombonne, D. Geschwind, R. Goin-Kochel, E. Hanson, D. Grice, A. Klin, D. Ledbetter, C. Lord, C. Martin, D. Martin, R. Maxim, J. Miles, O. Ousley, K. Pelphrey, B. Peterson, J. Piggot, C. Saulnier, M. State, W. Stone, J. Sutcliffe, C. Walsh, Z. Warren, E. Wijsman). We appreciate obtaining access to SNP arrays, WES and phenotypic data on SFARI Base. Approved researchers can obtain the SSC population dataset and the Simons Searchlight population dataset described in this study by applying at https://base.sfari.org. The authors would like to thank the members of the Human Genetics and Cognitive Functions lab for helpful discussions, and Kuldeep Kumar, Annabelle Harvey, Andreanne Proulx and Hanad Sharmarke for helping with the quality control of the UK-Biobank rs-fMRI preprocessed data. This work was funded by Institut Pasteur, the Bettencourt-Schueller Foundation, Universite de Paris, the Conny-Maeva Charitable Foundation, the Cognacq Jay Foundation, the Eranet-Neuron (ALTRUISM), the GenMed Labex, AIMS-2-TRIALS which received support from the Innovative Medicines Initiative 2 Joint Undertaking under grant agreement No 777394 and the Inception program (Investissement d Avenir grant ANR-16-CONV-0005). This project has received funding from the European Union s Horizon 2020 research and innovative program CANDY under grant agreement No 847818. The views expressed here are the responsibility of the author(s) only. The EU Commission takes no responsibility for any use made of the information set out. The iPSYCH team was supported by grants from the Lundbeck Foundation (R102-A9118, R155-2014-1724, and R248-2017-2003) and the Universities and University Hospitals of Aarhus and Copenhagen. High-performance computer capacity for handling and statistical analysis of iPSYCH data on the GenomeDK HPC facility was provided by the Center for Genomics and Personalized Medicine and the Centre for Integrative Sequencing, iSEQ, Aarhus University, Denmark (grant to ADB). SBC received funding from the Wellcome Trust 214322\Z\18\Z, support from the European Union s Horizon 2020 research and innovation programme and EFPIA and AUTISM SPEAKS, Autistica, SFARI. SBC also received funding from the Autism Centre of Excellence, SFARI, the Templeton World Charitable Fund, the MRC, and the NIHR Cambridge Biomedical Research Centre. The research was supported by the National Institute for Health Research (NIHR) Applied Research Collaboration East of England. Any views expressed are those of the author(s) and not necessarily those of the funder.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
The Institut Pasteur IRB (IRB00006966) exempted this study review according to the 45CFR46 provisions.
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
Footnotes
↵* e-mail: thomas.rolland{at}pasteur.fr, thomas.bourgeron{at}pasteur.fr
This version is a fully revised and augmented version of the manuscript previously titled "Towards a gene-level map of resilience to genetic variants associated with autism".
Data Availability
Whole-exome and SNP genotyping data from the SSC and SPARK cohorts can be obtained by applying at SFARI Base (https://www.sfari.org/resource/sfari-base/). The UK-Biobank whole-exome, SNP genotyping and brain imaging data can be obtained by applying at the UK-Biobank database (https://www.ukbiobank.ac.uk/). The human neurodevelopmental transcriptome dataset is available on the BrainSpan database (http://www.brainspan.org). Functional annotations can be obtained from SynGO (https://syngoportal.org/) and Gene Ontology (http://current.geneontology.org/annotations/goa_human.gaf.gz). Electronic health records and healthcare claims data used in the present study for the UK-Biobank individuals are not publicly available due to patient privacy concerns. Prevalence and autism-OR measures can be visualized and downloaded on https://genetrek.pasteur.fr/.
https://www.sfari.org/resource/sfari-base/
http://current.geneontology.org/annotations/goa_human.gaf.gz