Abstract
The ability to read is an important life skill and a major route to education. Individual differences in reading ability are influenced by genetic variation, with a heritability of 0.66 for word reading, estimated by twin studies. Until recently, genomic investigations were limited by modest sample size. Here we use a multivariate genome-wide association study (GWAS) method, MTAG, to leverage summary statistics from two independent GWAS efforts, boosting power for analyses of reading ability; GenLang meta-analysis of word reading (N = 27 180) and the 23andMe, Inc., study of dyslexia (Ncases = 51 800, Ncontrols = 1 087 070). We increase effective sample size to N = 102 082, representing the largest genetic study of reading ability, to date. We identified 35 independent genome-wide significant loci, including 7 regions not previously reported. Single-nucleotide polymorphism (SNP) based heritability was estimated at 24%. We observed clear positive genetic correlations with cognitive and educational measures. Gene-set analyses implicated neuronal synapses and proneural glioblastoma pathways, further supported by enrichment of neuronally expressed genes in the developing embryonic brain. Polygenic scores of our multivariate results predicted between 2.29-3.50% of variance in reading ability in an independent sample, the National Child Development Study cohort (N = 6 410). Polygenic adaptation was examined using a large panel of ancient genomes spanning the last ∼15k years. We did not find evidence of selection, suggesting that reading ability may not have been subject to recent selection pressure in Europeans. By combining existing datasets to improve statistical power, these results provide novel insights into the biology of reading.
Competing Interest Statement
PF, AA and the 23andMe Research Team are employed by and hold stock or stock options in 23andMe, Inc. All other authors declare no conflicts of interest.
Funding Statement
SEF and EE are supported by the Max Planck Society (Germany). EE is also supported by a Veni grant of the Dutch Research Council (NWO; VI.Veni.202.072). HSM is supported by the Biotechnology and Biological Sciences Research Council [BB/T000813/1].
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
The School of Philosophy, Psychology and Language Sciences Research Ethics Committee of the University of Edinburgh gave ethical approval for this work (PPLSREC 29-1819/8)
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.
Yes
Data Availability
The full summary statistics for each dyslexia GWAS presented in this paper will be made available through 23andMe website (https://research.23andme.com/dataset-access/) to qualified researchers under an agreement with 23andMe that protects the privacy of the 23andMe participants. SNPs that met suggestive genome-wide significance (P<1×10-5) are made available in the supplementary tables.