Chromosome X-Wide Common Variant Association Study (XWAS) in Autism Spectrum Disorder

Marla Mendes; Desmond Zeya Chen; Worrawat Engchuan; Thiago Peixoto Leal; Bhooma Thiruvahindrapuram; Brett Trost; Jennifer L. Howe; Giovanna Pellecchia; Thomas Nalpathamkalam; Roumiana Alexandrova; Nelson Bautista Salazar; Ethan Alexander McKee; Natalia Rivera Alfaro; Meng-Chuan Lai; Sara Bandres-Ciga; Delnaz Roshandel; Clarrisa A. Bradley; Evdokia Anagnostou; Lei Sun; Stephen W. Scherer

doi:10.1101/2024.07.18.24310640

Abstract

Autism Spectrum Disorder (ASD) displays a notable male bias in prevalence. Research into rare (<0.1) genetic variants on the X chromosome has implicated over 20 genes in ASD pathogenesis, such as MECP2, DDX3X, and DMD. The "female protective effect" in ASD suggests that females may require a higher genetic burden to manifest similar symptoms as males, yet the mechanisms remain unclear. Despite technological advances in genomics, the complexity of the biological nature of sex chromosomes leave them underrepresented in genome-wide studies. Here, we conducted an X chromosome-wide association study (XWAS) using whole-genome sequencing data from 6,873 individuals with ASD (82% males) across Autism Speaks MSSNG, Simons Simplex Cohort SSC, and Simons Foundation Powering Autism Research SPARK, alongside 8,981 population controls (43% males). We analyzed 418,652 X-chromosome variants, identifying 59 associated with ASD (p-values 7.9×10⁻⁶ to 1.51×10⁻⁵), surpassing Bonferroni-corrected thresholds. Key findings include significant regions on chrXp22.2 (lead SNP=rs12687599, p=3.57×10⁻⁷) harboring ASB9/ASB11, and another encompassing DDX53/PTCHD1-AS long non-coding RNA (lead SNP=rs5926125, p=9.47×10⁻⁶). When mapping genes within 10kb of the 59 most significantly associated SNPs, 91 genes were found, 17 of which yielded association with ASD (GRPR, AP1S2, DDX53, HDAC8, PCDH19, PTCHD1, PCDH11X, PTCHD1-AS, DMD, SYAP1, CNKSR2, GLRA2, OFD1, CDKL5, GPRASP2, NXF5, SH3KBP1). FGF13 emerged as a novel X-linked ASD candidate gene, highlighted by sex-specific differences in minor allele frequencies. These results reveal significant new insights into X chromosome biology in ASD, confirming and nominating genes and pathways for further investigation.

1. Introduction

Autism Spectrum Disorder (ASD [MIM 209850]) is a neurodevelopmental condition defined by social communication atypicalities, restrictive interests and repetitive sensory–motor behaviors. It is diagnosed in ∼1% of the population worldwide^1,2, with a 3-4:1 male:female prevalence ratio^3,4.

This difference may have demographic and social components, with one example being that some autistic traits, such as restricted interests, may be more normalized in females compared with male individuals, and consequently ASD could be underdiagnosed⁵. However, there is evidence for a significant biological influence on the sex-differential likelihood of ASD^6–11. For example, females with neurodevelopmental disorders, including ASD, tend to have an excess of deleterious autosomal copy number variants (CNVs), and deleterious autosomal single-nucleotide variants (SNVs)^6,7,10–13. Variation in steroid hormones and differential gene expression in males and females may also influence ASD likelihood and characteristics⁸. Another consideration, which may be influenced by the afore-mentioned observations, is that in a family with a son having ASD, the likelihood of a female sibling also being affected is 4.2%, a number that increases to 12.9% if the sib is male¹⁴.

Collectively the evidence may suggest a hypothetical "female protective effect" whereby females require a quantitatively greater etiologic load than males to exhibit the same degree of clinical presentation of ASD^15–17. The sex ratio contribution approaches 1:1 when considering de novo mutations affecting presumed ‘penetrant’ autosomal genes and copy number variants (CNVs) in ASD and other neurodevelopmental conditions^18–20. However, some studies suggest that the etiology of ASD includes qualitative sex differences, particularly involving genetic variations on the X chromosome.²¹. Sex hormones, known influencers of typical male and female brain development²², may also contribute to sex-varied penetrance in ASD²³. For example, a surge of testosterone in the male fetus, combined with XY chromosomal determinants, may impact the neuroimmune system, affecting dendritic arborization²⁴ and the number of microglia and neurons²⁵, hence contribute to the sex-difference biology of ASD.

Currently, there are 23 SFARI²⁶ score 1 and 36 SFARI²⁶ score 2 genes with evidence to be involved in ASD mapping to the X chromosome²⁶. Nine of these reach a sufficient “Evaluation of Autism Gene Link Evidence (EAGLE)” score to be considered definitively involved in more narrowingly-defined ASD²⁷ (the SFARI and EAGLE genes are often used in diagnostic testing panels for ASD)²⁸. Since upwards of 75% of genome-wide studies do not consider rare or common variants (including polygenic score analysis) on the sex chromosomes in their analysis ²⁹, it is anticipated there are additional gene loci to be validated and others to be discovered (Table S1 summarizes the published genome-wide manuscripts examining the X-chromosome). One study has attempted a genetic association test for ASD using common variants on the X chromosome³⁰, finding TBL1X as a candidate locus (Table S1).

There are, however, complications in studying the X chromosome as it has a lower genetic diversity compared to the autosomes, because, apart from the small pseudoautosomal region, this genomic region does not recombine in males²⁹. Thus, the X chromosome can be more sensitive to evolutionary events, such as sex-bias admixture, bottlenecks and natural selection, and it can have different mutation rates from autosomes³¹. Moreover, in females, the X-inactivation phenomenon can occur where a random X chromosome copy may be inactive (i.e. X chromosome dosage compensation)^29,31,32. The issue of 50% reduced X-chromosome coverage in males (46XY) in microarray and sequencing experiments has also led to the understudy of this important sex chromosome²⁹.

Recent development, however, now enables more robust X-wide association studies (XWAS) by dealing with X-specific quality control, statistical tests stratified by sex, estimation of significant thresholds, and accounting for the potential heterogeneity of allelic effect between males and females and chromosome inactivation bias ^29,33.

Here, we conducted a comprehensive XWAS of ASD from 6,873 ASD individuals (5,639 males and 1,234 females) sourced from three different whole-genome sequencing (WGS) datasets, alongside 8,981 control individuals (3,911 males and 5,070 females), from two additional datasets (Figure 1, Figure S1).

Figure 1: XWAS workflow.

A) Outline of the XWAS pipeline detailing data sources including MSSNG (Autism Speaks), SSC (Simons Simplex Cohort), SPARK (Simons Foundation Powering Autism Research), 1KGP (1000 Genome Project), HostSeq (The Host Genome Sequencing Initiative), and MGRB (Medical Genome Reference Bank). The significance threshold was determined using Bonferroni correction, individually calculated for the Male-XWAS, Female-XWAS, and Both-XWAS approaches. For Meta-XWAS, we used the threshold inferred from the Both-XWAS result. B) Replication and robustness studies conducted.

2. Material and methods

2.1 Database

2.1.1. ASD Datasets

The Autism Speaks MSSNG resource^34,35 is a dataset of genetic and phenotype information from individuals diagnosed with ASD as well as members of their families ^34,35. The affected individuals were diagnosed according to the Diagnostic and Statistical Manual of Mental Disorders (DSM)³⁶, also supported in many individuals by the Autism Diagnostic Interview-Revised (ADI-R)^37,38 and/or the Autism Diagnostic Observation Schedule (ADOS) ^39,40. The Province of Ontario Neurodevelopmental Network (POND) is part of MSSNG and continues to contribute with new data. We used data from 9,621 individuals for the analysis done here.

The Simons Simplex Collection (SSC) includes WGS data from approximately 2,600 ASD simplex families (one affected child plus unaffected parents and siblings)⁴¹. The ASD diagnoses were performed following the University of Michigan Autism and Communication Disorders Center guidance to guarantee uniformity across the 12 university-affiliated research clinics involved in this initiative. We used 9,209 ASD participants from SSC in this analysis. Also from SFARI⁴², the SPARK data (Simons Foundation Powering Autism Research) is an autism research initiative that includes both WES (Whole Exome Sequence) and WGS data from US individuals, besides behaviour and phenotypic information. We used WGS from 12,519 individuals for the X-chromosome analysis.

2.1.2. Population/Control Datasets

For ancestry inference, we used genetic information from 3,202 samples from 1000 Genomes Project of five different ancestries (Africans, Americans, East Asians, Europeans and South Asians). For this, we used the high-coverage 2020 version released by the New York Genome Center (NYGC) (https://www.internationalgenome.org/data-portal/data-collection/30x-grch38)⁴³.

As ASD-controls we used data from 2,561 samples from the Medical Genome Reference Bank (MGRB)⁴⁴, which is a WGS dataset from ∼4,000 healthy, elderly Australians⁴⁴. The MGRB dataset includes most individuals of European ancestry but does not exclude samples from different genetic backgrounds. We also used 9,802 samples from the Host Genome Sequencing Initiative (HostSeq)⁴⁵ which is a collection of 14 Canadian research studies examining responses to COVID-19.

2.2 Quality Control

2.2.1 Autosomes

After selecting biallelic variants we used the SmartQC software (https://github.com/ldgh/MosaiQC-public) to perform the basic control quality steps to remove: (i) variants with the chromosome notation equal to “0”, (ii) remove variants with duplicated IDs, (iii) remove variants and samples with missing data greater than 10% (plink --geno 0.1; plink --mind 0.1), (iv) impute sex codes using SNP data through PLINK’s ’--impute-sex --check-sex’ functionality. (v) remove A|T and C|G variants (ambiguous SNPs), (vi) remove 100% heterozygous variants (plink --hardy) and (vii) annotate the variants for dbSNP ID and LiftOver for hg38 if necessary.

Using plink --bmerge, we merged the data from MSSNG, SSC, SPARK, MGRB, HostSeq. The merged file had a total of 22,242 samples and 1,407,803 variants (Figure 1, Figure S1).

For the XWAS analysis using Principal Components (PCs) based on the autosomal information as covariates for logistic regression, we cleaned our data based on the pipeline of Leal et al (2023)⁴⁶ (https://github.com/MataLabCCF/XWAS) in the merged file with MSSNG, SSC, SPARK, MGRB, and HostSeq. This cleaning pipeline adds the following steps; (i) removal of monomorphic SNPs, or those located in structural variants, using the list of SNPs located in structural variants from Le Guen et al. (2021)³² created using Tri-Typer⁴⁷. (ii) remove of potential probe sites using gnomAD⁴⁸, and (iii) relationship control using KING⁴⁹ to calculate the kinship coefficient and NAToRA⁵⁰ to remove samples with relatedness closer than second degree. After this XWAS cleaning pipeline the autosomal file had 1,075,065 SNPs and 21,089 samples (Figure S1).

The final XWAS analysis was restricted to individuals with more than 75% European ancestry. To achieve this, we conducted an ancestry check utilizing ADMIXTURE software⁵¹ with five clusters. The reference populations included Europeans, Africans, East Asians, South Asians, and Americans from the 1000 Genomes Project (1KGP)⁵². After merging our XWAS data with samples from the 1000 Genomes Project (1KGP), which underwent the same quality control process, we obtained a dataset containing 24,291 samples (Figure S1). To enhance data quality for ancestry inference, we conducted a filtering step to exclude variants exhibiting high levels of Linkage Disequilibrium (LD), using the command ’plink --indep-pairwise 100 10 0.1’. Additionally, variants located in regions known to be under recent selection were removed from the dataset^53–55. We then ran ADMIXTURE using a total of 131,291 SNPs.

2.2.2 X Chromosome

After completing the general quality control steps described in section 2.2.1, we separated the variants on the X chromosome (coded as chromosome 23 in PLINK) from those in the pseudoautosomal regions (coded as chromosome 25 in PLINK). This separation was based on a dbSNP reference file. We also applied the XWAS cleaning pipeline (Figure 1)⁴⁶ for the X chromosome, which includes; (i) selecting the remaining individuals from the autosomal cleaning process, including samples without relatedness greater than second degree and samples with more than 75% of European ancestry, (ii) removal of SNPs following the same parameters used for the autosomes, besides SNPs with differential missingness between ASD individuals and controls with p-values lower than 10^-5, (iii) removal of SNPs with differential missingness between males and females with p-values lower than 10^-5, (v) heterozygous SNPs found in males were assigned as missing data. For the XWAS logistic regression we used a final of 418,652 X chromosomal variants and 15,499 samples (Figure S1).

2.3. XWAS

After data cleaning, we conducted the XWAS analysis using two input files. The first file contained autosomal data with 1,075,065 variants, intended for principal component inferences to be used as covariates in the XWAS logistic regression. The second file consisted of X chromosome data with 418,652 variants. Both files contained data from the same 15,854 samples. Among these samples, 9,550 were male (3,911 controls and 5,639 ASD individuals), and 6,304 were females (5,070 controls and 1,234 ASD individuals). The principal component inference was done with the GENESIS package stratified by sex (one PCA for males and one for females), where all samples with standard deviation greater than three from major clusters were defined as being outliers and removed from subsequent analyses. For the primary XWAS analysis, we utilized X chromosome data for principal component analysis (PCA). Additionally, we conducted a replication analysis using data from autosomal chromosomes. The resulting 10 PCs from males only and females only were employed as covariates for Male-XWAS and Female-XWAS, respectively. Non-outlier males and females were merged to create both datasets. Subsequently, this new dataset underwent another PCA, where outliers were detected and excluded based on the same parameters. The 10 resulting PCs from this process were used as covariates in “Both-XWAS” (Figure 1).

The final regression analysis was performed using logistic regression in PLINK2 ⁵⁶ (--glm) for three approaches (Figure 1); (i) Male-XWAS: Based on 5,639 ASD male individuals and 3,911 male controls. This approach used the 10 top PCs from males as covariates; (ii) Female-XWAS: Based on 1,234 ASD female individuals and 5,070 female controls. This approach used the 10 top PCs from females as covariates; and (iii) Both-XWAS: Based on 6,873 ASD individuals and 8,981 controls. This approach used the 10 top PCs from both and sex as covariates.

We also performed a meta-analysis from the sex-stratified results (Male-XWAS and Female-XWAS) implemented on GWAMA^57,58. This result incorporates the "gender_heterogeneity_p-value," which is derived from assessing heterogeneity between sex-specific allelic effects. This result incorporates the "gender_heterogeneity_p-value," which is derived from assessing heterogeneity between sex-specific allelic effects using one degree of freedom. This test involved analyzing males and females separately in each XWAS. It entailed obtaining male- and female-specific allelic effect estimates in a fixed-effects meta-analysis, followed by testing for heterogeneity between the sexes ⁵⁸.

2.4 X-Chromosome Significance Threshold

Given that our association tests are conducted on a single chromosome, the number of effective tests performed is lower compared to a genome-wide analysis. Typically, in genome-wide analyses, the significance threshold is set at p-value < 5×10^-8. To determine an appropriate significance threshold for our XWAS analysis, we applied a Bonferroni correction by dividing 0.05 by the number of effective tests ⁵⁹.

The number of effective tests (N_eff) was calculated by dividing the squared number of variants by the sum of the R² correlation coefficients between all variants present in the dataset ⁴⁶: V= Total number of variants

L = the R² correlation coefficient between all variants (V) present in the datasets (L is a matrix with size V by V); i and j = Matrix indexes.

To generate the R2 matrix among all variants in our dataset, we utilized the command ’plink -

-r2 square gz yes-really’. The sum of our corresponding matrix was: Female: 37441288.90; Male: 27721944.23 and Both: 53006792.62. Thus, the respective number of effective tests (N_eff): Female: 418,652²/37441288.90 = 4,681.18; Male: 418,652²/ 27721944.23 = 6,322.40; Both: 418,652²/ 53006792.62 = 3,306.54 with the final significance threshold being; Female: 0.05/4,681.18 = 1.07×10^-5; Male: 0.05/6,322.40 = 7.9×10^-6; Both: 0.05/3,306.54 = 1.51×10^-5.

2.5. sdMAF

Sex differences in allele frequencies were analyzed with the sdMAF software ^60,61. We initially split the pseudoautosomal regions (PAR) with the PLINK⁶² --split-par hg38 command. Since PLINK was not able to properly handle male homozygous in the bed file and simply assigned them all to missing, we bypassed the problem by changing the chromosome code to 22 prior to generating genotype counts. The chromosome number in the ‘gcount’ file was then re-coded back to 23 and subsequently pipelined into the sdMAF software as suggested by the sdMAF documentation⁶¹. To select the significant sdMAF results, we utilized the same conservative, Bonferroni-corrected significance level for XWAS-Both analysis (1.51×10^-5), given that we are testing the same number of SNPs.

2.6 Rare variant analysis

We further investigated the impact of rare genetic variations inside the candidate regions identified from the XWAS analysis by comparing the frequency of rare predicted damaging single nucleotide variants (SNVs; gnomAD frequency <0.1%), insertion and deletions smaller than 50bp (indels; gnomAD frequency <0.1%), and exonic copy number deletions (CNV deletions; gnomAD frequency <1%) impacting genes between ASD-probands and family members.

The initial reads were aligned to the GRCh38 human genome reference. Small variants (SNVs and Indels) and CNVs were called using GATK and in-house CNV calling pipeline, respectively⁶³. Standard output files were generated, including CRAMs for alignment, and VCFs for small variants, and CNVs. Per sample analysis metrics were also generated. The small variant calls were annotated using an ANNOVAR-based pipeline⁶⁴. Using an in-house script, we filtered high quality small variants that were found in less than 0.1% of gnomAD samples. We then selected only damaging small variants if they result in a stop gain or a frameshift, or, they are nonsynonymous SNVs predicted to be damaging by four different in-silico tools (i.e., sift_score⁶⁵ <=0.05, polyphen_score⁶⁶>=0.9, mt_score⁶⁷>=0.5, and CADD_phred⁶⁸ >= 15). For this SNVs analysis, besides the WGS data previous described (session 2.1.1), we also used whole exome data (WES) from SPARK, given a final number of 47,840 ASD-probands (79% males), 19,820 ASD-unaffected siblings (47% males), and 63,692 ASD-parents (40% fathers).

The deletions were detected using a previously described read depth-based pipeline^34,63. We only considered high-quality deletions, which were tagged based on the following criteria; (i) length >= 5kb, ii. called by both ERDS⁶⁹ and CNVnator⁷⁰ with at least 50% reciprocally overlapped in length, (ii) having < 70% of its length overlap with repetitive or low complexity regions of the genome (i.e., telomere, centromere, and segmental duplications), and (iii). for the X chromosomal calls in males, CNVs in PAR were filtered out. For the CNV comparison we only used WGS data, and we also included data from the new MSSNG release (MSSNGdb7), resulting in a total of 9,691 ASD-probands (82% males), 5,591 ASD-unaffected siblings (38% males) and 17,470 ASD-parents (50% fathers).

For both small variants and deletions, independently, we performed an association analysis using a conditional logistic regression stratifying the test by the family. For sex-combined analysis, we also used sex as covariates.

2.7 Brain gene Expression Analysis

Exon-averaged gene expression data were obtained from BrainSpan (Allen Brain Atlas)⁷¹. With this microarray data, we further applied quantile normalization and standardization across both genes and samples for the comparative analysis. Subsequently, we generated a brain map plot wherein colors ranging from blue (indicating downregulation) to red (indicating upregulation) denote the average expression levels of the selected genes within each brain region. This visualization was created for five developmental stages: Early Fetal (less than 16 weeks), Late Fetal (more than 16 weeks to birth), Early Childhood (birth to three years old), Childhood/Teenage (three years to 20 years), and Adulthood (more than 20 years).

3. Results

3.1 Association Test

After performing the four different XWAS tests (Figure 1), which included sex-stratified tests (Male-XWAS and Female-XWAS), sex-combined mega-analysis (Both-XWAS), and meta-analysis (Meta-XWAS), we identified 59 variants as significant in at least one of the four approaches (Table S2). These variants correspond to a total of 20 risk loci, encompassing 23 genes with variants in high linkage disequilibrium (r^2 > 0.7) with the lead SNP (Table 1). The genomic loci detected from the four XWAS approaches utilized are shown in Table 1. Among these, 42 were found uniquely by a unitary XWAS approach: 27 in the Male-XWAS, five in the Female-XWAS, one in the Both-XWAS (performed with males and females together, using sex as a covariate), and nine in the Meta-XWAS (a meta-analysis of Male-XWAS and Female-XWAS results using GWAMA⁵⁷ software, because it includes a "meta-analysis using sex-differentiated and sex heterogeneity"⁵⁷). Additionally, 17 variants showed significant p-values in more than one test (Table S2). Each test underwent visual inspection via histograms and QQ plots, revealing no distortions as indicated by the genomic inflation factor (λ=0.928 - 1.036), which measures systematic bias in the statistical test (Figure S2, Figure 2). Among the 59 variants, 30 exhibit a “gender heterogeneity p-value” (test for heterogeneity between sexes with one degree of freedom)⁵⁸ below 0.05, all of them in the sex stratified approaches (26 in the Male-XWAS and four in the Female-XWAS), suggesting significant differences in allelic effects between males and females for these variants. Notably, two of these variants, identified in the Male-XWAS within the ASB11 gene, attained a “gender heterogeneity p-value” of less than 9×10⁻⁵. In the presence of heterogeneity in allelic effects between the sexes, a loss of power for sex-combined association tests can occur if the allele has opposite direction of effect in the other sex⁵⁸. This type of biological phenomena may explain why variants are not detected in Both-XWAS and Meta-XWAS.

Figure 2. ASD-XWAS manhattan and qq plots.

Each panel shows a Manhattan plot on the left part and qqPlot on the right part. The graphs result from XWAS testing using 6,873 ASD individuals (5,639 males and 1,234 females) and 8,981 controls (3,911 males and 5,070 females) with a total of 418,652 X chromosomal variants originated from WGS data (46 variants in PAR regions) for (A) Male-XWAS, B) Female-XWAS, C) Both-XWAS, and D) the Meta-XWAS, a meta-analysis from the sex stratified approaches implemented on GWAMA⁵⁷.

View this table:

Table 1. Genomic Risk Loci.

Genomic loci detected from four XWAS analyses (Male-XWAS, Female-XWAS, Both-XWAS, Meta-XWAS). The unique ID as well as the p-value refer to the lead SNPs specified. The 23 genes in the last column are within the gene locus, and encompass variants exhibiting strong linkage disequilibrium with the lead SNP (r2 > 0.7).

3.1.2 Robustness Study

We performed XWAS analyses using various configurations, including one ASD dataset against all controls, as well as all ASD against each control dataset, to mitigate potential bias stemming from dataset heterogeneity and to conduct robust sanity replications, (Table S3). Consequently, we obtained XWAS results for; (i) MSSNG as cases versus HostSeq and MGRB as controls, (ii) SSC as cases versus HostSeq and MGRB as controls, (iii) SPARK as cases versus HostSeq and MGRB as controls, (iv) MSSNG, SSC, and SPARK as cases versus HostSeq as controls, (v) MSSNG, SSC, and SPARK as cases versus MGRB as controls and (vi) control versus control (sanity test; MGRB was labeled as cases and HostSeq as control).

Among the 27 variants exclusively found in males, all replication tests yielded a p-value below 0.05, except for eight variants solely in robustness test "v" (involving all case datasets versus MGRB controls). Notably, these eight variants reside within the first significant region identified in the Male-XWAS, spanning between 15.27 and 15.36 Mb. Even after excluding these eight variants, we retained 19 significant SNPs in this region with a p-value lower than 0.05 across all replication tests.

Regarding the five variants in the detected exclusive in Female-XWAS, two did not reach a p-value lower than 0.05 across all robustness tests. One of them, rs749183760 in ENOX2, was not captured by test "ii". Additionally, the intergenic SNP rs182249604, located within the first significant genomic region (between 16.7Mb and 17.33Mb), did not yield a p-value lower than 0.05 on test “iii”. However, even after excluding these variants, we still observe significant SNPs in this region, including variants within the TXLN gene.

Considering the results from the Both-XWAS replications, there is only one significant variant (rs767542284 in PDHA1), that also demonstrated a significant p-value in all subset (i-v) analyses; five of nine variants detected in the Meta-XWAS achieved significant p-values in all cohort tests, and these are located within DMD, PABPC1L2A and PCDH11X. When comparing the significant variants detected in both Meta-XWAS and Both-XWAS (8 variants located on PTCHD1-AS, HDAC8, and LOC124905257 genes), we replicated five results across all cohort tests (i to v). Notably, the variants that did not reach a significant p-value in all tests include two variants in the HDAC8 gene (rs5958792, rs73218354) and one intergenic variant (rs5981334), all of which were not replicated only in test "iii" (SPARK versus all control cohorts).

All six results identified in both the Female-XWAS and another XWAS (Meta-XWAS, Both-XWAS) were situated within two different genes, ENOX2 and HTR2C. None of these variants achieved a p-value < 0.05 in test ii (SSC versus all controls). Three significant variants were detected in both the Male-XWAS and Meta-XWAS. All three variants had significant p-values in all tests except for one variant (rs12835197 - PCDH19) in test "v" (All cases versus MGRB).

We performed a sanity check employing logistic regression, where controls were compared against controls (Test “vi”), using MGRB as cases and Hostseq as controls. To fortify the reliability of our findings, we assessed whether our candidate variants yielded non-significant p-values (≥0.05) in this sanity test as well. At least one variant in the genes ASB9 and ASB11 from Male-XWAS analysis meet the criteria of the sanity test. Hence, we retained both genes in the final results. In the Female-XWAS results, three out of the five detected variants failed to pass the sanity test, resulting in only the ENOX2 gene being included among the final genes.

3.1.3 XWAS replication using Autosomal Principal Components as covariates

Our principal component analysis (PCA) focused solely on the X chromosome due to its unique biological features (see Methods section 2.3). Therefore, the top 10 PCs were then considered as covariates⁴⁶. We also implemented XWAS with autosomal PCs to assess the generalizability of findings (Table S4, Figure S3 and Figure S4). The modified model revealed a total of 58 significant loci spanning over 12 genes: ASB9, ASB11, PIGA, PCDH19, TXLNG, HTR2C, ENOX2, PDHA1, PTCHD1-AS, DMD, HDAC8, and PABPC1L2A (Table 2); 11 of these overlap with the genes detected by the primary analysis using X chromosome-only PCs (Figure S4). The PIGA gene was identified exclusively with the autosomal PC model, noting it is located in proximity to ASB11(3.8kb) and ASB9 (48.9kb), which were detected in the Male-XWAS results using the X chromosome PCs. Among the 14 genes discovered by the XWAS using the X chromosome-only PCs only X, LOC124905257 and PCDH11X were not present when using autosomal PCs in the XWAS.

View this table:

Table 2. Significantly associated ASD genes based on our main XWAS and sdMAF results.

The values in red are the p-values considered significant based on the specific Bonferroni corrections (Males: 7.9×10^-6, Females: 1.07×10^-5, Both: 1.51×10^-5).

The genomic control lambdas observed in the QQ plots ranged from 0.913 to 1.119 (Figure S4). Additionally, the correlation between the XWAS results obtained from X chromosome PCs and autosomal PCs was 0.76 for males and 0.79 for females (Figure S3). Overall, the results from the main analysis are generalizable and robust.

3.1.4 Annotation

All XWAS results (Male-XWAS, Female-XWAS, Both-XWAS, Meta-XWAS) were annotated using both modules of FUMA⁷²: SNP2GENE and GENE2FUNC. SNP2GENE mapped the genes corresponding to the significantly associated SNPs, while GENE2FUNC annotated gene expression and gene sets from the previously mapped genes. The 59 significant associated variants were mapped (within a 10kb distance) to a total of 93 genes (Table S5). Through the gene-based test conducted using MAGMA⁷³, significant associations were identified for ASB11 (p-value = 2.87×10^-6) in the Male-XWAS (Figure 2, Figure 3-A), where initial SNPs were mapped to 704 genes given a significance threshold defined as 7.1×10^-5 (0.05/704). This gene was also mapped in the Female-XWAS analysis, being situated within at least 10kb distance from a significantly associated SNP. Notably, ASB11 is located within one of the most significant Linkage Disequilibrium (LD) regions identified in the Male-XWAS results, spanning between 15.27 and 15.36Mb, which also encompasses the genes ASB9 and PIGA (Genomic Locus 1-Male-XWAS; Table 1). The corresponding LocusZoom plot, along with the Combined Annotation Dependent Depletion (CADD)^68,74 score and RegulomeDB score^75,76 plots for this region, are presented in Figure 3-A. The CADD score assesses the deleteriousness of genetic variants, while the RegulomeDB score evaluates their functional significance, aiding in the interpretation of their potential biological effects. When considering only significant SNPs falling internal to the gene rather than within a 10kb range, 13 candidates were identified: ASB11, ASB9, DMD, ENOX2, HDAC8, HTR2C, LOC124905257, PABPC1L2A, PCDH11X, PCDH19, PDHA1, PTCHD1-AS and TXLNG (Figure 2, Table 2).

Figure 3. Annotation details for the genomic risk Locus 1_Male-XWAS and 1_Both XWAS.

A) Details for the genomic risk locus 1_Male-XWAS. The upper panel shows the LocusZoom plot for the correspondent region with the lead SNP rs12687599 highlighted in purple. The used LD reference panel was Europeans from 1000G data⁷⁷ for both sexes together. Following the LocusZoom plot, on the left, we provide annotation results displaying CADD and RegulomeDB scores. On the right, the Manhattan plot illustrates the gene-based test computed by MAGMA in FUMA. The SNPs were mapped to 704 protein-coding genes, hence the genome-wide significance threshold (indicated by the red dashed line in the plot) was conservatively set at P = 0.05/704 = 7.10×10^-5. B) LocusZoom plot of the genomic locus 1_Both-XWAS, followed by the CADD and Regulome profiles of the same region.

In the sex-stratified analysis, the majority of the SNPs found to have significant association were located in intronic regions, accounting for 68.4% of the Male-XWAS results and 60% of the Female-XWAS results. Within the Both-XWAS results, 45.8% of the SNPs were intergenic, 28.9% were non-coding RNAs, 22.9% were intronic, and an additional 2.4% located in UTR regions.

3.2 Sex differences in minor allele frequencies (sdMAF)

Evolutionary forces can influence allele frequency on the X chromosome between sexes compared to the autosomes ^78,79. To ensure the effectiveness of the quality control process, we have implemented sdMAF^60,61 analysis on the same set of genomic data. Subsequently, we removed all sdMAF significant results from the XWAS findings. These signals could be capturing either true biological sex differences or genotyping error, inducing spurious association between ASD and variants.

However, the sdMAF results also provided valuable insights. We applied sdMAF separately to ASD individuals and controls cohorts; and we observed scatters of statistically significant variants in both ASD individuals and controls (Figure 4). Single or few scatters were expected to be caused by genotyping error. The results from the region of FGF13 gene was particularly prominent. Notably, FGF13 is a previously ASD-associated gene with a SFARI score of 3S (Figure 4). Interestingly, the detection of this region is solely through sdMAF but not via logistic regressions, highlighting the potential of sdMAF being used as a tool for association studies of sex-biased diagnoses.

Figure 4. sdMaf Results.

A) Left, the Manhattan plot illustrates the sdMAF p-values obtained from ASD datasets exclusively. Right, the Manhattan plot represents the sdMAF p-values obtained from control datasets only. B) The LocusZoom plot displays the region identified in the sdMAF-cases results, highlighting the gene FGF13. The LD reference panel used was Europeans from 1000G data⁷⁷ for both sexes together.

3.3 Rare variants analysis

Recognizing the significant role of rare variants in ASD genetic architecture ^34,80–83, we checked in the same ASD datasets (MSSNG, SSC and SPARK) for rare predicted damaging small variants (SNV/indels with less than 0.1% of frequency on gnomAD⁴⁸) and CNV deletions (<1% frequency in gnomAD⁴⁸) overlapping at least one exon of the 14 significant detected genes (13 from XWAS and one from sdMAF).

Among the total of 14 XWAS genes analyzed (Figure 5), 11 exhibited rare predicted damaging SNVs. Among the remaining three genes, two were non-coding RNAs (LOC124905257 at HG38 chrX:20606477:20727481 and PTCHD1-AS at HG38 chrX:22193005:23293146), while the third was PABPC1L2A (HG38 chrX: 73077276:73079512). In the male frequency comparisons, almost all genes showed a higher frequency of these variants in ASD-probands compared to other family members, except for PCDH11X and PCDH19. In females, five genes (ASB11, DMD, HDAC8, PCDH19, and HTR2C) showed a higher frequency in ASD-probands. Combining both sexes, four genes (ASB11, DMD, HDAC8, HTR2C, and FGF13) showed a higher frequency in ASD-probands.

Figure 5. Rare Variant Frequency Analysis.

The figure compares the frequencies of rare variants among different groups: ASD-Probands (red bars), ASD-Unaffected Siblings (green bars), and ASD-Parents (gray bars). The left panel shows the frequency of rare predicted damaging SNVs (<0.1% frequency in general population) across 11 genes (ASB9, ASB11, TXLNG, PDHA1, PTCHD1-AS, DMD, HDAC8, PCDH11X, PCDH19, HTR2C, ENOX2, FGF13) detected through XWAS common variant data analysis (Table 2). The right panel illustrates the frequency of rare CNV deletions overlapping exons (< 1% frequency in general population), found in four XWAS-genes (PTCHD1-AS, DMD, ENOX2, FGF13). In each graph, the corresponding p-value from a conditional logistic regression is shown at the bottom, conducted separately for males, females, and both sexes combined (using “sex” as covariate).

We successfully identified rare deletions overlapping exons in the joint ASD datasets for the gene detected in sdMAF (FGF13) and for three of the 13 genes from the main XWAS results, including PTCHD1-AS, DMD, and ENOX2 (Figure 5, Table 2). Comparing the frequency of these CNVs in unaffected family members, we observed an enrichment in cases compared to unaffected family members for deletions impacting PTCHD1-AS in males, DMD and ENOX2 in females and both sexes combined, and FGF13 in males and both sexes combined. However, none of the association test results reached a p-value lower than 0.05, but this might be expected because of sample size.

3.4 Brain Gene expression analysis

Utilizing data from BrainSpan (Allen Brain Atlas)⁷¹, we generated a visualization to examine the mean expression patterns of 12 of 14 candidate genes detected in our previous analysis across various brain regions during distinct developmental periods (Figure 6). Data for LOC124905257 and PTCHD1-AS were not available in BrainSpan. In general, the ASD-XWAS candidate genes showed different expression levels in all different time ranges when compared with the plotted controls (Figure 6 last three columns).

Figure 6. Gene Expression by Brain Regions in different development times.

Brain map showing the gene expression levels in different parts of the brain in five developmental stages (Early Fetal, Late fetal, Early childhood, Childhood/Teenage and Adulthood). Left to right shows the gene expression levels from all 12 ASD-candidate genes with available expression data (ASB11, ASB9, DMD, ENOX2, FGF13, HDAC8, HTR2C, PABPC1L2A, PCDH11X, PCDH19, PDHA1, TXLNG), followed by three genes from Male-XWAS (ASB11, ASB9, PCDH19), three genes from Female-XWAS (TXLNG, HTR2C, ENOX2) and the correspondent control comparison with all the ∼800 X chromosome genes in both sexes and also in male brains only and female brains only. The color scales go from blue (downregulated) to red (upregulated).

During the early fetal stage, the 12 XWAS genes exhibit up-regulation in the cerebellum, which contrasts with the pattern observed in the Female-XWAS genes, showing notably low expression levels in the same region. In males, XWAS genes in the early fetal stage demonstrate down-regulated expression in the primary motor cortex and the primary visual cortex, alongside up-regulated expression in the prefrontal, primary somatosensory, and posteroventral parietal cortex. In this stage, the most expressed brain regions in Female-XWAS genes include the primary visual, primary auditory, and temporal cortex.

Transitioning to the late fetal stage, the most pronounced pattern includes down-regulated expression of Male-XWAS genes across nearly all analyzed brain regions. In contrast, All-XWAS genes exhibit heightened expression in the primary auditory, temporal, and prefrontal cortex. In early childhood, spanning the initial three years of life, a consistent down-regulated expression pattern is observed in the cerebellum across all approaches (All XWAS genes and sex-stratified comparisons). Furthermore, during this phase, the posteroventral parietal cortex displays elevated expression levels for All-XWAS genes.

From ages three to 20 (childhood to teenage years), X candidate genes remain downregulated in the cerebellum, while both sex-stratified approaches indicate up-regulation in the primary auditory and visual cortex. Additionally, the prefrontal cortex exhibits high expression levels for Male-XWAS genes.

In adulthood (after 20 years), the cerebellum maintains a down regulated pattern for all XWAS genes and for the genes identified in Male-XWAS, while exhibiting slightly higher expression levels in the genes identified in Female-XWAS. Conversely, the prefrontal cortex demonstrates low expression levels for the genes identified in Male-XWAS, with an upregulation pattern observed in the genes identified in Female-XWAS.

4. Discussion

Our XWAS analyses identified 59 SNP variants on the X chromosome that exhibited a statistically significant association with ASD (Table S2). These variants were mapped to 91 distinct genes, of which 11 had previously been associated with ASD through the detection of rare variants or CNVs, as reported in databases (Table S5). Out of the 59 significant variants identified in the main analysis, 35 were also successfully detected in our robustness study (Table S3), spanning all five different tests. Among these, 33 variants passed the sanity test by not reaching a significant value in the Control vs Control test “vi”. These 33 X-Chromosome variants were located in intergenic regions as well as in the genes ASB9, ASB11, PDHA1, LOC124905257, PTCHD1-AS, HDAC8, PABPC1L2A, and PCDH11X (Table 2).These new results will increase our understanding of the genes involved in ASD and provide a basis for improving polygenic risk scores (PRS), which currently are significantly underpowered regarding ASD^34,84.

In the Male-XWAS results we detected an LD region encompassing the genes ASB9, ASB11 and PIGA. The lead SNP, rs12687599, is reported in the GWAS catalog⁸⁵ for being associated with sex hormone-binding globulin levels⁸⁶. Autism was previously associated with a decreased level of maternal serum sex hormone binding globulin⁸⁷. This finding could imply an etiological association between sex hormone pathways and ASD status particularly in males ^16,23. Still in the Male-XWAS, we identified the gene PCDH19, also found in the Meta-XWAS. This gene has the highest significance score of 1 in the SFARI database, indicating its significant relevance to ASD²⁶. It is also classified as syndromic, primarily expressed in brain tissue and plays a role in cell adhesion⁸⁸, suggesting that mutations within it are associated with an increase in ASD likelihood and are consistently linked to neurodevelopmental and neuropsychiatric characteristics beyond those necessary for an ASD diagnosis.

The Both-XWAS and Meta-XWAS identified significantly associated variants in the lncRNA PTCHD1-AS (PTCHD1 antisense RNA)⁸⁹. This gene is part of a complex on chromosome Xp22.11, which also encompasses DDX53, placing this locus among the most prevalent and impactful genetic factors for ASD⁹⁰ and other neurodevelopmental disorders. Ross et al., 2021⁸⁹, conducted an analysis compiling data from previously reported variants on PTCHD1-AS. They found that 69% of these variants associated with this long non-coding RNA (lncRNA) are linked to ASD or ASD-related features. Consequently, the EAGLE score, a metric evaluating a gene’s relevance to ASD, definitively assigns PTCHD1-AS a final score of 17.6²⁷. However, despite this association, the functional significance of these variants remains unknown.

In the Meta-XWAS we identified significant variants associated with ASD in DMD and HDAC8. Notably, HDAC8 was also highlighted in the Both-XWAS results. Both genes carry a syndromic status on the SFARI gene score. Both DMD and HDAC8 are linked to intellectual disability, with DMD additionally implicated in attention-deficit hyperactivity disorder (ADHD) and extra-pyramidal syndrome (EPS). The DMD gene was the only gene to reach a significant enrichment p-value (0.01) when comparing rare deletions in probands against unaffected family members specifically for females. This finding suggests a potential sex-specific effect of rare deletions in the DMD gene, with females exhibiting a significant enrichment compared to unaffected family members. Our previous genomic studies of CNVs ³⁴ further support the importance of rare deletions in DMD.

We also applied a case-only sdMAF analysis in a complementary way to the traditional case-control association analysis. This analysis pointed out a significant peak overlapping FGF13⁹¹ with variants in this gene being involved in infantile-onset developmental and epileptic encephalopathy, which can be important associated features of ASD.

In summary, our XWAS study of individuals with ASD and controls has generated significant new data that further validate the roles of specific genes in autism and unveil novel candidates for future research. Our approach, utilizing XWAS ’common variant’ analyses alongside parallel ’rare variant’ examinations of the same samples, provides a unique paradigm for dissecting the genomic architecture involved in ASD and potentially other complex conditions. Additionally, while the development of an X-chromosome-based Polygenic Risk Score (X-PRS) is of interest, it is beyond the scope of this paper and may require new methodologies²⁹.

Data Availability

All data produced are available online at https://research.mss.ng/ https://base.sfari.org https://www.internationalgenome.org https://www.cgen.ca/hostseq-databank-access-request MGRB genomic data is deposited at the European Genome-Phenome Archive under study ID EGAS00001003511

https://research.mss.ng/

https://base.sfari.org

https://www.internationalgenome.org

https://www.cgen.ca/hostseq-databank-access-request

Declaration of interests

At the time of this study and its publication, S.W.S. served on the Scientific Advisory Committee of Population Bio. Intellectual property from aspects of his research held at The Hospital for Sick Children are licensed to Athena Diagnostics and Population Bio. These relationships did not influence data interpretation or presentation during this study but are disclosed for potential future considerations.

Web resources

Approved researchers can obtain the MSSNG dataset by applying at https://research.mss.ng/; and the SSC and SPARK datasets at https://base.sfari.org. 1000 genomes data is publicly available at https://www.internationalgenome.org/, HostSeq data can be also available after applying at https://www.cgen.ca/hostseq-databank-access-request and MGRB genomic data is deposited at the European Genome-Phenome Archive under study ID EGAS00001003511.

Author contributions

Conceptualization: M.M.A, D.Z.C, T.P.L, L.S, S.W.S

Data curation: M.M.A, W.E, B.Thiruvahindrapuram, B.Trost, J.L.H, G.P, T.N

Formal Analysis: M.M.A, D.Z.C, W.E, E.A.M, S.B.C, D.R

Funding acquisition: S.W.S

Investigation: M.M.A, D.Z.C, W.E, B.Trost,, J.L.H, M.C.L, C.A.B, L.S, S.W.S

Methodology: M.M.A, D.Z.C, W.E, T.P.L, B.Thiruvahindrapuram, G.P, T.N, R.A, N.B.S, E.A.M, N.R.A, S.B.C, D.R

Project administration: J.L.H, L.S, S.W.S

Resources: S.W.S

Software: M.M.A, D.Z.C, T.P.L, L.S

Supervision: L.S, S.W.S

Validation: M.M.A, W.E, N.B.S, C.A.B

Visualization: M.M.A, W.E

Writing – original draft: M.M.A, W.E, B.Trost, M.C.L, L.S, S.W.S

Writing – review & editing: M.M.A, D.Z.C, W.E, T.P.L, B.Thiruvahindrapuram, B.Trost, J.L.H, G.P, T.N, R.A, N.B.S, E.A.M, N.R.A, M.C.L, S.B.C, D.R, C.A.B, L.S, S.W.S

Acknowledgments

We thank the families participating in MSSNG, SSC, and SPARK, as well as the resources provided by Autism Speaks and The Centre for Applied Genomics. M.M.A was supported by the CGEn HostSeq/CIHR fellowship (CGE 185054) and the SickKids Restracomp Fellowship. S.B.C was supported in part by the Intramural Research Program of the NIH and the National Institute on Aging (NIA). M.C.L is supported by a Canadian Institutes of Health Research Sex and Gender Science Chair (GSB 171373). S.W.S holds the Northbridge Chair in Pediatric Research at The Hospital for Sick Children and the University of Toronto.

References

1.↵
Zeidan, J., Fombonne, E., Scorah, J., Ibrahim, A., Durkin, M.S., Saxena, S., Yusuf, A., Shih, A., and Elsabbagh, M. (2022). Global prevalence of autism: A systematic review update. Autism Res. 15, 778–790. doi:10.1002/aur.2696
OpenUrl CrossRef PubMed
2.↵
Lord, C., Elsabbagh, M., Baird, G., and Veenstra-Vanderweele, J. (2018). Autism spectrum disorder. The Lancet 392, 508–520. doi:10.1016/S0140-6736(18)31129-2
OpenUrl CrossRef PubMed
3.↵
Loomes, R., Hull, L., and Mandy, W.P.L. (2017). What Is the Male-to-Female Ratio in Autism Spectrum Disorder? A Systematic Review and Meta-Analysis. J. Am. Acad. Child Adolesc. Psychiatry 56,.doi:10.1016/j.jaac.2017.03.013
OpenUrl CrossRef PubMed
4.↵
Maenner, M.J., Warren, Z., Williams, A.R., Amoakohene, E., Bakian, A.V., Bilder, D.A., Durkin, M.S., Fitzgerald, R.T., Furnier, S.M., Hughes, M.M., et al. (2023). Prevalence and Characteristics of Autism Spectrum Disorder Among Children Aged 8 Years - Autism and Developmental Disabilities Monitoring Network, 11 Sites, United States, 2020. MMWR Surveill. Summ. 72,. doi:10.15585/mmwr.ss7202a1
OpenUrl CrossRef
5.↵
Lai, M.C., Lombardo, M.V., Auyeung, B., Chakrabarti, B., and Baron-Cohen, S. (2015). Sex/gender differences and autism: setting the scene for future research. J. Am. Acad. Child Adolesc. Psychiatry 54,.doi:10.1016/j.jaac.2014.10.003
OpenUrl CrossRef PubMed
6.↵
Jacquemont, S., Coe, B.P., Hersch, M., Duyzend, M.H., Krumm, N., Bergmann, S., Beckmann, J.S., Rosenfeld, J.A., and Eichler, E.E. (2014). A higher mutational burden in females supports a “female protective model” in neurodevelopmental disorders. Am. J. Hum. Genet. 94, 415–425. doi:10.1016/j.ajhg.2014.02.001
OpenUrl CrossRef PubMed
7.↵
Pinto, D., Delaby, E., Merico, D., Barbosa, M., Merikangas, A., Klei, L., Thiruvahindrapuram, B., Xu, X., Ziman, R., Wang, Z., et al. (2014). Convergence of genes and cellular pathways dysregulated in autism spectrum disorders. Am. J. Hum. Genet. 94, 677–694. doi:10.1016/j.ajhg.2014.03.018
OpenUrl CrossRef PubMed
8.↵
Leow, K.Q., Tonta, M.A., Lu, J., Coleman, H.A., and Parkington, H.C. (2024). Towards understanding sex differences in autism spectrum disorders. Brain Res. 1833, 148877. doi:10.1016/j.brainres.2024.148877
OpenUrl CrossRef
9.
Palmer, N., Beam, A., Agniel, D., Eran, A., Manrai, A., Spettell, C., Steinberg, G., Mandl, K., Fox, K., Nelson, S.F., et al. (2017). Association of Sex With Recurrence of Autism Spectrum Disorder Among Siblings. JAMA Pediatr. 171, 1107–1112. doi:10.1001/jamapediatrics.2017.2832
OpenUrl CrossRef PubMed
10.↵
Napolitano, A., Schiavi, S., La Rosa, P., Rossi-Espagnet, M.C., Petrillo, S., Bottino, F., Tagliente, E., Longo, D., Lupi, E., Casula, L., et al. (2022). Sex Differences in Autism Spectrum Disorder: Diagnostic, Neurobiological, and Behavioral Features. Front. Psychiatry 13, 889636. doi:10.3389/fpsyt.2022.889636
OpenUrl CrossRef
11.↵
Zhang, Y., Li, N., Li, C., Zhang, Z., Teng, H., Wang, Y., Zhao, T., Shi, L., Zhang, K., Xia, K., et al. (2020). Genetic evidence of gender difference in autism spectrum disorder supports the female-protective effect. Transl. Psychiatry 10, 4. doi:10.1038/s41398-020-0699-8
OpenUrl CrossRef
12.
Antaki, D., Guevara, J., Maihofer, A.X., Klein, M., Gujral, M., Grove, J., Carey, C.E., Hong, O., Arranz, M.J., Hervas, A., et al. (2022). A phenotypic spectrum of autism is attributable to the combined effects of rare variants, polygenic risk and sex. Nat. Genet. 54,. doi:10.1038/s41588-022-01064-5
OpenUrl CrossRef
13.↵
Warrier, V., Zhang, X., Reed, P., Havdahl, A., Moore, T.M., Cliquet, F., Leblond, C.S., Rolland, T., Rosengren, A., Rowitch, D.H., et al. (2022). Genetic correlates of phenotypic heterogeneity in autism. Nat. Genet. 54,. doi:10.1038/s41588-022-01072-5
OpenUrl CrossRef
14.↵
Wigdor, E.M., Weiner, D.J., Grove, J., Fu, J.M., Thompson, W.K., Carey, C.E., Baya, N., van der Merwe, C., Walters, R.K., Satterstrom, F.K., et al. (2022). The female protective effect against autism spectrum disorder. Cell Genomics 2,. doi:10.1016/j.xgen.2022.100134
OpenUrl CrossRef
15.↵
Elsabbagh, M. (2020). Linking risk factors and outcomes in autism spectrum disorder: is there evidence for resilience? BMJ 368, l6880. doi:10.1136/bmj.l6880
OpenUrl Abstract/FREE Full Text
16.↵
Werling, D.M. (2016). The role of sex-differential biology in risk for autism spectrum disorder. Biol. Sex Differ. 7,. doi:10.1186/s13293-016-0112-8
OpenUrl CrossRef
17.↵
D., Kroll, K.L., Gutmann, D.H., Constantino, J.N., et al. (2022). Can the “female protective effect” liability threshold model explain sex differences in autism spectrum disorder? Neuron 110,. doi:10.1016/j.neuron.2022.06.020
OpenUrl CrossRef
18.↵
Leppa, V.M., Kravitz, S.N., Martin, C.L., Andrieux, J., Le Caignec, C., Martin-Coignard, D., DyBuncio, C., Sanders, S.J., Lowe, J.K., Cantor, R.M., et al. (2016). Rare Inherited and De Novo CNVs Reveal Complex Contributions to ASD Risk in Multiplex Families. Am. J. Hum. Genet. 99, 540–554. doi:10.1016/j.ajhg.2016.06.036
OpenUrl CrossRef PubMed
19.
Han, J., Walters, J.T.R., Kirov, G., Pocklington, A., Escott-Price, V., Owen, M.J., Holmans, P., O’Donovan, M.C., and Rees, E. (2016). Gender differences in CNV burden do not confound schizophrenia CNV associations. Sci. Rep. 6, 25986. doi:10.1038/srep25986
OpenUrl CrossRef
20.↵
Martin, J., Tammimies, K., Karlsson, R., Lu, Y., Larsson, H., Lichtenstein, P., and Magnusson, P.K.E. (2019). Copy number variation and neuropsychiatric problems in females and males in the general population. Am. J. Med. Genet. B Neuropsychiatr. Genet. 180, 341–350. doi:10.1002/ajmg.b.32685
OpenUrl CrossRef
21.↵
Mitra, I., Tsang, K., Ladd-Acosta, C., Croen, L.A., Aldinger, K.A., Hendren, R.L., Traglia, M., Lavillaureix, A., Zaitlen, N., Oldham, M.C., et al. (2016). Pleiotropic Mechanisms Indicated for Sex Differences in Autism. PLoS Genet. 12, e1006425. doi:10.1371/journal.pgen.1006425
OpenUrl CrossRef
22.↵
McCarthy, M.M. (2020). A new view of sexual differentiation of mammalian brain. J. Comp. Physiol. A Neuroethol. Sens. Neural Behav. Physiol. 206, 369–378. doi:10.1007/s00359-019-01376-8
OpenUrl CrossRef PubMed
23.↵
Amestoy, A., Baudrillard, C., Briot, K., Pizano, A., Bouvard, M., and Lai, M.C. (2023). Steroid hormone pathways, vitamin D and autism: a systematic review. J. Neural Transm. 130,. doi:10.1007/s00702-022-02582-6
OpenUrl CrossRef
24.↵
Lenz, K.M., Wright, C.L., Martin, R.C., and McCarthy, M.M. (2011). Prostaglandin E₂ regulates AMPA receptor phosphorylation and promotes membrane insertion in preoptic area neurons and glia during sexual differentiation. PLoS One 6, e18500. doi:10.1371/journal.pone.0018500
OpenUrl CrossRef PubMed
25.↵
VanRyzin, J.W., Marquardt, A.E., Argue, K.J., Vecchiarelli, H.A., Ashton, S.E., Arambula, S.E., Hill, M.N., and McCarthy, M.M. (2019). Microglial Phagocytosis of Newborn Cells Is Induced by Endocannabinoids and Sculpts Sex Differences in Juvenile Rat Social Play. Neuron 102, 435–449.e6. doi:10.1016/j.neuron.2019.02.006
OpenUrl CrossRef PubMed
26.↵
Abrahams, B.S., Arking, D.E., Campbell, D.B., Mefford, H.C., Morrow, E.M., Weiss, L.A., Menashe, I., Wadkins, T., Banerjee-Basu, S., and Packer, A. (2013). SFARI Gene 2.0: a community-driven knowledgebase for the autism spectrum disorders (ASDs). Mol. Autism 4, 36. doi:10.1186/2040-2392-4-36
OpenUrl CrossRef PubMed
27.↵
Schaaf, C.P., Betancur, C., Yuen, R.K.C., Parr, J.R., Skuse, D.H., Gallagher, L., Bernier, R.A., Buchanan, J.A., Buxbaum, J.D., Chen, C.-A., et al. (2020). A framework for an evidence-based gene list relevant to autism spectrum disorder. Nat. Rev. Genet. 21, 367–376. doi:10.1038/s41576-020-0231-2
OpenUrl CrossRef
28.↵
Hoang, N., Buchanan, J.A., and Scherer, S.W. (2018). Heterogeneity in clinical sequencing tests marketed for autism spectrum disorders. Npj Genomic Medicine 3, 1–4. doi:10.1038/s41525-018-0066-3
OpenUrl CrossRef
29.↵
Sun, L., Wang, Z., Lu, T., Manolio, T. A., & Paterson, A. D. (2023). eXclusionarY: 10 years later, where are the sex chromosomes in GWASs?. American journal of human genetics, 110(6), 903–912. doi:10.1016/j.ajhg.2023.04.009
OpenUrl CrossRef
30.↵
Chung, R.-H., Ma, D., Wang, K., Hedges, D.J., Jaworski, J.M., Gilbert, J.R., Cuccaro, M.L., Wright, H.H., Abramson, R.K., Konidari, I., et al. (2011). An X chromosome-wide association study in autism families identifies TBL1X as a novel autism spectrum disorder candidate gene in males. Mol. Autism 2, 18. doi:10.1186/2040-2392-2-18
OpenUrl CrossRef PubMed
31.↵
Gottipati, S., Arbiza, L., Siepel, A., Clark, A.G., and Keinan, A. (2011). Analyses of X-linked and autosomal genetic variation in population-scale whole genome sequencing. Nat. Genet. 43, 741–743. doi:10.1038/ng.877
OpenUrl CrossRef PubMed
32.↵
Le Guen, Y., Napolioni, V., Belloy, M.E., Yu, E., Krohn, L., Ruskey, J.A., Gan-Or, Z., Kennedy, G., Eger, S.J., and Greicius, M.D. (2021). Common X-Chromosome Variants Are Associated with Parkinson Disease Risk. Ann. Neurol. 90, 22–34. doi:10.1002/ana.26051
OpenUrl CrossRef
33.↵
Gao, F., Chang, D., Biddanda, A., Ma, L., Guo, Y., Zhou, Z., and Keinan, A. (2015). XWAS: A Software Toolset for Genetic Data Analysis and Association Studies of the X Chromosome. J. Hered. 106, 666–671. doi:10.1093/jhered/esv059
OpenUrl CrossRef PubMed
34.↵
Trost, B., Thiruvahindrapuram, B., Chan, A. J. S., Engchuan, W., Higginbotham, E. J., Howe, J. L., Loureiro, L. O., Reuter, M. S., Roshandel, D., Whitney, J., Zarrei, M., Bookman, M., Somerville, C., Shaath, R., Abdi, M., Aliyev, E., Patel, R. V., Nalpathamkalam, T., Pellecchia, G., Hamdan, O., … Scherer, S. W. (2022). Genomic architecture of autism from comprehensive whole-genome sequence annotation. Cell, 185(23), 4409–4427.e18. doi:10.1016/j.cell.2022.10.009
OpenUrl CrossRef
35.↵
C Yuen, R.K., Merico, D., Bookman, M., L Howe, J., Thiruvahindrapuram, B., Patel, R.V., Whitney, J., Deflaux, N., Bingham, J., Wang, Z., et al. (2017). Whole genome sequencing resource identifies 18 new candidate genes for autism spectrum disorder. Nat. Neurosci. 20, 602–611. doi:10.1038/nn.4524
OpenUrl CrossRef PubMed
36.↵
Black, D.W., and Jon E. Grant, M.D., M.P.H., J.D. (2014). DSM-5 Guidebook: The Essential Companion to the Diagnostic and Statistical Manual of Mental Disorders, Fifth Edition (American Psychiatric Pub).
37.↵
Lord, C., Rutter, M., and Le Couteur, A. (1994). Autism Diagnostic Interview-Revised: A revised version of a diagnostic interview for caregivers of individuals with possible pervasive developmental disorders. J. Autism Dev. Disord. 24, 659–685. doi:10.1007/BF02172145
OpenUrl CrossRef PubMed Web of Science
38.↵
Kim, S.H., and Lord, C. (2011). New Autism Diagnostic Interview-Revised Algorithms for Toddlers and Young Preschoolers from 12 to 47 Months of Age. J. Autism Dev. Disord. 42, 82–93. doi:10.1007/s10803-011-1213-1
OpenUrl CrossRef
39.↵
Lord, C., Rutter, M., Goode, S., Heemsbergen, J., Jordan, H., Mawhood, L., and Schopler, E. (1989). Austism diagnostic observation schedule: A standardized observation of communicative and social behavior. J. Autism Dev. Disord. 19, 185–212. doi:10.1007/BF02211841
OpenUrl CrossRef PubMed Web of Science
40.↵
Lord, C., Risi, S., Lambrecht, L., Cook, E.H., Jr., Leventhal, B.L., DiLavore, P.C., Pickles, A., and Rutter, M. (2000). The autism diagnostic observation schedule-generic: a standard measure of social and communication deficits associated with the spectrum of autism. J. Autism Dev. Disord. 30, 205–223.
OpenUrl CrossRef PubMed Web of Science
41.↵
Fischbach, G. D., & Lord, C. (2010). The Simons Simplex Collection: a resource for identification of autism genetic risk factors. Neuron, 68(2), 192–195. doi:10.1016/j.neuron.2010.10.006
OpenUrl CrossRef PubMed Web of Science
42.↵
SPARK Consortium. Electronic address: pfeliciano@simonsfoundation.org, & SPARK Consortium (2018). SPARK: A US Cohort of 50,000 Families to Accelerate Autism Research. Neuron, 97(3), 488–493. doi:10.1016/j.neuron.2018.01.015
OpenUrl CrossRef PubMed
43.↵
Byrska-Bishop, M., Evani, U.S., Zhao, X., Basile, A.O., Abel, H.J., Regier, A.A., Corvelo, A., Clarke, W.E., Musunuri, R., Nagulapalli, K., et al. (2022). High-coverage whole-genome sequencing of the expanded 1000 Genomes Project cohort including 602 trios. Cell 185, 3426–3440.e19. doi:10.1016/j.cell.2022.08.004
OpenUrl CrossRef PubMed
44.↵
Pinese, M., Lacaze, P., Rath, E.M., Stone, A., Brion, M.-J., Ameur, A., Nagpal, S., Puttick, C., Husson, S., Degrave, D., et al. (2020). The Medical Genome Reference Bank contains whole genome and phenotype data of 2570 healthy elderly. Nat. Commun. 11, 1–14. doi:10.1038/s41467-019-14079-0
OpenUrl CrossRef PubMed
45.↵
Yoo, S., Garg, E., Elliott, L.T., Hung, R.J., Halevy, A.R., Brooks, J.D., Bull, S.B., Gagnon, F., Greenwood, C.M.T., Lawless, J.F., et al. (2023). HostSeq: A Canadian Whole Genome Sequencing and Clinical Data Resource.
46.↵
Leal, T.P., French-Kwawu, J.N., Gouveia, M.H., Borda, V., Inca-Martinez, M., Mason, E.A., Horimoto, A.R., Loesch, D.P., Sarihan, E.I., Cornejo-Olivas, M.R., et al. (2023). X-Chromosome Association Study in Latin American Cohorts Identifies New Loci in Parkinson Disease. doi:10.1002/mds.29508
OpenUrl CrossRef
47.↵
Franke, L., de Kovel, C.G.F., Aulchenko, Y.S., Trynka, G., Zhernakova, A., Hunt, K.A., Blauw, H.M., van den Berg, L.H., Ophoff, R., Deloukas, P., et al. (2008). Detection, imputation, and association analysis of small deletions and null alleles on oligonucleotide arrays. Am. J. Hum. Genet. 82, 1316–1333. doi:10.1016/j.ajhg.2008.05.008
OpenUrl CrossRef PubMed Web of Science
48.↵
Karczewski, K.J., Francioli, L.C., Tiao, G., Cummings, B.B., Alföldi, J., Wang, Q., Collins, R.L., Laricchia, K.M., Ganna, A., Birnbaum, D.P., et al. (2020). The mutational constraint spectrum quantified from variation in 141,456 humans. Nature 581, 434–443. doi:10.1038/s41586-020-2308-7
OpenUrl CrossRef PubMed
49.↵
Manichaikul, A., Mychaleckyj, J.C., Rich, S.S., Daly, K., Sale, M., and Chen, W.-M. (2010). Robust relationship inference in genome-wide association studies. Bioinformatics 26, 2867–2873. doi:10.1093/bioinformatics/btq559
OpenUrl CrossRef PubMed Web of Science
50.↵
Leal, T.P., Furlan, V.C., Gouveia, M.H., Saraiva Duarte, J.M., Fonseca, P.A., Tou, R., Scliar, M. de O., Araujo, G.S. de, Costa, L.F., Zolini, C., et al. (2022). NAToRA, a relatedness-pruning method to minimize the loss of dataset size in genetic and omics analyses. Comput. Struct. Biotechnol. J. 20, 1821–1828. doi:10.1016/j.csbj.2022.04.009
OpenUrl CrossRef
51.↵
Alexander, D.H., Novembre, J., and Lange, K. (2009). Fast model-based estimation of ancestry in unrelated individuals. Genome Res. 19, 1655–1664. doi:10.1101/gr.094052.109
OpenUrl Abstract/FREE Full Text
52.↵
Fairley, S., Lowy-Gallego, E., Perry, E., and Flicek, P. (2019). The International Genome Sample Resource (IGSR) collection of open human genomic variation resources. Nucleic Acids Res. 48, D941–D947. doi:10.1093/nar/gkz836
OpenUrl CrossRef PubMed
53.↵
Price, A.L., Weale, M.E., Patterson, N., Myers, S.R., Need, A.C., Shianna, K.V., Ge, D., Rotter, J.I., Torres, E., Taylor, K.D., et al. (2008). Long-range LD can confound genome scans in admixed populations. Am. J. Hum. Genet. 83, 132–135; author reply 135–139. doi:10.1016/j.ajhg.2008.06.005
OpenUrl CrossRef PubMed Web of Science
54.
Weale, M.E. (2010). Quality control for genome-wide association studies. Methods Mol. Biol. 628, 341–372. doi:10.1007/978-1-60327-367-1_19
OpenUrl CrossRef PubMed
55.↵
Anderson, C.A., Pettersson, F.H., Clarke, G.M., Cardon, L.R., Morris, A.P., and Zondervan, K.T. (2010). Data quality control in genetic case-control association studies. Nat. Protoc. 5, 1564–1573. doi:10.1038/nprot.2010.116
OpenUrl CrossRef PubMed Web of Science
56.↵
Chang, C.C., Chow, C.C., Tellier, L.C., Vattikuti, S., Purcell, S.M., and Lee, J.J. (2015). Second-generation PLINK: rising to the challenge of larger and richer datasets. Gigascience 4, 7. doi:10.1186/s13742-015-0047-8
OpenUrl CrossRef PubMed
57.↵
Mägi, R., and Morris, A.P. (2010). GWAMA: software for genome-wide association meta-analysis. BMC Bioinformatics 11, 1–6. doi:10.1186/1471-2105-11-288
OpenUrl CrossRef PubMed
58.↵
Magi, R., Lindgren, C.M., and Morris, A.P. (2010). Meta-analysis of sex-specific genome-wide association studies. Genet. Epidemiol. 34, 846–853. doi:10.1002/gepi.20540
OpenUrl CrossRef PubMed
59.↵
Bretherton, C.S., Widmann, M., Dymnikov, V.P., Wallace, J.M., and Bladé, I. (1999). The Effective Number of Spatial Degrees of Freedom of a Time-Varying Field. J. Clim. 12, 1990– 2009.
OpenUrl
60.↵
Chen, D.Z., Roshandel, D., Wang, Z., Sun, L., and Paterson, A.D. (2023). Comprehensive whole-genome analyses of the UK Biobank reveal significant sex differences in both genotype missingness and allele frequency on the X chromosome. Hum. Mol. Genet. ddad 201. doi:10.1093/hmg/ddad201
OpenUrl CrossRef
61.↵
Wang, Z., Sun, L., and Paterson, A.D. (2022). Major sex differences in allele frequencies for X chromosomal variants in both the 1000 Genomes Project and gnomAD. PLoS Genet. 18, e1010231. doi:10.1371/journal.pgen.1010231
OpenUrl CrossRef
62.↵
Purcell, S., Neale, B., Todd-Brown, K., Thomas, L., Ferreira, M.A.R., Bender, D., Maller, J., Sklar, P., de Bakker, P.I.W., Daly, M.J., et al. (2007). PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81, 559–575. doi:10.1086/519795
OpenUrl CrossRef PubMed
63.↵
Trost, B., Walker, S., Wang, Z., Thiruvahindrapuram, B., MacDonald, J. R., Sung, W. W. L., Pereira, S. L., Whitney, J., Chan, A. J. S., Pellecchia, G., Reuter, M. S., Lok, S., Yuen, R. K. C., Marshall, C. R., Merico, D., & Scherer, S. W. (2018). A Comprehensive Workflow for Read Depth-Based Identification of Copy-Number Variation from Whole-Genome Sequence Data. American journal of human genetics, 102(1), 142–155. doi:10.1016/j.ajhg.2017.12.007
OpenUrl CrossRef PubMed
64.↵
Wang, K., Li, M., and Hakonarson, H. (2010). ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 38, e164. doi:10.1093/nar/gkq603
OpenUrl CrossRef PubMed
65.↵
Ng, P.C., and Henikoff, S. (2003). SIFT: Predicting amino acid changes that affect protein function. Nucleic Acids Res. 31, 3812–3814. doi:10.1093/nar/gkg509
OpenUrl CrossRef PubMed Web of Science
66.↵
Adzhubei, I.A., Schmidt, S., Peshkin, L., Ramensky, V.E., Gerasimova, A., Bork, P., Kondrashov, A.S., and Sunyaev, S.R. (2010). A method and server for predicting damaging missense mutations. Nat. Methods 7, 248–249. doi:10.1038/nmeth0410-248
OpenUrl CrossRef PubMed Web of Science
67.↵
Schwarz, J.M., Cooper, D.N., Schuelke, M., and Seelow, D. (2014). MutationTaster2: mutation prediction for the deep-sequencing age. Nat. Methods 11, 361–362. doi:10.1038/nmeth.2890
OpenUrl CrossRef PubMed Web of Science
68.↵
Kircher, M., Witten, D.M., Jain, P., O’Roak, B.J., Cooper, G.M., and Shendure, J. (2014). A general framework for estimating the relative pathogenicity of human genetic variants. Nat. Genet. 46, 310–315. doi:10.1038/ng.2892
OpenUrl CrossRef PubMed
69.↵
Zhu, M., Need, A.C., Han, Y., Ge, D., Maia, J.M., Zhu, Q., Heinzen, E.L., Cirulli, E.T., Pelak, K., He, M., et al. (2012). Using ERDS to infer copy-number variants in high-coverage genomes. Am. J. Hum. Genet. 91, 408–421. doi:10.1016/j.ajhg.2012.07.004
OpenUrl CrossRef PubMed
70.↵
Abyzov, A., Urban, A.E., Snyder, M., and Gerstein, M. (2011). CNVnator: an approach to discover, genotype, and characterize typical and atypical CNVs from family and population genome sequencing. Genome Res. 21, 974–984. doi:10.1101/gr.114876.110
OpenUrl Abstract/FREE Full Text
71.↵
Kang, H.J., Kawasawa, Y.I., Cheng, F., Zhu, Y., Xu, X., Li, M., Sousa, A.M.M., Pletikos, M., Meyer, K.A., Sedmak, G., et al. (2011). Spatio-temporal transcriptome of the human brain. Nature 478, 483–489. doi:10.1038/nature10523
OpenUrl CrossRef PubMed Web of Science
72.↵
Watanabe, K., Taskesen, E., and van Bochoven, A. (2017). Functional mapping and annotation of genetic associations with FUMA. Nat. Commun. 8, 1–11. doi:10.1038/s41467-017-01261-5
OpenUrl CrossRef PubMed
73.↵
de Leeuw, C.A., Mooij, J.M., and Heskes, T. (2015). MAGMA: Generalized Gene-Set Analysis of GWAS Data. PLoS Comput. Biol. 11, e1004219. doi:10.1371/journal.pcbi.1004219
OpenUrl CrossRef PubMed
74.↵
Schubach, M., Maass, T., Nazaretyan, L., Röner, S., and Kircher, M. (2024). CADD v1.7: using protein language models, regulatory CNNs and other nucleotide-level scores to improve genome-wide variant predictions. Nucleic Acids Res. 52, D1143–D1154. doi:10.1093/nar/gkad989
OpenUrl CrossRef
75.↵
Dong, S., Zhao, N., Spragins, E., Kagda, M.S., Li, M., Assis, P., Jolanki, O., Luo, Y., Cherry, J.M., Boyle, A.P., et al. (2023). Annotating and prioritizing human non-coding variants with RegulomeDB v.2. Nat. Genet. 55, 724–726. doi:10.1038/s41588-023-01365-3
OpenUrl CrossRef
76.↵
Boyle, A.P., Hong, E.L., Hariharan, M., Cheng, Y., Schaub, M.A., Kasowski, M., Karczewski, K.J., Park, J., Hitz, B.C., Weng, S., et al. (2012). Annotation of functional variation in personal genomes using RegulomeDB. Genome Res. 22, 1790–1797. doi:10.1101/gr.137323.112
OpenUrl Abstract/FREE Full Text
77.↵
1000 Genomes Project Consortium, Auton, A., Brooks, L. D., Durbin, R. M., Garrison, E. P., Kang, H. M., Korbel, J. O., Marchini, J. L., McCarthy, S., McVean, G. A., & Abecasis, G. R. (2015). A global reference for human genetic variation. Nature, 526(7571), 68–74. doi:10.1038/nature15393
OpenUrl CrossRef PubMed
78.↵
Day, F.R., Loh, P.-R., Scott, R.A., Ong, K.K., and Perry, J.R.B. (2016). A Robust Example of Collider Bias in a Genetic Association Study. Am. J. Hum. Genet. 98, 392–393. doi:10.1016/j.ajhg.2015.12.019
OpenUrl CrossRef
79.↵
Price, A.L., Patterson, N.J., Plenge, R.M., Weinblatt, M.E., Shadick, N.A., and Reich, D. (2006). Principal components analysis corrects for stratification in genome-wide association studies. Nat. Genet. 38, 904–909. doi:10.1038/ng1847
OpenUrl CrossRef PubMed Web of Science
80.↵
Zarrei, M., Burton, C.L., Engchuan, W., Young, E.J., Higginbotham, E.J., MacDonald, of genomic copy number variation across neurodevelopmental disorders. NPJ Genomic Medicine 4,. doi:10.1038/s41525-019-0098-3
OpenUrl CrossRef
81.
D’Abate, L., Walker, S., Yuen, R.K.C., Tammimies, K., Buchanan, J.A., Davies, R.W., Thiruvahindrapuram, B., Wei, J., Brian, J., Bryson, S.E., et al. (2019). Predictive impact of rare genomic copy number variations in siblings of individuals with autism spectrum disorders. Nat. Commun. 10,. doi:10.1038/s41467-019-13380-2
OpenUrl CrossRef
82.
Woodbury-Smith, M., Zarrei, M., Wei, J., Thiruvahindrapuram, B., O’Connor, I., Paterson, A.D., Yuen, R.K.C., Dastan, J., Stavropoulos, D.J., Howe, J.L., et al. (2020). Segregating patterns of copy number variations in extended autism spectrum disorder (ASD) pedigrees. Am. J. Med. Genet. B Neuropsychiatr. Genet. 183,. doi:10.1002/ajmg.b.32785
OpenUrl CrossRef
83.↵
Zarrei, M., Burton, C.L., Engchuan, W., Higginbotham, E.J., Wei, J., Shaikh, S., Roslin, N.M., MacDonald, J.R., Pellecchia, G., Nalpathamkalam, T., et al. (2023). Gene copy number variation and pediatric mental health/neurodevelopment in a general population. Hum. Mol. Genet. 32,. doi:10.1093/hmg/ddad074
OpenUrl CrossRef
84.↵
Grove, J., Ripke, S., Als, T.D., Mattheisen, M., Walters, R.K., Won, H., Pallesen, J., Agerbo, E., Andreassen, O.A., Anney, R., et al. (2019). Identification of common genetic risk variants for autism spectrum disorder. Nat. Genet. 51, 431–444. doi:10.1038/s41588-019-0344-8
OpenUrl CrossRef PubMed
85.↵
Sollis, E., Mosaku, A., Abid, A., Buniello, A., Cerezo, M., Gil, L., Groza, T., Güneş, O., Hall, P., Hayhurst, J., et al. (2023). The NHGRI-EBI GWAS Catalog: knowledgebase and deposition resource. Nucleic Acids Res. 51, D977–D985. doi:10.1093/nar/gkac1010
OpenUrl CrossRef
86.↵
Ruth, K.S., Day, F.R., Tyrrell, J., Thompson, D.J., Wood, A.R., Mahajan, A., Beaumont, R.N., Wittemans, L., Martin, S., Busch, A.S., et al. (2020). Using human genetics to understand the disease impacts of testosterone in men and women. Nat. Med. 26, 252–258. doi:10.1038/s41591-020-0751-5
OpenUrl CrossRef
87.↵
Bilder, D.A., Worsham, W., Sullivan, S., Sean Esplin, M., Burghardt, P., Fraser, A., and Bakian, A.V. (2023). Sex-specific and sex-independent steroid-related biomarkers in early second trimester maternal serum associated with autism. Mol. Autism 14,. doi:10.1186/s13229-023-00562-5
OpenUrl CrossRef
88.↵
Piton, A., Gauthier, J., Hamdan, F.F., Lafrenière, R.G., Yang, Y., Henrion, E., Laurent, S., Noreau, A., Thibodeau, P., Karemera, L., et al. (2011). Systematic resequencing of X-chromosome synaptic genes in autism spectrum disorder and schizophrenia. Mol. Psychiatry 16, 867. doi:10.1038/mp.2010.54
OpenUrl CrossRef PubMed Web of Science
89.↵
Joel Ross, P., Zhang, W.-B., Mok, R.S.F., Zaslavsky, K., Deneault, E., D’Abate, L., Rodrigues, D.C., Yuen, R.K.C., Faheem, M., Mufteev, M., et al. (2020). Synaptic dysfunction in human neurons with autism-associated deletions in PTCHD1-AS. Biol. Psychiatry 87, 139. doi:10.1016/j.biopsych.2019.07.014
OpenUrl CrossRef
90.↵
Scala, M., Bradley, C.A., Howe, J.L., Trost, B., Salazar, N.B., Shum, C., Reuter, M.S., MacDonald, J.R., Ko, S.Y., Frankland, P.W., et al. (2023). Genetic variants in contribute to Autism Spectrum Disorder associated with the Xp22.11 locus. medRxiv. doi:10.1101/2023.12.21.23300383
OpenUrl Abstract/FREE Full Text
91.↵
Fry, A. E., Marra, C., Derrick, A. V., Pickrell, W. O., Higgins, A. T., Te Water Naude, J., McClatchey, M. A., Davies, S. J., Metcalfe, K. A., Tan, H. J., Mohanraj, R., Avula, S., Williams, D., Brady, L. I., Mesterman, R., Tarnopolsky, M. A., Zhang, Y., Yang, Y., Wang, X., Genomics England Research Consortium, … Chung, S. K. (2021). Missense variants in the N-terminal domain of the A isoform of FHF2/FGF13 cause an X-linked developmental and epileptic encephalopathy. American journal of human genetics, 108(1), 176–185. doi:10.1016/j.ajhg.2020.10.017
OpenUrl CrossRef

View the discussion thread.

Posted July 18, 2024.

Download PDF

Supplementary Material

Data/Code

Citation Tools

Subject Area

Genetic and Genomic Medicine

Subject Areas

All Articles

Addiction Medicine (376)
Allergy and Immunology (691)
Anesthesia (185)
Cardiovascular Medicine (2781)
Dentistry and Oral Medicine (323)
Dermatology (237)
Emergency Medicine (418)
Endocrinology (including Diabetes Mellitus and Metabolic Disease) (989)
Epidemiology (12432)
Forensic Medicine (10)
Gastroenterology (787)
Genetic and Genomic Medicine (4301)
Geriatric Medicine (397)
Health Economics (705)
Health Informatics (2776)
Health Policy (1029)
Health Systems and Quality Improvement (1023)
Hematology (371)
HIV/AIDS (882)
Infectious Diseases (except HIV/AIDS) (13871)
Intensive Care and Critical Care Medicine (820)
Medical Education (406)
Medical Ethics (113)
Nephrology (455)
Neurology (4074)
Nursing (218)
Nutrition (603)
Obstetrics and Gynecology (768)
Occupational and Environmental Health (713)
Oncology (2156)
Ophthalmology (609)
Orthopedics (252)
Otolaryngology (313)
Pain Medicine (257)
Palliative Medicine (79)
Pathology (480)
Pediatrics (1152)
Pharmacology and Therapeutics (479)
Primary Care Research (475)
Psychiatry and Clinical Psychology (3572)
Public and Global Health (6680)
Radiology and Imaging (1457)
Rehabilitation Medicine and Physical Therapy (851)
Respiratory Medicine (889)
Rheumatology (425)
Sexual and Reproductive Health (426)
Sports Medicine (354)
Surgery (467)
Toxicology (57)
Transplantation (194)
Urology (172)

[1] 1.↵
Zeidan, J., Fombonne, E., Scorah, J., Ibrahim, A., Durkin, M.S., Saxena, S., Yusuf, A., Shih, A., and Elsabbagh, M. (2022). Global prevalence of autism: A systematic review update. Autism Res. 15, 778–790. doi:10.1002/aur.2696
OpenUrl CrossRef PubMed

[2] 2.↵
Lord, C., Elsabbagh, M., Baird, G., and Veenstra-Vanderweele, J. (2018). Autism spectrum disorder. The Lancet 392, 508–520. doi:10.1016/S0140-6736(18)31129-2
OpenUrl CrossRef PubMed

[3] 3.↵
Loomes, R., Hull, L., and Mandy, W.P.L. (2017). What Is the Male-to-Female Ratio in Autism Spectrum Disorder? A Systematic Review and Meta-Analysis. J. Am. Acad. Child Adolesc. Psychiatry 56,.doi:10.1016/j.jaac.2017.03.013
OpenUrl CrossRef PubMed

[4] 4.↵
Maenner, M.J., Warren, Z., Williams, A.R., Amoakohene, E., Bakian, A.V., Bilder, D.A., Durkin, M.S., Fitzgerald, R.T., Furnier, S.M., Hughes, M.M., et al. (2023). Prevalence and Characteristics of Autism Spectrum Disorder Among Children Aged 8 Years - Autism and Developmental Disabilities Monitoring Network, 11 Sites, United States, 2020. MMWR Surveill. Summ. 72,. doi:10.15585/mmwr.ss7202a1
OpenUrl CrossRef

[5] 5.↵
Lai, M.C., Lombardo, M.V., Auyeung, B., Chakrabarti, B., and Baron-Cohen, S. (2015). Sex/gender differences and autism: setting the scene for future research. J. Am. Acad. Child Adolesc. Psychiatry 54,.doi:10.1016/j.jaac.2014.10.003
OpenUrl CrossRef PubMed

[6] 6.↵
Jacquemont, S., Coe, B.P., Hersch, M., Duyzend, M.H., Krumm, N., Bergmann, S., Beckmann, J.S., Rosenfeld, J.A., and Eichler, E.E. (2014). A higher mutational burden in females supports a “female protective model” in neurodevelopmental disorders. Am. J. Hum. Genet. 94, 415–425. doi:10.1016/j.ajhg.2014.02.001
OpenUrl CrossRef PubMed

[7] 7.↵
Pinto, D., Delaby, E., Merico, D., Barbosa, M., Merikangas, A., Klei, L., Thiruvahindrapuram, B., Xu, X., Ziman, R., Wang, Z., et al. (2014). Convergence of genes and cellular pathways dysregulated in autism spectrum disorders. Am. J. Hum. Genet. 94, 677–694. doi:10.1016/j.ajhg.2014.03.018
OpenUrl CrossRef PubMed

[8] 8.↵
Leow, K.Q., Tonta, M.A., Lu, J., Coleman, H.A., and Parkington, H.C. (2024). Towards understanding sex differences in autism spectrum disorders. Brain Res. 1833, 148877. doi:10.1016/j.brainres.2024.148877
OpenUrl CrossRef

[9] 9.
Palmer, N., Beam, A., Agniel, D., Eran, A., Manrai, A., Spettell, C., Steinberg, G., Mandl, K., Fox, K., Nelson, S.F., et al. (2017). Association of Sex With Recurrence of Autism Spectrum Disorder Among Siblings. JAMA Pediatr. 171, 1107–1112. doi:10.1001/jamapediatrics.2017.2832
OpenUrl CrossRef PubMed

[10] 10.↵
Napolitano, A., Schiavi, S., La Rosa, P., Rossi-Espagnet, M.C., Petrillo, S., Bottino, F., Tagliente, E., Longo, D., Lupi, E., Casula, L., et al. (2022). Sex Differences in Autism Spectrum Disorder: Diagnostic, Neurobiological, and Behavioral Features. Front. Psychiatry 13, 889636. doi:10.3389/fpsyt.2022.889636
OpenUrl CrossRef

[11] 11.↵
Zhang, Y., Li, N., Li, C., Zhang, Z., Teng, H., Wang, Y., Zhao, T., Shi, L., Zhang, K., Xia, K., et al. (2020). Genetic evidence of gender difference in autism spectrum disorder supports the female-protective effect. Transl. Psychiatry 10, 4. doi:10.1038/s41398-020-0699-8
OpenUrl CrossRef

[12] 12.
Antaki, D., Guevara, J., Maihofer, A.X., Klein, M., Gujral, M., Grove, J., Carey, C.E., Hong, O., Arranz, M.J., Hervas, A., et al. (2022). A phenotypic spectrum of autism is attributable to the combined effects of rare variants, polygenic risk and sex. Nat. Genet. 54,. doi:10.1038/s41588-022-01064-5
OpenUrl CrossRef

[13] 13.↵
Warrier, V., Zhang, X., Reed, P., Havdahl, A., Moore, T.M., Cliquet, F., Leblond, C.S., Rolland, T., Rosengren, A., Rowitch, D.H., et al. (2022). Genetic correlates of phenotypic heterogeneity in autism. Nat. Genet. 54,. doi:10.1038/s41588-022-01072-5
OpenUrl CrossRef

[14] 14.↵
Wigdor, E.M., Weiner, D.J., Grove, J., Fu, J.M., Thompson, W.K., Carey, C.E., Baya, N., van der Merwe, C., Walters, R.K., Satterstrom, F.K., et al. (2022). The female protective effect against autism spectrum disorder. Cell Genomics 2,. doi:10.1016/j.xgen.2022.100134
OpenUrl CrossRef

[15] 15.↵
Elsabbagh, M. (2020). Linking risk factors and outcomes in autism spectrum disorder: is there evidence for resilience? BMJ 368, l6880. doi:10.1136/bmj.l6880
OpenUrl Abstract/FREE Full Text

[16] 16.↵
Werling, D.M. (2016). The role of sex-differential biology in risk for autism spectrum disorder. Biol. Sex Differ. 7,. doi:10.1186/s13293-016-0112-8
OpenUrl CrossRef

[17] 17.↵
D., Kroll, K.L., Gutmann, D.H., Constantino, J.N., et al. (2022). Can the “female protective effect” liability threshold model explain sex differences in autism spectrum disorder? Neuron 110,. doi:10.1016/j.neuron.2022.06.020
OpenUrl CrossRef

[18] 18.↵
Leppa, V.M., Kravitz, S.N., Martin, C.L., Andrieux, J., Le Caignec, C., Martin-Coignard, D., DyBuncio, C., Sanders, S.J., Lowe, J.K., Cantor, R.M., et al. (2016). Rare Inherited and De Novo CNVs Reveal Complex Contributions to ASD Risk in Multiplex Families. Am. J. Hum. Genet. 99, 540–554. doi:10.1016/j.ajhg.2016.06.036
OpenUrl CrossRef PubMed

[19] 19.
Han, J., Walters, J.T.R., Kirov, G., Pocklington, A., Escott-Price, V., Owen, M.J., Holmans, P., O’Donovan, M.C., and Rees, E. (2016). Gender differences in CNV burden do not confound schizophrenia CNV associations. Sci. Rep. 6, 25986. doi:10.1038/srep25986
OpenUrl CrossRef

[20] 20.↵
Martin, J., Tammimies, K., Karlsson, R., Lu, Y., Larsson, H., Lichtenstein, P., and Magnusson, P.K.E. (2019). Copy number variation and neuropsychiatric problems in females and males in the general population. Am. J. Med. Genet. B Neuropsychiatr. Genet. 180, 341–350. doi:10.1002/ajmg.b.32685
OpenUrl CrossRef

[21] 21.↵
Mitra, I., Tsang, K., Ladd-Acosta, C., Croen, L.A., Aldinger, K.A., Hendren, R.L., Traglia, M., Lavillaureix, A., Zaitlen, N., Oldham, M.C., et al. (2016). Pleiotropic Mechanisms Indicated for Sex Differences in Autism. PLoS Genet. 12, e1006425. doi:10.1371/journal.pgen.1006425
OpenUrl CrossRef

[22] 22.↵
McCarthy, M.M. (2020). A new view of sexual differentiation of mammalian brain. J. Comp. Physiol. A Neuroethol. Sens. Neural Behav. Physiol. 206, 369–378. doi:10.1007/s00359-019-01376-8
OpenUrl CrossRef PubMed

[23] 23.↵
Amestoy, A., Baudrillard, C., Briot, K., Pizano, A., Bouvard, M., and Lai, M.C. (2023). Steroid hormone pathways, vitamin D and autism: a systematic review. J. Neural Transm. 130,. doi:10.1007/s00702-022-02582-6
OpenUrl CrossRef

[24] 24.↵
Lenz, K.M., Wright, C.L., Martin, R.C., and McCarthy, M.M. (2011). Prostaglandin E₂ regulates AMPA receptor phosphorylation and promotes membrane insertion in preoptic area neurons and glia during sexual differentiation. PLoS One 6, e18500. doi:10.1371/journal.pone.0018500
OpenUrl CrossRef PubMed

[25] 25.↵
VanRyzin, J.W., Marquardt, A.E., Argue, K.J., Vecchiarelli, H.A., Ashton, S.E., Arambula, S.E., Hill, M.N., and McCarthy, M.M. (2019). Microglial Phagocytosis of Newborn Cells Is Induced by Endocannabinoids and Sculpts Sex Differences in Juvenile Rat Social Play. Neuron 102, 435–449.e6. doi:10.1016/j.neuron.2019.02.006
OpenUrl CrossRef PubMed

[26] 26.↵
Abrahams, B.S., Arking, D.E., Campbell, D.B., Mefford, H.C., Morrow, E.M., Weiss, L.A., Menashe, I., Wadkins, T., Banerjee-Basu, S., and Packer, A. (2013). SFARI Gene 2.0: a community-driven knowledgebase for the autism spectrum disorders (ASDs). Mol. Autism 4, 36. doi:10.1186/2040-2392-4-36
OpenUrl CrossRef PubMed

[27] 27.↵
Schaaf, C.P., Betancur, C., Yuen, R.K.C., Parr, J.R., Skuse, D.H., Gallagher, L., Bernier, R.A., Buchanan, J.A., Buxbaum, J.D., Chen, C.-A., et al. (2020). A framework for an evidence-based gene list relevant to autism spectrum disorder. Nat. Rev. Genet. 21, 367–376. doi:10.1038/s41576-020-0231-2
OpenUrl CrossRef

[28] 28.↵
Hoang, N., Buchanan, J.A., and Scherer, S.W. (2018). Heterogeneity in clinical sequencing tests marketed for autism spectrum disorders. Npj Genomic Medicine 3, 1–4. doi:10.1038/s41525-018-0066-3
OpenUrl CrossRef

[29] 29.↵
Sun, L., Wang, Z., Lu, T., Manolio, T. A., & Paterson, A. D. (2023). eXclusionarY: 10 years later, where are the sex chromosomes in GWASs?. American journal of human genetics, 110(6), 903–912. doi:10.1016/j.ajhg.2023.04.009
OpenUrl CrossRef

[30] 30.↵
Chung, R.-H., Ma, D., Wang, K., Hedges, D.J., Jaworski, J.M., Gilbert, J.R., Cuccaro, M.L., Wright, H.H., Abramson, R.K., Konidari, I., et al. (2011). An X chromosome-wide association study in autism families identifies TBL1X as a novel autism spectrum disorder candidate gene in males. Mol. Autism 2, 18. doi:10.1186/2040-2392-2-18
OpenUrl CrossRef PubMed

[31] 31.↵
Gottipati, S., Arbiza, L., Siepel, A., Clark, A.G., and Keinan, A. (2011). Analyses of X-linked and autosomal genetic variation in population-scale whole genome sequencing. Nat. Genet. 43, 741–743. doi:10.1038/ng.877
OpenUrl CrossRef PubMed

[32] 32.↵
Le Guen, Y., Napolioni, V., Belloy, M.E., Yu, E., Krohn, L., Ruskey, J.A., Gan-Or, Z., Kennedy, G., Eger, S.J., and Greicius, M.D. (2021). Common X-Chromosome Variants Are Associated with Parkinson Disease Risk. Ann. Neurol. 90, 22–34. doi:10.1002/ana.26051
OpenUrl CrossRef

[33] 33.↵
Gao, F., Chang, D., Biddanda, A., Ma, L., Guo, Y., Zhou, Z., and Keinan, A. (2015). XWAS: A Software Toolset for Genetic Data Analysis and Association Studies of the X Chromosome. J. Hered. 106, 666–671. doi:10.1093/jhered/esv059
OpenUrl CrossRef PubMed

[34] 34.↵
Trost, B., Thiruvahindrapuram, B., Chan, A. J. S., Engchuan, W., Higginbotham, E. J., Howe, J. L., Loureiro, L. O., Reuter, M. S., Roshandel, D., Whitney, J., Zarrei, M., Bookman, M., Somerville, C., Shaath, R., Abdi, M., Aliyev, E., Patel, R. V., Nalpathamkalam, T., Pellecchia, G., Hamdan, O., … Scherer, S. W. (2022). Genomic architecture of autism from comprehensive whole-genome sequence annotation. Cell, 185(23), 4409–4427.e18. doi:10.1016/j.cell.2022.10.009
OpenUrl CrossRef

[35] 35.↵
C Yuen, R.K., Merico, D., Bookman, M., L Howe, J., Thiruvahindrapuram, B., Patel, R.V., Whitney, J., Deflaux, N., Bingham, J., Wang, Z., et al. (2017). Whole genome sequencing resource identifies 18 new candidate genes for autism spectrum disorder. Nat. Neurosci. 20, 602–611. doi:10.1038/nn.4524
OpenUrl CrossRef PubMed

[36] 36.↵
Black, D.W., and Jon E. Grant, M.D., M.P.H., J.D. (2014). DSM-5 Guidebook: The Essential Companion to the Diagnostic and Statistical Manual of Mental Disorders, Fifth Edition (American Psychiatric Pub).

[37] 37.↵
Lord, C., Rutter, M., and Le Couteur, A. (1994). Autism Diagnostic Interview-Revised: A revised version of a diagnostic interview for caregivers of individuals with possible pervasive developmental disorders. J. Autism Dev. Disord. 24, 659–685. doi:10.1007/BF02172145
OpenUrl CrossRef PubMed Web of Science

[38] 38.↵
Kim, S.H., and Lord, C. (2011). New Autism Diagnostic Interview-Revised Algorithms for Toddlers and Young Preschoolers from 12 to 47 Months of Age. J. Autism Dev. Disord. 42, 82–93. doi:10.1007/s10803-011-1213-1
OpenUrl CrossRef

[39] 39.↵
Lord, C., Rutter, M., Goode, S., Heemsbergen, J., Jordan, H., Mawhood, L., and Schopler, E. (1989). Austism diagnostic observation schedule: A standardized observation of communicative and social behavior. J. Autism Dev. Disord. 19, 185–212. doi:10.1007/BF02211841
OpenUrl CrossRef PubMed Web of Science

[40] 40.↵
Lord, C., Risi, S., Lambrecht, L., Cook, E.H., Jr., Leventhal, B.L., DiLavore, P.C., Pickles, A., and Rutter, M. (2000). The autism diagnostic observation schedule-generic: a standard measure of social and communication deficits associated with the spectrum of autism. J. Autism Dev. Disord. 30, 205–223.
OpenUrl CrossRef PubMed Web of Science

[41] 41.↵
Fischbach, G. D., & Lord, C. (2010). The Simons Simplex Collection: a resource for identification of autism genetic risk factors. Neuron, 68(2), 192–195. doi:10.1016/j.neuron.2010.10.006
OpenUrl CrossRef PubMed Web of Science

[42] 42.↵
SPARK Consortium. Electronic address: pfeliciano@simonsfoundation.org, & SPARK Consortium (2018). SPARK: A US Cohort of 50,000 Families to Accelerate Autism Research. Neuron, 97(3), 488–493. doi:10.1016/j.neuron.2018.01.015
OpenUrl CrossRef PubMed

[43] 43.↵
Byrska-Bishop, M., Evani, U.S., Zhao, X., Basile, A.O., Abel, H.J., Regier, A.A., Corvelo, A., Clarke, W.E., Musunuri, R., Nagulapalli, K., et al. (2022). High-coverage whole-genome sequencing of the expanded 1000 Genomes Project cohort including 602 trios. Cell 185, 3426–3440.e19. doi:10.1016/j.cell.2022.08.004
OpenUrl CrossRef PubMed

[44] 44.↵
Pinese, M., Lacaze, P., Rath, E.M., Stone, A., Brion, M.-J., Ameur, A., Nagpal, S., Puttick, C., Husson, S., Degrave, D., et al. (2020). The Medical Genome Reference Bank contains whole genome and phenotype data of 2570 healthy elderly. Nat. Commun. 11, 1–14. doi:10.1038/s41467-019-14079-0
OpenUrl CrossRef PubMed

[45] 45.↵
Yoo, S., Garg, E., Elliott, L.T., Hung, R.J., Halevy, A.R., Brooks, J.D., Bull, S.B., Gagnon, F., Greenwood, C.M.T., Lawless, J.F., et al. (2023). HostSeq: A Canadian Whole Genome Sequencing and Clinical Data Resource.

[46] 46.↵
Leal, T.P., French-Kwawu, J.N., Gouveia, M.H., Borda, V., Inca-Martinez, M., Mason, E.A., Horimoto, A.R., Loesch, D.P., Sarihan, E.I., Cornejo-Olivas, M.R., et al. (2023). X-Chromosome Association Study in Latin American Cohorts Identifies New Loci in Parkinson Disease. doi:10.1002/mds.29508
OpenUrl CrossRef

[47] 47.↵
Franke, L., de Kovel, C.G.F., Aulchenko, Y.S., Trynka, G., Zhernakova, A., Hunt, K.A., Blauw, H.M., van den Berg, L.H., Ophoff, R., Deloukas, P., et al. (2008). Detection, imputation, and association analysis of small deletions and null alleles on oligonucleotide arrays. Am. J. Hum. Genet. 82, 1316–1333. doi:10.1016/j.ajhg.2008.05.008
OpenUrl CrossRef PubMed Web of Science

[48] 48.↵
Karczewski, K.J., Francioli, L.C., Tiao, G., Cummings, B.B., Alföldi, J., Wang, Q., Collins, R.L., Laricchia, K.M., Ganna, A., Birnbaum, D.P., et al. (2020). The mutational constraint spectrum quantified from variation in 141,456 humans. Nature 581, 434–443. doi:10.1038/s41586-020-2308-7
OpenUrl CrossRef PubMed

[49] 49.↵
Manichaikul, A., Mychaleckyj, J.C., Rich, S.S., Daly, K., Sale, M., and Chen, W.-M. (2010). Robust relationship inference in genome-wide association studies. Bioinformatics 26, 2867–2873. doi:10.1093/bioinformatics/btq559
OpenUrl CrossRef PubMed Web of Science

[50] 50.↵
Leal, T.P., Furlan, V.C., Gouveia, M.H., Saraiva Duarte, J.M., Fonseca, P.A., Tou, R., Scliar, M. de O., Araujo, G.S. de, Costa, L.F., Zolini, C., et al. (2022). NAToRA, a relatedness-pruning method to minimize the loss of dataset size in genetic and omics analyses. Comput. Struct. Biotechnol. J. 20, 1821–1828. doi:10.1016/j.csbj.2022.04.009
OpenUrl CrossRef

[51] 51.↵
Alexander, D.H., Novembre, J., and Lange, K. (2009). Fast model-based estimation of ancestry in unrelated individuals. Genome Res. 19, 1655–1664. doi:10.1101/gr.094052.109
OpenUrl Abstract/FREE Full Text

[52] 52.↵
Fairley, S., Lowy-Gallego, E., Perry, E., and Flicek, P. (2019). The International Genome Sample Resource (IGSR) collection of open human genomic variation resources. Nucleic Acids Res. 48, D941–D947. doi:10.1093/nar/gkz836
OpenUrl CrossRef PubMed

[53] 53.↵
Price, A.L., Weale, M.E., Patterson, N., Myers, S.R., Need, A.C., Shianna, K.V., Ge, D., Rotter, J.I., Torres, E., Taylor, K.D., et al. (2008). Long-range LD can confound genome scans in admixed populations. Am. J. Hum. Genet. 83, 132–135; author reply 135–139. doi:10.1016/j.ajhg.2008.06.005
OpenUrl CrossRef PubMed Web of Science

[54] 54.
Weale, M.E. (2010). Quality control for genome-wide association studies. Methods Mol. Biol. 628, 341–372. doi:10.1007/978-1-60327-367-1_19
OpenUrl CrossRef PubMed

[55] 55.↵
Anderson, C.A., Pettersson, F.H., Clarke, G.M., Cardon, L.R., Morris, A.P., and Zondervan, K.T. (2010). Data quality control in genetic case-control association studies. Nat. Protoc. 5, 1564–1573. doi:10.1038/nprot.2010.116
OpenUrl CrossRef PubMed Web of Science

[56] 56.↵
Chang, C.C., Chow, C.C., Tellier, L.C., Vattikuti, S., Purcell, S.M., and Lee, J.J. (2015). Second-generation PLINK: rising to the challenge of larger and richer datasets. Gigascience 4, 7. doi:10.1186/s13742-015-0047-8
OpenUrl CrossRef PubMed

[57] 57.↵
Mägi, R., and Morris, A.P. (2010). GWAMA: software for genome-wide association meta-analysis. BMC Bioinformatics 11, 1–6. doi:10.1186/1471-2105-11-288
OpenUrl CrossRef PubMed

[58] 58.↵
Magi, R., Lindgren, C.M., and Morris, A.P. (2010). Meta-analysis of sex-specific genome-wide association studies. Genet. Epidemiol. 34, 846–853. doi:10.1002/gepi.20540
OpenUrl CrossRef PubMed

[59] 59.↵
Bretherton, C.S., Widmann, M., Dymnikov, V.P., Wallace, J.M., and Bladé, I. (1999). The Effective Number of Spatial Degrees of Freedom of a Time-Varying Field. J. Clim. 12, 1990– 2009.
OpenUrl

[60] 60.↵
Chen, D.Z., Roshandel, D., Wang, Z., Sun, L., and Paterson, A.D. (2023). Comprehensive whole-genome analyses of the UK Biobank reveal significant sex differences in both genotype missingness and allele frequency on the X chromosome. Hum. Mol. Genet. ddad 201. doi:10.1093/hmg/ddad201
OpenUrl CrossRef

[61] 61.↵
Wang, Z., Sun, L., and Paterson, A.D. (2022). Major sex differences in allele frequencies for X chromosomal variants in both the 1000 Genomes Project and gnomAD. PLoS Genet. 18, e1010231. doi:10.1371/journal.pgen.1010231
OpenUrl CrossRef

[62] 62.↵
Purcell, S., Neale, B., Todd-Brown, K., Thomas, L., Ferreira, M.A.R., Bender, D., Maller, J., Sklar, P., de Bakker, P.I.W., Daly, M.J., et al. (2007). PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81, 559–575. doi:10.1086/519795
OpenUrl CrossRef PubMed

[63] 63.↵
Trost, B., Walker, S., Wang, Z., Thiruvahindrapuram, B., MacDonald, J. R., Sung, W. W. L., Pereira, S. L., Whitney, J., Chan, A. J. S., Pellecchia, G., Reuter, M. S., Lok, S., Yuen, R. K. C., Marshall, C. R., Merico, D., & Scherer, S. W. (2018). A Comprehensive Workflow for Read Depth-Based Identification of Copy-Number Variation from Whole-Genome Sequence Data. American journal of human genetics, 102(1), 142–155. doi:10.1016/j.ajhg.2017.12.007
OpenUrl CrossRef PubMed

[64] 64.↵
Wang, K., Li, M., and Hakonarson, H. (2010). ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 38, e164. doi:10.1093/nar/gkq603
OpenUrl CrossRef PubMed

[65] 65.↵
Ng, P.C., and Henikoff, S. (2003). SIFT: Predicting amino acid changes that affect protein function. Nucleic Acids Res. 31, 3812–3814. doi:10.1093/nar/gkg509
OpenUrl CrossRef PubMed Web of Science

[66] 66.↵
Adzhubei, I.A., Schmidt, S., Peshkin, L., Ramensky, V.E., Gerasimova, A., Bork, P., Kondrashov, A.S., and Sunyaev, S.R. (2010). A method and server for predicting damaging missense mutations. Nat. Methods 7, 248–249. doi:10.1038/nmeth0410-248
OpenUrl CrossRef PubMed Web of Science

[67] 67.↵
Schwarz, J.M., Cooper, D.N., Schuelke, M., and Seelow, D. (2014). MutationTaster2: mutation prediction for the deep-sequencing age. Nat. Methods 11, 361–362. doi:10.1038/nmeth.2890
OpenUrl CrossRef PubMed Web of Science

[68] 68.↵
Kircher, M., Witten, D.M., Jain, P., O’Roak, B.J., Cooper, G.M., and Shendure, J. (2014). A general framework for estimating the relative pathogenicity of human genetic variants. Nat. Genet. 46, 310–315. doi:10.1038/ng.2892
OpenUrl CrossRef PubMed

[69] 69.↵
Zhu, M., Need, A.C., Han, Y., Ge, D., Maia, J.M., Zhu, Q., Heinzen, E.L., Cirulli, E.T., Pelak, K., He, M., et al. (2012). Using ERDS to infer copy-number variants in high-coverage genomes. Am. J. Hum. Genet. 91, 408–421. doi:10.1016/j.ajhg.2012.07.004
OpenUrl CrossRef PubMed

[70] 70.↵
Abyzov, A., Urban, A.E., Snyder, M., and Gerstein, M. (2011). CNVnator: an approach to discover, genotype, and characterize typical and atypical CNVs from family and population genome sequencing. Genome Res. 21, 974–984. doi:10.1101/gr.114876.110
OpenUrl Abstract/FREE Full Text

[71] 71.↵
Kang, H.J., Kawasawa, Y.I., Cheng, F., Zhu, Y., Xu, X., Li, M., Sousa, A.M.M., Pletikos, M., Meyer, K.A., Sedmak, G., et al. (2011). Spatio-temporal transcriptome of the human brain. Nature 478, 483–489. doi:10.1038/nature10523
OpenUrl CrossRef PubMed Web of Science

[72] 72.↵
Watanabe, K., Taskesen, E., and van Bochoven, A. (2017). Functional mapping and annotation of genetic associations with FUMA. Nat. Commun. 8, 1–11. doi:10.1038/s41467-017-01261-5
OpenUrl CrossRef PubMed

[73] 73.↵
de Leeuw, C.A., Mooij, J.M., and Heskes, T. (2015). MAGMA: Generalized Gene-Set Analysis of GWAS Data. PLoS Comput. Biol. 11, e1004219. doi:10.1371/journal.pcbi.1004219
OpenUrl CrossRef PubMed

[74] 74.↵
Schubach, M., Maass, T., Nazaretyan, L., Röner, S., and Kircher, M. (2024). CADD v1.7: using protein language models, regulatory CNNs and other nucleotide-level scores to improve genome-wide variant predictions. Nucleic Acids Res. 52, D1143–D1154. doi:10.1093/nar/gkad989
OpenUrl CrossRef

[75] 75.↵
Dong, S., Zhao, N., Spragins, E., Kagda, M.S., Li, M., Assis, P., Jolanki, O., Luo, Y., Cherry, J.M., Boyle, A.P., et al. (2023). Annotating and prioritizing human non-coding variants with RegulomeDB v.2. Nat. Genet. 55, 724–726. doi:10.1038/s41588-023-01365-3
OpenUrl CrossRef

[76] 76.↵
Boyle, A.P., Hong, E.L., Hariharan, M., Cheng, Y., Schaub, M.A., Kasowski, M., Karczewski, K.J., Park, J., Hitz, B.C., Weng, S., et al. (2012). Annotation of functional variation in personal genomes using RegulomeDB. Genome Res. 22, 1790–1797. doi:10.1101/gr.137323.112
OpenUrl Abstract/FREE Full Text

[77] 77.↵
1000 Genomes Project Consortium, Auton, A., Brooks, L. D., Durbin, R. M., Garrison, E. P., Kang, H. M., Korbel, J. O., Marchini, J. L., McCarthy, S., McVean, G. A., & Abecasis, G. R. (2015). A global reference for human genetic variation. Nature, 526(7571), 68–74. doi:10.1038/nature15393
OpenUrl CrossRef PubMed

[78] 78.↵
Day, F.R., Loh, P.-R., Scott, R.A., Ong, K.K., and Perry, J.R.B. (2016). A Robust Example of Collider Bias in a Genetic Association Study. Am. J. Hum. Genet. 98, 392–393. doi:10.1016/j.ajhg.2015.12.019
OpenUrl CrossRef

[79] 79.↵
Price, A.L., Patterson, N.J., Plenge, R.M., Weinblatt, M.E., Shadick, N.A., and Reich, D. (2006). Principal components analysis corrects for stratification in genome-wide association studies. Nat. Genet. 38, 904–909. doi:10.1038/ng1847
OpenUrl CrossRef PubMed Web of Science

[80] 80.↵
Zarrei, M., Burton, C.L., Engchuan, W., Young, E.J., Higginbotham, E.J., MacDonald, of genomic copy number variation across neurodevelopmental disorders. NPJ Genomic Medicine 4,. doi:10.1038/s41525-019-0098-3
OpenUrl CrossRef

[81] 81.
D’Abate, L., Walker, S., Yuen, R.K.C., Tammimies, K., Buchanan, J.A., Davies, R.W., Thiruvahindrapuram, B., Wei, J., Brian, J., Bryson, S.E., et al. (2019). Predictive impact of rare genomic copy number variations in siblings of individuals with autism spectrum disorders. Nat. Commun. 10,. doi:10.1038/s41467-019-13380-2
OpenUrl CrossRef

[82] 82.
Woodbury-Smith, M., Zarrei, M., Wei, J., Thiruvahindrapuram, B., O’Connor, I., Paterson, A.D., Yuen, R.K.C., Dastan, J., Stavropoulos, D.J., Howe, J.L., et al. (2020). Segregating patterns of copy number variations in extended autism spectrum disorder (ASD) pedigrees. Am. J. Med. Genet. B Neuropsychiatr. Genet. 183,. doi:10.1002/ajmg.b.32785
OpenUrl CrossRef

[83] 83.↵
Zarrei, M., Burton, C.L., Engchuan, W., Higginbotham, E.J., Wei, J., Shaikh, S., Roslin, N.M., MacDonald, J.R., Pellecchia, G., Nalpathamkalam, T., et al. (2023). Gene copy number variation and pediatric mental health/neurodevelopment in a general population. Hum. Mol. Genet. 32,. doi:10.1093/hmg/ddad074
OpenUrl CrossRef

[84] 84.↵
Grove, J., Ripke, S., Als, T.D., Mattheisen, M., Walters, R.K., Won, H., Pallesen, J., Agerbo, E., Andreassen, O.A., Anney, R., et al. (2019). Identification of common genetic risk variants for autism spectrum disorder. Nat. Genet. 51, 431–444. doi:10.1038/s41588-019-0344-8
OpenUrl CrossRef PubMed

[85] 85.↵
Sollis, E., Mosaku, A., Abid, A., Buniello, A., Cerezo, M., Gil, L., Groza, T., Güneş, O., Hall, P., Hayhurst, J., et al. (2023). The NHGRI-EBI GWAS Catalog: knowledgebase and deposition resource. Nucleic Acids Res. 51, D977–D985. doi:10.1093/nar/gkac1010
OpenUrl CrossRef

[86] 86.↵
Ruth, K.S., Day, F.R., Tyrrell, J., Thompson, D.J., Wood, A.R., Mahajan, A., Beaumont, R.N., Wittemans, L., Martin, S., Busch, A.S., et al. (2020). Using human genetics to understand the disease impacts of testosterone in men and women. Nat. Med. 26, 252–258. doi:10.1038/s41591-020-0751-5
OpenUrl CrossRef

[87] 87.↵
Bilder, D.A., Worsham, W., Sullivan, S., Sean Esplin, M., Burghardt, P., Fraser, A., and Bakian, A.V. (2023). Sex-specific and sex-independent steroid-related biomarkers in early second trimester maternal serum associated with autism. Mol. Autism 14,. doi:10.1186/s13229-023-00562-5
OpenUrl CrossRef

[88] 88.↵
Piton, A., Gauthier, J., Hamdan, F.F., Lafrenière, R.G., Yang, Y., Henrion, E., Laurent, S., Noreau, A., Thibodeau, P., Karemera, L., et al. (2011). Systematic resequencing of X-chromosome synaptic genes in autism spectrum disorder and schizophrenia. Mol. Psychiatry 16, 867. doi:10.1038/mp.2010.54
OpenUrl CrossRef PubMed Web of Science

[89] 89.↵
Joel Ross, P., Zhang, W.-B., Mok, R.S.F., Zaslavsky, K., Deneault, E., D’Abate, L., Rodrigues, D.C., Yuen, R.K.C., Faheem, M., Mufteev, M., et al. (2020). Synaptic dysfunction in human neurons with autism-associated deletions in PTCHD1-AS. Biol. Psychiatry 87, 139. doi:10.1016/j.biopsych.2019.07.014
OpenUrl CrossRef

[90] 90.↵
Scala, M., Bradley, C.A., Howe, J.L., Trost, B., Salazar, N.B., Shum, C., Reuter, M.S., MacDonald, J.R., Ko, S.Y., Frankland, P.W., et al. (2023). Genetic variants in contribute to Autism Spectrum Disorder associated with the Xp22.11 locus. medRxiv. doi:10.1101/2023.12.21.23300383
OpenUrl Abstract/FREE Full Text

[91] 91.↵
Fry, A. E., Marra, C., Derrick, A. V., Pickrell, W. O., Higgins, A. T., Te Water Naude, J., McClatchey, M. A., Davies, S. J., Metcalfe, K. A., Tan, H. J., Mohanraj, R., Avula, S., Williams, D., Brady, L. I., Mesterman, R., Tarnopolsky, M. A., Zhang, Y., Yang, Y., Wang, X., Genomics England Research Consortium, … Chung, S. K. (2021). Missense variants in the N-terminal domain of the A isoform of FHF2/FGF13 cause an X-linked developmental and epileptic encephalopathy. American journal of human genetics, 108(1), 176–185. doi:10.1016/j.ajhg.2020.10.017
OpenUrl CrossRef