Chromosome X-Wide Common Variant Association Study (XWAS) in Autism Spectrum Disorder ===================================================================================== * Marla Mendes * Desmond Zeya Chen * Worrawat Engchuan * Thiago Peixoto Leal * Bhooma Thiruvahindrapuram * Brett Trost * Jennifer L. Howe * Giovanna Pellecchia * Thomas Nalpathamkalam * Roumiana Alexandrova * Nelson Bautista Salazar * Ethan Alexander McKee * Natalia Rivera Alfaro * Meng-Chuan Lai * Sara Bandres-Ciga * Delnaz Roshandel * Clarrisa A. Bradley * Evdokia Anagnostou * Lei Sun * Stephen W. Scherer ## Abstract Autism Spectrum Disorder (ASD) displays a notable male bias in prevalence. Research into rare (<0.1) genetic variants on the X chromosome has implicated over 20 genes in ASD pathogenesis, such as *MECP2*, *DDX3X*, and *DMD*. The "female protective effect" in ASD suggests that females may require a higher genetic burden to manifest similar symptoms as males, yet the mechanisms remain unclear. Despite technological advances in genomics, the complexity of the biological nature of sex chromosomes leave them underrepresented in genome-wide studies. Here, we conducted an X chromosome-wide association study (XWAS) using whole-genome sequencing data from 6,873 individuals with ASD (82% males) across Autism Speaks MSSNG, Simons Simplex Cohort SSC, and Simons Foundation Powering Autism Research SPARK, alongside 8,981 population controls (43% males). We analyzed 418,652 X-chromosome variants, identifying 59 associated with ASD (p-values 7.9×10⁻⁶ to 1.51×10⁻⁵), surpassing Bonferroni-corrected thresholds. Key findings include significant regions on chrXp22.2 (lead SNP=rs12687599, p=3.57×10⁻⁷) harboring *ASB9*/*ASB11*, and another encompassing *DDX53/PTCHD1-AS* long non-coding RNA (lead SNP=rs5926125, p=9.47×10⁻⁶). When mapping genes within 10kb of the 59 most significantly associated SNPs, 91 genes were found, 17 of which yielded association with ASD (*GRPR*, *AP1S2*, *DDX53*, *HDAC8*, *PCDH19*, *PTCHD1*, *PCDH11X*, *PTCHD1-AS*, *DMD*, *SYAP1*, *CNKSR2*, *GLRA2*, *OFD1*, *CDKL5*, *GPRASP2*, *NXF5*, *SH3KBP1*). *FGF13* emerged as a novel X-linked ASD candidate gene, highlighted by sex-specific differences in minor allele frequencies. These results reveal significant new insights into X chromosome biology in ASD, confirming and nominating genes and pathways for further investigation. ## 1. Introduction Autism Spectrum Disorder (ASD [MIM 209850]) is a neurodevelopmental condition defined by social communication atypicalities, restrictive interests and repetitive sensory–motor behaviors. It is diagnosed in ∼1% of the population worldwide1,2, with a 3-4:1 male:female prevalence ratio3,4. This difference may have demographic and social components, with one example being that some autistic traits, such as restricted interests, may be more normalized in females compared with male individuals, and consequently ASD could be underdiagnosed5. However, there is evidence for a significant biological influence on the sex-differential likelihood of ASD6–11. For example, females with neurodevelopmental disorders, including ASD, tend to have an excess of deleterious autosomal copy number variants (CNVs), and deleterious autosomal single-nucleotide variants (SNVs)6,7,10–13. Variation in steroid hormones and differential gene expression in males and females may also influence ASD likelihood and characteristics8. Another consideration, which may be influenced by the afore-mentioned observations, is that in a family with a son having ASD, the likelihood of a female sibling also being affected is 4.2%, a number that increases to 12.9% if the sib is male14. Collectively the evidence may suggest a hypothetical "female protective effect" whereby females require a quantitatively greater etiologic load than males to exhibit the same degree of clinical presentation of ASD15–17. The sex ratio contribution approaches 1:1 when considering *de novo* mutations affecting presumed ‘penetrant’ autosomal genes and copy number variants (CNVs) in ASD and other neurodevelopmental conditions18–20. However, some studies suggest that the etiology of ASD includes qualitative sex differences, particularly involving genetic variations on the X chromosome.21. Sex hormones, known influencers of typical male and female brain development22, may also contribute to sex-varied penetrance in ASD23. For example, a surge of testosterone in the male fetus, combined with XY chromosomal determinants, may impact the neuroimmune system, affecting dendritic arborization24 and the number of microglia and neurons25, hence contribute to the sex-difference biology of ASD. Currently, there are 23 SFARI26 score 1 and 36 SFARI26 score 2 genes with evidence to be involved in ASD mapping to the X chromosome26. Nine of these reach a sufficient “Evaluation of Autism Gene Link Evidence (EAGLE)” score to be considered definitively involved in more narrowingly-defined ASD27 (the SFARI and EAGLE genes are often used in diagnostic testing panels for ASD)28. Since upwards of 75% of genome-wide studies do not consider rare or common variants (including polygenic score analysis) on the sex chromosomes in their analysis 29, it is anticipated there are additional gene loci to be validated and others to be discovered (Table S1 summarizes the published genome-wide manuscripts examining the X-chromosome). One study has attempted a genetic association test for ASD using common variants on the X chromosome30, finding *TBL1X* as a candidate locus (Table S1). There are, however, complications in studying the X chromosome as it has a lower genetic diversity compared to the autosomes, because, apart from the small pseudoautosomal region, this genomic region does not recombine in males29. Thus, the X chromosome can be more sensitive to evolutionary events, such as sex-bias admixture, bottlenecks and natural selection, and it can have different mutation rates from autosomes31. Moreover, in females, the X-inactivation phenomenon can occur where a random X chromosome copy may be inactive (i.e. X chromosome dosage compensation)29,31,32. The issue of 50% reduced X-chromosome coverage in males (46XY) in microarray and sequencing experiments has also led to the understudy of this important sex chromosome29. Recent development, however, now enables more robust X-wide association studies (XWAS) by dealing with X-specific quality control, statistical tests stratified by sex, estimation of significant thresholds, and accounting for the potential heterogeneity of allelic effect between males and females and chromosome inactivation bias 29,33. Here, we conducted a comprehensive XWAS of ASD from 6,873 ASD individuals (5,639 males and 1,234 females) sourced from three different whole-genome sequencing (WGS) datasets, alongside 8,981 control individuals (3,911 males and 5,070 females), from two additional datasets (Figure 1, Figure S1). ![Figure 1:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2024/07/18/2024.07.18.24310640/F1.medium.gif) [Figure 1:](http://medrxiv.org/content/early/2024/07/18/2024.07.18.24310640/F1) Figure 1: XWAS workflow. A) Outline of the XWAS pipeline detailing data sources including MSSNG (Autism Speaks), SSC (Simons Simplex Cohort), SPARK (Simons Foundation Powering Autism Research), 1KGP (1000 Genome Project), HostSeq (The Host Genome Sequencing Initiative), and MGRB (Medical Genome Reference Bank). The significance threshold was determined using Bonferroni correction, individually calculated for the Male-XWAS, Female-XWAS, and Both-XWAS approaches. For Meta-XWAS, we used the threshold inferred from the Both-XWAS result. B) Replication and robustness studies conducted. ## 2. Material and methods ### 2.1 Database #### 2.1.1. ASD Datasets The Autism Speaks MSSNG resource34,35 is a dataset of genetic and phenotype information from individuals diagnosed with ASD as well as members of their families 34,35. The affected individuals were diagnosed according to the Diagnostic and Statistical Manual of Mental Disorders (DSM)36, also supported in many individuals by the Autism Diagnostic Interview-Revised (ADI-R)37,38 and/or the Autism Diagnostic Observation Schedule (ADOS) 39,40. The Province of Ontario Neurodevelopmental Network (POND) is part of MSSNG and continues to contribute with new data. We used data from 9,621 individuals for the analysis done here. The Simons Simplex Collection (SSC) includes WGS data from approximately 2,600 ASD simplex families (one affected child plus unaffected parents and siblings)41. The ASD diagnoses were performed following the University of Michigan Autism and Communication Disorders Center guidance to guarantee uniformity across the 12 university-affiliated research clinics involved in this initiative. We used 9,209 ASD participants from SSC in this analysis. Also from SFARI42, the SPARK data (Simons Foundation Powering Autism Research) is an autism research initiative that includes both WES (Whole Exome Sequence) and WGS data from US individuals, besides behaviour and phenotypic information. We used WGS from 12,519 individuals for the X-chromosome analysis. #### 2.1.2. Population/Control Datasets For ancestry inference, we used genetic information from 3,202 samples from 1000 Genomes Project of five different ancestries (Africans, Americans, East Asians, Europeans and South Asians). For this, we used the high-coverage 2020 version released by the New York Genome Center (NYGC) ([https://www.internationalgenome.org/data-portal/data-collection/30x-grch38](https://www.internationalgenome.org/data-portal/data-collection/30x-grch38))43. As ASD-controls we used data from 2,561 samples from the Medical Genome Reference Bank (MGRB)44, which is a WGS dataset from ∼4,000 healthy, elderly Australians44. The MGRB dataset includes most individuals of European ancestry but does not exclude samples from different genetic backgrounds. We also used 9,802 samples from the Host Genome Sequencing Initiative (HostSeq)45 which is a collection of 14 Canadian research studies examining responses to COVID-19. ### 2.2 Quality Control #### 2.2.1 Autosomes After selecting biallelic variants we used the SmartQC software ([https://github.com/ldgh/MosaiQC-public](https://github.com/ldgh/MosaiQC-public)) to perform the basic control quality steps to remove: (i) variants with the chromosome notation equal to “0”, (ii) remove variants with duplicated IDs, (iii) remove variants and samples with missing data greater than 10% (plink --geno 0.1; plink --mind 0.1), (iv) impute sex codes using SNP data through PLINK’s ’--impute-sex --check-sex’ functionality. (v) remove A|T and C|G variants (ambiguous SNPs), (vi) remove 100% heterozygous variants (plink --hardy) and (vii) annotate the variants for dbSNP ID and LiftOver for hg38 if necessary. Using plink --bmerge, we merged the data from MSSNG, SSC, SPARK, MGRB, HostSeq. The merged file had a total of 22,242 samples and 1,407,803 variants (Figure 1, Figure S1). For the XWAS analysis using Principal Components (PCs) based on the autosomal information as covariates for logistic regression, we cleaned our data based on the pipeline of Leal *et al* (2023)46 ([https://github.com/MataLabCCF/XWAS](https://github.com/MataLabCCF/XWAS)) in the merged file with MSSNG, SSC, SPARK, MGRB, and HostSeq. This cleaning pipeline adds the following steps; (i) removal of monomorphic SNPs, or those located in structural variants, using the list of SNPs located in structural variants from Le Guen *et al*. (2021)32 created using Tri-Typer47. (ii) remove of potential probe sites using gnomAD48, and (iii) relationship control using KING49 to calculate the kinship coefficient and NAToRA50 to remove samples with relatedness closer than second degree. After this XWAS cleaning pipeline the autosomal file had 1,075,065 SNPs and 21,089 samples (Figure S1). The final XWAS analysis was restricted to individuals with more than 75% European ancestry. To achieve this, we conducted an ancestry check utilizing ADMIXTURE software51 with five clusters. The reference populations included Europeans, Africans, East Asians, South Asians, and Americans from the 1000 Genomes Project (1KGP)52. After merging our XWAS data with samples from the 1000 Genomes Project (1KGP), which underwent the same quality control process, we obtained a dataset containing 24,291 samples (Figure S1). To enhance data quality for ancestry inference, we conducted a filtering step to exclude variants exhibiting high levels of Linkage Disequilibrium (LD), using the command ’plink --indep-pairwise 100 10 0.1’. Additionally, variants located in regions known to be under recent selection were removed from the dataset53–55. We then ran ADMIXTURE using a total of 131,291 SNPs. #### 2.2.2 X Chromosome After completing the general quality control steps described in section 2.2.1, we separated the variants on the X chromosome (coded as chromosome 23 in PLINK) from those in the pseudoautosomal regions (coded as chromosome 25 in PLINK). This separation was based on a dbSNP reference file. We also applied the XWAS cleaning pipeline (Figure 1)46 for the X chromosome, which includes; (i) selecting the remaining individuals from the autosomal cleaning process, including samples without relatedness greater than second degree and samples with more than 75% of European ancestry, (ii) removal of SNPs following the same parameters used for the autosomes, besides SNPs with differential missingness between ASD individuals and controls with p-values lower than 10-5, (iii) removal of SNPs with differential missingness between males and females with p-values lower than 10-5, (v) heterozygous SNPs found in males were assigned as missing data. For the XWAS logistic regression we used a final of 418,652 X chromosomal variants and 15,499 samples (Figure S1). ### 2.3. XWAS After data cleaning, we conducted the XWAS analysis using two input files. The first file contained autosomal data with 1,075,065 variants, intended for principal component inferences to be used as covariates in the XWAS logistic regression. The second file consisted of X chromosome data with 418,652 variants. Both files contained data from the same 15,854 samples. Among these samples, 9,550 were male (3,911 controls and 5,639 ASD individuals), and 6,304 were females (5,070 controls and 1,234 ASD individuals). The principal component inference was done with the GENESIS package stratified by sex (one PCA for males and one for females), where all samples with standard deviation greater than three from major clusters were defined as being outliers and removed from subsequent analyses. For the primary XWAS analysis, we utilized X chromosome data for principal component analysis (PCA). Additionally, we conducted a replication analysis using data from autosomal chromosomes. The resulting 10 PCs from males only and females only were employed as covariates for Male-XWAS and Female-XWAS, respectively. Non-outlier males and females were merged to create both datasets. Subsequently, this new dataset underwent another PCA, where outliers were detected and excluded based on the same parameters. The 10 resulting PCs from this process were used as covariates in “Both-XWAS” (Figure 1). The final regression analysis was performed using logistic regression in PLINK2 56 (--glm) for three approaches (Figure 1); (i) Male-XWAS: Based on 5,639 ASD male individuals and 3,911 male controls. This approach used the 10 top PCs from males as covariates; (ii) Female-XWAS: Based on 1,234 ASD female individuals and 5,070 female controls. This approach used the 10 top PCs from females as covariates; and (iii) Both-XWAS: Based on 6,873 ASD individuals and 8,981 controls. This approach used the 10 top PCs from both and sex as covariates. We also performed a meta-analysis from the sex-stratified results (Male-XWAS and Female-XWAS) implemented on GWAMA57,58. This result incorporates the "gender\_heterogeneity\_p-value," which is derived from assessing heterogeneity between sex-specific allelic effects. This result incorporates the "gender\_heterogeneity\_p-value," which is derived from assessing heterogeneity between sex-specific allelic effects using one degree of freedom. This test involved analyzing males and females separately in each XWAS. It entailed obtaining male- and female-specific allelic effect estimates in a fixed-effects meta-analysis, followed by testing for heterogeneity between the sexes 58. ### 2.4 X-Chromosome Significance Threshold Given that our association tests are conducted on a single chromosome, the number of effective tests performed is lower compared to a genome-wide analysis. Typically, in genome-wide analyses, the significance threshold is set at p-value < 5×10-8. To determine an appropriate significance threshold for our XWAS analysis, we applied a Bonferroni correction by dividing 0.05 by the number of effective tests 59. The number of effective tests (Neff) was calculated by dividing the squared number of variants by the sum of the R2 correlation coefficients between all variants present in the dataset 46: ![Formula][1] V= Total number of variants L = the R² correlation coefficient between all variants (V) present in the datasets (L is a matrix with size V by V); i and j = Matrix indexes. To generate the R2 matrix among all variants in our dataset, we utilized the command ’plink - -r2 square gz yes-really’. The sum of our corresponding matrix was: Female: 37441288.90; Male: 27721944.23 and Both: 53006792.62. Thus, the respective number of effective tests (Neff): Female: 418,6522/37441288.90 = 4,681.18; Male: 418,6522/ 27721944.23 = 6,322.40; Both: 418,6522/ 53006792.62 = 3,306.54 with the final significance threshold being; Female: 0.05/4,681.18 = 1.07×10-5; Male: 0.05/6,322.40 = 7.9×10-6; Both: 0.05/3,306.54 = 1.51×10-5. ### 2.5. sdMAF Sex differences in allele frequencies were analyzed with the sdMAF software 60,61. We initially split the pseudoautosomal regions (PAR) with the PLINK62 --split-par hg38 command. Since PLINK was not able to properly handle male homozygous in the bed file and simply assigned them all to missing, we bypassed the problem by changing the chromosome code to 22 prior to generating genotype counts. The chromosome number in the ‘gcount’ file was then re-coded back to 23 and subsequently pipelined into the sdMAF software as suggested by the sdMAF documentation61. To select the significant sdMAF results, we utilized the same conservative, Bonferroni-corrected significance level for XWAS-Both analysis (1.51×10-5), given that we are testing the same number of SNPs. ### 2.6 Rare variant analysis We further investigated the impact of rare genetic variations inside the candidate regions identified from the XWAS analysis by comparing the frequency of rare predicted damaging single nucleotide variants (SNVs; gnomAD frequency <0.1%), insertion and deletions smaller than 50bp (indels; gnomAD frequency <0.1%), and exonic copy number deletions (CNV deletions; gnomAD frequency <1%) impacting genes between ASD-probands and family members. The initial reads were aligned to the GRCh38 human genome reference. Small variants (SNVs and Indels) and CNVs were called using GATK and *in-house* CNV calling pipeline, respectively63. Standard output files were generated, including CRAMs for alignment, and VCFs for small variants, and CNVs. Per sample analysis metrics were also generated. The small variant calls were annotated using an ANNOVAR-based pipeline64. Using an in-house script, we filtered high quality small variants that were found in less than 0.1% of gnomAD samples. We then selected only damaging small variants if they result in a stop gain or a frameshift, or, they are nonsynonymous SNVs predicted to be damaging by four different in-silico tools (i.e., sift_score65 <=0.05, polyphen_score66>=0.9, mt_score67>=0.5, and CADD_phred68 >= 15). For this SNVs analysis, besides the WGS data previous described (session 2.1.1), we also used whole exome data (WES) from SPARK, given a final number of 47,840 ASD-probands (79% males), 19,820 ASD-unaffected siblings (47% males), and 63,692 ASD-parents (40% fathers). The deletions were detected using a previously described read depth-based pipeline34,63. We only considered high-quality deletions, which were tagged based on the following criteria; (i) length >= 5kb, ii. called by both ERDS69 and CNVnator70 with at least 50% reciprocally overlapped in length, (ii) having < 70% of its length overlap with repetitive or low complexity regions of the genome (i.e., telomere, centromere, and segmental duplications), and (iii). for the X chromosomal calls in males, CNVs in PAR were filtered out. For the CNV comparison we only used WGS data, and we also included data from the new MSSNG release (MSSNGdb7), resulting in a total of 9,691 ASD-probands (82% males), 5,591 ASD-unaffected siblings (38% males) and 17,470 ASD-parents (50% fathers). For both small variants and deletions, independently, we performed an association analysis using a conditional logistic regression stratifying the test by the family. For sex-combined analysis, we also used sex as covariates. ### 2.7 Brain gene Expression Analysis Exon-averaged gene expression data were obtained from BrainSpan (Allen Brain Atlas)71. With this microarray data, we further applied quantile normalization and standardization across both genes and samples for the comparative analysis. Subsequently, we generated a brain map plot wherein colors ranging from blue (indicating downregulation) to red (indicating upregulation) denote the average expression levels of the selected genes within each brain region. This visualization was created for five developmental stages: Early Fetal (less than 16 weeks), Late Fetal (more than 16 weeks to birth), Early Childhood (birth to three years old), Childhood/Teenage (three years to 20 years), and Adulthood (more than 20 years). ## 3. Results ### 3.1 Association Test After performing the four different XWAS tests (Figure 1), which included sex-stratified tests (Male-XWAS and Female-XWAS), sex-combined mega-analysis (Both-XWAS), and meta-analysis (Meta-XWAS), we identified 59 variants as significant in at least one of the four approaches (Table S2). These variants correspond to a total of 20 risk loci, encompassing 23 genes with variants in high linkage disequilibrium (r^2 > 0.7) with the lead SNP (Table 1). The genomic loci detected from the four XWAS approaches utilized are shown in Table 1. Among these, 42 were found uniquely by a unitary XWAS approach: 27 in the Male-XWAS, five in the Female-XWAS, one in the Both-XWAS (performed with males and females together, using sex as a covariate), and nine in the Meta-XWAS (a meta-analysis of Male-XWAS and Female-XWAS results using GWAMA57 software, because it includes a "meta-analysis using sex-differentiated and sex heterogeneity"57). Additionally, 17 variants showed significant p-values in more than one test (Table S2). Each test underwent visual inspection via histograms and QQ plots, revealing no distortions as indicated by the genomic inflation factor (λ=0.928 - 1.036), which measures systematic bias in the statistical test (Figure S2, Figure 2). Among the 59 variants, 30 exhibit a “gender heterogeneity p-value” (test for heterogeneity between sexes with one degree of freedom)58 below 0.05, all of them in the sex stratified approaches (26 in the Male-XWAS and four in the Female-XWAS), suggesting significant differences in allelic effects between males and females for these variants. Notably, two of these variants, identified in the Male-XWAS within the *ASB11* gene, attained a “gender heterogeneity p-value” of less than 9×10−5. In the presence of heterogeneity in allelic effects between the sexes, a loss of power for sex-combined association tests can occur if the allele has opposite direction of effect in the other sex58. This type of biological phenomena may explain why variants are not detected in Both-XWAS and Meta-XWAS. ![Figure 2.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2024/07/18/2024.07.18.24310640/F2.medium.gif) [Figure 2.](http://medrxiv.org/content/early/2024/07/18/2024.07.18.24310640/F2) Figure 2. ASD-XWAS manhattan and qq plots. Each panel shows a Manhattan plot on the left part and qqPlot on the right part. The graphs result from XWAS testing using 6,873 ASD individuals (5,639 males and 1,234 females) and 8,981 controls (3,911 males and 5,070 females) with a total of 418,652 X chromosomal variants originated from WGS data (46 variants in PAR regions) for (A) Male-XWAS, B) Female-XWAS, C) Both-XWAS, and D) the Meta-XWAS, a meta-analysis from the sex stratified approaches implemented on GWAMA57. View this table: [Table 1.](http://medrxiv.org/content/early/2024/07/18/2024.07.18.24310640/T1) Table 1. Genomic Risk Loci. Genomic loci detected from four XWAS analyses (Male-XWAS, Female-XWAS, Both-XWAS, Meta-XWAS). The unique ID as well as the p-value refer to the lead SNPs specified. The 23 genes in the last column are within the gene locus, and encompass variants exhibiting strong linkage disequilibrium with the lead SNP (r2 > 0.7). #### 3.1.2 Robustness Study We performed XWAS analyses using various configurations, including one ASD dataset against all controls, as well as all ASD against each control dataset, to mitigate potential bias stemming from dataset heterogeneity and to conduct robust sanity replications, (Table S3). Consequently, we obtained XWAS results for; (i) MSSNG as cases versus HostSeq and MGRB as controls, (ii) SSC as cases versus HostSeq and MGRB as controls, (iii) SPARK as cases versus HostSeq and MGRB as controls, (iv) MSSNG, SSC, and SPARK as cases versus HostSeq as controls, (v) MSSNG, SSC, and SPARK as cases versus MGRB as controls and (vi) control versus control (sanity test; MGRB was labeled as cases and HostSeq as control). Among the 27 variants exclusively found in males, all replication tests yielded a p-value below 0.05, except for eight variants solely in robustness test "v" (involving all case datasets versus MGRB controls). Notably, these eight variants reside within the first significant region identified in the Male-XWAS, spanning between 15.27 and 15.36 Mb. Even after excluding these eight variants, we retained 19 significant SNPs in this region with a p-value lower than 0.05 across all replication tests. Regarding the five variants in the detected exclusive in Female-XWAS, two did not reach a p-value lower than 0.05 across all robustness tests. One of them, rs749183760 in *ENOX2*, was not captured by test "ii". Additionally, the intergenic SNP rs182249604, located within the first significant genomic region (between 16.7Mb and 17.33Mb), did not yield a p-value lower than 0.05 on test “iii”. However, even after excluding these variants, we still observe significant SNPs in this region, including variants within the *TXLN* gene. Considering the results from the Both-XWAS replications, there is only one significant variant (rs767542284 in *PDHA1*), that also demonstrated a significant p-value in all subset (i-v) analyses; five of nine variants detected in the Meta-XWAS achieved significant p-values in all cohort tests, and these are located within *DMD*, *PABPC1L2A* and *PCDH11X*. When comparing the significant variants detected in both Meta-XWAS and Both-XWAS (8 variants located on *PTCHD1-AS*, *HDAC8*, and *LOC124905257* genes), we replicated five results across all cohort tests (i to v). Notably, the variants that did not reach a significant p-value in all tests include two variants in the *HDAC8* gene (rs5958792, rs73218354) and one intergenic variant (rs5981334), all of which were not replicated only in test "iii" (SPARK versus all control cohorts). All six results identified in both the Female-XWAS and another XWAS (Meta-XWAS, Both-XWAS) were situated within two different genes, *ENOX2* and *HTR2C*. None of these variants achieved a p-value < 0.05 in test ii (SSC versus all controls). Three significant variants were detected in both the Male-XWAS and Meta-XWAS. All three variants had significant p-values in all tests except for one variant (rs12835197 - *PCDH19*) in test "v" (All cases versus MGRB). We performed a sanity check employing logistic regression, where controls were compared against controls (Test “vi”), using MGRB as cases and Hostseq as controls. To fortify the reliability of our findings, we assessed whether our candidate variants yielded non-significant p-values (≥0.05) in this sanity test as well. At least one variant in the genes *ASB9* and *ASB11* from Male-XWAS analysis meet the criteria of the sanity test. Hence, we retained both genes in the final results. In the Female-XWAS results, three out of the five detected variants failed to pass the sanity test, resulting in only the ENOX2 gene being included among the final genes. #### 3.1.3 XWAS replication using Autosomal Principal Components as covariates Our principal component analysis (PCA) focused solely on the X chromosome due to its unique biological features (see Methods section 2.3). Therefore, the top 10 PCs were then considered as covariates46. We also implemented XWAS with autosomal PCs to assess the generalizability of findings (Table S4, Figure S3 and Figure S4). The modified model revealed a total of 58 significant loci spanning over 12 genes: *ASB9*, *ASB11*, *PIGA*, *PCDH19*, *TXLNG*, *HTR2C*, *ENOX2*, *PDHA1*, *PTCHD1-AS*, *DMD*, *HDAC8*, and *PABPC1L2A* (Table 2); 11 of these overlap with the genes detected by the primary analysis using X chromosome-only PCs (Figure S4). The *PIGA* gene was identified exclusively with the autosomal PC model, noting it is located in proximity to *ASB11*(3.8kb) and *ASB9* (48.9kb), which were detected in the Male-XWAS results using the X chromosome PCs. Among the 14 genes discovered by the XWAS using the X chromosome-only PCs only X, *LOC124905257* and *PCDH11X* were not present when using autosomal PCs in the XWAS. View this table: [Table 2.](http://medrxiv.org/content/early/2024/07/18/2024.07.18.24310640/T2) Table 2. Significantly associated ASD genes based on our main XWAS and sdMAF results. The values in red are the p-values considered significant based on the specific Bonferroni corrections (Males: 7.9×10-6, Females: 1.07×10-5, Both: 1.51×10-5). The genomic control lambdas observed in the QQ plots ranged from 0.913 to 1.119 (Figure S4). Additionally, the correlation between the XWAS results obtained from X chromosome PCs and autosomal PCs was 0.76 for males and 0.79 for females (Figure S3). Overall, the results from the main analysis are generalizable and robust. #### 3.1.4 Annotation All XWAS results (Male-XWAS, Female-XWAS, Both-XWAS, Meta-XWAS) were annotated using both modules of FUMA72: SNP2GENE and GENE2FUNC. SNP2GENE mapped the genes corresponding to the significantly associated SNPs, while GENE2FUNC annotated gene expression and gene sets from the previously mapped genes. The 59 significant associated variants were mapped (within a 10kb distance) to a total of 93 genes (Table S5). Through the gene-based test conducted using MAGMA73, significant associations were identified for *ASB11* (p-value = 2.87×10-6) in the Male-XWAS (Figure 2, Figure 3-A), where initial SNPs were mapped to 704 genes given a significance threshold defined as 7.1×10-5 (0.05/704). This gene was also mapped in the Female-XWAS analysis, being situated within at least 10kb distance from a significantly associated SNP. Notably, *ASB11* is located within one of the most significant Linkage Disequilibrium (LD) regions identified in the Male-XWAS results, spanning between 15.27 and 15.36Mb, which also encompasses the genes *ASB9* and *PIGA* (Genomic Locus 1-Male-XWAS; Table 1). The corresponding LocusZoom plot, along with the Combined Annotation Dependent Depletion (CADD)68,74 score and RegulomeDB score75,76 plots for this region, are presented in Figure 3-A. The CADD score assesses the deleteriousness of genetic variants, while the RegulomeDB score evaluates their functional significance, aiding in the interpretation of their potential biological effects. When considering only significant SNPs falling internal to the gene rather than within a 10kb range, 13 candidates were identified: *ASB11*, *ASB9*, *DMD*, *ENOX2*, *HDAC8*, *HTR2C*, *LOC124905257*, *PABPC1L2A*, *PCDH11X*, *PCDH19*, *PDHA1*, *PTCHD1-AS* and *TXLNG* (Figure 2, Table 2). ![Figure 3.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2024/07/18/2024.07.18.24310640/F3.medium.gif) [Figure 3.](http://medrxiv.org/content/early/2024/07/18/2024.07.18.24310640/F3) Figure 3. Annotation details for the genomic risk Locus 1\_Male-XWAS and 1\_Both XWAS. A) Details for the genomic risk locus *1_Male-XWAS*. The upper panel shows the LocusZoom plot for the correspondent region with the lead SNP rs12687599 highlighted in purple. The used LD reference panel was Europeans from 1000G data77 for both sexes together. Following the LocusZoom plot, on the left, we provide annotation results displaying CADD and RegulomeDB scores. On the right, the Manhattan plot illustrates the gene-based test computed by MAGMA in FUMA. The SNPs were mapped to 704 protein-coding genes, hence the genome-wide significance threshold (indicated by the red dashed line in the plot) was conservatively set at P = 0.05/704 = 7.10×10-5. B) LocusZoom plot of the genomic locus *1_Both-XWAS*, followed by the CADD and Regulome profiles of the same region. In the sex-stratified analysis, the majority of the SNPs found to have significant association were located in intronic regions, accounting for 68.4% of the Male-XWAS results and 60% of the Female-XWAS results. Within the Both-XWAS results, 45.8% of the SNPs were intergenic, 28.9% were non-coding RNAs, 22.9% were intronic, and an additional 2.4% located in UTR regions. ### 3.2 Sex differences in minor allele frequencies (sdMAF) Evolutionary forces can influence allele frequency on the X chromosome between sexes compared to the autosomes 78,79. To ensure the effectiveness of the quality control process, we have implemented sdMAF60,61 analysis on the same set of genomic data. Subsequently, we removed all sdMAF significant results from the XWAS findings. These signals could be capturing either true biological sex differences or genotyping error, inducing spurious association between ASD and variants. However, the sdMAF results also provided valuable insights. We applied sdMAF separately to ASD individuals and controls cohorts; and we observed scatters of statistically significant variants in both ASD individuals and controls (Figure 4). Single or few scatters were expected to be caused by genotyping error. The results from the region of *FGF13* gene was particularly prominent. Notably, *FGF13* is a previously ASD-associated gene with a SFARI score of 3S (Figure 4). Interestingly, the detection of this region is solely through sdMAF but not via logistic regressions, highlighting the potential of sdMAF being used as a tool for association studies of sex-biased diagnoses. ![Figure 4.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2024/07/18/2024.07.18.24310640/F4.medium.gif) [Figure 4.](http://medrxiv.org/content/early/2024/07/18/2024.07.18.24310640/F4) Figure 4. sdMaf Results. A) Left, the Manhattan plot illustrates the sdMAF p-values obtained from ASD datasets exclusively. Right, the Manhattan plot represents the sdMAF p-values obtained from control datasets only. B) The LocusZoom plot displays the region identified in the sdMAF-cases results, highlighting the gene *FGF13*. The LD reference panel used was Europeans from 1000G data77 for both sexes together. ### 3.3 Rare variants analysis Recognizing the significant role of rare variants in ASD genetic architecture 34,80–83, we checked in the same ASD datasets (MSSNG, SSC and SPARK) for rare predicted damaging small variants (SNV/indels with less than 0.1% of frequency on gnomAD48) and CNV deletions (<1% frequency in gnomAD48) overlapping at least one exon of the 14 significant detected genes (13 from XWAS and one from sdMAF). Among the total of 14 XWAS genes analyzed (Figure 5), 11 exhibited rare predicted damaging SNVs. Among the remaining three genes, two were non-coding RNAs (LOC124905257 at HG38 chrX:20606477:20727481 and *PTCHD1-AS* at HG38 chrX:22193005:23293146), while the third was *PABPC1L2A* (HG38 chrX: 73077276:73079512). In the male frequency comparisons, almost all genes showed a higher frequency of these variants in ASD-probands compared to other family members, except for *PCDH11X* and *PCDH19*. In females, five genes (*ASB11*, *DMD*, *HDAC8*, *PCDH19*, and *HTR2C*) showed a higher frequency in ASD-probands. Combining both sexes, four genes (*ASB11*, *DMD*, *HDAC8*, *HTR2C*, and *FGF13*) showed a higher frequency in ASD-probands. ![Figure 5.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2024/07/18/2024.07.18.24310640/F5.medium.gif) [Figure 5.](http://medrxiv.org/content/early/2024/07/18/2024.07.18.24310640/F5) Figure 5. Rare Variant Frequency Analysis. The figure compares the frequencies of rare variants among different groups: ASD-Probands (red bars), ASD-Unaffected Siblings (green bars), and ASD-Parents (gray bars). The left panel shows the frequency of rare predicted damaging SNVs (<0.1% frequency in general population) across 11 genes (*ASB9*, *ASB11*, *TXLNG*, *PDHA1*, *PTCHD1-AS*, *DMD*, *HDAC8*, *PCDH11X*, *PCDH19*, *HTR2C*, *ENOX2*, *FGF13*) detected through XWAS common variant data analysis (Table 2). The right panel illustrates the frequency of rare CNV deletions overlapping exons (< 1% frequency in general population), found in four XWAS-genes (*PTCHD1-AS*, *DMD*, *ENOX2*, *FGF13*). In each graph, the corresponding p-value from a conditional logistic regression is shown at the bottom, conducted separately for males, females, and both sexes combined (using “sex” as covariate). We successfully identified rare deletions overlapping exons in the joint ASD datasets for the gene detected in sdMAF (*FGF13*) and for three of the 13 genes from the main XWAS results, including *PTCHD1-AS, DMD*, and *ENOX2* (Figure 5, Table 2). Comparing the frequency of these CNVs in unaffected family members, we observed an enrichment in cases compared to unaffected family members for deletions impacting *PTCHD1-AS* in males, *DMD* and *ENOX2* in females and both sexes combined, and *FGF13* in males and both sexes combined. However, none of the association test results reached a p-value lower than 0.05, but this might be expected because of sample size. ### 3.4 Brain Gene expression analysis Utilizing data from BrainSpan (Allen Brain Atlas)71, we generated a visualization to examine the mean expression patterns of 12 of 14 candidate genes detected in our previous analysis across various brain regions during distinct developmental periods (Figure 6). Data for *LOC124905257* and *PTCHD1-AS* were not available in BrainSpan. In general, the ASD-XWAS candidate genes showed different expression levels in all different time ranges when compared with the plotted controls (Figure 6 last three columns). ![Figure 6.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2024/07/18/2024.07.18.24310640/F6.medium.gif) [Figure 6.](http://medrxiv.org/content/early/2024/07/18/2024.07.18.24310640/F6) Figure 6. Gene Expression by Brain Regions in different development times. Brain map showing the gene expression levels in different parts of the brain in five developmental stages (Early Fetal, Late fetal, Early childhood, Childhood/Teenage and Adulthood). Left to right shows the gene expression levels from all 12 ASD-candidate genes with available expression data (*ASB11, ASB9, DMD, ENOX2, FGF13, HDAC8, HTR2C, PABPC1L2A, PCDH11X, PCDH19, PDHA1, TXLNG*), followed by three genes from Male-XWAS (*ASB11, ASB9, PCDH19*), three genes from Female-XWAS (*TXLNG*, *HTR2C*, *ENOX2*) and the correspondent control comparison with all the ∼800 X chromosome genes in both sexes and also in male brains only and female brains only. The color scales go from blue (downregulated) to red (upregulated). During the early fetal stage, the 12 XWAS genes exhibit up-regulation in the cerebellum, which contrasts with the pattern observed in the Female-XWAS genes, showing notably low expression levels in the same region. In males, XWAS genes in the early fetal stage demonstrate down-regulated expression in the primary motor cortex and the primary visual cortex, alongside up-regulated expression in the prefrontal, primary somatosensory, and posteroventral parietal cortex. In this stage, the most expressed brain regions in Female-XWAS genes include the primary visual, primary auditory, and temporal cortex. Transitioning to the late fetal stage, the most pronounced pattern includes down-regulated expression of Male-XWAS genes across nearly all analyzed brain regions. In contrast, All-XWAS genes exhibit heightened expression in the primary auditory, temporal, and prefrontal cortex. In early childhood, spanning the initial three years of life, a consistent down-regulated expression pattern is observed in the cerebellum across all approaches (All XWAS genes and sex-stratified comparisons). Furthermore, during this phase, the posteroventral parietal cortex displays elevated expression levels for All-XWAS genes. From ages three to 20 (childhood to teenage years), X candidate genes remain downregulated in the cerebellum, while both sex-stratified approaches indicate up-regulation in the primary auditory and visual cortex. Additionally, the prefrontal cortex exhibits high expression levels for Male-XWAS genes. In adulthood (after 20 years), the cerebellum maintains a down regulated pattern for all XWAS genes and for the genes identified in Male-XWAS, while exhibiting slightly higher expression levels in the genes identified in Female-XWAS. Conversely, the prefrontal cortex demonstrates low expression levels for the genes identified in Male-XWAS, with an upregulation pattern observed in the genes identified in Female-XWAS. ## 4. Discussion Our XWAS analyses identified 59 SNP variants on the X chromosome that exhibited a statistically significant association with ASD (Table S2). These variants were mapped to 91 distinct genes, of which 11 had previously been associated with ASD through the detection of rare variants or CNVs, as reported in databases (Table S5). Out of the 59 significant variants identified in the main analysis, 35 were also successfully detected in our robustness study (Table S3), spanning all five different tests. Among these, 33 variants passed the sanity test by not reaching a significant value in the Control vs Control test “vi”. These 33 X-Chromosome variants were located in intergenic regions as well as in the genes *ASB9*, *ASB11*, *PDHA1*, *LOC124905257*, *PTCHD1*-*AS*, *HDAC8*, *PABPC1L2A*, and *PCDH11X* (Table 2).These new results will increase our understanding of the genes involved in ASD and provide a basis for improving polygenic risk scores (PRS), which currently are significantly underpowered regarding ASD34,84. In the Male-XWAS results we detected an LD region encompassing the genes *ASB9*, *ASB11* and *PIGA*. The lead SNP, rs12687599, is reported in the GWAS catalog85 for being associated with sex hormone-binding globulin levels86. Autism was previously associated with a decreased level of maternal serum sex hormone binding globulin87. This finding could imply an etiological association between sex hormone pathways and ASD status particularly in males 16,23. Still in the Male-XWAS, we identified the gene *PCDH19*, also found in the Meta-XWAS. This gene has the highest significance score of 1 in the SFARI database, indicating its significant relevance to ASD26. It is also classified as syndromic, primarily expressed in brain tissue and plays a role in cell adhesion88, suggesting that mutations within it are associated with an increase in ASD likelihood and are consistently linked to neurodevelopmental and neuropsychiatric characteristics beyond those necessary for an ASD diagnosis. The Both-XWAS and Meta-XWAS identified significantly associated variants in the lncRNA *PTCHD1-AS* (*PTCHD1* antisense RNA)89. This gene is part of a complex on chromosome Xp22.11, which also encompasses *DDX53*, placing this locus among the most prevalent and impactful genetic factors for ASD90 and other neurodevelopmental disorders. Ross *et al*., 202189, conducted an analysis compiling data from previously reported variants on *PTCHD1-AS*. They found that 69% of these variants associated with this long non-coding RNA (lncRNA) are linked to ASD or ASD-related features. Consequently, the EAGLE score, a metric evaluating a gene’s relevance to ASD, definitively assigns *PTCHD1-AS* a final score of 17.627. However, despite this association, the functional significance of these variants remains unknown. In the Meta-XWAS we identified significant variants associated with ASD in *DMD* and *HDAC8*. Notably, *HDAC8* was also highlighted in the Both-XWAS results. Both genes carry a syndromic status on the SFARI gene score. Both *DMD* and *HDAC8* are linked to intellectual disability, with *DMD* additionally implicated in attention-deficit hyperactivity disorder (ADHD) and extra-pyramidal syndrome (EPS). The *DMD* gene was the only gene to reach a significant enrichment p-value (0.01) when comparing rare deletions in probands against unaffected family members specifically for females. This finding suggests a potential sex-specific effect of rare deletions in the DMD gene, with females exhibiting a significant enrichment compared to unaffected family members. Our previous genomic studies of CNVs 34 further support the importance of rare deletions in *DMD*. We also applied a case-only sdMAF analysis in a complementary way to the traditional case-control association analysis. This analysis pointed out a significant peak overlapping *FGF13*91 with variants in this gene being involved in infantile-onset developmental and epileptic encephalopathy, which can be important associated features of ASD. In summary, our XWAS study of individuals with ASD and controls has generated significant new data that further validate the roles of specific genes in autism and unveil novel candidates for future research. Our approach, utilizing XWAS ’common variant’ analyses alongside parallel ’rare variant’ examinations of the same samples, provides a unique paradigm for dissecting the genomic architecture involved in ASD and potentially other complex conditions. Additionally, while the development of an X-chromosome-based Polygenic Risk Score (X-PRS) is of interest, it is beyond the scope of this paper and may require new methodologies29. ## Supporting information Supplementary Table 2 [[supplements/310640_file02.xlsx]](pending:yes) Supplementary Table 3 [[supplements/310640_file03.xlsx]](pending:yes) Supplementary Table 4 [[supplements/310640_file04.xlsx]](pending:yes) Supplementary Table 5 [[supplements/310640_file05.xlsx]](pending:yes) Supplementary Table 5 [[supplements/310640_file06.xlsx]](pending:yes) Supplementary Materials [[supplements/310640_file07.pdf]](pending:yes) ## Data Availability All data produced are available online at https://research.mss.ng/ https://base.sfari.org https://www.internationalgenome.org https://www.cgen.ca/hostseq-databank-access-request MGRB genomic data is deposited at the European Genome-Phenome Archive under study ID EGAS00001003511 [https://research.mss.ng/](https://research.mss.ng/) [https://base.sfari.org](https://base.sfari.org) [https://www.internationalgenome.org](https://www.internationalgenome.org) [https://www.cgen.ca/hostseq-databank-access-request](https://www.cgen.ca/hostseq-databank-access-request) ## Declaration of interests At the time of this study and its publication, S.W.S. served on the Scientific Advisory Committee of Population Bio. Intellectual property from aspects of his research held at The Hospital for Sick Children are licensed to Athena Diagnostics and Population Bio. These relationships did not influence data interpretation or presentation during this study but are disclosed for potential future considerations. ## Web resources Approved researchers can obtain the MSSNG dataset by applying at [https://research.mss.ng/](https://research.mss.ng/); and the SSC and SPARK datasets at [https://base.sfari.org](https://base.sfari.org). 1000 genomes data is publicly available at [https://www.internationalgenome.org/](https://www.internationalgenome.org/), HostSeq data can be also available after applying at [https://www.cgen.ca/hostseq-databank-access-request](https://www.cgen.ca/hostseq-databank-access-request) and MGRB genomic data is deposited at the European Genome-Phenome Archive under study ID EGAS00001003511. ## Author contributions Conceptualization: M.M.A, D.Z.C, T.P.L, L.S, S.W.S Data curation: M.M.A, W.E, B.Thiruvahindrapuram, B.Trost, J.L.H, G.P, T.N Formal Analysis: M.M.A, D.Z.C, W.E, E.A.M, S.B.C, D.R Funding acquisition: S.W.S Investigation: M.M.A, D.Z.C, W.E, B.Trost,, J.L.H, M.C.L, C.A.B, L.S, S.W.S Methodology: M.M.A, D.Z.C, W.E, T.P.L, B.Thiruvahindrapuram, G.P, T.N, R.A, N.B.S, E.A.M, N.R.A, S.B.C, D.R Project administration: J.L.H, L.S, S.W.S Resources: S.W.S Software: M.M.A, D.Z.C, T.P.L, L.S Supervision: L.S, S.W.S Validation: M.M.A, W.E, N.B.S, C.A.B Visualization: M.M.A, W.E Writing – original draft: M.M.A, W.E, B.Trost, M.C.L, L.S, S.W.S Writing – review & editing: M.M.A, D.Z.C, W.E, T.P.L, B.Thiruvahindrapuram, B.Trost, J.L.H, G.P, T.N, R.A, N.B.S, E.A.M, N.R.A, M.C.L, S.B.C, D.R, C.A.B, L.S, S.W.S ## Acknowledgments We thank the families participating in MSSNG, SSC, and SPARK, as well as the resources provided by Autism Speaks and The Centre for Applied Genomics. M.M.A was supported by the CGEn HostSeq/CIHR fellowship (CGE 185054) and the SickKids Restracomp Fellowship. S.B.C was supported in part by the Intramural Research Program of the NIH and the National Institute on Aging (NIA). M.C.L is supported by a Canadian Institutes of Health Research Sex and Gender Science Chair (GSB 171373). S.W.S holds the Northbridge Chair in Pediatric Research at The Hospital for Sick Children and the University of Toronto. * Received July 18, 2024. * Revision received July 18, 2024. * Accepted July 18, 2024. * © 2024, Posted by Cold Spring Harbor Laboratory This pre-print is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), CC BY-NC 4.0, as described at [http://creativecommons.org/licenses/by-nc/4.0/](http://creativecommons.org/licenses/by-nc/4.0/) ## References 1. 1.Zeidan, J., Fombonne, E., Scorah, J., Ibrahim, A., Durkin, M.S., Saxena, S., Yusuf, A., Shih, A., and Elsabbagh, M. (2022). Global prevalence of autism: A systematic review update. Autism Res. 15, 778–790. doi:10.1002/aur.2696 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/aur.2696&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=35238171&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F18%2F2024.07.18.24310640.atom) 2. 2.Lord, C., Elsabbagh, M., Baird, G., and Veenstra-Vanderweele, J. (2018). Autism spectrum disorder. The Lancet 392, 508–520. doi:10.1016/S0140-6736(18)31129-2 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S0140-6736(18)31129-2&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=30078460&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F18%2F2024.07.18.24310640.atom) 3. 3.Loomes, R., Hull, L., and Mandy, W.P.L. (2017). What Is the Male-to-Female Ratio in Autism Spectrum Disorder? A Systematic Review and Meta-Analysis. J. Am. Acad. Child Adolesc. Psychiatry 56,.doi:10.1016/j.jaac.2017.03.013 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.jaac.2017.03.013&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=28545751&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F18%2F2024.07.18.24310640.atom) 4. 4.Maenner, M.J., Warren, Z., Williams, A.R., Amoakohene, E., Bakian, A.V., Bilder, D.A., Durkin, M.S., Fitzgerald, R.T., Furnier, S.M., Hughes, M.M., et al. (2023). Prevalence and Characteristics of Autism Spectrum Disorder Among Children Aged 8 Years - Autism and Developmental Disabilities Monitoring Network, 11 Sites, United States, 2020. MMWR Surveill. Summ. 72,. doi:10.15585/mmwr.ss7202a1 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.15585/mmwr.ss7202a1&link_type=DOI) 5. 5.Lai, M.C., Lombardo, M.V., Auyeung, B., Chakrabarti, B., and Baron-Cohen, S. (2015). Sex/gender differences and autism: setting the scene for future research. J. Am. Acad. Child Adolesc. Psychiatry 54,.doi:10.1016/j.jaac.2014.10.003 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.jaac.2014.10.003&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25524786&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F18%2F2024.07.18.24310640.atom) 6. 6.Jacquemont, S., Coe, B.P., Hersch, M., Duyzend, M.H., Krumm, N., Bergmann, S., Beckmann, J.S., Rosenfeld, J.A., and Eichler, E.E. (2014). A higher mutational burden in females supports a “female protective model” in neurodevelopmental disorders. Am. J. Hum. Genet. 94, 415–425. doi:10.1016/j.ajhg.2014.02.001 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ajhg.2014.02.001&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24581740&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F18%2F2024.07.18.24310640.atom) 7. 7.Pinto, D., Delaby, E., Merico, D., Barbosa, M., Merikangas, A., Klei, L., Thiruvahindrapuram, B., Xu, X., Ziman, R., Wang, Z., et al. (2014). Convergence of genes and cellular pathways dysregulated in autism spectrum disorders. Am. J. Hum. Genet. 94, 677–694. doi:10.1016/j.ajhg.2014.03.018 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ajhg.2014.03.018&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24768552&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F18%2F2024.07.18.24310640.atom) 8. 8.Leow, K.Q., Tonta, M.A., Lu, J., Coleman, H.A., and Parkington, H.C. (2024). Towards understanding sex differences in autism spectrum disorders. Brain Res. 1833, 148877. doi:10.1016/j.brainres.2024.148877 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.brainres.2024.148877&link_type=DOI) 9. 9.Palmer, N., Beam, A., Agniel, D., Eran, A., Manrai, A., Spettell, C., Steinberg, G., Mandl, K., Fox, K., Nelson, S.F., et al. (2017). Association of Sex With Recurrence of Autism Spectrum Disorder Among Siblings. JAMA Pediatr. 171, 1107–1112. doi:10.1001/jamapediatrics.2017.2832 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1001/jamapediatrics.2017.2832&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=28973142&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F18%2F2024.07.18.24310640.atom) 10. 10.Napolitano, A., Schiavi, S., La Rosa, P., Rossi-Espagnet, M.C., Petrillo, S., Bottino, F., Tagliente, E., Longo, D., Lupi, E., Casula, L., et al. (2022). Sex Differences in Autism Spectrum Disorder: Diagnostic, Neurobiological, and Behavioral Features. Front. Psychiatry 13, 889636. doi:10.3389/fpsyt.2022.889636 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.3389/fpsyt.2022.889636&link_type=DOI) 11. 11.Zhang, Y., Li, N., Li, C., Zhang, Z., Teng, H., Wang, Y., Zhao, T., Shi, L., Zhang, K., Xia, K., et al. (2020). Genetic evidence of gender difference in autism spectrum disorder supports the female-protective effect. Transl. Psychiatry 10, 4. doi:10.1038/s41398-020-0699-8 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41398-020-0699-8&link_type=DOI) 12. 12.Antaki, D., Guevara, J., Maihofer, A.X., Klein, M., Gujral, M., Grove, J., Carey, C.E., Hong, O., Arranz, M.J., Hervas, A., et al. (2022). A phenotypic spectrum of autism is attributable to the combined effects of rare variants, polygenic risk and sex. Nat. Genet. 54,. doi:10.1038/s41588-022-01064-5 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41588-022-01064-5&link_type=DOI) 13. 13.Warrier, V., Zhang, X., Reed, P., Havdahl, A., Moore, T.M., Cliquet, F., Leblond, C.S., Rolland, T., Rosengren, A., Rowitch, D.H., et al. (2022). Genetic correlates of phenotypic heterogeneity in autism. Nat. Genet. 54,. doi:10.1038/s41588-022-01072-5 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41588-022-01072-5&link_type=DOI) 14. 14.Wigdor, E.M., Weiner, D.J., Grove, J., Fu, J.M., Thompson, W.K., Carey, C.E., Baya, N., van der Merwe, C., Walters, R.K., Satterstrom, F.K., et al. (2022). The female protective effect against autism spectrum disorder. Cell Genomics 2,. doi:10.1016/j.xgen.2022.100134 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.xgen.2022.100134&link_type=DOI) 15. 15.Elsabbagh, M. (2020). Linking risk factors and outcomes in autism spectrum disorder: is there evidence for resilience? BMJ 368, l6880. doi:10.1136/bmj.l6880 [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6MzoiYm1qIjtzOjU6InJlc2lkIjtzOjE4OiIzNjgvamFuMjhfMTAvbDY4ODAiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyNC8wNy8xOC8yMDI0LjA3LjE4LjI0MzEwNjQwLmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 16. 16.Werling, D.M. (2016). The role of sex-differential biology in risk for autism spectrum disorder. Biol. Sex Differ. 7,. doi:10.1186/s13293-016-0112-8 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/s13293-016-0112-8&link_type=DOI) 17. 17. D., Kroll, K.L., Gutmann, D.H., Constantino, J.N., et al. (2022). Can the “female protective effect” liability threshold model explain sex differences in autism spectrum disorder? Neuron 110,. doi:10.1016/j.neuron.2022.06.020 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.neuron.2022.06.020&link_type=DOI) 18. 18.Leppa, V.M., Kravitz, S.N., Martin, C.L., Andrieux, J., Le Caignec, C., Martin-Coignard, D., DyBuncio, C., Sanders, S.J., Lowe, J.K., Cantor, R.M., et al. (2016). Rare Inherited and De Novo CNVs Reveal Complex Contributions to ASD Risk in Multiplex Families. Am. J. Hum. Genet. 99, 540–554. doi:10.1016/j.ajhg.2016.06.036 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ajhg.2016.06.036&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=27569545&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F18%2F2024.07.18.24310640.atom) 19. 19.Han, J., Walters, J.T.R., Kirov, G., Pocklington, A., Escott-Price, V., Owen, M.J., Holmans, P., O’Donovan, M.C., and Rees, E. (2016). Gender differences in CNV burden do not confound schizophrenia CNV associations. Sci. Rep. 6, 25986. doi:10.1038/srep25986 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/srep25986&link_type=DOI) 20. 20.Martin, J., Tammimies, K., Karlsson, R., Lu, Y., Larsson, H., Lichtenstein, P., and Magnusson, P.K.E. (2019). Copy number variation and neuropsychiatric problems in females and males in the general population. Am. J. Med. Genet. B Neuropsychiatr. Genet. 180, 341–350. doi:10.1002/ajmg.b.32685 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/ajmg.b.32685&link_type=DOI) 21. 21.Mitra, I., Tsang, K., Ladd-Acosta, C., Croen, L.A., Aldinger, K.A., Hendren, R.L., Traglia, M., Lavillaureix, A., Zaitlen, N., Oldham, M.C., et al. (2016). Pleiotropic Mechanisms Indicated for Sex Differences in Autism. PLoS Genet. 12, e1006425. doi:10.1371/journal.pgen.1006425 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pgen.1006425&link_type=DOI) 22. 22.McCarthy, M.M. (2020). A new view of sexual differentiation of mammalian brain. J. Comp. Physiol. A Neuroethol. Sens. Neural Behav. Physiol. 206, 369–378. doi:10.1007/s00359-019-01376-8 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s00359-019-01376-8&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=31705197&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F18%2F2024.07.18.24310640.atom) 23. 23.Amestoy, A., Baudrillard, C., Briot, K., Pizano, A., Bouvard, M., and Lai, M.C. (2023). Steroid hormone pathways, vitamin D and autism: a systematic review. J. Neural Transm. 130,. doi:10.1007/s00702-022-02582-6 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s00702-022-02582-6&link_type=DOI) 24. 24.Lenz, K.M., Wright, C.L., Martin, R.C., and McCarthy, M.M. (2011). Prostaglandin E₂ regulates AMPA receptor phosphorylation and promotes membrane insertion in preoptic area neurons and glia during sexual differentiation. PLoS One 6, e18500. doi:10.1371/journal.pone.0018500 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pone.0018500&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=21490976&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F18%2F2024.07.18.24310640.atom) 25. 25.VanRyzin, J.W., Marquardt, A.E., Argue, K.J., Vecchiarelli, H.A., Ashton, S.E., Arambula, S.E., Hill, M.N., and McCarthy, M.M. (2019). Microglial Phagocytosis of Newborn Cells Is Induced by Endocannabinoids and Sculpts Sex Differences in Juvenile Rat Social Play. Neuron 102, 435–449.e6. doi:10.1016/j.neuron.2019.02.006 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.neuron.2019.02.006&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=30827729&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F18%2F2024.07.18.24310640.atom) 26. 26.Abrahams, B.S., Arking, D.E., Campbell, D.B., Mefford, H.C., Morrow, E.M., Weiss, L.A., Menashe, I., Wadkins, T., Banerjee-Basu, S., and Packer, A. (2013). SFARI Gene 2.0: a community-driven knowledgebase for the autism spectrum disorders (ASDs). Mol. Autism 4, 36. doi:10.1186/2040-2392-4-36 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/2040-2392-4-36&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24090431&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F18%2F2024.07.18.24310640.atom) 27. 27.Schaaf, C.P., Betancur, C., Yuen, R.K.C., Parr, J.R., Skuse, D.H., Gallagher, L., Bernier, R.A., Buchanan, J.A., Buxbaum, J.D., Chen, C.-A., et al. (2020). A framework for an evidence-based gene list relevant to autism spectrum disorder. Nat. Rev. Genet. 21, 367–376. doi:10.1038/s41576-020-0231-2 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41576-020-0231-2&link_type=DOI) 28. 28.Hoang, N., Buchanan, J.A., and Scherer, S.W. (2018). Heterogeneity in clinical sequencing tests marketed for autism spectrum disorders. Npj Genomic Medicine 3, 1–4. doi:10.1038/s41525-018-0066-3 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41525-018-0066-3&link_type=DOI) 29. 29.Sun, L., Wang, Z., Lu, T., Manolio, T. A., & Paterson, A. D. (2023). eXclusionarY: 10 years later, where are the sex chromosomes in GWASs?. American journal of human genetics, 110(6), 903–912. doi:10.1016/j.ajhg.2023.04.009 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ajhg.2023.04.009&link_type=DOI) 30. 30.Chung, R.-H., Ma, D., Wang, K., Hedges, D.J., Jaworski, J.M., Gilbert, J.R., Cuccaro, M.L., Wright, H.H., Abramson, R.K., Konidari, I., et al. (2011). An X chromosome-wide association study in autism families identifies TBL1X as a novel autism spectrum disorder candidate gene in males. Mol. Autism 2, 18. doi:10.1186/2040-2392-2-18 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/2040-2392-2-18&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22050706&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F18%2F2024.07.18.24310640.atom) 31. 31.Gottipati, S., Arbiza, L., Siepel, A., Clark, A.G., and Keinan, A. (2011). Analyses of X-linked and autosomal genetic variation in population-scale whole genome sequencing. Nat. Genet. 43, 741–743. doi:10.1038/ng.877 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/ng.877&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=21775991&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F18%2F2024.07.18.24310640.atom) 32. 32.Le Guen, Y., Napolioni, V., Belloy, M.E., Yu, E., Krohn, L., Ruskey, J.A., Gan-Or, Z., Kennedy, G., Eger, S.J., and Greicius, M.D. (2021). Common X-Chromosome Variants Are Associated with Parkinson Disease Risk. Ann. Neurol. 90, 22–34. doi:10.1002/ana.26051 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/ana.26051&link_type=DOI) 33. 33.Gao, F., Chang, D., Biddanda, A., Ma, L., Guo, Y., Zhou, Z., and Keinan, A. (2015). XWAS: A Software Toolset for Genetic Data Analysis and Association Studies of the X Chromosome. J. Hered. 106, 666–671. doi:10.1093/jhered/esv059 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/jhered/esv059&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=26268243&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F18%2F2024.07.18.24310640.atom) 34. 34.Trost, B., Thiruvahindrapuram, B., Chan, A. J. S., Engchuan, W., Higginbotham, E. J., Howe, J. L., Loureiro, L. O., Reuter, M. S., Roshandel, D., Whitney, J., Zarrei, M., Bookman, M., Somerville, C., Shaath, R., Abdi, M., Aliyev, E., Patel, R. V., Nalpathamkalam, T., Pellecchia, G., Hamdan, O., … Scherer, S. W. (2022). Genomic architecture of autism from comprehensive whole-genome sequence annotation. Cell, 185(23), 4409–4427.e18. doi:10.1016/j.cell.2022.10.009 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.cell.2022.10.009&link_type=DOI) 35. 35.C Yuen, R.K., Merico, D., Bookman, M., L Howe, J., Thiruvahindrapuram, B., Patel, R.V., Whitney, J., Deflaux, N., Bingham, J., Wang, Z., et al. (2017). Whole genome sequencing resource identifies 18 new candidate genes for autism spectrum disorder. Nat. Neurosci. 20, 602–611. doi:10.1038/nn.4524 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/nn.4524&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=28263302&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F18%2F2024.07.18.24310640.atom) 36. 36.Black, D.W., and Jon E. Grant, M.D., M.P.H., J.D. (2014). DSM-5 Guidebook: The Essential Companion to the Diagnostic and Statistical Manual of Mental Disorders, Fifth Edition (American Psychiatric Pub). 37. 37.Lord, C., Rutter, M., and Le Couteur, A. (1994). Autism Diagnostic Interview-Revised: A revised version of a diagnostic interview for caregivers of individuals with possible pervasive developmental disorders. J. Autism Dev. Disord. 24, 659–685. doi:10.1007/BF02172145 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/BF02172145&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=7814313&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F18%2F2024.07.18.24310640.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=A1994PM02100009&link_type=ISI) 38. 38.Kim, S.H., and Lord, C. (2011). New Autism Diagnostic Interview-Revised Algorithms for Toddlers and Young Preschoolers from 12 to 47 Months of Age. J. Autism Dev. Disord. 42, 82–93. doi:10.1007/s10803-011-1213-1 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s10803-011-1213-1&link_type=DOI) 39. 39.Lord, C., Rutter, M., Goode, S., Heemsbergen, J., Jordan, H., Mawhood, L., and Schopler, E. (1989). Austism diagnostic observation schedule: A standardized observation of communicative and social behavior. J. Autism Dev. Disord. 19, 185–212. doi:10.1007/BF02211841 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/BF02211841&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=2745388&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F18%2F2024.07.18.24310640.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=A1989AC89500001&link_type=ISI) 40. 40.Lord, C., Risi, S., Lambrecht, L., Cook, E.H., Jr., Leventhal, B.L., DiLavore, P.C., Pickles, A., and Rutter, M. (2000). The autism diagnostic observation schedule-generic: a standard measure of social and communication deficits associated with the spectrum of autism. J. Autism Dev. Disord. 30, 205–223. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1023/A:1005592401947&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=11055457&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F18%2F2024.07.18.24310640.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000089674100004&link_type=ISI) 41. 41.Fischbach, G. D., & Lord, C. (2010). The Simons Simplex Collection: a resource for identification of autism genetic risk factors. Neuron, 68(2), 192–195. doi:10.1016/j.neuron.2010.10.006 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.neuron.2010.10.006&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20955926&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F18%2F2024.07.18.24310640.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000284304300007&link_type=ISI) 42. 42.SPARK Consortium. Electronic address: pfeliciano@simonsfoundation.org, & SPARK Consortium (2018). SPARK: A US Cohort of 50,000 Families to Accelerate Autism Research. Neuron, 97(3), 488–493. doi:10.1016/j.neuron.2018.01.015 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.neuron.2018.01.015&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=29420931&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F18%2F2024.07.18.24310640.atom) 43. 43.Byrska-Bishop, M., Evani, U.S., Zhao, X., Basile, A.O., Abel, H.J., Regier, A.A., Corvelo, A., Clarke, W.E., Musunuri, R., Nagulapalli, K., et al. (2022). High-coverage whole-genome sequencing of the expanded 1000 Genomes Project cohort including 602 trios. Cell 185, 3426–3440.e19. doi:10.1016/j.cell.2022.08.004 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.cell.2022.08.004&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=36055201&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F18%2F2024.07.18.24310640.atom) 44. 44.Pinese, M., Lacaze, P., Rath, E.M., Stone, A., Brion, M.-J., Ameur, A., Nagpal, S., Puttick, C., Husson, S., Degrave, D., et al. (2020). The Medical Genome Reference Bank contains whole genome and phenotype data of 2570 healthy elderly. Nat. Commun. 11, 1–14. doi:10.1038/s41467-019-14079-0 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41467-019-13889-6&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=31911652&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F18%2F2024.07.18.24310640.atom) 45. 45.Yoo, S., Garg, E., Elliott, L.T., Hung, R.J., Halevy, A.R., Brooks, J.D., Bull, S.B., Gagnon, F., Greenwood, C.M.T., Lawless, J.F., et al. (2023). HostSeq: A Canadian Whole Genome Sequencing and Clinical Data Resource. 46. 46.Leal, T.P., French-Kwawu, J.N., Gouveia, M.H., Borda, V., Inca-Martinez, M., Mason, E.A., Horimoto, A.R., Loesch, D.P., Sarihan, E.I., Cornejo-Olivas, M.R., et al. (2023). X-Chromosome Association Study in Latin American Cohorts Identifies New Loci in Parkinson Disease. doi:10.1002/mds.29508 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/mds.29508&link_type=DOI) 47. 47.Franke, L., de Kovel, C.G.F., Aulchenko, Y.S., Trynka, G., Zhernakova, A., Hunt, K.A., Blauw, H.M., van den Berg, L.H., Ophoff, R., Deloukas, P., et al. (2008). Detection, imputation, and association analysis of small deletions and null alleles on oligonucleotide arrays. Am. J. Hum. Genet. 82, 1316–1333. doi:10.1016/j.ajhg.2008.05.008 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ajhg.2008.05.008&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=18519066&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F18%2F2024.07.18.24310640.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000256647000009&link_type=ISI) 48. 48.Karczewski, K.J., Francioli, L.C., Tiao, G., Cummings, B.B., Alföldi, J., Wang, Q., Collins, R.L., Laricchia, K.M., Ganna, A., Birnbaum, D.P., et al. (2020). The mutational constraint spectrum quantified from variation in 141,456 humans. Nature 581, 434–443. doi:10.1038/s41586-020-2308-7 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41586-020-2308-7&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=32461654&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F18%2F2024.07.18.24310640.atom) 49. 49.Manichaikul, A., Mychaleckyj, J.C., Rich, S.S., Daly, K., Sale, M., and Chen, W.-M. (2010). Robust relationship inference in genome-wide association studies. Bioinformatics 26, 2867–2873. doi:10.1093/bioinformatics/btq559 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/bioinformatics/btq559&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20926424&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F18%2F2024.07.18.24310640.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000283919800010&link_type=ISI) 50. 50.Leal, T.P., Furlan, V.C., Gouveia, M.H., Saraiva Duarte, J.M., Fonseca, P.A., Tou, R., Scliar, M. de O., Araujo, G.S. de, Costa, L.F., Zolini, C., et al. (2022). NAToRA, a relatedness-pruning method to minimize the loss of dataset size in genetic and omics analyses. Comput. Struct. Biotechnol. J. 20, 1821–1828. doi:10.1016/j.csbj.2022.04.009 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.csbj.2022.04.009&link_type=DOI) 51. 51.Alexander, D.H., Novembre, J., and Lange, K. (2009). Fast model-based estimation of ancestry in unrelated individuals. Genome Res. 19, 1655–1664. doi:10.1101/gr.094052.109 [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NjoiZ2Vub21lIjtzOjU6InJlc2lkIjtzOjk6IjE5LzkvMTY1NSI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDI0LzA3LzE4LzIwMjQuMDcuMTguMjQzMTA2NDAuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 52. 52.Fairley, S., Lowy-Gallego, E., Perry, E., and Flicek, P. (2019). The International Genome Sample Resource (IGSR) collection of open human genomic variation resources. Nucleic Acids Res. 48, D941–D947. doi:10.1093/nar/gkz836 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/nar/gkz836&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=31584097&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F18%2F2024.07.18.24310640.atom) 53. 53.Price, A.L., Weale, M.E., Patterson, N., Myers, S.R., Need, A.C., Shianna, K.V., Ge, D., Rotter, J.I., Torres, E., Taylor, K.D., et al. (2008). Long-range LD can confound genome scans in admixed populations. Am. J. Hum. Genet. 83, 132–135; author reply 135–139. doi:10.1016/j.ajhg.2008.06.005 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ajhg.2008.06.005&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=18606306&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F18%2F2024.07.18.24310640.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000257784000020&link_type=ISI) 54. 54.Weale, M.E. (2010). Quality control for genome-wide association studies. Methods Mol. Biol. 628, 341–372. doi:10.1007/978-1-60327-367-1_19 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/978-1-60327-367-1_19&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20238091&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F18%2F2024.07.18.24310640.atom) 55. 55.Anderson, C.A., Pettersson, F.H., Clarke, G.M., Cardon, L.R., Morris, A.P., and Zondervan, K.T. (2010). Data quality control in genetic case-control association studies. Nat. Protoc. 5, 1564–1573. doi:10.1038/nprot.2010.116 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/nprot.2010.116&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=21085122&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F18%2F2024.07.18.24310640.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000281569200007&link_type=ISI) 56. 56.Chang, C.C., Chow, C.C., Tellier, L.C., Vattikuti, S., Purcell, S.M., and Lee, J.J. (2015). Second-generation PLINK: rising to the challenge of larger and richer datasets. Gigascience 4, 7. doi:10.1186/s13742-015-0047-8 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/s13742-015-0047-8&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25722852&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F18%2F2024.07.18.24310640.atom) 57. 57.Mägi, R., and Morris, A.P. (2010). GWAMA: software for genome-wide association meta-analysis. BMC Bioinformatics 11, 1–6. doi:10.1186/1471-2105-11-288 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/1471-2105-11-1&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20043860&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F18%2F2024.07.18.24310640.atom) 58. 58.Magi, R., Lindgren, C.M., and Morris, A.P. (2010). Meta-analysis of sex-specific genome-wide association studies. Genet. Epidemiol. 34, 846–853. doi:10.1002/gepi.20540 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/gepi.20540&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=21104887&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F18%2F2024.07.18.24310640.atom) 59. 59.Bretherton, C.S., Widmann, M., Dymnikov, V.P., Wallace, J.M., and Bladé, I. (1999). The Effective Number of Spatial Degrees of Freedom of a Time-Varying Field. J. Clim. 12, 1990– 2009. 60. 60.Chen, D.Z., Roshandel, D., Wang, Z., Sun, L., and Paterson, A.D. (2023). Comprehensive whole-genome analyses of the UK Biobank reveal significant sex differences in both genotype missingness and allele frequency on the X chromosome. Hum. Mol. Genet. ddad 201. doi:10.1093/hmg/ddad201 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/hmg/ddad201&link_type=DOI) 61. 61.Wang, Z., Sun, L., and Paterson, A.D. (2022). Major sex differences in allele frequencies for X chromosomal variants in both the 1000 Genomes Project and gnomAD. PLoS Genet. 18, e1010231. doi:10.1371/journal.pgen.1010231 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pgen.1010231&link_type=DOI) 62. 62.Purcell, S., Neale, B., Todd-Brown, K., Thomas, L., Ferreira, M.A.R., Bender, D., Maller, J., Sklar, P., de Bakker, P.I.W., Daly, M.J., et al. (2007). PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81, 559–575. doi:10.1086/519795 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1086/519795&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=17701901&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F18%2F2024.07.18.24310640.atom) 63. 63.Trost, B., Walker, S., Wang, Z., Thiruvahindrapuram, B., MacDonald, J. R., Sung, W. W. L., Pereira, S. L., Whitney, J., Chan, A. J. S., Pellecchia, G., Reuter, M. S., Lok, S., Yuen, R. K. C., Marshall, C. R., Merico, D., & Scherer, S. W. (2018). A Comprehensive Workflow for Read Depth-Based Identification of Copy-Number Variation from Whole-Genome Sequence Data. American journal of human genetics, 102(1), 142–155. doi:10.1016/j.ajhg.2017.12.007 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ajhg.2017.12.007&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F18%2F2024.07.18.24310640.atom) 64. 64.Wang, K., Li, M., and Hakonarson, H. (2010). ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 38, e164. doi:10.1093/nar/gkq603 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/nar/gkq603&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20601685&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F18%2F2024.07.18.24310640.atom) 65. 65.Ng, P.C., and Henikoff, S. (2003). SIFT: Predicting amino acid changes that affect protein function. Nucleic Acids Res. 31, 3812–3814. doi:10.1093/nar/gkg509 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/nar/gkg509&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=12824425&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F18%2F2024.07.18.24310640.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000183832900117&link_type=ISI) 66. 66.Adzhubei, I.A., Schmidt, S., Peshkin, L., Ramensky, V.E., Gerasimova, A., Bork, P., Kondrashov, A.S., and Sunyaev, S.R. (2010). A method and server for predicting damaging missense mutations. Nat. Methods 7, 248–249. doi:10.1038/nmeth0410-248 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/nmeth0410-248&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20354512&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F18%2F2024.07.18.24310640.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000276150600004&link_type=ISI) 67. 67.Schwarz, J.M., Cooper, D.N., Schuelke, M., and Seelow, D. (2014). MutationTaster2: mutation prediction for the deep-sequencing age. Nat. Methods 11, 361–362. doi:10.1038/nmeth.2890 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/nmeth.2890&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24681721&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F18%2F2024.07.18.24310640.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000333749900007&link_type=ISI) 68. 68.Kircher, M., Witten, D.M., Jain, P., O’Roak, B.J., Cooper, G.M., and Shendure, J. (2014). A general framework for estimating the relative pathogenicity of human genetic variants. Nat. Genet. 46, 310–315. doi:10.1038/ng.2892 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/ng.2892&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24487276&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F18%2F2024.07.18.24310640.atom) 69. 69.Zhu, M., Need, A.C., Han, Y., Ge, D., Maia, J.M., Zhu, Q., Heinzen, E.L., Cirulli, E.T., Pelak, K., He, M., et al. (2012). Using ERDS to infer copy-number variants in high-coverage genomes. Am. J. Hum. Genet. 91, 408–421. doi:10.1016/j.ajhg.2012.07.004 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ajhg.2012.07.004&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22939633&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F18%2F2024.07.18.24310640.atom) 70. 70.Abyzov, A., Urban, A.E., Snyder, M., and Gerstein, M. (2011). CNVnator: an approach to discover, genotype, and characterize typical and atypical CNVs from family and population genome sequencing. Genome Res. 21, 974–984. doi:10.1101/gr.114876.110 [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NjoiZ2Vub21lIjtzOjU6InJlc2lkIjtzOjg6IjIxLzYvOTc0IjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjQvMDcvMTgvMjAyNC4wNy4xOC4yNDMxMDY0MC5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 71. 71.Kang, H.J., Kawasawa, Y.I., Cheng, F., Zhu, Y., Xu, X., Li, M., Sousa, A.M.M., Pletikos, M., Meyer, K.A., Sedmak, G., et al. (2011). Spatio-temporal transcriptome of the human brain. Nature 478, 483–489. doi:10.1038/nature10523 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/nature10523&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22031440&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F18%2F2024.07.18.24310640.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000296194200036&link_type=ISI) 72. 72.Watanabe, K., Taskesen, E., and van Bochoven, A. (2017). Functional mapping and annotation of genetic associations with FUMA. Nat. Commun. 8, 1–11. doi:10.1038/s41467-017-01261-5 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41467-016-0009-6&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=28169986&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F18%2F2024.07.18.24310640.atom) 73. 73.de Leeuw, C.A., Mooij, J.M., and Heskes, T. (2015). MAGMA: Generalized Gene-Set Analysis of GWAS Data. PLoS Comput. Biol. 11, e1004219. doi:10.1371/journal.pcbi.1004219 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pcbi.1004219&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25885710&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F18%2F2024.07.18.24310640.atom) 74. 74.Schubach, M., Maass, T., Nazaretyan, L., Röner, S., and Kircher, M. (2024). CADD v1.7: using protein language models, regulatory CNNs and other nucleotide-level scores to improve genome-wide variant predictions. Nucleic Acids Res. 52, D1143–D1154. doi:10.1093/nar/gkad989 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/nar/gkad989&link_type=DOI) 75. 75.Dong, S., Zhao, N., Spragins, E., Kagda, M.S., Li, M., Assis, P., Jolanki, O., Luo, Y., Cherry, J.M., Boyle, A.P., et al. (2023). Annotating and prioritizing human non-coding variants with RegulomeDB v.2. Nat. Genet. 55, 724–726. doi:10.1038/s41588-023-01365-3 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41588-023-01365-3&link_type=DOI) 76. 76.Boyle, A.P., Hong, E.L., Hariharan, M., Cheng, Y., Schaub, M.A., Kasowski, M., Karczewski, K.J., Park, J., Hitz, B.C., Weng, S., et al. (2012). Annotation of functional variation in personal genomes using RegulomeDB. Genome Res. 22, 1790–1797. doi:10.1101/gr.137323.112 [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NjoiZ2Vub21lIjtzOjU6InJlc2lkIjtzOjk6IjIyLzkvMTc5MCI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDI0LzA3LzE4LzIwMjQuMDcuMTguMjQzMTA2NDAuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 77. 77.1000 Genomes Project Consortium, Auton, A., Brooks, L. D., Durbin, R. M., Garrison, E. P., Kang, H. M., Korbel, J. O., Marchini, J. L., McCarthy, S., McVean, G. A., & Abecasis, G. R. (2015). A global reference for human genetic variation. Nature, 526(7571), 68–74. doi:10.1038/nature15393 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/nature15393&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=26432245&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F18%2F2024.07.18.24310640.atom) 78. 78.Day, F.R., Loh, P.-R., Scott, R.A., Ong, K.K., and Perry, J.R.B. (2016). A Robust Example of Collider Bias in a Genetic Association Study. Am. J. Hum. Genet. 98, 392–393. doi:10.1016/j.ajhg.2015.12.019 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ajhg.2015.12.019&link_type=DOI) 79. 79.Price, A.L., Patterson, N.J., Plenge, R.M., Weinblatt, M.E., Shadick, N.A., and Reich, D. (2006). Principal components analysis corrects for stratification in genome-wide association studies. Nat. Genet. 38, 904–909. doi:10.1038/ng1847 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/ng1847&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=16862161&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F18%2F2024.07.18.24310640.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000239325700019&link_type=ISI) 80. 80.Zarrei, M., Burton, C.L., Engchuan, W., Young, E.J., Higginbotham, E.J., MacDonald, of genomic copy number variation across neurodevelopmental disorders. NPJ Genomic Medicine 4,. doi:10.1038/s41525-019-0098-3 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41525-019-0098-3&link_type=DOI) 81. 81.D’Abate, L., Walker, S., Yuen, R.K.C., Tammimies, K., Buchanan, J.A., Davies, R.W., Thiruvahindrapuram, B., Wei, J., Brian, J., Bryson, S.E., et al. (2019). Predictive impact of rare genomic copy number variations in siblings of individuals with autism spectrum disorders. Nat. Commun. 10,. doi:10.1038/s41467-019-13380-2 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41467-019-13380-2&link_type=DOI) 82. 82.Woodbury-Smith, M., Zarrei, M., Wei, J., Thiruvahindrapuram, B., O’Connor, I., Paterson, A.D., Yuen, R.K.C., Dastan, J., Stavropoulos, D.J., Howe, J.L., et al. (2020). Segregating patterns of copy number variations in extended autism spectrum disorder (ASD) pedigrees. Am. J. Med. Genet. B Neuropsychiatr. Genet. 183,. doi:10.1002/ajmg.b.32785 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/ajmg.b.32785&link_type=DOI) 83. 83.Zarrei, M., Burton, C.L., Engchuan, W., Higginbotham, E.J., Wei, J., Shaikh, S., Roslin, N.M., MacDonald, J.R., Pellecchia, G., Nalpathamkalam, T., et al. (2023). Gene copy number variation and pediatric mental health/neurodevelopment in a general population. Hum. Mol. Genet. 32,. doi:10.1093/hmg/ddad074 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/hmg/ddad074&link_type=DOI) 84. 84.Grove, J., Ripke, S., Als, T.D., Mattheisen, M., Walters, R.K., Won, H., Pallesen, J., Agerbo, E., Andreassen, O.A., Anney, R., et al. (2019). Identification of common genetic risk variants for autism spectrum disorder. Nat. Genet. 51, 431–444. doi:10.1038/s41588-019-0344-8 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41588-019-0344-8&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=30804558&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F18%2F2024.07.18.24310640.atom) 85. 85.Sollis, E., Mosaku, A., Abid, A., Buniello, A., Cerezo, M., Gil, L., Groza, T., Güneş, O., Hall, P., Hayhurst, J., et al. (2023). The NHGRI-EBI GWAS Catalog: knowledgebase and deposition resource. Nucleic Acids Res. 51, D977–D985. doi:10.1093/nar/gkac1010 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/nar/gkac1010&link_type=DOI) 86. 86.Ruth, K.S., Day, F.R., Tyrrell, J., Thompson, D.J., Wood, A.R., Mahajan, A., Beaumont, R.N., Wittemans, L., Martin, S., Busch, A.S., et al. (2020). Using human genetics to understand the disease impacts of testosterone in men and women. Nat. Med. 26, 252–258. doi:10.1038/s41591-020-0751-5 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41591-020-0751-5&link_type=DOI) 87. 87.Bilder, D.A., Worsham, W., Sullivan, S., Sean Esplin, M., Burghardt, P., Fraser, A., and Bakian, A.V. (2023). Sex-specific and sex-independent steroid-related biomarkers in early second trimester maternal serum associated with autism. Mol. Autism 14,. doi:10.1186/s13229-023-00562-5 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/s13229-023-00562-5&link_type=DOI) 88. 88.Piton, A., Gauthier, J., Hamdan, F.F., Lafrenière, R.G., Yang, Y., Henrion, E., Laurent, S., Noreau, A., Thibodeau, P., Karemera, L., et al. (2011). Systematic resequencing of X-chromosome synaptic genes in autism spectrum disorder and schizophrenia. Mol. Psychiatry 16, 867. doi:10.1038/mp.2010.54 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/mp.2010.54&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20479760&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F07%2F18%2F2024.07.18.24310640.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000293001800012&link_type=ISI) 89. 89.Joel Ross, P., Zhang, W.-B., Mok, R.S.F., Zaslavsky, K., Deneault, E., D’Abate, L., Rodrigues, D.C., Yuen, R.K.C., Faheem, M., Mufteev, M., et al. (2020). Synaptic dysfunction in human neurons with autism-associated deletions in PTCHD1-AS. Biol. Psychiatry 87, 139. doi:10.1016/j.biopsych.2019.07.014 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.biopsych.2019.07.014&link_type=DOI) 90. 90.Scala, M., Bradley, C.A., Howe, J.L., Trost, B., Salazar, N.B., Shum, C., Reuter, M.S., MacDonald, J.R., Ko, S.Y., Frankland, P.W., et al. (2023). Genetic variants in contribute to Autism Spectrum Disorder associated with the Xp22.11 locus. medRxiv. doi:10.1101/2023.12.21.23300383 [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NzoibWVkcnhpdiI7czo1OiJyZXNpZCI7czoyMToiMjAyMy4xMi4yMS4yMzMwMDM4M3YxIjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjQvMDcvMTgvMjAyNC4wNy4xOC4yNDMxMDY0MC5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 91. 91.Fry, A. E., Marra, C., Derrick, A. V., Pickrell, W. O., Higgins, A. T., Te Water Naude, J., McClatchey, M. A., Davies, S. J., Metcalfe, K. A., Tan, H. J., Mohanraj, R., Avula, S., Williams, D., Brady, L. I., Mesterman, R., Tarnopolsky, M. A., Zhang, Y., Yang, Y., Wang, X., Genomics England Research Consortium, … Chung, S. K. (2021). Missense variants in the N-terminal domain of the A isoform of FHF2/FGF13 cause an X-linked developmental and epileptic encephalopathy. American journal of human genetics, 108(1), 176–185. doi:10.1016/j.ajhg.2020.10.017 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/J.AJHG.2020.10.017&link_type=DOI) [1]: /embed/graphic-2.gif