Polygenic scores clarify the relationship between mental health and gender diversity ==================================================================================== * Taylor R. Thomas * Ashton J. Tener * Amy M. Pearlman * Katherine L. Imborek * Ji Seung Yang * John F. Strang * Jacob J. Michaelson ## ABSTRACT **BACKGROUND** Gender diverse individuals are at increased risk for mental health problems, but it is unclear whether this is due to shared environmental or genetic factors. **METHODS** In two SPARK samples, we tested for 16 polygenic scores (PGS) effects on quantitative measures of gender diversity and mental health. In Study 1, *N* = 639 independent adults (59% autistic) reported their mental health with the Adult Self Report and their gender diversity with the Gender Self Report (GSR). The GSR has two dimensions: Binary (degree of identification with the gender opposite that implied by sex designated at birth) and Nonbinary (degree of identification with a gender that is neither male nor female). In Study 2 (*N* = 5, 165), we used categorical gender identity. **RESULTS** In Study 1, neuropsychiatric PGS were positively associated with mental health problems. Externalizing was positively associated with ADHD PGS (*ρ* = 0.12, *p <* 0.001, FDR = 0.10), and Internalizing was positively associated with PGS for depression (*ρ* = 0.08, *p* = 0.04, FDR = 1) and neuroticism (*ρ* = 0.11, *p* = 0.01, FDR = 0.41). Interestingly, we found no associations between gender diversity and neuropsychiatric PGS (80% powered to detect *ρ > ±*0.11). However, the GSR was positively associated with cognitive performance PGS (Binary *ρ* = 0.11, *p <* 0.001, FDR = 0.23 and Nonbinary *ρ* = 0.12, *p <* 0.001, FDR = 0.13). Binary was also positively associated with PGS for non-heterosexual sexual behavior (*ρ* = 0.09, *p* = 0.03, FDR = 0.69). In Study 2, the cognitive performance PGS effect replicated; transgender and non-binary individuals had higher PGS: *t* = 4.16, *p <* 0.001, FDR *<* 0.001. They also had higher risky behavior and anorexia PGS. **CONCLUSIONS** We show that while gender diversity as a trait is positively associated with mental health problems, the strongest PGS associations with gender diversity were with cognitive performance, not neuropsychiatric conditions. ## 1 INTRODUCTION Sex and gender can have major impacts on health [1] (see Table 1 for our definitions). This stems from both extrinsic factors (e.g., healthcare barriers [2, 3]) and biological factors, with sex and gender modulating the underlying molecular mechanisms of disease and well-being [4]. In health research, sex is a more objective and well-defined variable than gender. This is because gender is often experienced on a continuum [5] and is multidimensional with binary and nonbinary dimensions. Gender can be reported through self-endorsement of categorical gender identity labels, like transgender, cisgender, nonbinary, and genderqueer. However, categorical gender identity labels may not be ideal for health research. Gender identity labels are contextually and culturally dependent (i.e., not accessible by all), and they are often nonspecific in their meanings [6]. Furthermore, gender diversity, a fundamental aspect of human diversity, is not only expressed by individuals with gender-diverse identities. People who identify as cisgender also exhibit some variation in dimensional gender diversity [7], but this diversity would be lost in studies that only use categorical gender identity labels. Therefore, parsing datasets based on numerous and nonspecific gender identity labels would erode statistical power for research studies. A continuous, multidimensional characterization of gender that uses simple and widely accessible language will enable health researchers to appropriately incorporate gender diversity. View this table: [Table 1.](http://medrxiv.org/content/early/2023/07/06/2021.11.22.21266696/T1) Table 1. Working definitions for the gender-related terms used in this study. Some definitions are from [7] and [54]. Gender diversity is a crucial variable to include in mental health research. Previous studies have reported higher rates of mental health problems in groups that express more gender diversity than the cisgender proportional majority, such as LGBTQ+ individuals [8, 9]. One study found LGBQ+ individuals had higher rates of anxiety, depression, and attempted suicide [10]. Recent work leveraging the All of Us cohort (*N* = 329, 038) found that the LGBQ+ participants had a higher prevalence of neuropsychiatric diagnoses [11]. The exact mechanisms are not fully understood. However, research has shown that poorer mental health is at least partially due to factors related to the experienced adversity from sexual orientation and/or gender diversity. One study found that discrimination and resilience partially mediated negative mental health outcomes in LGBTQ+ college students [12]. Another study found that access to gender-affirming hormone therapy for transgender and gender nonbinary youth was associated with reduced risk of depression and suicidality [13]. To our knowledge, no study has used genetic data, so any possible genetic mechanisms are unknown. Most behaviors are somewhat heritable, so we and others have hypothesized that gender identity and gender diversity are also susceptible to genetic influences. [14]. One twin study of *N* = 4, 426 females estimated the heritability of adult gender expression (i.e., self-reported masculinity and femininity) at 11%, and retrospective childhood gender typicality at 31% [15]. However, the searches for specific loci have been underpowered for gene discovery [16, 17]. Genome-wide association studies (GWAS) of human behavior often uncover many associated loci that each have a small effect size and contribute additively [18]. Loci associated with one trait are often associated with other traits, which suggests that the two traits have a degree of pleiotropy. However, the predictive power of the PGS depends on the GWAS power, which is driven chiefly by sample size. Among the well-powered GWAS, the most proximal trait to gender diversity is the non-heterosexual sexual behavior (NHSB) GWAS [19] performed in *N* = 408, 995 UK Biobank participants. The trait was defined as yes/no response to ever having sex with someone of the same sex (the nuance between same-sex versus same-gender is lost due to the nature of the question). The estimated heritability of NHSB ranged from 8% to 25%. It was positively genetically correlated with several neuropsychiatric conditions and personality traits. However, the interpretation of these genetic correlations is limited because of the confounding with experienced adversity and psychiatric diagnoses. In this study, we investigated whether gender diversity, like NHSB, is genetically associated with other behaviors and if this plays a role in mental health. We invited a subset of participants from SPARK [20] to complete surveys about their mental health and gender identity. SPARK is a national genetic study totalling more than 300,000 participants with and without autism. Previous studies have shown that there is an enrichment of gender diversity in autism [21], and gender diverse individuals were found to have increased levels of clinically relevant autistic traits and increased likelihoods of autism diagnoses [22]. This makes SPARK a logical choice for investigating this topic. In our sample of *N* = 696 (*N* = 639 of European genetic ancestry), we calculated PGS for 16 behavior traits. We also administered two psychometrically valid self-report tools. The first, the Adult Self Report (ASR) [23], measures several mental health outcomes and adaptive behaviors. The second, the Gender Self-Report [24] captures two quantitative dimensions of gender diversity: Binary Gender Diversity, the extent one experiences themselves as the other binary gender (i.e., different from their sex designated at birth), and Nonbinary Gender Diversity, the extent one experiences themselves as neither female nor male. We then sought to answer the following questions: First, are the ASR scores phenotypically associated with the GSR scores? Second, are behavior-related PGS associated with ASR and GSR scores? How are the PGS associations different for the ASR versus the GSR? Lastly, do the PGS findings broadly replicate in a larger sample with a categorical gender identity phenotype instead of the GSR? See Figure 1 for an overview of the study. ![Figure 1.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2023/07/06/2021.11.22.21266696/F1.medium.gif) [Figure 1.](http://medrxiv.org/content/early/2023/07/06/2021.11.22.21266696/F1) Figure 1. Overview of the study. In *N* = 696 (*N* = 639 of European genetic ancestry), dimensional gender diversity was measured using the Gender Self Report (GSR) and mental health was measured using the Adult Self Report (ASR). The GSR and ASR scores were then tested for associations with 16 polygenic scores (PGS) for psychiatric diagnoses, personality, and cognition. We then used categorical gender identity in *N* = 5, 388 (*N* = 5, 165 of European genetic ancestry) to test for the replication of these PGS associations in the larger sample. ## 2 RESULTS ### 2.1 Phenotypic associations between gender diversity and mental health The demographic characteristics of the SPARK Research Match participants are shown in Table 2. The final sample size was *N* = 696 with *N* = 639 of European genetic ancestry. Approximately one third of the cohort identified as transgender or gender nonbinary (TGNB). Fifty-eight percent of the participants were autistic and 22% were male. View this table: [Table 2.](http://medrxiv.org/content/early/2023/07/06/2021.11.22.21266696/T2) Table 2. Participant Demographics. Study 1 used the two continuous measures of gender diversity from the Gender Self Report, and Study 2 used categorical gender identity. The two gender diversity scores, Binary and Nonbinary Gender Diversity, were from the Gender Self Report (GSR) factor analysis [24] . GSR scores range from 0 (no gender diversity) to 1 (high gender diversity), with the modes near 0(Figure S1. The scores were adjusted for age, sex designated at birth, and autism by linear regression residualization, and then Z-scaled (*µ* = 0, *σ* = 1). The distributions of these two scores are shown in Figure 2A and are colored by sexual orientation and gender identity. The general trend shows higher gender diversity in LGBQ+ and TGNB participants. Binary and Nonbinary were positively correlated: *ρ* = 0.57, *p <* 0.001 (Figure 2B). ![Figure 2.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2023/07/06/2021.11.22.21266696/F2.medium.gif) [Figure 2.](http://medrxiv.org/content/early/2023/07/06/2021.11.22.21266696/F2) Figure 2. Distributions and correlations of the Gender Self Report and Adult Self Report scores. **(A)** Distribution of the two dimensional gender diversity measures from the Gender Self Report (GSR) GSR: Binary and Nonbinary Gender Diversity. The sample size was *N* = 696. The GSR scores were adjusted for age, sex, and autism. The histograms are colored by self-endorsed sexual orientation labels (top panel) and gender identity labels (bottom panel). Distributions of the GSR scores before adjusting for age, sex, and autism are shown in Figure S1. **(B)** Correlation of the two GSR scores. **(C)** Correlation of the two mental heath measures from the Adult Self Report (ASR): Externalizing and Internalizing problems. The ASR scores were also adjusted for age, sex, and autism. **(D)** Correlations between the GSR scores and ASR scores. The two mental health scores, Externalizing and Internalizing, were from the Adult Self Report (ASR) [23]. The scores were adjusted for age, sex designated at birth, and autism by linear regression residualization and then Z-scaled. Externalizing and Internalizing were positively correlated: *ρ* = 0.61, *p <* 0.001 (Figure 2C). The ASR scores were positively correlated with the GSR scores (Figure 2D). Binary was more correlated with Internalizing (*ρ* = 0.14, *p <* 0.001) than Externalizing (*ρ* = 0.10, *p* = 0.01). Nonbinary was also more correlated with Internalizing (*ρ* = 0.18, *p <* 0.001) than Externalizing (*ρ* = 0.13, *p <* 0.001). ### 2.2 Polygenic score associations with gender diversity and mental health We next assessed the associations between GSR and ASR scores with 16 polygenic scores (PGS) of behavior (Figure 3). The PGS were adjusted for the 20 principal genetic components and then age, sex designated at birth, and autism. Correlations were run in the European subset (*N* = 639). The sample size of *N* = 639 has 80% power at *α* = 0.05 to detect effects *ρ > ±*0.11, meaning that the absence of significant effects must be carefully interpreted. In addition to these partial correlations, we ran linear regression with age, sex, and autism included as covariates (Table S1). Multiple testing corrections for the 16 PGS were performed with the Benjamini and Yekutieli method [25], which is a conservative method for multiple testing correction. ![Figure 3.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2023/07/06/2021.11.22.21266696/F3.medium.gif) [Figure 3.](http://medrxiv.org/content/early/2023/07/06/2021.11.22.21266696/F3) Figure 3. Polygenic score associations with the Gender Self Report and Adult Self Report scores. Polygenic score (PGS) associations with the **(A)** Adult Self Report (ASR) scores and **(B)** Gender Self Report (GSR) scores. The PGS, as well as the GSR and ASR phenotypes, were adjusted for age, sex, and autism prior to correlations by linear regression residualization. The correlations shown here were run in the European subset (*N* = 639). Correlations were also run stratified by autism (Figure S3). The PGS linear model associations with age, sex, and autism included as covariates (as opposed to residualizing the covariates from the variables) are shown in Table S1. The partial correlations not filtered on genetic ancestry (*N* = 696) are shown in Figure S2. Multiple testing corrections for the 16 PGS were performed with the Benjamini and Yekutieli method [25]. PGS abbreviations: **risky** = risky behavior, **NEB** = number of children ever born, **non-het** = non-heterosexual sexual behavior, **SCZ** = schizophrenia. The ASR scores had unsurprising correlations with psychiatric PGS. Externalizing was positively correlated with ADHD PGS (*ρ* = 0.12, *p <* 0.001, *FDR* = 0.10). Internalizing was positively correlated with depression PGS (*ρ* = 0.08, *p* = 0.04, *FDR* = 1) and neuroticism PGS (*ρ* = 0.11, *p* = 0.01, *FDR* = 0.41). As expected, non-heterosexual sexual behavior (NHSB) PGS was positively correlated with Binary: *ρ* = 0.09, *p* = 0.03, *FDR* = 0.69. NHSB PGS was also positively correlated with Nonbinary, although the correlation did not reach nominal significance: *ρ* = 0.06, *p* = 0.14, *FDR* = 1. Strikingly, cognitive performance PGS was significantly positively correlated with Binary (*ρ* = 0.11, *p <* 0.001, *FDR* = 0.23) and Nonbinary (*ρ* = 0.12, *p <* 0.001, *FDR* = 0.13), meaning that the polygenic propensity for higher cognitive performance was associated with elevated binary and nonbinary gender diversity. No psychiatric PGS were significantly correlated with the GSR scores, although autism PGS approached nominal significance with Binary (*ρ* = 0.05, *p* = 0.17, *FDR* = 1). The results were comparable when not filtering on genetic ancestry (*N* = 696, see Figure S2). Cognitive performance PGS was positively correlated with Binary *ρ* = 0.09, *p* = 0.02, *FDR* = 0.66) and Nonbinary (*ρ* = 0.11, *p <* 0.001, *FDR* = 0.16). NHSB PGS was positively correlated with Binary: *ρ* = 0.09, *p* = 0.02, *FDR* = 0.66. Autism PGS was positively correlated with Binary: *ρ* = 0.08, *p* = 0.05, *FDR* = 0.82. We tested whether the correlations trended in the same direction when run separately in the autistic subset (*N* = 376), versus those without an autism diagnosis (*N* = 263) (Figure S3). These tests were not well powered, but cognitive performance PGS was positively correlated with Binary in the autistic subset (*ρ* = 0.10, *p* = 0.04, *FDR* = 1) and the non-autistic subset (*ρ* = 0.13, *p* = 0.04, *FDR* = 1), as well as Nonbinary in the autistic subset (*ρ* = 0.12, *p* = 0.02, *FDR* = 1) and the non-autistic subset (*ρ* = 0.08, *p* = 0.19, *FDR* = 1). ### 2.3 Replication of polygenic score associations with categorical gender identity We next tested if the PGS associations with the GSR were similar in a larger sample (*N* = 5, 388, *N* = 5, 165 of European genetic ancestry) with a categorical gender identity phenotype (Study 2). We used the background history files provided by SPARK to label individuals as cisgender or TGNB by using discordance between the participant’s designated sex at birth (options: Male or Female) and their gender (options: Male, Female, or Other) to categorize the participants as TGNB or cisgender. We merged this with Study 1 for a final sample size of *N* = 5, 388, with *N* = 590 from Study 1 and the remaining *N* = 4, 798 from the background history reporting. The mean age for Study 2 was 25 years and 88% of the participants were autistic (Table 2). We tested for PGS differences between the two gender identity groups with t-tests (Figure 4). The strongest effect was observed for cognitive performance PGS, with the TGNB group being significantly higher (*xd* = 0.26 [0.14, 0.39], *t* = 4.16, *p <* 0.001, *FDR <* 0.001). The TGNB group also had significantly higher PGS for risky behavior (*xd* = 0.12 [0.01, 0.23], *t* = 2.12, *p* = 0.03, *FDR* = 0.67) and anorexia (*xd* = 0.12 [0.01, 0.24], *t* = 2.09, *p* = 0.04, *FDR* = 0.67). NHSB PGS was close to nominal significance (*xd* = 0.11 [*−*0.01, 0.23], *t* = 1.86, *p* = 0.06, *FDR* = 0.84). ![Figure 4.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2023/07/06/2021.11.22.21266696/F4.medium.gif) [Figure 4.](http://medrxiv.org/content/early/2023/07/06/2021.11.22.21266696/F4) Figure 4. Replication of polygenic score associations with categorical gender identity in a larger sample (Study 2). Polygenic score (PGS) difference in means (t-tests) between gender identity groups in the larger cohort of European genetic ancestry (*N* = 5, 165). The two gender identity groups are cisgender (*N* = 4, 879) versus transgender and gender nonbinary (TGNB, *N* = 286). The PGS were adjusted for age, sex, and autism prior to performing the t-tests. The results with the PGS not adjusted for age, sex, and autism are shown in Figure S5. The results not filtered on genetic ancestry (*N* = 5, 388) are shown in Figure S4. Multiple testing corrections for the 16 PGS were performed with the Benjamini and Yekutieli method [25]. PGS abbreviations: **risky** = risky behavior, **NEB** = number of children ever born, **non-het** = non-heterosexual sexual behavior, **SCZ** = schizophrenia. We repeated the tests not filtering on genetic ancestry (*N* = 5, 388) (Figure S4). The strongest effect was still cognitive performance PGS, which was higher in the TGNB group (*xd* = 0.24 [0.12, 0.36], *t* = 3, 86, *p <* 0.001, *FDR* = 0.01). The TGNB group was also higher for risky behavior PGS (*xd* = 0.12 [0.01, 0.23], *t* = 2.09, *p* = 0.04, *FDR* = 0.79) and NHSB PGS (*xd* = 0.12 [0, 0.23], *t* = 2.02, *p* = 0.04, *FDR* = 0.79). We repeated the tests by adjusting the PGS for only age (Figure S5A) and not adjusting the PGS for age, sex, or autism (Figure S5B). ### 2.4 Interactions between gender diversity, mental health, and polygenic scores Although our smaller sample size was not well-powered for interactions, having found little evidence for main effect PGS associations that explained the mental health associations with gender diversity, we decided to investigate interactions between PGS and gender diversity that could potentially explain mental health outcomes. We tested for PGS-by-GSR interactions with three linear models and performed correlations stratified by PGS (Figure S6). The first model included the covariates (age, sex, and autism) as main effects and interactions with PGS and GSR, as recommended by [26]. For the second model, we maintained consistency with the previous analysis that adjusted for covariates prior to tests. For the third model, we binarized the PGS into the upper quartile (coded as 1) and the lower quartile groups (coded as 0), with *N* = 160 in each group and the middle 50% removed. From the third model, we identified three nominally significant PGS-by-GSR interactions, specifically the schizophrenia and depression PGS. The most prominent interaction was Nonbinary and schizophrenia PGS interaction on Internalizing (*β* = 0.32, [0.1, 0.53], *p <* 0.001, *FDR* = 0.20). Within the entire sample of *N* = 639, Nonbinary and Internalizing were positively correlated: *ρ* = 0.18, *p <* 0.001. However, this apparent main effect appears to be driven by a context-specific interaction with PGS: in the subset at highest schizophrenia PGS (e.g., the upper quartile, *N* = 160), the correlation between Nonbinary and Internalizing was *ρ* = 0.36, *p <* 0.001. While in the lowest risk group (e.g. the lower quartile, *N* = 160), there was no correlation: *ρ* = 0.02, *p* = 0.81. The interaction between depression PGS and Nonbinary on Internalizing was also similar (*β* = 0.23, [0.02, 0.45], *p* = 0.04, *FDR* = 0.97). ## 3 DISCUSSION Our analyses were the first to address the relationships of multidimensional gender diversity with mental health and genetics. We used two quantitative measures of gender diversity, Binary and Nonbinary Gender Diversity, from the Gender Self-Report (GSR) in a neurodiverse sample of *N* = 696 adults in SPARK [20]. In our sample, we found greater gender diversity in female, autistic, and LGBTQ+ participants. Due to the structure of SPARK recruitment, we were only able to collect data from independent adults with autism or non-autistic immediate family members of someone with autism (mainly parents). Therefore, the elevated gender diversity in the autistic participants should be interpreted with the caveat that the non-autistic participants were older and presumed to adhere to more traditional gender roles. However, these results are in line with previous research showing the enrichment of gender diversity in autism [21]. Intriguingly, while our results showed higher gender diversity in LGBTQ+ participants, many people who were cisgender also showed evidence of gender diversity, though not enough to report TGNB identities. This underscores the value of the GSR in capturing dimensional gender diversity beyond self-endorsed identities alone. We tested 16 behavior-related PGS for association with the two GSR dimensions, and strikingly, the strongest association was cognitive performance PGS positively associated with both Binary and Nonbinary (Figure 3A). This finding was validated in our larger sample of *N* = 5, 165 with categorical gender identity; cognitive performance PGS was higher in the TGNB group compared to the cisgender group (Figure 4). This suggests that cognitive capacity may be an important component in the development of more complex and nuanced gender identities. Beyond cognitive performance, non-heterosexual sexual behavior (NHSB) PGS was positively correlated with Binary. Although gender identity and sexual orientation are distinct, the NHSB GWAS is a well-powered GWAS that is adjacent to gender diversity. Recent research has found that just within heterosexuals, the NHSB PGS is positively associated with an increased number of partners [27]). Building on this, our results suggest that gender diversity may be part of a pleiotropic ensemble of traits with adaptive advantages (e.g., cognitive performance). We expected neuropsychiatric PGS to also be positively correlated with the GSR, considering NHSB is positively genetically correlated with several neuropsychiatric conditions [19]. Our sample size was on the low end for PGS associations: *N* = 639 provides 80% power for detecting *ρ ±* 0.11, meaning we are not powered to detect small effects. However, in light of this previous research, it was surprising that we found no strong significant positive associations of neuropsychiatric PGS with the GSR. This suggests that, within the statistical power limits of our sample, gender diversity may not have a strong direct genetic relationship with adult-onset psychiatric disorders. Instead, in our sample higher gender diversity had the strongest genetic relationships with higher cognitive ability and NHSB. The lack of a main genetic effect that links psychiatric conditions and gender diversity, combined with our observation that the GSR scores nevertheless show numerous significant correlations with poorer self-reported mental health prompted us to examine the possibility of a relationship between gender diversity and mental health that depends on the level of genetic risk (i.e., an interaction between polygenic risk and gender diversity). We observed differences in correlations when stratifying by schizophrenia and depression PGS (Figure S6D). Groups with high depression and schizophrenia PGS had the strongest GSR-ASR correlations, whereas the GSR-ASR correlations in the low PGS groups were absent (i.e., not nominally significant). This suggests that PGS for depression and schizophrenia might interact with gender diversity (or related environmental factors such as discrimination and/or minority stress), ultimately affecting mental health. In other words, the observed relationship between gender diversity and mental health may not be solely environmental or genetic, but rather an interaction of the two. However, our sample size limits the ability to detect PGS main effects, let alone interactions, and therefore the interaction effects must be interpreted with the understanding that they are small and not significant after multiple testing correction. Our results and their interpretations have several limitations. The primary limitation is the small sample size, and we were only powered to detect strong PGS effects. In addition, age, sex designated at birth, and autism are entangled with other variables of interest. Autism is confounded at the genetic level, as observed in previous work showing educational attainment [28] and cognitive performance [29] are positively genetically correlated with autism. However, we repeated our analyses without adjusting the PGS and phenotypes for autism (Table S1 and Figure S5) and also stratifying by autism (Figure S3) and found the results to be robust against the inclusion or omission of autism. In summary, our findings show that gender diversity, as captured by the GSR, has dimensional properties that share common genetic factors with cognitive performance and NHSB. In agreement with previous studies, we find that higher gender diversity is correlated with poorer mental health, but our results suggest that any polygenic contribution of psychiatric risk alleles to gender diversity, if such contributions exists, are not large. Rather, one’s polygenic background may function as a risk/resilience mechanism that interacts with gender diversity (and/or the adversity that comes with it) in shaping mental health outcomes. ## 4 METHODS AND MATERIALS ### 4.1 Sample description SPARK [20] is a nationwide autism study in the United States involving more than 300,000 participants, with genetic data available for many of the participants. SPARK is approved by the Western IRB (#20151664). For Study 1, independent adults, with or without autism, were invited to participate in our SPARK Research Match. The Research Match was approved by the University of Iowa Institutional Review Board (IRB #201611784). Those consented to participate were asked to complete the Gender Self Report (GSR) [24], the Adult Self Report (ASR) [23], and additional questions on sexual orientation, gender identity, and gender expression. The sample size was *N* = 818. We removed nine individuals who had withdrawn from SPARK since the Research Match based on Version 8 (*N* = 809). *N* = 696 was the final sample size after genetic data availability and quality control filtering. For Study 2, we used the Version 8 background history files. The independent adult data was self-report, whereas the child and siblings data were parent report. We kept children 14 years or older and whose cognitive impairment status at enrollment was not significantly below age. ### 4.2 Study 1 phenotypes #### Labels of gender identity and sexual orientation Participants were able to select as many labels for gender identity and sexual orientation they found applicable. Selections of nonbinary, demigender, gender fluid, third gender, agender, gender neutral, pangender, bigender, and gender queer were categorized as nonbinary/neutral. Cisgender and transgender were each categorized separately. Participants who did not endorse any of the listed gender identities were excluded from analyses using gender identity labels (*N* = 66 of *N* = 696). For sexual orientation, participants selecting lesbian, gay, bisexual, pansexual, homosexual, queer, and/or polysexual were grouped as LGBQ+ and heterosexual orientation was classified separately. Participants who did not select any of the listed sexual orientation labels were excluded from analyses using sexual orientation labels (*N* = 73 of *N* = 696). #### Gender Self Report (GSR) The GSR itemset was developed through an iterative multi-input community driven process with autistic cisgender, autistic gender-diverse, and non-autistic cisgender and gender-diverse collaborators [24]; Open Science Framework Development Summary: https://osf.io/qh25d/?view_only=c0ce41d07bca4af1b792e074d51b7ded. The final GSR itemset is composed of 30 questions. The GSR factor analysis and generation of Binary and Nonbinary factor scores are described in [24]. In the genetic sample of *N* = 696, the GSR scores were adjusted for age, sex designated at birth, and autism by linear regression residualization and then Z-scaled. #### Adult Self Report (ASR) The ASR [23] is a questionnaire of 129 items assessing a range of adaptive behaviors and mental health outcomes. From *N* = 809, five participants were removed because they had 12 (approximately 10%) or more missing ASR items. In the remaining *N* = 804, 0.2% of the data was missing, with no item having more than five missing data points. The missing data was imputed to the median. The two ASR subscales were Externalizing and Internalizing. Externalizing is a composite score of aggressive, rulebreaking, and intrusive behavior, and Internalizing is a composite score of anxious, withdrawn depressed, and somatic complaints. In the genetic sample of *N* = 696, the ASR scores were adjusted for age, sex designated at birth, and autism by linear regression residualization and then Z-scaled. ### 4.3 Study 2 phenotypes If the participant’s designated sex at birth (options: Male or Female) did not match their gender (options: Male, Female, or Other) then the participant was classified as TGNB. We then merged with our Research Match gender identity labels. The final sample size was *N* = 5, 388 with *N* = 590 from the Research Match and the other *N* = 4, 798 from the background history. ### 4.4 Genotype quality control and imputation We used the genotype array data from SPARK integrated whole-exome-sequencing (iWES1) 2022 Release and the SPARK whole-genome-sequencing (WGS) Release 2, 3, and 4. iWES1 (*N* = 69, 592) was quality controlled on release, including removing samples due to heterozygosity or high missingness, so no further quality control was performed by us before genotype imputation. iWES1 provided genetic ancestry assignments based on the 1000 Genomes populations [30]. WGS Release 2 (*N* = 2, 365), Release 3 (*N* = 2, 871), and Release 4 (*N* = 3, 684) were not quality controlled on release, so we performed quality control using PLINK [31] before genotype imputation. First, we removed participants from the WGS releases if they were in iWES1. Second, we removed variants with missingness higher than 0.1 and participants with missingness higher than 0.2. Third, we merged the three releases and removed any participant whose heterozygosity (F statistic) was not within 3 standard deviations of the mean heterozygosity across the three releases. We then used the TopMed reference panel [32] to identify strand flips. The final sample size for WGS 2-4 was *N* = 8, 152. iWES1 and WGS 2-4 were then imputed to the TopMed [32] reference panel using the Michigan Imputation Server [33] with the phasing and quality control steps included and to output variants with imputation quality r2 *>* 0.3. After imputation, the variants were filtered to only the HapMap SNPs (*N* = 1, 054, 330 variants) with imputation quality r2 *>* 0.8 using bcftools [34]. They were lifted over from hg38 to hg19 using the VCF-liftover tool ([https://github.com/hmgu-itg/VCF-liftover](https://github.com/hmgu-itg/VCF-liftover)) and the alleles normalized to the hg19 reference genome. Finally, the files were merged and only variants with 0% missingness were retained (*N* = 914, 328). ### 4.5 Genetic ancestry Genetic principal components (PCs) were calculated using the bigsnpr package [35], specifically following the author’s recommendations [36] and their tutorial: [https://privefl.github.io/bigsnpr/articles/](https://privefl.github.io/bigsnpr/articles/) bedpca.html. In summary, we 1.) used the snp_plinkKINGQC function to identify and remove related participants at the KING threshold of 2−3.5, 2.) performed PC analysis using the bed_autoSVD on just the unrelated participants, 3.) detected PC outliers and removed them, 4.) recalculated the PCs, and 5.) projected the PCs onto the entire cohort using the bed_projectSelfPCA function. We used the 40 PCs and performed k-means clustering with K = 5 (for the five populations of 1000 Genomes [30]) and used the genetic ancestry labels from iWES1 to assign labels to the genetic population clusters. ### 4.6 Relatedness For Study 1, from the *N* = 809 Research Match participants whom completed the GSR, *N* = 804 completed the ASR and *N* = 727 had genetic data. This subset was pruned to remove related participants using GCTA [37] with a relatedness threshold of 0.125, corresponding to approximately third-degree relatives (*N* = 31 removed). For Study 2, we retained only one participant from each family, with prioritization towards TGNB identities and then removed related participants at the same threshold. ### 4.7 Polygenic scores PGS were calculated using LDpred2 [38] and the bigsnpr tools [35] in R [39]. Because SPARK is family-based, an external LD reference based on *N* = 362, 320 in the UK Biobank (provided by the authors of LDpred2) was used to calculate the genetic correlation matrix, estimate heritability, and calculate infinitesimal beta weights. PGS were calculated from the following genome-wide association studies (GWAS): ADHD [40], anorexia nervosa [41], autism [28], bipolar disorder [42], major depression [43], OCD [44], schizophrenia [45], cognitive performance [46], educational attainment [46], and non-heterosexual sexual behavior (NHSB) [19]. The public LDpred2 beta weights from the Polygenic Index Repository [47] were used to calculate PGS for extraversion [48], neuroticism [49], openness [50], risky behavior [51], number of children ever born (men) [52], and number of children ever born (women) [52]. To account for genetic ancestry, we residualized the 20 genetic PCs from the PGS. We also accounted for age, sex designated at birth, and autism using linear regression residualization. Lastly, the PGS were Z-scaled. We performed PGS processing separately for Study 1 (*N* = 696) and Study 2 (*N* = 5, 388). PGS were correlated with the GSR and ASR scores using Spearman correlations. We used the pwr.r.test() function from the pwr package [53] to determine the statistical power of the correlations. Multiple testing corrections for the 16 PGS were performed with the Benjamini and Yekutieli method [25]. We tested for PGS-by-GSR interactions with three linear models and then performed stratified correlations. The first model included the covariates (age, sex, and autism) as both main effects and interactions with the PGS and GSR scores, as recommended by [26]. The model was specified as: *ASR ∼ PGS* + *GSR* + *PGS × GSR*+*age*+*sex*+*autism*+*PGS ×age*+*PGS ×sex*+*PGS ×autism*+*GSR×age*+*GSR×sex*+*GSR×autism*. For the second model, we wanted to be consistent with the previous analyses. We tested for interactions with the variables adjusted for covariates prior to model input. This model was specified as: *ASR ∼ PGS* + *GSR* + *PGS × ASR*, with the ASR, PGS, and GSR adjusted for age, sex, and autism. For the third model, we binarized the PGS into the upper quartile (coded as 1) and the lower quartile groups (coded as 0), with *N* = 160 in each group and the middle 50% removed. This model was specified as: *ASR ∼ PGSgroup* + *GSR* + *PGSgroup × GSR*, with the ASR and GSR scores adjusted for covariates prior to model input. To further investigate the interactions from the third model, we ran ASR-GSR correlations stratified by PGS groups. ## Data Availability The SPARK genetic data can be obtained at SFARI Base. The SPARK Research Match data will be available to qualified, approved researchers through SFARI Base upon publication of this article. [https://base.sfari.org](https://base.sfari.org) ## DATA AND CODE AVAILABILITY SPARK genetic data can be obtained at SFARI Base: [https://base.sfari.org](https://base.sfari.org) SPARK Research Match data will be available to qualified approved researchers through SFARI Base after publication of this article. The code for all analyses can be found at [https://research-git.uiowa.edu/](https://research-git.uiowa.edu/) michaelson-lab-public/gsr-polygenic-scores ## FUNDING This work was supported by the National Institutes of Health and National Institute of Mental Health (R01HG012697 to JJM and JFS, MH105527 to JJM, DC014489 to JJM, and R01MH100028 to JFS), as well as grants from the Simons Foundation (SFARI 516716 to JJM), the Clinical and Translational Science Award (KL2TR001877 to JJM), the Fahs-Beck Fellow Grant to JFS, and the National Institutes of Health Predoctoral training grant (T32GM008629 to TRT). The Roy J. Carver Charitable Trust supports the work of JJM. This work was also supported by the University of Iowa Hawkeye Intellectual and Developmental Disabilities Research Center (Hawk-IDDRC) through the Eunice Kennedy Shriver National Institute of Child Health and Human Development (P50HD103556). ## DISCLOSURES The authors declare that the research was conducted in the absence of any commercial or financial relationship that could be construed as a potential conflict of interest. ## AUTHOR CONTRIBUTIONS The study was designed by TRT, JFS, and JJM. The GSR scores were generated by JSY and JFS. The polygenic scores were generated by TRT and JJM. The analyses were performed by TRT, AJT, and JJM. The manuscript writing was done by all authors. ## SUPPLEMENTARY View this table: [Table S1.](http://medrxiv.org/content/early/2023/07/06/2021.11.22.21266696/T3) Table S1. Polygenic score associations with the Gender Self Report and Adult Self Report scores using linear regression with covariates. Linear regressions to test for the polygenic (PGS) associations with the Gender Self Report (GSR) and Adult Self Report (ASR) scores with age, sex, and autism as covariates in the model: *phenotype ∼ PGS* +*age*+*sex*+*autism*. The ASR and GSR phenotypes and PGS were not adjusted for covariates prior to model input, but they were Z-scaled (*µ* = 0, *σ* = 1). The regression were ran in the European subset (*N* = 639).PGS abbreviations: **risky** = risky behavior, **NEB** = number of children ever born, **non-het** = non-heterosexual sexual behavior, **SCZ** = schizophrenia. ![Figure S1.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2023/07/06/2021.11.22.21266696/F5.medium.gif) [Figure S1.](http://medrxiv.org/content/early/2023/07/06/2021.11.22.21266696/F5) Figure S1. Distributions of the Gender Self Report scores before adjusting for covariates. The Gender Self Report (GSR) scores here are the unadjusted scores from the GSR factor analysis [24]. ![Figure S2.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2023/07/06/2021.11.22.21266696/F6.medium.gif) [Figure S2.](http://medrxiv.org/content/early/2023/07/06/2021.11.22.21266696/F6) Figure S2. Polygenic score associations with the Gender Self Report and Adult Self Report scores, not filtered by genetic ancestry. Polygenic score (PGS) associations with the **(A)** Adult Self Report (ASR) scores and **(B)** Gender Self Report (GSR) scores. The PGS, as well as the GSR and ASR phenotypes, were adjusted for age, sex, and autism. The correlations shown here were run in the full cohort (*N* = 696). PGS abbreviations: **risky** = risky behavior, **NEB** = number of children ever born, **non-het** = non-heterosexual sexual behavior, **SCZ** = schizophrenia. ![Figure S3.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2023/07/06/2021.11.22.21266696/F7.medium.gif) [Figure S3.](http://medrxiv.org/content/early/2023/07/06/2021.11.22.21266696/F7) Figure S3. Polygenic score associations with the Gender Self Report and Adult Self Report scores, stratified by autism diagnosis. (A) Polygenic score (PGS) differences by autism diagnosis within those of the European genetic ancestry cluster (Autistic *N* = 376, not diagnosed *N* = 263). PGS correlations with the **(B)** Adult Self Report (ASR) scores and **(C)** Gender Self Report (GSR) scores. The PGS, as well as the GSR and ASR phenotypes, were not adjusted for age or sex. Multiple testing corrections for the 16 PGS were performed with the Benjamini and Yekutieli method [25]. PGS abbreviations: **risky** = risky behavior, **NEB** = number of children ever born, **non-het** = non-heterosexual sexual behavior, **SCZ**= schizophrenia. ![Figure S4.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2023/07/06/2021.11.22.21266696/F8.medium.gif) [Figure S4.](http://medrxiv.org/content/early/2023/07/06/2021.11.22.21266696/F8) Figure S4. Replication of polygenic score associations with categorical gender identity in a larger sample, not filtered on genetic ancestry. Polygenic score (PGS) difference in means (t-tests) between gender identity groups in the larger cohort (*N* = 5, 388) with all samples. The two gender identity groups are cisgender (*N* = 5, 089) versus transgender and gender nonbinary (TGNB, *N* = 299). The PGS were adjusted for age, sex, and autism prior to performing the t-tests. Multiple testing corrections for the 16 PGS were performed with the Benjamini and Yekutieli method [25]). PGS abbreviations: **risky** = risky behavior, **NEB** = number of children ever born, **non-het** = non-heterosexual sexual behavior, **SCZ** = schizophrenia. ![Figure S5.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2023/07/06/2021.11.22.21266696/F9.medium.gif) [Figure S5.](http://medrxiv.org/content/early/2023/07/06/2021.11.22.21266696/F9) Figure S5. Replication of polygenic score associations with categorical gender identity in a larger sample with the polygenic scores not adjusted for age, sex, or autism. Polygenic score (PGS) difference in means (t-tests) between gender identity groups in the larger cohort (*N* = 5, 165) of European genetic ancestry. The two gender identity groups are cisgender (*N* = 4, 879) versus transgender and gender nonbinary (TGNB, *N* = 286). **(A)** shows the results for the PGS adjusted for age and sex (not autism). **(B)** shows the results with the PGS not adjusted for age, sex, nor autism. Multiple testing corrections for the 16 PGS were performed with the Benjamini and Yekutieli method [25]. PGS abbreviations: **risky** = risky behavior, **NEB** = number of children ever born, **non-het** = non-heterosexual sexual behavior, **SCZ** = schizophrenia. ![Figure S6.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2023/07/06/2021.11.22.21266696/F10.medium.gif) [Figure S6.](http://medrxiv.org/content/early/2023/07/06/2021.11.22.21266696/F10) Figure S6. Interaction effects between the Gender Self Report and polygenic scores on the Adult Self Report. Interactions between the Gender Self Report (GSR) scores and the polygenic scores (PGS) on the Adult Self Report (ASR) scores were tested with three models. All variables were Z-scaled prior to model input. The fill color is the *β* estimate for the PGS-by-GSR interaction term from the linear model. Nominally significant interaction terms are indicated with one star. No interactions are significant after multiple testing corrections with the Benjamini and Yekutieli method [25]. **(A)** As recommended by [26], this model included the covariates (age, sex, and autism) as both main effects and interactions with the PGS and GSR scores. *ASR ∼ PGS* + *GSR* + *PGS × GSR* + *age* + *sex* + *autism* + *PGS × age* + *PGS × sex* + *PGS × autism* + *GSR × age* + *GSR × sex* + *GSR × autism*. **(B)** To be consistent with the previous analysis, we tested for interactions with the variables adjusted for covariates prior to model input. Specifically, the PGS, GSR, and ASR scores were adjusted for age, sex, and autism. The interaction model was thus *ASR ∼ PGS* + *GSR* + *PGS × ASR*. **(C)** We binarized the PGS: the upper 75th quartile (coded as 1) and the lower 25th quartile groups (coded as 0), with *N* = 160 in each group and the middle 50% removed. The interaction model was thus *ASR ∼ PGSgroup* + *GSR* + *PGSgroup × GSR*, with the ASR and GSR scores adjusted for covariates prior to model input. **(D)** Spearman correlations of the GSR scores with the ASR scores stratified by PGS group. The ASR and GSR scores were adjusted for covariates prior to running the correlations. PGS abbreviations: **risky** = risky behavior, **NEB** = number of children ever born, **non-het** = non-heterosexual sexual behavior, **SCZ** = schizophrenia. ## Public summary The way we act (behavior) is influenced by how our brains grow and function. Some of the ways our brains grow and function are influenced by our genes (DNA). Everyone has slightly different versions of DNA. This is a normal part of human diversity. Some of these DNA differences lead to differences in our brains. Brain differences can lead to behavior differences like in personality, intelligence, or mental health. In this study, we asked whether our DNA is involved in gender identity and gender expression. We use the term “gender diversity” to mean differences in gender identity or gender expression. People with higher gender diversity are more likely to be transgender and/or nonbinary, although cisgender people can also have differences in gender expression. People with higher gender diversity are also more likely to be autistic, so we conducted this study with the help of SPARK participants (SPARK is the largest study of autism). Approximately half of our study participants were autistic adults, and the others were not autistic but do have an immediate relative who is autistic. ### What we found We found that thousands of DNA differences, when combined, are linked to differences in gender diversity. Specifically, we found that DNA differences linked to higher intelligence were also linked to higher gender diversity. We need to do more research to understand why DNA differences linked to higher intelligence are also linked to higher gender diversity. ### What our study does not show We did not identify or attempt to identify a “transgender” or “nonbinary” gene. We cannot predict a person’s gender diversity from their DNA. We found very little evidence that the DNA differences linked to major psychiatric conditions are also strongly linked to gender diversity. Larger studies in the future may be able to identify weaker effects, but our study does not support a strong genetic connection between psychiatric conditions and gender diversity. Gender diversity is not purely genetic, but genetic factors do play a role. ### Why this study is important This is one of the first genetic studies of gender diversity. Many people say that gender is a purely social construct, with no biological factors involved. Our results show that the DNA differences linked to higher intelligence are also linked to higher gender diversity. Ultimately, we believe this line of research will advance the health of gender diverse people through a higher understanding of how genetics interact with gender diversity in determining health outcomes. ## ACKNOWLEDGEMENTS We are grateful to our community advisory council, including members Elizabeth Graham, Sascha Klomp, and Jillian Nelson for their feedback throughout the research and writing process. We are also grateful to all the participants and families in SPARK, the SPARK clinical sites, and the SPARK staff. We appreciate obtaining access to genetic and phenotypic data for SPARK data on SFARI Base. ## Footnotes * Supplemental figures and tables have been added and the writing has been revised for clarity. * Received November 22, 2021. * Revision received July 5, 2023. * Accepted July 6, 2023. * © 2023, Posted by Cold Spring Harbor Laboratory The copyright holder for this pre-print is the author. All rights reserved. The material may not be redistributed, re-used or adapted without the author's permission. ## REFERENCES 1. [1]. D. Westergaard, P. Moseley, F. K. H. Sørup, P. Baldi, and S. Brunak, “Population-wide analysis of differences in disease progression patterns in men and women,” Nature Communications, vol. 10, Feb. 2019. 2. [2]. I. O. Friberg, G. Krantz, S. Määttä, and K. Järbrink, “Sex differences in health care consumption in sweden: A register-based cross-sectional study,” Scandinavian Journal of Public Health, vol. 44, pp. 264–273, Dec. 2015. 3. [3]. M. Romanelli and K. D. Hudson, “Individual and systemic barriers to health care: Perspectives of lesbian, gay, bisexual, and transgender adults.,” American Journal of Orthopsychiatry, vol. 87, no. 6, pp. 714–728, 2017. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1037/ort0000306&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=29154611&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F06%2F2021.11.22.21266696.atom) 4. [4]. E. A. Khramtsova, L. K. Davis, and B. E. Stranger, “The role of sex in the genomics of human complex traits,” Nature Reviews Genetics, vol. 20, pp. 173–190, Dec. 2018. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41576-018-0083-1&link_type=DOI) 5. [5]. A. E. Lloyd and M. P. Galupo, “What people with normative identities believe about sex, gender and sexual orientation,” Psychology & Sexuality, vol. 10, pp. 269–280, May 2019. 6. [6]. R. E. Morgan, C. Dragon, G. Daus, J. Holzberg, R. Kaplan, H. Menne, and M. Spiegelman, “Updates on terminology of sexual orientation and gender identity survey measures (fcsm 20-03),” Federal Committee on Statistical Methodology, Aug. 2020. 7. [7]. T. R. Thomas, D. Hofammann, B. G. McKenna, A. I. R. van der Miesen, M. A. Stokes, P. Daniolos, and J. J. Michaelson, “Community attitudes on genetic research of gender identity, sexual orientation, and mental health,” PLOS ONE, vol. 15, p. e0235608, July 2020. 8. [8]. B. D. Wilson and I. H. Meyer, “Nonbinary LGBTQ adults in the United States,” 2021. 9. [9]. R. A. Lippa, “Gender-related traits in gay men, lesbian women, and heterosexual men and women: The virtual identity of homosexual-heterosexual diagnosticity and gender diagnosticity,” Journal of Personality, vol. 68, pp. 899–926, Oct. 2000. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1111/1467-6494.00120&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=11001153&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F06%2F2021.11.22.21266696.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000089451900004&link_type=ISI) 10. [10]. M. P. Marshal, L. J. Dietz, M. S. Friedman, R. Stall, H. A. Smith, J. McGinley, B. C. Thoma, P. J. Murray, A. R. D’Augelli, and D. A. Brent, “Suicidality and depression disparities between sexual minority and heterosexual youth: A meta-analytic review,” Journal of Adolescent Health, vol. 49, pp. 115–123, Aug. 2011. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.jadohealth.2011.02.005&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=21783042&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F06%2F2021.11.22.21266696.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000292997600003&link_type=ISI) 11. [11]. P. B. Barr, T. B. Bigdeli, and J. L. Meyers, “Prevalence, comorbidity, and sociodemographic correlates of psychiatric diagnoses reported in the All of Us research program,” JAMA Psychiatry, vol. 79, p. 622, June 2022. 12. [12]. M. R. Woodford, G. Weber, Z. Nicolazzo, R. Hunt, A. Kulick, T. Coleman, S. Coulombe, and K. A. Renn, “Depression and attempted suicide among LGBTQ college students: Fostering resilience to the effects of heterosexism and cisgenderism on campus,” Journal of College Student Development, vol. 59, no. 4, pp. 421–438, 2018. 13. [13]. A. E. Green, J. P. DeChants, M. N. Price, and C. K. Davis, “Association of gender-affirming hormone therapy with depression, thoughts of suicide, and attempted suicide among transgender and nonbinary youth,” Journal of Adolescent Health, vol. 70, no. 4, pp. 643–649, 2022. 14. [14]. T. J. C. Polderman, B. P. C. Kreukels, M. S. Irwig, L. Beach, Y.-M. Chan, E. M. Derks, I. Esteva, J. Ehrenfeld, M. D. Heijer, D. Posthuma, et al., “The biological contributions to gender identity and gender diversity: Bringing data to the table,” Behavior Genetics, vol. 48, pp. 95–108, Feb. 2018. 15. [15]. A. Burri, L. Cherkas, T. Spector, and Q. Rahman, “Genetic and environmental influences on female sexual orientation, childhood gender typicality and adult gender identity,” PLoS ONE, vol. 6, p. e21982, July 2011. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pone.0021982&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=21760939&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F06%2F2021.11.22.21266696.atom) 16. [16]. J. G. Theisen, V. Sundaram, M. S. Filchak, L. P. Chorich, M. E. Sullivan, J. Knight, H.-G. Kim, and L. C. Layman, “The use of whole exome sequencing in a cohort of transgender individuals to identify rare genetic variants,” Scientific Reports, vol. 9, Dec. 2019. 17. [17]. A. R. Sanders, G. W. Beecham, S. Guo, K. Dawood, G. Rieger, R. S. Krishnappa, A. B. Kolundzija, J. M. Bailey, and E. R. Martin, “Genome-wide linkage and association study of childhood gender nonconformity in males,” Archives of Sexual Behavior, vol. 50, pp. 3377–3383, Sept. 2021. 18. [18]. A. Abdellaoui and K. J. H. Verweij, “Dissecting polygenic signals from genome-wide association studies on human behaviour,” Nature Human Behaviour, vol. 5, pp. 686–694, May 2021. 19. [19]. A. Ganna, K. J. H. Verweij, M. G. Nivard, R. Maier, R. Wedow, A. S. Busch, A. Abdellaoui, S. Guo, J. F. Sathirapongsasuti, P. Lichtenstein, et al., “Large-scale GWAS reveals insights into the genetic architecture of same-sex sexual behavior,” Science, vol. 365, Aug. 2019. 20. [20]. P. Feliciano, A. M. Daniels, L. G. Snyder, A. Beaumont, A. Camba, A. Esler, A. G. Gulsrud, A. Mason, A. Gutierrez, A. Nicholson, et al., “SPARK: A US cohort of 50,000 families to accelerate autism research,” Neuron, vol. 97, pp. 488–493, Feb. 2018. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.neuron.2018.01.015&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=29420931&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F06%2F2021.11.22.21266696.atom) 21. [21]. E. Hisle-Gorman, C. A. Landis, A. Susi, N. A. Schvey, G. H. Gorman, C. M. Nylund, and D. A. Klein, “Gender dysphoria in children with autism spectrum disorder,” LGBT Health, vol. 6, pp. 95–100, Apr. 2019. 22. [22]. V. Warrier, D. M. Greenberg, E. Weir, C. Buckingham, P. Smith, M.-C. Lai, C. Allison, and S. Baron-Cohen, “Elevated rates of autism, other neurodevelopmental and psychiatric diagnoses, and autistic traits in transgender and gender-diverse individuals,” Nature Communications, vol. 11, Aug. 2020. 23. [23]. T. M. Achenbach and L. Rescorla, “Manual for the ASEBA adult forms and profiles,” 2003. 24. [24]. J. F. Strang, G. L. Wallace, J. J. Michaelson, A. L. Fischbach, T. R. Thomas, A. Jack, J. Shen, D. Chen, A. Freeman, M. Knauss, et al., “The Gender Self-Report: A multidimensional gender characterization tool for gender-diverse and cisgender youth and adults.,” The American Psychologist (accepted for publication*)*, 2023. 25. [25]. Y. Benjamini and D. Yekutieli, “The control of the false discovery rate in multiple testing under dependency,” The Annals of Statistics, vol. 29, Aug. 2001. 26. [26]. M. C. Keller, “Gene × environment interaction studies have not properly controlled for potential confounders: The problem and the (simple) solution,” Biological Psychiatry, vol. 75, pp. 18–24, Jan. 2014. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.biopsych.2013.09.006&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24135711&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F06%2F2021.11.22.21266696.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000328265800005&link_type=ISI) 27. [27]. B. P. Zietsch, M. J. Sidari, A. Abdellaoui, R. Maier, N. Långström, S. Guo, G. W. Beecham, E. R. Martin, A. R. Sanders, and K. J. H. Verweij, “Genomic evidence consistent with antagonistic pleiotropy may help explain the evolutionary maintenance of same-sex sexual behaviour in humans,” Nature Human Behaviour, vol. 5, pp. 1251–1258, Aug. 2021. 28. [28]. J. Grove, S. Ripke, T. D. Als, M. Mattheisen, R. K. Walters, H. Won, J. Pallesen, E. Agerbo, O. A. Andreassen, R. Anney, et al., “Identification of common genetic risk variants for autism spectrum disorder,” Nature Genetics, vol. 51, pp. 431–444, Feb. 2019. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41588-019-0344-8&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=30804558&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F06%2F2021.11.22.21266696.atom) 29. [29]. T.-K. Clarke, M. K. Lupton, A. M. Fernandez-Pujals, J. Starr, G. Davies, S. Cox, A. Pattie, D. C. Liewald, L. S. Hall, D. J. MacIntyre, et al., “Common polygenic risk for autism spectrum disorder (ASD) is associated with cognitive ability in the general population,” Molecular Psychiatry, vol. 21, pp. 419–425, Mar. 2015. 30. [30]. . G. P. Consortium et al., “A global reference for human genetic variation,” Nature, vol. 526, no. 7571, p. 68, 2015. 31. [31]. S. Purcell, B. Neale, K. Todd-Brown, L. Thomas, M. A. Ferreira, D. Bender, J. Maller, P. Sklar, P. I. de Bakker, M. J. Daly, et al., “PLINK: A tool set for whole-genome association and population-based linkage analyses,” The American Journal of Human Genetics, vol. 81, no. 3, pp. 559–575, 2007. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1086/519795&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=17701901&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F06%2F2021.11.22.21266696.atom) 32. [32]. D. Taliun, D. N. Harris, M. D. Kessler, J. Carlson, Z. A. Szpiech, R. Torres, S. A. G. Taliun, A. Corvelo, S. M. Gogarten, H. M. Kang, et al., “Sequencing of 53,831 diverse genomes from the NHLBI TOPMed program,” Nature, vol. 590, pp. 290–299, Feb. 2021. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41586-021-03205-y&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=33568819&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F06%2F2021.11.22.21266696.atom) 33. [33]. S. Das, L. Forer, S. Schönherr, C. Sidore, A. E. Locke, A. Kwong, S. I. Vrieze, E. Y. Chew, S. Levy, M. McGue, et al., “Next-generation genotype imputation service and methods,” Nature Genetics, vol. 48, pp. 1284–1287, Aug. 2016. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/ng.3656&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=27571263&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F06%2F2021.11.22.21266696.atom) 34. [34]. P. Danecek, J. K. Bonfield, J. Liddle, J. Marshall, V. Ohan, M. O. Pollard, A. Whitwham, T. Keane, S. A. McCarthy, R. M. Davies, et al., “Twelve years of SAMtools and BCFtools,” GigaScience, vol. 10, Jan. 2021. 35. [35]. F. Privé, H. Aschard, A. Ziyatdinov, and M. G. B. Blum, “Efficient analysis of large-scale genome-wide data with two R packages: bigstatsr and bigsnpr,” Bioinformatics, vol. 34, pp. 2781–2787, Mar. 2018. 36. [36]. F. Privé, K. Luu, M. G. B. Blum, J. J. McGrath, and B. J. Vilhjálmsson, “Efficient toolkit implementing best practices for principal component analysis of population genetic data,” Bioinformatics, vol. 36, no. 16, pp. 4449–4457, 2020. 37. [37]. J. Yang, S. H. Lee, M. E. Goddard, and P. M. Visscher, “GCTA: A tool for genome-wide complex trait analysis,” The American Journal of Human Genetics, vol. 88, no. 1, pp. 76–82, 2011. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ajhg.2010.11.011&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=21167468&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F06%2F2021.11.22.21266696.atom) 38. [38]. F. Privé, J. Arbel, and B. J. Vilhjálmsson, “LDpred2: better, faster, stronger,” Bioinformatics, vol. 36, pp. 5424–5431, Dec. 2020. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F06%2F2021.11.22.21266696.atom) 39. [39].R Core Team, R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria, 2013. 40. [40]. D. Demontis, R. K. Walters, J. Martin, M. Mattheisen, T. D. Als, E. Agerbo, G. Baldursson, R. Belliveau, J. Bybjerg-Grauholm, M. Baekvad-Hansen, et al., “Discovery of the first genome-wide significant risk loci for attention deficit/hyperactivity disorder,” Nature Genetics, vol. 51, pp. 63–75, Nov. 2018. 41. [41]. H. J. Watson, Z. Yilmaz, L. M. Thornton, C. Hübel, J. R. I. Coleman, H. A. Gaspar, J. Bryois, A. Hinney, V. M. Leppä, M. Mattheisen, et al., “Genome-wide association study identifies eight risk loci and implicates metabo-psychiatric origins for anorexia nervosa,” Nature Genetics, vol. 51, pp. 1207–1214, July 2019. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/S41588-019-0439-2&link_type=DOI) 42. [42]. N. Mullins, A. J. Forstner, K. S. O’Connell, B. Coombes, J. R. I. Coleman, Z. Qiao, T. D. Als, T. B. Bigdeli, S. Børte, J. Bryois, et al., “Genome-wide association study of more than 40,000 bipolar disorder cases provides new insights into the underlying biology,” Nature Genetics, vol. 53, pp. 817–829, May 2021. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41588-021-00857-4&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=34002096&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F06%2F2021.11.22.21266696.atom) 43. [43]. D. M. Howard, M. J. Adams, T.-K. Clarke, J. D. Hafferty, J. Gibson, M. Shirali, J. R. I. Coleman, S. P. Hagenaars, J. Ward, E. M. Wigmore, et al., “Genome-wide meta-analysis of depression identifies 102 independent variants and highlights the importance of the prefrontal brain regions,” Nature Neuroscience, vol. 22, pp. 343–352, Feb. 2019. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/S41593-018-0326-7&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F06%2F2021.11.22.21266696.atom) 44. [44]. P. D. Arnold, K. D. Askland, C. Barlassina, L. Bellodi, O. Bienvenu, D. Black, M. Bloch, H. Brentani, C. L. Burton, B. Camarena, et al., “Revealing the complex genetic architecture of obsessive-compulsive disorder using meta-analysis,” Molecular psychiatry, vol. 23, no. 5, pp. 1181–1181, 2018. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/MP.2017.154&link_type=DOI) 45. [45]. V. Trubetskoy, A. F. Pardiñas, T. Qi, G. Panagiotaropoulou, S. Awasthi, T. B. Bigdeli, J. Bryois, C.-Y. Chen, C. A. Dennison, L. S. Hall, et al., “Mapping genomic loci implicates genes and synaptic biology in schizophrenia,” Nature, vol. 604, pp. 502–508, Apr. 2022. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F06%2F2021.11.22.21266696.atom) 46. [46]. J. J. Lee, R. Wedow, A. Okbay, E. Kong, O. Maghzian, M. Zacher, T. A. Nguyen-Viet, P. Bowers, J. Sidorenko, R. K. Linnér, et al., “Gene discovery and polygenic prediction from a genome-wide association study of educational attainment in 1.1 million individuals,” Nature Genetics, vol. 50, no. 8, pp. 1112–1121, 2018. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41588-018-0147-3&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=30038396&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F06%2F2021.11.22.21266696.atom) 47. [47]. J. Becker, C. A. Burik, G. Goldman, N. Wang, H. Jayashankar, M. Bennett, D. W. Belsky, R. K. Linnér, R. Ahlskog, A. Kleinman, et al., “Resource profile and user guide of the Polygenic Index Repository,” Nature Human Behaviour, pp. 1–15, 2021. 48. [48]. S. M. van den Berg, M. H. de Moor, K. J. Verweij, R. F. Krueger, M. Luciano, A. A. Vasquez, L. K. Matteson, J. Derringer, N. Amin, S. D. Gordon, et al., “Meta-analysis of genome-wide association studies for extraversion: Findings from the genetics of personality consortium,” Behavior Genetics, vol. 46, no. 2, pp. 170–182, 2016. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s10519-015-9735-5&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=26362575&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F06%2F2021.11.22.21266696.atom) 49. [49]. M. Nagel, P. R. Jansen, S. Stringer, K. Watanabe, C. A. De Leeuw, J. Bryois, J. E. Savage, A. R. Hammerschlag, N. G. Skene, A. B. Muñoz-Manchado, et al., “Meta-analysis of genome-wide association studies for neuroticism in 449,484 individuals identifies novel genetic loci and pathways,” Nature Genetics, vol. 50, no. 7, pp. 920–927, 2018. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41588-018-0151-7&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F06%2F2021.11.22.21266696.atom) 50. [50]. M. H. De Moor, P. T. Costa, A. Terracciano, R. F. Krueger, E. J. De Geus, T. Toshiko, B. W. Penninx, T. Esko, P. A. Madden, J. Derringer, et al., “Meta-analysis of genome-wide association studies for personality,” Molecular Psychiatry, vol. 17, no. 3, pp. 337–349, 2012. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/mp.2010.128&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=21173776&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F06%2F2021.11.22.21266696.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000300709300011&link_type=ISI) 51. [51]. R. K. Linnér, P. Biroli, E. Kong, S. F. W. Meddens, R. Wedow, M. A. Fontana, M. Lebreton, S. P. Tino, A. Abdellaoui, A. R. Hammerschlag, et al., “Genome-wide association analyses of risk tolerance and risky behaviors in over 1 million individuals identify hundreds of loci and shared genetic influences,” Nature Genetics, vol. 51, no. 2, pp. 245–257, 2019. 52. [52].Neale Lab, “UK Biobank GWAS -Round 2,” 2018. 53. [53]. S. Champely, pwr: Basic Functions for Power Analysis, 2020. R package version 1.3-0. 54. [54].American Psychiatric Association, “A guide for working with transgender and gender nonconforming patients,” 2017.