Abstract
Gender diverse individuals are at higher risk for mental health problems. What remains unclear is whether this increased risk is attributable to environmental stressors (e.g., minority stress), to innate genetic factors with pleiotropic effects on gender diversity and mental health, or to gene-by-environment interactions. Here, we present a study of N=701 independent adults (58% autistic) who were thoroughly characterized for gender diversity using the Gender Self Report (GSR), a novel assessment for the continuous, multidimensional characterization of gender diversity. We calculated polygenic scores for 20 behavioral traits, and tested them for association with the continuous dimensions of the GSR: Binary Gender Diversity (degree of identification with the gender opposite that implied by sex designated at birth) and Nonbinary Gender Diversity (degree of identification with a gender that is neither man/male nor woman/female). We found no evidence of association between gender diversity and polygenic risk for adult-onset psychiatric conditions (major depression, bipolar disorder, schizophrenia). Strikingly, we instead found that both gender diversity dimensions were positively associated with polygenic scores for cognitive performance (Binary ρ = 0.09, Nonbinary ρ = 0.11, p < 0.05). We also found Binary Gender Diversity to be positively associated with polygenic scores for both autism (ρ = 0.08, p < 0.05) and non-heterosexual sexual behavior (ρ = 0.09, p < 0.05). Further, we found no association between increasing gender diversity and poorer mental health outcomes in a subsample with low genetic risk for these neuropsychiatric conditions. Only in the subsample with high genetic risk for major depression or schizophrenia did we observe a significant relationship between gender diversity and poor mental health outcomes. These findings suggest that minority stress experienced as a gender diverse person may act with particular potency in those who have high genetic risk for neuropsychiatric disorders. In summary, our findings challenge a pathologizing view of gender diversity, identify pleiotropic relationships with adaptive traits such as cognitive performance, and implicate environment (e.g., minority stress) as a key factor interacting with polygenic risk to generate poor mental health outcomes in gender diverse individuals.
1 Introduction
Sex and gender (see Table 1 for our definitions of terms) have major impacts on health [1]. This stems from both extrinsic factors (e.g., healthcare barriers [2, 3]) as well as biological factors, with sex and gender modulating the underlying molecular mechanisms of disease and well-being [4]. In health research, sex has been a more objective and well-defined variable than gender, which is multidimensional with binary and nonbinary components and often experienced on a continuum [5]. Gender diversity can be reported through self-endorsement of gender identity labels (e.g., transgender, nonbinary, genderqueer, demi-boy), but these labels are contextually and culturally dependent (i.e., not accessible by all) and variable and often non-specific in their meanings [6]. Further, there are numerous gender identity self-descriptors, and group-based analyses based on parsing datasets into individual descriptors erode statistical power for meaningful comparisons given the numerous individual subgroupings. Moreover, gender diversity, a fundamental aspect of human diversity, is not only expressed by individuals with transgender and/or gender nonbinary identities (TGNB). People who identify as cisgender also exhibit some variation in gender diversity that would be lost in studies only reporting categorical descriptors of gender identity [7]. Therefore, a multidimensional, continuous characterization of gender that uses simple and broadly accessible language will enable health researchers to appropriately incorporate gender diversity in their analyses.
Gender diversity is a crucial variable to include in health research, and this may be particularly true in mental health and neuropsychiatric research. Groups that express higher levels of gender diversity than the cisgender proportional majority, such as LGBTQ+ (lesbian, gay, bisexual, transgender, and queer) individuals [8, 9], often have greater rates of anxiety and depression and are more likely to attempt suicide [10]. A recent report of N = 329,038 participants in the All of Us cohort found that the non-heterosexual participants had greater prevalence of all neuropsychiatric diagnoses compared to the heterosexual participants [11]. The exact mechanisms for this are not entirely known. Research has shown that poorer mental health is due to factors related to the experienced adversity from sexual orientation and/or gender diversity minority stress; for example, discrimination and resilience partially mediate negative mental health outcomes in LGBTQ+ college students [12]. Additionally, access to gender-affirming hormone therapy for TGNB youth is associated with a reduced likelihood of depression and suicidality [13]. However, to the best of our knowledge, no study has leveraged genetic data to elucidate the relationships between gender diversity and mental health, so any possible contributions of genetic factors are unknown.
The brain is the biological seat of personal identity, including gender identity. We hypothesize that gender identity is therefore susceptible to genetic influences like other human behavior traits [14]. Most behaviors are somewhat heritable, with genome-wide association studies (GWASs) of common genetic variants showing many loci, each of a small effect, contributing additively (i.e., polygenicity) [15]. Additionally, genomic loci associated with one behavior trait are often found to be associated with another trait, suggesting the two traits have a degree of pleiotropy. One method to estimate pleiotropy is to use polygenic scores that are the genome-wide cumulative sum from a GWAS; a polygenic score is then correlated with the other trait of interest. Genetic research of gender diversity has been limited and underpowered for gene discovery [16, 17].
Among the current well-powered GWAS, the most reasonable proxy to gender diversity is the non-heterosexual sexual behavior GWAS [18] performed in N = 408,995 UK Biobank participants. The trait was defined as the yes/no response to ever having sex with someone of the same sex (the nuance between same-sex versus same-gender are lost due to the nature of the question). The heritability of non-heterosexual sexual behavior varied by age, ranging from 0.08 to 0.25 and was positively genetically correlated with several neuropsychiatric conditions and personality traits. However, the interpretation of these genetic correlations is limited because of the confounding with experienced adversity, meaning the positive correlation could be due to either individuals engaging in non-heterosexual sexual behavior facing more sexual and/or gender-based discrimination that increase risk for neuropsychiatric conditions and/or pleiotropy between non-heterosexual sexual behavior and neuropsychiatric risk. Recent work has begun disentangling the confounding variables of discrimination, genetic risk, and mental health outcomes in a study of N = 1,146 participants. They regressed out the effects of anxiety, depression, and neuroticism polygenic scores from both their discrimination measures (not necessarily sexual or gender-based discrimination) and anxiety measures and found the association between discrimination and anxiety was persistent after controlling for these genetic liabilities [19].
In this study, we investigated whether gender diversity, like non-heterosexual sexual behavior, is pleiotropic with other behavioral traits and how this pleiotropy might play a role in mental health. Study participants were from the SPARK cohort [20], a nationwide genetic study of over 300,000 participants with and without autism. Existing research demonstrating the common intersection of autism and gender diversity makes SPARK an ideal cohort for gender diversity studies. Previous studies have shown there is an enrichment of gender diversity in autistic samples compared to the general population [21]. Likewise, general population samples of transgender and more broadly gender diverse people are more likely to be or autistic or have clinically relevant levels of autistic traits [22]. In our sample of N = 701 participants, we calculated polygenic scores for 20 traits including cognitive ability, personality, and neuropsychiatric conditions and administered two psychometric self-report tools. The first, the Adult Self Report (ASR) [23], is a well-established instrument that measures several mental health outcomes and adaptive behaviors. The second, the Gender Self-Report captures two quantitative dimensions of gender diversity: Binary Gender Diversity, the extent one experiences themselves as the other binary gender (i.e., different from their sex designated at birth), and Nonbinary Gender Diversity, the extent one experiences themselves as not female or male. We then sought to answer the following questions: First, are behavior polygenic scores correlated with the two measures of dimensional gender diversity from the GSR? Second, is self-reported mental health correlated with gender diversity? Lastly, do polygenic scores provide additional context in our understanding of the relationship between gender diversity and mental health? An overview of our analyses is shown in Figure 1.
2 Results
2.1 Gender diversity and mental health correlations
The demographic characteristics of the SPARK Research Match participants are shown in Table 2. The final sample size was N = 701 participants, with approximately one-third of the cohort identifying as transgender or gender nonbinary (TGNB). Fifty-eight percent of participants were autistic, and 22% were male. The genetic ancestry categorization is based on the five continental populations described by 1000 Genomes [24]. Ninety-two percent of participants were in the Europe genetic group, 4% in the Americas group, 4% in the South Asia group, 1% in the East Asia group, 0% in the Africa group.
The two gender diversity values, Binary and Nonbinary Gender Diversity, are from the Gender Self Report (GSR) [25]. The GSR values range from 0 (no gender diversity) to 1 (high gender diversity), with the mode being near 0 for both; these values were then controlled for age, sex designated at birth, and autism diagnostic status by linear regression, and then standardized to a mean of 0 and a standard deviation of 1. The distributions of these two values (controlling for covariates) are shown in Figure 2A and are colored by self-endorsed sexual orientation (top panel) and gender identity (bottom panel). The overall trend shows higher gender diversity in LGBQ+ and TGNB participants. The GSR values were significantly positively correlated with each other: ρ = 0.56, p < 0.05 (Figure 2B).
The two mental health outcome values, Externalizing and Internalizing, are from the Adult Self Report (ASR) [23]. Internalizing problems is a composite score of anxiety, depression, and somatic complaints, and Externalizing problems is a composite score of aggressive, rule-breaking, and intrusive behavior. These values were also controlled for age, sex designated at birth, and autism diagnostic status by linear regression, and then standardized to a mean of 0 and a standard deviation of 1. Externalizing and Internalizing were significantly positively correlated with each other: ρ = 0.61, p < 0.05 (Figure 2C). The ASR values were also significantly positively correlated with the GSR values (Figure 2D). Binary Gender Diversity was more strongly correlated with Internalizing ρ = 0.15, p < 0.05, than Externalizing ρ = 0.10, p < 0.05. Nonbinary Gender Diversity was also more strongly correlated with Internalizing ρ = 0.18, p < 0.05 than Externalizing ρ = 0.12, p < 0.05.
2.2 Polygenic score correlations with gender diversity and mental health
We next assessed the relationships between the GSR and ASR with twenty polygenic scores for behavior traits spanning across four behavior domains [15]. The first domain is reflective of traits related to cognition and socioeconomic status–– cognitive performance and educational attainment [26]. The second domain is personality and well-being traits, with three traits from the Big Five personality (extraversion [27], neuroticism [28], openness [29]), as well as depressive symptoms [30], loneliness [31], risky behavior [32], and subjective well-being (SWB) [33]. The third domain is sexuality and reproduction-related traits: these include age at first birth (i.e., age at parenthood) [34], number of children ever born (NEB) [31], and non-heterosexual sexual behavior [18]. The last domain is neuropsychiatric conditions–– ADHD [35], anorexia [36], autism [37], bipolar disorder [38], major depression [39], OCD [40], and schizophrenia [41]. Polygenic scores were controlled for the effects of the first twenty genetic principal components (to account for genetic ancestry effects), as well as age, sex designated at birth, and autism diagnostic status by linear regression. We computed correlation coefficients between the polygenic scores with the GSR values (Figure 3A) and ASR values (Figure 3B).
As expected, the non-heterosexual sexual behavior polygenic score was significantly positively correlated with Binary Gender Diversity: ρ = 0.09, p < 0.05. The non-heterosexual sexual behavior polygenic score was also positively correlated with Nonbinary Gender Diversity, although the correlation did not reach nominal significance: ρ = 0.05, p = 0.18. Strikingly, the cognitive performance polygenic score was significantly positively correlated with Binary Gender Diversity (ρ = 0.09, p < 0.05) and Nonbinary Gender Diversity (ρ = 0.11, p < 0.05), meaning that polygenic propensity for greater cognitive performance was associated with elevated binary and nonbinary gender diversity. The autism polygenic score was significantly positively correlated with Binary Gender Diversity: ρ = 0.08, p < 0.05. No other neuropsychiatric polygenic scores were significantly correlated with the GSR values. However, we did observe significant neuropsychiatric polygenic score correlations with the two ASR values. Externalizing was positively correlated with the ADHD polygenic score (ρ = 0.13, p < 0.05) and negatively correlated with the anorexia polygenic score (ρ = −0.08, p < 0.05) and the openness polygenic score (ρ = −0.08, p 0.05). Internalizing was positively correlated with the depression polygenic score (ρ = 0.09, p < 0.05), as well as the depressive symptoms polygenic score (ρ = 0.10, p < 0.05) and the neuroticism polygenic score (ρ = 0.11, p < 0.05).
We performed the same correlations stratified by autism diagnostic status, with the results being comparable (Figure S1). Results were also comparable when performing the correlations only in the European genetic population group of N = 644 (Figure S2).
2.3 Interactions between gender diversity, mental health, and polygenic scores
In order to investigate whether polygenic risk and gender diversity interact in modeling mental health outcomes, we tested for interaction effects in linear models and also performed stratified correlations. We grouped participants into one of three groups for each polygenic score: high risk (upper quartile, N = 175), neutral (2nd and 3rd quartiles, N = 351), and the low risk (lower quartile, N = 175). We compared GSR-ASR associations between the polygenic score upper group (coded as 1) versus the polygenic score lower group (coded as 0) and removed the neutral risk group. We first formally tested for polygenic group-by-GSR interaction effects with the linear model ASR value ∼ GSR value + polygenic group + GSR value:polygenic group. The interaction terms are shown in Figure 4A, with nominally significant interactions (p < 0.05) indicated by a white asterisk. We then performed GSR-ASR correlations stratified by the polygenic group, and Figure 4B shows the ρ for GSR-ASR correlations for the upper quartile versus lower quartile polygenic risk groups. Figure 4C shows the stratified correlations for the two strongest polygenic group-by-GSR interaction effects.
We identified four significant polygenic group-by-GSR interactions, specifically the schizophrenia and depression polygenic risk. Within the entire cohort of N = 701, Nonbinary Gender Diversity and Internalizing are positively correlated: ρ = 0.18, p < 0.05. However, this apparent main effect appears to be driven by a context-specific interaction with genetic risk: in the subset at greatest schizophrenia polygenic risk (e.g. the upper quartile, N = 175), the correlation between Nonbinary Gender Diversity and Internalizing is ρ = 0.33, p < 0.05. While in the lower risk group (e.g. the lower quartile, N = 175), there is no correlation: ρ = 0.04, p = 0.56. The effect is similar when stratifying by the depression polygenic score–– the high risk correlation is ρ = 0.29, p < 0.05, while in the low risk group the correlation is not significant: ρ = 0.09, p = 0.25.
3 Discussion
Our analyses are the first to address the relationships of multidimensional gender diversity with mental health and genetics. We leveraged two novel, quantitative measures of gender diversity, Binary and Nonbinary Gender Diversity, from the Gender Self-Report (GSR) in a neurodiverse sample of N = 701 adults participating in the SPARK autism study. In this sample, we found greater gender diversity in female, autistic, and LGBTQ+ participants. Due to the structure of SPARK and study recruitment, we were only able to collect data from independent adults with autism or immediate family-members of someone with autism (mostly parents). Therefore, the elevated gender diversity in the autistic subset should be interpreted with the caveat that the non-autistic participants were older and presumed to adhere to more traditional gender roles. Still, these results are in line with prior research that has shown the enrichment for gender diversity in autism [21]. Intriguingly, while our results showed higher gender diversity in the LGBTQ+ participants, many people who identify as cisgender also showed evidence of gender diversity, though not enough for them to report being transgender or more broadly gender diverse. This underscores the value of the GSR in capturing dimensional gender diversity beyond self-endorsed identities, alone. The formation of gender identity is a complex and multi-factorial process [42] and is contextualized by numerous factors like time (e.g., age, generation), region, and culture. Additionally, the conceptualization of these identities requires understanding of how the self relates to other points of reference. This can be different for some autistic people who may struggle with understanding social and gender norms [43].
We correlated 20 behavior polygenic scores with the two GSR measures, and strikingly, the strongest association was cognitive performance being positively associated with both Binary and Nonbinary Gender Diversity (Figure 3A). This suggests cognitive capacity may be an important component in the development of more complex and nuanced gender identities. Beyond cognitive performance, we also found the non-heterosexual sexual behavior polygenic score to be positively correlated with Binary Gender Diversity. While gender identity and sexual orientation are distinct concepts, the non-heterosexual sexual behavior genome-wide association study (GWAS) is the most well-powered GWAS that is adjacent to gender diversity. Non-heterosexual behavior is associated with reduced number of children (i.e., reduced reproductive fitness) [44], so the population endurance of alleles associated with non-heterosexual behavior is an interesting conflict. Among heterosexuals, the non-heterosexual sexual behavior polygenic score was recently shown to be positively correlated with an increased number of partners, which presumably increases reproductive fitness [45]. Building off this, our results suggest gender diversity may part of a pleiotropic ensemble of traits with adaptive advantages (e.g., cognitive performance).
We expected neuropsychiatric polygenic scores to also be positively correlated with the GSR measures, considering non-heterosexual sexual behavior shows positive genetic correlation with several neuropsychiatric conditions [18]. In light of this prior research, it was surprising that we found no significant positive correlations with GSR values and neuropsychiatric polygenic scores, aside from Binary Gender Diversity being positively correlated with the autism polygenic score. This suggests that, within the statistical power limits of our sample, gender diversity is not in strong pleiotropic relationships with adult-onset psychiatric disorders. Instead, in our sample greater gender diversity appears to have pleiotropic relationships with higher cognitive ability, non-heterosexual sexual behavior, and autism.
The lack of a genetic main effect linking psychiatric conditions and gender diversity, combined with our observation that the GSR values nevertheless show numerous significant correlations with poorer self-reported mental health (Figure 2D) prompted us to examine the possibility of a relationship between gender diversity and mental health that depends on genetic risk level (i.e., an interaction between polygenic risk and gender diversity). To accomplish this, we used the polygenic score for each psychiatric condition to stratify our sample into high and low risk groups (upper and lower quartiles of polygenic scores, respectively, each with N=175, see Figure 4A). We observed dramatic differences in the correlations between the genetic risk groups when stratifying by the schizophrenia and depression polygenic scores (Figure 4B, C): the groups of high depression and schizophrenia polygenic risk had the strongest GSR-ASR correlations, whereas the correlations in the low-risk groups were absent (i.e., not nominally significant). This suggests that polygenic risk for depression and schizophrenia interact with gender diversity (or environmental factors related to gender diversity such as discrimination and/or minority stress) in determining mental health outcomes. In other words, our findings provide evidence that the robustly observed relationship between gender diversity and mental health outcomes is not solely environmental or genetic, but rather a combination of the two. Specifically, an individual’s polygenic risk for psychiatric disorders determines the extent their gender diversity (and/or experiences of adversity that gender diverse individuals may experience) impacts their mental health. This observation could also be cast in terms of resilience: the high genetic risk group is less resilient against experienced adversity that might impact mental health, while the low risk group shows more resilience against poorer mental health as gender diversity and/or associated stressors increase. This interpretation is congruent with previous work that found that the individuals at high polygenic risk for depression were more likely to have more depressive symptoms while under stress, and those in the lowest depression polygenic risk group were least likely/most resilient under stress [46].
Our results and their interpretations have several limitations. Most genetic analyses (genome-wide association studies, heritabilities, polygenic scores) require large sample sizes due to the small effects of individual common variants. Consequently, our primary limitation is the small sample size, and we therefore were only powered to detect strong polygenic score correlations. With our sample size of N = 701, we were at 80% power to detect correlations greater than ρ = ±0.106. Additionally, age, sex designated at birth, and autism diagnostic status are entangled with other variables of interest. Autism diagnosis is confounded at the genetic level, as observed in previous work that showed that educational attainment [37] and cognitive performance [47] are positively genetically correlated with autism. However, we repeated our analyses stratified by autism diagnostic status and found the results to be comparable (Figure S1). Future work with larger samples should analyze the interplay between autism, sex designated at birth, and polygenic scores in their associations with gender diversity by performing sufficiently powered analyses stratified by autism and designated sex.
In summary, our findings show that gender diversity, as captured by the Gender Self Report, has dimensional properties that share common genetic factors with cognitive performance, non-heterosexual sexual behavior, and autism. In agreement with previous studies, we find greater gender diversity to be correlated with poorer mental health, but this relationship is not due to shared genetic effects between psychiatric disorders and gender diversity. Rather, one’s polygenic background is a risk/resilience mechanism that interacts with gender diversity (and/or the adversity that comes with it) in determining mental health outcomes.
4 Materials and methods
4.1 Sample description
SPARK [20] is a U.S.-based nationwide autism study of over 300,000 participants, with genetic data available for many of the participants. Independent adults, with or without autism, were invited to participate in our Research Match. Those who agreed and consented to participate were asked to complete the Gender Self Report (GSR) [25], the Adult Self Report (ASR) [23], and additional questions regarding their sexual orientation, gender identity, and gender expression, with the final sample size N = 818. N = 701 is the final sample size after genetic data availability and quality control filtering. This study was approved by the University of Iowa Institutional Review Board (IRB #201611784). SPARK is approved by the Western IRB (#20151664).
4.2 Measures
Self-endorsed labels of gender identity and sexual orientation
Participants were able to select as many labels for gender identity and sexual orientation they found applicable. Selections of nonbinary, demigender, gender fluid, third gender, agender, gender neutral, pangender, bigender, and gender queer were categorized as nonbinary/neutral. Cisgender and transgender were each categorized separately. Participants who did not endorse any of the listed gender identities were excluded from analyses using gender identity labels (N = 67 of 729). For sexual orientation, participants selecting lesbian, gay, bisexual, pansexual, homosexual, queer, and/or polysexual were grouped as LGBQ+ and heterosexual orientation was categorized separately. Participants who did not select any of the listed sexual orientation labels were excluded from analyses using sexual orientation labels (N = 73 of 701).
Gender Self Report (GSR)
The Gender Self-Report (GSR) itemset was developed through an iterative multi-input community driven process with autistic cisgender, autistic gender-diverse, and non-autistic cisgender and gender-diverse collaborators [25]; Open Science Framework Development Summary: https://osf.io/qh25d/?view_only=c0ce41d07bca4af1b792e074d51b7ded. A diversified recruitment approach was employed across seven separate recruitments (N = 1,654), including the current study’s recruitment (N = 818), to optimize the breadth of the GSR calibration sample and enrich the sample based on the following key characteristics: autism, gender-diverse identities (binary and nonbinary), the intersection of autism and gender-diverse identities, transition age/young adult age, and female designation at birth within the entire sample and within autism, specifically. This sampling approach resulted in an overall calibration sample that was 37.5% autistic, 32.6% gender diverse, and 38.9% cisgender sexual minority. Two-dimensional graded response model with a normal-mixture latent density adequately fit the data and yielded two factors. The two factors are labeled Female-Male Continuum and Nonbinary Gender Diversity. A transformation of the Female-Male Continuum values based on designated sex at birth produced Binary Gender Diversity values (i.e., representing the distance on the binary gender spectrum from individual’s designated sex at birth). GSR calibration employed differential item functioning, an equity-based psychometric method to identify and reduce bias, in this case by age as well as autism status. Empirical reliability coefficients for response pattern expected a posteriori scores were 0.75 for Nonbinary Gender Diversity and 0.85 for Binary Gender Diversity. GSR factors performed well across the following validation metrics: (1) construct validity; GSR factor values followed expected value patterns comparing gender identity subgroups, (2) convergent validity; GSR factor values correlated with existing gender-related measures and in expected directions, and (3) ecological validity; GSR factor values aligned with report of gender-affirming medical treatment request/receipt. The final GSR itemset is composed of 30 questions, that participants answered: 1 = never true, 2 = sometimes true, 3 = often true, 4 = always true. In our genetic sample of N = 701 participants, these two GSR values were controlled for age in months, sex designated at birth, and autism diagnostic status by linear regression and then standardized to a mean of 0 and a standard deviation of 1. These values were then used as the phenotypes in the subsequent correlation analyses.
Adult Self Report (ASR)
The Adult Self Report (ASR) [23] is a well-established self-report questionnaire of 129 items assessing a range of adaptive behaviors and mental health outcomes. The participants responds either: 0 = not true, 1 = somewhat or sometimes true, or 2 = very true or often true. From the N = 818, five participants were removed due to having 12 (approximately 10%) or more missing ASR items. In the remaining N = 813, 0.2% of the data was missing, with no item having more than five missing data points. The missing data was imputed to the median. The two measures used in our analyses were Internalizing and Externalizing problems that are summed syndrome subscales. Externalizing problems is composed of aggression, rule-breaking, and intrusive behavior subscales (35 items total), and Internalizing problems is composed of the anxiety, depression, and somatic complaints subscales (37 items total). In our genetic sample of N = 701 participants, these two ASR values were controlled for age, sex designated at birth, and autism diagnostic status by linear regression and then standardized to a mean of 0 and a standard deviation of 1. These values were then used as the phenotypes in the subsequent correlation analyses.
4.3 Genotype quality control and imputation
We used the genotype array data from SPARK integrated whole-exome-sequencing (iWES1) 2022 Release and the SPARK whole-genome-sequencing (WGS) Release 2, 3, and 4. iWES1 (N = 69,592) was quality controlled on release, including removing samples due to heterozygosity or high missingness, so no further quality control was performed by us before genotype imputation. iWES1 also provided genetic ancestry assignments based on the 1000 Genomes populations [24]. WGS Release 2 (N = 2,365), Release 3 (N = 2,871), and Release 4 (N = 3,684) were not quality controlled on release, so we performed quality control using PLINK [48] before genotype imputation. First, we removed participants from the WGS releases if they were in iWES1. Second, we removed variants with missingness greater than 0.1 and participants with missingness greater than 0.2. Third, we merged the three releases and then removed any participant whose heterozygosity (F statistic) was not within 3 standard deviations of the mean heterozygosity across the three releases. We then used the TopMed reference panel [49] to identify strand flips. The final sample size for WGS 2-4 was N = 8,152. iWES1 and WGS 2-4 were then imputed to the TopMed [49] reference panel using the Michigan Imputation Server [50] with the phasing and quality control steps included and to output variants with imputation quality r2 > 0.3. After imputation, the variants were filtered to only the HapMap SNPs (N = 1,054,330 variants) with imputation quality r2 > 0.8 using bcftools [51]. They were lifted over from hg38 to hg19 using the VCF-liftover tool (https://github.com/hmgu-itg/VCF-liftover) and the alleles normalized to the hg19 reference genome. Finally, the files were merged and only variants with 0% missingness were retained (N = 914,328).
4.4 Genetic ancestry
Genetic principal components (PCs) were calculated using the bigsnpr package [52], specifically by following the author’s recommendations [53] and their tutorial: https://privefl.github.io/bigsnpr/articles/bedpca.html. In summary, we 1.) used the snp plinkKINGQC function to identify and remove related participants at the KING threshold of 2−3.5, 2.) performed principal component analysis using the bed autoSVD on just the unrelated participants, 3.) detected principal component outliers and removed them, 4.) recalculated the principal components, and 5.) projected the principal components onto the entire cohort using the bed projectSelfPCA function. We chose to not remove participants based on their genetic ancestry and instead use genetic ancestry as a continuous variable (instead of categorical), as per recent recommendations [54]. However to establish faith in the robustness of our results, we used the top 40 principal components and performed k-means clustering with K = 5 (for the five populations from 1000 Genomes [24]) and used the genetic ancestry labels from iWES1 to assign labels to the genetic population clusters. We then repeated the polygenic score correlations in the European subset, with the results provided in the Supplemental Information.
4.5 Polygenic score calculations
Polygenic scores were calculated using LDpred2 [55] and the bigsnpr tools [52] in R [56]. Because SPARK is family-based, an external LD reference based on 362,320 individuals in UK Biobank (provided by the authors of LDpred2) was used to calculate the genetic correlation matrix, estimate heritability, and calculate the infinitesimal beta weights. Polygenic scores were calculated from the following genome-wide association studies performed by the Psychiatric Genomics Consortium: ADHD (2019) [35], anorexia nervosa (2019) [36], autism (2019) [37], bipolar disorder (2021) [38], major depression (2019) [39], OCD (2018) [40], and schizophrenia (2020) [41]. Polygenic scores were calculated from genome-wide association studies performed by the Social Science Genetic Association Consortium for cognitive performance (2018) and educational attainment (2018) [26] and from the UK Biobank for non-heterosexual sexual behavior [18]. The public LDpred2 beta weights from the Polygenic Index Repository [57] were used to calculate polygenic scores for depressive symptoms [30], extraversion [27], loneliness [31], neuroticism [28], openness [29], risky behavior [32], subjective well-being [33], age at first birth [34], number of children ever born (men) [31], and number of children ever born (women) [31].
From the N = 818 Research Match participants whom completed the GSR, N = 813 also had sufficient ASR data, and N = 730 had genetic data. This subset of N = 730 was pruned to remove related participants using GCTA [58] with a relatedness threshold of 0.125, corresponding to approximately third degree relatives (N = 29 individual removed). To control for genetic ancestry confounding with the polygenic scores, we residualized using linear regression for the first 20 genetic principal components. We also additionally controlled for the effects of age in months, sex designated at birth, and autism diagnostic status by linear regression. Lastly, the polygenic scores were standardized to a mean of 0 and a standard deviation of 1.
4.6 Polygenic score analyses
Polygenic scores were correlated with the two GSR values (Binary and Nonbinary) and the two ASR values (Externalizing and Internalizing) using Spearman correlations. In the correlations stratified by polygenic risk, we grouped people into one of three groups for each polygenic score: upper 75th quartile (N = 175), middle quartile (N = 351), and the lower 25th quartile (N = 175). We then compared Spearman correlation coefficients between the upper versus lower quartiles. We tested for polygenic group-by-GSR interaction effects in association with the ASR values with linear models: ASR value ∼ GSR value + polygenic group + GSR value:polygenic group. We used the pwr.r.test() function from the R pwr package [59] to determine statistical power for the correlations.
Data Availability
The SPARK genetic data can be obtained at SFARI Base. The SPARK Research Match data will be available to qualified, approved researchers through SFARI Base upon publication of this article.
Data and code availability
The SPARK genetic data can be obtained at SFARI Base: https://base.sfari.org
The SPARK Research Match data will be available to qualified, approved researchers through SFARI Base upon publication of this article. The code for all analyses can be found at https://research-git.uiowa.edu/michaelson-lab-public/gsr-polygenic-scores
Funding
This work was supported by the National Institutes of Health (MH105527 and DC014489 to JJM) and the National Institute of Mental Health (R01MH100028 to JFS), as well as grants from the Simons Foundation (SFARI 516716 to JJM), the Clinical and Translational Science Award (KL2TR001877 to JJM), the Fahs-Beck Fellow Grant to JFS, and the National Institutes of Health Predoctoral training grant (T32GM008629 to TRT) This work was supported by the University of Iowa Hawkeye Intellectual and Developmental Disabilities Research Center (Hawk-IDDRC) through the Eunice Kennedy Shriver National Institute of Child Health and Human Development (P50HD103556).
Conflicts of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Author contributions
The study was designed by TRT, JFS, and JJM. The GSR scores were generated by JSY and JFS. The polygenic scores were generated by TRT and JJM. The analyses were performed by TRT, AJT, and JJM. The manuscript writing was done by all authors.
Supplementary information
Public summary
The way we act (behavior) is influenced by how our brains grow and function. Some of the ways our brains grow and function are influenced by our genes (DNA). Everyone has slightly different versions of DNA. This is a normal part of human diversity. Some of these DNA differences lead to differences in our brains. Brain differences can lead to behavior differences like in personality, intelligence, or mental health.
In this study, we asked whether our DNA is involved in gender identity and gender expression. We use the term “gender diversity” to mean differences in gender identity or gender expression. People with greater gender diversity are more likely to be transgender and/or nonbinary, although cisgender people can also have differences in gender expression. People with greater gender diversity are also more likely to be autistic, so we conducted this study with the help of SPARK participants (SPARK is the largest study of autism). Approximately half of our study participants were autistic adults, and the others were not autistic but do have an immediate relative who is autistic.
What we found
We found that thousands of DNA differences, when combined, are linked to differences in gender diversity. Specifically, we found that DNA differences linked to higher intelligence were also linked to greater gender diversity. We need to do more research to understand why DNA differences linked to higher intelligence are also linked to greater gender diversity.
Gender diverse people are at increased risk for stress due to discrimination. Scientists have repeatedly found that gender diverse people are at a greater risk for poorer mental health. In this study, we showed that greater gender diversity is linked to poorer mental health only among people who have high genetic risk for psychiatric conditions. Greater gender diversity was not linked to poorer mental health among those with low genetic risk for psychiatric conditions. This may mean that their DNA differences help them to be resilient against minority stress.
What our study does not show
We did not identify, nor attempt to identify, a “transgender” or “nonbinary” gene. We cannot predict a person’s gender diversity from their DNA. We found very little evidence that the DNA differences linked to major psychiatric conditions are also linked to gender diversity. Larger studies in the future may be able to identify weaker effects, but our study does not support a strong genetic connection between psychiatric conditions and gender diversity. Gender diversity is not purely genetic, but genetic factors do play a role.
Why this study is important
This is one of the first genetic studies of gender diversity. Many people say that gender is a purely social construct, with no biological factors involved. Our results show that the DNA differences linked to higher intelligence are also linked to greater gender diversity. We also show that whether minority stress translates into poorer mental health depends on the person’s level of genetic risk for psychiatric conditions. Ultimately, we believe this line of research will advance the health of gender diverse people through a greater understanding of how genetics interact with gender diversity in determining health outcomes.
Acknowledgments
We are grateful to our community advisory council, including members Elizabeth Graham, Sascha Klomp, and Jillian Nelson for all of their feedback throughout the research and writing process. We are also grateful to all of the participants and families in SPARK, the SPARK clinical sites, and SPARK staff. We appreciate obtaining access to genetic and phenotypic data for SPARK data on SFARI Base.
Footnotes
A new genetic data release allowed us to perform the analyses with a larger sample size. The language of the paper has also been updated.