STRUCTURED ABSTRACT
Importance The reported phenotypes of men with 47,XXY and 47,XYY syndromes include tall stature, multisystem comorbidities, and poor health-related quality of life (HRQoL). However, knowledge about these sex chromosome aneuploidy (SCA) conditions has been derived from studies in the <15% of patients who are clinically diagnosed and also lack diversity in age and genetic ancestry.
Objectives Determine the prevalence of clinically diagnosed and undiagnosed X or Y chromosome aneuploidy among men enrolled in the Million Veteran Program (MVP); describe military service metrics of men with SCAs; compare morbidity and mortality outcomes between men with SCA with and without a clinical diagnosis to matched controls.
Design Cross-sectional, case-control
Setting United States Veterans Administration Healthcare System
Participants Biologic males enrolled in the MVP biobank with genomic identification of an additional X or Y chromosome (cases); controls matched 1:5 on sex, age, and genetic ancestry
Main Outcome(s) and Measure(s) Prevalence of men with SCAs from genomic analysis; clinical SCA diagnosis; Charlson Comorbidity Index (CCI); rates of outpatient, inpatient, and emergency encounters per year; self-reported health outcomes; standardized mortality ratio (SMR)
Results An additional X or Y chromosome was present in 145 and 125 per 100,000 males in the MVP, respectively, with the highest prevalence among men with European and East Asian ancestry. At a mean age of 61±12 years, 74% of male veterans with 47,XXY and >99% with 47,XYY remained undiagnosed. Individuals with 47,XXY (n=862) and 47,XYY (n=747) had similar military service history, all-cause SMR, and age of death compared to matched controls. CCI and healthcare utilization were higher among individuals with SCA, while several measures of HRQoL were lower. Men with a clinical diagnosis of 47,XXY had higher healthcare utilization but lower comorbidity score compared to those undiagnosed.
Conclusion and Relevance One in 370 males in the MVP cohort have SCA, a prevalence comparable to estimates in the general population. While these men have successfully served in the military, they have higher morbidity and report poorer HRQoL with aging. Longer longitudinal follow-up of this sample will be informative for clinical and patient-reported outcomes, the role of ancestry, and mortality statistics.
KEY POINTS
Comparable to the general population, approximately 1 in 370 male veterans have a sex chromosome aneuploidy, but most are undiagnosed.
Men with X or Y chromosome aneuploidy successfully complete US miliary duty with similar service history compared to their 46,XY peers.
Medical comorbidities and healthcare utilization metrics are higher in male veterans with 47,XXY and 47,XYY during aging, however life expectancy is similar to matched controls.
INTRODUCTION
Approximately 1 in 400 males have an extra X or Y chromosome based on birth cohort studies, resulting in ∼5,000 males born in the US annually with 47,XXY (Klinefelter syndrome) or 47,XYY (Double Y syndrome).1 These male sex chromosome aneuploidy (SCA) conditions are associated with broad multisystem medical and neuropsychiatric comorbidities and certain physical traits such as tall stature; however, there is substantial phenotypic variability and lack of pathognomonic dysmorphic features, resulting in profound underdiagnosis with <25% of men with 47,XXY and <10% with 47,XYY receiving a diagnosis.1 With few exceptions, SCA research has been limited to the minority of individuals who have a clinical diagnosis, leading to substantial ascertainment bias. Therefore, the full spectrum of health outcomes for individuals with 47,XXY or 47,XYY is unknown.
British and Danish population registries have informed most of our knowledge about health outcomes for individuals with SCA conditions. These studies report a decreased life span and higher incidence rates for nearly all clinical diagnostic codes in males clinically diagnosed with SCA compared to sex- and age-matched controls.2-6 In addition to increased morbidity and mortality outcomes, a recent meta-analysis summarizing the published literature on quality of life (QoL) in men with 47,XXY (13 studies with a total of 829 participants) found a negative impact on physical, psychological, and social QoL domains.7 However, these studies only reflect men with a clinical SCA diagnosis and therefore cannot be generalized to all individuals with SCA. Recently, a study using the UK Biobank identified 356 men via genotype array to have X or Y chromosome polysomy.8 While the prevalence of 47,XXY and 47,XYY was lower than previous estimates, most of their findings aligned with prior data, including an 85% non-diagnosis rate and poorer overall health outcomes compared to men with 46,XY from both self-report and medical record data sources. A significant limitation, however is that epidemiological data and most clinical trial data in SCA conditions have been based on men with European ancestry. While Western European countries have the advantage of national healthcare systems that make epidemiological research more feasible, they lack the unique ethnic and socioeconomic diversity present in the US and other parts of the world. In addition, there is a paucity of data exploring how SCA affects biological aging and the development of age-related medical disorders.
The Veteran’s Health Administration (VA) Million Veteran Program (MVP) is a voluntary population-based study of genetic determinants of medical, neurological, and psychiatric illnesses and health outcomes for individuals who have served in the US military.9 We used the MVP cohort as a novel opportunity to address limitations in existing 47,XXY and 47,XYY research. The aims of our investigation were to 1) determine the prevalence of men in MVP with X or Y chromosome aneuploidy overall, both with and without a clinical diagnosis, 2) describe military service for men with SCAs compared to matched controls, and 3) compare morbidity and mortality outcomes between men with SCAs to matched controls, as well as between those with and without a clinical diagnosis.
METHODS
Overview of study design
The MVP began enrollment in 2011 and will soon have actively recruited 1 million participants across multiple VA facilities within the US. Methodological details for MVP have been described at length elsewhere.9 Veterans enrolled in MVP consent to provide a blood sample used for genotyping and biobanking, grant access to their VA electronic health record (EHR) for research, and complete the Baseline Survey and Lifestyle Survey.10 The MVP study was approved by the VA Central Institutional Review Board (IRB), and all individuals provided informed consent prior to study enrollment. This cross-sectional, case-control study analyzed genotype, EHR, and survey responses of 658,582 men with both genotype and VA EHR data available (Figure 1a). Survey response rates in our cohort were similar to MVP metrics as a whole.11
(a) Flow diagram of MVP participants included in the current analysis. (b) Scatterplot of sex chromosome array intensity over >20,000 X probes and 157 Y probes in all male MVP participants to identify men with an additional X or Y chromosome. Orange dots are males with 47,XXY, blue dots are males with 47,XYY, and black are males with 46,XY.
Genotyping, Ancestry, and SCA determination
Blood samples are banked at the VA Central Biorepository at the Massachusetts Veterans Epidemiology Research and Information Center (MAVERIC). DNA is extracted from peripheral blood leukocytes and analyzed on the MVP custom Affymetrix Axiom Biobank 1.0 array consisting of 723,305 unique SNPs. Details on this platform, technical processes, and quality control measures have previously been published.12 Genotype data within the GenISIS workspace were used to determine genetic ancestry using the Harmonized Ancestry and Race/Ethnicity (HARE) method.13 To identify men with an additional X or Y chromosome, the sex chromosome dosage was estimated by array intensity measurements. Normalized probeset intensities (log-R ratios) were calculated over 20,170 non-pseudoautosomal X probesets and 157 non-pseudoautosomal Y probesets. Comparison of median log-R ratios on X and Y (mLRR-X and mLRR-Y) identified clear clusters corresponding to samples with 46,XX, 46,XY, 47,XXY and 47,XYY karyotypes. A final SCA call set was then obtained by thresholding mLRR-X and mLRR-Y values around each identified cluster (Figure 1b).
Analytic Cohort
Individuals with available VA EHR and male sex who were within the 47,XXY or 47,XYY clusters defined our cases for this analysis, while those in the 46,XY cluster were controls. Analyses were not stratified by race because presence of an additional sex chromosome was considered as a single genetic variant eliminating the possibility of population stratification due to genetic drift influencing statistical tests, therefore we proceeded with a matched analysis where each male with an extra X or Y chromosome was randomly matched with five 46,XY males on age at time of enrollment and ancestry based on HARE designation.14
Outcomes
We determined the prevalence of SCA based on the number of males with a second X or Y chromosome over the total number of males in MVP expressed as the number per 100,000 males, and then stratified by genetic ancestry. To determine whether an individual with a genetically identified 47,XXY or 47,XYY karyotype had a clinical diagnosis of SCA, we queried ICD diagnosis codes consistent with SCA (International Classification of Disease 9 758.7-81; ICD10 Q98.0-9). Participants with at least one diagnostic code were determined to be clinically diagnosed, and the age first diagnosis code was retrieved.
Descriptive military service metrics were obtained from a combination of VA records (e.g. branch, time period, discharge status) and survey responses (e.g. active duty, deployment, exposures). We calculated healthcare utilization metrics including the number of outpatient, emergency, and overnight hospital encounters per year followed, with duration of follow-up determined as the difference between the initial and most recent encounter dates. The primary measure of medical morbidity was the Charlson Comorbidity Index (CCI), a calculated metric incorporating multiple medical conditions obtained from the EHR validated to estimate 10-year survival.15 Additional measures of morbidity included body mass index (BMI) from EHR, self-reported number of prescription medications, and validated summary scores from the Baseline and Lifestyle surveys as previously described10 including the Veterans RAND 12 Item Health Survey (VR-12)16,17 as a measure of physical and mental health-related QoL, Medical Outcomes Study Cognitive Functioning-Revised Scale (MOS-Cog-R)18,19 as a subjective estimate of cognitive difficulties, and the Patient Health Questionnaire-4 (PHQ-4)20 and the Posttraumatic Stress Disorder Checklist (PCL)21 as estimates of psychological functioning.
Finally, we calculated the standardized mortality ratio (SMR) as the number of deaths in cases divided by the expected number of deaths based on matched controls within the same time period. The primary cause of death was recoded into one of 39 cause of death categories according to the National Center for Health Statistics manual.22
Analytic approach
Continuous variables are reported as means and standard deviations or medians with interquartile range; categorical variables are reported as frequency/number and percentage. We conducted outcome comparisons between cases (47,XXY or 47,XYY) versus their matched controls using chi-squared tests for categorical variables and t-tests for continuous variables. Similarly, we used chi-squared and t-tests to compare those with and without a clinical SCA diagnosis. To avoid type 1 error with a large sample size and multiple comparisons, we elected a conservative alpha of 0.005 for primary outcomes. Analyses were conducted using R v4.0.3 and Prism Graphpad v9.5.1.
RESULTS
Out of 595,612 genotyped males, we identified 862 with an additional X chromosome (47,XXY) and 747 with an extra Y chromosome (47,XYY), corresponding to a prevalence of 145 per 100,000 (1 in 690) males for 47,XXY and 125 per 100,000 (1 in 800) males for 47,XYY (Figure 1). The average age of enrollment was similar to the overall MVP male population; however, genetic ancestry was significantly different (Table 1; Figure 2). This resulted in an estimated prevalence over twice as high in East Asian and European ancestry groups compared to those of African or Hispanic (p<0.001). The minority of men with 47,XXY (n=226, 26%) had a diagnosis in their EHR consistent with a clinical diagnosis of Klinefelter syndrome, and only two men with 47,XYY had a diagnosis consistent with clinical ascertainment.
Prevalence of (a) 47,XXY, (b) 47,XYY, and (c) both 47,XXY and 47,XYY combined per 100,000 males in the MVP stratified by genetic ancestry. The bars and error bars represent the calculated prevalence and 95% confidence interval (CI) for each genetic ancestry group. The horizontal line and lighter shaded area represent the calculated prevalence and 95% CI overall. The darker shaded area within the bars represent those with a clinical diagnosis in their electronic medical record.
After matching on age and ancestry, military service metrics were very similar between SCA cases and controls and nearly all (>90%) earned an Honorable Discharge (Table 2). Men with 47,XXY and 47,XYY were 4.6 and 9.2cm taller than controls respectively (p<0.001). CCI-calculated 10-year survival estimate mean was 44% in 47,XXY and 39% in 47,XYY vs ∼57% in controls (p<0.001). The rates of outpatient, emergency, and inpatient encounters within the VA system were 25-50% higher for SCA groups compared to controls (p<0.001 for all). Similarly, men with SCAs self-reported significantly more hospitalizations, prescription medications, and reproductive and neurological health concerns than matched controls on survey responses, along with lower health-related QoL (Table 3).
Between enrollment and last follow up, 110 individuals with 47,XXY (12.8%) and 112 with 47,XYY (15%) died, compared to 11.5% of age-matched controls, yielding an SMR of 1.08 (95%CI 0.9-1.3) for 47,XXY and 1.30 (95%CI 1.1-1.6) for 47,XYY. Mean age of death for all groups was between 70-72 years with no statistical differences. Supplemental Table 1 details primary causes of death.
Men who were clinically diagnosed with Klinefelter syndrome (at an average age of 52) were younger with a lower CCI and more clinical encounters than those with 47,XXY on genotyping without a Klinefelter syndrome diagnosis in their EHR, (Supplemental Table 2). All other outcome measures from the EHR and from surveys were very similar between men with 47,XXY with and without a clinical diagnosis.
DISCUSSION
In this large, ethnically diverse population-based study, we found a prevalence of 270 per 100,000 for X or Y chromosome aneuploidy among male veterans, which is similar or even higher than previous prevalence estimates in other populations.1 These men successfully completed military service many years prior to study enrollment with similar service metrics to their 46,XY peers, however, they experienced higher medical morbidity and lower HRQoL with aging. Evidence of a clinical diagnosis of 47,XXY was associated with lower medical comorbidity than those undiagnosed. An additional novel finding of this study was that the prevalence of SCA differed by genetic ancestry with half as many affected from African and Hispanic ancestries compared to those of European or East Asian ancestry. This initial overview of men with 47,XXY and 47,XYY within the MVP highlights the exciting potential of the MVP as a unique data source for investigating the effects of SCA on health outcome and quality of life during aging.
Prior epidemiologic studies, including birth cohorts and several population-based cohorts, have estimated that 47,XXY occurs in 103-174/100,000 males and 47,XYY in 69-98/100,000 males.1,23 Our estimates in the MVP population are within the expected range for 47,XXY (145 per 100,0000) and slightly higher than previously reported for 47,XYY (125 in 100,000). Not surprisingly, the minority of these individuals had a clinical diagnosis in the EHR, and the age of diagnosis in this cohort was much older than previously reported.24 If men with a SCA had been formally diagnosed prior to enlistment, certain Department of Defense codes regarding medical, learning, and psychiatric disorders could have prevented them from serving in the US military. Based on our study, however, the observed prevalence rates, along with comparable service metrics, provide evidence that men with SCAs are as capable of successful military service as their 46,XY peers.
Our study represents one of the only estimations of the prevalence of 47,XXY and 47,XYY among men in a US population, and the only one to our knowledge inclusive of diverse ancestral groups. We did not expect to see a difference in prevalence based on ancestry, as the risk of chromosomal aneuploidy is assumed to be independent of genetic or social constructs because it is caused by random errors in meiosis.25 Parental age could be a confounder, as nondisjunction errors increase with age and parental age is higher in European ancestry compared to Hispanic or African ancestry.26 However, the role of paternal age in aneuploidy risk is less clear than that of maternal age,27 and while there was a significant difference in parental age between men with 47,XXY and controls, that difference was not present between men with 47,XYY and controls. Alternatively, the cross-ancestry differences in prevalence could be secondary to sociodemographic factors. Enlistment in the US military requires conformation to a system of expectations that may be less attainable for minoritized individuals who also have physical, socioemotional, and learning differences that can be associated with SCAs6 – essentially a double hit. Undoubtedly, further research is needed to understand the basis of the ancestorial differences in the prevalence of SCAs, including prevalence estimates in non-Western countries and other diverse populations.
Our results corroborate findings from multiple other studies that a supernumerary sex chromosome is associated with excess morbidity and higher healthcare utilization.2,4,6 It is promising that a clinical diagnosis of 47,XXY, even if quite delayed, was associated with a lower CCI, possibly due to closer medical follow-up and more preventive medicine interventions in the VA Healthcare System. Future studies should examine whether the relationship between clinical diagnosis, increased outpatient encounters, and reduced morbidity is causal. Additional investigation into the specific medical conditions contributing to the increased morbidity index is underway.
We also observed poorer self-reported health and wellbeing metrics among men with SCA. Lower QoL outcomes are widely reported in SCAs, however, to our knowledge this is the only study that has evaluated HRQoL in individuals who are presumably unaware of their genetic condition. Biased stereotypes for 47,XXY and 47,XYY can continue to influence expectations and reporting for subjective measures, both by participants and researchers, highlighting the valuable contribution of our findings to the existing literature. QoL outcomes did not differ between those with and without a clinical diagnosis in our study, suggesting a true association between SCA and QoL rather than clinical ascertainment as a confounder. In one study, health status was the most significant predictor of QoL in men with 47,XXY,28 suggesting QoL may be increased if chronic health were improved. In addition, many conditions contributing to morbidity in SCAs are preventable or at least treatable.6,29 We hypothesize that a more timely diagnosis could optimize medical care, improve health, and ultimately lead to better QoL with aging.
In contrast to studies from Europe, we did not find greater mortality in men with SCAs.3,5,30 This may reflect a limitation of the MVP dataset as we cannot capture deaths occurring prior to enrollment (mean age of 61 years), and the follow up duration is relatively short for mortality disparities to emerge. Given the estimated 10-year survival was significantly lower in SCA groups, longitudinal follow up will be important to clarify the impact of SCAs on mortality. In addition, while the life expectancy of men with SCA in the MVP was similar to prior reports (∼71 years), our controls did not live as long as men in the European cohorts, possibly reflecting poorer health among US Veterans.31
This is the first US, ancestrally diverse investigation of SCAs, with rigorous cohort identification through genotyping that overcomes the clinical diagnostic bias inherent in most SCA studies to date. Despite these strengths, we caution generalizing these findings to other populations. Young men with a known SCA diagnosis may have been dissuaded from enrolling in the military, and VHA clinicians may be less likely to suspect and test for a genetic condition in men who have served in the military (reverse diagnostic suspicion bias), affecting the prevalence and diagnostic estimates within. We relied on the VHA EHR to determine if an individual had a clinical diagnosis of 47,XXY or 47,XYY, which may underestimate the true diagnostic rate if a known diagnosis was withheld from the EHR for any reason, including patient non-disclosure. Furthermore, while the MVP cohort has provided a unique opportunity to study an aging population with men with SCAs, earlier outcomes may not be captured. Finally, although the rate of survey response was congruent with the MVP cohort as a whole (67%), it is possible that respondents may not be representative of the non-responders with 47,XXY/47,XYY. Recognizing these limitations, this robust study provides important, novel data for the SCA community and introduces the potential of this valuable resource.
CONCLUSIONS
In the largest and most ethnically diverse population-based study of 47,XXY and 47,XYY to date, we observed 270 out of 100,000 men in the MVP study has an extra X or Y chromosome, with an unexpected finding of a higher prevalence among European and East Asian ancestry groups. Our analysis establishes that men with SCA successfully complete military duty with similar service performance metrics to their 46,XY peers. Thus, we urge that neither these conditions nor their treatments be exclusionary to military service. Although several multiple health and QoL are negatively impacted by the presence of an additional X or Y chromosome, these results provide evidence for the first time that a clinical diagnosis may reduce comorbidity burden. These data add to the existing literature on male SCA with a favorable perspective on their contribution to society, and highlight the need to study preventive medicine and other interventions that may ameliorate the observed negative health sequelae observed during aging.
ACKNOWLEDGEMENTS
This research used data from the Million Veteran Program (MVP) [Office of Research and Development, Veterans Health Administration] and was supported by MVP022 award CX001727 (PI: Hauger) and MVP000 award (PI: Pyarajan). RLH was additionally funded by the VISN-22 VA Center of Excellence for Stress and Mental Health (CESAMH) and National Institute of Aging R01 grants AG050595, AG05064, AG065385. SMD received salary support during this work from NICHD K23HD092588. MSP was supported on awards 1F30CA247168-01 and T32CA067754. VCM received salary support during this work from IK2 CX001952 from the VA Clinical Science Research & Development Service. This publication does not represent the views of the Department of Veterans Affairs, the National Institutes of Health, or the United States Government.