Development and Validation of a Metabolite Index for Obstructive Sleep Apnea across Race/Ethnicities ==================================================================================================== * Ying Zhang * Debby Ngo * Bing Yu * Neomi A. Shah * Han Chen * Alberto R. Ramos * Phyllis C. Zee * Russell Tracy * Peter Durda * Robert Kaplan * Martha L. Daviglus * Stephen S. Rich * Jerome I. Rotter * Jianwen Cai * Clary Clish * Robert Gerszten * Bruce S. Kristal * Sina A. Gharib * Susan Redline * Tamar Sofer ## Abstract **Background** Obstructive sleep apnea (OSA) is a common disorder characterized by recurrent episodes of upper airway obstruction during sleep resulting in oxygen desaturation and sleep fragmentation, and associated with increased risk of adverse health outcomes. Metabolites are being increasingly used for biomarker discovery and evaluation of disease processes and progression. Studying metabolomic associations with OSA in a diverse community-based cohort may provide insights into the pathophysiology of OSA. We aimed to develop and replicate a metabolite index for OSA and identify individual metabolites associated with OSA. **Methods and Findings** We studied 219 metabolites and their associations with the apnea hypopnea index (AHI) and with moderate-severe OSA (AHI≥15) in the Hispanic Community Health Study/Study of Latinos (HCHS/SOL) (n=3507) using two methods: (1) association analysis of individual metabolites, and (2) least absolute shrinkage and selection operator (LASSO) regression to identify a subset of metabolites jointly associated with OSA, and develop a metabolite index for OSA. Results were validated in the Multi-Ethnic Study of Atherosclerosis (MESA) (n=475). When assessing the associations with individual metabolites, we identified seven metabolites significantly positively associated with OSA in HCHS/SOL (FDR p<0.05), of which four associations - glutamate, oleoyl-linoleoyl-glycerol (18:1/18:2), linoleoyl-linoleoyl- glycerol (18:2/18:2) and phenylalanine, replicated in MESA (one sided-*p* <0.05). The OSA metabolite index, composed of 14 metabolites, was associated with 50% increase of risk for moderate-severe OSA (OR=1.50 [95% CI: 1.21-1.85] per 1 SD of OSA metabolite index, *p*<.001) in HCHS/SOL and 44% increased risk (OR=1.55 [95% CI: 1.10-2.20] per 1 SD of OSA metabolite index, *p*=0.013) in MESA, both adjusted for demographics, lifestyle, and comorbidities. Similar albeit less significant associations were observed for AHI. **Conclusions** We developed a metabolite index that replicated in an independent multi-ethnic dataset, demonstrating the robustness of metabolomic-based OSA index to population heterogeneity. Replicated metabolite associations may provide insights into OSA-related molecular and metabolic mechanisms. ## Introduction Obstructive sleep apnea (OSA) is a common disorder characterized by recurrent episodes of upper airway obstruction during sleep resulting in oxygen desaturation and sleep fragmentation (1). While highly prevalent in the population (2), OSA is severely under- diagnosed (3), especially in women (4). For example, only 1.3% of participants in the Hispanic Community Health Study/Study of Latinos (HCHS/SOL) and 7-15% of participants in the Multi- Ethnic Study of Atherosclerosis (MESA) reported a previous OSA diagnosis; in MESA, underdiagnosis was highest among race/ethnic minorities (5,6). The pathophysiology underlying OSA is multifaceted, which includes obesity, craniofacial structure, upper airway neuronal control, ventilatory control, and inflammation, among others (7). OSA is also associated with increased risk of adverse health outcomes, including hypertension, cardiovascular disease, diabetes, and early mortality (8-11). Metabolomics is the study of small biochemical compounds at large scales (12). As a growing number of large metabolomics datasets become available, metabolites are being increasingly leveraged for biomarker discovery and evaluation of disease processes and progression (13,14). In particular, studying metabolite associations with OSA may improve our understanding of the pathophysiology of OSA. However, research in this area has been limited by the relatively small number of participants that have undergone both overnight sleep studies and metabolomic profiling, the lack of representativeness of the study subjects selected solely in clinical encounters, and the limited metabolite panels used by many targeted metabolomics platforms (15,16). Here, we study metabolite associations with OSA in the HCHS/SOL, one of the largest multi- center cohorts with diverse participants from a rapidly growing minority group in the US: Hispanics/Latinos. We then test these associations for replication in MESA, a multi-ethnic community-based cohort. Our study follows the design described in Figure 1. First, we study associations of individual metabolites with a measure of moderate to severe OSA, defined by the apnea hypopnea index AHI ≥ 15, as well as by associations with continuously measured AHI. Next, we develop a metabolite index for OSA by aggregating together multiple metabolites as a potential biomarker for OSA. We then validate its association with OSA in MESA. This process is repeated for a continuous measure of OSA severity, the AHI. Because OSA has different characteristics across sexes (17), in secondary analyses we additionally studied sex- specific metabolite indices. ![Figure 1:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2022/05/26/2022.05.25.22275577/F1.medium.gif) [Figure 1:](http://medrxiv.org/content/early/2022/05/26/2022.05.25.22275577/F1) Figure 1: Study design flow chart. Definition of abbreviations: LASSO = least absolute shrinkage and selection operator, HCHS/SOL= the Hispanic Community Health Study/Study of Latinos, MESA= the Multi-Ethnic Study of Atherosclerosis. ## Methods ### The Hispanic Community Health Study/Study of Latinos The HCHS/SOL is a community-based cohort study of 16,415 self-identified Hispanic/Latino persons from diverse Hispanic/Latino backgrounds (Mexicans, Puerto Ricans, Cubans, Central Americans, Dominicans, and South Americans) (18). Participants 18–74 years of age at their baseline examination were recruited through a stratified multistage area probability sample design from four communities: San Diego, California; Chicago, Illinois; The Bronx, New York; and Miami, Florida. The baseline examinations occurred in June 2008–July 2011 and included assessment of OSA using a validated Type 3 home sleep apnea test (ARES Unicorder 5.2; B- Alert, Carlsbad, CA) that measured nasal air-flow, position, snoring, heart rate and oxyhemoglobin saturation (19) as previously described (5) in 14,440 individuals. The current cross-sectional analysis included 3,507 of these participants who also had blood assessed for metabolomic measures. Our primary analysis focused on a metabolomics-based biomarker for moderate or severe OSA defined as AHI≥15, with events defined as apneas or hypopneas with at least 50% cannula flow reduction for a minimum duration of 10 seconds, with ≥3% oxygen desaturation. The sleep study was conducted within the week following the baseline exam in which blood was collected and used for metabolite quantification. The study was approved by Institutional Review Boards at each field center where all participants provided written consent, and by the study’s Reading and Data Coordinating Centers. ### HCHS/SOL Metabolomic Profiling Fasting blood samples were collected at the baseline examination. Of all HCHS/SOL participants from the baseline examination who also had genetic data, 3,968 individuals were selected at random for metabolomics assessment. The samples were processed, and serum was stored at −70°C since collection. The metabolomic profiling was conducted at Metabolon (Durham, NC) with Discovery HD4 platform in 2017. Serum metabolites were quantified with untargeted, liquid chromatography-mass spectrometry (LC-MS)-based quantification protocol (20). The platform captured a total of 1,136 metabolites, including 782 known and 354 unknown (unidentified) metabolites. ### The Multi-Ethnic Study of Atherosclerosis MESA is a cohort study designed to study risk factors for clinical and subclinical cardiovascular diseases in four racial/ ethnic groups (21). The study began in July 2000 and recruited 6,814 adults free of clinical CVD and aged 45–84 years from 6 centers: Baltimore, MD; Chicago, IL; Los Angeles, CA; New York, NY; Saint Paul, MN; and Winston-Salem, NC. Participants were continued to be studied through subsequent follow-up exams. Of the 4,077 participants who attended Exam 5 (2010-2012), 2,261 participated in the MESA Sleep ancillary study (2010- 2013). As reported before (6), participants in the Sleep Exam were generally similar to non- participants. OSA was assessed with Type II in-home polysomnography. AHI was defined as the total number of apnea and hypopneas with at least 30% reduction in the nasal flow signal and with ≥ 3% oxygen desaturation per hour of sleep. The median time interval between Exam 5 fasting plasma sample collection and MESA Sleep was 301 days (range 0–1,024 days). Metabolomic data was collected on 1,000 randomly selected participants from Exam 5. Of these, 475 participants also had sleep measures and are included in this analysis. Local institutional review boards at all the participating institutions approved study protocols, and all participants gave written informed consent. ### MESA Metabolomic Profiling Metabolite profiling was performed using liquid chromatography tandem mass spectrometry (LC-MS). Positive ion mode profiling of water-soluble metabolites and lipids was performed using LC-MS systems comprised of Nexera X2 U-HPLC (Shimadzu Corp.; Marlborough, MA) units coupled to a Q Exactive mass spectrometer (Thermo Fisher Scientific; Waltham, MA). Polar metabolites were analyzed using hydrophilic interaction liquid chromatography (HILIC) and lipids were analyzed separately using reversed phase C8 chromatography as described in detail previously (22). Raw data were processed using TraceFinder 3.1 (Thermo Fisher Scientific; Waltham, MA) and Progenesis QI (Nonlinear Dynamics; Newcastle upon Tyne, UK). To measure organic acids and other intermediary metabolites in negative ionization mode, chromatography was performed using an Agilent 1290 infinity LC system equipped with a Waters XBridge Amide column, coupled to an Agilent 6490 triple quadrupole mass spectrometer. Metabolite transitions were assayed using a dynamic multiple reaction monitoring system. LC-MS data were analyzed with Agilent Masshunter QQQ Quantitative analysis software. Isotope labeled internal standards were monitored in each sample to ensure proper MS sensitivity for quality control. Pooled plasma samples were interspersed at intervals of 10 participant samples for standardization of drift over time and between batches. Additionally, separate pooled plasma was interspersed at every 20 injections to determine coefficient of variation for each metabolite over the run. Peaks were manually reviewed in a blinded fashion to assess quality. For each method, metabolite identities were confirmed using authentic reference standards or reference samples. Metabolites with poor peak quality and coefficients of variation greater than 30% averaged across batches were removed from analysis. ### Quality control of metabolites in HCHS/SOL and MESA Missing metabolite values were addressed as described in **Supplemental Figure 1**. In our discovery sample (HCHS/SOL), we excluded individuals with more than 25% missing metabolite levels, and metabolites with missing values for 75% or more individuals. For metabolites with more than 25% and less than 75% missing values, values were dichotomized as “observed” and “unobserved”. For metabolites with less than 25% missing values, we imputed the missing values using the minimum observed value of the metabolite in the sample, under the assumption that metabolites were not observed due to a technical detection limit. Because our study design includes validation analysis, we focused on metabolites available in both HCHS/SOL and MESA. Before any quality control (QC) methods were applied, 231 HCHS/SOL metabolites were mapped to 294 metabolites in MESA. The mapping of MESA to HCHS/SOL metabolites as well as to RefMet ID was done at Clish Lab. MESA had multiple metabolites matched to a single HCHS/SOL metabolite in multiple instances because the same metabolite was measured via more than one platform used by MESA. In some cases, a single metabolite appears as two highly correlated ion features in the same MESA platform (e.g., some neutral lipids were measured as both sodium and ammonium adducts). Therefore, a single feature was mapped to the metabolite in HCHS/SOL while the redundant features were dropped according to the following principles: features with redundant ions were excluded; features with lower missingness and lower skewness were prioritized. After removing 60 such redundant features in MESA and applied QC methods based on the missingness in HCHS/SOL, 219 HCHS/SOL metabolites were mapped to 219 metabolites in MESA. **Table S1** provides the list of the 294 initially matched metabolites cross-referenced by RefMet ID and metabolite annotations including HMDB IDs provided by Metabolon, along with details regarding metabolite-specific QC resulting in the final list of one-to-one matched metabolites. The serum concentration values of the matched metabolites that were treated as continuous were rank- normalized. Because MESA was a validation study, we only evaluated metabolites that were identified in the association analysis in HCHS/SOL. The missing data for these metabolites were always <25%, so we treated these as continuous variables and imputed missing values with the minimum observed value in the MESA sample. ### Statistical analysis Association analyses were based on three conceptual regression models: Model 1 (i.e., primary model) adjusted for demographic variables – age, sex, study center, Hispanic background (Mexicans, Puerto Ricans, Cubans, Central Americans, Dominicans, and South Americans and other/multi), and body mass index (BMI) in HCHS/SOL; age, sex, study site (two sites with low sample sizes were combined), race (White versus “Non-White”, which consists of Hispanic, Black and Chinese Americans), and BMI in MESA. Model 2 (i.e., lifestyle model) adjusted for demographic and lifestyle variables – alcohol use, cigarette use, total physical activity (MET- min/day), and diet (Alternative Healthy Eating Index 2010) in HCHS/SOL; alcohol use and cigarette use in MESA. Model 3 (i.e., lifestyle and comorbidity model) adjusted for demographic, lifestyle and comorbidity variables - indicators for diabetes, hypertension, fasting insulin, fasting glucose, HOMA-IR, HDL, LDL, total cholesterol, triglycerides, systolic blood pressure and diastolic blood pressure in HCHS/SOL; hypertension, fasting glucose, HDL, LDL, cholesterol, triglycerides, systolic blood pressure and diastolic blood pressure in MESA. All models used the same set of individuals with complete set of sleep and covariate measures. ### Association analysis between individual metabolites and OSA and AHI We tested the association of each of 219 metabolites (both continuous and dichotomized metabolites) with moderate-severe OSA and AHI in the HCHS/SOL. Each metabolite was the exposure in either linear or logistic regression (depending on the outcome) for each model. We accounted for the HCHS/SOL study design (sampling and clustering) and obtained representative effect estimates using survey regression implemented in the R survey package (4.0) (23). We controlled the false discovery rate (FDR) using the Benjamini-Hochberg procedure (24) and determined significant associations as those with FDR p-value<0.05. In the replication analysis, we tested the associations of these metabolites with OSA in logistic regression and with AHI in linear regression in MESA in models 1-3. We computed one-sided p- values guided by the estimated direction of associations in the HCHS/SOL (25), and determined replication if the one-sided p-value was <0.05. ### LASSO regression for constructing metabolite indices We applied a LASSO logistic regression with moderate to severe OSA versus no or mild OSA (for brevity “OSA versus no OSA”), and linear regression with AHI – log transformed as log(AHI+1), adjusted for the covariates in Model 1 in HCHS/SOL. We included 209 continuous metabolites (not including the 10 dichotomized metabolites). We selected the LASSO tuning parameter by minimizing the misclassification error for OSA (assuming probability of 0.5 is the cutoff for “predicting” OSA), and the prediction error for AHI, in a 10-fold cross-validation. Metabolite indices were calculated as a weighted sum of the (normalized) metabolite serum concentrations, with weights being the metabolite coefficients from the LASSO regression. To validate the metabolite index association with OSA/AHI, we constructed the indices in MESA using the weights from the LASSO regression conducted in HCHS/SOL, then assessed their associations with the corresponding sleep traits (i.e., OSA, AHI) in model 1-3. In the secondary analyses we assessed potential sex differences via: (1) sex-stratified association analysis for sex- specific metabolite indices (constructed based on sex-stratified LASSO), as well as (2) sex- stratified association analysis for metabolite indices constructed based on combined sexes. We also assessed the associations between metabolite indices quartiles and the corresponding sleep traits. All analyses were done in R 3.6.3. The glmnet package (3.0)(26) in R was used for the LASSO logistic regression. ## Results ### Participant characteristics **Table 1** characterizes the HCHS/SOL analytic sample and target population. The HCHS/SOL cohort included 3,507 participants, with a mean age of 41.72 years (SD =15.4), of whom 50.7% were female and 10.2% were classified with moderate or severe OSA (AHI ≥15). Participants with OSA were more likely to be male, had a higher BMI, and were less likely to be never smokers compared to those without OSA. Individuals with OSA were also more likely to have co-morbidities: 60% had hypertension and 34.3% had diabetes, compared to 27.8% hypertension and 16.9% diabetes in those without OSA. **S2 Table** characterizes the 475 MESA participants with metabolomics, sleep, and required measured covariates from the validation dataset. MESA participants were older (mean 68.45 years, SD=9.33), with a higher proportion of females (56.2%). Reflecting their older age, more MESA participants had moderate or severe OSA (46.7%) compared to HCHS/SOL. View this table: [Table 1.](http://medrxiv.org/content/early/2022/05/26/2022.05.25.22275577/T1) Table 1. Characteristics of Hispanics/Latinos represented by the HCHS/SOL study population ### Metabolite associations with OSA and AHI **Table 2** shows the odds ratios corresponding to 7 metabolites associated (FDR *P* < 0.05) with OSA in HCHS/SOL adjusted for age, sex, BMI, study site/center, race/ethnicity). Figure 2A **and S3 Tables** show the lifestyle adjusted model and comorbidity adjusted model results. Among the 7 mapped metabolites in MESA, 4 metabolite associations had one sided p-values < 0.05 – glutamate, phenylalanine, linoleoyl-linoleoyl-glycerol (18:2/18:2), and oleoyl-linoeleoyl-glycerol (18:1/18:2), all of which were associated with increased risks for OSA (Figure 2B). These associations also had FDR p-value<0.05 in MESA. No metabolite was associated with AHI after multiple testing correction in the HCHS/SOL (**Supplemental Figure 2**). Also, no metabolite associations were detected at the FDR< 0.05 level in minimally adjusted sex-stratified analyses in HCHS/SOL. ![Figure 2.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2022/05/26/2022.05.25.22275577/F2.medium.gif) [Figure 2.](http://medrxiv.org/content/early/2022/05/26/2022.05.25.22275577/F2) Figure 2. Heatmap showing estimated odds ratios of metabolites with significant FDR-adjusted p-value for OSA in HCHS/SOL and in MESA * indicates FDR *p*<0.05. In HCHS/SOL: Model 1 adjusted for age, gender, center, background, and bmi. Model 2 adjusted for age, gender, center, background, bmi, alcohol use, smoking status, physical activity and diet (AHEI 2010). Model 3 adjusted for age, gender, center, background, bmi, alcohol use, smoking status, physical activity, diet, T2DM, hypertension, fasting glucose, fasting insulin, HOMA_IR, HDL, LDL, total cholesterol, triglycerides, systolic blood pressure and diastolic blood pressure. In MESA: Model 1 adjusted for age, gender, BMI, study site (site WFU and UCLA are combined due to low cell count), and race. Model 2 adjusted for age, gender, BMI, study site, race, alcohol use and smoking status. Model 3 adjusted for age, gender, BMI, study site, race, alcohol use, smoking status, hypertension indicator, fasting glucose, HDL, LDL, cholesterol, triglycerides, systolic blood pressure and diastolic blood pressure. View this table: [Table 2.](http://medrxiv.org/content/early/2022/05/26/2022.05.25.22275577/T2) Table 2. Single metabolite associations with FDR-adjusted p-value <0.05 in the discovery step ### LASSO regression for joint selection and estimation of metabolite associations with OSA and AHI in HCHS/SOL We used a LASSO regression to select a set of metabolites that jointly associated with sleep apnea traits in the HCHS/SOL. Among the 14 metabolites identified for OSA by LASSO (Figure 3), there were one carbohydrate, one peptide, three amino acids, three lipids, three nucleotides, and three cofactors and vitamins, among which biliverdin and serine were unique to the OSA metabolite index while the rest were shared between the OSA and AHI metabolite indices. 41 metabolites were identified for AHI (**Supplemental Figure 3**), among which 29 metabolites were unique to AHI metabolite index. Metabolites identified by sex-specific LASSO are provided in the supplemental materials (**Supplemental Figures 4-7)**. ![Figure 3:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2022/05/26/2022.05.25.22275577/F3.medium.gif) [Figure 3:](http://medrxiv.org/content/early/2022/05/26/2022.05.25.22275577/F3) Figure 3: Coefficients for metabolites selected by LASSO (OSA model) in HCHS/SOL. Blue: coefficient<0; Red: coefficient>0. Definition of abbreviations: OSA = moderate to severe obstructive sleep apnea (AHI≥15), LASSO = least absolute shrinkage and selection operator. ### Metabolite indices associations with OSA and AHI in HCHS/SOL and in independent validation in MESA We constructed OSA and AHI metabolite indices in both HCHS/SOL and MESA based on the weights from the LASSO regressions conducted in HCHS/SOL. **Table 3** provides overall and sex- stratified results. As expected by construction, the metabolite indices were associated with their phenotypes in HCHS/SOL (OSA metabolite index OR: 1.50; 95% CI 1.30-1.74; *P* < 0.001; AHI metabolite index beta =2.12 per 1 SD of the index, 95% CI 1.43-2.81; *P* < 0.001) after adjustment for demographic covariates (i.e., age, sex, BMI, study center, and Hispanic/Latino background); the associations persisted after additional adjustment of lifestyle (i.e., alcohol usage, smoking status, physical activity and diet) and comorbidity (i.e., diabetes, hypertension, fasting glucose, fasting insulin, HOMA-IR, HDL, LDL, total cholesterol, triglycerides, systolic blood pressure, diastolic blood pressure) covariates (OSA metabolite index OR: 1.50; 95% CI 1.21-1.85; *P* < 0.001; AHI metabolite index beta =1.73 per 1 SD of the index, 95% CI 0.96-2.50; *P*<0.001). In the validation dataset, the OSA metabolite index was also associated with an increased odds ratio for OSA (OR: 1.41; 95% CI 1.13-1.77; *P* = 0.003) (**Table 3**) when adjusted for demographic (i.e., age, sex, BMI, study site, and race) covariates and remained similar when adjusted for lifestyle (i.e., alcohol usage and smoking status) and comorbidity (i.e., hypertension, fasting glucose, HDL, LDL, cholesterol, triglycerides, systolic blood pressure and diastolic blood pressure) covariates (OR: 1.55; 95% CI 1.10-2.20; *P*= 0.013). When compared with the lowest quartile of the OSA metabolite index, the top quartile showed more than two- fold increase in risk for OSA [OR: 2.53; 95% CI (1.37-4.70); *P* = 0.003] in the primary model and remained significant when adjusted for lifestyle and comorbidity covariates [OR: 2.63; 95% CI (1.14-6.14); *P* = 0.024] (see Figure 3 **and Supplemental Figure 3**). AHI metabolite index associations had higher p-values in MESA, compared to OSA metabolite index associations (**Table 3**). AHI metabolite index was only replicated in women, adjusted for demographic, lifestyle, and comorbidity covariates in MESA. Notably, both OSA metabolite index and AHI metabolite index associations with their phenotypes were stronger when evaluated in women compared to the overall sample, in both HCHS/SOL and MESA. View this table: [Table 3.](http://medrxiv.org/content/early/2022/05/26/2022.05.25.22275577/T3) Table 3. Estimated associations between OSA and AHI metabolite indices and their respective phenotypes, in HCHS/SOL and MESA Results from secondary analysis of a sex specific metabolite indices are also provided in **Table 3**. Only the female-specific AHI metabolite index replicated in MESA in model 1 and 2, but its association with AHI in women was weaker than that of the AHI metabolite index trained on the full HCHS/SOL sample, both in terms of p-value and of estimated effect size. ## Discussion In this paper, we leveraged metabolomics data from two large, diverse community-based cohorts to derive the first metabolite index for moderate to severe OSA, as well as to identify individual metabolites associated with this disorder. We studied 219 metabolites and their associations with OSA and AHI in the HCHS/SOL using two methods: (1) analysis of individual metabolites, and (2) LASSO to identify a subset of metabolites that jointly predict OSA or AHI. Then, we studied the associations in an independent validation study, MESA. We used the results from LASSO to derive an OSA and AHI metabolite indices. In MESA, the OSA metabolite index was significantly associated with moderate to severe OSA; e.g., individuals in the highest quartile for OSA metabolite index had a more than 2-fold increased odds of moderate to severe OSA- both in the derivation sample and in an independent sample that varied by ancestry, age, and OSA prevalence, with findings that persisted after adjusting for multiple lifestyle and health covariates. In contrast, when modeling AHI as a continuous measure of sleep apnea, weaker associations were observed, except for the top quartiles among females. In the association analysis of individual metabolites, seven metabolites were associated with OSA in HCHS/SOL (FDR *p* < 0.05), of which four associations replicated in MESA. We implemented two approaches to study the metabolomic correlates of sleep apnea phenotypes: LASSO and individual-metabolite regression analysis. These approaches serve different purposes: single metabolite regression highlights individual metabolites associated with sleep apnea phenotypes without adjustment to other metabolites, while LASSO estimates the combined effect of multiple metabolites. A metabolite identified as associated with sleep apnea phenotypes individually may not be selected by the LASSO analysis (e.g., if a different metabolite correlated to it was selected by LASSO). Thus, it is not surprising that two of the replicated metabolites identified in single metabolite analysis were not selected by LASSO. Similarly, a specific metabolite may be selected by LASSO, but not by individual metabolite analysis due to adjustment for multiple testing, which is not done in LASSO analysis. The OSA metabolite index, constructed based on the LASSO results, is a single index using multiple blood biomarkers which together reflect the biochemical differences in the blood of individuals with and without OSA. The OSA metabolite index also showed stronger association with OSA than any single metabolite, in both the discovery and validation study, consistent with the influence of multiple metabolites in OSA pathophysiology (see **Table 3, S3 Table**). While future work is needed to study whether the metabolite index can be used in the clinic for OSA screening or clinical management of OSA patients, it is clearly useful from a statistical and epidemiological standpoints, as analysis based on the metabolite index had evidently higher statistical power than analyses testing single metabolite associations. Therefore, it may also enable additional studies of the pathophysiology of OSA and its relationship with other cardiometabolic conditions. Four metabolites were replicated in the single metabolite regression: glutamate, oleoyl- linoleoyl-glycerol (18:1/18:2) (DAG(36:3)), linoleoyl-linoleoyl-glycerol (18:2/18:2) (DAG(36:4)) and phenylalanine, among which glutamate and phenylalanine remained positively associated with OSA after adjusting for lifestyle and comorbidities in addition to basic demographics. Both metabolites have some previous evidence linking them to OSA or other sleep disorders, as well as cardiometabolic diseases, and suggest that elevations in glutamate and phenylalanine can be investigated as biomarkers for adverse outcomes in patients with OSA. High plasma glutamate has been associated with total and visceral adiposity, dyslipidemia and insulin resistance (27), as well as increased risks for incident cardiovascular disease (28), type 2 diabetes (29), and subclinical atherosclerosis (30), independent of established cardiovascular risk factors. The positive association between glutamate and OSA observed in our study may suggest a shared metabolomic profile for OSA and other cardiometabolic phenotypes. Previous studies in rats animals and human showed that more frequent sleep apneas led to increased level of glutamate in brain (31-33). Glutamate is the major excitatory neurotransmitter in the brain, and modulates brain energy metabolism and neuronal synaptic plasticity. Although the blood- brain barrier prevents plasma glutamate to freely permeate into the central nervous system (34), when glutamate increases in the brain, the brain-to-blood glutamate efflux also increases, as suggested by the correlation between the peripheral glutamate and the central nerve system glutamate levels (35). These findings support further research addressing the roles of peripheral and central glutamate in the pathophysiology of OSA. Phenylalanine is an essential aromatic amino acid that plays a key role in the biosynthesis of other amino acids, including the neurotransmitters, dopamine, and norepinephrine. Studies have shown that plasma phenylalanine level can be elevated due to inflammation (36,37), and inflammation is a common finding in OSA (38). One mechanism for the increased levels of phenylalanine may be through chronic hypoxia, which has been reported to increase both systemic and cerebral delivery of phenylalanine (39). This is in line with our evidence: peripheral phenylalanine was elevated among individuals with moderate to severe OSA patients (**S3 Table**). A prior lab-based study that measured a few metabolites over the course of sleep reported that phenylalanine levels decreased less overnight among patients with OSA compared to controls (40). Phenylalanine levels were also reported to be elevated after sleep restriction (41). The downstream effects of phenylalanine have been studied more widely in other chronic conditions, with reports of associations with elevated pro-inflammatory cytokines, suppressed immunity and increased mortality among heart failure patients (42); more rapid telomere shortening consistent with accelerated aging (43). Recent studies reported elevations of phenylalanine associated with adverse COPD outcomes (44), which was postulated to reflect muscle breakdown and respiratory muscle insufficiency (45). Elevated plasma phenylalanine was shown to be a strong predictor for cardiovascular risk (46) and a biomarker, mediator and potentially therapeutic target for pulmonary hypertension (47). Further research on the association of OSA and phenylalanine may further identify the roles of hypoxia, inflammation, and muscle function in the pathophysiology of OSA and cardiometabolic conditions. Our study has also shown that increased plasma levels of two diacylglycerols (DAGs): DAG(36:3) and DAG(36:4) were associated with moderate to severe OSA. Altered lipids metabolism is often observed among OSA patients (48-50); specifically, intermittent hypoxemia can stimulate lipolysis, increasing free fatty acid levels (51). Abnormalities in lipid metabolism may result in liver and skeletal muscle fat deposition, exacerbating OSA through inflammatory or muscle-related pathways (52). Therefore, the associations with these diacylglycerols may reflect mechanisms by which OSA related hypoxemia alters fatty acid metabolism. These associations, however, did not replicate in the validation study once adjusted for comorbidities, suggesting that the associations might be confounded by cardiometabolic conditions that often accompany OSA. Estimated OSA associations, in both LASSO and single metabolite analysis, were generally stronger than AHI associations. Potential reasons are the variability of AHI and the potential non-linear metabolomic associations with AHI. Notably, non-linearity (i.e., a threshold effect) was previously shown for AHI association with hypoxemia and sympathetic nervous system activation burden (53). Sex differences have been increasingly reported among OSA patients (17). Population-based studies have shown that the overall OSA prevalence is higher among men than women (54), while metabolic syndrome and cardiovascular conditions are more strongly associated with OSA among female patients compared to males (55,56). Indeed, when assessing the metabolite indices developed in both sexes and tested for their associations with OSA and AHI, we observed stronger associations among women than men in the MESA validation dataset (**Table 3**). However, compared to the metabolite indices developed in combined sex strata, sex-specific metabolite indices had weaker associations with the OSA/AHI in the validation data set (weaker effect size estimates and higher p-value), which may be the result of more misclassification when using a smaller dataset for discovery. In addition to pointing to novel individual metabolites that play a role in OSA, our metabolite indices also showed moderately strong associations with OSA in an external sample, despite the marked differences in race/ethnicity, age, and OSA prevalence compared to the discovery sample. This supports the overall generalizability of the metabolite index across diverse populations. Nonetheless, the utility of a 2-fold increased risk of OSA among individuals in the highest metabolite index quartile in helping to screen or triage patients for more comprehensive testing will need to be formally evaluated, potentially combining metabolite data with other information, such as OSA-related symptoms, to improve screening. A strength of our study is that our population-based sample is more than 10-fold larger than prior studies (57), includes a high proportion of ethnic/racial minorities who have been under-represented in research but are at increased for adverse health outcomes, and is more representative of samples in the general population who remain include large numbers of undiagnosed individuals. We used rigorous statistical methods, adjusted for a large number of lifestyle and health covariates, and were able to replicate the main findings despite large differences in our discovery and validation populations, which suggests relatively strong associations and generalizability of the metabolite associations with OSA. There are several limitations in this study. The temporal relationship between the blood sample collection and sleep test was concurrent in HCHS/SOL and up to one year apart in MESA, allowing for cross-sectional associations, but limiting our ability to discern causal pathways. Although over 1000 metabolites were quantified in both populations, less than 300 metabolites were matched between the two platforms (after quality control only 219 distinct metabolites were mapped). We limited our study to only the matched metabolites to allow for replication testing, which strengthens the results and conclusions. MESA metabolomic profiling was conducted using three complementary platforms measuring several broad classes of small molecules therefore multiple chemical compounds from MESA were mapped to the same metabolite in the HCHS/SOL. We chose a single feature to map to any HCHS/SOL metabolite based on a set of rules related presence of redundant ions, data missingness and skewness. In the future, other more optimal approaches may be proposed and studied. Some associations failed to replicate in MESA, potentially due to heterogeneity in different populations and low power in MESA, which had a small sample size. Finally, the definitions of AHI differed slightly in the two studies: while the 3% oxyhemoglobin desaturation criterion applied to hypopneas only in MESA, due to differences in the recording montage, a 3% desaturation criterion was applied for all respiratory events in HCHS/SOL. In summary, we used two large datasets of population-based multi-ethnic cohort studies to study metabolomics associations with OSA. We developed an metabolite index that replicated across datasets, and had statistically significant association with OSA even after adjustment to cardiometabolic comorbidities. In future work we will study the possibility of developing an OSA screening tool based on this metabolite index. Four metabolites also replicated in an independent data set, of which one was previously implicated in OSA, and two were previously connected to sleep disorders. Collectively, our findings support the utility of metabolomic profiling to generate metabolite indices of sleep apnea in racially diverse populations, and to OSA’s pathophysiology. ## Supporting information Supplementary tables [[supplements/275577_file02.xlsx]](pending:yes) Supplemental figures [[supplements/275577_file03.pdf]](pending:yes) ## Data Availability MESA and HCHS/SOL data are available through application to dbGaP according to the study specific accessions. MESA phenotypes are available in: phs000209, and HCHS/SOL phenotypes: phs000810. META metabolomics data will become available on dbGaP via the "NHLBI TOPMed: Multi-Ethnic Study of Atherosclerosis (MESA)" project (accession phs001416). HCHS/SOL metabolomics data are available via data use agreement with the HCHS/SOL Data Coordinating Center at the University of North Carolina at Chapel Hill, see collaborators website: [https://sites.cscc.unc.edu/hchs/](https://sites.cscc.unc.edu/hchs/). ## Data availability statement MESA and HCHS/SOL data are available through application to dbGaP according to the study specific accessions. MESA phenotypes are available in: phs000209, and HCHS/SOL phenotypes: phs000810. HCHS/SOL metabolomics data are available via data use agreement with the HCHS/SOL Data Coordinating Center at the University of North Carolina at Chapel Hill, see collaborators website: [https://sites.cscc.unc.edu/hchs/](https://sites.cscc.unc.edu/hchs/). ## Funding The research was partially supported by NIH NHLBI R35 HL135818. Support for metabolomics data was graciously provided by the JLH Foundation (Houston, Texas). The Hispanic Community Health Study/Study of Latinos was carried out as a collaborative study supported by contracts from the National Heart, Lung, and Blood Institute (NHLBI) to the University of North Carolina (N01-HC65233), University of Miami (N01-HC65234), Albert Einstein College of Medicine (N01-HC65235), Northwestern University (N01-HC65236), and San Diego State University (N01-HC65237). The following Institutes/Centers/Offices contribute to the HCHS/SOL through a transfer of funds to the NHLBI: National Center on Minority Health and Health Disparities, the National Institute of Deafness and Other Communications Disorders, the National Institute of Dental and Craniofacial Research, the National Institute of Diabetes and Digestive and Kidney Diseases, the National Institute of Neurological Disorders and Stroke, and the Office of Dietary Supplements. The authors thank the staff and participants of HCHS/SOL for their important contributions. MESA and the MESA SHARe project are conducted and supported by the National Heart, Lung, and Blood Institute (NHLBI) in collaboration with MESA investigators. Support for MESA is provided by contracts HHSN268201500003I, N01-HC- 95159, N01-HC-95160, N01-HC-95161, N01-HC-95162, N01-HC-95163, N01-HC-95164, N01- HC-95165, N01-HC-95166, N01-HC-95167, N01-HC-95168, N01-HC-95169, UL1-TR-000040, UL1-TR-001079, UL1-TR-001420. MESA Family is conducted and supported by the National Heart, Lung, and Blood Institute (NHLBI) in collaboration with MESA investigators. Support is provided by grants and contracts R01HL071051, R01HL071205, R01HL071250, R01HL071251, R01HL071258, R01HL071259 and by the National Center for Research Resources, Grant UL1RR033176. The MESA Sleep Ancillary study was funded by NIH-NHLBI R01HL098433. The provision of genotyping data was supported in part by the National Center for Advancing Translational Sciences, CTSI grant UL1TR001881, and the National Institute of Diabetes and Digestive and Kidney Disease Diabetes Research Center (DRC) grant DK063491 to the Southern California Diabetes Endocrinology Research Center. Molecular data for the Trans- Omics in Precision Medicine (TOPMed) program was supported by the National Heart, Lung and Blood Institute (NHLBI).). Metabolomics for “NHLBI TOPMed: Multi-Ethnic Study of Atherosclerosis (MESA)” (phs001416) was performed at Broad Institute and Beth Israel Metabolomics Platform (HHSN268201600038I). Core support including centralized genomic read mapping and genotype calling, along with variant quality metrics and filtering were provided by the TOPMed Informatics Research Center (3R01HL-117626-02S1; contract HHSN268201800002I). Core support including phenotype harmonization, data management, sample-identity QC, and general program coordination were provided by the TOPMed Data Coordinating Center (R01HL-120393; U01HL-120393; contract HHSN268201800001I). ## Supporting information **S1 Table. Mapping between MESA and HCHS/SOL Metabolomics Platform.** The mapping was carried out before any quality control steps (e.g. assessment of missingness, etc.). 294 compounds in MESA were mapped to 231 metabolites in HCHS/SOL. 60 redundant features in MESA were dropped based on the principles described in the methods section. After discarding 12 metabolites with high missingness(>=75%) in HCHS/SOL, 209 metabolites with low missingness (<25%) were imputed and 10 metabolites with medium missingness (25-75%) were dichotomized in the HCHS/SOL metabolomics platform. We applied the same QC steps to the corresponding metabolites in MESA, i.e., discarded, imputed, and dichotomized, the same metabolites in MESA as in HCHS/SOL. **S2 Table. Characteristics of the analytic sample from the MESA study population.** * Baseline hypertension is defined as systolic blood pressure (SBP) > 130 mmHg, diastolic blood pressure (DBP) > 80 mmHg or any history of antihypertensive medication intake. **S3 Table. Single metabolite associations with FDR-adjusted p-value <0.05 in HCHS/SOL and MESA.** Model 1 adjusted for age, gender, BMI, study center, and Hispanic/Latino background in HCHS/SOL, and adjusted for age, gender, BMI, study site (site WFU and UCLA are combined due to low cell count), and race in MESA. Model 2 adjusted for all covariates in model 1, in HCHS/SOL model 2 additionally adjusted for alcohol usage, smoking status, physical activity and diet; in MESA, model 2 additionally adjusted for alcohol usage and smoking status. Model 3 adjusted for all covariates in model 2, in HCHS/SOL, model 3 additionally adjusted for diabetes, hypertension, fasting glucose, fasting insulin, HOMA-IR, HDL, LDL, total cholesterol, triglycerides, systolic blood pressure, diastolic blood pressure; in MESA, model 3 additionally adjusted for hypertension indicator, fasting glucose, HDL, LDL, cholesterol, triglycerides, systolic blood pressure and diastolic blood pressure. Metabolite with * indicates they were identified based on accurate mass data, retention time and mass spectrometry but not reference standards. Therefore, the verification is not as robust as metabolites without *. **S4 Table. Estimated associations between OSA and AHI metabolite indices and their respective phenotypes in quartiles, in HCHS/SOL and MESA.** * per 1 STD increase in metabolite index. Model 1 adjusted for age, gender, BMI, study center, and Hispanic/Latino background in HCHS/SOL, and adjusted for age, gender, BMI, study site (site WFU and UCLA are combined due to low cell count), and race in MESA. Model 2 adjusted for all covariates in model 1, in HCHS/SOL model 2 additionally adjusted for alcohol usage, smoking status, physical activity and diet; in MESA, model 2 additionally adjusted for alcohol usage and smoking status. Model 3 adjusted for all covariates in model 2, in HCHS/SOL, model 3 additionally adjusted for diabetes, hypertension, fasting glucose, fasting insulin, HOMA-IR, HDL, LDL, total cholesterol, triglycerides, systolic blood pressure, diastolic blood pressure; in MESA, model 3 additionally adjusted for hypertension indicator, fasting glucose, HDL, LDL, cholesterol, triglycerides, systolic blood pressure and diastolic blood pressure. ## Acknowledgement The authors thank the staff and participants of HCHS/SOL and MESA for their important contributions. We gratefully acknowledge the studies and participants who provided biological samples and data for TOPMed. * Received May 25, 2022. * Revision received May 25, 2022. * Accepted May 26, 2022. * © 2022, Posted by Cold Spring Harbor Laboratory This pre-print is available under a Creative Commons License (Attribution-NonCommercial-NoDerivs 4.0 International), CC BY-NC-ND 4.0, as described at [http://creativecommons.org/licenses/by-nc-nd/4.0/](http://creativecommons.org/licenses/by-nc-nd/4.0/) ## Reference 1. 1.Somers VK, White DP, Amin R, Abraham WT, Costa F, Culebras A, et al. Sleep apnea and cardiovascular disease: an American Heart Association/american College Of Cardiology Foundation Scientific Statement from the American Heart Association Council for High Blood Pressure Research Professional Education Committee, Council on Clinical Cardiology, Stroke Council, and Council On Cardiovascular Nursing. In collaboration with the National Heart, Lung, and Blood Institute National Center on Sleep Disorders Research (National Institutes of Health). Circulation. 2008 Sep 2;118(10):1080–1111. [FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiRlVMTCI7czoxMToiam91cm5hbENvZGUiO3M6MTQ6ImNpcmN1bGF0aW9uYWhhIjtzOjU6InJlc2lkIjtzOjExOiIxMTgvMTAvMTA4MCI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIyLzA1LzI2LzIwMjIuMDUuMjUuMjIyNzU1NzcuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 2. 2.Peppard PE, Young T, Barnet JH, Palta M, Hagen EW, Hla KM. Increased prevalence of sleep- disordered breathing in adults. Am J Epidemiol. 2013 May 1;177(9):1006–1014. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/aje/kws342&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23589584&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F05%2F26%2F2022.05.25.22275577.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000318576300019&link_type=ISI) 3. 3.Kapur V, Strohl KP, Redline S, Iber C, O’Connor G, Nieto J. Underdiagnosis of sleep apnea syndrome in U.S. communities. Sleep Breath. 2002 Jun;6(2):49–54. 4. 4.Lastra AC, Attarian HP. The persistent gender bias in the diagnosis of obstructive sleep apnea. Gender and the Genome. 2018 Apr;2(2):43–48. 5. 5.Redline S, Sotres-Alvarez D, Loredo J, Hall M, Patel SR, Ramos A, et al. Sleep-disordered breathing in Hispanic/Latino individuals of diverse backgrounds. The Hispanic Community Health Study/Study of Latinos. Am J Respir Crit Care Med. 2014 Feb 1;189(3):335–344. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1164/rccm.201309-1735OC&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24392863&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F05%2F26%2F2022.05.25.22275577.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000331793400016&link_type=ISI) 6. 6.Chen X, Wang R, Zee P, Lutsey PL, Javaheri S, Alcántara C, et al. Racial/Ethnic Differences in Sleep Disturbances: The Multi-Ethnic Study of Atherosclerosis (MESA). Sleep. 2015 Jun 1;38(6):877–888. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.5665/sleep.4732&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25409106&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F05%2F26%2F2022.05.25.22275577.atom) 7. 7.Redline S. Genetics of obstructive sleep apnea. Principles and practice of sleep medicine. Elsevier; 2011. p. 1183–1193. 8. 8.Peppard PE, Young T, Palta M, Skatrud J. Prospective study of the association between sleep- disordered breathing and hypertension. N Engl J Med. 2000 May 11;342(19):1378–1384. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1056/NEJM200005113421901&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=10805822&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F05%2F26%2F2022.05.25.22275577.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000086940600001&link_type=ISI) 9. 9.Kasai T, Floras JS, Bradley TD. Sleep apnea and cardiovascular disease: a bidirectional relationship. Circulation. 2012 Sep 18;126(12):1495–1510. [FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiRlVMTCI7czoxMToiam91cm5hbENvZGUiO3M6MTQ6ImNpcmN1bGF0aW9uYWhhIjtzOjU6InJlc2lkIjtzOjExOiIxMjYvMTIvMTQ5NSI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIyLzA1LzI2LzIwMjIuMDUuMjUuMjIyNzU1NzcuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 10. 10.Vgontzas AN, Papanicolaou DA, Bixler EO, Hopper K, Lotsikas A, Lin HM, et al. Sleep apnea and daytime sleepiness and fatigue: relation to visceral obesity, insulin resistance, and hypercytokinemia. J Clin Endocrinol Metab. 2000 Mar;85(3):1151–1158. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1210/jc.85.3.1151&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=10720054&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F05%2F26%2F2022.05.25.22275577.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000088387000040&link_type=ISI) 11. 11.Heilbrunn ES, Ssentongo P, Chinchilli VM, Oh J, Ssentongo AE. Sudden death in individuals with obstructive sleep apnoea: a systematic review and meta-analysis. BMJ Open Respir Res. 2021 Jun;8(1). 12. 12.Idle JR, Gonzalez FJ. Metabolomics. Cell Metab. 2007 Nov;6(5):348–351. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.cmet.2007.10.005&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=17983580&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F05%2F26%2F2022.05.25.22275577.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000250809700005&link_type=ISI) 13. 13.Ussher JR, Elmariah S, Gerszten RE, Dyck JRB. The Emerging Role of Metabolomics in the Diagnosis and Prognosis of Cardiovascular Disease. J Am Coll Cardiol. 2016 Dec 27;68(25):2850– 2870. [FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6MzoiUERGIjtzOjExOiJqb3VybmFsQ29kZSI7czo0OiJhY2NqIjtzOjU6InJlc2lkIjtzOjEwOiI2OC8yNS8yODUwIjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjIvMDUvMjYvMjAyMi4wNS4yNS4yMjI3NTU3Ny5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 14. 14.Hunter WG, Kelly JP, McGarrah RW, Khouri MG, Craig D, Haynes C, et al. Metabolomic profiling identifies novel circulating biomarkers of mitochondrial dysfunction differentially elevated in heart failure with preserved versus reduced ejection fraction: evidence for shared metabolic impairments in clinical heart failure. J Am Heart Assoc. 2016 Jul 29;5(8). 15. 15.Ferrarini A, Rupérez FJ, Erazo M, Martínez MP, Villar-Álvarez F, Peces-Barba G, et al. Fingerprinting-based metabolomic approach with LC-MS to sleep apnea and hypopnea syndrome: a pilot study. Electrophoresis. 2013 Oct;34(19):2873–2881. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23775633&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F05%2F26%2F2022.05.25.22275577.atom) 16. 16.Engeli S, Blüher M, Jumpertz R, Wiesner T, Wirtz H, Bosse-Henck A, et al. Circulating anandamide and blood pressure in patients with obstructive sleep apnea. J Hypertens. 2012 Dec;30(12):2345–2351. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23032139&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F05%2F26%2F2022.05.25.22275577.atom) 17. 17.Won CHJ, Reid M, Sofer T, Azarbarzin A, Purcell S, White D, et al. Sex differences in obstructive sleep apnea phenotypes, the multi-ethnic study of atherosclerosis. Sleep. 2020 May 12;43(5). 18. 18.Lavange LM, Kalsbeek WD, Sorlie PD, Avilés-Santa LM, Kaplan RC, Barnhart J, et al. Sample design and cohort selection in the Hispanic Community Health Study/Study of Latinos. Ann Epidemiol. 2010 Aug;20(8):642–649. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.annepidem.2010.05.006&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20609344&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F05%2F26%2F2022.05.25.22275577.atom) 19. 19.Westbrook PR, Levendowski DJ, Cvetinovic M, Zavora T, Velimirovic V, Henninger D, et al. Description and validation of the apnea risk evaluation system: a novel method to diagnose sleep apnea- hypopnea in the home. Chest. 2005 Oct;128(4):2166–2175. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1378/chest.128.4.2166&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=16236870&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F05%2F26%2F2022.05.25.22275577.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000232679400042&link_type=ISI) 20. 20.Evans AM, DeHaven CD, Barrett T, Mitchell M, Milgram E. Integrated, nontargeted ultrahigh performance liquid chromatography/electrospray ionization tandem mass spectrometry platform for the identification and relative quantification of the small-molecule complement of biological systems. Anal Chem. 2009 Aug 15;81(16):6656–6667. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1021/ac901536h&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=19624122&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F05%2F26%2F2022.05.25.22275577.atom) 21. 21.Bild DE, Bluemke DA, Burke GL, Detrano R, Diez Roux AV, Folsom AR, et al. Multi-Ethnic Study of Atherosclerosis: objectives and design. Am J Epidemiol. 2002 Nov 1;156(9):871–881. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/aje/kwf113&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=12397006&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F05%2F26%2F2022.05.25.22275577.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000179035100012&link_type=ISI) 22. 22.Paynter NP, Balasubramanian R, Giulianini F, Wang DD, Tinker LF, Gopal S, et al. Metabolic predictors of incident coronary heart disease in women. Circulation. 2018 Feb 20;137(8):841–853. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6MTQ6ImNpcmN1bGF0aW9uYWhhIjtzOjU6InJlc2lkIjtzOjk6IjEzNy84Lzg0MSI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIyLzA1LzI2LzIwMjIuMDUuMjUuMjIyNzU1NzcuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 23. 23.Lumley T. survey: analysis of complex survey samples [Internet]. 2020 [cited 2021 Nov 18]. Available from: [https://cran.r-project.org/web/packages/survey/index.html](https://cran.r-project.org/web/packages/survey/index.html) 24. 24.Benjamini Y. Discovering the false discovery rate. J Royal Statistical Soc B. 2010 Aug 5;72(4):405– 416. 25. 25.Sofer T, Heller R, Bogomolov M, Avery CL, Graff M, North KE, et al. A powerful statistical framework for generalization testing in GWAS, with application to the HCHS/SOL. Genet Epidemiol. 2017 Apr;41(3):251–258. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/gepi.22029&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=28090672&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F05%2F26%2F2022.05.25.22275577.atom) 26. 26.Friedman J, Hastie T, Tibshirani R, Narasimhan B, Tay K, Simon N. Package glmnet. J Stat Softw. 2010 Feb 21;33(1). 27. 27.Cheng S, Rhee EP, Larson MG, Lewis GD, McCabe EL, Shen D, et al. Metabolite profiling identifies pathways associated with metabolic risk in humans. Circulation. 2012 May 8;125(18):2222–2231. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6MTQ6ImNpcmN1bGF0aW9uYWhhIjtzOjU6InJlc2lkIjtzOjExOiIxMjUvMTgvMjIyMiI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIyLzA1LzI2LzIwMjIuMDUuMjUuMjIyNzU1NzcuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 28. 28.Zheng Y, Hu FB, Ruiz-Canela M, Clish CB, Dennis C, Salas-Salvado J, et al. Metabolites of glutamate metabolism are associated with incident cardiovascular events in the PREDIMED prevención con dieta mediterránea (PREDIMED) trial. J Am Heart Assoc. 2016 Sep 15;5(9). 29. 29.Liu X, Zheng Y, Guasch-Ferré M, Ruiz-Canela M, Toledo E, Clish C, et al. High plasma glutamate and low glutamine-to-glutamate ratio are associated with type 2 diabetes: Case-cohort study within the PREDIMED trial. Nutr Metab Cardiovasc Dis. 2019 Oct;29(10):1040–1049. 30. 30.Lehn-Stefan A, Peter A, Machann J, Schick F, Randrianarisoa E, Heni M, et al. Elevated Circulating Glutamate Is Associated With Subclinical Atherosclerosis Independently of Established Risk Markers: A Cross-Sectional Study. J Clin Endocrinol Metab. 2021 Jan 23;106(2):e982–e989. 31. 31.Fung SJ, Xi M-C, Zhang J-H, Sampogna S, Yamuy J, Morales FR, et al. Apnea promotes glutamate- induced excitotoxicity in hippocampal neurons. Brain Res. 2007 Nov 7; 1179:42–50. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.brainres.2007.08.044&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=17888415&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F05%2F26%2F2022.05.25.22275577.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000251210200005&link_type=ISI) 32. 32.Macey PM, Sarma MK, Nagarajan R, Aysola R, Siegel JM, Harper RM, et al. Obstructive sleep apnea is associated with low GABA and high glutamate in the insular cortex. J Sleep Res. 2016 Aug;25(4):390–394. 33. 33.Macey PM, Sarma MK, Prasad JP, Ogren JA, Aysola R, Harper RM, et al. Obstructive sleep apnea is associated with altered midbrain chemical concentrations. Neuroscience. 2017 Nov 5;363:76–86. 34. 34.Hawkins RA. The blood-brain barrier and glutamate. Am J Clin Nutr. 2009 Sep;90(3):867S–874S. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NDoiYWpjbiI7czo1OiJyZXNpZCI7czo5OiI5MC8zLzg2N1MiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyMi8wNS8yNi8yMDIyLjA1LjI1LjIyMjc1NTc3LmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 35. 35.Alfredsson G, Wiesel FA, Tylec A. Relationships between glutamate and monoamine metabolites in cerebrospinal fluid and serum in healthy volunteers. Biol Psychiatry. 1988 Apr 1;23(7):689–697. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/0006-3223(88)90052-2&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=2453224&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F05%2F26%2F2022.05.25.22275577.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=A1988M549200005&link_type=ISI) 36. 36.Murr C, Grammer TB, Meinitzer A, Kleber ME, März W, Fuchs D. Immune activation and inflammation in patients with cardiovascular disease are associated with higher phenylalanine to tyrosine ratios: the ludwigshafen risk and cardiovascular health study. J Amino Acids. 2014 Feb 10;2014:783730. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1155/2014/783730&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24660059&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F05%2F26%2F2022.05.25.22275577.atom) 37. 37.Strasser B, Sperner-Unterweger B, Fuchs D, Gostner JM. Mechanisms of Inflammation- Associated Depression: Immune Influences on Tryptophan and Phenylalanine Metabolisms. Current topics in behavioral neurosciences. 2017;31:95–115. 38. 38.Huang T, Goodman M, Li X, Sands SA, Li J, Stampfer MJ, et al. C-reactive Protein and Risk of OSA in Four US Cohorts. Chest. 2021 Jun;159(6):2439–2448. 39. 39.Dahl RH, Berg RMG, Taudorf S, Bailey DM, Lundby C, Christensen M, et al. Transcerebral exchange kinetics of large neutral amino acids during acute inspiratory hypoxia in humans. Scand J Clin Lab Invest. 2019 Dec;79(8):595–600. 40. 40.Kiens O, Taalberg E, Ivanova V, Veeväli K, Laurits T, Tamm R, et al. The effect of obstructive sleep apnea on peripheral blood amino acid and biogenic amine metabolome at multiple time points overnight. Sci Rep. 2021 May 24;11(1):10811. 41. 41.Bell LN, Kilkus JM, Booth JN, Bromley LE, Imperial JG, Penev PD. Effects of sleep restriction on the human plasma metabolome. Physiol Behav. 2013 Oct 2;122:25–31. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.physbeh.2013.08.007&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23954406&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F05%2F26%2F2022.05.25.22275577.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000329552700004&link_type=ISI) 42. 42.Chen W-S, Wang C-H, Cheng C-W, Liu M-H, Chu C-M, Wu H-P, et al. Elevated plasma phenylalanine predicts mortality in critical patients with heart failure. ESC Heart Fail. 2020 Oct;7(5):2884–2893. 43. 43.Eriksson JG, Guzzardi M-A, Iozzo P, Kajantie E, Kautiainen H, Salonen MK. Higher serum phenylalanine concentration is associated with more rapid telomere shortening in men. Am J Clin Nutr. 2017 Jan;105(1):144–150. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NDoiYWpjbiI7czo1OiJyZXNpZCI7czo5OiIxMDUvMS8xNDQiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyMi8wNS8yNi8yMDIyLjA1LjI1LjIyMjc1NTc3LmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 44. 44.Ubhi BK, Riley JH, Shaw PA, Lomas DA, Tal-Singer R, MacNee W, et al. Metabolic profiling detects biomarkers of protein degradation in COPD patients. Eur Respir J. 2012 Aug;40(2):345–355. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6MzoiZXJqIjtzOjU6InJlc2lkIjtzOjg6IjQwLzIvMzQ1IjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjIvMDUvMjYvMjAyMi4wNS4yNS4yMjI3NTU3Ny5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 45. 45.Kuo W-K, Liu Y-C, Chu C-M, Hua C-C, Huang C-Y, Liu M-H, et al. Amino Acid-Based Metabolic Indexes Identify Patients With Chronic Obstructive Pulmonary Disease And Further Discriminates Patients In Advanced BODE Stages. Int J Chron Obstruct Pulmon Dis. 2019 Sep 30;14:2257–2266. 46. 46.Würtz P, Havulinna AS, Soininen P, Tynkkynen T, Prieto-Merino D, Tillin T, et al. Metabolite profiling and cardiovascular event risk: a prospective study of 3 population-based cohorts. Circulation. 2015 Mar 3;131(9):774–785. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6MTQ6ImNpcmN1bGF0aW9uYWhhIjtzOjU6InJlc2lkIjtzOjk6IjEzMS85Lzc3NCI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIyLzA1LzI2LzIwMjIuMDUuMjUuMjIyNzU1NzcuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 47. 47.Tan R, Li J, Liu F, Liao P, Ruiz M, Dupuis J, et al. Phenylalanine induces pulmonary hypertension through calcium-sensing receptor activation. Am J Physiol Lung Cell Mol Physiol. 2020 Dec 1;319(6):L1010–L1020. 48. 48.Lebkuchen A, Carvalho VM, Venturini G, Salgueiro JS, Freitas LS, Dellavance A, et al. Metabolomic and lipidomic profile in men with obstructive sleep apnoea: implications for diagnosis and biomarkers of cardiovascular risk. Sci Rep. 2018 Jul 26;8(1):11270. 49. 49.Geovanini GR, Wang R, Weng J, Jenny NS, Shea S, Allison M, et al. Association between Obstructive Sleep Apnea and Cardiovascular Risk Factors: Variation by Age, Sex, and Race. The Multi- Ethnic Study of Atherosclerosis. Annals of the American Thoracic Society. 2018;15(8):970–977. 50. 50.Nadeem R, Singh M, Nida M, Waheed I, Khan A, Ahmed S, et al. Effect of obstructive sleep apnea hypopnea syndrome on lipid profile: a meta-regression analysis. J Clin Sleep Med. 2014 May 15;10(5):475–489. 51. 51.Chopra S, Rathore A, Younas H, Pham LV, Gu C, Beselman A, et al. Obstructive sleep apnea dynamically increases nocturnal plasma free fatty acids, glucose, and cortisol during sleep. J Clin Endocrinol Metab. 2017 Sep 1;102(9):3172–3181. 52. 52.Bonsignore MR, McNicholas WT, Montserrat JM, Eckel J. Adipose tissue in obesity and obstructive sleep apnoea. Eur Respir J. 2012 Mar;39(3):746–767. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6MzoiZXJqIjtzOjU6InJlc2lkIjtzOjg6IjM5LzMvNzQ2IjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjIvMDUvMjYvMjAyMi4wNS4yNS4yMjI3NTU3Ny5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 53. 53.Alvarez D, Hornero R, Abásolo D, del Campo F, Zamarrón C. Nonlinear characteristics of blood oxygen saturation from nocturnal oximetry for obstructive sleep apnoea detection. Physiol Meas. 2006 Apr;27(4):399–412. 54. 54.Senaratna CV, Perret JL, Lodge CJ, Lowe AJ, Campbell BE, Matheson MC, et al. Prevalence of obstructive sleep apnea in the general population: A systematic review. Sleep Med Rev. 2017 Aug;34:70–81. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.smrv.2016.07.002&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F05%2F26%2F2022.05.25.22275577.atom) 55. 55.Fietze I, Laharnar N, Obst A, Ewert R, Felix SB, Garcia C, et al. Prevalence and association analysis of obstructive sleep apnea with gender and age differences - Results of SHIP-Trend. J Sleep Res. 2019 Oct;28(5):e12770. 56. 56.Chaudhary P, Goyal A, Goel SK, Kumar A, Chaudhary S, Kirti Keshri S, et al. Women with OSA have higher chances of having metabolic syndrome than men: effect of gender on syndrome Z in cross sectional study. Sleep Med. 2021 Mar;79:83–87. 57. 57.Zhang X, Wang S, Xu H, Yi H, Guan J, Yin S. Metabolomics and microbiome profiling as biomarkers in obstructive sleep apnoea: a comprehensive review. Eur Respir Rev. 2021 Jun 30;30(160).