Predicting prenatal depression and assessing model bias using machine learning models ===================================================================================== * Yongchao Huang * Suzanne Alvernaz * Sage J. Kim * Pauline Maki * Yang Dai * Beatriz Peñalver Bernabé ## Abstract Perinatal depression (PND) is one of the most common medical complications during pregnancy and postpartum period, affecting 10-20% of pregnant individuals. Black and Latina women have higher rates of PND, yet they are less likely to be diagnosed and receive treatment. Machine learning (ML) models based on Electronic Medical Records (EMRs) have been effective in predicting postpartum depression in middle-class White women but have rarely included sufficient proportions of racial and ethnic minorities, which contributed to biases in ML models for minority women. Our goal is to determine whether ML models could serve to predict depression in early pregnancy in racial/ethnic minority women by leveraging EMR data. We extracted EMRs from a hospital in a large urban city that mostly served low-income Black and Hispanic women (N=5,875) in the U.S. Depressive symptom severity was assessed from a self-reported questionnaire, PHQ-9. We investigated multiple ML classifiers, used Shapley Additive Explanations (SHAP) for model interpretation, and determined model prediction bias with two metrics, Disparate Impact, and Equal Opportunity Difference. While ML model (Elastic Net) performance was low (ROCAUC=0.67), we identified well-known factors associated with PND, such as unplanned pregnancy and being single, as well as underexplored factors, such as self-report pain levels, lower levels of prenatal vitamin supplement intake, asthma, carrying a male fetus, and lower platelet levels blood. Our findings showed that despite being based on a sample mostly composed of 75% low-income minority women (54% Black and 27% Latina), the model performance was lower for these communities. In conclusion, ML models based on EMRs could moderately predict depression in early pregnancy, but their performance is biased against low-income minority women. Keywords * Perinatal depression * electronic medical records * machine learning * model performance bias ## INTRODUCTION Perinatal depression (PND), depression during pregnancy and up to one year postpartum, is one of the most common complications during the perinatal period (1). In the United States (U.S.), the rate of PND is 10-20% (1) and has increased more than 3-fold from 2000 to 2015 (2, 3). The rates of PND among Black women and Latinas are 2-to 5-fold higher than non-Hispanic White Women (4–8). In our own longitudinal studies in low-income women of color, the rate of PND is 23%, while the U.S. average rate is 12%. During the COVID-19 pandemic, the prevalence of PND has risen to 27%-32% (9–11), highlighting the importance of environmental stressors in these mood disorders (12). PND confers significant obstetric risks of low birth weight (13), preterm labor (13, 14), higher maternal morbidity and mortality, longer hospital stays post-delivery and higher delivery costs (15), lower initiation and duration of breastfeeding (16), and poor maternal-fetal attachment (17). Infants born from women with PND have increased risks of stunted growth, inadequate cognitive development, altered stress response, underdeveloped social-emotional behavior, and future mental disorders (15, 18–23). In extreme cases, PND can lead to suicide, a leading cause of maternal mortality in the first year after delivery (24). Multiple individual-level factors have been linked to increased PND risk, such as lack of a partner and social support (25), unplanned pregnancy (26), young age (27), prior history of depression or trauma (27) and adverse childhood experiences (28, 29). Despite a higher prevalence of depression during pregnancy in Black women and Latinas, they are less likely to be screened (30–32) and to share their PND symptoms (33–35). Providers commonly screen for depression at least once during pregnancy and postpartum (36). Common PND screening methods include self-reported questionnaires, such as Edinburgh Postpartum Depression Score (EPDS) (37) and the Patient Health Questionnaire-9 (PHQ-9) (38). However, the utility of these self-reported questionnaires depends on the accurate disclosure of symptoms (33–35). For instance, Black women might share their PND symptoms with family but are more reluctant to seek help compared to non-Hispanic White women due to social stigma (33, 39) and they might be concerned that they are viewed as lacking strength and resilience (40–42). Fear of the consequences of symptoms disclosure, such as losing the infant’s custody (33), and medical mistrust are also important factors for which pregnant individuals might decide to underreport or completely deny their symptoms (43). Furthermore, how depressive symptoms are communicated might be influenced by cultural factors that EPDS or PHQ-9 might not be fully captured (43–47). For example, studies have shown that low-income urban Black women endorse lower levels of depressive symptoms severity in self-reported questionnaires than other populations (45–47), despite having the same clinical diagnosis, hence leading to more false negative results in Black women compared with other groups. Computational approaches, such as machine learning (ML) using Electronic Medical Records (EMRs), have been able to predict pregnancy outcomes, including gestational diabetes (48–50), preterm birth (51), and suicidal thoughts (52). EMR-based ML models for PND have generally focused on predicting postpartum depression (53–63), and rarely include racial and ethnic minorities (54, 59). In middle-class White women, EMR-based ML models to predict postpartum depression can perform relatively well, with Receiver Operating Characteristic Area Under the Curve (ROCAUC)>0.75, using various approaches such as random forest (RF) (54, 58, 59, 63–65), artificial neural networks (ANNs) (55, 56, 66), and logistic regression (54, 62, 67). However, the performance of EMR-based ML models for predicting depression in pregnancy for low-income minority women remains unexplored. Biases in prediction performance exist in ML models, which refer to the disparate levels of model prediction of the outcome of interest for certain socio-demographic variables, called protected variables, such as sex, gender, age, race, ethnicity, or social-economic status (68). Since EMRs are not collected from well-designed balanced studies, the model performance can better predict or be biased towards groups more represented in EMRs. Also, some groups might have more available data (e.g., more clinical encounters, diagnostic tests), or their EMR information is of higher quality (69). White women are commonly disproportionally overrepresented in EMR, and consequently, ML models based on EMR have lower performance at predicting PND risk in Black and Latina women compared with White individuals (59). Other reasons for poor performance include the lower quality and quantity of EMRs for racial/ethnic minorities than those of White women because of lower access to care (59), e.g., no prior documented medical history in the EMRs; underutilization of health care due to work conditions; lack of childcare or transportation; reluctance to reveal certain sensitive information (33); and/or implicit bias by health care providers (33, 43). Thus, understanding the sources of bias in ML models is essential for the equitable prediction of PND risk in diverse populations. Despite the negative consequences of depression during pregnancy for both the mother and the infant that disproportionally impact low-income women of color, no study has thus far used EMR information enriched in women of color from under-resourced communities to predict depression early in pregnancy and examined the model bias. Here, we constructed ML models to predict depression symptom severity early in pregnancy for an urban low-income women of color that received care in the same outpatient obstetric clinic, and we subsequently assessed model performance and racial and ethnic biases. ## MATERIALS AND METHODS ### Study population EMRs were extracted from patients who received obstetric care at the University of Illinois in Chicago (UIC) Hospital Health Sciences System, UIC Medical Center (UIHealth), an urban academic hospital in Chicago, U.S. The population served at UIHealth is 51% non-Hispanic Black, 29% Hispanic, 9% non-Hispanic White, and 10% Asian and Native American combined. We extracted EMR from patients who were 18 years or older and received their obstetric care and delivered within UIHealth between 2014 to 2020. We obtained information on the patient’s socio-demographic characteristics, prior medical history, mental health assessments, health behaviors (alcohol use, smoking status, substance abuse, etc.), vital records, laboratory tests, medications, obstetric complications for the prior and current pregnancy, and delivery and fetus information (weight, length, sex, Apgar scores). Depressive symptom severity was assessed using a self-reported questionnaire PHQ-9 (38). The extracted data included 694 features and 5,875 patients. The entire data dictionary is available in **Table S1 and S2.** The EMR extraction was approved by the University of Illinois Institutional Review Board (IRB # 2020-0553). ### Preprocessing of EMR data Patients that satisfy the following criteria were included in the downstream analysis: 1) patients with complete PHQ-9 (all 9 questions were recorded in the patient’s EMR); 2) patients that completed their first obstetric visit before 24 gestational weeks, as depressive symptoms later in pregnancy might be distinct from those experienced in the first and second trimesters; 3) patients with PHQ-9 scores between 1-4 or 9 and above. Patients with mild levels of depression (PHQ-9 scores between 5-8) or those who reported no depression symptoms (PHQ-9 score 0) were excluded to avoid incorporating possible false negatives in case the individual underreported or denied their symptoms, respectively. Patients with a PHQ-9 score higher than 9 were categorized as cases. Patients with a PHQ-9 score between 1-4 were categorized as controls. Prescribed medications were grouped into 29 broad classes based on their most common use, mechanism of action, and targeted organ system. These included medications/prescriptions taken for diabetes, heart pathologies, anti-inflammatory/analgesics, cancer, gastrointestinal disorders, vitamins & supplements, thyroid disease, fertility, antibiotics, and mood/anxiety medications. The complete annotation table can be found **(Table S2)**. We omitted features that were missing in more than 60% of the included patients. This cutoff was selected based on the minimum number of samples that were required to adequately impute a feature without jeopardizing model performance and independently of the ML algorithm (**Figure S1**). All qualitative features (e.g., race, insurance, and medications) were hard-coded, and continuous features were transformed using min-max normalization. Missing data were imputed using *MICE* (version 3.15.0) (70). To robustly impute missing values, we generated a total of 50 imputed datasets, with 10 sampling iterations per imputed set. For continuous variables, missing data were estimated as the average across all the 50 imputed datasets, and for qualitative variables as the majority of a given category. To partially mitigate bias, ML models were trained by excluding self-reported race and ethnicity as well as results from genetic tests to assess ancestry. We also removed features that have non-causal associations with PND but may be implicitly related to the patient’s race and ethnicity, such as preferred language to communicate with their provider other than English. ### Machine learning model selection and training We explored different ML models, including Random Forest, Elastic Net, and XGBoost, to identify the most suitable one for the prediction of cases and controls. Due to the imbalance between the numbers of cases (N = 657) and controls (N = 1,757), we selected the best model based on ROCAUC. The data were split into 80% for training and 20% for testing. The *GridSearchCV* function from the *sklearn* python package (version 1.2.0) was employed to determine the best hyperparameters for each model through a 5-fold cross-validated grid search using the training set. For the most suitable method based on their ROCAUC, Elastic Net, we used the optimized hyperparameters to generate 20 models, each of which was trained using 400 controls and 400 cases randomly sampled from the training set. The accuracy, ROCAUC, positive predictive value (PPV), negative predictive value (NPV), and specificity were calculated for model evaluation. We also computed the harmonic mean of precision and recall (F1), ![Formula][1] and the area under the Precision-Recall Curve (PRAUC), ![Formula][2] to further assess model performance. We fixed the sensitivity as 85% for each model to maximize the identification of cases—clinical objective. ### Identification of the most important features To identify the importance and directionality with respect to depressive symptom severity of the EMR features, we calculated the Shapley values using SHAP (Shapley Additive Explanations) (71). The Shapley value, *ɸ**i*, is defined as the estimated contribution of feature *i* in all samples to the depression outcome. For feature dependence analysis, we followed the procedure developed by Artzi and colleagues to convert the Shapley values into relative risk (RR) (48). Briefly, in the SHAP analysis, the log-odds of the predicted probability are calculated as *ɸ**i*, then the predicted probability of a single feature *i* is ![Formula][3] where, ![Formula][4] and *ɸ* is the ‘base’ Shapley value, i.e., the logit of the population prevalence (denoted as *P*). Therefore, ![Formula][5] ![Formula][6] Here *P*was set as 0.14, which was the prenatal depression prevalence estimated from the EMRs of the current study. The relative risk (RR) of a single feature *i* was calculated as follows: ![Formula][7] When calculating the *RR* of prenatal depression in relation to a set of features *A*, equation 7 can be extended as follows: ![Formula][8] We employed the function *dependence_plot()* within the *shap* package (version 0.39.0) to determine the correlation between the most important features and the rest of the EMR variables based on their Shapley value. ### Statistical analysis Differences between the groups were determined with the Mann-Whitney test and Bonferroni to correct for multiple comparisons. To establish the relationships between each feature and the outcome, we used Spearman correlation to estimate the direction of the effect. ### Bias assessments We employed two common metrics to assess model bias, the disparate impact (DI) (72) and the equal opportunity difference (EOD) (73): ![Formula][9] ![Formula][10] Here, *Y* and *y* denote the true and predicted outcome of depression (case=1, control=0), respectively. The privileged group was selected as the racial/ethnic group with the highest PRAUC and ROCAUC. In our study, the privileged group was non-Hispanic White women, and the unprivileged groups were non-Hispanic Black and Latina women. ## RESULTS ### EMRs were extracted from an understudied urban low-income woman of color We extracted EMRs from a total of 5,875 patients who received obstetric care at an academic hospital affiliated with a public university in Chicago, U.S., from 2014-2020. After preprocessing the data (**Figure 1**, see methods for more detail), a total sample of 2,414 pregnant women was employed for downstream analysis (**Table 1**). Individuals were adults who received obstetric care and delivered in the public hospital and whose first obstetric visit for the given pregnancy was before 24 gestational weeks. Most of the available EMRs belongs to low-income women of color, 54% non-Hispanic Black and 27% non-Hispanic White women, and more than 72% of women were in federal aid health care plans (Medicaid and Medicare). We observed statistically significant differences between racial/ethnic groups **(Tables S3-S4**), with non-Hispanic Black women being more likely to be single (p-adj < 0.01), having an unplanned pregnancy (p-adj < 0.01), and being unemployed (p-adj < 0.01). We also detected statistically significant differences between cases and controls overall as well as when segregating the data by race/ethnicity, with women reporting high levels of depressive symptoms being more likely to have an unplanned pregnancy and to smoke independently of their race/ethnicity (p-adj < 0.01). The percentage of missing data for planned pregnancy, tobacco use, pain assessment scale was similar among races/ethnicity (**Figure S2c**). ![Figure 1.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2023/07/18/2023.07.17.23292587/F1.medium.gif) [Figure 1.](http://medrxiv.org/content/early/2023/07/18/2023.07.17.23292587/F1) Figure 1. Overall workflow. View this table: [Table 1.](http://medrxiv.org/content/early/2023/07/18/2023.07.17.23292587/T1) Table 1. Socio-demographic characteristics of the sample employed for model optimization. Out of self-report PHQ-9 questions, questions pertaining to changes in appetite, sleep, and tiredness were commonly scored 1 or above in around 90% of the participants with the same proportions independently of race/ethnicity. In contrast, less than 5% of the individuals reported a score of 1 or above in feature related to suicide ideation and attempt, independently of their race or ethnicity (**Figure S3**). ### Elastic Net was the best model to predict depression in early pregnancy for our dataset First, we determined the most adequate machine learning model to predict depressive symptom severity early in pregnancy in our sample. We explored multiple ML models, including Random Forest (74), XGBoost (75), and Elastic Net (76), and selected the model based on its performance at a fixed level of 85% sensitivity (**Figure 1)**. Both Elastic Net and Random Forest had similar performance when models were agnostic to patient’s race/ethnicity (**Table S5**). Given its lower computational time compared to the Random Forest, we performed all the downstream analyses with the Elastic Net models. There were no statistically significant differences between those Elastic Net models in which were agnostic to patient’s race and ethnicity (Model 1) and those that were not agnostic to patient’s racial and ethnical background (Model 2) (**Figure S4**). ### Identification of well-known and novel features associated with prenatal depressive symptom severity To identify the features that were most predictive of depressive symptom severity early in pregnancy and the directionality of their associations, we determined their Shapley value. Based on game theory, Shapley value provides an estimate of the contribution made by each feature towards the overall prediction of the model. The top 20 most predictive features (based on their mean absolute Shapley values) included well-known socio-demographic factors associated with PND, such as having an unplanned pregnancy (77), being single (78), young (27), unemployed (79), on federal aid insurance (proxy for poverty) (80) and tobacco use (81) (**Figure 2a**). Yet, our model also identified features that have not been previously associated with depressive symptom severity in pregnancy, or just reported in a few studies. For instance, we discovered that elevated depressive symptoms were positively associated with self-reported levels of pain, an asthma diagnosis, carrying a male fetus (82), using antihistamines, analgesics, or antibiotics, and with lower platelet levels in blood (83, 84). Features related to preventive care, such as prenatal vitamin intake (85) and immunization against influenza and tetanus, were negatively correlated with depressive symptoms severity. ![Figure 2.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2023/07/18/2023.07.17.23292587/F2.medium.gif) [Figure 2.](http://medrxiv.org/content/early/2023/07/18/2023.07.17.23292587/F2) Figure 2. Multiple known and unexplored EMR features were associated with PND status. **a,** top 20 most important features based on their Shapley value to predict PND status. The color of the bar represents the correlation between each feature and PND status; red indicates a positive association with outcome and blue indicates a negative association. **b-d,** features whose importance is significantly different by race/ethnicity in terms of medication use **(b)**, socio-demographic factors **(c)**, substance use and pain assessment scale **(d)**. NHB: non-Hispanic Black. NHW: non-Hispanic White. H: Hispanic or Latina. Inspection of the contribution of top significant features towards elevated levels of depressive symptoms as a function of patient’s race/ethnicity (**Figure 2b-d**), socio-demographic factors, such as being single, having an unplanned pregnancy, and low socio-economic status, were weaker predictors of depressive symptoms in non-Hispanic Black women compared to non-Hispanic White and Hispanic women (p < 0.01). However, tobacco use, infections, asthma diagnosis, and elevated self-report levels of pain had significantly higher contributions to the risk of prenatal depression in non-Hispanic Black women compared to the other two groups (p-value < 0.01). Compared to non-Hispanic White women, unemployment was a less predictive feature of prenatal depression risk in Black women, while experiencing nausea early in pregnancy had a significantly higher feature contribution (p-value < 0.05). ### Predicted relative risk of prenatal depression Next, we determined the relative risk associated with most predicted features of depressive symptom severity based on their Shapley value (**Figure 3**). High self-reported pain levels were positively and linearly correlated with an increased risk of prenatal depression, independently of whether the patient was receiving any pain medication (**Figure 3a**). Of note, non-Hispanic Black individuals reported more often higher levels of pain (85%, score ≥ 6) than any other group (**Figure 3a)**. A history of medication for mood disorders is associated with an increased risk of perinatal depression (mean RR=1.18, sd=0.02, **Figure 3b**). As priorly reported (77, 78, 81), tobacco use (mean RR=1.09, sd=0.01, **Figure 3c**), unplanned pregnancy (RR=1.03, sd=0.01, **Figure 3d**), and being single (RR=1.02, sd=0.01, **Figure 3e**) are associated with increased the risk of perinatal depression. ![Figure 3.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2023/07/18/2023.07.17.23292587/F3.medium.gif) [Figure 3.](http://medrxiv.org/content/early/2023/07/18/2023.07.17.23292587/F3) Figure 3. The predicted relative risk of top selected features. **a,** predicted relative risk of self-report pain level and its interaction with the use of anti-inflammatory/analgesic drugs. The pie chart represents the racial/ethnic distribution of pain assessment levels below and equal or above 6. **b-d,** predicted relative risk of planned pregnancy **(b)**, of being single and its interactions with vitamin supplement intake **(c),** of tobacco use **(d),** of mood/anxiety medication use and their interactions with anti-coagulants medication **(e)**. ### Despite the enrichment of minority low-income women of color, the model was biased against Black and Latina women Finally, we examined whether ML models had the same predictive capability to determine PND status (cases versus controls) without using the individual’s race/ethnicity as part of the features for model training (**Figure 4a-b**). The model performance for any patient, independent of their racial/ethnic background, was moderate based on the Precision-Recall and Receiver Operating Characteristic Area under the Curve (PRAUC and ROCAUC, 50% and 66%, respectively). However, when stratifying the prediction by racial/ethnic groups, Elastic Net predictions of PND status for non-Hispanic White women were significantly higher than those for non-Hispanic Black women (PRAUC: 70%, 50%, and 40%; ROCAUC: 85%, 62%, and 70%, respectively; p<0.001). Further, the performance of the ML model for non-Hispanic White women was better than for non-Hispanic Black and Latina women when compared by specificity, accuracy, F1, PPV, and NPV (**Figure S4**). ![Figure 4.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2023/07/18/2023.07.17.23292587/F4.medium.gif) [Figure 4.](http://medrxiv.org/content/early/2023/07/18/2023.07.17.23292587/F4) Figure 4. EMR-based ML models are biased against low-income minority women. **a,** area under the precision and recall curve for different racial/ethnic groups. **b,** the area under the receiver operating characteristic curve for different race/ethnicity. **c,** disparate impact values between unprivileged and privileged groups. Privileged group: non-Hispanic White women; unprivileged groups: non-Hispanic Black and Hispanic women. Model 1: trained without race/ethnicity label (blue). Model 2: trained with race/ethnicity label (orange). To further estimate the inequality of model prediction or model bias, we calculated two common bias metrics: Equal Opportunity Difference (EOD), the true positive difference between the privileged and unprivileged group, and Disparity Impact (DI), the ratio of positive predictions between the privileged and unprivileged groups. Both DI and EOD values indicated that Elastic Net models had lower predictive performance in the unprivileged groups, non-Hispanic Black and Latina women, despite the model being trained with a sample mostly composed of low-income women of color. Removal of race/ethnicity as training features from the models moderately reduces model performance disparities as measured by DI, with more significant improvements for non-Hispanic Black women (p-value < 0.001, non-Hispanic Black versus no-Hispanic White women; p-value < 0.01, Hispanic versus no-Hispanic White women) (**Figure 4c-d**). ## DISCUSSION In this study, we examined the capability of interpretable machine learning models to predict the risk of depression during early pregnancy in low-income women of color using electronic medical records. Our results demonstrated that EMR-based ML models were moderately predictive of depressive symptom severity in early pregnancy in low-income women of color. In addition, using the SHAP analysis, our results were interpretable in that we were able to identify factors that were associated with the risk of early prenatal depression including the directionality of their associations. Our models not only revealed well-known factors, such as unplanned pregnancy, prior history of medication use to treat mood disorders, or young age, but also captured novel or underexplored markers, such as self-report pain levels, asthma diagnosis, carrying a male fetus, or platelet levels in blood. Importantly, our results highlighted the significant performance disparity in model prediction between racial and ethnic groups, with women of color at greater disadvantages. ML models to predict PND in racial and ethnic minorities are scarce. For instance, Zang and colleagues used a relatively well-balanced racial/ethnic group of urban pregnant individuals (18% non-Hispanic White, 27% Black) to validate their EMR-based ML models to predict postpartum depression (86). Park and colleagues focused on pregnant individuals at high-risk for mood disorders during pregnancy and postpartum that were covered by Medicaid (87). EMR-based ML models to predict postpartum depression have been developed in a wide variety of populations (e.g., Chinese (58), European (55, 56, 60, 63), non-Hispanic White women in the U.S.). These EMR-based ML models can perform relatively well (ROCAUC>0.75) using various ML approaches such as Random Forest(54, 63, 64, 86, 88), XGBoost (54, 57, 86), artificial neural networks (55, 56, 89), support-vector machines(56, 58, 64, 90) or logistic regression (54). However, previous studies have not demonstrated the predictive capability of EMR-based ML models to estimate PND status in low-income minority women. Our results filled this current gap and showed that EMR-based models trained with a sample primarily composed of low-income minority women moderately predict PND status. Yet, the Elastic Net model had a better prediction performance for the less prevalent group in the sample, non-Hispanic White women (ROCAUC > 0.85), compared with non-Hispanic Black women (ROCAUC > 0.6) and Latinas (ROCAUC > 0.65). Our results are interpretable, a very important capability to identify markers that providers can intervene upon. Using SHAP analysis, we revealed several established markers that increased the risk of depression in early pregnancy, such as unplanned pregnancy (77), or being single (78). Our results also suggest that individuals that do not take vitamin supplements during early pregnancy may have an increased risk of depression. This observation aligns with results from a systematic review performed by Sparling and colleagues. The authors found that lower levels of folate, vitamin D, iron, selenium, zinc, fats, and fatty acids were associated with increased risks of perinatal depression (85), but those studies were limited in their population diversity, which either failed to include minority populations, or had limited sample sizes of minority groups. Importantly, our analysis uncovered several new or underexplored markers that increased the predicted risk of depression during early pregnancy, such as self-report pain levels, mean platelet volume in blood (83, 84), carrying a male fetus (82), and having a diagnosis of asthma. Higher levels of pain have been linked to postpartum depression, and our findings suggest that this relationship also holds during the early stages of pregnancy(91) Similarly, patients with major depression exhibit a higher mean platelet volume (83, 84), but this phenomenon has not been investigated in the context of depression during pregnancy. Recently, Myers and colleagues reported that carrying a male fetus increases the odds of postnatal depression (82). While the mechanisms that link the fetus’s sex and depression status are unclear, women who carry male fetuses have lower estradiol levels in the blood than those carrying female fetuses (92) and lower levels of estrogens outside pregnancy are associated with higher levels of depressive symptoms (93). Also, women carrying male fetuses have higher levels of inflammatory markers, such as IL1B, in early pregnancy, and higher levels of inflammation have been linked to elevated levels of depression symptoms during pregnancy and postpartum (94, 95). Finally, our model indicates that an asthma diagnose increases the relative risk of early prenatal depression. While asthma may not be directly correlated with depression during pregnancy, air pollution is linked to an increased risk of asthma (96) and depression (97). Most of the women in our study live in underserved neighborhoods with high levels of outdoor pollutants and asthma prevalence. Thus, the associations between asthma diagnosis and depression during early pregnancy are most likely mediated by the negative impact of structural inequalities than by biological mechanisms. Despite our original expectation, employing samples enriched in individuals from underserved populations is not enough to reduce EMR-based ML model performance bias. Our results revealed a significant disparity in performance among non-Hispanic White women, non-Hispanic Black, and Hispanic women, and it was not driven by the under-representation of the latter groups. Even though there was no significant difference in the overall performance between models trained with or without race and ethnicity, the non-Hispanic White women, the least represented group in the sample, were predicted significantly better than non-Hispanic Black and Hispanic women, who represented more than 75% of the sample. This contradicts prior studies and indicates that although we trained the model with the majority of individuals from unprivileged groups with similar electronic medical record quality, the model was still biased. Multiple reasons might explain our observed results. For instance, due to cultural differences, social stigma, or medical mistrust, non-Hispanic Black women might underreport their symptoms more often than non-Hispanic White women. Studies indicate that urban low-income women of color report lower levels of depressive symptoms than non-Hispanic White women using self-report questionnaires despite having the same diagnosis when assessed by a clinical provider (98). Thus, a lesser number of non-Hispanic Black women with prenatal depression might be referred to mental health providers. Our results agree with this. While our model identified non-Hispanic Black women with elevated depressive symptoms with a sensitivity of 80%, the positive predictive value was low (PPV=37%, **Figure S3d, g**). In other words, the model predicted a large proportion of non-Hispanic Black women as cases instead of control, indicating that a larger proportion of non-Hispanic Black women may have endorsed higher levels of depressive symptoms. However, our results are based on self-report screening tools and should be confirmed using clinical diagnosis. Another plausible reason for the lower predictive performance of our EMR-based ML model for women of color might be due to the different use of medical services. Despite all the EMR employed in this study being collected in the same clinic, non-Hispanic Black women, who were mostly in federal aid insurance (**Figure S2b**), had their first obstetric visit later in their pregnancy (59% after the first trimester vs. 51% within the first trimester) compared with non-Hispanic White women (6% after the first trimester vs. 11% within the first trimester) (p-value < 0.001, **Figure S2a**). Another possibility is that EMRs might not contain all the necessary information to predict prenatal depression, as some other relevant factors, such as chronic stress, are not measured or recorded in them. Certain neighborhood characteristics, such as violence, living in poverty areas, or air pollution, are known to increase chronic stress thus chronic inflammation (99) and inflammation is one of the hallmarks of perinatal depression (100). Thus, chronic stress and chronic inflammation might mediate the negative effects of structural inequalities in depression during pregnancy. Therefore, future research should include neighborhood factors that could boost model prediction performance in women who are exposed to higher contextual risks, e.g., racial minority and low-income women living in highly segregated urban poverty areas (101). Our study has several strengths, including 1) the exploration of the predictive capabilities of ML models using EMR to identify depression early in pregnancy in low-income women of color, as most of the current studies aimed to predict postpartum depression in middle-class non-Hispanic White women; 2) the use of SHAP analysis to make model results interpretable and thus actionable; and 3) assessing model biases. Future studies can benefit from larger sample sizes in more diverse populations at high risk of PND (e.g., immigrants, homeless); using clinical diagnoses of prenatal depression, such as ICD-9/10 codes, instead of self-report depression questionnaires; the addition of community-level information; and the explorations of algorithms that could aid to remove model biases. ## Conclusions Interpretable machine learning models based on available electronic medical records can aid in identifying women at high risk of PND in their early pregnancy so that clinicians can intervene early enough to prevent the negative consequences of PND for both the mother and the child. However, new tools and approaches are necessary to increase the prediction performance of EMR-based machine learning models to reduce model biases so that the risk of PND can be equitably predicted for all pregnant individuals. ## Supporting information Supplemental Figure [[supplements/292587_file02.pptx]](pending:yes) Supplemental Table [[supplements/292587_file03.xlsx]](pending:yes) Supplements Caption [[supplements/292587_file04.docx]](pending:yes) ## Data availability Data is available to the research community upon approval of the University of Illinois Chicago Institutional Review Board (IRB # 2020-0553). Code is available at [https://github.com/Bealab](https://github.com/Bealab). ## Author contributions BPB and YD conceptualized the idea; BPB, YD, and YH designed the methodology for model creation and validation; BPB, YH, and SA curated and pre-processed the data; YH performed the analysis and subsequent validation; BPB, YD, and YH interpreted the results and wrote the original draft; and all the authors critically reviewed and edited the manuscript. ## Conflicts of interests None ## Acknowledgments This work is funded through a K12 BIRCWH Award (NICHD 101373-04) BPB and supported by the National Center for Advancing Translational Sciences (NCATS), National Institutes of Health, through Grant Award Number UL1TR002003. We would like to thank Dr. Subhash Kumar Kolar Rajanna for his assistance in extracting the electronic medical records used in this manuscript. The content is solely the responsibility of the authors and does not necessarily represent the official views of the NIH. ## Footnotes * # Co-senior authors * Received July 17, 2023. * Revision received July 17, 2023. * Accepted July 18, 2023. * © 2023, Posted by Cold Spring Harbor Laboratory This pre-print is available under a Creative Commons License (Attribution-NonCommercial-NoDerivs 4.0 International), CC BY-NC-ND 4.0, as described at [http://creativecommons.org/licenses/by-nc-nd/4.0/](http://creativecommons.org/licenses/by-nc-nd/4.0/) ## References 1. 1.Gavin NI, Gaynes BN, Lohr KN, Meltzer-Brody S, Gartlehner G, Swinson T (2005): Perinatal depression: a systematic review of prevalence and incidence. Obstet Gynecol 106:1071–1083. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1097/01.AOG.0000183597.31630.db&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=16260528&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F18%2F2023.07.17.23292587.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000233018800026&link_type=ISI) 2. 2.Pino EC, Zuo Y, Schor SH, Zatwarnicki S, Henderson DC, Borba CP, et al. (2019): Temporal trends of co-diagnosis of depression and/or anxiety among female maternal and non-maternal hospitalizations: Results from Nationwide Inpatient Sample 2004-2013. Psychiatry Res 272:42–50. 3. 3.MacDorman MF, Declercq E, Cabral H, Morton C (2016): Recent Increases in the U.S. Maternal Mortality Rate: Disentangling Trends From Measurement Issues. Obstet Gynecol 128:447–455. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1097/AOG.0000000000001556&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=27500333&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F18%2F2023.07.17.23292587.atom) 4. 4.Mukherjee S, Trepka MJ, Pierre-Victor D, Bahelah R, Avent T (2016): Racial/Ethnic Disparities in Antenatal Depression in the United States: A Systematic Review. Matern Child Health J 20:1780–1797. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F18%2F2023.07.17.23292587.atom) 5. 5.Gavin AR, Melville JL, Rue T, Guo Y, Dina KT, Katon WJ (2011): Racial differences in the prevalence of antenatal depression. Gen Hosp Psychiatry 33:87–93. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.genhosppsych.2010.11.012&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=21596200&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F18%2F2023.07.17.23292587.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000289183700001&link_type=ISI) 6. 6.Kuo W, Wilson TE, Holman S, Fuentes-Afflick E, O’Sullivan MJ, Minkoff H (2004): Depressive Symptoms in the Immediate Postpartum Period Among Hispanic Women in Three U.S. Cities. Journal of Immigrant Health 6:145–153. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1023/B:JOIH.0000045252.10412.fa&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=16228697&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F18%2F2023.07.17.23292587.atom) 7. 7.Grobman WA, Parker CB, Willinger M, Wing DA, Silver RM, Wapner RJ, et al. (2018): Racial disparities in adverse pregnancy outcomes and psychosocial stress. Obstet Gynecol 131:328. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1097/AOG.0000000000002441&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F18%2F2023.07.17.23292587.atom) 8. 8.Toledo-Corral CM, Gao L, Chavez T, Grubbs B, Habre R, Dunton GF, et al. (2022): Role of Race, Ethnicity, and Immigration in Perceived Stress and Depressive Symptomatology Trends During Pregnancy. J Immigr Minor Health 24:561–569. 9. 9.Adrianto N, Caesarlia J, Pajala FB (2022): Depression in pregnant and postpartum women during COVID-19 pandemic: systematic review and meta-analysis. Obstet Gynecol Sci 65:287–302. 10. 10.Lebel C, MacKinnon A, Bagshawe M, Tomfohr-Madsen L, Giesbrecht G (2020): Elevated depression and anxiety symptoms among pregnant individuals during the COVID-19 pandemic. J Affect Disord 277:5–13. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.jad.2020.07.126&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F18%2F2023.07.17.23292587.atom) 11. 11.Mateus V, Cruz S, Costa R, Mesquita A, Christoforou A, Wilson CA, et al. (2022): Rates of depressive and anxiety symptoms in the perinatal period during the COVID-19 pandemic: Comparisons between countries and with pre-pandemic data. J Affect Disord 316:245–253. 12. 12.Wenzel E, Penalver Bernabe B, Dowty S, Nagelli U, Pezley L, Gibbons R, et al. (2021): Using computerised adaptive tests to screen for perinatal depression in underserved women of colour. Evid Based Ment Health. 13. 13.Jarde A, Morais M, Kingston D, Giallo R, MacQueen GM, Giglia L, et al. (2016): Neonatal Outcomes in Women With Untreated Antenatal Depression Compared With Women Without Depression: A Systematic Review and Meta-analysis. JAMA Psychiatry 73:826–837. 14. 14.Madigan S, Oatley H, Racine N, Fearon RMP, Schumacher L, Akbari E, et al. (2018): A Meta-Analysis of Maternal Prenatal Depression and Anxiety on Child Socioemotional Development. J Am Acad Child Adolesc Psychiatry 57:645–657.e8. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.jaac.2018.06.012&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F18%2F2023.07.17.23292587.atom) 15. 15.McKee K, Admon LK, Winkelman TNA, Muzik M, Hall S, Dalton VK, et al. (2020): Perinatal mood and anxiety disorders, serious mental illness, and delivery-related health outcomes, United States, 2006–2015. BMC Womens Health 20:. 16. 16.Grigoriadis S, VonderPorten EH, Mamisashvili L, Tomlinson G, Dennis C, Koren G, et al. (2013): The impact of maternal depression during pregnancy on perinatal outcomes: a systematic review and meta-analysis. J Clin Psychiatry 74:321. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F18%2F2023.07.17.23292587.atom) 17. 17.Lindgren K (2001): Relationships among maternal-fetal attachment, prenatal depression, and health practices in pregnancy. Res Nurs Health 24:203–217. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/nur.1023&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=11526619&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F18%2F2023.07.17.23292587.atom) 18. 18.Yonkers KA, Wisner KL, Stewart DE, Oberlander TF, Dell DL, Stotland N, et al. (2009): The management of depression during pregnancy: a report from the American Psychiatric Association and the American College of Obstetricians and Gynecologists, Obstet Gynecol 114:703–713. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1097/AOG.0b013e3181ba0632&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=19701065&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F18%2F2023.07.17.23292587.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000269405300035&link_type=ISI) 19. 19.Grote NK, Bridge JA, Gavin AR, Melville JL, Iyengar S, Katon WJ (2010): A Meta-analysis of Depression During Pregnancy and the Risk of Preterm Birth, Low Birth Weight, and Intrauterine Growth Restriction. Arch Gen Psychiatry 67:1012–1024. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1001/archgenpsychiatry.2010.111&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20921117&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F18%2F2023.07.17.23292587.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000282917400004&link_type=ISI) 20. 20.Henrichs J, Schenk JJ, Roza SJ, van den Berg MP, Schmidt HG, Steegers E, et al. (2010): Maternal psychological distress and fetal growth trajectories: the Generation R Study. Psychol Med 40:633–643. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1017/S0033291709990894&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=19656431&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F18%2F2023.07.17.23292587.atom) 21. 21.Li D, Liu L, Odouli R (2009): Presence of depressive symptoms during early pregnancy and the risk of preterm delivery: a prospective cohort study. Hum Reprod 24:146–153. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/humrep/den342&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=18948314&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F18%2F2023.07.17.23292587.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000261680600019&link_type=ISI) 22. 22.Murray L, Cooper PJ, Wilson A, Romaniuk H (2003): Controlled trial of the short- and long-term effect of psychological treatment of post-partum depression: 2. Impact on the mother-child relationship and child outcome. Br J Psychiatry 182:420–427. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6MTA6ImJqcHJjcHN5Y2giO3M6NToicmVzaWQiO3M6OToiMTgyLzUvNDIwIjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjMvMDcvMTgvMjAyMy4wNy4xNy4yMzI5MjU4Ny5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 23. 23.Gentile S (2017): Untreated depression during pregnancy: Short- and long-term effects in offspring. A systematic review. Neuroscience 342:154–166. 24. 24.Palladino CL, Singh V, Campbell J, Flynn H, Gold KJ (2011): Homicide and suicide during the perinatal period: findings from the National Violent Death Reporting System. Obstet Gynecol 118:1056–1063. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1097/AOG.0b013e31823294da&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22015873&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F18%2F2023.07.17.23292587.atom) 25. 25.Glazier RH, Elgar FJ, Goel V, Holzapfel S (2004): Stress, social support, and emotional distress in a community sample of pregnant women. J Psychosom Obstet Gynaecol 25:247–255. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1080/01674820400024406&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=15715023&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F18%2F2023.07.17.23292587.atom) 26. 26.Biaggi A, Conroy S, Pawlby S, Pariante CM (2016): Identifying the women at risk of antenatal anxiety and depression: A systematic review. J Affect Disord 191:62–77. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.jad.2015.11.014&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=26650969&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F18%2F2023.07.17.23292587.atom) 27. 27.Melville JL, Gavin A, Guo Y, Fan MY, Katon WJ (2010): Depressive disorders during pregnancy: prevalence and risk factors in a large urban sample. Obstet Gynecol 116:1064–1070. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1097/AOG.0b013e3181f60b0a&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20966690&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F18%2F2023.07.17.23292587.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000283329000010&link_type=ISI) 28. 28.Racine N, Zumwalt K, McDonald S, Tough S, Madigan S (2020): Perinatal depression: The role of maternal adverse childhood experiences and social support. J Affect Disord 263:576–581. 29. 29.Atzl VM, Narayan AJ, Rivera LM, Lieberman AF (2019): Adverse childhood experiences and prenatal mental health: Type of ACEs and age of maltreatment onset. J Fam Psychol 33:304–314. 30. 30.Skaer TL, Sclar DA, Robison LM, Galin RS (2000): Trends in the rate of depressive illness and use of antidepressant pharmacotherapy by ethnicity/race: an assessment of office-based visits in the United States, 1992–1997. Clinical Therapeutics 22:1575–1589. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S0149-2918(00)83055-6&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=11192148&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F18%2F2023.07.17.23292587.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000166362600019&link_type=ISI) 31. 31.Borowsky SJ, Rubenstein LV, Meredith LS, Camp P, Jackson-Triche M, Wells KB (2000): Who is at risk of nondetection of mental health problems in primary care? J Gen Intern Med 15:381–388. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1046/j.1525-1497.2000.12088.x&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=10886472&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F18%2F2023.07.17.23292587.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000087945300005&link_type=ISI) 32. 32.Sidebottom A, Vacquier M, LaRusso E, Erickson D, Hardeman R (2021): Perinatal depression screening practices in a large health system: identifying current state and assessing opportunities to provide more equitable care. Archives of Women’s Mental Health 24:133–144. 33. 33.Russell S, Lang B, Clinton J, Adams C, Lamb J (2013): Perinatal Mental Health experiences of women and health professionals. Boots Family Trust Alliance. 34. 34.Henshaw C. 2014. Screening and Risk Assessment for Perinatal Mood Disorders. In: Diana Lynn Barnes, editor. Women’s Reproductive Mental Health Across the LifespanCham: Springer International Publishing, pp 91-108. 35. 35.Chew-Graham C, Sharp D, Chamberlain E, Folkes L, Turner KM (2009): Disclosure of symptoms of postnatal depression, the perspectives of health professionals and women: a qualitative study. BMC Fam Pract 10:7. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=19159478&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F18%2F2023.07.17.23292587.atom) 36. 36.Anonymous (2018): ACOG Committee Opinion No. 757: Screening for Perinatal Depression. Obstet Gynecol 132:e208-e212. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1097/AOG.0000000000002927&link_type=DOI) 37. 37.Cox JL, Holden JM, Sagovsky R (1987): Detection of postnatal depression. Development of the 10-item Edinburgh Postnatal Depression Scale. Br J Psychiatry 150:782-786. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6MTA6ImJqcHJjcHN5Y2giO3M6NToicmVzaWQiO3M6OToiMTUwLzYvNzgyIjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjMvMDcvMTgvMjAyMy4wNy4xNy4yMzI5MjU4Ny5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 38. 38.Kroenke K, Spitzer RL, Williams JB (2001): The PHQ-9: validity of a brief depression severity measure. J Gen Intern Med 16:606–613. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1046/j.1525-1497.2001.016009606.x&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=11556941&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F18%2F2023.07.17.23292587.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000171184700005&link_type=ISI) 39. 39.Liu CH, Giallo R, Doan SN, Seidman LJ, Tronick E (2016): Racial and Ethnic Differences in Prenatal Life Stress and Postpartum Depression Symptoms. Arch Psychiatr Nurs 30:7–12. 40. 40.Liao KY, Wei M, Yin M (2022): The Misunderstood Schema of the Strong Black Woman: Exploring Its Mental Health Consequences and Coping Responses Among African American Women. Psychol Women Q 44:84–104. 41. 41.Woods-Giscombe C (2010): Superwoman schema: African American women’s views on stress, strength, and health. Qual Health Res 20:668–683. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1177/1049732310361892&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20154298&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F18%2F2023.07.17.23292587.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000276851100009&link_type=ISI) 42. 42.Amankwaa LC (2003): Postpartum depression, culture and African-American women. J Cult Divers 10:23–30. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=12776544&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F18%2F2023.07.17.23292587.atom) 43. 43.Conteh N, Gagliardi J, McGahee S, Molina R, Clark CT, Clare CA (2022): Medical Mistrust in Perinatal Mental Health. Harv Rev Psychiatry 30:238–247. 44. 44. Penalver Bernabe B, Maki PM, Cunningham JL, Eisenlohr-Moul T, Tussing-Humphreys L, Carroll IM, et al. (2023): Interactions between perceived stress and microbial-host immune components: two demographically and geographically distinct pregnancy cohorts. Transl Psychiatry 13:3. 45. 45.Chaudron LH, Szilagyi PG, Tang W, Anson E, Talbot NL, Wadkins HI, et al. (2010): Accuracy of depression screening tools for identifying postpartum depression among urban mothers. Pediatrics 125:609. 46. 46.Yang J, Martinez M, Schwartz TA, Beeber L (2017): What Is Being Measured? A Comparison of Two Depressive Symptom Severity Instruments with a Depression Diagnosis in Low-Income High-Risk Mothers. J Womens Health (Larchmt) 26:683–691. 47. 47.Tandon SD, Cluxton-Keller F, Leis J, Le HN, Perry DF (2012): A comparison of three screening tools to identify perinatal depression among low-income African American women. J Affect Disord 136:155–162. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=21864914&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F18%2F2023.07.17.23292587.atom) 48. 48.Artzi NS, Shilo S, Hadar E, Rossman H, Barbash-Hazan S, Ben-Haroush A, et al. (2020): Prediction of gestational diabetes based on nationwide electronic health records. Nat Med 26:71–76. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41591-019-0724-8&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F18%2F2023.07.17.23292587.atom) 49. 49.Ye Y, Xiong Y, Zhou Q, Wu J, Li X, Xiao X (2020): Comparison of Machine Learning Methods and Conventional Logistic Regressions for Predicting Gestational Diabetes Using Routine Clinical Data: A Retrospective Cohort Study. J Diabetes Res 2020:4168340. 50. 50.Qiu H, Yu HY, Wang LY, Yao Q, Wu SN, Yin C, et al. (2017): Electronic Health Record Driven Prediction for Gestational Diabetes Mellitus in Early Pregnancy. Sci Rep 7:16417-y. 51. 51.Gao C, Osmundson S, Velez Edwards DR, Jackson GP, Malin BA, Chen Y (2019): Deep learning predicts extreme preterm birth from electronic health records. J Biomed Inform 100:103334. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.jbi.2019.103334&link_type=DOI) 52. 52.Zhong QY, Mittal LP, Nathan MD, Brown KM, Knudson Gonzalez D, Cai T, et al. (2019): Use of natural language processing in electronic medical records to identify pregnant women with suicidal behavior: towards a solution to the complex classification problem. Eur J Epidemiol 34:153–162. 53. 53.Zhang Y, Tayarani M, Wang S, Liu Y, Sharma M, Joly R, et al. (2021): Identifying urban built environment factors in pregnancy care and maternal mental health outcomes. BMC Pregnancy Childbirth 21:599–1. 54. 54.Park Y, Hu J, Singh M, Sylla I, Dankwa-Mullan I, Koski E, et al. (2021): Comparison of Methods to Reduce Bias From Clinical Prediction Models of Postpartum Depression. JAMA Netw Open 4:e213909. 55. 55.Tortajada S, Garcia-Gomez J, Vicente J, Sanjuan J, de Frutos R, Martin-Santos R, et al. (2009): Prediction of postpartum depression using multilayer perceptrons and pruning. Methods Inf Med 48:291–298. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.3414/ME0562&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=19387507&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F18%2F2023.07.17.23292587.atom) 56. 56.Jimenez-Serrano S, Tortajada S, Garcia-Gomez J (2015): A Mobile Health Application to Predict Postpartum Depression Based on Machine Learning. Telemed J E Health 21:567–574. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F18%2F2023.07.17.23292587.atom) 57. 57.Hochman E, Feldman B, Weizman A, Krivoy A, Gur S, Barzilay E, et al. (2022): Development and validation of a machine learning-based postpartum depression prediction model: A nationwide cohort study. Depression and Anxiety 38:400–411. 58. 58.Zhang W, Liu H, Silenzio VMB, Qiu P, Gong W (2020): Machine Learning Models for the Prediction of Postpartum Depression: Application and Comparison Based on a Cohort Study. JMIR Med Inform 8:e15516. 59. 59.Zhang Y, Wang S, Hermann A, Joly R, Pathak J (2021): Development and validation of a machine learning algorithm for predicting the risk of postpartum depression among pregnant women. J Affect Disord 279:1–8. 60. 60.Amit G, Girshovitz I, Marcus K, Zhang Y, Pathak J, Bar V, et al. (2021): Estimation of postpartum depression risk from electronic health records using machine learning. BMC Pregnancy Childbirth 21:630–8. 61. 61.S W, J P, Y Z (2019): Using Electronic Health Records and Machine Learning to Predict Postpartum Depression. Stud Health Technol Inform 264:888–892. 62. 62.Munk-Olsen T, Liu X, Madsen KB, Kjeldsen MZ, Petersen LV, Bergink V, et al. (2022): Postpartum depression: a developed and validated model predicting individual risk in new mothers. Transl Psychiatry 12:1–10. 63. 63.Andersson S, Bathula DR, Iliadis SI, Walter M, Skalkidou A (2021): Predicting women with depressive symptoms postpartum with machine learning methods. Sci Rep 11:1–15. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41598-021-81132-8&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F18%2F2023.07.17.23292587.atom) 64. 64.Shin D, Lee KJ, Adeluwa T, Hur J (2020): Machine Learning-Based Predictive Modeling of Postpartum Depression. Journal of Clinical Medicine 9:. 65. 65.Qasrawi R, Amro M, VicunaPolo S, Abu Al-Halawa D, Agha H, Abu Seir R, et al. (2022): Machine learning techniques for predicting depression and anxiety in pregnant and postpartum women during the COVID-19 pandemic: a cross-sectional regional study. F1000Res 11:390. 66. 66.Javed F, Gilani SO, Latif S, Waris A, Jamil M, Waqas A (2021): Predicting Risk of Antenatal Depression and Anxiety Using Multi-Layer Perceptrons and Support Vector Machines. J Pers Med 11:doi:10.3390/jpm11030199. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.3390/jpm11030199&link_type=DOI) 67. 67.Weitkamp K, Klein E, Midgley N (2016): The Experience of Depression: A Qualitative Study of Adolescents With Depression Entering Psychotherapy. Global Qualitative Nursing Research 3:2333393616649548. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1177/2333393616649548&link_type=DOI) 68. 68.Rajkomar A, Hardt M, Howell MD, Corrado G, Chin MH (2018): Ensuring Fairness in Machine Learning to Advance Health Equity. Ann Intern Med 169:866–872. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.7326/M18-1990&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F18%2F2023.07.17.23292587.atom) 69. 69.Gianfrancesco MA, Tamang S, Yazdany J, Schmajuk G (2018): Potential Biases in Machine Learning Algorithms Using Electronic Health Record Data. JAMA Internal Medicine 178:1544–1547. 70. 70.Zhang Z (2016): Multiple imputation with multivariate imputation by chained equation (MICE) package. Ann Transl Med 4:. 71. 71.Lundberg S, Lee S (2017): A Unified Approach to Interpreting Model Predictions. arXiv. 72. 72.Friedler S, Scheidegger C, Venkatasubramanian S (2014): Certifying and Removing Disparate Impact. 73. 73.Hardt M, Price E, Srebro N (2016): Equality of Opportunity in Supervised Learning. 74. 74.Breiman L (2001): Random Forests. Mach Learning 45:5–32. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1023/A:1010933404324&link_type=DOI) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000170489900001&link_type=ISI) 75. 75.Chen T, Guestrin C (2016): XGBoost: A Scalable Tree Boosting System. 76. 76.Zou H, Hastie T (2005): Regularization and Variable Selection Via the Elastic Net. J R Stat Soc Ser B Stat Methodol 67:301–320. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1111/j.1467-9868.2005.00503.x&link_type=DOI) 77. 77.Muskens L, Boekhorst MGBM, Kop WJ, van den Heuvel MI, Pop VJM, Beerthuizen A (2022): The association of unplanned pregnancy with perinatal depression: a longitudinal cohort study. Archives of Women’s Mental Health 25:611–620. 78. 78.Bedaso A, Adams J, Peng W, Xu F, Sibbritt D (2022): An examination of the association between marital status and prenatal mental disorders using linked health administrative data. BMC Pregnancy and Childbirth 22:735. 79. 79.Aochi Y, Honjo K, Kimura T, Ikehara S, Iso H (2021): Association between maternal employment status during pregnancy and risk of depressive symptomatology 1 month after childbirth: the Japan Environment and Children’s Study. J Epidemiol Community Health 75:531. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NDoiamVjaCI7czo1OiJyZXNpZCI7czo4OiI3NS82LzUzMSI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIzLzA3LzE4LzIwMjMuMDcuMTcuMjMyOTI1ODcuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 80. 80.Steenland M, Trivedi A (2023): Association of Medicaid Expansion With Postpartum Depression Treatment in Arkansas. JAMA health forum 4:e225603. 81. 81.Anastasopoulou S, Bonotis K, Hatzoglou C, Dafopoulos K, Gourgoulianis K (2022): Smoking Patterns and Anxiety Factors Among Women Expressing Perinatal Depression. Women’s Health Reports 3:198–206. 82. 82.Myers S, Johns SE (2019): Male infants and birth complications are associated with increased incidence of postnatal depression. Soc Sci Med 220:56–64. 83. 83.Ataoglu A, Canan F (2009): Mean Platelet Volume in Patients With Major Depression: Effect of Escitalopram Treatment. J Clin Psychopharmacol 29:. 84. 84.Canan F, Dikici S, Kutlucan A, Celbek G, Coskun H, Gungor A, et al. (2011): Association of mean Platelet volume with DSM-IV major depression in a large community-based population: The MELEN study. J Psychiatr Res 46:298–302. 85. 85.Sparling TM, Nesbitt RC, Henschke N, Gabrysch S (2017): Nutrients and perinatal depression: a systematic review. Journal of Nutritional Science 6:e61. 86. 86.Zhang Y, Wang S, Hermann A, Joly R, Pathak J (2021): Development and validation of a machine learning algorithm for predicting the risk of postpartum depression among pregnant women. J Affect Disord 279:1–8. 87. 87.Nemesure MD, Heinz MV, Huang R, Jacobson NC (2021): Predictive modeling of depression and anxiety using electronic health records and a novel machine learning approach with artificial intelligence. Sci Rep 11:1980–4. 88. 88.Zhang W, Liu H, Silenzio VMB, Qiu P, Gong W (2020): Machine Learning Models for the Prediction of Postpartum Depression: Application and Comparison Based on a Cohort Study. JMIR Medical Informatics 8:e15516. 89. 89.Javed F, Gilani SO, Latif S, Waris A, Jamil M, Waqas A (2021): Predicting Risk of Antenatal Depression and Anxiety Using Multi-Layer Perceptrons and Support Vector Machines. Journal of personalized medicine; J Pers Med 11:199. 90. 90.Moreira MWL, Rodrigues JJPC, Kumar N, Saleem K, Illin IV (2019): Postpartum depression prediction through pregnancy data analysis for emotion-aware smart systems. Information Fusion 47:23–31. 91. 91.Bair MJ, Robinson RL, Katon W, Kroenke K (2003): Depression and pain comorbidity: a literature review. Arch Intern Med 163:2433–2445. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1001/archinte.163.20.2433&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=14609780&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F18%2F2023.07.17.23292587.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000186431000008&link_type=ISI) 92. 92.Toriola AT, Vääräsmäki M, Lehtinen M, Zeleniuch-Jacquotte A, Lundin E, Rodgers K, et al. (2011): Determinants of maternal sex steroids during the first half of pregnancy. Obstet Gynecol 118:1029–1036. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1097/AOG.0b013e3182342b7f&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22015870&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F18%2F2023.07.17.23292587.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000296292600010&link_type=ISI) 93. 93.Dwyer JB, Aftab A, Radhakrishnan R, Widge A, Rodriguez CI, Carpenter LL, et al. (2020): Hormonal Treatments for Major Depressive Disorder: State of the Art. Am J Psychiatry 177:686–705. 94. 94.Osborne L, Monk C (2013): Perinatal Depression – the Fourth Inflammatory Morbidity of Pregnancy? Theory and Literature Review. Psychoneuroendocrinology 38:. 95. 95. Ramiro Cortijo D, Calle M, Böger R, Hannemann J, Lüneburg N, López-Gimenez R, et al. (2020): Male fetal sex is associated with low maternal plasma anti-inflammatory cytokine profile in the first trimester of healthy pregnancies. Cytokine 136:155290. 96. 96.Tiotiu AI, Novakova P, Nedeva D, Chong-Neto HJ, Novakova S, Steiropoulos P, et al. (2020): Impact of Air Pollution on Asthma Outcomes. Int J Environ Res Public Health 17:6212. 97. 97.Fu Z, Liu Q, Liang J, Weng Z, Li W, Xu J, et al. (2022): Air pollution, genetic factors and the risk of depression. Sci Total Environ 850:158001. 98. 98.Chaudron LH, Szilagyi PG, Tang W, Anson E, Talbot NL, Wadkins HI, et al. (2010): Accuracy of depression screening tools for identifying postpartum depression among urban mothers. Pediatrics 125:609. 99. 99.Giurgescu C, Misra DP, Sealy-Jefferson S, Caldwell CH, Templin TN, Slaughter-Acey J, et al. (2015): The impact of neighborhood quality, perceived stress, and social support on depressive symptoms during pregnancy in African American women. Soc Sci Med 130:172–180. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.socscimed.2015.02.006&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25703670&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F18%2F2023.07.17.23292587.atom) 100.100.Leff-Gelman P, Mancilla-Herrera I, Flores-Ramos M, Cruz-Fuentes C, Reyes-Grajeda JP, García-Cuétara MDP, et al. (2016): The Immune System and the Role of Inflammation in Perinatal Depression. Neurosci Bull 32:398–420. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s12264-016-0048-3&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F18%2F2023.07.17.23292587.atom) 101.101.Koritala BSC, Porter KI, Arshad OA, Gajula RP, Mitchell HD, Arman T, et al. (2021): Night shift schedule causes circadian dysregulation of DNA repair genes and elevated DNA damage in humans. J Pineal Res 70:e12726. [1]: /embed/graphic-1.gif [2]: /embed/graphic-2.gif [3]: /embed/graphic-3.gif [4]: /embed/graphic-4.gif [5]: /embed/graphic-5.gif [6]: /embed/graphic-6.gif [7]: /embed/graphic-7.gif [8]: /embed/graphic-8.gif [9]: /embed/graphic-9.gif [10]: /embed/graphic-10.gif