Abstract
Introduction Data on race and ethnic susceptibility to SARS-CoV-2 infection are limited. We analyzed socio-demographic factors associated with higher likelihood of SARS-CoV-2 infection and explore mediating pathways for race disparities in the SARS-CoV-2 pandemic.
Methods Cross sectional analysis of COVID-19 Surveillance and Outcomes Registry (CURATOR), which captures data for a large healthcare system comprising of one central tertiary care, seven large community hospitals, and an expansive ambulatory / emergency care network in the Greater Houston area. Nasopharyngeal samples for individuals inclusive of all ages, races, ethnicities and sex were tested for SARS-CoV-2. We analyzed, socio-demographic (age, sex, race, ethnicity, household income, residence population density) and comorbidity (hypertension, diabetes, obesity, cardiac disease) factors. Multivariable logistic regression models were fitted to provide adjusted Odds Ratios (aOR), 95% confidence intervals (CI) for likelihood of positive SARS-CoV-2 test. Structural Equation Modeling (SEM) framework was utilized to explore three mediation pathways (low income, high population density, high comorbidity burden) for association between African American race and SARS-CoV-2 infection.
Results Among 4,513 tested individuals, 754 (16.7%) tested positive. Overall mean (SD) age was 50.6 (18.9) years, 62% females and 26% were African American. African American race was associated with lower socio-economic status, higher comorbidity burden, and population density residence. In the fully adjusted model, African American race (vs. White; aOR, CI: 1.84, 1.49-2.27) and Hispanic ethnicity (vs. non-Hispanic; aOR, CI: 1.70, 1.35-2.14) had a higher likelihood of infection. Older individuals and males were also at a higher risk of SARS-CoV-2 infection. The SEM framework demonstrated a statistically significant (p = 0.008) indirect effect of African American race on SARS-CoV-2 infection mediated via a pathway that included residence in densely populated zip code.
Conclusions There is strong evidence of race and ethnic disparities in the SARS-CoV-2 pandemic potentially mediated through unique social determinants of health.
Strengths and limitations of this studyOne of the first studies to systematically evaluate race and ethnic disparities in susceptibility to SARS-CoV-2 infection, while accounting for multiple sociodemographic characteristics and comorbidities
Study population represents a large and diverse metropolitan of the U.S. with data from one of the largest healthcare providers across the greater metropolitan area
Study evaluates potential mediation pathways for race disparities and demonstrates that residence in areas with high population density may mediate race disparities in susceptibility to SARS-CoV-2 infection
Single center study with limited information about true burden of comorbidity and lifestyle factors
Introduction
The Coronavirus (COVID-19) disease caused by infection with the Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) virus is a pandemic that has thus far resulted in over 2 million cases across 170 countries in under 4 months. At the time of this reporting, the U.S. has approximately 30% of total global cases, and has surpassed all countries in terms of absolute number of cases and fatalities.1,2 Experts project these numbers to continue rising as widespread testing is instituted. The geographic distribution of cases across the U.S. suggests that the major pandemic burden has hit metropolitan areas such as New York; however, cases of COVID-19 have now been reported across all 50 states, the District of Columbia, Guam, Puerto Rico, the Northern Mariana Islands, and the U.S. Virgin Islands.3 As of April 18, the state of Texas had 18,260 reported cases of COVID-19, with approximately one-third in the Greater Houston area.4 The greater Houston area is home to approximately 7 million individuals, is the fourth-largest metropolitan area by population in the U.S. and is considered one of the nation’s most diverse regions.
Initial reports from China and Europe indicate that specific individuals such as the elderly; males; and people with comorbidities including hypertension, diabetes, obesity, coronary artery disease and heart failure have poor COVID-19 outcomes.5-8 As the pandemic spread over the continental U.S. during the last two months, patterns of high-risk phenotypes have started to emerge, and reports of poor outcomes (particularly high case fatality) among racial minorities have surfaced in the media.9-11 Though it is important to understand the determinants of poor outcomes among COVID-19 patients, it is equally imperative, from a public health perspective, to systematically examine the likelihood of SARS-CoV-2 infection across large diverse communities in the U.S. More specifically, data on potential higher likelihood of SARS-CoV-2 infection among racial and ethnic minorities across diverse U.S. metropolitan areas outside of New York are limited. Furthermore, the mediators of SARS-CoV-2 infection among racial and ethnic minorities have not been described.
We explored socio-demographic characteristics such as age, sex, race, ethnicity, median household zip code income, population density of residents’ zip codes, and health insurance status associated with positive SARS-CoV-2 testing in an urban and diverse population served by one of the leading healthcare systems of the greater Houston area. We further examined the association between pre-existing comorbidities and higher likelihood of SARS-CoV-2 infection in our study population. We hypothesized that older age, non-white race and ethnic minority status will be associated with significantly higher likelihood of SARS-CoV-2 infection, and factors such as low socio-economic status, residence in high population density areas (proxy for potential difficulties in social distancing) and higher comorbidity burden will mediate the effect of race on SARS-CoV-2 infection.
Methods
We analyzed data being contemporaneously collected since March 5, 2020 as a part of the COVID-19 Surveillance and Outcomes Registry (CURATOR) at the Houston Methodist Hospital system (HM). The Houston Methodist CURATOR has been approved by the HM Institutional Review Board (IRB) as an observational quality of care registry for all suspected and confirmed COVID-19 patients. CURATOR is populated from multiple data sources across the HM system such as electronic medical records, electronic databanks for laboratory and pharmacy, and electronic interactive patient interface tools. The HM system comprises a flagship tertiary care hospital in the Texas Medical Center, seven large community hospitals, a continuing care hospital, and multiple emergency centers and clinics throughout the Greater Houston area. Data from various sources are curated into a harmonized format, assessed for quality and integrity, and stored on a secure institutional HIPAA-compliant server.
We flagged all individuals who were tested for the SARS-CoV-2 using the real time Reverse Transcriptase (RT) Polymerized Chain Reaction (PCR) diagnostic panels. These assays were verified for quantitative detection of novel SARS-CoV-2 isolated and purified from nasopharyngeal swab specimens obtained from individuals and immersed in universal transport medium. Testing was carried out for symptomatic individuals or for individuals who had a self-reported history of exposure to a COVID-19 case including recent travel to other countries with high infection rates or hotspots within the U.S. Socio-demographic characteristics including age, sex, race, ethnicity, and payer-status (insurance type) were obtained from the HM CURATOR for analyses. We utilized the U.S. Census Bureau’s American Community Survey (ACS) 5-year data (2014-2018) to determine median household income by individual zip code tabulation areas (ZCTA).12 The median ZCTA household income was inflation-adjusted to 2018 USD. We also utilized the same data source to obtain population estimates by ZCTA, and calculated ZCTA level population density (population per mile square) by standardizing it for area measurements of ZCTA. For the purpose of population density determination, land area estimates were obtained from the Census Bureau’s U.S. Gazetteer Files 2010.13 In the absence of granular and precise social distancing data, we have utilized population density as a proxy for potential difficulties in social distancing among crowded communities.
We provide descriptive summary data as means (standard deviations) and proportions. We fit univariable and multivariable logistic regression models to assess unadjusted and adjusted association between socio-demographic characteristics and likelihood of being tested positive for the SARS-CoV-2. We determined a priori to include all variables (age, sex, race, ethnicity, zip code household income, insurance type, zip population density and comorbidities) in our final multivariable model. We assessed the model fit utilizing the Hosmer and Lemeshow goodness of fit test and crude and adjusted odds ratios (OR and aOR), and 95% confidence intervals (CI) are reported. Age, income and population density variables were categorized to improve model fit. Post-estimation marginal probabilities of SARS-CoV-2 infection were determined from the fully adjusted model for major covariates (race, ethnicity and age). A comorbidity burden score was calculated by assigning one point each for presence of hypertension, diabetes, obesity or a combination of Coronary Artery Disease / Myocardial Infarction / Congestive Heart Failure (CAD / MI / CHF). We explored the mediation influence of comorbidity burden, socio-economic status (median income), and lack of social distancing (population density) on the relationship between African American race and high likelihood of SARS-CoV-2 infection using the Generalized Structural Equation Modeling (GSEM) framework. The GSEM framework was set up to provide estimates of direct and indirect effect of African American race on SARS-CoV-2 infectivity. Statistically significant (p < 0.05) indirect effects represent full or partial mediation by a tested covariate. We included all individuals tested for SARS-CoV-2 across our healthcare system and did not perform formal sample size calculations.
Patient and public involvement: There was no direct patient involvement in the design and conduct of this study.
Results
From the HM CURATOR, during an approximate 5-week (37-day) time period, we identified a total of 4,513 presumed cases tested for SARS-CoV-2, among whom 754 (16.7%, 95% CI: 15.6 - 17.8) tested positive. Figure 1 represents temporal course of total, positive, and negative SARS-CoV-2 tests across the 37-day timeline in our hospital system.
Socio-Demographic and Comorbidity Characteristics of the Study Population
Overall, the mean (SD) age of the study population was 50.6 (18.9) years; 62% were female and 58% were Caucasian. The overall median (IQR) household income was USD $70,324 ($53,116-$97,747), and 39.8% of the study population had private or employer-based insurance. In our univariate analysis, African American race (vs. White; OR, CI: 1.52, 1.281.82), Hispanic (vs. non-Hispanic; OR, CI: 1.26, 1.04-1.54), and males (vs. females; OR, CI: 1.30, 1.11-1.51), were associated with significantly higher likelihood of testing positive for SARS-CoV-2. Furthermore, among the SARS-CoV-2 positive patients, 44% were in the age category of 51-75 years, and 11% were greater than 75 years. These proportions were significantly higher than the reference group (up to 35 years; OR, CI for 51-75 years vs. up to 35 years: 1.76, 1.42-2.18 and for >75 years vs. up to 35 years: 1.35, 1.01-1.79). Furthermore, individuals in higher pentiles of socio-economic status had significantly lower likelihood; whereas, those residing in higher population density ZCTAs had higher likelihood of SARS- CoV-2 infection. Among comorbidities, a significantly greater proportion of diabetic individuals had SARS-CoV-2 positive results (OR, CI: 1.40, 0.17 - 1.68). The socio-demographic characteristics and comorbidity profiles for the overall and SARS-CoV-2 positive and negative patients are summarized in Table 1.
Socio-demographic and comorbidity characteristics associated with African American Race
In order to understand the association between African American race and other sociodemographic factors, we compared age, sex, median income, population density, and comorbidity profile between African American and non-African American race. Although African Americans had higher proportion of younger individuals and greater proportion of females. A significantly higher proportion of African Americans had lower socio-economic status, resided in ZCTAs with higher population density, and had high comorbidity burden for hypertension, diabetes, obesity and CAD / MI / CHF. Table 2 provides univariable comparison of African Americans vs. Non-African Americans across various socio-demographic and comorbidity characteristics.
Multivariable Model and Marginal Probabilities for likelihood of SARS-Co V-2 infection
The significantly higher likelihood of SARS-CoV-2 infection among African Americans (compared to White) persisted after controlling for other demographics, insurance type, median household income, population density, and comorbidities. Adjusted odds ratios (CI) for African American vs. White: 1.84 (1.49-2.27). Our fully adjusted model estimated that Asians (vs. White) were also at a significantly higher risk of SARS-CoV-2 infection (aOR, CI: 1.46, 1.09–1.95). Furthermore, we also observed a statistically significant association between SARS-CoV- 2 infection and Hispanic ethnicity, aOR (CI): 1.70 (1.35-2.14). Higher risk of infection among males (compared to females) and higher likelihood of SARS-CoV-2 infection among elderly also remained statistically significant. Detailed output of the fully adjusted logistic regression model is presented in Table 3. The influence of African American race (vs. White) and Hispanic (vs. Non-Hispanic) ethnicity was observed uniformly across the age spectrum of 10 - 80 years. In other words, we did not observe effect modification by age for relationship between race / ethnicity and SARS-CoV-2 infection. However older age in itself remains significantly associated with higher likelihood of SARS-CoV-2 infection. Based on the marginal probabilities obtained from our fully adjusted model, the probability of SARS-CoV-2 infection in a 40-year-old African American is 18.8% whereas it is 12.9% in a 40-year-old White individual, all other adjusted variables being constant. At the age of 75 this probability is 29.0% for an African American, and 20.8% for a Caucasian. A similar relationship differential was observed for Hispanic vs. non-Hispanic. Probability of SARS-CoV-2 infection for African American vs. White, and for Hispanic vs. Non-Hispanic across age spectrum is presented in Figure 2 and Figure 3.
Generalized Structural Equation Modeling for Mediation by Income, Population Density and Comorbidity
Utilizing the GSEM framework, we determined the direct and indirect effects of African American race on SARS-CoV-2 infection with median income, population density and comorbidity score modeled as mediators in three separate equations. The indirect effect of African American race mediated through population density was statistically significant (p = 0.008); however, the indirect effects mediated via median income and comorbidity scores were not statistically significant (p = 0.31 and p = 0.38 respectively).
Discussion
There is emerging evidence of race disparities in the evolving COVID-19 pandemic across the continental U.S. Most reports indicate higher case fatality among African Americans across major U.S. metropolitan areas.9-11 However, robust insights on the racial and ethnic differences for SARS-CoV-2 infection are limited. This is perhaps because of comparatively homogenous populations in non-U.S. regions of the world. Houston, as an exceptionally ethnically diverse population center,14 is well suited for an investigation of racial, ethnic, and socioeconomic gradients in COVID-19 test positivity.
Our study adds to the current literature by analyzing emerging data for individuals being tested across one of the largest healthcare systems in the Greater Houston area; we report that racial minorities (non-Hispanic African American and Asian) are approximately 50-80% more likely to test positive for SARS-CoV-2 than the non-Hispanic White population. Our data also indicate that the Hispanic population is almost 70% more likely than non-Hispanics to be susceptible to SARS-CoV-2 infection. These findings illuminate systematic racial / ethnic disparities in testing positive for SARS-CoV-2 infection. Though there are limited prior SARS- CoV-2 data, such race and ethnic disparities have previously been described for the U.S. H1N1 influenza pandemic.15 These data indicated that Spanish-speaking Hispanics were at a greater risk of H1N1 infection primarily attributable to lack of healthcare access. Black people were also more susceptible to complications of H1N1 infection.15
We explored three possible mechanisms of race disparities in our data. These included lower socio-economic status, residence in higher population dense areas, and higher level of comorbidities. We demonstrate that African American race is significantly associated with all three potential disparity pathways, and in the traditional multivariable analyses, race and ethnic disparities persisted even after controlling for these pathways. However, our mediation analyses highlighted the potential influence of residence in high population density areas as a viable pathway that at least partially explains race disparity. Pathways mediating the influence of median income and comorbidity status did not demonstrate a significant effect. We utilized population density as a marker for potential inability to maintain adequate social distancing as it has been indicated that maintaining the WHO recommended safe distance between people becomes challenging with high population densities.16 Furthermore, overall effects of population density and disease spread has been previously described in literature.17,18 In addition to lack of social distancing, higher population density may also be associated with several other behavioral and socio-demographic attributes that may predispose to both viral spread and increased susceptibility. For example, there are reports linking obesity, lack of physical activity, and higher mortality with residence in densely populated neighborhoods.19,20
As reported, our data also corroborate that older populations may be more susceptible to SARS-CoV-2 infection;8 however, younger populations still have cause for concern, as nearly 1 in 4 of the infected cases in our sample were between 36-50 years. Finally, our data demonstrate that males may be approximately 30% more likely to test positive for the SARS-CoV-2 infection. Potential sex differences in infectivity to SARS-CoV-2 and intersectionality with racial and ethnic socioeconomic factors need to be explored further in future analyses. Additional policy-oriented research should prioritize study on the intersectionality of these vulnerable economic statuses and racial disparities in COVID infection indicated by the present study.
Findings of our study need to be interpreted in the light of certain limitations. First, our data are from a single center and may not be generalizable to the wider U.S. population. These findings need to be replicated in larger data sets across other large heterogenous U.S. metropolitans. However, the Houston metropolitan area is one of the most diverse and representative in the U.S.,14 and our healthcare system is one of the largest systems providing care to COVID-19 patients in the Greater Houston area. Our sample was composed of 26% Black, 19% Hispanic, and 62% female population. Second, we did not have information on certain demographic covariates such as education. Educational status has been linked to healthcare awareness and may be important to adjust for in analyses of potential disparities. However, we obtained and adjusted for zip code income data from the U.S. Census, as income has previously been shown to have strong correlation with educational attainment.21 Third, since testing was based on suspicion of infection and may have been influenced by factors such as access to care, the potential for selection bias cannot be ruled out. Finally, we did not have detailed information on comorbidities and their management in the study population. However, we did control for major comorbidities which are being reported as associated with COVID-19 outcomes.22
Conclusions
The strong association between racial and ethnic minorities and SARS-CoV-2 infection demonstrated in our data, even after adjustment for other important socio-demographic and comorbidity factors, highlight a potential catastrophe of inequality within the existential crisis of a global pandemic. Our data, representing a large heterogeneous U.S. metropolitan area, also provide preliminary evidence into the potential pathways for this disparity. It is highly likely that higher comorbidity burden and detrimental effects of adverse social determinants, including those that may not adequately permit safe practices of social distancing, mediate higher SARS- CoV-2 infectivity among racial and ethnic minorities.
As the pandemic continues to spread and evolve across the continental U.S., emerging data on association between SARS-CoV-2 infection and various socio-demographic factors will continue to enhance our understanding of targeted risks related to SARS-CoV-2 infection, and such data would enable us to comprehend healthcare services and access factors related to development and outcomes of COVID-19 among minority populations.
Data Availability
Data requests can be made to the corresponding author and will be evaluated by the appropriate institutional committees on a case by case basis.
IRB Approval
This work was carried out under an approved protocol for the Houston Methodist COVID-19 Surveillance and Outcomes Registry (HM CURATOR) by the Houston Methodist Research Institute Institutional Review Board (HMRI IRB).
Contribution Statement
FV: design, data analysis and interpretation, drafting the manuscript, critical revision for important intellectual content, final approval
JCN: data acquisition, data analysis, drafting the manuscript, final approval
OK: data acquisition, data analysis, drafting the manuscript, final approval
SLJ: data acquisition, data interpretation, critical revision for important intellectual content, final approval
FNM: critical revision for important intellectual content, final approval
HDS: critical revision for important intellectual content, final approval
RAP: critical revision for important intellectual content, final approval
JDA: critical revision for important intellectual content, final approval
BK: critical revision for important intellectual content, final approval
KN: design, interpretation of data, critical revision for important intellectual content, final approval
Competing Interests
No authors declare a competing interest to this work
Funding
This work was supported internally by the Houston Methodist Academic Institute.
Date sharing statement
All requests for de-identified data should be made to the corresponding author. All reasonable requests will be evaluated by the CURATOR Data Governance and Sharing Committee comprising of FV, SLJ, BK and KN in the light of institutional policies and guidelines.
Acknowledgment
The authors thank Jacob M. Kolman, Senior Scientific Writer of the Houston Methodist Center for Outcomes Research, for reviewing the language and format of the manuscript.