Contributions of occupation characteristics and educational attainment to racial/ethnic inequities in COVID-19 mortality ======================================================================================================================== * Ellicott C. Matthay * Kate A. Duchowny * Alicia R. Riley * Marilyn Thomas * Yea-Hung Chen * Kirsten Bibbins-Domingo * M. Maria Glymour ## Abstract **Background** Racial/ethnic inequities in COVID-19 mortality are hypothesized to be driven by education and occupation, but limited empirical evidence has assessed these mechanisms. **Objective** To quantify the extent to which educational attainment and occupation explain racial/ethnic inequities in COVID-19 mortality. **Design** Observational cohort. **Setting** California. **Participants** Californians aged 18-65 years. **Measurements** We linked all COVID-19-confirmed deaths in California through February 12, 2021 (N=14,783), to population estimates within strata defined by race/ethnicity, sex, age, USA nativity, region of residence, education, and occupation. We characterized occupations using measures related to COVID-19 exposure including essential sector, telework-ability, and wages. Using sex-stratified regressions, we predicted COVID-19 mortality by race/ethnicity if all races/ethnicities had the same education and occupation distribution as White people and if all people held the safest educational/occupational positions. **Results** COVID-19 mortality per 100,000 ranged from 15 for White and Asian females to 139 for Latinx males. Accounting for differences in age, nativity, and region, if all races/ethnicities had the education and occupation distribution of Whites, COVID-19 mortality would be reduced for Latinx males (−22%) and females (−23%), and Black males (−1%) and females (−8%), but increased for Asian males (+22%) and females (+23%). Additionally, if all individuals had the COVID-19 mortality associated with the safest educational and occupational position (Bachelor’s degree, non-essential, telework, highest wage quintile), there would have been 57% fewer COVID-19 deaths. **Conclusion** Educational and occupational disadvantage are important risk factors for COVID-19 mortality across all racial/ethnic groups, especially Latinx individuals. Eliminating avoidable excess risk associated with low-education, essential, on-site, and low-wage jobs may reduce COVID-19 mortality and inequities, but is unlikely to be sufficient to achieve equity. ## Introduction As of July 16, 2021, individuals identifying as Latinx were 2·3 times more likely to have died from COVID-19 than non-Latinx White persons in the United States.(1) Inequities for Black and American Indian/Alaska Native groups were similarly large.(1) The drivers of these inequities are poorly understood.(2) Research to date probing the drivers of racial/ethnic inequities in COVID-19 outcomes has focused primarily on risk factors such as age, sex, health status, and comorbidities, all of which inadequately explain racial/ethnic differences in COVID-19 cases, severe morbidity, and mortality.(3–6) A systematic review identified 20 studies examining factors underlying racial/ethnic inequities in COVID-19 outcomes and found they may be partially explained by differences in health care access and factors affecting exposure, but the strength of the evidence was low.(3) One potential driver of racial/ethnic inequities in COVID-19 outcomes is the unequal distribution of occupations across racial/ethnic groups—one manifestation of structural racism.(7,8) In the US, people of color disproportionately hold jobs that place them at higher risk for poor health outcomes.(8,9) This same pattern has prevailed during the COVID-19 pandemic. Black and Latinx workers were substantially more likely than White and Asian workers to hold low-wage essential jobs and jobs with the greatest risk of COVID-19 exposure relative to population size.(10) In turn, the risk of COVID-19 exposure, infection, and death varies notably by occupation. In California, many non-essential workers experienced no discernable increase in mortality during the pandemic while mortality increased nearly 60% for those in more vulnerable occupations such as sewing machine operators, cooks, and miscellaneous agricultural workers.(11) Little is known about the contribution of occupation to racial/ethnic disparities in COVID-19 outcomes. One study found that Black and Latinx adults with health conditions that place them at high risk for severe COVID-19 illness were more likely to live in households with essential workers and workers whose jobs cannot be done from home.(12) Several ecological studies suggest that racial/ethnic inequities in COVID-19 outcomes are related to the community-level density of workers in service or low-wage essential occupations, commuting on public transportation, or residing in crowded housing with an essential worker.(13–16) Workplace outbreaks of COVID-19 also appear to disproportionately affect non-White workers.(17) Yet, to our knowledge, no research has directly linked occupation to racial/ethnic inequities in COVID-19 outcomes. We tested the extent to which racial/ethnic inequities in COVID-19 mortality among the working-age population can be explained by differences in education and occupations. That is, we estimate the magnitude of inequities that would remain if all racial/ethnic groups had similar education and were equally likely to hold low-risk and high-risk jobs. We focus on California, a state of 40 million residents, the most COVID-19 deaths of any USA state, and large racial/ethnic inequities in COVID-19 mortality.(18) Because structural racism manifests not only in the jobs individuals hold, but also in the educational opportunities that determine job opportunities, we also consider the role of educational attainment in racial/ethnic inequities in COVID-19 outcomes. Because both the types of work available and the risk associated with work differ for males and females within racial/ethnic groups, we stratify all analyses by sex.(19,20) ## Methods ### Death data and measures From California death records, we identified all COVID-19-confirmed deaths occurring between January 1, 2020 and February 12, 2021. Records included the decedent’s race/ethnicity, sex (male or female), date of birth, date of death, and open text fields for primary occupation and industry, described as “type of work done during most of working life”. COVID-19 deaths were those with a primary International Classification of Diseases, Revision 10 diagnostic code of U071. We conceptualized race/ethnicity as socially defined categories that govern the distribution of risk, opportunities, and discrimination according to each group’s socially-assigned value and power.(21) We classified race/ethnicity as: Asian, Black, Latinx, White, and Other (including American Indian, Alaskan Native, Native Hawaiians, other Pacific Islanders, multi-race, and unspecified). Apart from Latinx, all racial/ethnic groups were non-Latinx. We used the National Institute for Occupational Safety and Health’s Industry and Occupation Computerized Coding System, an automated machine learning-based system, to convert the open text fields for occupation and industry to standardized 2010 Census Codes. Educational attainment categories were: no high school degree and no GED; high school degree or GED; some college or Associate’s degree; and Bachelor’s degree or beyond. Nativity was USA-born or foreign-born. Place of residence was grouped into ten California regions (Appendix Table 1). Given our focus on workers, we restricted to decedents aged 18-65 years at death, in alignment with prior research.(11) We calculated age using dates of birth and death and defined age groups: 18-24 years, 5-year age groups between ages 25 and 59 years, and 60-65 years. We defined population strata by cross-classifying all categories of all variables selected from the death records. The cross-classification of race/ethnicity, sex, age group, nativity, region, education, and occupational category (see below) created 3,672,000 total possible strata, of which 12,850 were represented in the death data. We then created a dataset comprised of stratum-level COVID-19 death counts by summing the number of COVID-19 deaths in each stratum. ### Population data and measures To characterize the population at risk of death, we used the 2019 American Community Survey (ACS) California person-level microdata. ACS is the primary national population surveillance system, surveying 3 million individuals annually, including 380,000 in California. Residents are legally required to respond. We defined strata using the same set of variables as in the death records. We converted 2018 Census occupation codes in the ACS to 2010 Census occupation codes using the Census Bureau crosswalk. Restricting to the same ages (18-65 years), we created population counts by summing the ACS person weights representing the number of people in each stratum (174,315 strata had non-zero populations). ### Occupational sector and characteristics We characterized occupations using multiple measures hypothesized to be related to SARS-CoV-2 exposure risk (Appendix “Occupational measures”; Appendix Table 2). First, as in previous research,(11) a team of three researchers manually categorized the 529 unique 2010 Census occupation codes into 9 occupational sectors based on the California official definition of essential work(22): facilities, food/agriculture, government/community, health/emergency, manufacturing, retail, transportation/logistics, not essential, and unemployed/not in labor force/missing. Second, we used Dingel and Neiman’s classification of which jobs can be done at home during the COVID-19 pandemic to link occupation codes to their telework-ability. This classification was based on a composite of job characteristics measures in the Bureau of Labor Statistics O*NET database. In secondary analyses, we also considered 13 individual O*NET measures (Appendix “Occupational measures”). Third, we linked each occupation code to its median annual wages as reported by the Bureau of Labor Statistics in May 2019. Individuals with lower incomes have less ability to forgo work or income when faced with undesired COVID-19 exposure risk. In secondary analyses, we considered other quantiles of annual and hourly wages for each occupation code. To merge the telework, O*NET, and wages measures to the death/population data, we used multiple occupation code crosswalks (Appendix “Occupation code crosswalks”). ### Construction of analytic dataset We treated the state of California as a cohort and appended (i.e. stacked) the death and ACS population data to create one strata-level dataset representative of the cohort experience. The outcome was COVID-19 death: rows derived from the death data were assigned a value of 1 and rows derived from the ACS data were assigned a value of 0. Each row was assigned a weight corresponding to the number of individuals it represented, i.e., within a given stratum, the number of individuals who died or who lived in California. These weights were included in all statistical analyses. Because the ACS does not indicate which individuals subsequently died of COVID-19, this data structure implies that the <0·1% of Californians aged 18-65 who died of COVID-19 are represented twice. Records with missing education or occupational measures were excluded (<1% of study population). The final analytic dataset included 187,165 rows representing 25,235,092 individuals aged 18-65 years. ### Statistical analysis Consistent with prior research studying sources of racial/ethnic health inequities, we treat race/ethnicity as the exposure, educational and occupational variables as potential mediators, and COVID-19 death as the outcome.(23) We conducted sex-stratified analyses using g-computation.(24) This approach allowed us to predict COVID-19 mortality for hypothetical alternative distributions of covariates and educational/occupational positions while flexibly modeling COVID-19 mortality risk. We fit three separate linear probability models:(25) First, we fit unadjusted linear regression models, regressing COVID-19 death on race/ethnicity. Second, we adjusted for measured compositional differences between racial/ethnic groups (age group, nativity, region). Third, we further adjusted for the hypothesized mediating variable(s) (education, essential sector, telework, wages, or all four measures simultaneously). To allow for non-additive and non-linear associations, we converted all continuous measures to quintile-based categorical measures and included all possible first-order interaction terms between independent variables. For occupational mediators, nonworkers (16% of the study population) were included as a distinct category. Using the fitted models, we predicted the COVID-19 mortality risk for each racial/ethnic group under four scenarios: 1. Unadjusted: the observed COVID-19 mortality risk predicted using the unadjusted models 2. Composition-adjusted: the COVID-19 mortality risk if all racial/ethnic groups had the same distribution of covariates (age, nativity, and region) as Whites (predicted using the covariate-adjusted models) 3. Composition- and mediator-adjusted: the COVID-19 mortality risk if all racial/ethnic groups had the same distribution of covariates and education/occupation mediator(s) as Whites (predicted using the covariate- and mediator-adjusted models) 4. Composition-adjusted with safest education/occupation: the COVID-19 mortality risk if all racial/ethnic groups had the same distribution of covariates as Whites and if all individuals, regardless of race/ethnicity, had been in the lowest-risk categories of education/occupation (i.e. Bachelor’s degree or higher, non-essential, telework-able, highest quintile of median annual wages) (predicted using the covariate- and mediator-adjusted models). This sequence of estimates quantifies how COVID-19 mortality would change for each racial/ethnic group under each hypothetical scenario. To convert to potential deaths averted, we multiplied differences in risk by corresponding population counts. We do not report confidence intervals because the study population constitutes the entire population of interest, not a sample from the population to which we made inferences. Because linear probability models can predict risks outside of [0,1], we tested the sensitivity of our results to using logistic regression. ### IRB approval This study was approved by the institutional review boards of the California Department of Public Health and the University of California, San Francisco. ### Role of the funding source This study was supported by the National Institutes of Health. The funders had no role in the study’s design, conduct, or reporting. ## Results Among people in California aged 18-65 years, there were 14,783 deaths attributed to COVID-19 between January 1, 2020 and February 12, 2021. With an estimated 25 million people at risk, this implies a COVID mortality risk of 59 per 100,000 persons. By design, the demographic composition of the study population matched that of the 2019 California population (Table 1). Compared to the general population, COVID-19 decedents were disproportionately older, male, and Latinx, with lower education and occupational positions (Table 1). For example, those with a high school education or less made up 36% of the study population but comprised 69% of COVID-19 deaths, while those holding jobs that are likely to be telework-able also made up 36% of the population but comprised only 18% of COVID-19 deaths. View this table: [Table 1:](http://medrxiv.org/content/early/2021/10/30/2021.10.29.21265628/T1) Table 1: Demographic, educational, and occupational characteristics of 2019 California population and California COVID-19 decedents ages 18-65 years, January 1, 2020-February 12, 2021 Table 2 presents unadjusted COVID-19 mortality by race/ethnicity and sex, and predicted mortality if all groups had the same composition (age, nativity, region of residence), education, or occupational risk as White people of the same sex (for risk differences, see Appendix Table 3). In unadjusted analyses, Latinx, Black, and other race/ethnicity groups had greater COVID-19 mortality than White people of the same sex. Gaps were particularly pronounced for Latinx people. For Asian females, COVID-19 mortality matched that of White females. Large differences in mortality persisted even after accounting for compositional differences in age, nativity, and region of residence. View this table: [Table 2:](http://medrxiv.org/content/early/2021/10/30/2021.10.29.21265628/T2) Table 2: Predicted COVID-19 mortality risks and percent change in mortality risk after accounting for racial/ethnic differences in composition, education, and occupational mediators, for individuals aged 18-65, per 100,000 persons, by race/ethnicity and sex, California, January 1, 2020-February 12, 2021 If all groups had the same composition and educational attainment as White people of the same sex, we predicted that COVID-19 mortality would be reduced (and therefore more equal) for Latinx females (−21%) and males (−18%), Black females (−8%) and males (−11%), and Other race/ethnicity females (−11%) and males (−7%). However, COVID-19 mortality inequalities would be exacerbated for Asian females (+31%) and males (+26%), reflecting that on average Asian people in California are more educated than Whites (Appendix Table 4). If all groups had the same composition and work sector or telework capacity as Whites, we predicted little change in COVID-19 mortality for females, but reduced mortality for Latinx males (6-10%) and increased mortality for Black males (+10%). The latter finding is consistent with the pattern that although Black people aged 18-65 in California on average hold far fewer non-essential jobs than Whites (26% vs. 40%), they are also more likely to be unemployed or not in the labor force (23% vs. 14%) (Appendix Table 4) which appears protective against COVID-19 mortality (Table 1). When equalizing wages, we predicted reduced COVID-19 mortality for Latinx females and males and other race/ethnicity females, but greater mortality for Asian males and little change for other groups. Models adjusting for other occupational measures showed attenuated but similar patterns (Appendix Table 5). Taken together, if each group had the education, essential sector, telework, and wages of their White counterparts, COVID-19 mortality would be reduced by 23% for Latinx females, 22% for Latinx males, and smaller amounts for Black and other/race ethnicity individuals, but increased by 23% for Asian females and 22% for Asian males (Table 2). Findings based on logistic regression rather than linear probability models were consistent with the main results (Appendix Table 6). Figure 1 summarizes the unadjusted and predicted COVID-19 mortality risk by race/ethnicity and sex under alternative distributions education and occupation. If all groups held the safest educational and occupational positions (Bachelor’s degree or higher, non-essential, telework-able occupation, highest quintile of median annual wages), we predict that COVID-19 mortality risk would be substantially reduced for all groups, with reductions ranging from 3 per 100,000 among Asian and White females to 75 per 100,000 among Latinx males. ![Figure 1:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/10/30/2021.10.29.21265628/F1.medium.gif) [Figure 1:](http://medrxiv.org/content/early/2021/10/30/2021.10.29.21265628/F1) Figure 1: Unadjusted and predicted COVID-19 mortality for individuals aged 18-65, per 100,000 persons, by race/ethnicity and sex, under alternative compositional, educational, and occupational distributions, California, January 1, 2020-February 12, 2021 Legend: Estimates present the predicted COVID-19 mortality risk for each racial/ethnic-sex group under four scenarios: Unadjusted (the COVID-19 mortality risk as-observed); if all racial/ethnic groups had the same distribution of age, nativity, and region of residence as Whites; if all racial/ethnic groups had the same distribution of age, nativity, and region of residence, education, essential sector work, telework-able occupation, and wages as Whites; and if all racial/ethnic groups had the same distribution of age, nativity, and region of residence as Whites and if all individuals regardless of race/ethnicity had a Bachelor’s degree or higher and worked in non-essential, telework-able occupations in the highest quintile of median annual wages. Table 3 presents predicted deaths among working-age Californians averted by placing everyone in the safest educational and occupational positions. If every working-age Californian had had the COVID-19 mortality risk associated with the safest education and occupational strata, mortality would have decreased substantially for all groups, with the largest relative benefits for White females (−80%) and White males (−65%). The smallest relative benefits would have accrued to Black women, for whom mortality would have been only 8% lower. Overall, this shift was associated with 8,441 COVID-19 deaths averted, a 57% reduction, with the vast majority of this risk reduction among Latinx males (3,755 potential deaths averted) and females (2,329 potential deaths averted). View this table: [Table 3:](http://medrxiv.org/content/early/2021/10/30/2021.10.29.21265628/T3) Table 3: Predicted COVID-19 mortality reductions and potential deaths averted, for individuals aged 18-65, per 100,000 persons, by race/ethnicity and sex, if all groups held the safest educational and occupational positions, California, January 1, 2020-February 12, 2021 ## Discussion In this population-based analysis of working-age Californians, we found that large inequities in COVID-19 mortality by race/ethnicity can be partially explained by differences in educational attainment and measured occupational risk factors. The fraction of inequality in comparison to White people that was explained by education and occupation varied by race/ethnicity and sex. For Latinx males, 22% of their excess risk compared to White males might have been eliminated if they shared the risk associated with the education and occupational conditions of White males; for Latinx females, this number was 23%. For Asian individuals, existing educational and occupational differences apparently conferred an advantage compared to Whites. In all racial/ethnic groups, COVID-19 mortality would have been reduced substantially reduced if excess risk associated with educational and occupational disadvantage were eliminated. If COVID-19 mortality were no higher in low-education, essential, on-site, or low-wage jobs than in high-education, non-essential, telework-able, high-paying jobs, we estimate that around 57% of California’s working-age COVID-19 deaths would have been prevented. In terms of absolute deaths averted, this risk reduction would confer the most benefit for Latinx males and females. Despite these reductions, substantial racial/ethnic inequities in COVID-19 mortality remained, suggesting that eliminating excess risk associated with educational and occupational disadvantage alone is insufficient, and should be considered in complement to other steps to reduce inequities. Although these findings suggest that education and occupation play important roles in COVID-19 risk and inequities, the estimated percentage of inequity explained was smaller than we anticipated. Compared to measured occupational characteristics, education played a larger role in racial/ethnic inequities in COVID-19 mortality. Prior work has shown that lower educational attainment is associated with greater COVID-19 infection and mortality.(26–28) Inequities in education are driven by residential segregation, educational access, structural racism,(29) and immigration patterns. Education may influence COVID-19 mortality inequities because lower-educated individuals have less access to accurate information on COVID-19 prevention compared to people with more education.(30) Additionally, in the USA, education is tied to employment opportunities, which influence access to healthcare.(31) Healthcare access has been linked to racial/ethnic disparities in COVID-19 outcomes,(3) and despite reductions in racial/ethnic gaps in healthcare coverage after the Affordable Care Act, Black and Latinx adults remain less likely to be covered.(32) Education also conveys social status, social influence, and labor market resources. As a result, individuals with more education may be able to negotiate for safer environments at work or elsewhere, and refuse to participate in unsafe workplaces.(33) Our findings also underscore the importance of safeguarding essential workers in the highest-risk, lowest-wage occupations. The unequal distribution of risky work contributed to racial/ethnic mortality differences, particularly affecting Latinx individuals. We know how to protect workers: in California, those with high exposure risk but adequate protective policies, procedures, and equipment (e.g. physicians) did not experience increases in pandemic-era mortality.(11) This is especially important as novel pathogenic variants continue to evolve and many remain unvaccinated: despite highly efficacious vaccine availability, as of June 30, 2021, Latinx Californians constituted 63% of COVID-19 cases but only 29% of vaccinations received.(34) The particular relevance of occupational factors (e.g., essential work, telework, wages) for Latinx people may reflect occupational segregation in California, where the diverse Latinx population is highly differentiated by type of work and corresponding COVID-19 risk. Prior research has shown that Latinx people, in particular, are overrepresented in low-wage, precarious work in the essential sectors that saw the highest COVID-19 mortality in the state.(35) In contrast, our occupational measures did little to explain Black-White inequities. Given the pervasiveness of anti-Black racism, other social factors such as residential segregation, lack of healthcare access, intergenerational wealth inequalities, and less-measurable influences may contribute more to underlying risk for Black workers. Our findings underscore the importance of considering structural drivers of racial/ethnic inequities in COVID-19 outcomes. A major barrier to research on the role of upstream social and economic factors is the lack of surveillance data that incorporates information on race/ethnicity, education/occupation, and COVID-19 outcomes simultaneously. California death certificates are one of few sources with complete data on all three. Adequate surveillance data on modifiable determinants of inequities such as occupational exposures is urgently needed to advance research on COVID-19 disparities and guide public health efforts to prevent unnecessary COVID-19 deaths. This study has several limitations. First, we may underestimate the relevance of education and occupation because we used indirect measures of risk that were indexed by codes for primary occupation in life rather than direct measures of occupational exposure. Second, we did not account for within-household transmission initiated by an occupational exposure. We may underestimate the importance of occupation since household members with different occupation codes share COVID-19 risk associated with each other’s work. Third, we only included confirmed COVID-19 deaths, conceivably leading to an underestimate of true inequities: some racial/ethnic groups may be more likely to die at home without COVID-19 testing and therefore not be counted as a COVID-19-confirmed death.(36) Fourth, we could only control for potential confounders measured in both the death and ACS records. Absent further covariate adjustment, education and occupation may proxy for other factors such as social class, intergenerational wealth/debt, parental education, and non-citizen legal status.(37) Unmeasured potential confounders of particular concern are comorbidities, housing composition and density (including housing instability, homelessness, and incarceration), access to high-quality healthcare, and undocumented legal status. However, these factors may also be part of the causal pathway from structural racism, education, and occupation to COVID-19 death. ## Conclusions Educational attainment and occupation should be considered important risk factors for COVID-19 mortality. Even in one of America’s most progressive states, many people of color have been relegated to high-risk, devalued, unprotected jobs characterized by inadequate COVID-19 protections and limited resources and power to advocate for better protections. Unnecessary COVID-19 mortality is just one of many potential health and social consequences of this racial stratification.(7,8) Future COVID-19 mitigation strategies should include policies, protections, and vigilant monitoring such that workers in low-education, essential, on-site, and low-wage jobs are ensured no greater risk of COVID-19 mortality than workers with greater power and privilege. These steps are not a panacea, but they have the potential to save lives and reduce some racial/ethnic inequities in COVID-19 mortality. ## Data Availability Death data used for this analysis include person identifiers and cannot be shared publicly. They can be obtained directly from the California Department of Public Health with appropriate approvals. American Community Survey microdata are publicly available online from https://data.census.gov. ## Appendix ### Supplemental Methods View this table: [Appendix Table 1:](http://medrxiv.org/content/early/2021/10/30/2021.10.29.21265628/T4) Appendix Table 1: Definition of California regions #### Occupational measures We characterized occupations using multiple measures hypothesized to be related to SARS-CoV-2 exposure risk (Appendix Table 1). First, as in previous research,1 a team of three researchers manually categorized the 529 unique 2010 Census occupation codes into 9 occupational sectors based on the California official definition of essential work2 and retail work. The categories were: facilities, food/agriculture, government/community, health/emergency, manufacturing, retail, transportation/logistics, not essential, and unemployed/not in labor force/missing. Second, we used the O*NET database to link occupation codes to characteristics of each occupation. Overseen by the Bureau of Labor statistics, O*NET is based on surveys completed by employees, employers, and job experts, and includes measures of required knowledge and skills, typical tasks, exposures encountered, and the workplace environment for nearly 1,000 occupations. We used 13 O*NET measures deemed relevant to COVID-19 exposure risk in previous research3–6—for example, the importance of assisting and caring for others, or the importance of working with computers (Appendix Table 2). We also considered Dingel and Neiman’s classification of which jobs can be done at home during the COVID-19 pandemic (telework), which was based on a composite of O*NET measures.5 Third, individuals with lower incomes have less ability to not work or forgo income when faced with undesired COVID-19 exposure risk. We therefore characterized mean, median, 10th, 25th, 75th, and 90th percentiles of annual and hourly wages within each occupation code, based on the Bureau of Labor Statistics May 2019 report of wages for each occupation code. To merge the O*NET, telework, and wages measures to the death and population data, we used multiple available crosswalks from the occupational coding schemes used in each source to 2010 Census occupation codes (see Appendix “Occupation code crosswalks”). In our presentation of the results, we focus on the occupational measures of essential sector, telework, and median annual wages, because within the three major categories of measures (sector, O*NET-based measures, and wages) these measures were the strongest predictors of COVID-19 death in the study population. Results for the remaining occupational measures are presented in Appendix Table 5. View this table: [Appendix Table 2:](http://medrxiv.org/content/early/2021/10/30/2021.10.29.21265628/T5) Appendix Table 2: Occupation-based measures related to COVID-19 exposure risk #### Occupation code crosswalks ##### Death and American Community Survey records Death records included open text fields for primary occupation and primary industry, described as “type of work done during most of working life”. We used the National Institute for Occupational Safety and Health’s Industry and Occupation Computerized Coding System, an automated machine-learning based system, to convert the open text fields for occupation and industry to standardized 2010 Census Codes (529 unique codes). To align the 2010 Census occupation codes in the death records with American Community Survey (ACS) which used 2018 Census occupation codes (570 unique codes), we used the Census crosswalk.7 When two 2018 codes were assigned to the same 2010 code, we assigned them the same 2010 code. When one 2018 code was assigned to multiple 2010, we arbitrarily selected the first, recognizing that the ultimate occupational measures used in the analysis (telework, wages, etc.) would not distinguish between these sub-categories. ##### O*NET data To merge the O*NET data to the death/ACS data, we applied multiple crosswalks: from 2019 O*NET SOC codes (1016 unique codes) to 2018 SOC codes (867 unique codes),8 then from 2018 SOC codes to 2010 SOC codes (840 unique codes),9 then from 2010 SOC codes to 2010 Census codes (529 unique codes).7 At each stage, we applied the same procedures as above to assign codes that split or combined when transitioning from one coding scheme to the next. ##### Telework data To merge the Dingel and Neiman telework data to the death/ACS data, we applied multiple crosswalks: from 2010 O*NET SOC codes (1110 unique codes) to 2010 SOC codes (840 unique codes),10 then from 2010 SOC codes to 2010 Census codes (529 unique codes).7 At each stage, we applied the same procedures to assign codes that split or combined when transitioning from one coding scheme to the next. ##### Wages data To merge the US Bureau of Labor Statistics telework data to the death/ACS data, we applied multiple crosswalks: from the OES hybrid SOC coding scheme (824 unique codes) to 2010 SOC codes (840 unique codes),11 then from 2010 SOC codes to 2010 Census codes (529 unique codes).7 At each stage, we applied the same procedures to assign codes that split or combined when transitioning from one coding scheme to the next. #### Supplemental results View this table: [Appendix Table 4:](http://medrxiv.org/content/early/2021/10/30/2021.10.29.21265628/T6) Appendix Table 4: Predicted risk differences for individuals aged 18-65 per 100,000 population for the association of race/ethnicity with COVID-19 death accounting for each hypothesized mediator, by race/ethnicity and sex, California, January 1, 2020-February 12, 2021 View this table: [Appendix Table 4:](http://medrxiv.org/content/early/2021/10/30/2021.10.29.21265628/T7) Appendix Table 4: Demographic, educational, and occupational characteristics of study population by race/ethnicity View this table: [Appendix Table 5:](http://medrxiv.org/content/early/2021/10/30/2021.10.29.21265628/T8) Appendix Table 5: Predicted COVID-19 mortality risks for individuals aged 18-65, per 100,000 persons, by race/ethnicity and sex, if all groups had the same composition and occupational risks as Whites, California, January 1, 2020-February 12, 2021 View this table: [Appendix Table 6:](http://medrxiv.org/content/early/2021/10/30/2021.10.29.21265628/T9) Appendix Table 6: Predicted COVID-19 mortality risks for individuals aged 18-65, per 100,000 persons, by race/ethnicity and sex, if all groups had the same composition and educational/occupational risks as Whites, sensitivity analysis results based on logistic regression, California, January 1, 2020-February 12, 2021 ## Footnotes * Funding: This study was funded by the National Institute on Alcohol Abuse and Alcoholism (grant number K99AA028256). * Disclaimer: Death data for this study were provided by the California Department of Public Health. We acknowledge financial support from the National Institutes of Health. The analyses, interpretations, and conclusions of this article are attributable to the authors, and not to the California Department of Public Health or the National Institutes of Health. * Data availability: Death data used for this analysis include person identifiers and cannot be shared publicly. They can be obtained directly from the California Department of Public Health with appropriate approvals. American Community Survey microdata are publicly available online from [https://data.census.gov](https://data.census.gov). * Conflicts of interest: None declared. * Received October 29, 2021. * Revision received October 29, 2021. * Accepted October 30, 2021. * © 2021, Posted by Cold Spring Harbor Laboratory This pre-print is available under a Creative Commons License (Attribution-NonCommercial-NoDerivs 4.0 International), CC BY-NC-ND 4.0, as described at [http://creativecommons.org/licenses/by-nc-nd/4.0/](http://creativecommons.org/licenses/by-nc-nd/4.0/) ## References 1. 1.Centers for Disease Control and Prevention. Risk for COVID-19 Infection, Hospitalization, and Death By Race/Ethnicity [Internet]. Centers for Disease Control and Prevention. 2020 [cited 2021 Aug 2]. Available from: [https://www.cdc.gov/coronavirus/2019-ncov/covid-data/investigations-discovery/hospitalization-death-by-race-ethnicity.html](https://www.cdc.gov/coronavirus/2019-ncov/covid-data/investigations-discovery/hospitalization-death-by-race-ethnicity.html) 2. 2.Tai DBG, Shah A, Doubeni CA, Sia IG, Wieland ML. The Disproportionate Impact of COVID-19 on Racial and Ethnic Minorities in the United States. Clinical Infectious Diseases. 2021 Feb 15;72(4):703–6. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F10%2F30%2F2021.10.29.21265628.atom) 3. 3.Mackey K, Ayers CK, Kondo KK, Saha S, Advani SM, Young S, et al. Racial and Ethnic Disparities in COVID-19–Related Infections, Hospitalizations, and Deaths. Ann Intern Med. 2020 Dec 1;174(3):362–73. 4. 4.Azar KMJ, Shen Z, Romanelli RJ, Lockhart SH, Smits K, Robinson S, et al. Disparities In Outcomes Among COVID-19 Patients In A Large Health Care System In California. Health Affairs. 2020 May 21;39(7):1253–62. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F10%2F30%2F2021.10.29.21265628.atom) 5. 5.Escobar GJ, Adams AS, Liu VX, Soltesz L, Chen Y-FI, Parodi SM, et al. Racial Disparities in COVID-19 Testing and Outcomes. Ann Intern Med [Internet]. 2021 Feb 9 [cited 2021 Feb 8]; Available from: [https://www.acpjournals.org/doi/10.7326/M20-6979](https://www.acpjournals.org/doi/10.7326/M20-6979) 6. 6.Muñoz-Price LS, Nattinger AB, Rivera F, Hanson R, Gmehlin CG, Perez A, et al. Racial Disparities in Incidence and Outcomes Among Patients With COVID-19. JAMA Network Open. 2020 Sep 25;3(9):e2021892–e2021892. 7. 7.McClure ES, Vasudevan P, Bailey Z, Patel S, Robinson WR. Racial Capitalism Within Public Health—How Occupational Settings Drive COVID-19 Disparities. American Journal of Epidemiology. 2020 Nov 2;189(11):1244–53. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/aje/kwaa126&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F10%2F30%2F2021.10.29.21265628.atom) 8. 8.Laster Pirtle WN. Racial Capitalism: A Fundamental Cause of Novel Coronavirus (COVID-19) Pandemic Inequities in the United States. Health Educ Behav. 2020 Aug 1;47(4):504–8. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1177/1090198120922942&link_type=DOI) 9. 9.Bhala N, Curry G, Martineau AR, Agyemang C, Bhopal R. Sharpening the global focus on ethnicity and race in the time of COVID-19. The Lancet. 2020 May 30;395(10238):1673–6. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F10%2F30%2F2021.10.29.21265628.atom) 10. 10.Hawkins D. Differential occupational risk for COVID-19 and other infection exposure according to race and ethnicity. American Journal of Industrial Medicine. 2020;63(9):817–20. 11. 11.Chen Y-H, Glymour M, Riley A, Balmes J, Duchowny K, Harrison R, et al. Excess mortality associated with the COVID-19 pandemic among Californians 18–65 years of age, by occupational sector and occupation: March through November 2020. PLOS ONE. 2021 Jun 4;16(6):e0252454. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pone.0252454&link_type=DOI) 12. 12.Selden TM, Berdahl TA. COVID-19 And Racial/Ethnic Disparities In Health Risk, Employment, And Household Composition. Health Affairs. 2020 Jul 14;39(9):1624–32. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1377/hlthaff.2020.00897&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F10%2F30%2F2021.10.29.21265628.atom) 13. 13.Benitez J, Courtemanche C, Yelowitz A. Racial and Ethnic Disparities in COVID-19: Evidence from Six Large Cities. J Econ Race Policy. 2020 Dec 1;3(4):243–61. 14. 14.Figueroa JF, Wadhera RK, Lee D, Yeh RW, Sommers BD. Community-Level Factors Associated With Racial And Ethnic Disparities In COVID-19 Rates In Massachusetts. Health Affairs. 2020 Nov 1;39(11):1984–92. 15. 15.Figueroa JF, Wadhera RK, Mehtsun WT, Riley K, Phelan J, Jha AK. Association of race, ethnicity, and community-level factors with COVID-19 cases and deaths across U.S. counties. Healthcare. 2021 Mar 1;9(1):100495. 16. 16.Reitsma MB, Claypool AL, Vargo J, Shete PB, McCorvie R, Wheeler WH, et al. Racial/Ethnic Disparities In COVID-19 Exposure Risk, Testing, And Cases At The Subcounty Level In California. Health Affairs. 2021 Jun 1;40(6):870–8. 17. 17.Bui DP. Racial and Ethnic Disparities Among COVID-19 Cases in Workplace Outbreaks by Industry Sector — Utah, March 6–June 5, 2020. MMWR Morb Mortal Wkly Rep [Internet]. 2020 [cited 2021 Jun 28];69. Available from: [https://www.cdc.gov/mmwr/volumes/69/wr/mm6933e3.htm](https://www.cdc.gov/mmwr/volumes/69/wr/mm6933e3.htm) 18. 18.Chen Y-H, Glymour MM, Catalano R, Fernandez A, Nguyen T, Kushel M, et al. Excess Mortality in California During the Coronavirus Disease 2019 Pandemic, March to August 2020. JAMA Internal Medicine. 2021 May 1;181(5):705–7. 19. 19.Laster Pirtle WN, Wright T. Structural Gendered Racism Revealed in Pandemic Times: Intersectional Approaches to Understanding Race and Gender Health Inequities in COVID-19. Gender & Society. 2021 Apr 1;35(2):168–79. 20. 20.Hawkes S, Buse K. COVID-19 and the gendered markets of people and products: explaining inequalities in infections and deaths. Canadian Journal of Development Studies / Revue canadienne d’études du développement. 2020 Oct 23;():1–18. 21. 21.Jones CP. Invited Commentary: “Race,” Racism, and the Practice of Epidemiology. American Journal of Epidemiology. 2001 Aug 15;154(4):299–304. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/aje/154.4.299&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=11495851&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F10%2F30%2F2021.10.29.21265628.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000170557700002&link_type=ISI) 22. 22.State of California. Essential workforce [Internet]. 2020 [cited 2021 Jul 6]. Available from: [https://covid19.ca.gov/essential-workforce/](https://covid19.ca.gov/essential-workforce/) 23. 23.Naimi AI, Schnitzer ME, Moodie EEM, Bodnar LM. Mediation Analysis for Health Disparities Research. American Journal of Epidemiology. 2016 Aug 15;184(4):315–24. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/aje/kwv329&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=27489089&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F10%2F30%2F2021.10.29.21265628.atom) 24. 24.Snowden JM, Rose S, Mortimer KM. Implementation of G-Computation on a Simulated Data Set: Demonstration of a Causal Inference Technique. Am J Epidemiol. 2011 Apr 1;173(7):731–8. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/aje/kwq472&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=21415029&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F10%2F30%2F2021.10.29.21265628.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000289301200002&link_type=ISI) 25. 25.Angrist J, Pischke J-S. 3.1 Regression Fundamentals. In: Mostly Harmless Econometrics. Princeton, New Jersey: Princeton University Press; 2008. 26. 26.Hawkins RB, Charles EJ, Mehaffey JH. Socio-economic status and COVID-19–related cases and fatalities. Public Health. 2020 Dec 1;189:129–34. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.puhe.2020.09.016&link_type=DOI) 27. 27.Chadeau-Hyam M, Bodinier B, Elliott J, Whitaker MD, Tzoulaki I, Vermeulen R, et al. Risk factors for positive and negative COVID-19 tests: a cautious and in-depth analysis of UK biobank data. International Journal of Epidemiology. 2020 Oct 1;49(5):1454–67. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/9je/dyaa134&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F10%2F30%2F2021.10.29.21265628.atom) 28. 28.Abedi V, Olulana O, Avula V, Chaudhary D, Khan A, Shahjouei S, et al. Racial, Economic, and Health Inequality and COVID-19 Infection in the United States. J Racial and Ethnic Health Disparities. 2021 Jun 1;8(3):732–42. 29. 29.Bailey ZD, Krieger N, Agénor M, Graves J, Linos N, Bassett MT. Structural racism and health inequities in the USA: evidence and interventions. The Lancet. 2017 Apr 8;389(10077):1453–63. 30. 30.Alobuia WM, Dalva-Baird NP, Forrester JD, Bendavid E, Bhattacharya J, Kebebew E. Racial disparities in knowledge, attitudes and practices related to COVID-19 in the USA. Journal of Public Health. 2020 Aug 18;42(3):470–8. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/pubmed/fdaa069&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F10%2F30%2F2021.10.29.21265628.atom) 31. 31.Zuvekas SH, Taliaferro GS. Pathways To Access: Health Insurance, The Health Care Delivery System, And Racial/Ethnic Disparities, 1996–1999. Health Affairs. 2003 Mar 1;22(2):139–53. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6OToiaGVhbHRoYWZmIjtzOjU6InJlc2lkIjtzOjg6IjIyLzIvMTM5IjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjEvMTAvMzAvMjAyMS4xMC4yOS4yMTI2NTYyOC5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 32. 32.Chen J, Vargas-Bustamante A, Mortensen K, Ortega AN. Racial and Ethnic Disparities in Health Care Access and Utilization Under the Affordable Care Act. Med Care. 2016 Feb;54(2):140–6. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1097/MLR.0000000000000467&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=26595227&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F10%2F30%2F2021.10.29.21265628.atom) 33. 33.Patler C, Gleeson S, Schonlau M. Contesting Inequality: The Impact of Immigrant Legal Status and Education on Legal Knowledge and Claims-Making in Low-Wage Labor Markets. Social Problems [Internet]. 2020 Oct 14 [cited 2021 Aug 18];(spaa029). Available from: [https://doi.org/10.1093/socpro/spaa029](https://doi.org/10.1093/socpro/spaa029) 34. 34.Pham O, Jun 30 NPP, 2021. Latest Data on COVID-19 Vaccinations by Race/Ethnicity [Internet]. KFF. 2021 [cited 2021 Jul 7]. Available from: [https://www.kff.org/coronavirus-covid-19/issue-brief/latest-data-on-covid-19-vaccinations-race-ethnicity/](https://www.kff.org/coronavirus-covid-19/issue-brief/latest-data-on-covid-19-vaccinations-race-ethnicity/) 35. 35.Riley AR, Chen Y-H, Matthay EC, Glymour MM, Torres JM, Fernandez A, et al. Excess death among Latino people in California during the COVID-19 pandemic. medRxiv. 2021 Jan 25;2020.12.18.20248434. 36. 36.Wrigley-Field E, Garcia S, Leider JP, Robertson C, Wurtz R. Racial Disparities in COVID-19 and Excess Mortality in Minnesota. Socius. 2020 Jan 1;6:2378023120980918. 37. 37.Braveman PA, Cubbin C, Egerter S, Chideya S, Marchi KS, Metzler M, et al. Socioeconomic Status in Health ResearchOne Size Does Not Fit All. JAMA. 2005 Dec 14;294(22):2879–88. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1001/jama.294.22.2879&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=16352796&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F10%2F30%2F2021.10.29.21265628.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000233826700029&link_type=ISI) ## References 1. Chen Y-H, Glymour M, Riley A, et al. Excess mortality associated with the COVID-19 pandemic among Californians 18–65 years of age, by occupational sector and occupation: March through November 2020. PLOS ONE 2021; 16: e0252454. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pone.0252454&link_type=DOI) 2. State of California. Essential workforce. 2020. [https://covid19.ca.gov/essential-workforce/](https://covid19.ca.gov/essential-workforce/) (accessed July 6, 2021). 3. Hawkins D. Differential occupational risk for COVID-19 and other infection exposure according to race and ethnicity. American Journal of Industrial Medicine 2020; 63: 817–20. 4. Baker MG. Nonrelocatable Occupations at Increased Risk During Pandemics: United States, 2018. Am J Public Health 2020; 110: 1126–32. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F10%2F30%2F2021.10.29.21265628.atom) 5. Dingel JI, Neiman B. How many jobs can be done at home? Journal of Public Economics 2020; 189: 104235. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F10%2F30%2F2021.10.29.21265628.atom) 6. Baker MG. Who cannot work from home? Characterizing occupations facing increased risk during the COVID-19 pandemic using 2018 BLS data. medRxiv 2020; : 2020.03.21.20031336. 7. US Census Bureau. 2018 Census Occupation Code List with Crosswalk. 2020; published online Aug 31. [https://www2.census.gov/programs-surveys/demo/guidance/industry-occupation/2018-occupation-code-list-and-crosswalk.xlsx](https://www2.census.gov/programs-surveys/demo/guidance/industry-occupation/2018-occupation-code-list-and-crosswalk.xlsx) (accessed July 21, 2021). 8. O*NET Resource Center. Crosswalk O*NET-SOC 2019 to 2018 SOC - Occupational Listings at O*NET Resource Center. [https://www.onetcenter.org/taxonomy/2019/soc.html](https://www.onetcenter.org/taxonomy/2019/soc.html) (accessed July 21, 2021). 9. US Bureau of Labor Statistics. Crosswalk from the 2010 SOC to the 2018 SOC. 2017; published online Dec 6. [https://www.bls.gov/soc/2018/soc\_2010\_to\_2018\_crosswalk.xlsx](https://www.bls.gov/soc/2018/soc\_2010_to_2018_crosswalk.xlsx) (accessed July 21, 2021). 10. O*NET Resource Center. Updating the O*NET-SOC Taxonomy: Incorporating the 2010 SOC Structure at O*NET Resource Center. 2010; published online Dec. [https://www.onetcenter.org/reports/Taxonomy2010.html](https://www.onetcenter.org/reports/Taxonomy2010.html) (accessed July 21, 2021). 11. US Bureau of Labor Statistics. OES 2019 Hybrid Structure. 2020; published online March 31. [https://www.bls.gov/oes/oes\_2019\_hybrid\_structure.xlsx](https://www.bls.gov/oes/oes_2019_hybrid_structure.xlsx).