Impact of individual-level characteristics and transmission mitigation behaviors on SARS-CoV-2 infection and seroprevalence in a large Northern California Bay Area cohort ========================================================================================================================================================================== * Cameron Adams * Mary Horton * Olivia Solomon * Marcus Wong * Sean L. Wu * Sophia Fuller * Xiaorong Shao * Indro Fedrigo * Hong L. Quach * Diana L. Quach * Michelle Meas * Luis Lopez * Abigail Broughton * Anna L. Barcellos * Joan Shim * Yusef Seymens * Samantha Hernandez * Magelda Montoya * Darrell M. Johnson * Kenneth B. Beckman * Michael P. Busch * Josefina Coloma * Joseph A. Lewnard * Eva Harris * Lisa F. Barcellos ## Abstract Comprehensive data on transmission mitigation behaviors and SARS-CoV-2 infection and serostatus are needed from large, community-based cohorts to identify SARS-CoV-2 risk factors and impact of public health measures. From July 2020 to March 2021, ≈5,500 adults from the East Bay Area, California were followed over three data collection rounds. We estimated the prevalence of antibodies from SARS-CoV-2 infection and COVID-19 vaccination, and self-reported COVID-19 test positivity. Population-adjusted SARS-CoV-2 seroprevalence was low, increasing from 1.03% (95% CI: 0.50-1.96) in Round 1 (July-September 2020), to 1.37% (95% CI: 0.75-2.39) in Round 2 (October-December 2020), to 2.18% (95% CI: 1.48-3.17) in Round 3 (February-March 2021). Population-adjusted seroprevalence of COVID-19 vaccination was 21.64% (95% CI: 19.20-24.34) in Round 3. Despite >99% of participants reporting wearing masks, non-Whites, lower-income, and lower-educated individuals had the highest SARS-CoV-2 seroprevalence and lowest vaccination seroprevalence. Our results demonstrate that more effective policies are needed to address these disparities and inequities. ## Introduction The first confirmed severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infection in the California Bay Area was reported on February 28, 2020.(1) In response, measures to prevent transmission were implemented soon after and included shelter-in-place orders, mask mandates, business and school closures, and social distancing recommendations. Despite these measures, spikes in reported infections occurred from July to September 2020 and December 2020 to February 2021.(2) Local public health and healthcare systems experienced major challenges in preventing infections, identifying COVID-19 cases, and ensuring adherence to transmission mitigation strategies. Furthermore, disparities and inequities in COVID-19 have been observed in the United States(3) and there are few studies investigating the association of risk behaviors with infection outcomes at the individual level.(4–6) A detailed understanding of the effectiveness of transmission mitigation behaviors and sociodemographic factors that contribute to the dipropionate impact of COVID-19 in vulnerable communities is critical. Public health and policy directives aimed at controlling ongoing transmission, developing future prevention strategies, and targeting health disparities and inequities must evidence-based. Large population-representative cohorts with individual-level data on social and behavioral factors(7) associated with COVID-19 and SARS-COV-2 infection and serostatus(8–12) are needed. To address this need, we investigated individual-level characteristics and mitigation behaviors that contributed to SARS-CoV-2 seroprevalence, self-reported infection, and viral infection, and other outcomes in a large, population-based sample of over 5,500 individuals from 12 East Bay cities in Northern California followed longitudinally. During three time periods from July 2020 to April 2021, we estimated the population-adjusted prevalence of SARS-CoV-2 outcomes and differences by age, sex, race/ethnicity, ZIP code, and other demographic strata as well as the effect of transmission mitigation behavior on SARS-CoV-2 prevalence. ## Methods ### Recruitment and participants Recruitment and selection of study participants was completed using a screening phase followed by a longitudinal study phase with three rounds of data collection (Figure 1). In the screening phase, all residential addresses within the East Bay cities and communities of Albany, Berkeley, El Cerrito, El Sobrante, Emeryville, Hercules, Kensington, Oakland, Piedmont, Pinole, Richmond, and San Pablo (∼307,000 residential households) were mailed an invitation to complete a consent form and screening questionnaire. In a household, the individual aged 18 years or older with the next birthday was eligible to participate. Additional eligibility criteria included living within the study region, willingness to provide biospecimens and questionnaire responses, ability to read and speak English or Spanish, and having internet access and an email address. ![Figure 1.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/12/05/2021.12.02.21266871/F1.medium.gif) [Figure 1.](http://medrxiv.org/content/early/2021/12/05/2021.12.02.21266871/F1) Figure 1. Study flow chart. The target sample size was 5,500 participants. The distribution of racial and ethnic identification in the screening questionnaire responses was more White and non-Hispanic than the region population. To obtain a sample that resembled the racial and ethnic proportions in the 2018 American Community Survey (ACS) for the study region, we ranked assigned ranks for order of inclusion to all eligible individuals who responded to the screening questionnaire. Black and/or Hispanic individuals were ranked the highest followed by other non-Whites. Order of inclusion for Whites was randomly sampled. In Round 1, individuals ranked between 1 and 5,500 were offered study enrollment. Those who declined to enroll were replaced with next highest ranked individuals who had not been offered study entry. In subsequent rounds, individuals who had participated in the previous round(s) were offered participation in the next round. If participation was declined, individuals from the pool of participants who had not participated in a study round were invited. Approximate dates for each round were July-September 2020, October-December 2020, and February-March 2021. All participants provided their informed consent for the screening phase. All participants in the study phase provided their informed consent for each study round. The study was approved by the University of California, Berkeley Committee on Protection of Human Subjects (Protocol #2020-03-13121). ### Study procedures At the start of each round, eligible participants were invited to participate. Those who agreed to participate received a kit via FedEx containing materials for self-collection of biospecimens, pre-paid return shipping labels, and instructions to complete an online-administered questionnaire at the same time as biospecimen collection. ### Questionnaire The questionnaire addressed sex, gender, age, race/ethnicity, income, employment, physical and mental health, as well as symptoms potentially related to COVID-19 within the previous 2 weeks, and SARS-CoV-2 testing outside of the study. Participants were also asked about behaviors that might affect risk of SARS-CoV-2 infection including physical distancing practices, close contacts with others, and mask wearing. Questionnaires were available in English or Spanish. ### SARS-CoV-2 viral and antibody testing Anterior nasal nare swabs for viral RNA testing and dried blood spots (DBS) for antibody testing were collected from participants at each study round. Quantitative reverse transcription PCR (RT-qPCR) was used to identify SARS-CoV-2 viral infection. Three tests were used to assess anti-SARS-CoV-2 antibodies in DBS: Ortho VITROS® Anti-SARS-CoV-2 Total IgG and spike IgG ELISA targeted antibodies against the SARS-CoV-2 spike protein (indicating prior natural infection or vaccination), and Roche-NC Total IgG targeted antibodies against the nucleocapsid (NC) protein (indicating prior natural infection only).(13) Before COVID-19 vaccines were available in the study region during rounds 1 and 2, detection of antibodies against the SARS-CoV-2 spike protein was considered evidence of SARS-CoV-2 infection. During Round 3, vaccinations were widely available, therefore detection of antibodies against the NC protein were considered evidence of SARS-CoV-2 infection, while detection of antibodies against the spike protein were considered evidence of SARS-CoV-2 infection or COVID-19 vaccination (Supplement S-4.2, Figure S-1). ### SARS-CoV-2 outcomes The following outcomes were investigated in each round: (1) cumulative SARS-CoV-2 antibody positivity, (2) participants’ self-reported history of SARS-CoV-2 positivity from RT-qPCR testing outside the study, (3) and a surveillance definition of “probable COVID-19 case” derived from self-reported symptoms and close contact with infected individuals(14), and (4) viral SARS-CoV-2 positivity from RT-qPCR testing of nasal swabs (Supplement 4.2). We also investigated antibodies induced by COVID-19 vaccination only in Round 3. SARS-CoV-2 antibody positivity was defined as having detectable antibodies in the current and/or previous round(s). DBS samples that tested both negative for anti-NC antibodies and positive for anti-spike antibodies in Round 3 were considered to have antibodies from COVID-19 vaccination alone. Self-reported COVID-19 viral positivity prevalence was defined as the proportion of participants reporting a positive viral test among all participants who reported being tested within a study round. Probable COVID-19 case prevalence was defined as the proportion of participants identified as a probable COVID-19 case among all participants who provided valid responses within a study round. Viral positivity prevalence was defined as testing positive by RT-qPCR from nasal swab samples. ## Statistical analysis ### Population-adjusted seroprevalence and other SARS-CoV-2 outcomes Bayesian multilevel regression and poststratification (MRP) was used to estimate population-adjusted cumulative seroprevalence, self-reported SARS-CoV-2 viral positivity prevalence at each study round, and “probable COVID-19” prevalence at each study round. MRP is a regression-based method for estimating population and sub-population averages from survey data that has been shown to perform better than survey weighting, particularly with sparse data (15,16). In addition to estimation of regional prevalence of our outcomes, we estimated prevalence within demographic groups and geographic areas in the study region. Variables of interest were gender, age, race, Hispanic ethnicity, income, education, household size, and ZIP code. We used a method described by Leeman and colleagues to generate a synthetic population for poststratification using data from the 2018 American Community Survey (ACS) and the Public Use Microdata Sample (PUMS).(16) Poststratification was done using binary sex because gender is not reported by the ACS. Race and ethnicity were combined into a single variable to reduce the number of poststratification strata. At each study round, binary SARS-CoV-2 outcomes were modeled as a function of geographic and demographic characteristics using multilevel logistic regression models. Participant sex was included as a fixed effect. Vectors of random intercepts were defined for each category of race/ethnicity, age, education, income, household size, and ZIP code and two-way interactions between ZIP code, race/ethnicity, educational attainment, income, and age. To improve estimation of geographic effects we allowed for spatial correlations using the modified Besag-York-Mollié model and included the proportion of Spanish speaking households within the ZIP code of residence from the ACS as a fixed effect (Supplement S-6).(17) ### SARS-CoV-2 outcome prevalence and measures of association We report populated-adjusted prevalence of SARS-CoV-2 outcomes across the study region and within geographic and demographic groups of interest. To calculate prevalence estimates, posterior distributions of the relevant poststratification stratum were aggregated. We also estimated prevalence differences (PD) and prevalence ratios (PR) for the association between populated-adjusted SARS-CoV-2 outcomes and race/ethnicity, education, and sex. For each parameter of interest, the mean of the posterior distribution was the point estimate, and the 95% credible interval (CI) was the 2.5% and 97.5% quantiles of a posterior distribution. ### SARS-CoV-2 test-kit bias corrected seroprevalence We estimated cumulative SARS-CoV-2 seroprevalence at each study round adjusted for the net sensitivity and specificity of the SARS-CoV-2 antibody testing algorithm in each study round (Supplement S-6). ### Transmission mitigation behavior analyses In each round, participants were asked about physical distancing practices, recent close contacts with others, mask wearing, and other behaviors and activities that might affect the risk of SARS-CoV-2 infection. We classified participants into two behavior categories, “high-risk” and “low-risk”, with those responses using latent class analysis.(18) Crude associations between behavior categories and characteristics such as sex, age, race/ethnicity, education, and income were assessed with *χ*2tests. Associations between high-risk vs. low-risk behavior and within round SARS-CoV-2 seroprevalence and self-reported test positivity were estimated using the MRP model described above with random intercepts for behavior categories and interactions between the behavior categories and ZIP code, age, race/ethnicity, education, and income (Supplement S-5 and S-6). Statistical analyses were completed in R 4.0.2. NIMBLE was used to implement MRP models.(19) Detailed descriptions of MRP methods and code are provided in Supplementary Methods and at github.com/adams-cam/ebcovid_prev. ## Results ### Enrollment and characteristics of study participants Of the 16,115 residents who consented and completed the screening procedures between May-July 2020, 1,777 did not satisfy inclusion criteria and were excluded (Figure 1). Characteristics of participants are presented in Table 1 and Table S-1. Participation rates were high (Round 1: 76.8%, Round 2: 89.8%, and Round 3: 87.3%), and participants identified predominantly as female (∼63%). Those aged 45-64 years were the largest age group of participants across all study rounds (ranging from 37.3% to 39.4%). Most participants identified as White (52.5% to 63.3%), followed by Asian/Pacific Islander (13.9% to 15.7%), Hispanic (11.0% to 15.6%), two or more races (6.9% to 9.1%), African American or Black (3.0% to 4.9%), and Native American/Alaska Native or other (1.7% to 2.2%). View this table: [Table 1.](http://medrxiv.org/content/early/2021/12/05/2021.12.02.21266871/T1) Table 1. Characteristics of participants at each round of the study compared to study region population. Of those who completed the questionnaire, 87.3%, 95.3%, and 96.6% provided DBS and 93.6%, 98.1% and 98.% provided nasal swabs in rounds 1, 2, and 3, respectively (Table 2). Antibodies against the SARS-CoV-2 spike protein were detected in 29 (0.6%) and 33 (0.6%) of DBS in rounds 1 and 2, respectively. In Round 3, NC antibodies from natural infection alone were detected in 84 participants (1.8% of 4,806) and spike antibodies from natural infection or vaccination were detected in 1,452 participants (31.3% of 4,806). Viral infection was detected in less than three nasal swabs in each round. The proportion of participants reporting both receiving a SARS-CoV-2 test outside the study and testing positive within a study round increased over the study period; 10/1,030 (1.0%) participants reporting a positive COVID-19 result in Round 1, 19/2,059 (0.9%) in Round 2, and 53/1892 (2.8%) in Round 3. In all rounds, few participants met the criteria for being a COVID-19 probable case (<0.5%) (Table S-2). View this table: [Table 2.](http://medrxiv.org/content/early/2021/12/05/2021.12.02.21266871/T2) Table 2. Prevalence of SARS-CoV-2 related outcomes among participants per study round. ### Population adjusted SARS-CoV-2 outcome prevalence in study region Population adjusted SARS-CoV-2 seroprevalence, self-reported COVID-19 test positivity, and probable COVID-19 cases are reported in Table 3. Overall, populated-adjusted SARS-CoV-2 natural infection seroprevalence was low across the study region: Round 1 (July-September 2020) 1.03% (95% CI 0.50-1.96), Round 2 (October-December 2020 1.37% (0.75-2.39), and Round 3 (February-March 2021) 2.18% (95% CI 1.48-3.17). In Round 3 the populated-adjusted seroprevalence of COVID-19 vaccination was 21.64% (95% CI: 19.2, 24.34). View this table: [Table 3.](http://medrxiv.org/content/early/2021/12/05/2021.12.02.21266871/T3) Table 3. Estimated population-adjusted prevalence (%) and 95% credible intervals within study region of SARS-CoV-2 outcomes. Models incorporating sensitivity and specificity of the antibody assays yielded lower SARS-CoV-2 seroprevalence estimates in Round 1, similar estimates in Rounds 2 and 3, and a higher estimate of COVID-19 vaccine seroprevalence in Round 3. Population adjusted self-reported test positivity to SARS-CoV-2 was similar in Rounds 1 (1.11%, 95 CI: 0.39-2.40) and 2 (1.29%, 95% CI: 0.55, 2.17) to seroprevalence estimates and increased to 4.58% (95% CI: 2.56-7.64) in Round 3. Population-adjusted prevalence of being a COVID-19 probable case was <1% across rounds (Figure S-2). ### SARS-CoV-2 seroprevalence and infection vary by geographic area There was evidence for spatial differences in both populated-adjusted seroprevalence and self-reported test positivity (Figure 2). The northern areas (Richmond, San Pablo, Pinole, and Hercules) and southern areas (East Oakland) of the study region had higher seroprevalence than Berkeley, El Cerrito, and North/Downtown Oakland. Self-reported test positivity was also higher in the northern and southern areas. These trends were consistent across study rounds. ![Figure 2.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/12/05/2021.12.02.21266871/F2.medium.gif) [Figure 2.](http://medrxiv.org/content/early/2021/12/05/2021.12.02.21266871/F2) Figure 2. Population adjusted, A) cumulative seroprevalence of SARS-CoV-2 antibodies to natural infection, and B) self-reported test positivity prevalence for (July 2020-April 2021) across the study region. ### Population adjusted seroprevalence and self-reported test positivity prevalence within subgroups Differences in populated-adjusted seroprevalence were observed among demographic groups (Figure 3). In Round 1, those aged less than 45 years had higher likelihood of antibody positivity than those aged 45 or older, but this relationship reversed over time (Figure 3B). In general, non-Whites consistently had a higher likelihood of antibody positivity than Whites (Figure 3C). The likelihood of antibody positivity also differed by household income and educational attainment, with those reporting a household income <$100,000 and less than a college degree having a higher likelihood of antibody positivity than their comparison groups (Figures 3D-E). Those in households with more than four people had a higher likelihood of antibody positivity than those in households with four or less people (Figure 3F). There was no clear relationship between sex and populated-adjusted seroprevalence (Figure 3A). Similar relationships were seen between demographic groups and self-reported test prevalence (Figure S-3). ![Figure 3.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/12/05/2021.12.02.21266871/F3.medium.gif) [Figure 3.](http://medrxiv.org/content/early/2021/12/05/2021.12.02.21266871/F3) Figure 3. Cumulative seroprevalence of SARS-CoV-2 anti-S antibodies (Rounds 1 and 2) and anti-NC antibodies (Round 3) (July 2020-April 2021) among demographic subgroups, A) sex, B) age, C) race/ethnicity, D) income, E) education, and F) household size. Abbreviations: AA, African American; AMI, American Indian; PI, Pacific Islander. ### COVID-19 vaccination seroprevalence differs by race/ethnicity and age In Round 3, populated-adjusted COVID-19 vaccination seroprevalence was lower in all non-White groups compared to Whites, (White vs. Non-White, PD: -4.35%, 95% CI: (−9.32, - 0.35), Figure 4, Table 4). The difference between racial/ethnic groups was largest among older participants; non-Whites had a lower COVID-19 vaccination seroprevalence compared to Whites among those aged between 65-74 (PD: -11.67%, 95% CI: (−20.98, -2.49)) and those aged 75 or older (PD: -10.15%, 95% CI: (−22.25, 0.43)). View this table: [Table 4.](http://medrxiv.org/content/early/2021/12/05/2021.12.02.21266871/T4) Table 4. Round 3 seroprevalence of antibodies from COVID-19 vaccination within race/ethnicity and age groups and prevalence differences between non-Whites and Whites. ![Figure 4.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/12/05/2021.12.02.21266871/F4.medium.gif) [Figure 4.](http://medrxiv.org/content/early/2021/12/05/2021.12.02.21266871/F4) Figure 4. Round 3 population-adjusted nucleocapsid (natural infection) seroprevalence compared to spike (natural infection or vaccination) seroprevalence within strata of race/ethnicity and age. Each point represents a stratum of race/ethnicity and age within a ZIP code, the size of the point is proportional to American Community Survey population size. ### Mask-wearing and association between high-risk vs. low-risk behavior and seroprevalence More than 99% of participants reported ever wearing a mask 99% reported wearing a mask during leisure and exercise activities, >91% reported wearing a mask at work, and >88% reported wearing a mask while shopping (Table S-3). After clustering participants into “low-risk” and “high-risk” groups according to self-reported mitigation behaviors, most participants were considered low-risk across the study rounds (70%, 82%, and 77%, respectively; Figure 5A). Behaviors with the largest differences in high-versus low-risk behavior were reporting “yes” to: left the house for work, medical/healthcare, care of relative, or other, worked with potential COVID-19 contact, attended a gathering, and traveled to county outside of residence within last two weeks (Figure 5B). Age, race/ethnicity, and education were all associated (p<0.001) with high-vs. low-risk groups (Table S-4), but we did not observe strong evidence for an association between mitigation behavior and either seroprevalence or self-reported test positivity (Table 5), although the point estimates indicate that “low-risk” behavior was associated with lower seroprevalence. View this table: [Table 5.](http://medrxiv.org/content/early/2021/12/05/2021.12.02.21266871/T5) Table 5. Association between seroprevalence and self-reported test positivity and high-risk vs. low-risk mitigation behavior within each study round. ![Figure 5.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/12/05/2021.12.02.21266871/F5.medium.gif) [Figure 5.](http://medrxiv.org/content/early/2021/12/05/2021.12.02.21266871/F5) Figure 5. Proportion of participants stratified by high- and low-risk mitigation behaviors in each study round, A) overall, and B) according to specific behavior. ## Discussion In the current study, we investigated individual-level characteristics and behaviors that contributed to SARS-CoV-2 related outcomes, including seroprevalence and self-reported infection, in a large, population-based sample of over 5,500 participants from 12 East Bay (Northern California) cities. During three time periods from July 2020 to March 2021, we estimated the population-adjusted prevalence of SARS-CoV-2 outcomes across the study region and within strata of age, sex, race/ethnicity, ZIP code, and household size. We then characterized behaviors to mitigate transmission of SARS-CoV-2 and their effects on related outcomes. Overall, prevalence of SARS-CoV-2 outcomes for natural infection were low which may be attributable to the high percentage of mask-wearing and other risk-mitigating behaviors among our participants. COVID-19 vaccination seroprevalence estimates in Round 3 was greater than 20%, with non-Whites having much lower seroprevalence than Whites. Despite the low overall SARS-CoV-2 seroprevalence and infection observed in our study, ZIP code of residence, age, racial/ethnic identification, education, and income all contributed to differences in seroprevalence. Specifically, non-Whites, those without a college degree, and lower income households, had higher seroprevalence. Further, ZIP codes with higher proportions of Spanish speakers had higher populated-adjusted seroprevalence estimates (Figure S-4). These differences persisted despite the high rates of mask wearing reported by our study sample, further adding to strong evidence that the risk of COVID-19 is distributed unequally and that structural inequities play an important role in COVID-19 risk.(20–23) Moreover, during Round 3 (February-March 2021), COVID-19 vaccines were widely available across the study region. We found that Whites had much higher prevalence of antibodies against the SARS-CoV-2 spike protein in Round 3, indicative of vaccination status, than non-Whites, as reported elsewhere.(3,24) Notably, this difference was largest among those aged 65 or older. These findings demonstrate that in the first few months of vaccine availability in the Bay Area, large disparities in vaccination rates by race/ethnicity were observed among older persons, and Whites, the group with the lowest prevalence of SARS-CoV-2 infection, were more likely to be vaccinated, further underscoring the inequities that exist surrounding the coronavirus pandemic. Another key finding was that almost all participants reported wearing masks. This contrasts with models of mask usage reported by the Institute for Health Metrics and Evaluation which reported mask usage ranging between 75-82% from December 2020 through March 2021 in California.(25,26) Mask wearing is one the most effective behaviors for controlling community spread of SARS-CoV-2 infection.(27) The high rate of mask usage by study participants may partially explain why we did not detect a differences between high-risk and low-risk mitigation behavior and SARS-CoV-2 prevalence, and partially explain why our estimates of SARS-CoV-2 seroprevalence and self-report test positivity were lower than public case reports. However, a recent study estimating SARS-CoV-2 seroprevalence from blood donors reported seroprevalence in the Bay Area region of California that were similar to our estimates through December 2020.(28) A major strength of this study was the longitudinal design and collection of individual-level data, including biospecimens for antibody and virus testing, which is challenging but much needed in current studies of the pandemic. Comprehensive data on social distancing, self-quarantine, mask wearing, working from home, and other public health and SARS-CoV-2 mitigation efforts are needed to inform current and future strategies of prevention and for whom to target prevention efforts. At-home collection of biospecimens, including dried blood spots for antibody testing, made it possible to regularly test participants without any in-person interaction. This was a critical feature, particularly early in the pandemic, when recommendations were to travel only for essential purposes and to limit in-person interactions. At-home sample collection was used to obtain more than 30,000 biospecimens and is a feasible approach for even larger populations and geographic regions. One limitation of this study was the under-representation of certain demographics in our sample. Supplementary mailings of recruitment postcards in Spanish were sent to residences in ZIP codes with high proportions of Spanish speaking households. We also placed recruitment flyers in local grocery stores and conducted outreach to community organizations, local government officials, and school districts in the study region. Despite these efforts, non-Whites, males, lower income households, and individuals with lower education and from lower socioeconomic ZIP codes were underrepresented in our sample. Unhoused individuals were also not captured in our sample. This was important given evidence that individuals who identify as Hispanic or Black, and other underrepresented groups, are at the highest risk for COVID-19.(23) These discrepancies may have led to underestimation of overall SARS-CoV-2 seroprevalence and self-reported infection, although the MRP models may have somewhat minimized this effect by pooling information across similar observed characteristics in the sample data. Our seroprevalence estimates were consistent with weekly case prevalence reported by public health agencies in ZIP codes with high response rates (Figure S-5).(29) Additionally, there may be unmeasured confounding from variables not included in the analyses and self-selection bias from some participants in COVID-19 research who may be more fervent adherers to COVID-19 public health measures. Finally, some individuals do not seroconvert after infection or vaccination. Our results underscore the substantial and persistent inequities that exist surrounding the coronavirus pandemic. Non-Whites, lower-income, and lower-educated individuals had the highest SARS-CoV-2 seroprevalence. This disparity in seroprevalence was observed despite near universal rates of mask wearing in our sample. We also observed large differences in COVID-19 vaccination seroprevalence between racial and ethnic groups. More work must be done to address these disparities and inequities, such as allocation of resources for high-risk communities and strategies to mitigate the structural barriers posed by social and structural determinants of health. ## Supporting information Supplementary Methods, Tables, and Figures [[supplements/266871_file03.pdf]](pending:yes) ## Data Availability De-identified individual participant data that underlie the results reported in this article (text, tables, figures may be shared for up to 36 months following publication after investigators whose proposed use of the data has been approved by an independent review committee. For individual participant data analysis or meta-analysis, proposals should be directed to Lisa Barcellos (lbarcellos{at}berkeley.edu) and Lynn Hollyer (lhollyer{at}berkeley.edu). Requests will be reviewed by an independent review committee and the UC Berkeley Institutional Review Board. Data may be shared upon approval. Data requestors will need to sign a data access agreement. ## Acknowledgments We want to acknowledge the contributions of the following individuals to this study: Ella Parsons, Jordan Keen, Janine Solomon, Jose Salinas, Kevin Duong, Joseph Egbunikeokye, Maya Talavera, Riya Shrestha, Colin Warnes, José Victor Zambrana, Nicholas Lo, Parnal Narvekar, Fausto Bustos, Gregorio Dias, Reinaldo Mercado-Hernandez, Julia Huffaker, Raymond Montes, Alexandra Zermeno, Alejandra Zeiger, William Dow, Michael Lu, Lila Krop, Kelly Lam, Yan Zhang, Sarah Folkmanis, Sophie Zhai, Dingjun Chen, Ruben Vargas Ethan Garcia, Oliver Li, Manisha Sahoo, Raina Walencewicz, Sophia Wang, Antonia Gibbs, Amrita Ramanathan, Catherine Livelo, Taylor Worley, Amanda Tanaka, Savinnie Ho, Jane Liu Ryan Allen, Sofia Soltero, Victoria Van Metter, Madeleine Fraix, Allie Coyne, Subeksha Sharma Lydia Yu, Shreeya Garg, Sanjeet Paluru, Malika Saxena, Talia Panadero, Ayra Rahman, Joshua Calangian, Dharaa Upadhyaya, Sophia Kemp, Ruhi Parikh, Amy Rich, Sophie Manoukian, Nola Vu, Crystal Nguyen, Jordyn Pinochi, Alma Kuc, Siri Ylenduri, Manvir Kaur, Angikaar Chana, and Sannidhi Sarvadhavabhatla, Benjamin T. Auch, Dinesha Walek, Evan Forsberg, Jerry Daniel, Veronica Tonnell, Ji Hyun (Jay) Kim, Mary Nieuwenhuis. Creative Testing Solutions: Valerie Green, Sherri Cyrus, Phillip Willamson, Brett Hirsch, Paul Contestable, Mars Stone, Joe Derisi, Emily Crawford, Emily Ahlvin, Armando Diaz, and Favianna Rodriquez. * Received December 2, 2021. * Revision received December 2, 2021. * Accepted December 5, 2021. * © 2021, Posted by Cold Spring Harbor Laboratory This pre-print is available under a Creative Commons License (Attribution-NonCommercial-NoDerivs 4.0 International), CC BY-NC-ND 4.0, as described at [http://creativecommons.org/licenses/by-nc-nd/4.0/](http://creativecommons.org/licenses/by-nc-nd/4.0/) ## References 1. 1.Eby K. Coronavirus Timeline: Tracking major moments of COVID-19 pandemic in San Francisco Bay Area [Internet]. ABC7 San Francisco. 2020 [cited 2021 Nov 29]. Available from: [https://abc7news.com/6047519/](https://abc7news.com/6047519/) 2. 2.United States. Agency for Toxic Substances and Disease Registry. Geospatial Research A and SP, editor. CDC COVID data tracker. 2020 Sep 15; Available from: [https://stacks.cdc.gov/view/cdc/96118](https://stacks.cdc.gov/view/cdc/96118) 3. 3.Njoku A, Joseph M, Felix R. Changing the Narrative: Structural Barriers and Racial and Ethnic Inequities in COVID-19 Vaccination. International Journal of Environmental Research and Public Health. 2021 Jan;18(18):9904. 4. 4.Qualls N. Community Mitigation Guidelines to Prevent Pandemic Influenza — United States, 2017 MMWR Recomm Rep [Internet]. 2017 [cited 2021 Nov 10];66. Available from: [https://www.facebook.com/CDCMMWR](https://www.facebook.com/CDCMMWR) 5. 5.CDC. Interim pre-pandemic planning guidance : community strategy for pandemic influenza mitigation in the United States : early, targeted, layered use of nonpharmaceutical interventions [Internet]. [cited 2021 Nov 10]. Available from: [https://stacks.cdc.gov/view/cdc/11425](https://stacks.cdc.gov/view/cdc/11425) 6. 6.Bell DM, World Health Organization Working Group on International and Community Transmission of SARS. Public health interventions and SARS spread, 2003. Emerg Infect Dis. 2004 Nov;10(11):1900–6. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.3201/eid1011.040729&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=15550198&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F12%2F05%2F2021.12.02.21266871.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000224997700002&link_type=ISI) 7. 7.Hutchins HJ. COVID-19 Mitigation Behaviors by Age Group — United States, April–June 2020. MMWR Morb Mortal Wkly Rep [Internet]. 2020 [cited 2021 Nov 10];69. Available from: [https://www.cdc.gov/mmwr/volumes/69/wr/mm6943e4.htm](https://www.cdc.gov/mmwr/volumes/69/wr/mm6943e4.htm) 8. 8.Havers FP, Reed C, Lim T, Montgomery JM, Klena JD, Hall AJ, et al. Seroprevalence of Antibodies to SARS-CoV-2 in 10 Sites in the United States, March 23-May 12, 2020. JAMA Internal Medicine. 2020 Dec 1;180(12):1576–86. 9. 9.Bendavid E, Mulaney B, Sood N, Shah S, Ling E, Bromley-Dulfano R, et al. COVID-19 Antibody Seroprevalence in Santa Clara County, California. medRxiv. 2020 Apr;2020.04.14.20062463-2020.04.14.20062463. 10. 10.Sood N, Simon P, Ebner P, Eichner D, Reynolds J, Bendavid E, et al. Seroprevalence of SARS-CoV-2–Specific Antibodies Among Adults in Los Angeles County, California, on April 10-11, 2020. JAMA. 2020 Jun 16;323(23):2425–7. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F12%2F05%2F2021.12.02.21266871.atom) 11. 11.Self WH. Seroprevalence of SARS-CoV-2 Among Frontline Health Care Personnel in a Multistate Hospital Network — 13 Academic Medical Centers, April–June 2020. MMWR Morb Mortal Wkly Rep [Internet]. 2020 [cited 2021 Nov 10];69. Available from: [https://www.cdc.gov/mmwr/volumes/69/wr/mm6935e2.htm](https://www.cdc.gov/mmwr/volumes/69/wr/mm6935e2.htm) 12. 12.Rosenberg ES, Tesoriero JM, Rosenthal EM, Chung R, Barranco MA, Styer LM, et al. Cumulative incidence and diagnosis of SARS-CoV-2 infection in New York. Ann Epidemiol. 2020 Aug;48:23-29.e4. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.annepidem.2020.06.004&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=32648546&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F12%2F05%2F2021.12.02.21266871.atom) 13. 13.Wong MP, Meas MA, Adams C, Hernandez S, Green V, Montoya M, et al. Development and Implementation of Dried Blood Spot-based COVID-19 Serological Assays for Epidemiologic Studies. medRxiv. 2021 Jan 1;2021.11.25.21266786. 14. 14.CSTE Interim Position Statement: Update to COVID-19 Case Definition - Council of State and Territorial Epidemiologists [Internet]. [cited 2021 Oct 13]. Available from: [https://www.cste.org/news/520707/CSTE-Interim-Position-Statement-Update-to-COVID-19-Case-Definition.htm](https://www.cste.org/news/520707/CSTE-Interim-Position-Statement-Update-to-COVID-19-Case-Definition.htm) 15. 15.Gelman A, Little TC. Poststratification Into Many Categories Using Hierarchical Logistic Regression. Survey Research. 1997 Dec;23:127–35. 16. 16.Leemann L, Wasserfallen F. Extending the Use and Prediction Precision of Subnational Public Opinion Estimation: EXTENDING USE AND PRECISION OF MrP. American Journal of Political Science. 2017 Oct;61(4):1003–22. 17. 17.Simpson D, Rue H, Riebler A, Martins TG, Sørbye SH. Penalising Model Component Complexity: A Principled, Practical Approach to Constructing Priors. Statist Sci [Internet]. 2017 Feb 1 [cited 2021 Sep 15];32(1). Available from: [https://projecteuclid.org/journals/statistical-science/volume-32/issue-1/Penalising-Model-Component-Complexity--A-Principled-Practical-Approach-to/10.1214/16-STS576.full](https://projecteuclid.org/journals/statistical-science/volume-32/issue-1/Penalising-Model-Component-Complexity--A-Principled-Practical-Approach-to/10.1214/16-STS576.full) 18. 18.Linzer DA, Lewis JB. poLCA: An R Package for Polytomous Variable Latent Class Analysis. Journal of Statistical Software. 2011 Jun 14;42:1–29. 19. 19.de Valpine P, Turek D, Paciorek CJ, Anderson-Bergman C, Lang DT, Bodik R. Programming with models: writing statistical algorithms for general model structures with NIMBLE. Journal of Computational and Graphical Statistics. 2017 Apr 3;26(2):403–13. 20. 20.Reitsma MB, Claypool AL, Vargo J, Shete PB, McCorvie R, Wheeler WH, et al. Racial/Ethnic Disparities In COVID-19 Exposure Risk, Testing, And Cases At The Subcounty Level In California. Health Affairs. 2021 Jun 1;40(6):870–8. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1377/hlthaff.2021.00098&link_type=DOI) 21. 21.Paremoer L, Nandi S, Serag H, Baum F. Covid-19 pandemic and the social determinants of health. BMJ. 2021 Jan 29;372:n129. 22. 22.Abrams EM, Szefler SJ. COVID-19 and the impact of social determinants of health. Lancet Respir Med. 2020 Jul;8(7):659–61. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F12%2F05%2F2021.12.02.21266871.atom) 23. 23.Magesh S, John D, Li WT, Li Y, Mattingly-app A, Jain S, et al. Disparities in COVID-19 Outcomes by Race, Ethnicity, and Socioeconomic Status: A Systematic-Review and Meta-analysis. JAMA Network Open. 2021 Nov 11;4(11):e2134147. 24. 24.Agarwal R, Dugas M, Ramaprasad J, Luo J, Li G, Gao G (Gordon). Socioeconomic privilege and political ideology are associated with racial disparity in COVID-19 vaccination. PNAS [Internet]. 2021 Aug 17 [cited 2021 Nov 10];118(33). Available from: [https://www.pnas.org/content/118/33/e2107873118](https://www.pnas.org/content/118/33/e2107873118) 25. 25.Reiner RC, Barber RM, Collins JK, Zheng P, Adolph C, Albright J, et al. Modeling COVID-19 scenarios for the United States. Nat Med. 2021 Jan;27(1):94–105. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=33097835&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F12%2F05%2F2021.12.02.21266871.atom) 26. 26.IHME | COVID-19 Projections [Internet]. Institute for Health Metrics and Evaluation. [cited 2021 Oct 1]. Available from: [https://covid19.healthdata.org/](https://covid19.healthdata.org/) 27. 27.Brooks JT, Butler JC. Effectiveness of Mask Wearing to Control Community Spread of SARS-CoV-2. JAMA. 2021 Mar 9;325(10):998–9. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1001/jama.2021.1505&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=33566056&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F12%2F05%2F2021.12.02.21266871.atom) 28. 28.Jones JM, Stone M, Sulaeman H, Fink RV, Dave H, Levy ME, et al. Estimated US Infection- and Vaccine-Induced SARS-CoV-2 Seroprevalence Based on Blood Donations, July 2020-May 2021. JAMA [Internet]. 2021 Sep 2 [cited 2021 Sep 30]; Available from: [https://doi.org/10.1001/jama.2021.15161](https://doi.org/10.1001/jama.2021.15161) 29. 29.The Los Angeles Times’ open-source archive of California coronavirus data [Internet]. Los Angeles Times Data and Graphics Department; 2021 [cited 2021 Oct 12]. Available from: [https://github.com/datadesk/california-coronavirus-data](https://github.com/datadesk/california-coronavirus-data)