Abstract
Background Infectious disease surveillance systems, which largely rely on diagnosed cases, underestimate the true incidence of SARS-CoV-2 infection, due to under-ascertainment and underreporting. We used repeat serologic testing to measure N-protein seroconversion in a well-characterized cohort of U.S. adults with no serologic evidence of SARS-CoV-2 infection to estimate the incidence of SARS-CoV-2 infection and characterize risk factors, with comparisons before and after the start of the SARS-CoV-2 vaccine and variant eras.
Methods We assessed the incidence rate of infection and risk factors in two sub-groups (cohorts) that were SARS-CoV-2 N-protein seronegative at the start of each follow-up period: 1) the pre-vaccine/wild-type era cohort (n=3,421), followed from April to November 2020; and 2) the vaccine/variant era cohort (n=2,735), followed from November 2020 to June 2022. Both cohorts underwent repeat serologic testing with an assay for antibodies to the SARS-CoV-2 N protein (Bio-Rad Platelia SARS-CoV-2 total Ab). We estimated crude incidence and sociodemographic/epidemiologic risk factors in both cohorts. We used multivariate Poisson models to compare the risk of SARS-CoV-2 infection in the pre-vaccine/wild-type era cohort (referent group) to that in the vaccine/variant era cohort, within strata of vaccination status and epidemiologic risk factors (essential worker status, child in the household, case in the household, social distancing).
Findings In the pre-vaccine/wild-type era cohort, only 18 of the 3,421 participants (0.53%) had >1 vaccine dose by the end of follow-up, compared with 2,497/2,735 (91.3%) in the vaccine/variant era cohort. We observed 323 and 815 seroconversions in the pre-vaccine/wild-type era and the vaccine/variant era and cohorts, respectively, with corresponding incidence rates of 9.6 (95% CI: 8.3-11.5) and 25.7 (95% CI: 24.2-27.3) per 100 person-years. Associations of sociodemographic and epidemiologic risk factors with SARS-CoV-2 incidence were largely similar in the pre-vaccine/wild-type and vaccine/variant era cohorts. However, some new epidemiologic risk factors emerged in the vaccine/variant era cohort, including having a child in the household, and never wearing a mask while using public transit. Adjusted incidence rate ratios (aIRR), with the entire pre-vaccine/wild-type era cohort as the referent group, showed markedly higher incidence in the vaccine/variant era cohort, but with more vaccine doses associated with lower incidence: aIRRun/undervaccinated=5.3 (95% CI: 4.2-6.7); aIRRprimary series only=5.1 (95% CI: 4.2-7.3); aIRRboosted once=2.5 (95% CI: 2.1-3.0), and aIRRboosted twice=1.65 (95% CI: 1.3-2.1). These associations were essentially unchanged in risk factor-stratified models.
Interpretation In SARS-CoV-2 N protein seronegative individuals, large increases in incidence and newly emerging epidemiologic risk factors in the vaccine/variant era likely resulted from multiple co-occurring factors, including policy changes, behavior changes, surges in transmission, and changes in SARS-CoV-2 variant properties. While SARS-CoV-2 incidence increased markedly in most groups in the vaccine/variant era, being up to date on vaccines and the use of non-pharmaceutical interventions (NPIs), such as masking and social distancing, remained reliable strategies to mitigate the risk of SARS-CoV-2 infection, even through major surges due to immune evasive variants. Repeat serologic testing in cohort studies is a useful and complementary strategy to characterize SARS-CoV-2 incidence and risk factors.
SHORT ABSTRACT This study used repeat serologic testing to estimate infection rates and risk factors in two overlapping cohorts of SARS-CoV-2 N protein seronegative U.S. adults. One mostly unvaccinated sub-cohort was tracked from April to November 2020 (pre-vaccine/wild-type era, n=3,421), and the other, mostly vaccinated cohort, from November 2020 to June 2022 (vaccine/variant era, n=2,735). Vaccine uptake was from 0.53% and 91.3% in the pre-vaccine and vaccine/variant cohorts, respectively. Corresponding seroconversion rates were 9.6 and 25.7 per 100 person-years. In both cohorts, sociodemographic and epidemiologic risk factors for infection were similar, though new risks emerged in the vaccine/variant era, such as having a child in the household. Despite higher incidence rates in the vaccine/variant cohort, vaccine boosters, masking, and distancing likely reduced infection risk, even through major variant surges. Repeat serologic testing in cohorts is a useful and complementary strategy to characterize incidence and risk factors.
Funding The work was supported by the CUNY Institute for Implementation Science in Population Health, the U.S. National Institutes of Allergy and Infectious Diseases (NIAID), Pfizer, Inc., and the U.S. National Institute of Mental Health (NIMH).
INTRODUCTION
Infectious disease surveillance systems, which largely rely on diagnosed case counts, emergency department visits, hospital admissions, and deaths, underestimate the true incidence of infection due to asymptomatic infections, under-ascertainment/diagnosis, and underreporting to health departments. This has proved to be a challenge during the COVID-19 pandemic in the United States (US)1,2, where the national surveillance system relied on the reporting of positive SARS-CoV-2 real-time reverse-transcription polymerase chain reaction (RT-PCR) test results by providers and laboratories. These data have effectively been used as a proxy for the incidence and prevalence of SARS-CoV-2 infection in the US, including to inform pandemic policy decisions, often with no efforts to adjust for underestimation due to asymptomatic infections, testing and healthcare access, evolving testing practices, behaviors, and policies.2–5 Studies have shown that estimates of infection based on seroprevalence far exceed the number of diagnosed and reported cases.2,6–9 These issues have posed major challenges to using surveillance data to assess the true burden of infection and related risk factors and inform decision-making in an evolving pandemic10 These issues are magnified with the end of the national emergency declaration in May 2023, as surveillance for SARS-CoV-2 further dismantles, testing behaviors change and official case counts continue to fall.11
The risk of SARS-CoV-2 infection is determined by multiple factors, including frequency of exposure, underlying medical conditions, vaccination status, virus properties and their evolution, levels of community transmission, and individual behaviors, such as the use of non-pharmaceutical interventions (NPIs) like masking and social distancing.12–15 Vaccines, which were authorized by the FDA on December 11, 2020 and started to become more widely available in the United States in March 202116, have dramatically reduced the risk of severe disease and death from SARS-CoV-217, including during the Alpha (March-June 2021), Delta (June-December 2021) and Omicron (December 2021-present) variant eras.18 However, with the emergence of the Delta variant19 and particularly during the Omicron era, vaccine effectiveness against infection reduced and waned quickly after vaccination, including after a booster.18,20–23,15–18 Recent population-representative, cross-sectional studies during successive Omicron surges have shown a high point prevalence of SARS-CoV-2 infection among those who were previously vaccinated and boosted.2,24
Alongside variant surges, starting in late 2021, the United States relaxed public health policies and guidelines recommending or requiring quarantine and isolation, masking, social distancing, and remote K-12 schooling in response to the increasing availability and uptake of vaccines and the availability of effective therapeutics.25–29 National policies have increasingly emphasized the importance of using vaccines and boosters to reduce the burden of severe disease and healthcare system strain, with less emphasis on NPIs for preventing infection.14 These policy choices and related public health messaging likely impacted individual-level and community-level risk factors and behavior, resulting in less utilization of NPIs such as masking, social distancing, and more. 30,31
Beyond basic demographics and geography, risk factors for SARS-CoV-2 infection (vs. diagnoses), including asymptomatic infection, have not been well-characterized in the vaccine and variant eras, either via routine case-based surveillance or by cross-sectional seroprevalence studies in population-based samples.10,32,33 Moreover, the degree of the potential protective effect of vaccines for preventing SARS-CoV-2 infection has not been examined in a population-based prospective cohort with systematically gathered, time-updated information on risk factors, behaviors, and infection status derived from infection-induced seroconversions.
Within a well-characterized national prospective cohort of US adults (for whom we previously characterized risk factors for seroincident SARS-CoV-2 infection in the pre-vaccine/wild-type era15), we used repeat serologic testing to assess the incidence of SARS-CoV-2 infection and risk factors among those who were N protein seronegative by March 2021 (the start of the variant era), with vaccine uptake and infection status assessed through June 2022.
METHODS
Recruitment
We used internet-based strategies34–36 to recruit a geographically and socio-demographically diverse cohort of 6,740 adults into longitudinal follow-up with at-home, dried blood spot (DBS) specimen collection. To be eligible for inclusion in the cohort, individuals had to: 1) reside in the United States or a U.S. territory; 2) be ≥18 years of age; 3) provide a valid email address for follow-up; and 4) demonstrate early engagement in study activities (provision of a baseline specimen for serologic testing or completion of >1 recruitment/enrollment visit). Details of the study design and recruitment procedures36 and a pre-vaccine/wild-type era serology-based incidence study in this cohort15 are described elsewhere. Cohort enrollment was completed between March 28 and August 21, 2020, during which baseline demographic data collection took place. The full cohort includes participants from all 50 U.S. states, the District of Columbia, Puerto Rico, and Guam.
Study population
The study population (n=3,582) was divided into two overlapping sub-groups (henceforth cohorts, Figure 1) which broadly correspond to those with a seronegative specimen during Serology Period 1 with at least one follow-up specimen (pre-vaccine/wild-type era cohort, n=3,421), and those with a seronegative specimen during Serology Period 2 with at least one follow-up specimen (vaccine/variant era cohort, n=2,735) (Figure 2).37,38 The vaccine/variant era cohort included those in the pre-vaccine/wild-type era cohort who remained seronegative on their specimen from Serology Period 2 (Figure 2).
Follow-up data collection
From 14 follow-up study encounters occurring approximately quarterly between August 2020 and July 2022, we obtained repeated measurements of epidemiologic risk factors, COVID-19 symptoms, non-study-related SARS-CoV-2 testing (PCR or rapid, at-home rapid), hospitalizations, use of NPIs, public health strategies (i.e., quarantine, isolation), and contact tracing encounters.
Serologic testing
Figure 2 shows the three periods of serologic testing in the cohort – from April through September 2020 (Serology Period 1), November 2020 through March 2021 (Serology Period 2), and March 2022 through June 2022 (Serology Period 3). During these periods, participants were invited to complete serologic testing using an at-home self-collected dried blood spot (DBS) specimen collection kit. DBS cards were sent from and returned to the study laboratory (Molecular Testing Laboratories [MTL], Vancouver, WA) via the U.S. Postal Service using a self-addressed, stamped envelope containing a biohazard bag.
To assess infection-induced seroconversion, all DBS specimens were tested by the study laboratory for total antibodies to the SARS-CoV-2 nucleocapsid protein (total nucleocapsid Ab) using the Bio-Rad Platelia test for IgA, IgM, and IgG (manufacturer sensitivity 98.0%, specificity 99.3%).39 Other studies have independently validated this assay and found average sensitivity and specificity of 91.7% and 98.8%, respectively.40–42 This assay was also validated for use with DBS by the study laboratory, which found 100% sensitivity and 100% specificity (MTL, personal communication).
Outcome (infection-induced seroconversion)
Among those individuals with two total nucleocapsid Ab tests, the outcome of infection-induced SARS-CoV-2 seroconversion was defined as having a negative total nucleocapsid Ab test in Serology Period 1 followed by a positive total nucleocapsid Ab test in Serology Period 2 (within the pre-vaccine/wild-type era cohort) or a negative total nucleocapsid Ab test in Serology Period 2 followed by a positive total nucleocapsid Ab test in Serology Period 3 (within the vaccine/variant era cohort). We estimated person-years of follow-up in each cohort using the collection dates for each specimen. Participants could contribute person-time to each cohort (pre-vaccine/wild-type or vaccine/variant era). When the specimen collection date was missing, we used the date the laboratory received the sample. For those who seroconverted, the seroconversion date was assigned as the midpoint between the initial seronegative specimen collection date and earlier of: 1) the date of the follow-up seropositive specimen collection; or 2) the reported date of a positive SARS-CoV-2 test result (PCR or rapid test) in between the initial and follow-up specimen collection dates.
Exposures
Timing of exposure data collection on risk factors, behaviors, and vaccination status
All exposure measurements were derived from time-updated questionnaire data collected during each era, and only those measures taken prior to outcome measurement for a given era were used. For the pre-vaccine/wild-type era cohort, we used exposure data from the questionnaire for study visits 1 through 4 (V1-V4; Figure 2). For the vaccine/variant era cohort, we used data from V4-V10 questionnaires (Figure 2).
Individual-Level COVID-19 Risk Factors
We collected time-updated information on an array of epidemiologic risk factors for SARS-CoV-2 infection reported by participants, including the following: essential worker status (those working in healthcare, emergency response, law enforcement, delivery of food/goods, transportation), household factors (household crowding defined as ≥4 people living in a single unit of a multi-unit dwelling, having a child in the household, and having a confirmed COVID-19 case in a household member before participant tested positive); spending time in public places (attending mass gatherings, indoor dining in a restaurant or bar, outdoor dining at a restaurant or bar, visiting places of worship, or visiting public parks or pools); mask use indoors (for grocery shopping, visiting non–household members, at work, and in salons or gyms); mask use outdoors; gathering in groups with 10 or more people; travel during the pandemic (air travel and public transit use); and individual-level factors that may increase the risk of infection and/or severe COVID-19 (comorbid conditions, binge drinking, regular cannabis use or un-prescribed opioid use). Binge drinking was defined as six or more drinks in one sitting during the last month, asked as part of the Alcohol Use Disorders Identification Test questions on select questionnaires.43 As a measure of susceptibility to severe COVID-19, we used comorbid conditions or exposures that CDC identified as increasing the risk for COVID-19 complications, given SARS-CoV-2 infection: age >=60 years, daily smoking, chronic lung disease, including chronic obstructive pulmonary disease, emphysema, chronic bronchitis, serious heart conditions, current asthma, type 2 diabetes, kidney disease, immunocompromised condition, or an HIV diagnosis.44
Risk groups
We hypothesized that some participants may be at higher risk of SARS-CoV-2 infection in the vaccine/variant era because of membership in a group more directly affected by policy changes, including changes to guidelines and public health messaging. These groups included essential workers, those living in crowded households, and those with children in the household who might attend childcare or school. For essential worker status, household factors, and other binary variables, we assigned exposure status based on any vs. no exposure (e.g., having a confirmed COVID-19 case in the household or not) that occurred within each era.
Risk behaviors
We also hypothesized that some participants may have a higher risk of SARS-CoV-2 infection in the vaccine/variant era relative to the pre-vaccine/wild-type era due to the de-implementation of policies that may change risk factors and behaviors. These risk factors/behaviors included: mask use indoors while visiting non-household members, mask use at work, social distancing with individuals the participant knows, and social distancing with individuals the participant does not know. For time-dependent exposure variables (e.g., social distancing and masking), we assigned exposure status based on a hierarchy of exposure risk during follow-up. Specifically, participants were classified according to the highest risk strata (e.g., never masking>sometimes masking>always masking) that they reported at one or more follow-up assessments.
Composite risk score
We computed a composite COVID-19 risk score, as many of the above COVID-19 risk groups and behaviors are likely to be highly correlated. We applied least absolute shrinkage selection operator (LASSO) regression to select the set of risk factors that best-predicted seroconversion in the pre-vaccine/wild-type era.45 The LASSO model selected household crowding, having a confirmed COVID-19 case in a household member, indoor dining in a bar/restaurant, gathering with groups of ≥10, and no mask use indoors in salons or gyms as the most predictive of seroconversion in our cohort during the pre-vaccine/wild-type era. Scores were assigned to each participant based on their responses for each of the risk factors selected by the LASSO model. Sores were normalized between 0 and 100, with higher scores indicating more engagement in high-risk activities (details in Statistical Appendix). The composite score was divided into tertiles for analysis.
Vaccination status
For the pre-vaccine/wild-type cohort, vaccination status at the start of follow-up was assigned as unvaccinated for all participants in the cohort, since vaccines were not available during Serology Period 1 (i.e., 100% were unvaccinated at the start). Vaccination status at the end of follow-up was assigned at the start of Serology Period 2 (based on responses to the V4 questionnaire in Figure 2), at which time almost the entire cohort (99%) remained unvaccinated. For the vaccine/variant era cohort, vaccination status at the start of follow-up was assigned based on vaccination status as of February 10, 2021, which corresponded with the first questionnaire after Serology Period 2 specimen collection (V6 in Figure 2). Individuals in the vaccine/variant-era cohort were classified according to their vaccination/booster status as of the end of follow-up (June 2022), with categories as follows: un/undervaccinated (unvaccinated or did not complete primary vaccine series), completed primary vaccine series, completed primary vaccine series with one booster, and completed primary vaccine series with two or more boosters. Completing a primary vaccine series was defined as one dose for participants who indicated that they had received the Johnson & Johnson vaccine and two doses for any other COVID vaccine type specified.
Statistical analysis
Seroincidence of SARS-CoV-2 infection was calculated within each cohort and across strata of sociodemographic factors and epidemiologic risk factors, selected based on the literature and on previous published pre-vaccine/wild-type era analyses of SARS-CoV-2 incidence in this cohort.15 Crude associations of each factor with SARS-CoV-2 infection were reported as rate ratios. A multivariable mixed effects Poisson model with random coefficients, using the log of total person-time as the offset and an unstructured covariance matrix, was used to estimate the rate ratio of incident SARS-CoV-2 infection stratified by vaccination status (un/undervaccinated, vaccinated, boosted once, boosted more than once) in the vaccine/variant era cohort. The entire pre-vaccine/wild-type era cohort was used as the referent group in these models. We ran a crude and multivariable overall model. To assess the association of vaccination status within different risk factor strata, we ran 12 multivariable models, one for each stratum of five different risk factor groups [essential workers, household children, household cases, social distancing with those you know, and social distancing with those you don’t know]. All multivariable models were adjusted for age, sex, and the presence of comorbidities. All mixed models accounted for repeated measures among participants by including a random intercept for subject. All data were cleaned and analyzed in R and SAS.
Ethical Approval
The study protocol was approved by the Institutional Review Board at the City University of New York (CUNY).
RESULTS
Sample characteristics
The characteristics of the study cohorts are shown in Table 1. Seventy-two percent of subjects (n=2,574) were represented in both cohorts, 24% (n=847) were represented only in the pre-vaccine/wild-type era cohort and 4% (n=161) were represented only in the vaccine/variant era cohort (Figure 1). Participants in each cohort were very similar on measured characteristics, except for employment status, where a slightly lower proportion was unemployed in the vaccine/variant era than in the pre-vaccine/wild-type era (6.5% vs 11.1%, respectively).
Vaccination status. In the pre-vaccine/wild-type era, none of the 3,421 seronegative participants were vaccinated at the start of the follow-up (March 28, 2020), and only 18 participants (0.53%) had any vaccine doses as of November 17, 2020 (Table 1). In the vaccine/variant era, 282 (10.3%) of the 2,735 seronegative participants had completed a primary vaccine series as of February 10, 2021 (V6 questionnaire in Figure 1); 2,497 (91.3%) were fully vaccinated by the end of follow-up, including 2,246 (82.1%) boosted at least once and 723 (26.4%) boosted twice. In terms of the timing of vaccination, 87% percent of the vaccine/variant-era cohort had completed their primary series within 6 months of the start of follow-up in the vaccine/variant era (i.e., within 6 months of their seronegative specimen).
Seroincidence of SARS-CoV-2 Infection
The seroincidence rate of SARS-CoV-2 infection in the vaccine/variant era cohort was nearly three times higher than in the pre-vaccine/wild-type era cohort. Specifically, we observed a SARS-CoV-2 infection rate of 9.61 per 100 person-years (95% CI 8.3-11.1) and 25.74 per 100 person-years (95% CI 24.2-27.3) in the pre-vaccine/wild-type era cohort and vaccine/variant era cohort, respectively (Table 2).
Sociodemographic factors
Table 2 also shows the SARS-CoV-2 incidence rate and crude SARS-CoV-2 incidence rate ratios by sociodemographic factors and cohort era. Across the two cohorts, crude incidence rates were substantially higher in all sociodemographic subgroups in the vaccine/variant era cohort compared with the pre-vaccine/wild-type era cohort. Within each of the two cohorts, the SARS-CoV-2 infection rate varied substantially by sociodemographic factors, with lower SARS-CoV-2 infection in those aged 60 and older compared with 18-29 year-olds in both cohorts (IRRpre-vaccine/wild-type, 0.53 [95% CI, 0.31-0.90]; IRRvaccine/variant, 0.43 [95% CI, 0.34 -0.55]) and in women compared to men in the pre-vaccine/wild-type era cohort (IRRpre- vaccine/wild-type, 0.69 [95% CI, 0.51-0.95]). Some associations appeared to be protective only in the vaccine/variant era cohort (household income above $100,000 vs. less than $35,000, retired vs. employed, and higher vs. lower risk of severe COVID). In each cohort, we observed higher seroincidence of SARS-CoV-2 infection among Hispanic (IRRpre-vaccine/wild-type, 2.06 [95% CI, 1.41-3.01]; IRRvaccine/variant, 1.49 [95% CI: 1.24-1.80]) and non-Hispanic Black (IRRpre-vaccine/wild-type, 1.69 [95% CI, 0.99-2.87] ; IRRvaccine/variant, 1.68 [95% CI, 1.34-2.11]) participants compared with non-Hispanic White participants, essential workers compared with non-essential workers (IRRpre- vaccine/wild-type, 1.69 [95% CI, 1.19-2.39]; IRRvaccine/variant, 1.28 [95% CI, 1.08-1.52]), and in the South compared with the Northeast (IRRpre-vaccine/wild-type, 1.69 [95% CI, 1.10-2.60]; IRRvaccine/variant, 1.32 [95% CI 1.10-1.58]). However, some of these associations (Hispanic vs. non-Hispanic Whites, living in the Southern United States vs. the Northeast), became less pronounced in the vaccine/variant era cohort. Individuals who were at high risk for severe COVID-19 (vs. lower riks) also had significantly lower SARS-CoV-2 infection risk in the vaccine/variant era cohort (IRRvaccine/variant, 0.66 [95% CI: 0.56-0.79]), but not in the pre-vaccine/wild-type era cohort.
Vaccination status
In the vaccine/variant era cohort, the highest crude rates of SARS-CoV-2 infection were in un/under-vaccinated (51.34, 95% CI: 45.02-57.63) and fully vaccinated but not boosted participants (48.94, 95% CI: 42.83–55.09) (Table 2). Those who were fully vaccinated and received a booster had substantially lower rates of SARS-CoV-2 infection than the un/under-vaccinated, including those with one booster (IRRvaccine/variant, 0.47 [95% CI, 0.39-0.57]) and two or more boosters (IRRvaccine/variant, 0.28 [95% CI, 0.22-0.36]).
Epidemiologic risk factors
Table 3 shows the SARS-CoV-2 infection and crude SARS-CoV-2 infection rate ratios by epidemiologic risk factors that were present prior to or between serologic tests for each cohort. Crude incidence rates were substantially higher in most subgroups of epidemiologic risk factors in the vaccine/variant era cohort compared with the pre-vaccine/wild-type era cohort. In both cohorts, never social distancing with people you do not know (IRRpre-vaccine/wild-type, 3.16 [95% CI 1.47-6.77; IRRvaccine/variant, 2.00 [95% CI, 1.62-2.48] and never masking or sometimes masking in a variety of settings were associated with a higher risk of SARS-CoV-2 infection. Binge drinking (IRRpre-vaccine/wild-type, 1.47 [95% CI: 1.07-2.03]; IRRvaccine/variant, 1.45 [95% CI: 1.26-1.67]) and recent air travel (IRRpre-vaccine/wild-type, 1.49 [95% CI, 1.04-2.13]; IRRvaccine/variant, 1.20 [95% CI: 1.03-1.39]) were associated with higher risk of SARS-CoV-2 infection. A higher composite measure of risk was significantly associated with a higher risk of incident infection in both eras (Table 2).
Changes in epidemiologic risk factors between pre-vaccine and vaccine/variant eras
Some new associations emerged that were not present in the pre-vaccine/wild-type era cohort (Table 3). In the pre-vaccine/wild-type era cohort, people living with <4 household members in a single unit of a multi-unit dwelling and those living with 4 or more household members in a single-family dwelling had similar incidence as those living in single-family dwellings with <4 household members. But in the vaccine/variant era cohort, those living with <4 household members in a single unit of a multi-unit dwelling and those living with 4 or more household members in a single-family dwelling saw their risk increase compared with those living in single-family dwellings with <4 household members (IRRvaccine/variant, 1.39 [95% CI, 1.18-1.63] for multi-unit dwelling with <4 household members; IRRvaccine/variant, 1.59 [95% CI: 1.32-1.92] for single-family dwelling with 4+ household members). Similarly, as shown in Table 3, having a child in the household was not associated with a higher risk of SARS-CoV-2 infection in the pre-vaccine/wild-type era cohort, but was in the vaccine/variant-era cohort (IRRvaccine/variant, 1.41 [95% CI, 1.22-1.64]). Social distancing ‘with people you don’t know’ sometimes (vs. always) was not associated with a higher risk of SARS-CoV-2 infection in the pre-vaccine/wild-type era cohort, but was significantly associated with a higher risk in the vaccine/variant era cohort (IRRvaccine/variant, 1.25 [95% CI, 1.03–1.50]). Associations for some risk factors persisted but became less pronounced in the vaccine/variant era cohort compared with the pre-vaccine/wild-type era cohort (Table 3). Specifically, having a confirmed case in the household had the highest absolute incidence rate and was the strongest risk factor in each cohort, but the strength of the association decreased in the vaccine/variant-era cohort (IRRvaccine/variant, 8.35 [95% CI, 7.22-9.66]) vs. the pre-vaccine/wild-type era cohort (IRRpre-vaccine/wild-type, 22.34 [95% CI, 14.77– 33.77]). Social distancing ‘with people you know’ never (vs. always) was associated with a higher risk of SARS-CoV-2 infection in the pre-vaccine/wild-type era cohort (IRRpre-vaccine/wild-type, 3.16 [95% CI, 1.47-6.77]) than in the vaccine/variant era cohort (IRRvaccine/variant, 2.00 [95% CI, 1.62-2.48]). The risk ratio for SARS-CoV-2 infection was also higher in the pre-vaccine/wild-type era cohort than the vaccine/variant era cohort for indoor dining, visiting a place of worship, gathering indoors with 10 or more persons, and social distancing with ‘people you do not know’ never (vs. always).
Associations of mask use with incidence depended on the context. For mask use while grocery shopping, at the salon or gym, or on public transit, risk for sometimes use (vs. always) was elevated in both cohorts but was less pronounced in the vaccine/variant era cohort (Table 3). No mask use (vs. always) while indoors visiting non-household members was associated with an elevated risk of infection in both cohorts, but less so in the vaccine/variant era cohort. Mask use sometimes or never (vs. always) while indoors at work was associated with a higher risk in the vaccine/variant era, but not in the pre-vaccine/wild-type era, as was no mask use (vs. always) while at the salon or gym or while on public transit.
Poisson models of SARS-CoV-2 seroconversion in strata of risk factors
Table 4 shows adjusted IRRs and 95% CIs from the overall multivariable model and the 12 risk-factor group-specific multivariate Poisson models stratified by vaccine status in the vaccine/variant era, with the pre-vaccine/wild-type cohort as the referent group, adjusting for age, gender, and presence of co-morbidities. Relative to the pre-vaccine/wild-type cohort, the risk of SARS-CoV-2 infection was similar between un/undervaccinated and fully vaccinated participants in the vaccine/variant era. However, the risk of infection tended to decrease with an increasing number of boosters. Specifically, adjusted incident rate ratios (aIRR) with the pre-vaccine/wild-type cohort as the referent group were: aIRRun/undervaccinated=5.3 (95% CI 4.2-6.7); aIRRprimary series only=5.1 (95%CI: 4.1-6.4); aIRRboosted once=2.5 (95% CI 2.1-3.0), and aIRRboosted twice=1.65 (95%CI: 1.3-2.1) (Table 4; Figure 3). These associations were essentially unchanged within risk factor-stratified models, except for those with a confirmed case of SARS-CoV-2 in the household, where the relative change in SARS-CoV-2 infection between the pre-vaccine/wild-type cohort and the vaccine/variant-era cohort was smaller than that in other risk groups.
Self-reported testing
Table 5 shows the number and proportion of participants who tested positive on serologic tests as part of our study as well as on self-reported PCR or rapid tests taken by participants outside of the study for each cohort. Compared to serologic test results, participants had lower rates of test positivity on self-reported viral PCR or rapid tests. Specifically, in the pre-vaccine/wild-type era cohort, 4.0% (n=137) of participants self-reported a positive PCR or rapid test outside of the study, compared to 4.7% (n=161) that tested positive on serologic testing (ratio 85%). In the vaccine/variant era cohort, 21% (n=561) self-reported a positive test, compared to 30% (n=815) from serologic testing (ratio 69%). Thus, the proportion of participants with an infection detected outside the study declined from 85% in the pre-vaccine/wild-type era cohort to 69% in the vaccine/variant era cohort (Table 5).
DISCUSSION
In a community-based prospective study with repeat serologic testing of SARS-CoV-2 n protein seronegative individuals, we observed a nearly 3-fold increase in the incidence of n protein seroconversion, coinciding with the SARS-CoV-2 vaccine/variant era (25.74 per 100 person-years) as compared with the pre-vaccine/wild-type era (9.61 per 100 person-years). This corresponds to an increase in SARS-CoV-2 infection risk from 10% to 26% of participants infected per year. The large increase in SARS-CoV-2 incidence coincided with a relaxing of guidelines (e.g., around social distancing, masking, school attendance, in-person school attendance) and with surges of increasingly transmissible, immune evasive variants: Alpha (March–June 2021)46, Delta (June–December 2021) and Omicron variant and subvariants (December 2021-present)47, all emerged as vaccines were being more widely taken up. Our cohort findings are consistent with widespread community transmission in the general population, particularly during the Delta and Omicron surges, including in workplaces and households with children in the vaccine/variant era compared with the pre-vaccine/wild-type era.48–52 Despite the large increase in community transmission in the vaccine/variant era, being more up-to-date on vaccines (i.e., being boosted) was associated with a lower risk of SARS-CoV-2 infection compared with being un/undervaccinated or only receiving the primary vaccine series. While there are likely differences in people who were boosted compared with those who were not, these associations were maintained across several epidemiologic risk strata. Although being boosted was associated with a reduced incidence in the vaccine/variant era, except for the groups reporting a confirmed case in the household, the incidence was still generally 1.3-2 times higher among individuals with 2+ boosters compared with those in the pre-vaccine/wild-type era cohort (Table 4). While this highlights the potential for new variants to cause breakthrough infections even among those who are more up-to-date on vaccines, it also suggests that being up-to-date on SARS-CoV-2 vaccines can greatly reduce the risk of infection during surges. In addition, many non-pharmaceutical interventions used by cohort participants (e.g., masking in many different settings, social distancing) remained associated with substantially lower SARS-CoV-2 incidence rates in the vaccine/variant-era cohort, despite large increases in absolute incidence rates (Table 3).
In multivariate models, those who only received the primary vaccine series and no booster doses had similar SARS-CoV-2 infection risk as those who were un/undervaccinated in the vaccine/variant era. In both groups, the risk was approximately 5 times higher than in the pre-vaccine/wild-type era cohort. However, receipt of booster doses beyond the primary vaccine series was associated with a lower risk of infection compared with other vaccine status groups in the vaccine/variant era cohort. In fact, the risk of infection became progressively lower as the number of vaccine booster doses increased. This could be because 87% of the vaccine/variant-era cohort was fully vaccinated by August 2021 (∼six months into cohort follow-up, Table 1), and enough time had passed such that boosters would have been needed for most participants in order to offer some protection against infection from the more immune evasive variants.
There also may be differences in behaviors and other factors among these groups. Importantly, however, these associations were observed in each stratum across various risk factors, with models that adjusted for age, gender, and presence of comorbidities.
Our study showed substantial increases in SARS-CoV-2 incidence rates in the vaccine/variant era cohort compared to the pre-vaccine/wild-type era cohort within every sociodemographic subgroup and epidemiologic risk factor that we examined. Importantly, many factors that appeared protective against SARS-CoV-2 in the pre-vaccine/wild-type era cohort (e.g., NPIs such as masking and social distancing) remained protective in the vaccine/variant era cohort, despite the major increases in community transmission that occurred. This suggests that NPIs may play an important role in limiting SARS-CoV-2 transmission even during major surges of new variants, and in protecting those most vulnerable to SARS-CoV-2 infection.
However, it should be noted that while the incidence rates in those engaging in protective behaviors in the vaccine/variant era cohort were lower than those who didn’t engage in protective behaviors, absolute incidence rates were still very high in most instances in the vaccine/variant era cohort compared with the pre-vaccine/wild-type era cohort, highlighting that protective behaviors can reduce but not eliminate risk when community transmission rates are high. For example, there was a lower infection risk associated with wearing masks at work in the vaccine/variant era cohort, but the absolute incidence rate in those wearing masks at work was also much higher than in the pre-vaccine/wild-type era cohort. This could be due to higher levels of exposure inside the workplace, more individuals returning to in-person work, or higher levels of exposure from other sources (such as at home or on public transportation) or other locations that may not have been open during the pre-vaccine/wild-type era (e.g., gyms, indoor dining).
We also noted some new epidemiologic risk factors that emerged in the vaccine/variant cohort (e.g., having a child in the household), likely reflecting the higher incidence of infection in the general population, including new sources of exposure such as children attending in-person school. Lastly, looking at the composite risk score, there was a clear dose-response in the pre-vaccine/wild-type era cohort, but in the vaccine/variant era, the highest level of the composite risk score was lower than that of the medium risk score. This could be because the risk associated with having a case in the household, by far the strongest risk factor in both eras (Table 3), cannot increase as much in comparison with that of other risk factors when moving from the pre-vaccine/wild-type era to the vaccine/variant era (i.e., a ceiling effect).
Our study aligns with recent cross-sectional, population-representative surveys that were conducted during Omicron variant surges in some ways and not others. For example, similar to our study, recent NYC-based and national surveys conducted during major surges found high absolute point prevalence of SARS-CoV-2 infection but substantially lower relative point prevalence estimates among older (vs. younger) adults, those with comorbidities (vs. those without), and higher relative point prevalence estimates among those in households with school-aged children (vs. those without).53,54 However, in contrast to our study, these surveys found that vaccinated and boosted respondents had similar point prevalence estimates of SARS-CoV-2 infection to unvaccinated respondents. Reasons for this discrepancy could be that the surveys captured self-reported infection (positive point of care test, home test, or symptoms plus close contact) during the two weeks prior to the survey, while our study examined SARS-CoV-2 infection prospectively, as measured by serology over a longer time frame. In our cohort, compared with the seropositivity rate, the positivity rate on self-reported PCR/rapid tests over the same time period was lower in both the pre-vaccine/wild-type (85%) and vaccine/variant era cohorts (69%; Table 5). The reasons for the lower ratio in the vaccine/variant era cohort are not clear, but may be due to the fact that this was a highly vaccinated cohort, and fully vaccinated individuals with breakthrough infections may be less symptomatic, have a lower viral load, and experience a shorter duration of infection/illness.55–57 As such, these participants may not have recognized signs of SARS-CoV-2 infection and/or felt the need to test for SARS-CoV-2.
As the COVID-19 pandemic evolves, it remains important to monitor the incidence of SARS-CoV-2 infections. It may become more difficult, however, to identify cases through routine provider/laboratory reporting of PCR or rapid antigen tests. Individuals are increasingly less likely to be required to test in certain scenarios, may choose not to test, or may exclusively use at-home tests that are not captured in routine surveillance. Thus, using serologic testing in cohort studies is a useful strategy to characterize SARS-CoV-2 incidence and risk factors.
Strengths of our study include its prospective nature, with time-updated exposure measurement prior to outcome ascertainment. We also used repeat serologic testing to examine SARS-CoV-2 nucleocapsid seroconversion as the outcome measure of incident infection. Our design, which compared incidence in the two cohorts, leveraged the use of individuals as their own controls, since 75% of the overall sample was represented in both cohorts (Figure 1), helping to reduce confounding when comparing incidence in the two eras. Finally, comparing incidence rates within models specific to different strata of risk factors also helps to limit confounding of the association of vaccination status with incidence by risk behaviors.
Our study also has limitations worth noting. The observed cumulative incidence in our cohort may be lower than the true cumulative incidence in our cohort because of the imperfect nature of serologic testing and the potential waning of SARS-CoV-2 antibodies,58 particularly for milder infections.59,60 Studies of SARS-CoV-2 antibody persistence have suggested the waning of antibodies to both nucleocapsid and spike proteins.61,62 Boosted individuals who have an infection after vaccination may experience a more rapid waning of nucleocapsid antibodies.63 Our study required total nucleocapsid seronegativity for inclusion. Because of the timing of specimen collection relative to infection in our cohort (median of 191 days in the pre-vaccine/wild type era cohort and 476 days in the vaccine/variant era cohort), this could mean that we have underestimated the true cumulative incidence due to waning. Additionally, immunocompromised status for study participants was not collected, and fully vaccinated status could therefore not be accurately assigned using three doses among this subgroup. For these participants, the third primary series dose may have been misidentified as a booster dose or skipped entirely, resulting in a fully vaccinated status. We did, however, adjust for the presence of comorbidities.
Crude associations between SARS-CoV-2 risk factors and incidence are subject to confounding. For example, behaviors between risk groups likely differ, with interpretation for some associations further hampered by small sample sizes in some exposure strata. Some risk behaviors may have been underreported (e.g., due to social desirability), which would bias observed associations toward the null. While our study was prospective, because we used a midpoint method to infer the timing of infection between a negative and positive serologic test it is possible that some measured exposures, including vaccination, did not temporally precede infection. Also, some infections may have occurred in between vaccine doses. Finally, while our study was able to examine the role of behavioral risk factors over time, because of the timing of serologic testing, we could not distinguish the variant-specific effects (e.g., wild-type vs. Alpha, Delta vs. Omicron).
Conclusion
Increases in the incidence of infection and newly emerging risk factors in the vaccine/variant era likely resulted from multiple co-occurring factors related to policy changes, individual- and community-level behavior changes (due to the availability of vaccines and relaxation of restrictions), and changing virus properties (i.e., more transmissible, immune evasive variants). While SARS-CoV-2 incidence increased markedly in most groups in the vaccine/variant cohort, being up to date on vaccines and the use of NPIs (masking, distancing) likely reduced the risk of SARS-CoV-2 infection during major surges, making them relevant strategies to mitigate the impact of future SARS-CoV-2 surges, including those due to new variants that may evade existing vaccine-induced and hybrid immunity. As the COVID-19 pandemic transitions to the endemic phase, serologic testing is a useful strategy to characterize SARS-CoV-2 incidence and risk factors.
CONTRIBUTORS
DN, MMR, AS conceptualized the study. AS performed statistical analyses. DN and MSR wrote the first draft of the paper. All authors contributed to interpreting the data and revising of the manuscript. SGK, WY, MMR AB, SK, and DN contributed to data collection, cleaning and management. DN, MMR, SGK, and CG contributed to obtaining funding for the research.
Data Availability
There is a public-use dataset for the CHASING COVID Cohort Study. Other data are available upon reasonable request from the authors. DOI: 10.5281/zenodo.7305435
DISCLOSURES
DN reports receiving consulting fees from Abbvie and Gilead.
FUNDING
Funding for this project is provided by The National Institute of Allergy and Infectious Diseases (NIAID), award number 3UH3AI133675-04S1 (MPIs: D Nash and C Grov), the CUNY Institute for Implementation Science in Population Health (cunyisph.org) and the COVID-19 Grant Program of the CUNY Graduate School of Public Health and Health Policy; Pfizer, Inc (PI: Nash), and the National Institute of Mental Health (NIMH), award number 1RF1MH132360 (MPIs: D Nash and A Parcesepe).
ACKNOWLEDGEMENTS
The authors wish to thank the participants of the CHASING COVID Cohort Study. We are grateful to you for your contributions to the advancement of science around the SARS-CoV-2 pandemic. We are grateful to MTL Labs for serologic testing of our cohort’s specimens.
STATISTICAL APPENDIX
LASSO. The study samples of 3,421 and 2,735 participants for the pre-vaccine/wild type and post-vaccine variant eras respectively were split randomly into equally sized training and test data sets. A grouped LASSO regression was fit using training data and 10-fold cross validation was used to obtain the minimum value of lambda (the tuning parameter). The variables included in the LASSO model were household crowding, having children in the household, confirmed COVID-19 case in a household member, attending mass gatherings, indoor dining in a restaurant/bar, outdoor dining in a restaurant/bar, visiting places of worship, visiting public park or pool, gathering with groups of 10 or more indoors and/or outdoors, mask use while grocery shopping, mask use while visiting non-household members, mask use indoors in gym/salons, mask use indoors at work, mask use outdoors, using public transit, recent air travel, alcohol use, and substance use. Using the minimum lambda value, a grouped LASSO model was run on the entire dataset (training + test) to obtain factors that best predicted seroconversion. The LASSO model selected having a confirmed COVID-19 case in a household member, indoor dining in a bar/restaurant, gathering with groups of 10 or more outdoors, and mask use indoors in salons/gyms as the most predictive of seroconversion in our cohort. A logistic regression model was fitted using these variables selected by LASSO regression with seroconversion (Y/N) as the outcome. Coefficients of the variables were multiplied by 10 and rounded up to the nearest integer to create a score associated with that variable/risk factor. If a participant engaged in a given risk factor, they were assigned the score associated with that risk factor. Scores were then summed across risk factors, normalized between 0 and 100, and a total composite risk score was created for each participant.
Footnotes
DISCLOSURES: Dr. Nash received consulting fees from Gilead Sciences, Inc. and Abbvie, Inc.
REFERENCES
- 1.
- 2.
- 3.
- 4.
- 5.
- 6.
- 7.
- 8.
- 9.
- 10.
- 11.
- 12.
- 13.
- 14.
- 15.
- 16.
- 17.
- 18.
- 19.
- 20.
- 21.
- 22.
- 23.
- 24.
- 25.
- 26.
- 27.
- 28.
- 29.
- 30.
- 31.
- 32.
- 33.
- 34.
- 35.
- 36.
- 37.
- 38.
- 39.
- 40.
- 41.
- 42.
- 43.
- 44.
- 45.
- 46.
- 47.
- 48.
- 49.
- 50.
- 51.
- 52.
- 53.
- 54.
- 55.
- 56.
- 57.
- 58.
- 59.
- 60.
- 61.
- 62.
- 63.