Abstract
Recent studies have provided key information about SARS-CoV-2 vaccines’ efficacy and effectiveness (VE). One important question that remains is whether the protection conferred by vaccines wanes over time. However, estimates over time are subject to bias from differential depletion of susceptibles between vaccinated and unvaccinated groups. Here we examine the extent to which biases occur under different scenarios and assess whether serologic testing has the potential to correct this bias. By identifying non-vaccine antibodies, these tests could identify individuals with prior infection. We find that the main determinant of bias is the proportion of the population that has been infected since vaccination began, which is influenced by several factors. In scenarios with high baseline VE, differential depletion of susceptibles creates minimal bias in VE estimates. However, if VE is lower, the bias for leaky vaccines (that reduce individual probability of infection given contact) is larger and should be corrected if the mechanism is known to be leaky. Conducting analyses both unadjusted and adjusted for past infection could give lower and upper bounds for the true VE. Studies of VE should therefore enroll individuals regardless of prior infection history but also collect information, ideally through serologic testing, on this critical variable.
Vaccines are a critical tool for combatting the COVID-19 pandemic. Clinical trials and observational studies have provided key information about the vaccines’ efficacy and effectiveness (VE). One important question that remains to be answered is whether or not the protection conferred by vaccines wanes over time. However, estimates of effectiveness over time are subject to bias from differential depletion of susceptibles between vaccinated and unvaccinated groups. This bias occurs when individuals who are no longer at risk of infection due to protection from past infection are included in the analysis; assuming the VE is greater than zero, these individuals with prior infection are more likely to be unvaccinated than vaccinated. Therefore, over time, more uninfected and unvaccinated individuals who are not at risk of infection are included in the analysis, biasing VE estimates downward. This bias grows as infection spreads and makes the VE incorrectly appear to wane over time (i.e. spurious waning) (1,2). Additionally, in a population with individuals who have heterogeneous risk of infection (for example due to occupational exposure or choice to wear a face covering), the riskiest individuals will be depleted preferentially among the unvaccinated group when the vaccine is effective, leading to the same bias downwards in VE, growing over time and thus seemingly showing waning of VE (1).
Serologic testing for SARS-CoV-2 antibodies has the potential to help correct the first bias. By identifying non-vaccine antibodies (e.g. N-protein), these tests could be used to identify individuals with prior infection and exclude them from studies of VE over time. Likewise, adjustment for individual-level risk of infection (in practice, for proxies such as occupation or behavior) can address the second bias.
While each of these issues can in principle affect VE estimates and induce a spurious impression of waning VE, the magnitude of this bias under various assumptions about baseline VE is not clear, nor has it been shown before to our knowledge how adjustments can solve the problems. Here we examine the extent to which these biases occur under different scenarios and assess approaches to alleviate bias under various assumptions.
Methods
Network and epidemic
We first create a network model of 20,000 individuals, similar to models described previously (2,3). The probability of connections between individuals in the network is calibrated in combination with the parameter for the probability of infection given contact to result in a reproduction number (R) of 1.25 or 1.50 (see Table 1 for a full list of parameters) (4). We seed an epidemic of a SARS-CoV-2-like pathogen with ten exposed individuals. Each day, each susceptible individual has a daily probability of infection from their infected connections in the network. A random half of the population is high risk, and the other half is low risk. High risk individuals have a daily probability of infection three times that of low risk individuals. This binary risk status is a simplified proxy for multiple factors that could affect individuals’ risks for infection, such as occupation, demographics, geography, or behavioral patterns (5–7).
We assume that half of those who are infected become symptomatic and that people are infectious for seven days. We assume that symptomatic, pre-symptomatic, and asymptomatic infected individuals have the same level of infectiousness. After individuals recover, we assume that complete protection from natural immunity lasts for 90 days (8), after which individuals can be reinfected; we then assume recovered individuals’ susceptibility is 95% lower than those without prior infection, resulting in low numbers of reinfection during the study period examined in the simulations (9). It is unknown exactly how VE differs for recovered individuals, so for simplicity we assume vaccinated recovered individuals’ susceptibility is further decreased by the same amount as for vaccinated susceptible individuals; in a sensitivity analysis, we assume vaccinated individuals with and without prior infection have the same susceptibility (i.e. natural immunity does not provide additional protection over vaccination in vaccinated individuals).
Scenarios
We simulate random vaccination (to prevent unmeasured confounding) of 2500 individuals, or 12.5% of the population, on the first day of the simulation. Another 2500 unvaccinated individuals are also randomly selected for potential follow-up over the course of the simulations. We compare four primary scenarios (Table 2). In the first scenario, vaccine efficacy against susceptibility to infection (VES) is 0.90, and vaccine efficacy against progression to symptoms (VEP) is 0.5. These measures combine to give a vaccine efficacy against symptomatic disease (VESP), the primary outcome of most SARS-CoV-2 vaccine trials (10–13), of 0.95, under the formula VESP = 1 − (1 − VES) (1 − VEp)(14) These values are similar to those that have been observed in the trials (10,12) and observational studies (15) of the mRNA vaccines. In the second scenario, we assume VES and VESP are 0.7, similar to the findings from the Janssen vaccine trial (11).
In the first two scenarios, we assume the vaccine is “leaky”, meaning it reduces the probability of infection given contact to an equal degree, but not perfectly, in all vaccinated individuals (16). However, in the third scenario, we model an all-or-nothing vaccine, meaning it protects a certain proportion of vaccinated individuals completely and provides no protection to the rest. In this scenario, VES and VESP are both 0.9. In supplementary scenarios, we also examine an all-or-nothing vaccine with lower VES and VESP, as well as a leaky vaccine with VESP = 0.95, similar to Scenario 1, but with lower VES and higher VEP. (17).
Finally, in scenario 4, we examine a setting with a leaky vaccine with VESP = 0.95 in which some of the population has already been infected and recovered before the simulations and vaccination begin. We explore a range from 0-30% of individuals with prior infection under a higher R than in the other scenarios (R=2.0) to prevent herd immunity from prior infections from substantially slowing the epidemics before spurious waning can be observed. In these simulations, 20 individuals are exposed on the first day and 100 individuals are infectious (except in the simulations with 0 individuals previously infected).
Analyses
We then simulate sampling of cases, or individuals with COVID-19 (symptoms and positive virologic test), on a given day and a random 1:4 sample of controls (i.e. individuals without COVID-19), similar to a test-negative design (TND). We repeat this sampling for seven different time periods, every 25 days from day 75 to day 225, treating each day independently. Given the faster epidemics in scenario 4 with the higher R, we examine every 25 days from day 25 to day 150. We then estimate VESP -- the estimand that is in practice estimated in a standard TND, although the progression to symptoms aspect is not always acknowledged (18) -- using four analyses. We focus on VESP as it was the primary outcome in vaccine trials and due to potential biases that can arise in TNDs when estimating VE against all infection when vaccines affect disease severity (19).
In the first analysis (baseline), we estimate VESP by calculating the odds ratio (OR), using data from all individuals sampled:
, where D is disease (symptoms and a positive virologic test) and V is vaccine.
In the second analysis, we estimate the OR using logistic regression, controlling for risk (i.e. the binary measure described above for increased or decreased susceptibility to infection).
In the third analysis, we simulate serologic testing for non-vaccine antibodies (i.e. evidence of past infection) and then restrict the analysis to individuals who had not previously been infected. In the fourth analysis, we both restrict to those without evidence of previous infection and also control for risk. In the primary analyses, we assume perfect sensitivity and specificity of the serologic test for prior infection, but we relax these assumptions in sensitivity analyses. We examine lower sensitivity for cases and controls and lower specificity for cases only, as antibodies detected could reflect either current or prior infection.
As a comparison to the TND, we repeat the same four analyses to estimate VE using a cohort design, where the time of symptomatic infection is known for the 5000 people under follow-up. We again examine different lengths of follow-up for this study design. We assume no unmeasured confounding: that is, no common causes of vaccination and infection, as would be true in a randomized trial or with adequate control for confounders. In practice, this study could be done using an electronic health records database using stratification, matching, or modeling for example to control for confounding factors such as occupation, age, insurance, and other factors affecting both vaccination and the likelihood of infection given vaccination.
Ethics
This activity was reviewed by CDC and was conducted consistent with applicable federal law and CDC policy.
Results
In scenario 1 with high VES and VESP, we find that for most time points, all four TND analyses return estimates of VESP close to the true value of 0.95 (Figure 1). However, in the simulations with R=1.5, the first two analyses that do not exclude prior infection result in downward biases for days further from vaccination (i.e. as low as 0.89 200 days from vaccination). This bias occurs due to differential depletion of susceptibles between vaccinated and unvaccinated individuals over time and is alleviated by excluding those with prior infection from the analysis. Similar results are found in an additional analysis with the same VESP but different VES and VEP (Figure S1).
In scenario 2, with a lower value of VES and VESP of 0.7, the first and second analyses that do not exclude prior infection are biased further downwards than in scenario 1 (i.e. as low as 0.36 225 days from vaccination when R = 1.5); this bias also occurs earlier than in scenario 1 and for both R values (Figure 2). In addition, in this scenario, the third analysis that excludes those with prior infection but does not control for risk has a more pronounced bias (lowest value of 0.64 compared to true VESP of 0.7 on day 225 when R=1.5) than in scenario 1 (lowest value of 0.94 compared to true VESP of 0.95 on day 200 when R=1.5).
In scenario 3, which models an all-or-nothing vaccine mechanism instead of a leaky mechanism, excluding prior infections results in a bias upward away from the true VESP of 0.95 (Figure 3), with some values approaching 1. This bias is more pronounced in the higher R simulations, on later days, and for lower values of VESP; for example, on day 200 in the R=1.5 simulations, the VESP from the analysis excluding prior infection and adjusting for risk is 0.83, compared to the true value of 0.7 (Figure S2)
In scenario 4, we see that the number of cumulative cases since vaccination is the primary driver of the spurious waning bias as opposed to the total number of people ever infected (Figure 4A). While there are more cumulative cases overall at most timepoints in the simulations with 20% and 30% of individuals infected prior to vaccination than in the simulations with 10% (Figure 4B), the bias is greater in the simulations with 10% due to the larger number of cases since vaccination. This trend occurs because, assuming prior infection doesn’t affect vaccination, in the simulations where more individuals are infected prior to vaccination, the distribution of vaccination status among those with prior infection becomes more balanced. Vaccination status of individuals who are included in the sample in the baseline analysis and who are not at risk for infection is not as heavily skewed towards unvaccinated as in the simulations where the epidemic and vaccination begin concurrently. The cumulative cases since vaccination is a function of many variables, including timing of vaccination relative to the epidemic, force of infection, and VE values.
In sensitivity analyses, we relax assumptions of perfect tests for prior infection and examine lower sensitivity for both cases and controls and lower specificity for cases. We find that while lower sensitivity results in slight downward biases of the estimates, which are more pronounced in scenario 2 than scenario 1 (Figures S3-S4), lower specificity for cases does not induce a bias (Figure S5). This is because imperfect specificity only reduces the sample size but we assume it does not do so differentially by vaccination status; however, if the speed of antibody response differed between vaccinated and unvaccinated individuals, this assumption would be violated, and imperfect specificity would induce bias.
In the cohort study analysis, we find similar trends to those observed in the TND (Figures S6-8); however, the magnitude of the bias in the analyses that do not exclude those with prior infection is lower than the TND analyses for each scenario. This bias is less in cohort studies because in the cohort study analysis, those with past symptomatic infection are censored at the time of infection, meaning fewer people are incorrectly treated as still at risk in the analysis.
Finally, in analyses that assume equal susceptibility between vaccinated individuals who have and have not been previously infected, we find similar results to the baseline scenario, likely due to the low number of reinfections in the simulations (Figure S9).
Discussion
We find that in scenarios with high VE, differential depletion of susceptibles creates minimal bias in VE estimates and in the time trend of these estimates, thus little spurious suggestion of waning from comparing later to earlier VE estimates. While it is important to control for known predictors of risk, estimates that do not account for prior infection status will likely not be far off from the truth. In fact, without knowledge of the vaccine mechanism (i.e. leaky or all-or-nothing), it may be better to not condition on prior infection status: if the vaccine is leaky, the baseline estimates may be slightly underestimated, but if the vaccine is all-or-nothing, the adjusted estimates will overestimate the true VE because in this case, the depletion of susceptible over time is not differential by vaccine status. Evaluating how the estimates from different analyses change over time could give potential insight into the type of vaccine mechanism. Because the bias from failing to exclude prior infection in the analysis of a leaky vaccine with high initial VE is expected to be small under the null hypothesis of no waning VE, if the VE appears to wane substantially, this finding is likely not entirely due to bias. Evidence on the effectiveness of the mRNA vaccines to date suggests spurious waning is likely not a major concern. However, if true waning occurs, spurious waning bias may become a relevant consideration.
If VE is lower, the bias for leaky vaccines is larger and ideally should be corrected if the mechanism is known to be leaky. However, at present, evidence for the leaky vs. all-or-nothing mechanism is scant for any known vaccine. In the absence of other sources of bias, conducting analyses both unadjusted and adjusted for past infection could give lower and upper bounds for the true VE. Studies of VE should therefore enroll individuals regardless of prior infection history but also collect information on this critical variable for use in the analysis; when possible, prior infection status should be assessed using serology as even an imperfect serologic test will improve sensitivity over self-report alone.
This study has several limitations. First, we make many simplifying assumptions in the model. For example, we assume all individuals are grouped into one large community and do not examine the potential impact of geographic heterogeneity. Other studies have shown epidemic dynamics due to differences in geography are important to control for in vaccine (20) and serologic (3) studies. We also assume perfect sensitivity and specificity of virologic tests, as implications of these parameters have been explored in detail previously (21,22). While we incorporate heterogeneity in risk of acquiring infection, we do not model differences in risk of transmitting infection (e.g. due to host factors). Additionally, as described above, the combined effects of immunity from prior infection and vaccination are uncertain; we model two simplified assumptions of this relationship but do not incorporate further complexity such as boosting of immunity from repeated exposure. Second, using serologic tests to identify prior infection is subject to error from imperfect test characteristics and waning of antibodies over time. However, we find only small biases in VE estimates from imperfect sensitivity, and information on past infection can also be obtained through self-report or medical records. Third, as described above, we assume random vaccination and no unmeasured confounding; the strategies discussed here alone do not address most other sources of potential confounding, which are important to account for in analyses, particularly given that vaccine rollout in some cases prioritized those at highest risk to receive vaccines first. Finally, we simulated epidemics with higher R values than much of the United States experienced during most of the pandemic to uncover scenarios where spurious waning might be of concern (23). These values should not affect the conclusions from the simulations, as we find that the main determinant of the extent and magnitude of bias is the proportion of the population that has been infected since vaccination, which is influenced by a combination of several factors, including R values, the point in the epidemic trajectory when the vaccine was introduced, and prevalence of high vs low risk individuals in the population.
Assessing duration of protection from COVID-19 vaccines is important for anticipating future dynamics of this pandemic. Here we have outlined circumstances under which bias can arise in these estimates and identified approaches to alleviate these biases.
Funding
This work was supported by the U.S. National Cancer Institute Seronet cooperative agreement U01CA261277. The content is solely the responsibility of the authors and does not necessarily represent the official views of the Department of Health and Social Care or the National Institutes of Health.
Conflicts of interest
Dr. Lipsitch reports consulting/honoraria from Bristol Myers Squibb, Sanofi Pasteur, and Merck, as well as a grant through his institution, unrelated to COVID-19, from Pfizer. He has served as an unpaid advisor related to COVID-19 to Pfizer, One Day Sooner, Astra-Zeneca, Janssen, and COVAX (United Biomedical). Dr. Kahn discloses consulting fees from Partners In Health.
Acknowledgments
We thank Rachel Slayton and Matt Biggerstaff for helpful comments and discussion.
Footnotes
Disclaimer: The findings and conclusions in this report are those of the author(s) and do not necessarily represent the official position of the Centers for Disease Control and Prevention.