Abstract
Background SARS-CoV-2 vaccines are highly effective in preventing severe COVID-19 but require boosting to maintain protection. Changes to circulating variants and prevalent natural immunity may impact on real-world effectiveness of boosters in different time periods and in different populations.
Methods With NHS England approval, we used linked routine clinical data from >24 million patients to evaluate the effectiveness of the 2022 combined COVID-19 autumn booster and influenza vaccine campaign in non-clinically vulnerable 50-year-olds in England using a regression discontinuity design. Our primary outcome was a composite of 6-week COVID-19 emergency attendance, COVID-19 unplanned hospitalisation, or death. The secondary outcomes were: respiratory hospitalisations or death; any unplanned hospitalisation; and any death.
Results Our study included 1,917,375 people aged 45-54 years with no evidence of being in a high-risk group prioritised for vaccination. By 26 November 2022, booster vaccine coverage was 11.1% at age 49.75 years increasing to 39.7% at age 50.25 years. The estimated effect of the campaign on the risk of the primary outcome in 50-year-olds during weeks 7-12 after the campaign start was -0.4 per 100,000 (95% CI -7.8, 7.1). For the secondary outcomes the estimated effects were: -0.6 per 100,000 (95%CI -13.5, 12.3) for respiratory outcomes; 5.0 per 100,000 (95%CI -40.7, 50.8) for unplanned hospitalisations; and 3.0 per 100,000 (95%CI -2.7, 8.6) for any death. The results were similar when using different follow-up start dates, different bandwidths, or when estimating the effect of vaccination (rather than the campaign).
Conclusion This study found little evidence that the autumn 2022 vaccination campaign in England was associated with a reduction in severe COVID-19-related outcomes among non-clinically vulnerable 50-year-olds. Possible explanations include the low risk of severe outcomes due to substantial pre-existing vaccine- and infection-induced immunity. Modest booster coverage reduced the precision with which we could estimate effectiveness. The booster campaign may have had effects beyond those estimated, including reducing virus transmission and incidence of mild or moderate COVID-19.
Background
The SARS-CoV-2 vaccines are highly effective at preventing severe COVID-19 outcomes, including hospitalisation and mortality.(1,2) However, their benefits wane over time(3) and so booster vaccinations are needed to sustain protection. A first booster vaccination reduces the incidence of severe COVID-19, but that protection also wanes and may be reduced against new variants.(4–9) High prevalence of immunity resulting from prior SARS-CoV-2 infection may impact on booster vaccine effectiveness.
Following the primary course of two doses, the UK offered first COVID-19 booster vaccinations from autumn 2021 to high-risk individuals and people 50 years and older. A spring 2022 booster was subsequently offered to people 75 years or older and those considered clinically vulnerable. An autumn booster was available from September 2022, initially targeting people considered at high risk, such as those aged over 65 years, care home residents and their staff, and immunosuppressed people.(10,11) On 15 October 2022, people aged 50-64 years who were not considered high risk became eligible for booster vaccination.(12) This coincided with the 2022/23 rollout of influenza vaccination, which was available from the same date to people who would turn 50 by March 2023.(12)
In this study, we estimated the effectiveness of the combined 2022 autumn COVID-19 booster vaccination campaign, coinciding with the influenza vaccination campaign, in reducing COVID-19 outcomes among non-clinically vulnerable people aged 50 years in England using a regression discontinuity design and the OpenSAFELY-TPP database.
Methods
Study objective and population
Our primary objective was to estimate the average effect of the combined COVID-19 booster and influenza vaccination campaign on severe COVID-19 outcomes in non-clinically vulnerable people aged 50 years in England who had previously received at least two COVID-19 vaccinations. This population became eligible for the COVID-19 booster and influenza vaccine on 15 October 2022. Our secondary objective was to estimate the effect of the booster vaccine itself among compliers; that is, people who take up the vaccine only when eligible.
We included all adults aged 45-54 years during the study period who were registered at one GP practice for at least 90 days prior to 3 September 2022, and had complete information on age and sex. People considered high risk or clinically vulnerable, who were eligible for booster vaccination earlier in the year were excluded.(10) This included people who were(13): identified as a health or social care workers at time of vaccination; resident in a care or nursing home; or part of any other clinically vulnerable group, specifically with: chronic respiratory disease, chronic heart/vascular disease, chronic kidney disease, chronic liver disease, chronic neurological disease, diabetes, immunosuppression, asplenia, morbid obesity, or severe mental illness; or with evidence of having received a third primary dose of the COVID-19 vaccine which may be a marker of immunosuppression.
We also excluded people who were otherwise ineligible for the booster or unlikely to be vaccinated, including people who: received another COVID-19 vaccine within 90 days prior to 15 October 2022; did not receive the first two primary doses of the COVID-19 vaccine; were housebound; or were receiving end of life care. Supplementary Table 1 lists the exclusion criteria and their definitions. Clinically vulnerable individuals were identified using primary care data using the same approach as described previously.(3,14)
Data Source
All data were linked, stored and analysed securely within the OpenSAFELY platform: https://opensafely.org/. With the approval of NHS England, primary care records managed by the GP software provider TPP were linked, using NHS numbers, to accident and emergency attendance (A&E) and in-patient hospital spell records via NHS Digital’s Hospital Episode Statistics (HES), and national death registry records from the Office for National Statistics (ONS). COVID-19 vaccination history is available in the GP record directly via the National Immunisation Management System (NIMS). The dataset analysed within OpenSAFELY is based on 24 million people currently registered with GP surgeries using TPP SystmOne software.
Study Measures
Outcomes
The primary outcome was a composite of COVID-19-related unplanned hospitalisation, COVID-19-related accident and emergency attendance, or any death from 6-weeks after start of the vaccination campaign. For unplanned hospitalisations, the COVID-19 diagnosis could be either an underlying or contributing cause in any position. Only unplanned admissions were included as these are more likely to be due to incident COVID-19 disease. We did not include a positive SARS-CoV-2 test as an outcome because free testing in England ended in April 2022.
Due to potential misdiagnosis of COVID-19 related outcomes, we also included a composite outcome of respiratory unplanned admission or death. We also examined any unplanned hospitalisations and any death. The definition and codes used to identify outcomes are in Supplementary Table 2.
Exposure
The exposure was booster eligibility, defined as being 50 years or older on the start of follow-up, for vaccination in the autumn 2022 booster campaign. Age was categorised in 3-month intervals. As only month of birth is available in OpenSAFELY, date of birth was set to the 15th of the month. We defined receipt of the autumn booster as a record of a third or fourth COVID-19 vaccination on or after 5 September 2022 (the date autumn boosters first became available), as some people may have been vaccinated prior to eligibility. We constructed cumulative incidence curves of booster coverage to display how separation above/below the threshold changed over the study period.
Study design
We used regression discontinuity, a study design that takes advantage of the threshold of being aged 50 years or older for booster vaccination eligibility and estimates the effectiveness of booster vaccination at this threshold. Threshold-based eligibility mimics randomisation, as the distribution of confounding variables among people just above and below the threshold is expected to be similar.(15) Regression discontinuity has previously been used to estimate the effectiveness of vaccines and vaccination campaigns, such as the first COVID-19 vaccine dose on COVID-19 mortality in England(16) and influenza vaccination in England and Wales.(17)
The population-level effect of booster vaccination is likely to evolve over time, as the proportion of people vaccinated increases, and the prevalence of SARS-CoV-2 infection changes. We therefore included multiple index dates for the start of follow-up as supplementary analyses: 3 September 2022 (before the start of the campaign; negative control); 15 October 2022 (start of the campaign; negative control); and each day between 26 November (6 weeks after the start of the campaign) and 9 December 2022. Follow-up was for 6 weeks after each index date. For each index date, we excluded people who had died or deregistered before that date.
Assumptions
Key assumptions of regression discontinuity are “continuity” (the risk of the outcome is expected to change smoothly at the threshold in absence of the intervention) and “exchangeability” (similar distribution of confounders just below and above the threshold).(15) To test the latter assumption, we plotted the distribution of the following characteristics by age in 3 month intervals: sex (male, female); deprivation, measured by the English Index of Multiple Deprivation (IMD), grouped by quintile of national rank; ethnicity (White, Mixed, Asian/Asian British, Black/Black British, Other, Unknown); practice region (East, East Midlands, London, North East, North West, South East, South West, Yorkshire and The Humber); number of previous COVID-19 vaccine doses. We also quantified receipt of influenza vaccine in the 2022/23 season from July 2022 onwards to examine its discontinuity at threshold. However, ascertainment of influenza vaccination was incomplete, because vaccination delivered by pharmacists outside of general practice is not routinely recorded in electronic health records.
We used both “sharp” and “fuzzy” regression discontinuity designs, which address different research questions.(18,19) The sharp design estimates the effect of the vaccination campaign (rather than receipt of vaccination itself) at the threshold age. As both the second booster and the influenza vaccine became available on the same date at the same age threshold, and thus there is an expected discontinuity in receipt of flu vaccination, this represents the effect of the combined vaccination campaigns.
The fuzzy design estimates the effectiveness of receiving booster vaccination during autumn 2022: those boosted commonly received influenza vaccination as well. Fuzzy regression discontinuity uses vaccine eligibility (being age 50 years or older) as an instrumental variable and estimates the “local average treatment effect” (LATE) or “complier average causal effect” (CACE) at the threshold.(15) This is the effect of vaccination among “compliers” - the subset of the population who are vaccinated only when eligible. The analysis assumes no “defiers” - people who are vaccinated only when ineligible. This assumption is also known as monotonicity. The monotonicity assumption is untestable, but likely to hold in our study. Other assumptions are that the instrumental variable is associated with the outcome only through vaccination at the threshold, that there are no common causes of eligibility and the outcome.(20) These assumptions are also plausible for our study.
Statistical methods
Outcomes were expressed as 6-week risks per 100,000 population. To prevent disclosure, all event counts presented were rounded to the nearest 5 and risks calculated from rounded counts. However, unrounded counts and risks were used in regression modelling. For each index date and follow-up period, only the first event for each individual was counted. The primary bandwidth was 5 years (20 data points representing three-month age intervals) either side of the threshold.
To estimate the effect of being eligible for the COVID-19 autumn booster/influenza vaccine among people aged 50 years (sharp regression discontinuity), we estimated the discontinuity in the risk of each outcome at the threshold by fitting a regression model with age in 3-month intervals (continuous), a binary variable representing the vaccine age threshold (50+ years), and an interaction between the two, allowing the slope of the change in the probability of the outcome by age to vary above and below the threshold. Age was centred, so that “0” represented the threshold.
We next used an instrumental variable approach to estimate the effect of vaccination in people aged 50 who were vaccinated (fuzzy regression discontinuity) with the instrument being 50 years or older on the index date (eligible for vaccination). We used the two-stage least squares estimator. In the first stage, booster coverage before the index date is predicted based on age. The second stage uses the predicted values from the first stage to predict the outcome.(15)
Sensitivity analyses
For the sharp regression discontinuity analysis, we conducted two sensitivity analyses: first, we excluded people born in the index month due to the imprecision in recorded birth date. Second, given the potential for bias with wide bandwidths, we repeated the analysis using progressively smaller bandwidths (4 years, 3 years, 2 years, 1 year). For the fuzzy regression discontinuity analysis, we repeated the analysis including influenza vaccination prior to the index date in the model, and also using different bandwidth periods.
Software and Reproducibility
Data management was performed using Python 3.8, with analysis carried out using R. Code for data management and analysis as well as codelists archived online https://github.com/opensafely/vax-fourth-dose-RD.
Patient and Public Involvement
We have developed a publicly available website https://opensafely.org/ through which we invite any patient or member of the public to contact us regarding this study or the broader OpenSAFELY project.
Ethics approval
This study was approved by the Health Research Authority (Research Ethics Committee reference 20/LO/0651) and by the London School of Hygiene and Tropical Medicine Ethics Board (reference 21863).
Results
We identified 3,254,000 people aged 45-54 years on 3 September 2022. Of these, 1,336,625 (41.1%) were excluded (Table 1). The most common reasons for exclusion were not having received a second primary COVID-19 dose (521,165, 16.0%) or being in a clinically vulnerable group (762,650, 23.4%), most frequently due to diabetes (229,905, 7.1%), chronic heart disease (160,165, 4.9%) or severe obesity (143,720, 4.4%).
All people 45-54 years on 3 September 2022 and reasons for exclusion from the study. People could have multiple exclusion criteria and so may appear in multiple categories. All counts rounded to nearest 5.
Among 1,917,375 people included in analyses, 465,730 (51.4%) of people 45-49 years and 516,310 (51.1%) of people 50-54 years were male and 661,145 (73.0%) and 776,560 (76.8%) were of White ethnicity, respectively (Table 2). Most (743,485 [82.1%] and 885,095 [87.5%]) had received three previous COVID-19 vaccine doses. The median time since the previous dose at the start of the campaign (15 October) was 304 days (interquartile range [IQR], 297-321 days) and 310 days (IQR, 301-328), respectively.
Vaccination coverage
A small number of people in our study population received the booster vaccine prior to its wider availability on 15 October 2022 (Figure 1A). Coverage increased rapidly thereafter and started to plateau by late November (Figure 1A). Six weeks after the wider availability (26 November), booster coverage ranged from 6.1% of people 45.00 years to 51.8% of people aged 54.75 years (Figure 1B). A large discontinuity was observed at the threshold, with the proportion receiving booster vaccination increasing from 11.1% of people aged 49.75 years to 31.1% and 39.7% of people aged 50.00 and 50.25 years respectively.
Coverage (%) of COVID-19 autumn booster by age. A) Cumulative coverage based on age in years at 3 September 2022. The booster vaccination became available to non-clinically vulnerable people aged 50-64 years on 15 October 2022. B) Coverage at 26 November 2022 by age in 3-month intervals.
No discontinuity was observed for any of the demographic variables (sex, IMD quintile, ethnicity, region) (Supplementary Figures 1-4) or number of prior COVID-19 vaccine doses (Supplementary Figure 5). However, we observed a discontinuity in recorded receipt of the 2022/23 influenza vaccine, increasing from 9.7% of people aged 49.25 years to 25.5% of people aged 50.00 years (Supplementary Figure 6). Recorded receipt of influenza vaccine was much more common among people who received COVID-19 booster vaccination. In people aged 45-49 years, 36,515 (53.2%) of those receiving booster vaccination also received influenza vaccination, compared with 46,820 (5.6%) of those who did not receive booster vaccination. Corresponding figures in people aged 50-54 years were 277,745 (64.7%) and 68,340 (11.8%).
Regression discontinuity analysis
Overall, COVID-19 related outcomes (unplanned admission, A&E attendance, or death) were rare; the risks within 6 weeks of 26 November 2022 were 12.7 and 14.4 per 100,000 for people aged 45-49 and 50-54 years respectively (Table 3). During the same time period, the 6-week risks of respiratory unplanned admission or death were 52.0 and 53.5 per 100,000 respectively, any unplanned admission 410.5 and 447.0 per 100,000 respectively, and any death 6.1 and 13.4 per 100,000 respectively.
By age, during the six weeks from 26 November 2022 the risk of the COVID-19 composite outcome was relatively constant between ages 45 and 55 years, with a slight negative slope before age 50 and a slight positive slope thereafter (Figure 2). The estimated effect of the booster vaccination campaign in 50-year-olds on the 6-week risk was -0.4 per 100,000 (95% CI -7.8 to 7.1) (Supplementary Table 3). Similarly, for the respiratory composite outcome the estimated effect on 6-week risk -0.6 per 100,000 (95% CI -13.5 to 12.3) and for any unplanned admission 5.0 per 100,000 (95% CI -40.7 to 50.8). For any death the effect was 3.0 per 100,000 (95% CI -2.7 to 8.6). Figure 3 shows corresponding estimates of the effect of the booster campaign for each index date and each outcome for people at the threshold: the estimates for index date 26 November correspond to those shown in Figure 2. Results were similar using different index dates for all outcomes (Supplementary Table 3). These results also were little changed after excluding people born in the index month (Supplementary Table 4). The results were also robust to different bandwidths, but as expected the confidence intervals were wider for shorter bandwidths (Supplementary Table 5).
Predicted values (dashed line) and 95% confidence interval (shaded area) from sharp regression discontinuity analysis of 6-week outcomes by age in 3-month intervals with 26 November as the index date. The dots are the observed values; to prevent disclosure, rates are calculated from rounded counts and for the COVID-19 composite outcome and any death outcome, observed values for 6-month intervals are presented instead of 3 months for visualisation only.
Estimated difference in 6-week outcome risk per 100,000 among people aged 50 years using different index dates. A positive estimate indicates a higher risk among people eligible to receive the booster vaccination compared with those ineligible. Whiskers represent 95% confidence intervals.
The instrumental variable analysis (fuzzy regression discontinuity) estimates the effectiveness of the booster vaccine in compliers at the threshold. Based on the first stage of the instrumental variable analysis, the estimated proportion of compliers among the population at the threshold is 28%. This analysis found a difference of 2.1 per 100,000 compliers for the COVID-19 composite outcome (95%CI -11.3 to 15.4) (Supplementary Table 6). This is expected given the absence of evidence for a population-level effect of the booster vaccination campaign. Controlling for prior receipt of influenza vaccination did not change the conclusions of the instrumental variable analyses, although confidence intervals became wider (Supplementary Table 7). Similarly, using different bandwidths did not change the findings (Supplementary Table 8).
Discussion
Summary
In this study of over 1.3 million people aged 45-54 years in England, there was a low risk of severe COVID-19 outcomes among non-clinically vulnerable people aged 45-54 years who had received at least two COVID-19 vaccine doses, and only moderate coverage of booster vaccination in people aged ≥50 years in autumn/winter 2022/23. Using a regression discontinuity design, we found little evidence of a population effect of the combined COVID-19 booster and influenza vaccination campaign on severe COVID-19 related events, unplanned respiratory or all-cause admissions or death in people aged 50 years. However, confidence intervals were wide and do not exclude a small effect.
Strengths and weaknesses
The OpenSAFELY-TPP database covers approximately 40% of registered English primary care patients(21) enabling analysis of a large cohort of people aged within five years of the booster vaccination age eligibility threshold. These data include reliable ascertainment of COVID-19 vaccination status, COVID-19-related outcomes, and other clinical characteristics.
Establishing causality using observational data is challenging.(22) Unlike many observational study designs, regression discontinuity analyses are unlikely to be impacted by unmeasured confounding. Vaccination campaigns are well-suited to regression discontinuity designs, as eligibility is often based on a threshold, usually age. There is a reasonable assumption of “exchangeability” of people on either side of this threshold. We checked this assumption and did not find discontinuities for several potentially confounding variables (e.g. deprivation), a notable exception being receipt of influenza vaccination which was subject to the same age eligibility on the same date. Thus, the results should be considered to estimate the combined effect of the COVID-19 booster and influenza vaccination campaigns at age 50 years as we cannot disentangle the effects of the two vaccines.
Our analysis also does not account for non-independence within and across exposure groups, arising from transmissibility of SARS-CoV-2. People below age 50 may indirectly benefit from booster vaccination amongst their slightly older peers due to reduced transmission rates, resulting in lower estimates of effectiveness in the sharp discontinuity design.
Findings in context
Various studies worldwide have demonstrated the effectiveness of a second booster against SARS-CoV-2 infection or severe COVID-19 outcomes when the Omicron variant predominates, with many identifying rapid waning.(23–25) However, the majority of studies focussed on older populations (60+ years), or do not report age-specific effectiveness estimates. Therefore, there are few studies with which to compare our findings. One previous study evaluated the 2022 autumn booster in England using a test-negative design: this found that the effectiveness of three or more COVD-19 vaccine doses against COVID-19 hospitalisation in people 18-64 years was 61.5% compared with unvaccinated people in the first two weeks post-vaccination, falling to 38.9% by 6 months.(26) Additionally, the incremental effectiveness of the booster against hospitalisation among people 50+ years who had previously received at least two doses was estimated to be 43.7% and 47.5% at 5-9 weeks post-vaccination for the Pfizer and Moderna boosters respectively.
Policy implications and interpretation
The effects of vaccination campaigns must be considered in the context in which they are implemented, due to changes in circulating variants, prevalence of previous infections, and changes in vaccine coverage. Our study was conducted during a time of high substantial pre-existing immunity. All participants had received at least two COVID-19 vaccine doses, with 85% having received three prior doses, and natural infection during successive omicron waves in 2022 was widespread. We previously estimated absolute vaccine effectiveness (compared with unvaccinated individuals) against COVID-19 hospitalisation of >80% at 6 months after primary vaccination with ChAdOx1-S among individuals aged 40–64 years in England, and this protection would have been enhanced among individuals who received a vaccine booster dose.(3) The resulting high baseline immunity in our study population may have reduced the effectiveness of the autumn 2022 booster campaign.
The autumn 2022 booster campaign in England was initiated to mitigate a probable winter COVID-19 wave. Vaccination campaigns are planned based on the best available knowledge at the time and cannot necessarily anticipate how widespread the wave will be. During our study period, the estimated prevalence of SARS-CoV-2 infection was between 2% and 3% among people 35-69 years, meaning that the risk of severe COVID-19 was low.(27) Our study estimated the impact of the autumn booster campaign in 50 year olds who were not clinically vulnerable, a group already at low risk of severe COVID-19. Based on the confidence intervals for the estimated effect, we cannot exclude a small reduction in the risk of severe outcomes. Booster vaccine effectiveness during autumn 2022 may have been higher in older adults, as was reported for booster doses administered in autumn 2021.(7)
There are other potential benefits of booster vaccination that we were not able to estimate. Illness that does not result in hospitalisation or death can still lead to loss of productivity and potentially long-term post-acute sequelae of COVID-19.(28,29) Further, vaccination of non-high risk individuals can indirectly benefit the rest of the community by limiting transmission. However, booster coverage was modest (40% in people aged 50.25 years). These findings may also not hold in future waves, with different SARS-CoV-2 transmissibility, circulating variants, and infection prevalence.
While COVID-19 booster vaccinations are extremely safe,(30) no pharmacological intervention is without risk. The relative risks and benefits of health interventions need to be considered, and the balance is likely to shift over time as the pandemic matures. For instance, in March 2023 the WHO Strategic Advisory Group of Experts on Immunization updated their guidance and does not recommend second boosters for non-high risk individuals, which includes healthy people aged <60 years.(31) Thus, there may be a case for changing eligibility thresholds in future campaigns, especially if vaccine costs and availability are a consideration, and there are expectations of low vaccine coverage. Continued monitoring of vaccination campaigns for emerging signals is warranted.
Conclusion
In this study, we found little evidence that the autumn 2022 COVID-19 booster vaccination (coinciding with the influenza vaccination campaign) in England reduced severe COVID-19-related outcomes among non-clinically vulnerable 50-year-olds. We cannot rule out other impacts of the booster vaccination campaign on the 2022/23 winter COVID-19 wave, such as reducing virus transmission and reducing rates of mild or moderate COVID-19.
Data Availability
Access to the underlying identifiable and potentially re-identifiable pseudonymised electronic health record data is tightly governed by various legislative and regulatory frameworks, and restricted by best practice. The data in OpenSAFELY is drawn from General Practice data across England where TPP is the Data Processor. TPP developers initiate an automated process to create pseudonymised records in the core OpenSAFELY database, which are copies of key structured data tables in the identifiable records. These are linked onto key external data resources that have also been pseudonymised via SHA-512 one-way hashing of NHS numbers using a shared salt. DataLab developers and PIs holding contracts with NHS England have access to the OpenSAFELY pseudonymised data tables as needed to develop the OpenSAFELY tools. These tools in turn enable researchers with OpenSAFELY Data Access Agreements to write and execute code for data management and data analysis without direct access to the underlying raw pseudonymised patient data, and to review the outputs of this code. All code for the full data management pipeline, from raw data to completed results for this analysis, and for the OpenSAFELY platform as a whole is available for review at github.com/OpenSAFELY.
Administrative
Conflicts of Interest
BG has received research funding from the Bennett Foundation, the Laura and John Arnold Foundation, the NHS National Institute for Health Research (NIHR), the NIHR School of Primary Care Research, NHS England, the NIHR Oxford Biomedical Research Centre, the Mohn-Westlake Foundation, NIHR Applied Research Collaboration Oxford and Thames Valley, the Wellcome Trust, the Good Thinking Foundation, Health Data Research UK, the Health Foundation, the World Health Organisation, UKRI MRC, Asthma UK, the British Lung Foundation, and the Longitudinal Health and Wellbeing strand of the National Core Studies programme; he is a Non-Executive Director at NHS Digital; he also receives personal income from speaking and writing for lay audiences on the misuse of science.
Funding
The OpenSAFELY Platform is supported by grants from the Wellcome Trust (222097/Z/20/Z) and MRC (MR/V015757/1, MC_PC-20059, MR/W016729/1). In addition, development of OpenSAFELY has been funded by the Longitudinal Health and Wellbeing strand of the National Core Studies programme (MC_PC_20030: MC_PC_20059), the NIHR funded CONVALESCENCE programme (COV-LT-0009), NIHR (NIHR135559, COV-LT2-0073), and the Data and Connectivity National Core Study funded by UK Research and Innovation (MC_PC_20058), and Health Data Research UK (HDRUK2021.000, 2021.0157). EPKP is funded by the NIHR Health Protection Research Unit in Vaccines and Immunisation (NIHR200929), a partnership between the UK Health Security Agency and the London School of Hygiene & Tropical Medicine.
The views expressed are those of the authors and not necessarily those of the NIHR, NHS England, UK Health Security Agency (UKHSA) or the Department of Health and Social Care.
Funders had no role in the study design, collection, analysis, and interpretation of data; in the writing of the report; and in the decision to submit the article for publication.
Information governance
NHS England is the data controller of the NHS England OpenSAFELY COVID-19 Service; TPP is the data processor; all study authors using OpenSAFELY have the approval of NHS England.(1) This implementation of OpenSAFELY is hosted within the TPP environment which is accredited to the ISO 27001 information security standard and is NHS IG Toolkit compliant.(2)
Patient data has been pseudonymised for analysis and linkage using industry standard cryptographic hashing techniques; all pseudonymised datasets transmitted for linkage onto OpenSAFELY are encrypted; access to the NHS England OpenSAFELY COVID-19 service is via a virtual private network (VPN) connection; the researchers hold contracts with NHS England and only access the platform to initiate database queries and statistical models; all database activity is logged; only aggregate statistical outputs leave the platform environment following best practice for anonymisation of results such as statistical disclosure control for low cell counts.(3)
The service adheres to the obligations of the UK General Data Protection Regulation (UK GDPR) and the Data Protection Act 2018. The service previously operated under notices initially issued in February 2020 by the Secretary of State under Regulation 3(4) of the Health Service (Control of Patient Information) Regulations 2002 (COPI Regulations), which required organisations to process confidential patient information for COVID-19 purposes; this set aside the requirement for patient consent.(4) As of 1 July 2023, the Secretary of State has requested that NHS England continue to operate the Service under the COVID-19 Directions 2020.(5) In some cases of data sharing, the common law duty of confidence is met using, for example, patient consent or support from the Health Research Authority Confidentiality Advisory Group.(6)
Taken together, these provide the legal bases to link patient datasets using the service. GP practices, which provide access to the primary care data, are required to share relevant health information to support the public health response to the pandemic, and have been informed of how the service operates.
(1) NHS Digital. The NHS England OpenSAFELY COVID-19 service - privacy notice [Internet]. 2023 [cited 2023 Jul 5]. Available from: https://digital.nhs.uk/coronavirus/coronavirus-covid-19-response-information-governance-hub/the-nhs-england-opensafely-covid-19-service-privacy-notice
(2) NHS Digital. NHS Digital. 2023 [cited 2023 Jul 5]. Data Security and Protection Toolkit. Available from: https://digital.nhs.uk/data-and-information/looking-after-information/data-security-and-information-governance/data-security-and-protection-toolkit
(3) NHS Digital [Internet]. [cited 2023 Mar 6]. ISB1523: Anonymisation Standard for Publishing Health and Social Care Data. Available from: https://digital.nhs.uk/data-and-information/information-standards/information-standards-and-data-collections-including-extractions/publications-and-notifications/standards-and-collections/isb1523-anonymisation-standard-for-publishing-health-and-social-care-data
(4) UK Department of Health and Social Care. GOV.UK. 2022 [cited 2023 Jul 5]. [Withdrawn] Coronavirus (COVID-19): notice under regulation 3(4) of the Health Service (Control of Patient Information) Regulations 2002 – general. Available from: https://www.gov.uk/government/publications/coronavirus-covid-19-notification-of-data-controllers-to-share-information/coronavirus-covid-19-notice-under-regulation-34-of-the-health-service-control-of-patient-information-regulations-2002-general--2
(5) NHS Digital. NHS Digital. 2022 [cited 2023 Jul 5]. Secretary of State for Health and Social Care: COVID-19 Public Health Directions 2020. Available from: https://digital.nhs.uk/about-nhs-digital/corporate-information-and-documents/directions-and-data-provision-notices/secretary-of-state-directions/covid-19-public-health-directions-2020
(6) NHS Health Research Authority. Health Research Authority. [cited 2023 Jul 5]. Confidentiality Advisory Group. Available from: https://www.hra.nhs.uk/about-us/committees-and-services/confidentiality-advisory-group/
Supplementary files
List of exclusion criteria and their definition. Unless otherwise stated, all criteria were identified using primary care data. Codelsits with all codes used to define criteria area available in the Github repository: https://github.com/opensafely/vax-fourth-dose-RD
Sex relative frequency distribution by age in 3-month intervals based on age at 3 September 2022.
IMD relative frequency distribution by age in 3-month intervals based on age at 3 September 2022
Ethnicity relative frequency distribution by age in 3-month intervals based on age at 3 September 2022
Region relative frequency distribution by age in 3-month intervals based on age at 3 September 2022
Relative frequency distribution of previous number of COVID-19 vaccine doses by age in 3-month intervals based on age at 3 September 2022
Coverage of 2022/23 influenza vaccination by 26 November 2023 by age in 3-month intervals
Estimates from sharp regression discontinuity analysis estimating change in 6-week outcomes at threshold (50 years) using different index dates. Bold indicates primary analysis.
Estimates from sharp regression discontinuity analysis excluding people born in the index month, estimating change in 6-week outcomes at threshold (50 years) using different index dates. Bold indicates primary analysis.
Estimates from sharp regression discontinuity analysis estimating change in 6-week outcomes at threshold (50 years) using different bandwidths for primary index date (26 November 2022).
Instrumental variable analysis (fuzzy regression discontinuity) estimating local average treatment effect (LATE) for 6-week outcomes with different index dates. Bold indicates primary analysis.
Instrumental variable analysis (fuzzy regression discontinuity) estimating local average treatment effect (LATE) for 6-week outcomes with different index dates including receipt of influenza vaccination in the model. Bold indicates primary analysis.
Estimates from fuzzy regression discontinuity analysis estimating change in 6-week outcomes at threshold (50 years) using different bandwidths for primary index date (26 November 2022).
Acknowledgements
We are very grateful for all the support received from the TPP Technical Operations team throughout this work, and for generous assistance from the information governance and database teams at NHS England and the NHS England Transformation Directorate.
Members of the The OpenSAFELY Collaborative include: Sebastian CJ Bacon, Lucy Bridges, Benjamin FC Butler-Cole, Simon Davy, Iain Dillingham, David Evans, Ben Goldacre, Amelia Green, Liam Hart, George Hickman, Peter Inglesby, Steven Maude, Jessica Morley, Amir Mehrkar, Thomas O’Dwyer, Rebecca M Smith, Tom Ward, Jon Massey, Milan Wiedemann, Christopher Bates, Jonathan Cockburn, Sam Harper, Frank Hester, John Parry.
Footnotes
↵^ Membership of the OpenSAFELY Collaborative is included in the Acknowledgements