Abstract
Background England, UK has experienced a large outbreak of SARS-CoV-2 infection. As in USA and elsewhere, disadvantaged communities have been disproportionately affected.
Methods National REal-time Assessment of Community Transmission-2 (REACT-2) seroprevalence study using self-administered lateral flow immunoassay (LFIA) test for IgG among a random population sample of 100,000 adults over 18 years in England, 20 June to 13 July 2020.
Results Completed questionnaires were available for 109,076 participants, yielding 5,544 IgG positive results and adjusted (for test performance), re-weighted (for sampling) prevalence of 6.0% (95% CI: 5.8, 6.1). Highest prevalence was in London (13.0% [12.3, 13.6]), among people of Black or Asian (mainly South Asian) ethnicity (17.3% [15.8, 19.1] and 11.9% [11.0, 12.8] respectively) and those aged 18-24 years (7.9% [7.3, 8.5]). Care home workers with client-facing roles had adjusted odds ratio of 3.1 (2.5, 3.8) compared with non-essential workers. One third (32.2%, [31.0-33.4]) of antibody positive individuals reported no symptoms. Among symptomatic cases, the majority (78.8%) reported symptoms during the peak of the epidemic in England in March (31.3%) and April (47.5%) 2020. We estimate that 3.36 million (3.21, 3.51) people have been infected with SARS-CoV-2 in England to end June 2020, with an overall infection fatality ratio of 0.90% (0.86, 0.94).
Conclusion The pandemic of SARS-CoV-2 infection in England disproportionately affected ethnic minority groups and health and care home workers. The higher risk of infection in these groups may explain, at least in part, their increased risk of hospitalisation and mortality from COVID-19.
Introduction
England has experienced a large outbreak of SARS-CoV-2 infection leading to the highest excess mortality in Europe by June 2020. (1) The first COVID-19 death occurred on 28 February, with in-hospital deaths peaking at 800 per day within 6 weeks. (2) Hospital admission and mortality data show an asymmetrical burden of COVID-19 in England, with high rates in older people including those living in long-term care, and in people of minority ethnic groups, particularly Black and Asian (mainly South Asian) individuals. (3–6) It is unclear how much of this excess is due to differences in exposure to the virus, e.g. related to workplace exposures and structural inequality, and how much is due to differences in outcome, including access to health care. (7,8)
Antibody data provide a long-lasting measure of SARS-CoV-2 exposure, enabling analyses of the recent epidemic. Most infected people mount an IgG antibody response detectable after 14-21 days although levels may start to wane after ~90 days. (9,10) Uncertain validity of the antibody tests used, variation in sampling, limited numbers and use of selected groups have made many studies difficult to interpret (11), although different criteria may apply for community-based studies where population-wide rather than individual results is the focus. (10–15). Self-administered lateral flow immunoassay (LFIA) tests done at home offer a method for obtaining community-wide seroprevalence estimates rapidly and at scale, at reasonable cost. While there have been questions about their use for individual care, (16–18) reliable population prevalence estimates can be obtained by adjusting for known test performance.(19)
We aimed here to i) estimate the cumulative community seroprevalence of IgG antibodies for SARS-CoV-2 and ii) identify those at most risk of infection, from a large representative sample in England up to early July 2020. We use these data to iii) estimate the total number of infected individuals In England and the infection fatality ratio (IFR).
Methods
The REal-time Assessment of Community Transmission-2 (REACT-2) programme is evaluating community seroprevalence of SARS-CoV-2 infection in England. (20) We obtained a random population sample of adults in England, using the National Health Service (NHS) patient list, which includes name, address, age and sex of everyone registered with a general practitioner (almost the entire population). Personalized invitations were sent to 315,000 individuals aged 18 years and above to achieve similar numbers in each of 315 lower-tier local authority areas. Participants registered via an online portal or by telephone with registration closed after ~120,000 people had signed up.
Those registered were sent a test kit, including a self-administered point-of-care LFIA test and instructions by post, with link to an on-line video. The LFIA (Fortress Diagnostics, Northern Ireland) was selected following evaluation of performance characteristics (sensitivity and specificity) against pre-defined criteria for detection of IgG, (21) and extensive public involvement and user testing. (22) Compared to results from at least one of two in house ELISAs, sensitivity and specificity of finger-prick blood (self-read) were 84.4% (70.5, 93.5) in PCR confirmed cases and 98.6% (97.1, 99.4) in 500 pre-pandemic sera. (21) Participants completed a short registration questionnaire (online/telephone) and a further survey upon completion of their self-test. This included information on demographics, household composition, recent symptoms and a photo of the result. A validation study using photographs showed substantial concordance between participant- and clinician-interpreted results in over 500 tests (kappa: 0.89, 95% Cl: 0.88-0.92) (22).
Seroprevalence was calculated as the proportion of individuals with a positive IgG result, adjusted for test performance using: where p = adjusted proportion positive, q = observed proportion positive. (19) In addition, prevalence estimates at national level were weighted for age, sex, region, ethnicity and deprivation to account for differential sampling (geographic) and for variation in response rate so as to be representative of the population (18+ years) of England. Logistic regression models were adjusted for age, sex and region, and additionally for ethnicity, deprivation, household size and occupation. We used complete case analysis without imputation.
We estimated total number of SARS-CoV-2 infections since start of the epidemic until 17 July 2020 by multiplying the antibody prevalence, adjusted for test characteristics and re-weighted for sampling, by mid year population size at ages 18+ years in England (23). To correct for survival bias we further added to the seropositive population the deaths that mentioned COVID-19 on the death certificate during this period. We then estimated the IFR, dividing the total number of COVID-19 deaths excluding care home residents.(14) We obtained an overall IFR estimate and estimates stratified by age and gender.(24) Confidence bounds were obtained by using the Delta method. As a sensitivity analysis we calculated IFR and total infections including care home residents and with all-cause excess deaths (for data sources see supplementary table 2).
We obtained research ethics approval from the South Central-Berkshire B Research Ethics Committee (IRAS ID: 283787), and received MHRA approval for use of the LFIA for research purposes
Data were analysed using the statistical package R version 4.0.0. (25)
Results
Of the 121,976 people who were sent LFIA test kits, 109,076 (89.4%) completed the questionnaire of whom 105,651 also completed the test; 5,743 (5.4%) reported an invalid or unreadable result leaving 99,908 (94.6%) individuals, 5,544 IgG positive and 94,364 IgG negative, giving a crude prevalence of 5.6% (95% Cl 5.4-5.7). After adjusting for the performance characteristics of the test and re-weighting, overall prevalence for England was 6.0% (95% Cl: 5.8-6.1) during the period 20 June to 13 July 2020. This equates to 3.36 (3.22, 3.51) million adults in England who had been infected with SARS-CoV-2 in England to end June 2020.
Prevalence was highest for ages 18-24 years and in London (Table 1). Highest prevalence by ethnic group was found in people of Black (includes Black Caribbean, African and Black British) (17.3%, 95% Cl 15.8,19.1) and Asian (mainly South Asian) ethnicities (11.9%, 95% Cl 11.0,12.8), compared to 5.0% (4.8, 5.2) in people of white ethnicity. (Table 1). The association of seroprevalence with non-white ethnicities was partially but not fully explained by the covariates. For example, in the unadjusted model, compared to white ethnicity, Black ethnicity was associated with a three-fold increase in odds of being antibody positive (OR 3.2, 95%CI 2.7, 3.9) which reduced to OR 2.0 (1.6, 2.4) after adjustment for age, sex, ethnicity, region, deprivation, household size and employment (Figure 1, Table S1). Essential workers, particularly those with public-facing roles, also had increased seroprevalence. Those working in residential care facilities (care homes) with client-facing roles had a prevalence of 16.5% (95% Cl 13.7, 19.8) and health care workers with patient contact a prevalence of 11.7% (95% Cl 10.5-13.1), with 3-fold (3.09; 2.51,3.80) and 2-fold (2.09; 1.86,2.35) odds of infection respectively compared with non-essential workers (Table 2, Figure 1). Those in the more deprived areas or living in larger households had higher prevalence than those in more affluent areas or who lived alone, although these gradients were partially attenuated in the adjusted models (Figure 1, Table S2). People who currently smoked had a lower prevalence (3.2%, 2.8, 3.7) than those who did not (5.2%, 5.0, 5.4), OR 0.64 [0.58,0.71).
Figure 2 shows how the epidemic evolved between January and June 2020, generating an epidemic curve based on data from the present study, based on those with antibodies who reported suspected or confirmed COVID-19 and provided a date (n=3,493). The top left plot (A) shows this alongside national mortality for England by date of death which tracks 2-3 weeks behind our epidemic curve, which peaked in the first week of April. The other panels show the proportionate distribution of cases from our data by (B) region, (C) ethnicity and (D) work type. The epidemic was widely distributed across regions; as the epidemic grew there was a shift during lockdown towards a greater proportion of cases in minority ethnic groups, and in essential workers, particularly those in people-facing roles in care homes and health care. These data exclude asymptomatic individuals and symptomatic people whose date of infection was unknown.
Clinical presentation
Of the 5,544 IgG positive people, 3,406 (61.4%; 60.1, 62.7) reported one or more typical symptoms (fever, persistent cough, loss of taste or smell), 353 (6.4%; 5.8, 7.0) reported atypical symptoms only, and 1,785 (32.2%; 31.0, 33.4) reported no symptoms. This varied by age, with people over 65 being more likely to report no symptoms (392/801, 48.9%, 45.4, 52.4) than those aged 18-34 (418/1,393, 30.0%, 27.6, 32.4) or 35-64 years (975/3,350, 29.1%, 27.6, 30.6), (p<0.001) (Table SI and S2); prevalence was higher in those with more severe symptoms, and who had contact with a confirmed or suspected case (Figure 1, Table S2). Those who were overweight or obese had higher prevalence than those with normal weight, and current smokers had a lower prevelance than non-smokers (3.2% vs. 5.2% (OR 0.64 [0.58,0.71]).
Infection Fatality Ratio
The estimated community IFR (excluding care homes) was 0.9% (0.86, 0.94), increasing with age and higher in males (1.07%, 1.00,1.15) than females (0.71%, 0.67, 0.75) (Table 3). Sensitivity analyses indicate an IFR as high as 1.58% (1.51%, 1.65%) if excess rather than COVID-specific deaths are used and care home deaths are included (Table S2).
Discussion
This is to our knowledge, the largest community-based evaluation of SARS-CoV-2 antibody prevalence, and the only nationwide study based on unsupervised use of LFIA tests at home. It shows an overall prevalence of 6.0% in England, with the epidemic widely dispersed geographically. Overall we estimate that 3.4 million adults had been infected with SARS-CoV-2 virus in England to the end of June 2020, with the majority of people who developed antibodies reporting symptoms during the peak of the epidemic in March and April 2020. This is broadly consistent with other surveillance systems and from a smaller national study of serological surveillance which reported a 26) Our findings are also broadly in line with large prevalence surveys in other countries which have shown a relatively low overall prevalence, despite apparently large outbreaks, and far below what would be necessary to confer some level of herd immunity. (14,27)
We have shown that as the epidemic increased in March and April it became more concentrated in specific groups including Black, Asian and other minority ethnic groups, essential workers, particularly those working in long-term residential care and health care workers, those living in larger households and deprived areas. While these associations were partially attenuated in the adjusted analyses, they remained important predictors, and reflect starkly uneven experience of the COVID-19 epidemic across society. This key finding of unequal burden of infection supports evidence emerging from other countries, mostly from hospital-based studies and mortality data. In the USA local and clinical studies indicate a higher rate of infection in Black and Hispanic populations. One clinical study in Louisiana reported a high proportion of cases but no increased mortality in Black non-Hispanic people; a similar pattern was observed in US Veterans and in national estimates of differential mortality. (28–30)
Our study has the advantage of including ethnicity data alongside information about employment, deprivation, household size and other potential confounders. This allows a more nuanced exploration of the underlying mechanisms for the unequal risks (8). We found that while some of the increased risk of infection in minority ethnic groups was explained by these other factors, odds of infection remained two-fold and 1.4-fold higher for people of Black and Asian ethnicities respectively compared with white ethnicity after adjusting for these factors, indicating that higher rates of infection are a component of the higher hospitalisation and mortality rates. However, more research is needed to better understand whether underlying structural inequalities, occupational or other factors may have contributed to this pattern of risks in non-white populations. (31)
In common with some other studies we found that current smokers have a lower prevalence of SARS-CoV-2 infection than non-smokers. (6,32–33) Reasons for this observation are unclear; further research is needed as to whether it relects unmeasured confounding, differential adoption of preventive behaviours or whether there may be some biological basis (34).
Our estimated IFR of 0.9% is in line with a recent large study in Spain which reported 0.83% to 1.07%, lower than described in Italy at 2%, and higher than the 0.38% reported from a German study. (35–37) The overall IFR is dependent on the age and sex distribution of infection, and stratified estimates maybe more informative. In estimating the IFR, we may have underestimated the number of infected individuals (leading to higher estimates of IFR), as a result of weakened or absent antibody response in some people, and waning antibody over time. (6) Like many countries, England experienced high numbers of cases and deaths in care home residents which have accounted for just under half of all deaths. (38) While the high rates of infection in care home workers may have spilled over into the community, the care home and community epidemics were somewhat separate. We excluded care home mortality from our estimates of community IFR, especially as only a relatively small number of care home residents were included in our study.
The clinical spectrum of infection is wide, with just under one third (32%) of people with antibodies reporting no symptoms, and this proportion was higher in people over 65 years (49%) as also reported for individuals in long-term care. (40). The national seroprevalence study in Spain re[ported that 28.5% or 32.7% were asymptomatic depending on the test (14), similar to our findings overall, although a systematic review of 16 clinical studies puts the figure at 40-50% (39).
The high prevalence in individuals with people-facing roles in social and health care suggests that infection control in the early months of the epidemic was inadequate in these settings and likely placed both staff and residents/ patients at risk. Together with our finding that almost 50% of infections in people over 65 were asymptomatic, this emphasises the need for expanded testing in the care home setting in particular among staff and residents.
Our study has a number of limitatons. As in almost all population surveys, our study showed unequal participation, with lower response from ethnic minority groups and people in more deprived areas. We re-weighted the sample to account for differential response, although this may not have overcome unknown participation biases. An important limitation was the exclusion of children for regulatory reasons as the tests were approved for research use in adults only. We used self-administered home LFIA tests as opposed to “gold standard” laboratory tests based on a blood draw. However, we carried out extensive evaluation of the selected LFIA whch showed it to have acceptable performance in terms of both sensitivity and specificity in comparison with the confirmatory laboratory tests (21). We also took steps to measure and improve usability, including ability to perform and read an LFIA test, through public involvement and evaluation in a national study of 14,000 people. (22).
Use of the LFIA enabled us to obtain antibody tests on large numbers, without the need for laboratory or health care personnel, over an 18-day period. Antibodies were strongly associated with clinical history of confirmed or suspected COVID-19, providing face validity; although there was a theoretical potential for reporting bias as respondents were not blinded to their test results, there was high concordance of self-reported with clinician-read results from the uploaded photos. Our results closely tracked other indicators of the epidemic curve and we believe that use of home-based self-tests is a sustainable model for community-based seroprevalence studies in other populations - to provide reliable estimates of the timing, extent and at risk groups of the epidemic, avoiding the biases of studies carried out in e.g. health care or work settings.
In conclusion, our finding of substantial inequalities in SARS-CoV-2 prevalence by ethnicity and social deprivation shows the underlying importance of excess risk of exposure in these groups, and counters suggestions that the excess risk is due predominantly to comorbidities or other biological factors. The higher risk of infection in these groups may explain, at least in part, their increased risk of hospitalisation and mortality from COVID-19.
Data Availability
The original datasets generated or analysed, or both, during this study are not publicly available because of governance restrictions and the identifiable nature of the data.
Contributions
HW, CA, PE designed the study and drafted the manuscript MW, KECA, JE, LO, RR conducted the analyses HW, CA, GC, DA, CAD, WB, AD, GC, SR, PE, provided study oversight AD and PE obtained funding
All authors have reviewed and approved the final manuscript
Funding
This work was funded by the Department of Health and Social Care in England.
The content of this manuscript and decision to submit for publication were the responsibility of the authors and the funders had no role in these decisions.
Acknowledgements
HW is a NIHR Senior Investigator and acknowledges support from NIHR Biomedical Research Centre of Imperial College NHS Trust, NIHR School of Public Health Research, NIHR Applied Research Collaborative North West London, Wellcome Trust (UNS32973).
GC is supported by an NIHR Professorship. WSB is the Action Medical Research Professor, AD is an NIHR senior investigator and DA is an Emeritus NIHR Senior Investigator.
SR acknowledges support from MRC Centre for Global Infectious Disease Analysis, National Institute for Health Research (NIHR) Health Protection Research Unit (HPRU), Wellcome Trust (200861/Z/16/Z, 200187/Z/15/Z), and Centres for Disease Control and Prevention (US, U01CK0005-01-02)
PE is Director of the MRC Centre for Environment and Health (MR/L01341X/1, MR/SO 19669/1). PE acknowledges support from the NIHR Imperial Biomedical Research Centre and the NIHR HPRUs in Environmental Exposures and Health and Chemical and Radiation Threats and Hazards, the British Heart Foundation Centre for Research Excellence at Imperial College London (RE/18/4/34215) and the UK Dementia Research Institute at Imperial (MC_PC_17114).
We thank key collaborators on this work -- Ipsos MORI: Stephen Finlay, John Kennedy, Kevin XXXX, Duncan Peskett, Sam Clemens and Kelly Beaver; Institute of Global Health Innovation at Imperial College: Gianluca Fontana, Dr Hutan Ashrafian, Sutha Satkunarajah and Lenny Naar; NHS Digital for access to the NHS Register.