Abstract
Background England has experienced one of the highest rates of confirmed COVID-19 mortality in the world. SARS-CoV-2 virus has circulated in hospitals, care homes and the community since January 2020. Our current epidemiological knowledge is largely informed by clinical cases with far less understanding of community transmission.
Methods The REal-time Assessment of Community Transmission (REACT) study is a nationally representative prevalence survey of SARS-CoV-2 virus swab-positivity in the community in England. We recruited participants regardless of symptom status.
Results We found 159 positives from 120,610 swabs giving an average prevalence of 0.13% (95% CI: 0.11%,0.15%) from 1st May to 1st June 2020. We showed decreasing prevalence with a halving time of 8.6 (6.2, 13.6) days, implying an overall reproduction number R of 0.57 (0.45, 0.72). Adults aged 18 to 24 yrs had the highest swab-positivity rates, while those >64 yrs had the lowest. Of the 126 participants who tested positive with known symptom status in the week prior to their swab, 39 reported symptoms while 87 did not, giving an estimate that 69% (61%,76%) of people were symptom-free for the 7 days prior testing positive in our community sample. Symptoms strongly associated with swab-positivity were: nausea and/or vomiting, diarrhoea, blocked nose, loss of smell, loss of taste, headache, chills and severe fatigue. Recent contact with a known COVID-19 case was associated with odds of 24 (16, 38) for swab-positivity. Compared with non-key workers, odds of swab-positivity were 7.7 (2.4, 25) among care home (long-term care facilities) workers and 5.2 (2.9, 9.3) among health care workers. However, some of the excess risk associated with key worker status was explained by recent contact with COVID-19 cases. We found no strong evidence for geographical variability in positive swab results.
Conclusion Our results provide a reliable baseline against which the impact of subsequent relaxation of lockdown can be assessed to inform future public health efforts to control transmission.
Introduction
To date, the UK has experienced the third highest number of reported COVID-19 deaths after the United States and Brazil1. In response to rapid growth in health care demand, lockdown was implemented nationally on 23rd March and daily mortality in the UK peaked around 11th April. As of 9th July 2020, 44,517 deaths in people with a positive test had occurred in the UK2.
Until recently, testing for severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2, cause of COVID-19) has focused mainly on health care3,4 and care home settings5,6. While there is evidence of transmission of SARS-CoV-2 virus and high levels of mortality in these settings, less attention has been paid to levels of the virus circulating in the community, especially among those not exhibiting symptoms. This is a critical knowledge gap since community transmission will play an increasingly important part in the epidemiology of the COVID-19 pandemic as lockdown measures are lifted7.
Understanding patterns of infection in the population and the risk by social, demographic and geographical factors is therefore crucial to help control the epidemic and prevent a second wave of infection. We report here findings from the REal-time Assessment of Community Transmission (REACT) study, a large population-based programme designed to establish prevalence of current SARS-CoV-2 infection across England toward the end of the lockdown period. In addition, it provides an opportunity to measure the growth rate of prevalence of infection in the community and hence infer the reproduction number (R).
Methods
Recruitment of participants
We sent a letter to a nationally representative sample of the population among the 315 lower-tier local authorities (LTLAs) in England. Participants were randomly selected from the
National Health Service (NHS) list of patients registered with a general practitioner, aged five yrs and over.
Covariates
Data on age, sex, address and postcode were available from the NHS register. Information on key worker status, ethnicity, smoking, contact with known or suspected COVID-19 cases, and symptoms were obtained by online questionnaire or telephone. Participants could either report no symptoms or select one or more from a list of 26 symptoms in the past six months, and whether or not they had experienced symptoms in the last week. Among those who reported symptoms in the last week, we were not able to identify which of their symptoms in the previous 6 months had occurred in the previous week. For analysis, we assumed that any of the reported symptoms could have occurred in the last week.
Nose and throat swab collection, sample handling and testing
Participants were given written and video instructions to obtain a self-administered nose and throat swab (parent /guardian administered for children aged 5-12 years). Initially swabs in viral transport medium were used and sent to one of four Public Health England (PHE) laboratories for analysis (n= 8,595 swabs with reported result). Subsequently we switched to obtaining dry swabs (n= 112,025). We asked participants to refrigerate the sample and book pick-up of the swab within 24 hours. Swabs were collected by courier and delivered to a commercial laboratory using a cold chain (4° to 8°C) to maintain sample integrity. Samples were tested using reverse-transcription--polymerase-chain-reaction (RT-PCR). A calibration study was undertaken to ensure comparability of test results across study laboratories and results for the commercial laboratory and PHE samples were then combined.
PCR Calibration experiments
We observed that the proportion of positive results from the commercial laboratory was substantially higher than from the Public Health England (PHE) laboratories. It was apparent that the commercial laboratory was routinely reporting as positive, on testing by RT-PCR, samples with high Ct values for the N-gene target, although the E-gene target was not detected.
To reconcile these differences, we conducted three separate calibration experiments. First, 10 RNA extraction plates were sent from the commercial laboratory to two NHS accredited laboratories for blinded re-analysis. Results were concordant for 919 negative samples and all 40 controls. We detected viral RNA in 11 of the 19 samples reported positive by the commercial laboratory (N-gene Ct-value range 16.5 to 40.7); 10 of these 11 samples had an N-gene Ct value < 37. Second, the commercial laboratory conducted a serial dilution experiment of known positive samples with high viral load to assess Ct thresholds at the limit of detection. Third, a further 40 unblinded positive samples (on 19 plates) with Ct values (N-gene) > 35 (range 35.7 to 46.8) and without a signal for the E-gene were selectively re-analysed in a PHE reference laboratory; SARS CoV0-2 RNA was detected in 15/40 (38%) (2/4 with N-gene Ct value < 37). As a result of these calibration experiments, we report swab-positivity for positive samples reported by the commercial laboratory where N-gene Ct values < 37 or where virus was detected by both N-gene and E-gene targets.
Analysis
To investigate association of different covariates with swab-positivity, we performed univariate logistic regression to obtain unadjusted odds ratios and 95% confidence intervals. Test result (positive/negative) was used as the outcome variable. We also used multi-variable models to adjust for age and sex and then additionally ethnicity, region, and key worker status. When assessing the difference in test result by age, we compared models with a categorical term for age (Table 1) as well as a smooth term.
To investigate trends in swab positivity rate over time, we used an exponential decay model, assuming that the number of positive samples each day arose from a binomial distribution. Posterior credibility intervals were obtained using a bivariate Metropolis-Hasting algorithm with uniform priors. We used day of swabbing (reported for 72% of participants for whom a laboratory result was available) or, if unavailable, day of collection (increasing the number of participants for which a date was available to 92%). To estimate R8, we assumed a generation time distribution of six days, comprising exponential latent and infectious periods of three days each.
Clustering of prevalence at the level of LTLAs was assessed by testing the null hypothesis that the number of positive swabs from each LTLA arose from a binomial process with fixed probability q from n samples, where n was the number of tested swabs from that LTLA. We assessed our power to detect clustering in LTLAs by hypothesising that (1-q) LTLAs would have no swab-positive with the remaining infection being distributed evenly among the remaining proportion q of LTLAs. If that were the case, our study was powered to detect q < ∼0.6 (that is clustering in up to 60% of LTLAs) with 80% power (Figure 1).We present results of the main data using complete case analysis without imputation. Analysis was conducted in R version 4.0.09. Logistic regression models were implemented in the base function glm or with smooth function using the mgcv package10. We used changes in Akaike information criterion (AIC) to assess improvement in model performance.
Ethics
We obtained research ethics approval from the South Central-Berkshire B Research Ethics Committee (IRAS ID: 283787).
Results
Sample and response rate
Letters were sent to 394,970 persons (base sample), of whom 161,771 (41%) (or parent/guardian) registered for the study, were sent a nose and throat swab kit, and were asked to complete a questionnaire. Swab results were available for 120,620 individuals (achieved sample, 31% of base sample). The achieved sample was broadly representative of the base sample although males, adults aged 18 to 34 yrs and London region were to an extent under-represented (Table 2).
Overall prevalence of swab-positivity
Out of 120,610 swab results, 159 were positive giving an overall unadjusted prevalence of 0.13% (0.11%,0.15%) for the period 1st May to 1st June 2020. This corresponds to an estimated 74,000 (63,000, 86,000) prevalent infections of SARS-CoV-2 virus in England on average which was unchanged when we adjusted for the age-sex distribution of the base sample.
Decline in prevalence and estimate of reproduction number
We obtained strong evidence that prevalence was decreasing during the study period with a halving time of 8.6 (6.2, 14) days, corresponding to a decay rate of 0.081 (0.051, 0.11) per day (Figure 2). We estimated the reproduction number R to be 0.57 (0.45, 0.72).
We used a regression framework to test that the rate of decay did not vary substantially over the time of our study: a linear univariate model for day of swab as a covariate for swab-positivity had the same AIC as a univariate model with a smooth term for day of swab, with the smooth term producing a straight line (not shown).
Trends with sex, ethnicity and age
Prevalence of swab positivity for males of 0.13% (0.11%,0.17%) was similar to that for females of 0.13% (0.11%, 0.16%) (Table 2). Asian participants (predominantly South Asian) were more likely to be swab-positive than white participants, with an unadjusted odds ratio of 1.9 (1.0, 3.5), although this reduced to 1.7 (0.86, 3.5) on adjustment (Table 1, Table 3). Adults aged 18 to 24 yrs had highest rates of swab positivity while those >64 yrs had lowest rates (Figure 3A, Table 4 in Supplement). A smooth term for age was better supported by the data than a categorical term in the adjusted model (Table 1, ΔAIC = 8) and its shape was largely unchanged when compared with a univariate model of age (also using a smooth term, Figure 3B). Unadjusted rates in children and young people aged 5 to 17 yrs were similar to those in adults aged 25 to 34, and remained so with a smooth term for age in the adjusted model.
The trend in age was not explained by key worker status (see below). The adjusted model including a smooth term for age and key worker status was preferred to the model with a smooth age term alone (ΔAIC = 37) but the shape of the trend was conserved (Figure 3B).
Contact with COVID-19 cases
Information was provided about possible contact with COVID-19 cases for 92,941 participants. Compared to a person who did not report contact with a COVID-19 case, those who reported contact with a confirmed case had odds of 24 (16, 38) of testing positive; odds were 5.3 (2.2, 12) for contact with a suspected case. These findings were essentially unchanged after adjustment for age and sex (Table 1).
Key workers
The prevalence of infection was highest among care home workers, 0.71% (0.24%, 2.06%), followed by health care workers 0.47% (0.30%, 0.75%) and other key workers 0.17% (0.12%, 0.25%) (Table 3). Compared with non-key workers, unadjusted odds of testing positive were 7.7 (1.8, 22) in care home workers, 5.2 (2.8, 9.2) in health care workers and 1.8 (1.1, 3.1) in other key workers. Odds were similar or larger after adjustment for covariates not including contact with a COVID-19 case (Table 1). However, care home workers and health care workers were much more likely to report a recent contact with a COVID-19 case (Table 5), which contributed to the excess risk in those groups (not shown).
Symptoms and swab-positivity
Of the 126 participants who tested positive with known symptom status in the week prior to their swab, 39 reported symptoms while 87 did not, giving an estimate that 69% (61%,76%) of people were symptom-free for the 7 days prior testing positive in our community sample. For those reporting symptoms in the last week (see Methods), the following were strongly associated with a positive swab result; nausea and/or vomiting, diarrhoea, blocked nose, loss of smell, loss of taste, headache, chills and severe fatigue (Table 6).
Spatial patterns
We detected differences in crude prevalence at the regional scale, highest in London at 0.20% (95% CI, 0.13%, 0.31%) and lowest in the South West at 0.06% (95% CI, 0.03%, 0.12%) (Figure 4A, Table 7 in Supplement). However higher risk in London was no longer evident after adjustment for age, sex, ethnicity and key worker status (Table 1).
For smaller geographical units, we did not find evidence of spatial clustering. The distribution of LTLAs with 0, 1, and 2 or more cases was consistent with equal prevalence in every LTLA (Figure 4B). Further, the distributions of pairwise distances between the home locations of people testing positive and those testing negative were similar (Figure 4C).
Sensitivity analysis
We estimated prevalence and decay rate for different Ct values for samples from the commercial laboratory (appendix supplementary methods). The overall unadjusted prevalence of swab positivity decreased from 0.13% to 0.11% (0.095%, 0.13%) for a Ct threshold of 35 and increased to 0.15% (0.13%, 0.18%) for a Ct threshold of 38. The estimated rate of decay of prevalence of 0.081 was similar at 0.083 (0.050, 0.12) per day for a Ct threshold of 35 and reduced to 0.073 (0.044, 0.10) for a Ct threshold of 38.
Discussion
In this large, nationally representative survey of SARS-CoV-2 infection in England, prevalence was approximately 1 in 1000 and decreasing at the end of the initial lockdown period in May 2020. These observations are in contrast with high model-estimated prevalence at the peak of the epidemic11, but are consistent with a recent household-based longitudinal community study12. Our results confirm the efficacy throughout lockdown of measures to contain the spread of SARS-CoV-2, with an estimate of R below 0.6 during May.
Although we saw no differences in viral prevalence by sex, we did find age differences with highest rates among young adults (ages 18-24 years) and lowest rates among older people (ages 65+). These patterns were not fully explained by adjustment for sex, key worker status, ethnicity or region, suggesting that differences in social contact behaviour across ages were important. Young adults appear to have maintained higher levels of social contact than other age groups during the lockdown period in England, while older age groups may have effectively shielded13. Children and young people aged 5 to 17 yrs had similar rates of infection to adults aged 25 to 44 yrs, indicating that children are similarly susceptible to being infected with SARS-CoV-214. The higher rates of infection in people of Asian (mainly South Asian) ethnicity suggest that some of the increased burden of disease in minority ethnic populations may be due to increased rates of infection15.
We found that key workers – and especially care home and health care workers – had markedly increased odds of infection, up to 7-8 fold higher than non-key workers. Thus, care homes and hospitals, even during the period of our study at the end of lockdown, remained an important source of infection. In addition, we found nearly 25-fold greater odds of infection for those individuals who reported coming into contact with a confirmed case of COVID-19, with much of the excess risk experienced by care home and health care workers being explained by their having reported contact with COVID-19 cases. Assuming that many of these contacts were work-related, these results suggest a continuing need for improved infection control in these settings.
We found that a high proportion of swab-positive participants did not report symptoms during the week prior to swabbing, nearly 70%. Current “test and trace” efforts in the UK are focused on care home, health care settings, and symptomatic individuals so will not be triggered by non-symptomatic people. On a given day, they comprise the bulk of infections in the community and an ongoing source of transmission. Therefore, social distancing needs to remain an important component of infection control measures.
We found apparent regional differences in rates, with the highest crude prevalence observed in London, which was a focus of transmission early in the UK epidemic.This may reflect its role as a major transport, business and tourism hub, similar to other global cities such as New York16. However, the higher rates in London appear to have been confounded somewhat by other factors including ethnicity and health care worker status. We found no evidence for geographical clustering at a sub-regional or local scale, suggesting that the epidemic in England at this stage was still showing generalized transmission and had not broken down into disconnected sub-epidemics.
Our study has limitations. Although we selected a representative sample of the population from the lists of patients registered with a general practitioner in England (covering almost the entire population) our response rate overall was 31% which may have affected our estimates of prevalence.
Our study involved the use of self-administered nose and throat swabs to obtain evidence of infection. A number of factors need to be considered when interpreting swab-positivity prevalence. First, viral RNA may have been present but not detected during the process of swabbing, transport and analysis. Estimates of diagnostic sensitivity are in the region of 70%17–19. Therefore, our estimate of prevalence of swab positivity suggests that up to 115,000 (95% CI 99,000, 134,000) people in England may have been infected withSARS-CoV-2 virus on any given day. Second, viral RNA may persist in the nose and mouth after viable virus is no longer present20 and therefore not be indicative of active infection.
Third, it is possible that self-swabbing efficacy may vary according to age and other demographic characteristics. Fourth, we had evidence to suggest that there was differential reporting of a positive result across laboratories with the commercial laboratory initially reporting a higher positivity rate than the PHE laboratories. However, we mitigated the potential impact of this difference through a calibration exercise resulting in similar swab-positivity rates between laboratories. Also, our key findings were robust to different thresholds of RT-PCR Ct-value suggesting that our results were not unduly influenced by Ct threshold.
In conclusion, we suggest nationally-representative population-based surveys of SARS-CoV-2 infection may greatly improve situational awareness. An important feature of the data presented here is that they are independent of service-oriented testing processes, the representativeness of which varies substantially over time and space. Repeated rounds of studies similar to that reported here will enable continued monitoring of key epidemic properties, including R estimates at regional and local levels, to guide locally-optimized interventions.
Data Availability
The original datasets generated or analysed, or both, during this study are not publicly available because of governance restrictions and the identifiable nature of the data.
Contributors
SR and PE conceptualised and designed the study and drafted the manuscript. SR, KECA, OE, BJ and CEW undertook the data analysis. PJD, DA and CAD provided statistical advice. GC, WB, HW, CA and AD provided study oversight. GT provided oversight of the laboratory calibration study. AD and PE obtained funding. SR, KECA, OE, BJ, CEW, CA, PJD, DA, CAD, GC, WB, HW, GT, AD and PE critically reviewed the manuscript. All authors read and approved the final version of the manuscript. PE is the guarantor for this paper. The corresponding author attests that all listed authors meet authorship criteria and that no others meeting the criteria have been omitted, had full access to all the data in the study, and had final responsibility for the decision to submit for publication.
Declaration of interests
We declare no competing interests.
Data sharing
The original datasets generated or analysed, or both, during this study are not publicly available because of governance restrictions and the identifiable nature of the data.
Funding
The study was funded by the Department of Health and Social Care in England. SR acknowledges support: MRC Centre for Global Infectious Disease Analysis, National Institute for Health Research (NIHR) Health Protection Research Unit (HPRU), Wellcome Trust (200861/Z/16/Z, 200187/Z/15/Z), and Centres for Disease Control and Prevention (US, U01CK0005-01-02). GC is supported by an NIHR Professorship. PE is Director of the MRC Centre for Environment and Health (MR/L01341X/1, MR/S019669/1). PE acknowledges support from the NIHR Imperial Biomedical Research Centre and the NIHR HPRUs in Environmental Exposures and Health and Chemical and Radiation Threats and Hazards, and the British Heart Foundation Centre for Research Excellence at Imperial College London (RE/18/4/34215).
Acknowledgements
We thank key collaborators on this work -- Ipsos MORI: Kelly Beaver, Sam Clemens, Gary Welch, Nick Gilby and Kelly Ward; Institute of Global Health Innovation at Imperial College: Gianluca Fontana, Dr Hutan Ashrafian, Sutha Satkunarajah and Lenny Naar; MRC Centre for Environment and Health, Imperial College London: Daniela Fecht; North West London Pathology and Public Health England for help in calibration of the laboratory analyses; NHS Digital for access to the NHS register; and the Office for Life Sciences and Department of Health and Social Care for logistic support. For comments on drafts of this report, we thank: Michelle Heys, Julia Gog, Adam Kucharski and Christophe Fraser.