SARS-CoV-2 lineage dynamics in England from January to March 2021 inferred from representative community samples

Oliver Eales; Andrew J. Page; Sonja N. Tang; Caroline E. Walters; Haowei Wang; David Haw; Alexander J. Trotter; Thanh Le Viet; Ebenezer Foster-Nyarko; Sophie Prosolek; Christina Atchison; Deborah Ashby; Graham Cooke; Wendy Barclay; Christl A. Donnelly; Justin O’Grady; Erik Volz; The COVID-19 Genomics UK (COG-UK) Consortium; Ara Darzi; Helen Ward; Paul Elliott; Steven Riley

doi:10.1101/2021.05.08.21256867

Abstract

Genomic surveillance for SARS-CoV-2 lineages informs our understanding of possible future changes in transmissibility and vaccine efficacy. However, small changes in the frequency of one lineage over another are often difficult to interpret because surveillance samples are obtained from a variety of sources. Here, we describe lineage dynamics and phylogenetic relationships using sequences obtained from a random community sample who provided a throat and nose swab for rt-PCR during the first three months of 2021 as part of the REal-time Assessment of Community Transmission-1 (REACT-1) study. Overall, diversity decreased during the first quarter of 2021, with the B.1.1.7 lineage (first identified in Kent) predominant, driven by a 0.3 unit higher reproduction number over the prior wild type. During January, positive samples were more likely B.1.1.7 in younger and middle-aged adults (aged 18 to 54) than in other age groups. Although individuals infected with the B.1.1.7 lineage were no more likely to report one or more classic COVID-19 symptoms compared to those infected with wild type, they were more likely to be antibody positive 6 weeks after infection. Viral load was higher in B.1.1.7 infection as measured by cycle threshold (Ct) values, but did not account for the increased rate of testing positive for antibodies. The presence of infections with non-imported B.1.351 lineage (first identified in South Africa) during January, but not during February or March, suggests initial establishment in the community followed by fade-out. However, this occurred during a period of stringent social distancing and targeted public health interventions and does not immediately imply similar lineages could not become established in the future. Sequence data from representative community surveys such as REACT-1 can augment routine genomic surveillance.

Introduction

Since the emergence of SARS-CoV-2 in late 2019 [1] there has been a continuous accumulation of mutations leading to a genetically diverse phylogeny [2]. Although most mutations are neutral, having no effect on the epidemiology of the virus, some have been found to affect transmissibility [3] and antigenicity [4], and have arisen on multiple occasions in independent lineages [5]. Lineages that are judged likely to have increased transmissibility or severity relative to current dominant lineages in the UK are termed ‘Variants of Concern’ (VOC) [6–8].

The B.1.1.7 lineage (Pango nomenclature [9], used for the rest of the paper) VOC was first detected in Kent, England 20th September 2020 [10,11]. Since its emergence, it has risen to become the dominant lineage in the United Kingdom (UK), and has increased in frequency in many other countries [12]. Previous studies have estimated that this lineage is more transmissible than previously dominant lineages, as measured by the reproduction number (R) [6,13].

The B.1.351 lineage was first detected in South Africa [7] in October 2020 and by March 2021 there had been 291 detections in the UK [14]. The lineage is associated with the E484K single nucleotide polymorphism (SNP) in the spike protein, which has been found to reduce the neutralizing activity of post-vaccination sera [15], triggering fears of lowered vaccine efficacy. This SNP has also been detected in a cluster of B.1.1.7 cases In England, predominantly in the South West [16]. Two further lineages, A.23.1 [17] and B.1.525, are currently described as ‘Variants under Investigation (VUI)’ [18] due to the presence of several SNPs of biological significance. Both of these lineages have been detected in low numbers in the United Kingdom [19,20]. A cluster of A.23.1 that exhibits the E484K SNP has been detected in Liverpool, England [16].

Extensive genomic surveillance has been undertaken in the UK by the COVID-19 Genomics UK Consortium (COG-UK) [21]. From its inception in March 2020 to the end of March 2021, COG-UK sequenced over 320,000 positive cases [22] representing 45% [23] of all uploaded sequences to GISAID, a global open-access database for coronavirus and influenza genomic data [24], with UK coverage of all detected samples varying from 2.5% [25] to 17.1% [26]. Samples included in COG-UK data are taken from several different sources: hospital cases, routine community surveillance, outbreak investigations and border screening.

The REal-time Assessment of Community Transmission study obtains throat and nose swabs from a random sample of the population in England (REACT-1) [27]. From the beginning of May 2020 to the end of March 2021, there were 10 rounds of REACT-1 with between 140,000 and 175,000 swab tests each round. Here we present the results of the genome sequencing performed on the positive swabs in early 2021, from round 8 (6 January - 22 January), round 9 (4 February - 23 February) and round 10 (11 March −30 March).

Results

Lineage diversity

In round 8 we were able to reliably determine lineages for 1,088 out of 2,282 positive samples, of which 83% (80%, 85%, n=898) were the B.1.1.7 lineage, 0.37% (0.14%, 0.94%, n=4) were the B.1.351 lineage, 0.18% (0.05%, 0.67%, n = 2) were the A.23.1 lineage, 0.09% (0.02%, 0.52%, n=1) were the B.1.525 lineage, and 0.18% (0.05%, 0.67%, n=2) were B.1.1.7 lineage with the E484K SNP (B.1.1.7+E484K, first identified in a cluster of cases in Bristol, UK) (Supplementary Table 1). The remaining 17% (15%, 19%, n=181) of lineages were classified as wildtype and comprised 35 distinct lineages, the main constituent of which was B.1.177 (n=105). In round 9, 236 lineages out of 689 positives were determined, of which 96% (92%, 98%, n=226) were the B.1.1.7 lineage and the remaining 4.2% (2.3%, 7.6%, n=10) were classified as wildtype. In round 10, all 73 lineages determined from 227 positive samples were B.1.1.7. Despite the reduced number of samples in round 10, the data across all three rounds securely describe a decrease in diversity (P < 0.001 for reduction in proportion not B.1.1.7).

Quantifying transmissibility of B.1.1.7

Fitting a logistic regression model to whether a sample was identified as B.1.1.7 or not allowed us to estimate a difference in daily growth rate between B.1.1.7 and all other lineages of 0.049 (0.034, 0.067) (Figure 1). This corresponds to an additive R advantage of 0.31 (0.21, 0.42), (assuming a mean generation time of 6.29 days, see Methods) and suggests a smaller difference than estimates based on sequences collected during November and December 2020 in england (between 0.4 and 0.7) [28]. Our lower estimate of the difference in R at a later time is consistent with a decreasing selection coefficient reported in the earlier study.

Figure 1.

B.1.1.7 lineage in England from January to March 2021. (A) Proportion of B.1.1.7 lineage over time. Points show raw data with error bars representing the 95% confidence interval. Shaded region shows best fit Bayesian logistic regression model with 95% credible interval. (B) Odds ratio of a determined lineage being B.1.1.7 by age group for logistic models including just age group (purple) and both age group and region (green) fit to data from round 8 only. (C) Proportion of positive tests that are from the B.1.1.7 lineage by region of England. Error bars show the 95% confidence intervals. (D) Proportion of positive tests that are from the B.1.1.7 lineage by age group. Error bars show the 95% confidence intervals.

Proportions of B.1.1.7 showed marked spatial heterogeneity in January (Round 8), with regions in the Midlands and the North of England showing lower proportions of B.1.1.7 compared to regions in the South (Figure 1, Supplementary Table 2). Sub-regional analysis showed a similar trend with a smoothed term regression model (see Methods) showing lower proportions of B.1.1.7 in areas of the Midlands, Yorkshire and The Humber, and the North West (Figure 2). This pattern is consistent with prior observations of B.1.1.7 lineage emerging in the South East [11] leading to earlier seeding events in the South of England. By February (round 9), spatial heterogeneity was substantially reduced with B.1.1.7 accounting for over 80% of lineages in all regions.

Figure 2.

Geospatial patterns of lineage frequency. (A) Location of all positive samples for which we have identified their lineage for each round (each point moved randomly a small distance). (B) Modelled proportion of B.1.1.7 lineage across space for round 8, round 9 and round 10. Regions: NE =North East, NW = North West, YH =Yorkshire and The Humber, EM = East Midlands, WM =West Midlands, EE = East ofEngland, L = London, SE = South East, SW = South West.

During round 8 there were higher proportions of B.1.1.7 in 18-to 54-year olds compared with other age groups (Figure 1). Using a logistic regression model (see Methods), this pattern was not explained by regional confounding (Figure 1). In contrast, for round 9, albeit based on fewer positive samples, the proportion of B.1.1.7 was similar in all age groups. Case data from November and December 2020 show a higher proportion of B.1.1.7 in school aged children than in other age groups [6], however schools were open for face-to-face teaching for all children during this time unlike during our study when school attendance was greatly limited [29,30].

Rates of symptom reporting

The percentage of people infected with B.1.1.7 reporting no symptoms (Figure 3, Supplementary Table 3) in the week prior to providing a swab was 33.3% (30.4%, 36.3%), compared with 38.2% (30.8%, 46.1%) for wildtype (P=0.24). Looking at the percentage of people reporting COVID-19 like symptoms (loss or change of sense of taste, loss or change of sense of smell, new persistent cough, fever) in the last week we found similar percentages exhibiting these symptoms between lineages with 45.7% (42.6%, 48.8%) for B.1.1.7, compared with 45.4% (37.7%, 53.3%) for the wildtype (P=0.94). Our results describe only the lower part of the severity pyramid -- the fraction of those infected who develop symptoms -- and contrast with a previous study using clinical cases as the denominator which found that the B.1.1.7 lineage caused more severe illness with increased relative mortality [31]. However, other previous work found no difference in symptomatology for B.1.1.7 against other circulating lineages [32]. We also note that participants in REACT-1 were not followed up, other than a small subset in round 8. Therefore, some participants will have developed symptoms after filling out the questionnaire.

Figure 3.

Symptoms and antibody positivity. (A) Proportion of those infected testing positive for antibodies 6 weeks after swab test (for all samples and for those that had both N- and E-gene detected), displaying any symptoms in the week prior to their swab test, displaying classic COVID-19 symptoms (loss of sense of taste, loss of sense of smell, new persistent cough, fever) in the week prior to their test and displaying no symptoms. (B) Odds ratios of the covariates of multiple logistic regression models. Each model had the result of the LFIA antibody test as the outcome variable with different combinations of lineage, N-gene Ct, E-gene Ct and Age as the covariates. OR displayed for B.1.1.7 is relative to wildtype. OR displayed for N- and E-gene Ct is relative to a change in Ct of +5. OR displayed for age is relative to a change of +10 years in age.

Differences in cycle threshold values

Quantitative PCR Ct values for N and E gene targets were lower for the B.1.1.7 lineage relative to the wildtype (Table 1, Supplementary Figure 2). Mean N gene Ct value was 1.33 (0.60, 2.06) lower (P<0.001) and mean E gene Ct value was 0.90 (0.14, 1.67) lower (P=0.020). These values are indicative of a higher viral load in those infected with B.1.1.7 with a decrease in Ct of 1 corresponding to an approximate twofold increase in viral load [33]. This result matches earlier work that suggested the B.1.1.7 lineage has higher viral loads than other circulating lineages [34]. However, it is possible that both analyses have been influenced by the high growth rate of B.1.1.7 during its emergence relative to other lineages which could lead to a difference in the observed distribution generated by differences in the average time since infection [35]. Also, given that our sample workflow ensures that lower Ct values are more likely to receive a lineage designation, a lower intrinsic Ct value for B.1.1.7 could have led to an overestimated proportion of B.1.1.7 in the community.

View this table:

Table 1.

Results of Gaussian regression with either E-gene or N-gene Ct value as the observation and lineage as the explanatory variable.

Differences in antibody positivity

Antibody positivity six weeks after the initial swab test, assessed using a lateral flow immunoassay [36], was higher in those infected with B.1.1.7 lineage relative to those infected with the wildtype lineage (Figure 3, Supplementary Table 4). Antibody positivity was 83.4% (79.7%, 86.5%) in those previously infected with B.1.1.7 and 72.8% (63.0%, 80.9%) in those previously infected with wildtype (P=0.018). This difference was not explained by patterns in N-gene Ct value, E-gene Ct value, or age (Figure 3). For example, the odds of B.1.1.7-positive participants being sero-positive were 1.84 (1.08, 3.13) higher than those who were wildtype-positive, using multivariable logistic regression that included N-gene Ct value and age as covariates. (Model 6, Supplementary Table 5). These differences in the likelihood of testing positive for antibodies may be caused by the immune response itself or by other unmeasured confounders such as the time from infection to swabbing.

Variants of concern and variants under investigation

Though only a small number of B.1.351, B.1.525, A.23.1 and B.1.1.7+E484K were detected in January, because of the random sampling strategy used for REACT-1, we can estimate their prevalence with well-quantified uncertainty (Supplementary Table 6). The single detection of any lineage in round 8 corresponded to an estimated 812 (136, 4847) swab positive infections in England at any one time (Supplementary Table 6), suggesting that these lineages were already established in the community during January 2021. Additionally, none of the individuals infected with these lineages who answered the question reported that they had been abroad in the previous two weeks (Supplementary Table 7). However, not all of the participants who tested positive for a VOC or VUI answered the question about recent travel (one did not for B.1.525, one did not for A.23.1). Also, the sequences of some of the REACT-1 samples grouped very closely with other English isolates when compared to a representative global subsample of the lineage (e.g. ARCH-000045H2 for B.1.351 in Figure 4 and all isolates in Figure 5), further suggesting that significant local transmission was occurring at that time.

Figure 4.

Phylogenetic tree showing relation of B.1.351 lineages detected in REACT-1 to other B.1.351 sequences in the COG-UK database. Sequences are coloured by the location in which the sequence was isolated. REACT lineages are coloured red and have an ID beginning with the sequence “ARCH-”-next to them. (A) Shows the subgroup of the entire constructed tree that contains all REACT sequences, re-rooted to the COG-UK sequence SouthAfrica/KRISP-K006830/2020. (B-D) show a zoomed-in view of the subtrees shown by the 3 shaded regions. Note that adjacent sequences ARCH-000047A3 and ARCH-000052A7 are multiple readings from the same individual.

Figure 5.

Phylogenetic tree showing relation of A.23.1 lineages detected in REACT-1 to other A.23.1 sequences in the COG-UK database. REACT-1 sequences are coloured in red and have an ID beginning with the sequence “ARCH-” next to them. All other sequences are coloured by the location in which the sequence was isolated.

Geographically, both samples of A.23.1 were detected in the North West (Figure 2, Supplementary Table 2). Given this lineage was first detected in England in the North West [16], these positive samples from community surveillance suggest that the lineage continued to circulate locally with limited spread to other regions of England. In contrast, B.1.1.7+E484K was detected in London and the North West, despite originally being detected in the South West [16], indicating either transmission out of the South West or that the E484K mutation arose independently within the B.1.1.7 lineage. The four B.1.351 samples were detected in London (1), South East (1), and East of England (2), and the single B.1.525 was also detected in London (Figure 2), suggesting that the capital and surrounding region plays an important role in the importation of lineages (see below Self-reported history of recent travel).

None of these VOCs or VOIs were detected in rounds 9 and 10 suggesting a decrease in their relative proportions. Fitting a logistic regression model to whether a sample was a specific VOC, VUI, or B.1.1.7, there was no evidence for a difference in transmissibility between A.23.1, B.1.525, B.1.1.7+E484K, and B.1.1.7. However, for B.1.351, which had the most samples available (n=4), the growth rate was estimated to be 0.110 (0.339, 0.002) less than B.1.1.7 (P=0.02) (Supplementary Table 8). The converse is seen in publicly available sequence data form community testing [37] (Supplementary Figure 3), which, for the same period of time, found that the frequency of B.1.351 grew faster than B.1.1.7 (Supplementary Table 8, Supplementary Figure 4). This difference likely reflects biases in public data, such as increased testing of international travellers and surge testing [38] in areas where variants are detected. The relative lineage dynamics do not seem to be consistent across space and time. Compared to patterns in England, higher proportions of B.1.351 relative to B.1.1.7 have been seen in some regions of Europe [12,14,39] and in Africa [7].

Self-reported history of recent travel

Spatial patterns of observed VOCs and VOIs may be driven partly by geographical variation in the frequency with which people travel abroad. The overall proportion of participants reporting travel abroad in the past two weeks (Figure 6, Supplementary Table 9) in round 5 (September) was 1.63% (1.56%, 1.69%), but declined to 0.49% (0.46%, 0.53%) in round 8 (January), 0.11% (0.09%, 0.13%) in round 9 (February), and 0.10% (0.08%, 0.12%) in round 10 (March). London had the highest proportion and the South East the second highest for all rounds (Figure 6). We estimated that over 55% of the people returning from abroad to England during rounds 8, 9 and 10 were in London and the South East (Supplementary Table 9). Sub-regionally (Figure 6 and Supplementary Figure 5) we see that during round 5 (September) there was little spatial heterogeneity in the proportion of people who had been abroad two weeks prior, with similar proportions all across England. In contrast, during rounds 8, 9 and 10 there were high levels of heterogeneity with relatively higher proportions of travel among those living in central London and areas of Kent.

Figure 6.

Patterns of frequency of returning from abroad in the prior two weeks. (A) Proportion of participants who answered they had been abroad in the previous two weeks by lower tier local authority. Regions: NE =North East, NW = North West, YH =Yorkshire and The Humber, EM = East Midlands, WM =West Midlands, EE = East ofEngland, L = London, SE = South East, SW = South West. (B) Proportion of individuals who answered that they had been abroad in the previous two weeks by region and round. Dates: Round 5 =18 September - 5 October 2020, Round 6 =16 October - 2 November 2020, Round 7 = 13 November - 3 December 2020, Round 8 = 6 January - 22 January 2021, Round 9 = 4 February - 23 February 2021, Round 10 = 11 March - 30 March 2021.

Conclusion

We describe lineage dynamics for SARS-CoV-2 in England for the period January to March 2021, based on representative community samples. From January to March 2021, B.1.1.7 lineage continued to dominate the pandemic in England, further increasing in proportion over this time, spreading rapidly northwards and westwards. This may be explained by increased transmissibility over previously circulating lineages accompanied by a higher viral load.

Despite an increase in mortality observed for B.1.1.7 in other studies we find no evidence of a difference in the rate that infected individuals report the four classic COVID-19 symptoms (one or more of loss or change of sense of taste, loss or change of sense of smell, new persistent cough, fever) between B.1.1.7 and other circulating lineages.

Our results suggest other lineages were circulating in the community at lower levels at the beginning of the study period which coincided with the start of lockdown 3 in England. These were then out-competed by B.1.1.7 during the course of the study. In January, small numbers of B.1.351, B.1.525, A.23.1 and B.1.1.7+E484K were detected; although the numbers were small, due to the size and sampling method of the study, they likely indicate a substantial level of community transmission. In the later rounds in February and March none of these lineages were detected — the decline in proportion of B.1.351 relative to B.1.1.7 indicates that B.1.351 was, on average, less transmissible than B.1.1.7 over this period. The difference may have been partly a result of targeted public health interventions and reductions in foreign travel.

Our study has limitations. During round 8 a subsample of participants testing positive also undertook two additional swab tests. The sequencing results from these additional swabs indicates possible misallocation of samples to participants. Though not all of these samples contained sufficient viral copies for successful sequencing (high Ct) or not enough physical volume in the sample, we were able to sequence multiple swab tests for some participants. These extra sequences, when a lineage was determined, allowed augmentation of the data for round 8 for some of the participants whose first test was unable to be sequenced or have a lineage designated. Thirteen of the 175 participants who had multiple tests had discordant lineage designations (Supplementary Table 10). Four of these divergent lineage designations were not incongruent, for example B.1.351 and B.1, likely reflecting a lower quality second sequence. For these four cases, the more advanced lineage was selected (B.1.351 over B.1). The remaining nine lineages could not be determined definitively, and so have been removed from the main analysis. This points to some potential sample mix-ups caused by manual cherry picking in the diagnostic pipeline; however, these errors are only likely to affect the most prevalent lineages, specifically B.1.1.7 and B.1.177.

The success of B.1.1.7 against the prior wild type and more recent variants during the first three months of 2021 in England during lockdown does not necessarily mean that future imported variants such as B.1.617.2 (first identified in India) will not succeed [40]. The immune landscape [41] against SARS-CoV-2 in the UK is changing rapidly because of high uptake of effective vaccines across a broad range of age groups. On the other hand, as stringent interventions during lockdown are eased, the potential for person-to-person contact and overall average transmissibility of SARS-CoV-2 in England will increase.

Methods

Methods for the REACT-1 study have been described previously [27]. Since May 2020, there have been 10 rounds of data collection approximately every month with between 140,000 and 175,000 swab tests completed over a 2-3 week period by a random subset of the population of England aged 5 and over. From round 8 onwards, all positive tests with a low N-gene Ct value (less than 34 used initially, but criteria changed midway through round 9 to less than 32 due to high rate of sequencing failure in those with N-gene Ct>32 [approx 88%]) and a high enough volume were sent for genome sequencing (Public Health England Research Ethics Governance Group (reference: R&D NR0195)). Extracted RNA was amplified using the ARTIC protocol [43] with sequencing libraries prepared using CoronaHiT [44], and sequenced on the Illumina NextSeq 500 platform. Each set of 96 samples included one positive and one negative control. Raw sequencing data were analysed using the ARTIC bioinformatic pipeline [45] and uploaded to CLIMB [46] for further analysis.

From the genome sequences lineages are assigned using a machine learning based assignment algorithm, PangoLEARN [47] (database version 2021-04-28) with lineage assignment following the Pangolin nomenclature [9]. Not all obtained sequences were of a high enough quality for a lineage to be determined and so are not included in the analysis. Further, samples in which less than 50% of bases were covered were excluded from the analysis. A diagram showing how many positive samples were sequenced and how many had a lineage determined is shown in Supplementary Figure 1.

Following lineage allocation by the algorithm, each sequence was then also investigated individually, particularly for the presence of lineage defining SNPs. This allowed for lineages that were too low quality to call by PangoLEARN to be manually assigned. This occurred twice, once for a B.1 lineage call that exhibited six of 14 B.1.351 lineage defining mutations (adjusted to B.1.351), and once for a B.1 lineage call that exhibited 11 out of 16 B.1.525 lineage defining mutations (adjusted to B.1.525). In depth analysis of low coverage regions of both samples which fell below the normal minimum threshold of 10X coverage showed that all lineage defining mutations were present in at least one sequence read, further supporting these probable lineage designations. The thresholds for ‘probable’ lineage designations are defined by Public Health England [48]. B.1.1.7 lineages that also had the E484K SNP present were designated to a separate lineage (B.1.1.7+E484K).

Phylogenetic trees were constructed in order to investigate how detected VOCs and VUIs fit into the wider epidemic context. A subsample of sequences for each variant was selected from a curated database of COG-UK up to 12 April 2021. A representative subsample for each lineage was selected using Civet [49] with a collapse threshold of 2. Two hundred and twelve of 1583 B.1.351 sequences and 21 of 60 A.23.1 sequences were selected. For each variant the maximum-likelihood phylogenetic tree was constructed using a HKY model implemented using IQ-TREE [50].

95% confidence intervals in lineage proportions were calculated assuming a binomial distribution using the Wilson method [51], which is preferred when the number of positive outcomes is small [52]. Differences in multinomial proportions between rounds were assessed using a multinomial goodness-of-fit test implemented using the XNomial package in R [53].

Potential confounding effects of region and age group on B.1.1.7 proportion during round 8 were investigated using logistic regression with a binomial likelihood and logit link function. Lineage assignment of B.1.1.7 versus any other was the binary outcome variable and both region and age group were included as covariates. Similar analysis was not attempted on round 9 and round 10 due to the small number of non-B.1.1.7 lineages.

Estimates of the average true number of swab positive cases by lineage at any one time during rounds 8, 9 and 10 were calculated by multiplying the estimates of weighted prevalence for rounds 8, 9 and 10 [54], the proportion of each lineage for rounds 8, 9 and 10 (Supplementary Table 1 and Supplementary Table 2), and the population size of England and each region [55]

Relative differences in growth rates between two lineages were estimated by fitting a Bayesian logistic regression model to the binary lineage outcome. This was converted into an additive difference (□) in R through the equation □□ = □□ × □, with the assumption □ = 1 + □ × □ [56] where r is the growth rate of a lineage and □ is the mean generation time, assumed to be 6.29 days [57] for both lineages.

Smoothed spatial estimates of the relative proportion of two lineages were estimated using a Bayesian generalised-linear mixed-effects model implemented in the R package glmmfields [58]. We included 25 knots to describe the spatial processes and random spatial effects were assumed to follow a multivariate-t distribution. Priors of the model were chosen to be uninformative.

A subsample of positive participants in round 8 underwent a lateral flow immunoassay [36] approximately 6 weeks after their initial swab test. Differences in raw antibody positivity by lineages were assessed using logistic regression, with a binomial likelihood using a logit link function, and the result of the antibody test (positive/negative) as the binary outcome variable. Regression was performed using the subset of the data in which both the N-gene and E-gene had been detected. Further regression models were performed including different combinations of age, N-gene Ct value and E-gene Ct value as additional covariates. Further exploratory analyses were conducted with models including interaction terms between different combinations of variables and smoothed terms for some variables (not reported).

Data Availability

Assembled/consensus genomes are available from GISAID subject to minimum quality control criteria. Raw reads are available from European Nucleotide Archive (ENA). All genomes, phylogenetic trees, and basic metadata are available from the COG-UK consortium website (https://www.cogconsortium.uk). For confidentiality reasons, extended metadata are not publicly available, however some may be available upon request. Supplementary tables are available in the spreadsheet (link provided below)

https://docs.google.com/spreadsheets/d/1Fk-IDQFc6eG4lV2j8_dnYHk8zuG3Fb-Pfwitnxx82mk/edit?usp=sharing

Funding

The study was funded by the Department of Health and Social Care in England. Sequencing was provided through funding from COG-UK.

Competing interests

The authors have declared no competing interest.

Ethics

The COG-UK study protocol was approved by the Public Health England Research Ethics Governance Group (reference: R&D NR0195). This study was conducted as part of surveillance for COVID-19 infections under the auspices of Section 251 of the NHS Act 2006 and/or Regulation 3 of The Health Service (Control of Patient Information) Regulations 2002. The REACT-1 study received research ethics approval from the South Central-Berkshire BResearch Ethics Committee (IRAS ID: 283787).

Data availability

Materials & Correspondence

Correspondence and requests for materials should be addressed to Steven Riley and Paul Elliott, s.riley{at}imperial.ac.uk, p.elliott{at}imperial.ac.uk, School of Public Health, Imperial College London, Norfolk Place, London, W2 1PG

Additional information

Titles for Supplementary Tables 1 to 10 are included below with the tables themselves in this spreadsheet. Supplementary Figures 1 to 5 are included below. Full list of COG-UK author’s names and affiliations are available in this spreadsheet.

Supplementary Tables and Figures

Supplementary Table 1. Lineages detected in round 8, round 9 and round 10 of REACT-1.

Supplementary Table 1 is available in this spreadsheet

Supplementary Table 2. Regional distribution of positives for lineages in England for round 8, 9 and 10.

Supplementary Table 2 is available in this spreadsheet

Supplementary Table 3. Symptom status by lineage

Supplementary Table 3 is available in this spreadsheet

Supplementary Table 4. Antibody positivity 6 weeks after a positive swab test by lineage type.

Supplementary Table 4 is available in this spreadsheet

Supplementary Table 5. Multivariate logistic regression models to determine the effect of lineage on antibody positivity.

Supplementary Table 5 is available in this spreadsheet

Supplementary Table 6. Estimates of the average true number of swab positive cases by lineage at any one time during round 8, 9 and 10 in England and in each region of England.

Supplementary Table 6 is available in this spreadsheet

Supplementary Table 7. Number of participants self-reporting being abroad in the two weeks prior to taking their swab test by round and lineage type.

Supplementary Table 7 is available in this spreadsheet

Supplementary Table 8. Implied difference in growth rates of detected VOCs and VUIs. Supplementary Table 8 is available in this spreadsheet

Supplementary Table 9. Proportion of participants that reported being abroad two weeks prior to their swab test by region and by round, estimated average number of people who had been abroad in the previous two weeks by region and round, and the overall percentage of people returning from abroad that were in each region of England.

Supplementary Table 9 is available in this spreadsheet

Supplementary Table 10. Description of the participants who had discordant lineage designations and the overall lineage designation made.

Supplementary Table 10 is available in this spreadsheet

Supplementary Figure 1.

Diagram showing the number of positive samples, the number for which sequencing was attempted, and the number of lineages that were determined and therefore used in the analysis (Green) for rounds 8, 9 and 10 of the study.

Supplementary Figure 2.

Distribution of Ct values by lineage for (A) E-gene and (B) N-gene. Dotted line shows the N-gene Ct cutoff value of 37 that was used to define positivity.

Supplementary Figure 3.

Central 7 day rolling average of the frequency of B.1.1.7 (Black, right y-axis), B.1.351 (Green, left y-axis), B.1.525 (Red, left y-axis), A.23.1 (Purple, left y-axis) and B.1.1.7+E484K (Blue, left y-axis) in publicly available data.

Supplementary Figure 4.

Plot of the proportion of (A) B.1.1.7+E484K (B) B.1.351 (C) B.1.525 and (D) A.23.1 relative to B.1.1.7 lineage in publicly available data over the same time span as the REACT-1 rounds 8 to 10 lineage data. Error bars show the 95% confidence interval for each daily proportion calculation. Shaded region shows best fit Bayesian logistic regression model with 95% credible interval.

Supplementary Figure 5.

Proportion of participants who answered they had been abroad in the previous two weeks by lower tier local authority zoomed into the London region. Dates: Round 5 =18 September - 5 October 2020, Round 6 =16 October - 2 November 2020, Round 7 = 13 November - 3 December 2020, Round 8 = 6 January - 22 January 2021, Round 9 = 4 February - 23 February 2021, Round 10 = 11 March - 30 March 2021.

Acknowledgements

SR, CAD acknowledge support: Medical Research Council (MRC) Centre for Global Infectious Disease Analysis, National Institute for Health Research (NIHR) Health Protection Research Unit (HPRU), Wellcome Trust (200861/Z/16/Z, 200187/Z/15/Z), and Centres for Disease Control and Prevention (US, U01CK0005-01-02). GC is supported by an NIHR Professorship. HW acknowledges support from an NIHR Senior Investigator Award and the Wellcome Trust (205456/Z/16/Z). PE is Director of the MRC Centre for Environment and Health (MR/L01341X/1, MR/S019669/1). PE acknowledges support from Health Data Research UK (HDR UK); the NIHR Imperial Biomedical Research Centre; NIHR HPRUs in Chemical and Radiation Threats and Hazards, and Environmental Exposures and Health; the British Heart Foundation Centre for Research Excellence at Imperial College London (RE/18/4/34215); and the UK Dementia Research Institute at Imperial (MC_PC_17114). We thank The Huo Family Foundation for their support of our work on COVID-19. AJP, JOG gratefully acknowledge the support of the Biotechnology and Biological Sciences Research Council (BBSRC); their research was funded by the BBSRC Institute Strategic Programme Microbes in the Food Chain BB/R012504/1 and its constituent project BBS/E/F/000PR10352. We thank members of the COVID-19 Genomics UK (COG-UK) Consortium UK for their contributions to generating the genomic data used in this study. COG-UK is supported by funding from the MRC (part of UK Research & Innovation ([UKRI]), the NIHR and Genome Research Limited, operating as the Wellcome Sanger Institute.

We thank key collaborators on this work – Ipsos MORI: Kelly Beaver, Sam Clemens, GaryWelch, Nicholas Gilby, Kelly Ward and Kevin Pickering; Institute of Global Health Innovationat Imperial College: Gianluca Fontana, Didi Thompson and Lenny Naar; Molecular Diagnostic Unit, Imperial College London: Prof. Graham Taylor; Patient Experience Research Centre at Imperial College and the REACT Public Advisory Panel; NHS Digital for access to the NHS register; and the Department of Health and Social Care for logistic support.

Footnotes

↵# Full list of consortium names and affiliations is provided as a supporting document

References

1.↵
Wu Z, McGoogan JM. Characteristics of and important lessons from the coronavirus disease 2019 (COVID-19) outbreak in China: summary of a report of 72 314 cases from the Chinese Center for Disease Control and Prevention. JAMA. 2020;323:1239–1242.
OpenUrl CrossRef PubMed
2.↵
Dimonte S, Babakir-Mina M, Hama-Soor T, Ali S. Genetic Variation and Evolution of the 2019 Novel Coronavirus. Public Health Genomics. 2021;24:54–66.
OpenUrl
3.↵
Zhao S, Lou J, Cao L, Zheng H, Chong MKC, Chen Z, et al. Quantifying the transmission advantage associated with N501Y substitution of SARS-CoV-2 in the United Kingdom: An early data-driven analysis. J Travel Med. 2021. doi:10.1093/jtm/taab011
OpenUrl CrossRef
4.↵
Wang P, Liu L, Iketani S, Luo Y, Guo Y, Wang M, et al. Increased Resistance of SARS-CoV-2 Variants B.1.351 and B.1.1.7 to Antibody Neutralization. bioRxiv. 2021. doi:10.1101/2021.01.25.428137
OpenUrl Abstract/FREE Full Text
5.↵
van Dorp L, Acman M, Richard D, Shaw LP, Ford CE, Ormond L, et al. Emergence of genomic diversity and recurrent mutations in SARS-CoV-2. Infect Genet Evol. 2020;83:104351.
OpenUrl CrossRef PubMed
6.↵
Volz E, Mishra S, Chand M, Barrett JC, Johnson R, Geidelberg L, et al. Transmission of SARS-CoV-2 Lineage B. 1.1. 7 in England: Insights from linking epidemiological and genetic data. medRxiv. 2021; 2020–2012.
7.↵
Tegally H, Wilkinson E, Giovanetti M, Iranzadeh A, Fonseca V, Giandhari J, et al. Emergence and rapid spread of a new severe acute respiratory syndrome-related coronavirus 2 (SARS-CoV-2) lineage with multiple spike mutations in South Africa. bioRxiv. medRxiv; 2020. doi:10.1101/2020.12.21.20248640
OpenUrl Abstract/FREE Full Text
8.↵
Sabino EC, Buss LF, Carvalho MPS, Prete CA Jr., Crispim MAE, Fraiji NA, et al. Resurgence of COVID-19 in Manaus, Brazil, despite high seroprevalence. Lancet. 2021;397:452–455.
OpenUrl
9.↵
Rambaut A, Holmes EC, O’Toole Á, Hill V, McCrone JT, Ruis C, et al. A dynamic nomenclature proposal for SARS-CoV-2 lineages to assist genomic epidemiology. Nat Microbiol. 2020;5:1403–1407.
OpenUrl
10.↵
Preliminary genomic characterisation of an emergent SARS-CoV-2 lineage in the UK defined by a novel set of spike mutations. 18 Dec 2020 [cited 7 May 2021]. Available: https://virological.org/t/563/
11.↵
Public Health England. PHE investigating a novel variant of COVID-19. In: GOV.UK [Internet]. 14 Dec 2020 [cited 5 Mar 2021]. Available: https://www.gov.uk/government/news/phe-investigating-a-novel-variant-of-covid-19
12.↵
Lineage B.1.1.7. [cited 21 Mar 2021]. Available: https://cov-lineages.org/global_report_B.1.1.7.html
13.↵
Davies NG, Abbott S, Barnard RC, Jarvis CI, Kucharski AJ, Munday J, et al. Estimated transmissibility and severity of novel SARS-CoV-2 Variant of Concern 202012/01 in England. MedRxiv. 2021; 2020–2012.
14.↵
Lineage B.1.351. [cited 21 Mar 2021]. Available: https://cov-lineages.org/global_report_B.1.351.html
15.↵
Jangra S, Ye C, Rathnasinghe R, Stadlbauer D, Krammer F, Simon V, et al. The E484K mutation in the SARS-CoV-2 spike protein reduces but does not abolish neutralizing activity of human convalescent and post-vaccination sera. medRxiv. 2021. doi:10.1101/2021.01.26.21250543
OpenUrl Abstract/FREE Full Text
16.↵
Public Health England. PHE statement on Variant of Concern and new Variant Under Investigation. In: GOV.UK [Internet]. 10 Feb 2021 [cited 5 Mar 2021]. Available: https://www.gov.uk/government/news/phe-statement-on-variant-of-concern-and-new-variant-under-investigation
17.↵
Bugembe DL, Phan MVT, Ssewanyana I, Semanda P, Nansumba H, Dhaala B, et al. A SARS-CoV-2 lineage A variant (A.23.1) with altered spike has emerged and is dominating the current Uganda epidemic. medRxiv; 2021. doi:10.1101/2021.02.08.21251393
OpenUrl Abstract/FREE Full Text
18.↵
Variants: distribution of cases data. [cited 7 May 2021]. Available: https://www.gov.uk/government/publications/covid-19-variants-genomically-confirmed-case-numbers/variants-distribution-of-cases-data
19.↵
Lineage A.23.1. [cited 21 Mar 2021]. Available: https://cov-lineages.org/global_report_A.23.1.html
20.↵
Lineage B.1.525. [cited 21 Mar 2021]. Available: https://cov-lineages.org/global_report_B.1.525.html
21.↵
COVID-19 Genomics UK (COG-UK) consortiumcontact@cogconsortium.uk. An integrated national scale SARS-CoV-2 genomic surveillance network. Lancet Microbe. 2020;1:e99–e100.
OpenUrl
22.↵
COVID-19 Genomics UK Consortium. 11 Jan 2021 [cited 5 Mar 2021]. Available: https://www.cogconsortium.uk/
23.↵
Cyranoski D. Alarming COVID variants show vital role of genomic surveillance. Nature. 2021;589:337–338.
OpenUrl CrossRef PubMed
24.↵
Elbe S, Buckland-Merrett G. Data, disease and diplomacy: GISAID’s innovative contribution to global health. Glob Chall. 2017;1:33–46.
OpenUrl CrossRef PubMed
25.↵
Summary report: COG-UK geographic coverage of SARS-CoV-2 sample sequencing. Available: https://www.cogconsortium.uk/wp-content/uploads/2021/02/COG-UK-geo-coverage_2021-01-18_summary.pdf
26.↵
Summary report: COG-UK geographic coverage of SARS-CoV-2 sample sequencing. Available: https://www.cogconsortium.uk/wp-content/uploads/2021/03/COG-UK-geo-coverage_2021-02-22_summary.pdf
27.↵
Riley S, Atchison C, Ashby D, Donnelly CA, Barclay W, Cooke G, et al. REal-time Assessment of Community Transmission (REACT) of SARS-CoV-2 virus: Study protocol. Wellcome Open Research. 2020;5:200.
OpenUrl
28.↵
Volz E, Mishra S, Chand M, Barrett JC, Johnson R, Geidelberg L, et al. Assessing transmissibility of SARS-CoV-2 lineage B.1.1.7 in England. Nature. 2021. doi:10.1038/s41586-021-03470-x
OpenUrl CrossRef PubMed
29.↵
COVID-19 Response - Spring 2021 (Summary). [cited 24 Apr 2021]. Available: https://www.gov.uk/government/publications/covid-19-response-spring-2021/covid-19-response-spring-2021-summary
30.↵
Department for Education. New guidance for schools, colleges and early years. In: GOV.UK [Internet]. 4 Nov 2020 [cited 24 Apr 2021]. Available: https://www.gov.uk/government/news/new-guidance-for-schools-colleges-and-early-years
31.↵
Davies NG, Jarvis CI, CMMID COVID-19 Working Group, Edmunds WJ, Jewell NP, Diaz-Ordaz K, et al. Increased mortality in community-tested cases of SARS-CoV-2 lineage B.1.1.7. Nature. 2021. doi:10.1038/s41586-021-03426-1
OpenUrl CrossRef PubMed
32.↵
Graham MS, Sudre CH, May A, Antonelli M, Murray B, Varsavsky T, et al. The effect of SARS-CoV-2 variant B.1.1.7 on symptomatology, re-infection and transmissibility. bioRxiv. medRxiv; 2021. doi:10.1101/2021.01.28.21250680
OpenUrl Abstract/FREE Full Text
33.↵
Yelin I, Aharony N, Tamar ES, Argoetti A, Messer E, Berenbaum D, et al. Evaluation of COVID-19 RT-qPCR Test in Multi sample Pools. Clin Infect Dis. 2020;71:2073–2078.
OpenUrl
34.↵
Golubchik T, Lythgoe KA, Hall M, Ferretti L, Fryer HR, MacIntyre-Cockett G, et al. Early analysis of a potential link between viral load and the N501Y mutation in the SARS-COV-2 spike protein. bioRxiv. medRxiv; 2021. doi:10.1101/2021.01.12.20249080
OpenUrl Abstract/FREE Full Text
35.↵
Hay JA, Kennedy-Shaffer L, Kanjilal S, Lennon NJ, Gabriel SB, Lipsitch M, et al. Estimating epidemiologic dynamics from cross-sectional viral load distributions. medRxiv. 2021. doi:10.1101/2020.10.08.20204222
OpenUrl Abstract/FREE Full Text
36.↵
Moshe M, Daunt A, Flower B, Simmons B, Brown JC, Frise R, et al. SARS-CoV-2 lateral flow assays for possible use in national covid-19 seroprevalence surveys (React 2): diagnostic accuracy study. BMJ. 2021;372:423.
37.↵
COG-UK Mutation Explorer. [cited 24 Mar 2021]. Available: http://sars2.cvr.gla.ac.uk/cog-uk/
38.↵
Surge testing for new coronavirus (COVID-19) variants. [cited 5 Mar 2021]. Available: https://www.gov.uk/guidance/surge-testing-for-new-coronavirus-covid-19-variants
39.↵
V-S-C, Update 14th. RAPID RISK ASSESSMENT. [cited 24 Mar 2021]. Available: https://www.ecdc.europa.eu/sites/default/files/documents/RRA-covid-19-14th-update-15-feb-2021.pdf
40.↵
Scientific Advisory Group for Emergencies. PHE: Investigaton of novel SARS-CoV-2 variants of concern (England) - Technical briefing 9, 22 April 2021. GOV.UK; 22 Apr 2021 [cited 7 May 2021]. Available: https://www.gov.uk/government/publications/phe-investigaton-of-novel-sars-cov-2-variants-of-concern-england-technical-briefing-9-22-april-2021
41.↵
Jiang CQ, Lessler J, Kim L, Kwok KO, Read JM, Wang S, et al. Cohort Profile: A study of influenza immunity in the urban and rural Guangzhou region of China: the Fluscape Study. Int J Epidemiol. 2017;46:e16.
OpenUrl CrossRef
42.
Tricco AC, Chit A, Soobiah C, Hallett D, Meier G, Chen MH, et al. Comparing influenza vaccine efficacy against mismatched and matched strains: a systematic review and meta-analysis. BMC Med. 2013;11:153.
OpenUrl CrossRef PubMed
43.↵
Quick J. nCoV-2019 sequencing protocol v3 (LoCost). 2020 [cited 4 May 2021]. Available: https://www.protocols.io/view/ncov-2019-sequencing-protocol-v3-locost-bh42j8ye
44.↵
Baker DJ, Aydin A, Le-Viet T, Kay GL, Rudder S, de Oliveira Martins L, et al. CoronaHiT: high-throughput sequencing of SARS-CoV-2 genomes. Genome Med. 2021;13:21.
OpenUrl CrossRef
45.↵
A Nextflow pipeline for running the ARTIC network’s field bioinformatics tools. Github; Available: https://github.com/connor-lab/ncov2019-artic-nf
46.↵
Connor TR, Loman NJ, Thompson S, Smith A, Southgate J, Poplawski R, et al. CLIMB (the Cloud Infrastructure for Microbial Bioinformatics): an online resource for the medical microbiology community. Microb Genom. 2016;2:e000086.
OpenUrl
47.↵
Phylogenetic Assignment of Named Global Outbreak LINeages (PANGOLIN). Github; Available: https://github.com/cov-lineages/pangolin
48.↵
Standardised Variant Definitions: A repository containing the up-to-date lineage definitions for variants of concern (VOC) and variants of interest (VUI) as curated by Public Health England. Github; Available: https://github.com/phe-genomics/variant_definitions
49.↵
Cluster Investigation & Virus Epidemiology Tool (CIVET). Github; Available: https://github.com/COG-UK/civet
50.↵
Nguyen L-T, Schmidt HA, von Haeseler A, Minh BQ. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol Biol Evol. 2015;32:268–274.
OpenUrl CrossRef PubMed
51.↵
Wilson EB. Probable Inference, the Law of Succession, and Statistical Inference. J Am Stat Assoc. 1927;22:209–212.
OpenUrl CrossRef Web of Science
52.↵
Brown LD, Tony Cai T, DasGupta A. Interval Estimation for a Binomial Proportion. SSO Schweiz Monatsschr Zahnheilkd. 2001;16:101–133.
OpenUrl
53.↵
XNomial: Exact Goodness-of-Fit Test for Multinomial Data with Fixed Probabilities. [cited 21 Mar 2021]. Available: https://cran.r-project.org/web/packages/XNomial/index.html
54.↵
Riley S, Wang H, Eales O, Haw D, Walters CE, Ainslie KEC, et al. REACT-1 round 9 final report: Continued but slowing decline of prevalence of SARS-CoV-2 during national lockdown in England in February 2021. medRxiv; 2021. doi:10.1101/2021.03.03.21252856
OpenUrl Abstract/FREE Full Text
55.↵
Park N. Population estimates for the UK, England and Wales, Scotland and Northern Ireland - Office for National Statistics. Office for National Statistics; 23 Jun 2020 [cited 6 Mar 2021]. Available: https://www.ons.gov.uk/peoplepopulationandcommunity/populationandmigration/populationestimates/bulletins/annualmidyearpopulationestimates/latest
56.↵
Wallinga J, Lipsitch M. How generation intervals shape the relationship between growth rates and reproductive numbers. Proc Biol Sci. 2007;274:599–604.
OpenUrl CrossRef PubMed Web of Science
57.↵
Bi Q, Wu Y, Mei S, Ye C, Zou X, Zhang Z, et al. Epidemiology and Transmission of COVID-19 in Shenzhen China: Analysis of 391 cases and 1,286 of their close contacts. MedRxiv. 2020. Available: https://www.medrxiv.org/content/medrxiv/early/2020/03/19/2020.03.03.20028423.full.pdf
58.↵
Anderson SC, Ward EJ. Black swans in space: modeling spatiotemporal processes with extremes. Ecology. 2019;100:e02403.
OpenUrl

View the discussion thread.

Posted May 14, 2021.

Download PDF

Supplementary Material

Data/Code

Citation Tools

Subject Area

Infectious Diseases (except HIV/AIDS)

Subject Areas

All Articles

Addiction Medicine (336)
Allergy and Immunology (664)
Anesthesia (178)
Cardiovascular Medicine (2600)
Dentistry and Oral Medicine (314)
Dermatology (218)
Emergency Medicine (391)
Endocrinology (including Diabetes Mellitus and Metabolic Disease) (920)
Epidemiology (12135)
Forensic Medicine (10)
Gastroenterology (752)
Genetic and Genomic Medicine (4031)
Geriatric Medicine (380)
Health Economics (670)
Health Informatics (2605)
Health Policy (994)
Health Systems and Quality Improvement (966)
Hematology (359)
HIV/AIDS (832)
Infectious Diseases (except HIV/AIDS) (13616)
Intensive Care and Critical Care Medicine (785)
Medical Education (397)
Medical Ethics (109)
Nephrology (426)
Neurology (3797)
Nursing (208)
Nutrition (564)
Obstetrics and Gynecology (728)
Occupational and Environmental Health (689)
Oncology (1989)
Ophthalmology (576)
Orthopedics (235)
Otolaryngology (303)
Pain Medicine (249)
Palliative Medicine (72)
Pathology (470)
Pediatrics (1098)
Pharmacology and Therapeutics (456)
Primary Care Research (443)
Psychiatry and Clinical Psychology (3379)
Public and Global Health (6470)
Radiology and Imaging (1379)
Rehabilitation Medicine and Physical Therapy (801)
Respiratory Medicine (866)
Rheumatology (395)
Sexual and Reproductive Health (403)
Sports Medicine (337)
Surgery (438)
Toxicology (51)
Transplantation (185)
Urology (165)

[1] 1.↵
Wu Z, McGoogan JM. Characteristics of and important lessons from the coronavirus disease 2019 (COVID-19) outbreak in China: summary of a report of 72 314 cases from the Chinese Center for Disease Control and Prevention. JAMA. 2020;323:1239–1242.
OpenUrl CrossRef PubMed

[2] 2.↵
Dimonte S, Babakir-Mina M, Hama-Soor T, Ali S. Genetic Variation and Evolution of the 2019 Novel Coronavirus. Public Health Genomics. 2021;24:54–66.
OpenUrl

[3] 3.↵
Zhao S, Lou J, Cao L, Zheng H, Chong MKC, Chen Z, et al. Quantifying the transmission advantage associated with N501Y substitution of SARS-CoV-2 in the United Kingdom: An early data-driven analysis. J Travel Med. 2021. doi:10.1093/jtm/taab011
OpenUrl CrossRef

[4] 4.↵
Wang P, Liu L, Iketani S, Luo Y, Guo Y, Wang M, et al. Increased Resistance of SARS-CoV-2 Variants B.1.351 and B.1.1.7 to Antibody Neutralization. bioRxiv. 2021. doi:10.1101/2021.01.25.428137
OpenUrl Abstract/FREE Full Text

[5] 5.↵
van Dorp L, Acman M, Richard D, Shaw LP, Ford CE, Ormond L, et al. Emergence of genomic diversity and recurrent mutations in SARS-CoV-2. Infect Genet Evol. 2020;83:104351.
OpenUrl CrossRef PubMed

[6] 6.↵
Volz E, Mishra S, Chand M, Barrett JC, Johnson R, Geidelberg L, et al. Transmission of SARS-CoV-2 Lineage B. 1.1. 7 in England: Insights from linking epidemiological and genetic data. medRxiv. 2021; 2020–2012.

[7] 7.↵
Tegally H, Wilkinson E, Giovanetti M, Iranzadeh A, Fonseca V, Giandhari J, et al. Emergence and rapid spread of a new severe acute respiratory syndrome-related coronavirus 2 (SARS-CoV-2) lineage with multiple spike mutations in South Africa. bioRxiv. medRxiv; 2020. doi:10.1101/2020.12.21.20248640
OpenUrl Abstract/FREE Full Text

[8] 8.↵
Sabino EC, Buss LF, Carvalho MPS, Prete CA Jr., Crispim MAE, Fraiji NA, et al. Resurgence of COVID-19 in Manaus, Brazil, despite high seroprevalence. Lancet. 2021;397:452–455.
OpenUrl

[9] 9.↵
Rambaut A, Holmes EC, O’Toole Á, Hill V, McCrone JT, Ruis C, et al. A dynamic nomenclature proposal for SARS-CoV-2 lineages to assist genomic epidemiology. Nat Microbiol. 2020;5:1403–1407.
OpenUrl

[10] 10.↵
Preliminary genomic characterisation of an emergent SARS-CoV-2 lineage in the UK defined by a novel set of spike mutations. 18 Dec 2020 [cited 7 May 2021]. Available: https://virological.org/t/563/

[11] 11.↵
Public Health England. PHE investigating a novel variant of COVID-19. In: GOV.UK [Internet]. 14 Dec 2020 [cited 5 Mar 2021]. Available: https://www.gov.uk/government/news/phe-investigating-a-novel-variant-of-covid-19

[12] 12.↵
Lineage B.1.1.7. [cited 21 Mar 2021]. Available: https://cov-lineages.org/global_report_B.1.1.7.html

[13] 13.↵
Davies NG, Abbott S, Barnard RC, Jarvis CI, Kucharski AJ, Munday J, et al. Estimated transmissibility and severity of novel SARS-CoV-2 Variant of Concern 202012/01 in England. MedRxiv. 2021; 2020–2012.

[14] 14.↵
Lineage B.1.351. [cited 21 Mar 2021]. Available: https://cov-lineages.org/global_report_B.1.351.html

[15] 15.↵
Jangra S, Ye C, Rathnasinghe R, Stadlbauer D, Krammer F, Simon V, et al. The E484K mutation in the SARS-CoV-2 spike protein reduces but does not abolish neutralizing activity of human convalescent and post-vaccination sera. medRxiv. 2021. doi:10.1101/2021.01.26.21250543
OpenUrl Abstract/FREE Full Text

[16] 16.↵
Public Health England. PHE statement on Variant of Concern and new Variant Under Investigation. In: GOV.UK [Internet]. 10 Feb 2021 [cited 5 Mar 2021]. Available: https://www.gov.uk/government/news/phe-statement-on-variant-of-concern-and-new-variant-under-investigation

[17] 17.↵
Bugembe DL, Phan MVT, Ssewanyana I, Semanda P, Nansumba H, Dhaala B, et al. A SARS-CoV-2 lineage A variant (A.23.1) with altered spike has emerged and is dominating the current Uganda epidemic. medRxiv; 2021. doi:10.1101/2021.02.08.21251393
OpenUrl Abstract/FREE Full Text

[18] 18.↵
Variants: distribution of cases data. [cited 7 May 2021]. Available: https://www.gov.uk/government/publications/covid-19-variants-genomically-confirmed-case-numbers/variants-distribution-of-cases-data

[19] 19.↵
Lineage A.23.1. [cited 21 Mar 2021]. Available: https://cov-lineages.org/global_report_A.23.1.html

[20] 20.↵
Lineage B.1.525. [cited 21 Mar 2021]. Available: https://cov-lineages.org/global_report_B.1.525.html

[21] 21.↵
COVID-19 Genomics UK (COG-UK) consortiumcontact@cogconsortium.uk. An integrated national scale SARS-CoV-2 genomic surveillance network. Lancet Microbe. 2020;1:e99–e100.
OpenUrl

[22] 22.↵
COVID-19 Genomics UK Consortium. 11 Jan 2021 [cited 5 Mar 2021]. Available: https://www.cogconsortium.uk/

[23] 23.↵
Cyranoski D. Alarming COVID variants show vital role of genomic surveillance. Nature. 2021;589:337–338.
OpenUrl CrossRef PubMed

[24] 24.↵
Elbe S, Buckland-Merrett G. Data, disease and diplomacy: GISAID’s innovative contribution to global health. Glob Chall. 2017;1:33–46.
OpenUrl CrossRef PubMed

[25] 25.↵
Summary report: COG-UK geographic coverage of SARS-CoV-2 sample sequencing. Available: https://www.cogconsortium.uk/wp-content/uploads/2021/02/COG-UK-geo-coverage_2021-01-18_summary.pdf

[26] 26.↵
Summary report: COG-UK geographic coverage of SARS-CoV-2 sample sequencing. Available: https://www.cogconsortium.uk/wp-content/uploads/2021/03/COG-UK-geo-coverage_2021-02-22_summary.pdf

[27] 27.↵
Riley S, Atchison C, Ashby D, Donnelly CA, Barclay W, Cooke G, et al. REal-time Assessment of Community Transmission (REACT) of SARS-CoV-2 virus: Study protocol. Wellcome Open Research. 2020;5:200.
OpenUrl

[28] 28.↵
Volz E, Mishra S, Chand M, Barrett JC, Johnson R, Geidelberg L, et al. Assessing transmissibility of SARS-CoV-2 lineage B.1.1.7 in England. Nature. 2021. doi:10.1038/s41586-021-03470-x
OpenUrl CrossRef PubMed

[29] 29.↵
COVID-19 Response - Spring 2021 (Summary). [cited 24 Apr 2021]. Available: https://www.gov.uk/government/publications/covid-19-response-spring-2021/covid-19-response-spring-2021-summary

[30] 30.↵
Department for Education. New guidance for schools, colleges and early years. In: GOV.UK [Internet]. 4 Nov 2020 [cited 24 Apr 2021]. Available: https://www.gov.uk/government/news/new-guidance-for-schools-colleges-and-early-years

[31] 31.↵
Davies NG, Jarvis CI, CMMID COVID-19 Working Group, Edmunds WJ, Jewell NP, Diaz-Ordaz K, et al. Increased mortality in community-tested cases of SARS-CoV-2 lineage B.1.1.7. Nature. 2021. doi:10.1038/s41586-021-03426-1
OpenUrl CrossRef PubMed

[32] 32.↵
Graham MS, Sudre CH, May A, Antonelli M, Murray B, Varsavsky T, et al. The effect of SARS-CoV-2 variant B.1.1.7 on symptomatology, re-infection and transmissibility. bioRxiv. medRxiv; 2021. doi:10.1101/2021.01.28.21250680
OpenUrl Abstract/FREE Full Text

[33] 33.↵
Yelin I, Aharony N, Tamar ES, Argoetti A, Messer E, Berenbaum D, et al. Evaluation of COVID-19 RT-qPCR Test in Multi sample Pools. Clin Infect Dis. 2020;71:2073–2078.
OpenUrl

[34] 34.↵
Golubchik T, Lythgoe KA, Hall M, Ferretti L, Fryer HR, MacIntyre-Cockett G, et al. Early analysis of a potential link between viral load and the N501Y mutation in the SARS-COV-2 spike protein. bioRxiv. medRxiv; 2021. doi:10.1101/2021.01.12.20249080
OpenUrl Abstract/FREE Full Text

[35] 35.↵
Hay JA, Kennedy-Shaffer L, Kanjilal S, Lennon NJ, Gabriel SB, Lipsitch M, et al. Estimating epidemiologic dynamics from cross-sectional viral load distributions. medRxiv. 2021. doi:10.1101/2020.10.08.20204222
OpenUrl Abstract/FREE Full Text

[36] 36.↵
Moshe M, Daunt A, Flower B, Simmons B, Brown JC, Frise R, et al. SARS-CoV-2 lateral flow assays for possible use in national covid-19 seroprevalence surveys (React 2): diagnostic accuracy study. BMJ. 2021;372:423.

[37] 37.↵
COG-UK Mutation Explorer. [cited 24 Mar 2021]. Available: http://sars2.cvr.gla.ac.uk/cog-uk/

[38] 38.↵
Surge testing for new coronavirus (COVID-19) variants. [cited 5 Mar 2021]. Available: https://www.gov.uk/guidance/surge-testing-for-new-coronavirus-covid-19-variants

[39] 39.↵
V-S-C, Update 14th. RAPID RISK ASSESSMENT. [cited 24 Mar 2021]. Available: https://www.ecdc.europa.eu/sites/default/files/documents/RRA-covid-19-14th-update-15-feb-2021.pdf

[40] 40.↵
Scientific Advisory Group for Emergencies. PHE: Investigaton of novel SARS-CoV-2 variants of concern (England) - Technical briefing 9, 22 April 2021. GOV.UK; 22 Apr 2021 [cited 7 May 2021]. Available: https://www.gov.uk/government/publications/phe-investigaton-of-novel-sars-cov-2-variants-of-concern-england-technical-briefing-9-22-april-2021

[41] 41.↵
Jiang CQ, Lessler J, Kim L, Kwok KO, Read JM, Wang S, et al. Cohort Profile: A study of influenza immunity in the urban and rural Guangzhou region of China: the Fluscape Study. Int J Epidemiol. 2017;46:e16.
OpenUrl CrossRef

[42] 42.
Tricco AC, Chit A, Soobiah C, Hallett D, Meier G, Chen MH, et al. Comparing influenza vaccine efficacy against mismatched and matched strains: a systematic review and meta-analysis. BMC Med. 2013;11:153.
OpenUrl CrossRef PubMed

[43] 43.↵
Quick J. nCoV-2019 sequencing protocol v3 (LoCost). 2020 [cited 4 May 2021]. Available: https://www.protocols.io/view/ncov-2019-sequencing-protocol-v3-locost-bh42j8ye

[44] 44.↵
Baker DJ, Aydin A, Le-Viet T, Kay GL, Rudder S, de Oliveira Martins L, et al. CoronaHiT: high-throughput sequencing of SARS-CoV-2 genomes. Genome Med. 2021;13:21.
OpenUrl CrossRef

[45] 45.↵
A Nextflow pipeline for running the ARTIC network’s field bioinformatics tools. Github; Available: https://github.com/connor-lab/ncov2019-artic-nf

[46] 46.↵
Connor TR, Loman NJ, Thompson S, Smith A, Southgate J, Poplawski R, et al. CLIMB (the Cloud Infrastructure for Microbial Bioinformatics): an online resource for the medical microbiology community. Microb Genom. 2016;2:e000086.
OpenUrl

[47] 47.↵
Phylogenetic Assignment of Named Global Outbreak LINeages (PANGOLIN). Github; Available: https://github.com/cov-lineages/pangolin

[48] 48.↵
Standardised Variant Definitions: A repository containing the up-to-date lineage definitions for variants of concern (VOC) and variants of interest (VUI) as curated by Public Health England. Github; Available: https://github.com/phe-genomics/variant_definitions

[49] 49.↵
Cluster Investigation & Virus Epidemiology Tool (CIVET). Github; Available: https://github.com/COG-UK/civet

[50] 50.↵
Nguyen L-T, Schmidt HA, von Haeseler A, Minh BQ. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol Biol Evol. 2015;32:268–274.
OpenUrl CrossRef PubMed

[51] 51.↵
Wilson EB. Probable Inference, the Law of Succession, and Statistical Inference. J Am Stat Assoc. 1927;22:209–212.
OpenUrl CrossRef Web of Science

[52] 52.↵
Brown LD, Tony Cai T, DasGupta A. Interval Estimation for a Binomial Proportion. SSO Schweiz Monatsschr Zahnheilkd. 2001;16:101–133.
OpenUrl

[53] 53.↵
XNomial: Exact Goodness-of-Fit Test for Multinomial Data with Fixed Probabilities. [cited 21 Mar 2021]. Available: https://cran.r-project.org/web/packages/XNomial/index.html

[54] 54.↵
Riley S, Wang H, Eales O, Haw D, Walters CE, Ainslie KEC, et al. REACT-1 round 9 final report: Continued but slowing decline of prevalence of SARS-CoV-2 during national lockdown in England in February 2021. medRxiv; 2021. doi:10.1101/2021.03.03.21252856
OpenUrl Abstract/FREE Full Text

[55] 55.↵
Park N. Population estimates for the UK, England and Wales, Scotland and Northern Ireland - Office for National Statistics. Office for National Statistics; 23 Jun 2020 [cited 6 Mar 2021]. Available: https://www.ons.gov.uk/peoplepopulationandcommunity/populationandmigration/populationestimates/bulletins/annualmidyearpopulationestimates/latest

[56] 56.↵
Wallinga J, Lipsitch M. How generation intervals shape the relationship between growth rates and reproductive numbers. Proc Biol Sci. 2007;274:599–604.
OpenUrl CrossRef PubMed Web of Science

[57] 57.↵
Bi Q, Wu Y, Mei S, Ye C, Zou X, Zhang Z, et al. Epidemiology and Transmission of COVID-19 in Shenzhen China: Analysis of 391 cases and 1,286 of their close contacts. MedRxiv. 2020. Available: https://www.medrxiv.org/content/medrxiv/early/2020/03/19/2020.03.03.20028423.full.pdf

[58] 58.↵
Anderson SC, Ward EJ. Black swans in space: modeling spatiotemporal processes with extremes. Ecology. 2019;100:e02403.
OpenUrl