Abstract
Recent months have seen surges of SARS-CoV-2 infection across the globe along with considerable viral evolution. Extensive mutations in the spike protein of variants B.1.1.7, B1.351, and P.1 have raised concerns that the efficacy of current vaccines and therapeutic monoclonal antibodies could be threatened. In vitro studies have shown that one mutation, E484K, plays a crucial role in the loss of neutralizing activity of some monoclonal antibodies as well as most convalescent and vaccinee sera against variant B.1.351. In fact, two vaccine trials have recently reported lower protective efficacy in South Africa, where B.1.351 is dominant. To survey for these novel variants in our patient population in New York City, PCR assays were designed to identify viruses with two signature mutations, E484K and N501Y. We observed a steady increase in the detection rate from late December to mid-February, with an alarming rise to 12.3% in the past two weeks. Whole genome sequencing further demonstrated that most of our E484K isolates (n=49/65) fell within a single lineage: NextStrain clade 20C or Pangolin lineage B.1.526. Patients with this novel variant came from diverse neighborhoods in the metropolitan area, and they were on average older and more frequently hospitalized. Phylogenetic analyses of sequences in the database further reveal that this B.1.526 variant is scattered in the Northeast of US, and its unique set of spike mutations may also pose an antigenic challenge for current interventions.
Manuscript Text
While evolution of SARS-CoV-2 was deemed to be slow at the beginning of the global pandemic (1), at least three major variants of concern have emerged over the past two months (2-4). These lineages are each characterized by numerous mutations in the spike protein, raising concerns that they may escape from therapeutic monoclonals and vaccine-induced antibodies. The hallmark mutation of B.1.1.7, the first SARS-CoV-2 variant of concern that emerged in the UK, is N501Y located in the receptor-binding domain (RBD) of spike (2). This variant is seemingly more transmissible and possibly more virulent (5-7). The other variants of concern, B.1.351 (first detected in South Africa) (3) and P.1 (first described in Brazilian travelers) (4), share the N501Y mutation with B.1.1.7 but contain an E484K substitution in RBD (3, 4). Epidemiological evidence suggests that P.1 emerged as part of a second surge in Manaus, Brazil despite a high pre-existing seroprevalence to SARS-CoV-2 in the population. Reinfections with P.1, as well as with another related Brazilian variant P.2 that also harbors E484K, have been documented (8, 9).
Our previous study on B.1.351 demonstrated that this variant is refractory to neutralization by a number of monoclonal antibodies directed to the top of RBD, including several that have received emergency use authorization (10). Moreover, this variant was markedly more resistant to neutralization by convalescent plasma and vaccinee sera. Importantly, these effects were largely mediated by the E484K mutation. These finding are worrisome in light of recent reports that two vaccine trials showed a substantial drop in efficacy in South Africa (11, 12). We therefore began an effort to survey our patient population at the Columbia University Irving Medical Center in New York City for B.1.351 and other E484K variants such as P.1 and P.2.
We first developed rapid PCR-based, single-nucleotide-polymorphism assays to search for N501Y and E484K mutations (see schematic in Fig. S1 and Methods in Supplement) in clinical samples known to be positive for SARS-CoV-2 and stored in the Columbia University Biobank, and patient information was extracted from the COVID-Care database (13). Between November 1, 2020 and February 15, 2021, a total of 60,539 nasopharyngeal swabs underwent clinical testing for SARS-CoV-2 at our medical center, with 4,358 positive samples identified. We screened 1,142 samples randomly chosen from this period for the two signature mutations. A total of 927 samples yielded a signal in our genotyping assays. We found that 83 (9.0%) were positive for E484K and 17 (1.8%) were positive for N501Y. Only one sample contained both mutations. The earliest case with E484K was collected in mid-November 2020. Subsequently, there was a substantial increase in E484K-positive cases over time (Fig. 1A), from 1.3% in early November to 5.3% by mid-January, and ultimately to 12.3% between February 8th and 15th. Viruses harboring N501Y also increased over time, from the earliest detection in mid-January to 2.6% of screened isolates by mid-February.
We then performed whole genome nanopore sequencing on samples flagged as potential N501Y- or E484K-harboring strains (n=65). We also sequenced samples negative for these signature mutations obtained during the same time period, all with Ct values below 35 (n=65). Sequencing results verified the E484K and N501Y substitutions in all samples identified by our screening PCR assays. Based on phylogenetic analyses including publicly available genomes (Fig. 1B), six cases with N501Y were identified as belonging to the B.1.1.7 lineage, two cases with E484K as P.2, and one sample as B.1.351, which harbored both N501Y and E484K based on our screening assay. However, quite unexpectedly, the large majority (n=49) of the remaining cases with E484K fell within a single lineage, B.1.526 (14). It is this novel variant that is surging, alarmingly, in our patient population over the past few weeks.
Nearly all of the newly identified B.1.526 variants have a set of common mutations in the spike protein: L5F, T95I, D253G, E484K, D614G, and A701V. Fig. 2A displays all of the spike mutations found in all variant viruses identified in the study, along with their phylogenetic relationship. Fig. 2B shows that D253G resides in the antigenic supersite within the N-terminal domain (15), which is a target for neutralizing antibodies (16), whereas the E484K is situated at the RBD interface with the cellular receptor ACE2. The A701V mutation near the furin cleavage site is also shared with variant B.1.351. The impact of the E484K mutation on antibody neutralization was assessed using 4 monoclonal antibodies with emergency use authorization, 10 convalescent plasma, and 10 vaccinee sera. As shown in Fig. S2, the neutralizing activity of REGN10987 against E484K pseudovirus is unaltered, but the activities of REGN10933, CB6, and LY-CoV555 are either impaired or abolished. Likewise, neutralizing activities of convalescent plasma or vaccinee sera are lower by 7.7-fold or 3.4-fold, respectively, against the E484K variant. These results signify an important antigenic drift in B.1.526 that could have clinical consequences, as have been noted for B.1.351 (10-12).
Patients with E484K variant viruses were comparable in gender, race and ethnicity to those with wildtype SARS-CoV-2, but on average were older (58.1 vs 52.4 years, p=0.049) and more likely to present to the ED or be admitted to the hospital (85.9% vs 70.8%, p=0.007, Table S1). There were no differences in the frequency of ICU admissions and length of stay between groups. The majority of patients with E484K variants were geographically concentrated in two distinct neighborhoods in the catchment area of our hospital system, but many others were found scattered throughout the metropolitan area without evidence for a single outbreak (Fig. S3). An early case of our novel B.1.526 variant (NP-3581 in Fig. 2A) was detected in November 2020 in a patient with advanced AIDS who had first presented in August with infection by a SARS-CoV-2 strain (NP-3005 in Fig. 2A) that initially lacked the L5F, D253G, and E484K mutations. This case is reminiscent of reported examples of extensive intra-host evolution due to persistent SARS-CoV-2 infection in immunocompromised individuals (17, 18).
We also investigated SARS-CoV-2 sequences in public databases and found ∼140 genomes highly related to the newly identified B.1.526 variant (14) (Fig. 1B). These were predominantly from samples collected in the Northeastern US, suggesting that E484K in the B.1.526 lineage is now widespread in the region, the original epicenter of COVID-19 in the US (19). This supports concerns that novel variants may spread in regions with a relatively high sero-prevalence. Moreover, it appears that the E484K mutation has emerged in at least 59 different lineages of SARS-CoV-2 (20), a real testament to convergent evolution. In conclusion, we identified B.1.526 as a local lineage of concern due to E484K in particular, which could threaten the efficacy of current antibody therapies and vaccines. This discovery also highlights the need for a concerted national surveillance program to track and contain the spread of novel SARS-CoV-2 variants.
Data Availability
All SARS-CoV-2 genomes generated as part of this study have been submitted to GISAID on 2/23/2021.
Acknowledgements
We gratefully acknowledge all the authors, the originating laboratories responsible for obtaining the specimens, and the submitting laboratories for generating the genetic sequence and metadata and sharing via the GISAID Initiative, on which part of the presented research is based. Biospecimens utilized for this research were obtained from the Columbia University Biobank (CUB) with technical support from Viplan J. Mahadeva, Sebastian Fernando and Sylvia T. Parker-Jones. CUB is supported by the Irving Institute for Clinical and Translational Research, home to Columbia University’s Clinical and Translational Science Award (CTSA) funded through Grant Number UL1TR001873. In particular, we thank Muredach Reilly, Eldad Hod and the CUB COVID-19 Genomics Consortium (CCGC) for facilitating this effort. We are also grateful to Lihong Liu and Sho Iketani for technical support. This work was in part funded by NIH/NIDA grant U01 DA053949 (ACU, MKA) and by support from Andrew & Peggy Cherng, Samuel Yin, Barbara Picower and the JBP Foundation, Brii Biosciences, Roger & David Wu, and the Bill and Melinda Gates Foundation.