Importance of E484K and N501Y mutations in SARS-CoV-2 for genomic surveillance: rapid detection by restriction enzyme analysis ============================================================================================================================== * Rossana C Jaspe * Yoneira Sulbaran * Carmen L Loureiro * Pierina D’Angelo * Lieska Rodríguez * Domingo J Garzaro * Héctor R Rangel * Flor H Pujol ## Abstract **Introduction** Variants of Concern of SARS-CoV-2 (VOCs), the new coronavirus responsible for COVID-19, have emerged in several countries. Two mutations in the gene coding for the Spike protein of the viral genome are particularly important and associated with some of these variants: E484K and N501Y. Restriction enzyme analysis is proposed as a rapid method to detect these two mutations. **Methodology** A search on GISAID was performed in April 2021 to detect the frequency of these two mutations in the sequence available and their association with other lineages. A small amplicon from the Spike gene was digested with two enzymes: HpyAV, which allows detecting E484K mutation, and MseI, for detecting the N501Y one. **Results** The mutations E484K and N501Y, associated with VOCs, have emerged in several other lineages, particularly E484K. A 100% correlation was observed with sequencing results. **Conclusions** The proposed methodology, which allows screening a great number of samples, will probably help to provide more information on the prevalence and epidemiology of these mutations worldwide, to select the candidates for whole-genome sequencing. Keywords * COVID-19 * SARS-CoV-2 * Variants of Concern (VOC) * RFLP * Rapid screening * Mutations ## Introduction In March 2020, the COVID-19 pandemic was declared, caused by an emerging coronavirus, SARS-CoV-2. One year later, this infection has caused more than 100 million cases and more than 2 million deaths worldwide. This virus belongs to the family *Coronaviridae*, order *Nidovirales*. This viral order is unique among the RNA viruses since the viruses belonging to this order encode for an exonuclease, which enables proof-reading capacity to the replication machinery, limiting mutational events. However, the tremendous number of replication events that this virus has undergone, in addition to an elevated frequency of recombination, and to the probable action of host deaminases on the viral genome [1], has allowed the emergence of many point mutation and frequent deletions in the viral genome [2]. Different variants (groups of viruses sharing particular types of mutations) have emerged at the end of 2020. Some of these variants have been defined of Interest (VOI) or Concern (VOC) by WHO: the variant B1.1.7 (VOC), for which an increase in incidence after its emergence was observed in the UK, variant B.1.351 (VOC), with an increase in prevalence in South Africa, and variants B.1.1.28.1 or P.1 (VOC) and B.1.1.28.2 or P.2 (VOI), which are now predominating in Brazil, are examples of these variants (variants named according to Pango lineages classification ([https://cov-lineages.org/lineages.html](https://cov-lineages.org/lineages.html)) [3-5]. Some of the mutations harbored by these variants are of public health concern for two main reasons: mutation N501Y might lead to increased transmissibility of the viruses harboring this mutation, and mutation E484K, has been frequently associated with reinfection cases, and might reduce the neutralizing activity of antibodies produced by vaccination [6,7]. Genomic surveillance has been proposed for monitoring the introduction of SARS-CoV-2 Variants of Concern (VOCs) in different countries [8]. A rapid method for detecting those variants might be useful for screening large quantities of samples, in settings where sequencing facilities are limited. This would be very useful for rapid and massive screening of these variants. In this study, we propose a simple restriction enzyme digestion analysis for the detection of two key mutations in the SARS-CoV-2 Spike: E484K and N501Y. ## Methodology ### Analysis of sequences available at GISAID for E484K and N501Y mutations Sequences available at GISAID on April 24 were analyzed for the presence of E484K and N501Y, at [https://www.gisaid.org/phylodynamics/global/nextstrain/](https://www.gisaid.org/phylodynamics/global/nextstrain/) and [https://www.epicov.org/](https://www.epicov.org/). ### RT-PCR This study was approved by the Bioethical Committee of IVIC. We previously detected in Venezuela, the circulation of variants of the B.1.1.28.1 lineage (P1-like) and samples harboring the E484K mutation without the N501Y one, by sequencing part of the Spike gene in Venezuelan isolates (Jaspe, RC, in preparation). RNA from clinical samples positive by qRT-PCR (classified as wild-type, WT, or harboring E484K alone or with N501Y) was amplified with primers 76.1L (5’-CCAGATGATTTTACAGGCTGCG-3’) and 76.8R (5’-GTTGCTGGTGCATGTAGAAGTTC-3’) using SuperScript III One-Step RT-PCR System with Platinum Taq High Fidelity DNA Polymerase (Thermo Fisher Scientific), and the following PCR conditions: an incubation at 55°C for 30 min, followed by 94°C/3min and 40 cycles of 94°C/15 sec, 55°C/30 sec and 68°C/30 sec, with a final extension of 68°C for 7 min. Superscript II One-Step PCR System with Platinum Taq DNA Polymerase (Thermo Fisher Scientific), was also used successfully. ### Restriction analysis Five µl of the amplicon were digested with 1 unit of HpyAV or MseI for 1 hour at 37°C and then loaded in a 3% agarose gel electrophoresis for band visualization with Ethidium bromide. Restriction results were compared with the sequence obtained by sending PCR purified fragments to Macrogen Sequencing Service (Macrogen, Korea). ## Results A total of 1,224,815 sequences of SARS-CoV-2 were available on April 24 at GISAID. Figure 1 shows the frequency of sequences harboring the N501Y, E484K, or both mutations. However, this frequency does not reflect the true prevalence of these mutations worldwide, since a strong bias exists, which has been accentuating over the last months. There are strong differences in genomic sequencing capacities between developed and developing countries, and even between developed countries (Table 1). View this table: [Table 1.](http://medrxiv.org/content/early/2021/05/10/2021.05.04.21256650/T1) Table 1. Number of sequences harboring E484K or N501Y mutations in selected countries ![Figure 1.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/05/10/2021.05.04.21256650/F1.medium.gif) [Figure 1.](http://medrxiv.org/content/early/2021/05/10/2021.05.04.21256650/F1) Figure 1. Frequency of sequences in GISAID harboring N501Y, E484K mutations or both. For example, UK is the country that has provided the highest number of sequences to GISAID, more than the USA, even if the total number of COVID-19 cases is 7.4 times lower than the ones in the USA. VOC of the lineage B.1.1.7, which lacks the E484K mutations, emerged in the UK and is also frequent in the USA. In contrast, the other two VOCs, which harbor E484K in addition to N501Y, emerged in Brazil and South Africa, countries with robust sequencing capacities but not comparable to the countries previously mentioned (Table 1). This is the main reason for the higher abundancy of N501Y mutation, compared to E484K one (Figure 1). Even if the frequency of the E484K mutation appears as relatively low in the sequences available at GISAID, this mutation has been found in some isolates from several lineages, more abundant than the ones where N501Y mutation has been described, in spite of the significant lower number of sequences available in GISAID, harboring this mutation (Table 2). View this table: [Table 2.](http://medrxiv.org/content/early/2021/05/10/2021.05.04.21256650/T2) Table 2. Lineages where E484K or N501Y mutations can be found Figure 2 shows the restriction pattern of Wild Type (WT) samples and isolates harboring mutations E484K and N501Y (M). Digestion with HpyAV of the 293 bp amplicon of the Spike partial genomic region yielded two bands of 173 and 120 bp for samples with E484, while the digestion site is abolished with K484. Digestion with MseI yielded a fragment of 182 and 111 for samples with N501, and a fragment of 173, 111 and 9 bp (nor observed in the gel) for samples with Y501 (Figure 2A). The 9 bp difference of the product between the WT and Y501 mutated samples was easily differentiated in 3% agarose gels (Figure 2B). NuSieve agarose or Polyacrylamide gel electrophoresis can also be used for more discrimination, particularly of bands of 173 and 182 bp. However, this was not necessary in our hands. ![Figure 2.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/05/10/2021.05.04.21256650/F2.medium.gif) [Figure 2.](http://medrxiv.org/content/early/2021/05/10/2021.05.04.21256650/F2) Figure 2. Restriction analysis of amplicons with E484K or N501Y mutations. A. Sequence of the amplified product showing the restriction sites which discriminate Wild-type (WT) or mutant (Mut or M) viruses. B. Agarose gel electrophoresis of digested PCR-amplified products. PCR-amplified products digested with the enzyme shown were run with molecular weight markers (1Kb plus DNA ladder): smaller bands are signaled (100, 200 and 300 bp). A total of 40 samples (12 WT, 27 with both mutations and 1 with only E484K) were analyzed for their restriction pattern and compared to the presence or not of the two mutations in their sequence. A 100% correlation was observed in the detection of the mutations between the two methods (data not shown). ## Discussion SARS-CoV-2 variants are emerging and spreading rapidly in several parts of the world [3]. Mutations E484K and N501Y have been recurrently associated with many of the VOCs and may appear in other variants already unknown. Indeed, a survey of sequences available at GIASID showed that these mutations, particularly E484K, have been found in sequences for many lineages. As stated before, the E484K mutation has been found in cases of reinfection and may represent a mutation that emerged to escape the presence of neutralizing antibodies [9]. The frequency of variants or mutations cannot be estimated with the sequences available at GISAID, since huge disparities exist between the sequencing capacities among countries. The presence of any of these two mutations does not necessarily represent that a VOC is circulating in a specific country. Their presence does not necessarily mean that this isolate will gain the phenotypic advantages provided by these mutations in VOCs, as the increased transmissibility observed for the VOCs and provided by N501Y. Many other mutations present in these VOCs (point mutations and deletions) might be contributing to the enhanced transmissibility of these VOCs [10]. However, the specific detection of these two mutations, which play a key role in determining the phenotype of the isolate, might be of particular interest. Thus a rapid method for identifying those mutations should be very useful, particularly in settings where massive whole genome sequencing is not available. Rapid methodologies might be used for the rapid screening of several samples. The whole protocol can be run in a day. ## Conclusions The proposed methodology allows analyzing a great number of samples to select the ones that may harbor mutations of concern, before proceeding to whole genome sequencing. On the other hand, once the presence of the variants is confirmed by whole genome sequencing, this method can be used for the rapid estimation of their prevalence in different geographical regions. ## Data Availability All data is available. ## Acknowledgements This study was supported by Ministerio del Poder Popular de Ciencia, Tecnología e Innovación of Venezuela. * Received May 4, 2021. * Revision received May 4, 2021. * Accepted May 9, 2021. * © 2021, Posted by Cold Spring Harbor Laboratory The copyright holder for this pre-print is the author. All rights reserved. The material may not be redistributed, re-used or adapted without the author's permission. ## References 1. 1.Pujol FH, Zambrano JL, Jaspe R, Loureiro CL, Vizzi E, Liprandi F, Rangel HR (2020) Biología y evolución del coronavirus causante de la COVID-19. Rev Soc Venezol Microbiol 40: 63–70. 2. 2.Phan T (2020) Genetic diversity and evolution of SARS-CoV-2. Infect Genet Evol 81:104260. doi: 10.1016/j.meegid.2020.104260. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.meegid.2020.104260&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F05%2F10%2F2021.05.04.21256650.atom) 3. 3.Davies NG, Abbott S, Barnard RC, Jarvis CI, Kucharski AJ, Munday JD, Pearson CAB, Russell TW, Tully DC, Washburne AD, Wenseleers T, Gimma A, Waites W, Wong KLM, van Zandvoort K, Silverman JD, CMMID COVID-19 Working Group, COVID-19 Genomics UK (COG-UK) Consortium, Diaz-Ordaz K, Keogh R, Eggo RM, Funk S, Jit M, Atkins KE, Edmunds WJ (2021) Estimated transmissibility and impact of SARS-CoV-2 lineage B.1.1.7 in England. Science 3055: 1–16. Available: [http://www.ncbi.nlm.nih.gov/pubmed/33658326](http://www.ncbi.nlm.nih.gov/pubmed/33658326). 4. 4.Tegally H, Wilkinson E, Giovanetti M, Iranzadeh A, Fonseca V, Msomi N, Mlisana K, von Gottberg A, Walaza S, Allam M, Ismail Mohale T, Glass AJ, Engelbrecht A, Van Zyl G, Preiser W, Petruccione F, Sigal A, Hardie D, Marais G, Hsiao NY, Korsman S, Davies MA, Tyers L, Mudau I, York D, Maslo C, Goedhals D, Abrahams S, Laguda-Akingba O, Alisoltani-Dehkordi A, Godzik A, Wibmer CK, Sewell BT, Lourenço J, Alcantara LCJ, Kosakovsky Pond SL, Weaver S, Martin D, Lesse lls RJ, Bhiman JN, Williamson C, de Oliveira T (2021) Emergence of a SARS-CoV-2 variant of concern with mutations in spike glycoprotein. Nature. Available: [http://www.ncbi.nlm.nih.gov/pubmed/33690265](http://www.ncbi.nlm.nih.gov/pubmed/33690265). 5. 5.Faria NR, Mellan TA, Whittaker C, Claro IM, Candido DDS, Mishra S, Crispim MAE, Sales FCS, Hawryluk I, McCrone JT, Hulswit RJG, Franco LAM, Ramundo MS, de Jesus JG, Andrade PS, Coletti TM, Ferreira GM, Silva CAM, Manuli ER, Pereira RHM, Peixoto PS, Kraemer MUG, Gaburo N Jr., Camilo CDC, Hoeltgebaum H, Souza WM, Rocha EC, de Souza LM, de Pinho MC, Araujo LJT, Malta FSV, de Lima AB, Silva JDP, Zauli DAG, Ferreira ACS, Schnekenberg RP, Laydon DJ, Walker PGT, Schlüter HM, Dos Santos ALP, Vidal MS, Del Caro VS, Filho RMF, Dos Santos HM, Aguiar RS, Proença-Modena JL, Nelson B, Hay JA, Monod M, Miscouridou X, Coupland H, Sonabend R, Vollmer M, Gandy A, Prete CA Jr., Nascimento VH, Suchard MA, Bowden TA, Pond SLK, Wu CH, Ratmann O, Ferguson NM, Dye C, Loman NJ, Lemey P, Rambaut A, Fraiji NA,Carvalho MDPSS, Pybus OG, Flaxman S, Bhatt S, Sabino EC (2021) Genomics and epidemiology of a novel SARS-CoV-2 lineage in Manaus, Brazil. Science. doi: 10.1126/science.abh2644. [Abstract](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6Mzoic2NpIjtzOjU6InJlc2lkIjtzOjE3OiJzY2llbmNlLmFiaDI2NDR2MSI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIxLzA1LzEwLzIwMjEuMDUuMDQuMjEyNTY2NTAuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 6. 6.Altmann DM, Boyton RJ, Beale R (2021) Immunity to SARS-CoV-2 variants of concern. Science 371: 1103–1104. Available: [http://www.ncbi.nlm.nih.gov/pubmed/33707254](http://www.ncbi.nlm.nih.gov/pubmed/33707254). [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6Mzoic2NpIjtzOjU6InJlc2lkIjtzOjEzOiIzNzEvNjUzNC8xMTAzIjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjEvMDUvMTAvMjAyMS4wNS4wNC4yMTI1NjY1MC5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 7. 7.Davies NG, Jarvis CI; CMMID COVID-19 Working Group, Edmunds WJ, Jewell NP, Diaz-Ordaz K, Keogh RH (2021) Increased mortality in community-tested cases of SARS-CoV-2 lineage B.1.1.7. Nature. doi: 10.1038/s41586-021-03426-1. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41586-021-03426-1&link_type=DOI) 8. 8.World Health Organization (2020) Epidemiological Update Coronavirus disease (COVID-19). 2019: 1–11. Available: [https://www.paho.org/pt/documentos/atualizacao-epidemiologica-covid-19-doenca-causada-pelo-novo-coronavirus-9-novembro-2020](https://www.paho.org/pt/documentos/atualizacao-epidemiologica-covid-19-doenca-causada-pelo-novo-coronavirus-9-novembro-2020). 9. 9.Hoffmann M, Arora P, Groß R, Seidel A, Hörnich BF, Hahn AS, Krüger N, Graichen L, Hofmann-Winkler H, Kempf A, Winkler MS, Schulz S, Jäck HM, Jahrsdörfer B, Schrezenmeier H, Müller M, Kleger A, Münch J, Pöhlmann S (2021) SARS-CoV-2 variants B.1.351 and P.1 escape from neutralizing antibodies. Cell. S0092-8674(21)00367-6. doi: 10.1016/j.cell.2021.03.036. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.cell.2021.03.036&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=33794143&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F05%2F10%2F2021.05.04.21256650.atom) 10. 10.Khan A, Zia T, Suleman M, Khan T, Ali SS, Abbasi AA, Mohammad A, Wei DQ (2021) Higher infectivity of the SARS-CoV-2 new variants is associated with K417N/T, E484K, and N501Y mutants: An insight from structural data. J Cell Physiol. doi: 10.1002/jcp.30367. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/jcp.30367&link_type=DOI)