Importation, circulation, and emergence of variants of SARS-CoV-2 in the South Indian State of Karnataka ======================================================================================================== * Chitra Pattabiraman * Pramada Prasad * Anson K George * Darshan Sreenivas * Risha Rasheed * Nakka Vijay Kiran Reddy * Anita Desai * Ravi Vasanthapuram ## ABSTRACT As the pandemic of COVID-19 caused by the coronavirus SARS-CoV-2 continues, the selection of genomic variants which can influence how the pandemic progresses is of growing concern. Of particular concern, are those variants that carry mutations/amino acid changes conferring higher transmission, more severe disease, re-infection, and immune escape. These can broadly be classified as Variants of Concern (VOCs). VOCs have been reported from several parts of the world- UK (lineage B.1.1.7), South Africa (lineage B.1.351) and, Brazil (lineage P.1/B.1.1.28). The conditions that contribute to the emergence of VOCs are not well understood. International travel remains an important means of spread. To track importation, spread, and the emergence of variants locally; we sequenced whole genomes of SARS-CoV-2 from international travellers (n=75) entering Karnataka, a state in South India, between Dec 22, 2020- Jan 31, 2021, and from positive cases in the city of Bengaluru (n=108), between Nov 22, 2020- Jan 22, 2021. The resulting 176 SARS-CoV-2 genomes could be classified into 34 lineages, that were either imported (73/176) or circulating (103/176) in this time period. The lineage B.1.1.7 (a.k.a the UK variant) was the major lineage imported into the state (24/73, 32.9%), followed by B.1.36 (20/73, 27.4%) and B.1 (14/73, 19.2%). We identified B.1.36 (45/103; 43.7%), B.1 (26/103; 25.2%), B.1.1.74 (5/103; 4.9%) and B.1.468 (4/103; 3.9%) as the major variants circulating in Bengaluru city. A distinct clade within the B.1.36 lineage was associated with a local outbreak. Analysis of the complete genomes predicted multiple amino acid replacements in the Spike protein. In total, we identified nine amino acid changes (singly or in pairs) in the Receptor Binding Domain of the Spike protein. Of these, the amino acid replacement N440K was found in 37/65 (56.92%) sequences in the B.1.36 lineage. The E484K amino acid change which is present in both VOCs, B.1.351 and P.1/B.1.1.28, was found in a single circulating virus in the B.1.36 lineage. This study highlights the introduction of VOCs by travel and the local circulation of viruses with amino acid replacements in the Spike protein. These were spread across lineages, suggesting that multiple paths can lead to the emergence of VOCs, this, in turn, highlights the need to sequence and limit outbreaks of SARS-CoV-2 locally. Our data support the use of concentrated and continued genomic surveillance of SARS-CoV-2 to direct public health measures, suggest revisions to vaccines, and serve as an early warning system to prepare for a surge in COVID-19 cases. ## INTRODUCTION The COVID-19 pandemic caused by the coronavirus SARS-CoV-2 has claimed millions of lives and has affected people living in all parts of the globe1. The evolution of the virus did not initially alarm public health specialists or those involved in vaccine development 2. However, the emergence of variants with distinct biological properties which include one or more mutations that confer higher infectivity, increased transmission, severe disease, re-infection, and immune escape are a cause for concern3–9. Such variants may influence the trend of the pandemic and are therefore broadly known as Variants of Concern (VOCs)3–8. In India, the COVID-19 pandemic began with the importation of the virus in January 202010. It is only after 11 million cases and over 150k deaths that the numbers declined, signalling the end of the first wave of SARS-CoV-2 in the country1,10,11. As with other countries in the world, India too started vaccination campaigns in January 2021, at about the same time that reports of VOCs were communicated from the UK, Brazil, and South Africa3,4,6,11. The primary concern is that they may herald the second wave of SARS-CoV-2 in the county and/or undermine the vaccination drive. Genomic studies in India have shown that several lineages of SARS-CoV-2 have been introduced, have spread, and fallen below the limit of detection since January 202012,13,22,14–21. We have previously performed detailed genomic epidemiology of SARS-CoV-2 in the South Indian state of Karnataka, with a population of 64.1 million (Census 2014)22. We found multiple introductions of SARS-CoV-2 into the state and at least seven distinct lineages were already circulating in the state by May 2020. Detailed analysis of the contact network of COVID-19 cases to look at transmission within the state emphasized the role of symptomatic individuals in spreading the virus23. These data have contributed to our understanding of how the virus enters, spreads, and evolves in a population. In the genomic epidemiology study, no particular lineages were associated with disease severity22. Studies of sequences from India juxtaposed with sequences from all over the world, suggest that mutations associated with immune escape and re-infection are already circulating in the population2,24–26. Multiple lineages of SARS-CoV-2 have been reported from across the world and in India 12,13,15–17,19–22,27. There are two ancestral lineages of SARS-CoV-2 in the PANGO classification system, A and B28. While viruses of both lineages are circulating across the world, viruses of lineage B are more widespread and prominent in number. The viruses responsible for the catastrophic outbreak in Italy, in early 2020, with an amino acid change in the spike protein D614G and were classified into lineage B.128. This lineage is now the dominant lineage across the world. Several studies have now shown that viruses in this lineage transmit better, with increased infectivity in cell culture29–32. Viruses of the lineage B.1 have acquired several other amino acid replacements in the Receptor Binding Domain of the Spike protein – specifically in the lineages which have been designated as VOCs, namely -B.1.1.7 (N501Y), B.1.351 (N501Y, E484K, K417T) and P.1 from the lineage B.1.1.28 (N501Y, E484K, K417T). Some of these amino acid replacements either singly or in combination have been shown to influence transmission of the virus, interfere with neutralization of the virus, and are associated with an increase in the number of hospitalizations2,5,7,8. The spread of these lineages, therefore, has global implications5,33. Early data suggests that some variants may escape neutralization by both therapeutic antibodies and antibodies induced by previous infection and vaccination 8,9,34,35. This has implications for the efficacy of Spike sequence-based vaccines and suggests that re-infection is possible7,36. Rapid sharing of genomic information enabled the global community to pick-up cases of VOCs and implement relevant public health measures3,4,6. A concentrated, ongoing, local approach to genomic surveillance is critical for the identification of variants and establishing epidemiological links with the trend of the outbreak5,7,12,22. This has also proved critical for local outbreak management and informed policy decisions across the world5,7,37,38. It is in this context that we conducted genomic surveillance of COVID-19 positive international travellers to the south Indian state of Karnataka between Dec 22, 2020-Jan 31, 2021 (n=75). We also performed sequencing of SARS-CoV-2 (n=108), collected between Nov 22, 2020-Jan 22, 2021) in Bengaluru city (Bengaluru Urban District) to identify and track locally circulating variants and potential VOCs. ## METHODS ### Samples for Sequencing The Department of Neurovirology, at the National Institute of Mental Health and Neurosciences (NIMHANS), Bengaluru, is an ICMR (Indian Council of Medical Research) approved COVID-19 diagnostic centre. Further, the Government of Karnataka and the Government of India designated our lab as a nodal centre for genomic sequencing. This study was granted a waiver by the Institutional Ethics Committee of NIMHANS in light of the public health emergency. Nasopharyngeal and oropharyngeal swabs collected from International Travellers (n=75, Dec 22, 2020 – Jan 31, 2021), samples from COVID-19 cases in Bengaluru city (n=108, Nov 22, 2020-Jan 22, 2021), and from a local outbreak (Feb 2021) were included in the study. Of the 42 samples collected from the local outbreak, 14 were suitable for sequencing (RT PCR positive, Ct value < 30) and were analysed further. ### RNA Extraction and RT-PCR Nucleic acid extraction was performed with automated magnetic bead-based extraction method, using the Chemagic Viral DNA/RNA special H96 kit (PerkinElmer, CMG-1033-S) following manufacturer’s instruction. SARS-CoV-2 detection was done using ICMR approved diagnostic kits. A total of 197 RT PCR positive samples fulfilling the following criteria – i. Ct values less than 30 in the case of international travellers (n=75), and local outbreak (n=14) or ii. Ct value less than 25 for local cases (n=108), were taken for whole genome sequencing. ### Whole Genome Sequencing of SARS-CoV-2 Whole genome sequencing was performed using the amplicon sequencing approach described in the ARTIC Network protocol using the V3 primer set39. The resulting amplicons from 12-24 samples were barcoded using the native barcoding kits (NBD104/114, Oxford Nanopore Technology (ONT)) and sequencing libraries were prepared using the ligation sequencing kit (SQK-LSK109, ONT). The barcoded library was loaded on to FLO-MIN-106 flow cells and sequenced on the MinION (ONT). An average of 0.12 million (median) sequencing reads were acquired per sample with a median coverage of 1737x (Supplementary Table1). ### Analysis of sequencing data and data sharing Analysis of sequencing reads was performed as described previously22. Briefly, sequences were basecalled and demultiplexed using guppy (v3.6). Amplicon sequencing primers were removed from the reads by trimming 25bp at the ends and using BBDuk (v38.37). Reference mapping based assembly of the genomes was performed using Minimap2 ver 2.17 using NC_045512 as the reference. A consensus genome was generated with a coverage cut-off of 10x and the 0% majority rule. This was then edited, and aligned to the reference for annotation. Of the 183 samples from international travellers and local cases, 176 (73/75 imported, 103/108 circulating) genomes could be used for the determination of lineage using the PANGO web application (Pangolin v2.2.2 lineages version 2021-02-12) 28. Of the 176 genomes,162 were complete (>92% at 1X and >85% at 10X) and were deposited into the GISAID Database40, accession numbers are provided in Supplementary Table 2. Complete sequences (162) were analysed for SNPs and amino acid replacements with reference MN908947.3 (Wuhan-Hu-1) using the CoV-Glue Web Application41. ### Phylogenetic analysis A total of 168 genomes, including the 162 described above, and an additional 6 complete genomes from a local outbreak, were used for phylogenetic analysis with the reference NC_045512 as an outgroup. Multiple sequence alignment was performed using MUSCLE and a maximum likelihood tree was constructed using iqtree42,43. The GTR+F+I+G4 substitution model was found to be the best-fit model (of the 88 models tested) using the Bayesian Information Criterion. The consensus tree was constructed from 1000 bootstraps and bootstrap values over 70 were interpreted. ## RESULTS We sequenced SARS-CoV-2 genomes from 197 SARS-CoV-2 positive individuals, including international travellers (n=75), local cases (n=108), and a local outbreak (n=14). Lineage classification using the PANGO scheme was possible for 176 genomes which were either imported (73/75) or circulating (103/108) (Fig 1 A, B), and for all 14 genomes from the local outbreak (Supplementary Table 3). The genomic surveillance for the local outbreak was carried out to identify the lineage/lineages responsible for the outbreak (Fig 1C). ![Fig 1](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/03/20/2021.03.17.21253810/F1.medium.gif) [Fig 1](http://medrxiv.org/content/early/2021/03/20/2021.03.17.21253810/F1) Fig 1 Distribution of SARS-CoV-2 Lineages in the state of Karnataka (Nov 22, 2020- Jan 31, 2021). A. SARS-CoV-2 lineages imported by international travel into Karnataka (n= 73) B. SARS-CoV-2 lineages circulating in Bengaluru city (n= 103). Colours represent different lineages. Lineages with greater > 4 sequences are labelled. C. Maximum Likelihood Phylogenetic tree of 168 complete SARS-CoV-2-genomes from Karnataka, rooted by the reference genome (NC_045512). The scale (length of branches) is in substitutions per nucleotide site. Predominant lineages are coloured. Sequences from a local outbreak of SARS-CoV-2 are coloured in magenta. Numbers on the nodes indicate bootstrap support values (in %), values above 70 are shown. A total of 34 lineages were detected from the 176 genomes in this study. A complete list of lineages and their frequencies is provided in Supplementary Table 4. Briefly, genomes from imported and circulating viruses belong to both A (3/176) and B (173/176) lineages. Within A, two (2/103) circulating genomes were classified into A.23.1. Of the 173 genomes in lineage B, two genomes were classified into lineage B (2/173), the rest were derived from B.1(130/173) or B.1.1(41/173). The genomes from imported cases grouped into 16 distinct lineages (Fig 1A, Supplementary Table 4) including B.1.1.7 (24/73, 32.9%), B.1.36 (20/73, 27.4%) and B.1 (14/73, 19.2%). The first introduction of B.1.1.7 was noted in the last week of December 2020, and by January 31, 2021, this lineage made up 32.9% (24/73) of all imported cases (Fig 1 A, Supplementary Table 4). Circulating genomes grouped into 24 distinct lineages, dominated by the lineages B.1.36 (45/103; 43.7%), B.1 (26/103; 25.2%), B.1.1.74 (5/103; 4.9%) and B.1.468 (4/103; 3.9%) (Fig. 1 B, Supplementary Table 4). Only a single sequence of B.1.1.7 was detected during the study period as part of this surveillance effort in a non-traveller. Sequences from the lineage B.1.36 and derived lineages (70/176) grouped into a distinct phylogenetic clade together with sequences belonging to lineage B.1.468 (6/176) (Fig 1C). Genomic investigation of an outbreak of SARS-CoV-2 in the city of Bengaluru in early Feb 2021, revealed that 14/14 sequences from the outbreak could be classified into lineage B.1.36. Complete genome sequences could be recovered from 6/14 cases. All six viruses grouped into a clade within the largely B.1.36+B.1.468 clade (Fig 1C). Of the 176 genomes from travellers and in circulation, for which lineage classification was possible,162 complete genomes (with coverage > 92% at 1X and > 85% at 10X) were used for the analysis of SNPs and amino acid replacements. A total of 968 SNPs (Supplementary Table 5) and 529 amino acid replacements (Supplementary Table 6) were identified. Of these amino acid replacements 61 were in the Spike protein of circulating viruses, and 32 in Spike protein of imported viruses (Supplementary Table 7,8). The B.1.36 lineage had 226 amino acid replacements, 31 of these were in the Spike protein. Although only the D614G and N440K were present in an appreciable number (> 50%) of sequences (Supplementary Table 9). We carried out further analysis of the amino acid replacements in the RBD domain of the spike protein (Fig 2A, Supplementary Tables 7,8) and mapped them on the Maximum-Likelihood tree (Fig 2B). We identified mutations leading to nine amino acid replacements in the RBD (Fig 2A). Of these, five (S477N, E484K, E484Q, S494L, S494P) were found in viruses circulating in Bengaluru, and the amino acid replacement V483A was from an imported case. The N501Y change was confined to the B.1.1.7 lineage. The N440K replacement was seen in 45/76 (59.2%) sequences in the B.1.36+B.1.468 clade (Fig 2B) and 37/65 sequences in B.1.36 lineage. Of the six sequences from a cluster of cases (Outbreak), only a single sequence carried the mutation resulting in the N440K change (Fig 2B, Supplementary Table 10). A single branch of the B.1.36+B.1.468 clade (n=4, 3 of which were imported) had an additional amino acid replacement F490S in the RBD (Fig 2B). The mutations in the RBD were seen across the phylogenetic tree and clades (Fig 2B). ![Fig 2](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/03/20/2021.03.17.21253810/F2.medium.gif) [Fig 2](http://medrxiv.org/content/early/2021/03/20/2021.03.17.21253810/F2) Fig 2 Amino acid replacements in the Receptor Binding Domain (RBD) of SARS-CoV-2. A. Frequency of amino acid replacements in the RBD (amino acids- 387- 516 in the Spike protein) is shown as a bar graph. Frequencies are plotted against the amino acid replacement. Orange and blue represent the circulating and imported genomes respectively. B. Maximum Likelihood phylogenetic tree highlighting branches with the indicated amino acid substitutions.s ## DISCUSSION In this study, we found 34 lineages of SARS-CoV-2 circulating and imported into Bengaluru city in Karnataka, India, between Nov 22, 2020 – Jan 31, 2021. We aimed to detect the introduction of the global VOCs (lineages B.1.1.7, B.1.351, P.1/B.1.1.28), as well as genotype the variants of SARS-CoV-2, circulating since our last study, which highlighted the introduction and spread of seven lineages of SARS-CoV-2 in Karnataka, between March-May 202022. We found no evidence suggesting that the B.1.1.7 lineage was present in Karnataka before late-Dec 2020. We first detected the B.1.1.7 variant in Karnataka, in an international traveller from a sample collected on Dec 22, 2020 (Supplementary Table S3). The first and only case of non-travel related B.1.1.7, in our study, was detected in the middle of Jan 2021 in an individual who was in contact with an international traveller(Supplementary Table 3). These data together suggest that B.1.1.7 in Karnataka was limited to travel-associated cases and was not in the community during the study period. At the end of the study period, the B.1.1.7 lineage was detected in 32.9% of all imported cases (Supplementary Table 4). We did not detect the variants P.1/B.1.1.28 or B.1.351 reported from Brazil and South Africa respectively in this study. We found that B.1.36 and B.1 lineages dominated in both the imported (20/73; 27.4%,14/73, 19.2%) and circulating viruses (45/103; 43.7%, 26/103; 25.2%) in our study (Supplementary Table 4). B.1.36 was first reported from Saudi Arabia in Feb 2020 (Supplementary Table 11) and has now been reported from many parts of the world including India. In our earlier work in Karnataka, we detected only two samples (2/91, 2.2%) clustering into this lineage in the middle of May 2020, which were then classified under the parent lineage B.1. Of the 176 sequences in the present study, 65 sequences were classified into B.1.36 (36.9%) and five were classified as B.1.36 derived lineages (2.8%) (Supplementary Table 4). The B.1.36 lineage was both imported by international travel (20/73) and circulating (45/103) in Bengaluru city (Supplementary Table 4). The lineage is characterized by the following amino acid replacements-nsp12-P323L(95.38%), S-D614G (93.85%), S-N440K (56.92%), ORF 3a-Q57H (90.77%), ORF 3a-E261*(81.54%), nsp3-T183I (81.54%), nsp16-L126F(80%), N-S2P (72.31%), ORF 8-S97I (72.31%) (Supplementary Table 9). The immune escape associated amino acid change, N440K has been reported from the states of Andhra Pradesh, Maharashtra, Telangana, and Karnataka, and is also associated with reinfection24,36,44. This change was found in 37/65 (56.92%) of the sequences clustering to B.1.36 (Supplementary Table 9). An outbreak of SARS-CoV-2 occurred in Bengaluru in early Feb 2021, raising concerns about the spread of variants, the threat of a second wave, and reduction in the efficacy of vaccines. This outbreak in a college where students were returning from different states within India was driven by related viruses belonging to the B.1.36 lineage (Fig 1C, Supplementary Table 3). Only one of the six sequences from the outbreak cluster had the mutation resulting the N440K replacement in the Spike protein (Fig 2C, Supplementary Table 10). This supports the idea that mutations in gene encoding the Spike protein may arise sporadically/multiple times in different clades. Apart from the introduction and spread of known VOCs, the emergence of variants locally is also a cause for concern. Early in the pandemic, a single mutation in the gene encoding the Spike protein of SARS-CoV-2 resulting in a D614G amino acid change was identified to increase infectivity and transmission2,29,32. Viruses with this amino acid replacement dominate across the globe31,45. Mutations in the gene encoding the Spike protein are of particular concern due to the role of this protein and its Receptor Binding Domain (RBD) in viral binding and entry46. Some of these mutations have been shown to increase infectivity, affinity to the ACE-2 receptor or affect neutralization by antibodies i*n vitro*. Viral genomes with these mutations were already circulating viruses by mid-2020 2,25,26,44,47,48. In the sequences from this study, nine amino acids replacements were noted in the RBD domain of the Spike protein (Fig 2B, Supplementary 7-8). They occurred singly or in pairs (N440K+F490S) (Fig 2). All nine amino acid changes, namely N440K, S477N, V483A, E484K/Q, F490S, S494L/P, N501Y are associated with immune escape24,25. Viruses with some of these amino acid changes were already known to be circulating in other parts of India 16,17,24. Mutations in the gene encoding Spike protein that do not map to the RBD have also been described; particularly near the polybasic cleavage site at the S1/S2 boundary of the Spike protein. Towards the end of the year 2020, multiple lineages with amino acid replacements at position 677 were noted49. Four viruses in our study have mutations resulting in amino acid changes at this position (Q677H (n=3), Q677P (n=1)) (Supplementary Table 6). It is to be noted that in this study we have only included samples with Ct values less than 25 for surveillance of circulating SARS-CoV-2 genomes and Ct values less than 30 for sequencing of international travel-related cases. We have also sequenced only a fraction of cases in a limited geographical area. This may therefore present an incomplete view of circulating viruses and inflate the ones that are more readily sequenced. Also, as we have used the amplicon sequencing approach, not all regions of all lineages are well covered by sequencing reads. Others have also noted homoplasy in SARS-CoV-2, this highlights the need to be cautious while interpreting the phylogenetic relationships between SARS-CoV-2 sequences, especially in the context of outbreaks50. In summary, our data highlight an increase in the frequency of the lineage B.1.36 in Bengaluru Urban, in Karnataka, and importation events indicate an underappreciated global burden (Fig 1, Supplementary Table 4). Whether this increase is because of epidemiological linkages such as increased travel, continued local transmission chains or super-spreader events remains to be determined. It is beyond the scope of this work to examine whether the lineage, contributing mutations, and amino acid changes impact transmission/infectivity of the virus. Our data emphasize that a consolidated and local approach to genomic surveillance which includes sequencing of SARS-CoV-2 from travellers, circulating variants, and outbreaks, in a continuous manner is necessary to detect VOCs. Rapid identification of such variants can aid in preparing the healthcare system for a surge in cases, suggest revisions to vaccines and diagnostic tests, inform the international community, and guide public health measures. ## Supporting information Supplementary Table 1 [[supplements/253810_file03.xls]](pending:yes) Supplementary Table 2 [[supplements/253810_file04.xls]](pending:yes) Supplementary Table 3 [[supplements/253810_file05.xls]](pending:yes) Supplementary Table 4 [[supplements/253810_file06.xls]](pending:yes) Supplementary Table 5 [[supplements/253810_file07.xls]](pending:yes) Supplementary Table 6 [[supplements/253810_file08.xls]](pending:yes) Supplementary Table 7 [[supplements/253810_file09.xls]](pending:yes) Supplementary Table 8 [[supplements/253810_file10.xls]](pending:yes) Supplementary Table 9 [[supplements/253810_file11.xls]](pending:yes) Supplementary Table 10 [[supplements/253810_file12.xls]](pending:yes) Supplementary Table 11 [[supplements/253810_file13.xls]](pending:yes) ## Data Availability All sequence data have been deposited in the GISAID database. All other data, including the accession numbers for the sequences, are included in Supplementary Information. [https://www.epicov.org/](https://www.epicov.org/) ## SUPPORTING INFORMATION Supplementary Table 1-Summary of sequencing results Supplementary Table 2 – GISAID Accession ID for sequences Supplementary Table 3 – Details of sequenced samples Supplementary Table 4 – PANGO lineage assignments for SARS-CoV-2 genomes Supplementary Table 5 – Position and frequency of single nucleotide polymorphisms Supplementary Table 6 – Position and frequency of amino acid replacements Supplementary Table 7-Amino Acid Replacement in Spike Protein (Imported) Supplementary Table 8 – Amino Acid Replacement in Spike Protein (Circulating) Supplementary Table 9 – Amino acid Replacements in lineage B.1.36 Supplementary Table 10-Position of amino acid replacements in the Spike protein of sequences from a local outbreak Supplementary Table 11 – Acknowledgement for sequences from GISAID ## FUNDING This work was supported by core funds of NIMHANS to the Department of Neurovirology, funds from the Government of Karnataka for genomic surveillance of SARS-CoV-2 and the DBT/Wellcome Trust India Alliance Fellowship IA/E/15/1/502336 awarded to Chitra P. ## AUTHOR DECLARATION The authors declare that they do not have any financial or non-financial relationships that could present a conflict of interest. ## ACKNOWLEDGEMENTS This work would not have been possible without the support of the Government of Karnataka, State Surveillance team for COVID-19, in particular Ms. Prameela Dinesh, Directorate of Health and Family Welfare Services, Government of Karnataka. We would like to thank all the labs and Primary Health Care centres that collected samples for testing and genomic surveillance. We would like to thank the COVID testing lab in NIMHANS. We would also like to acknowledge Prof. Sudhir Krishna’s laboratory at the National Centre for Biological Sciences (NCBS) for support with reagents, NCBS for access to their computational resources for carrying out our analysis, and Dr. Farhat Habib for custom scripts used in data analysis. We gratefully acknowledge the contributions of all the laboratories that have submitted their sequences to GISAID, in particular laboratories across India that have been involved in sequencing efforts. * Received March 17, 2021. * Revision received March 17, 2021. * Accepted March 20, 2021. * © 2021, Posted by Cold Spring Harbor Laboratory This pre-print is available under a Creative Commons License (Attribution-NonCommercial-NoDerivs 4.0 International), CC BY-NC-ND 4.0, as described at [http://creativecommons.org/licenses/by-nc-nd/4.0/](http://creativecommons.org/licenses/by-nc-nd/4.0/) ## REFERENCES 1. Dong E, Du H, Gardner L. An interactive web-based dashboard to track COVID-19 in real time. Lancet Infect. Dis. 2020; 20. DOI:10.1016/S1473-3099(20)30120-1. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S1473-3099(20)30120-1&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=32087114&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F03%2F20%2F2021.03.17.21253810.atom) 2. Li Q, Wu J, Nie J, et al. The Impact of Mutations in SARS-CoV-2 Spike on Viral Infectivity and Antigenicity. Cell 2020; 182. DOI:10.1016/j.cell.2020.07.012. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.cell.2020.07.012&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=32730807&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F03%2F20%2F2021.03.17.21253810.atom) 3. Rambaut A, Loman N, Pybus O, et al. Preliminary genomic characterisation of an emergent SARS-CoV-2 lineage in the UK defined by a novel set of spike mutations. VirologicalOrg 2020. 4. Faria NR, Mellan TA, Whittaker C, et al. Genomics and epidemiology of a novel SARS-CoV-2 lineage in Manaus, Brazil. medRxiv 2021; : 2021.02.26.21252554. 5. Volz E, Mishra S, Chand M, et al. Transmission of SARS-CoV-2 Lineage B.1.1.7 in England: Insights from linking epidemiological and genetic data. medRxiv 2021. 6. Tegally H, Wilkinson E, Giovanetti M, et al. Emergence and rapid spread of a new severe acute respiratory syndrome-related coronavirus 2 (SARS-CoV-2) lineage with multiple spike mutations in South Africa. medRxiv. 2020. DOI:10.1101/2020.12.21.20248640. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NzoibWVkcnhpdiI7czo1OiJyZXNpZCI7czoyMToiMjAyMC4xMi4yMS4yMDI0ODY0MHYxIjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjEvMDMvMjAvMjAyMS4wMy4xNy4yMTI1MzgxMC5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 7. Sabino EC, Buss LF, Carvalho MPS, et al. Resurgence of COVID-19 in Manaus, Brazil, despite high seroprevalence. Lancet. 2021; 397. DOI:10.1016/S0140-6736(21)00183-5. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S0140-6736(21)00183-5&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=33515491&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F03%2F20%2F2021.03.17.21253810.atom) 8. Cele S, Gazy I, Jackson L, et al. Escape of SARS-CoV-2 501Y.V2 variants from neutralization by convalescent plasma. medRxiv 2021. 9. Wang P, Nair MS, Liu L, et al. Antibody Resistance of SARS-CoV-2 Variants B.1.351 and B.1.1.7. Nature 2021. DOI:10.1038/s41586-021-03398-2. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41586-021-03398-2&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=33684923&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F03%2F20%2F2021.03.17.21253810.atom) 10. ICMR, Team CE& DM, Team CL, Team V. Laboratory surveillance for SARS-CoV-2 in India: Performance of testing & amp; descriptive epidemiology of detected COVID-19, January 22 - April 30, 2020. Indian J Med Res 2020; 151: 424. 11. Coronavirus outbreak in Karnataka. [https://www.covid19india.org/state/KA](https://www.covid19india.org/state/KA) (accessed March 11, 2021). 12. Agrawal A, Rakshit P, Kumar P, et al. Integrated genomic view of SARS-CoV-2 in India. Wellcome Open Res 2020; 5. DOI:10.12688/wellcomeopenres.16119.1. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.12688/wellcomeopenres.16119.1&link_type=DOI) 13. Srivastava S, Banu S, Singh P, Sowpati DT, Mishra RK. SARS-CoV-2 genomics: An Indian perspective on sequencing viral variants. J. Biosci. 2021; 46. DOI:10.1007/s12038-021-00145-7. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s12038-021-00145-7&link_type=DOI) 14. Banu S, Jolly B, Mukherjee P, et al. A Distinct Phylogenetic Cluster of Indian Severe Acute Respiratory Syndrome Coronavirus 2 Isolates. Open Forum Infect Dis 2020; 7. DOI:10.1093/ofid/ofaa434. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/ofid/ofaa434&link_type=DOI) 15. Maitra A, Sarkar MC, Raheja H, et al. Mutations in SARS-CoV-2 viral RNA identified in Eastern India: Possible implications for the ongoing outbreak in India and impact on viral structure and host susceptibility. J Biosci 2020; 45. DOI:10.1007/s12038-020-00046-1. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s12038-020-00046-1&link_type=DOI) 16. Rani PR, Imran M, Lakshmi JV, et al. Insights from Genomes and Genetic Epidemiology of SARS-CoV-2 isolates from the state of Andhra Pradesh. bioRxiv 2021; :2021.01.22.427775. 17. Gupta A, Sabarinathan R, Bala P, et al. Mutational landscape and dominant lineages in the SARS-CoV-2 infections in the state of Telangana, India. medRxiv. 2020. DOI:10.1101/2020.08.24.20180810. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NzoibWVkcnhpdiI7czo1OiJyZXNpZCI7czoyMToiMjAyMC4wOC4yNC4yMDE4MDgxMHYxIjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjEvMDMvMjAvMjAyMS4wMy4xNy4yMTI1MzgxMC5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 18. Yadav P, Potdar V, Choudhary M, et al. Full-genome sequences of the first two SARS-CoV-2 viruses from India. Indian J Med Res 2020; : . 19. Radhakrishnan C, Divakar MK, Jain A, et al. Initial insights into the genetic epidemiology of SARS-CoV-2 isolates from Kerala suggest local spread from limited introductions. bioRxiv. 2020. DOI:10.1101/2020.09.09.289892. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NzoiYmlvcnhpdiI7czo1OiJyZXNpZCI7czoxOToiMjAyMC4wOS4wOS4yODk4OTJ2MSI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIxLzAzLzIwLzIwMjEuMDMuMTcuMjEyNTM4MTAuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 20. Jain A, Rophina M, Mahajan S, et al. Analysis of the potential impact of genomic variants in global SARS-CoV-2 genomes on molecular diagnostic assays. Int J Infect Dis 2021; 102. DOI:10.1016/j.ijid.2020.10.086. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ijid.2020.10.086&link_type=DOI) 21. Joshi M, Puvar A, Kumar D, et al. Genomic variations in SARS-CoV-2 genomes from Gujarat: Underlying role of variants in disease epidemiology. bioRxiv. 2020. DOI:10.1101/2020.07.10.197095. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NzoiYmlvcnhpdiI7czo1OiJyZXNpZCI7czoxOToiMjAyMC4wNy4xMC4xOTcwOTV2MiI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIxLzAzLzIwLzIwMjEuMDMuMTcuMjEyNTM4MTAuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 22. Pattabiraman C, Habib F, Harsha PK, et al. Genomic epidemiology reveals multiple introductions and spread of SARS-CoV-2 in the Indian state of Karnataka. PLoS One 2020; 15. DOI:10.1371/journal.pone.0243412. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pone.0243412&link_type=DOI) 23. Kumar N, Shahul Hameed SK, Babu GR, et al. Descriptive epidemiology of SARS-CoV-2 infection in Karnataka state, South India: Transmission dynamics of symptomatic vs. asymptomatic infections. EClinicalMedicine 2021. DOI:10.1016/j.eclinm.2020.100717. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.eclinm.2020.100717&link_type=DOI) 24. Jolly B, Rophina M, Shamnath A, et al. Genetic epidemiology of variants associated with immune escape from global SARS-CoV-2 genomes. bioRxiv. 2020. DOI:10.1101/2020.12.24.424332. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NzoiYmlvcnhpdiI7czo1OiJyZXNpZCI7czoxOToiMjAyMC4xMi4yNC40MjQzMzJ2MiI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIxLzAzLzIwLzIwMjEuMDMuMTcuMjEyNTM4MTAuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 25. Greaney AJ, Starr TN, Gilchuk P, et al. Complete Mapping of Mutations to the SARS-CoV-2 Spike Receptor-Binding Domain that Escape Antibody Recognition. Cell Host Microbe 2020; published online Jan 13. DOI:10.1016/j.chom.2020.11.007. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.chom.2020.11.007&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=33259788&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F03%2F20%2F2021.03.17.21253810.atom) 26. Starr TN, Greaney AJ, Hilton SK, et al. Deep Mutational Scanning of SARS-CoV-2 Receptor Binding Domain Reveals Constraints on Folding and ACE2 Binding. Cell 2020. DOI:10.1016/j.cell.2020.08.012. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.cell.2020.08.012&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=32841599&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F03%2F20%2F2021.03.17.21253810.atom) 27. Bhoyar RC, Jain A, Sehgal P, et al. High throughput detection and genetic epidemiology of SARS-CoV-2 using COVIDSeq next generation sequencing. bioRxiv. 2020. DOI:10.1101/2020.08.10.242677. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NzoiYmlvcnhpdiI7czo1OiJyZXNpZCI7czoxOToiMjAyMC4wOC4xMC4yNDI2Nzd2MSI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIxLzAzLzIwLzIwMjEuMDMuMTcuMjEyNTM4MTAuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 28. Rambaut A, Holmes EC, O’Toole Á, et al. A dynamic nomenclature proposal for SARS-CoV-2 lineages to assist genomic epidemiology. Nat Microbiol 2020; 5. DOI:10.1038/s41564-020-0770-5. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41564-020-0770-5&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=32669681&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F03%2F20%2F2021.03.17.21253810.atom) 29. Plante JA, Liu Y, Liu J, et al. Spike mutation D614G alters SARS-CoV-2 fitness. Nature 2020. DOI:10.1038/s41586-020-2895-3. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41586-020-2895-3&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=33106671&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F03%2F20%2F2021.03.17.21253810.atom) 30. Jackson CB, Zhang L, Farzan M, Choe H. Functional importance of the D614G mutation in the SARS-CoV-2 spike protein. Biochem Biophys Res Commun 2020. DOI:10.1016/j.bbrc.2020.11.026. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.bbrc.2020.11.026&link_type=DOI) 31. Korber B, Fischer WM, Gnanakaran S, et al. Tracking Changes in SARS-CoV-2 Spike: Evidence that D614G Increases Infectivity of the COVID-19 Virus. Cell 2020; 182. DOI:10.1016/j.cell.2020.06.043. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.cell.2020.06.043&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=32697968&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F03%2F20%2F2021.03.17.21253810.atom) 32. Volz E, Hill V, McCrone JT, et al. Evaluating the Effects of SARS-CoV-2 Spike Mutation D614G on Transmissibility and Pathogenicity. Cell 2021; 184. DOI:10.1016/j.cell.2020.11.020. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.cell.2020.11.020&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=33275900&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F03%2F20%2F2021.03.17.21253810.atom) 33. Áine O’Toole, Verity Hill, Oliver G. Pybus Alexander Watts,, Isaac I. Bogoch,, Kamran Khan Jane P. Messina, The COVID-Genomics UK (COG-UK) consortium, Network for Genomic Surveillance in South Africa (NGS-SA), Brazil-UK CADDE Genomic Network, Houriiyah MUGK. Tracking the international spread of SARS-CoV-2 lineages B.1.1.7 and B.1.351/501Y-V2. VirologicalOrg 2021. 34. Hoffmann M, Arora P, Groß R, et al. SARS-CoV-2 variants B.1.351 and B.1.1.248: Escape from therapeutic antibodies and antibodies induced by infection and vaccination. bioRxiv 2021. 35. Wibmer CK, Ayres F, Hermanus T, et al. SARS-CoV-2 501Y.V2 escapes neutralization by South African COVID-19 donor plasma. bioRxiv Prepr Serv Biol 2021. DOI:10.1101/2021.01.18.427166. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NzoiYmlvcnhpdiI7czo1OiJyZXNpZCI7czoxOToiMjAyMS4wMS4xOC40MjcxNjZ2MiI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIxLzAzLzIwLzIwMjEuMDMuMTcuMjEyNTM4MTAuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 36. Gupta V, Bhoyar RC, Jain A, et al. Asymptomatic Reinfection in 2 Healthcare Workers From India With Genetically Distinct Severe Acute Respiratory Syndrome Coronavirus 2. Clin Infect Dis 2020. DOI:10.1093/cid/ciaa1451. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/cid/ciaa1451&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=32964927&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F03%2F20%2F2021.03.17.21253810.atom) 37. Page AJ, Mather AE, Le-Viet T, et al. Large scale sequencing of SARS-CoV-2 genomes from one region allows detailed epidemiology and enables local outbreak management. medRxiv. 2020. DOI:10.1101/2020.09.28.20201475. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NzoibWVkcnhpdiI7czo1OiJyZXNpZCI7czoyMToiMjAyMC4wOS4yOC4yMDIwMTQ3NXYyIjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjEvMDMvMjAvMjAyMS4wMy4xNy4yMTI1MzgxMC5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 38. Tegally H, Wilkinson E, Lessells RJ, et al. Sixteen novel lineages of SARS-CoV-2 in South Africa. Nat Med 2021. DOI:10.1038/s41591-021-01255-3. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41591-021-01255-3&link_type=DOI) 39. Quick J. Josh Quick 2020. nCoV-2019 sequencing protocol v3 (LoCost). Protocols.io. [https://protocols.io/view/ncov-2019-sequencing-protocol-v3-locost-bh42j8ye](https://protocols.io/view/ncov-2019-sequencing-protocol-v3-locost-bh42j8ye). 40. GISAID. [https://www.gisaid.org/](https://www.gisaid.org/). 41. Singer J, Gifford R, Cotten M, Robertson D. CoV-GLUE: A Web Application for Tracking SARS-CoV-2 Genomic Variation. 2020; published online June 18. DOI:10.20944/PREPRINTS202006.0225.V1. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.20944/PREPRINTS202006.0225.V1&link_type=DOI) 42. Nguyen LT, Schmidt HA, Von Haeseler A, Minh BQ. IQ-TREE: A fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol Biol Evol 2015. DOI:10.1093/molbev/msu300. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/molbev/msu300&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25371430&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F03%2F20%2F2021.03.17.21253810.atom) 43. Edgar RC. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res 2004; 32: 1792–7. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/nar/gkh340&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=15034147&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F03%2F20%2F2021.03.17.21253810.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000220487200025&link_type=ISI) 44. Weisblum Y, Schmidt F, Zhang F, et al. Escape from neutralizing antibodies 1 by SARS-CoV-2 spike protein variants. Elife 2020; 9. DOI:10.7554/eLife.61312. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.7554/eLife.61312&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=33112236&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F03%2F20%2F2021.03.17.21253810.atom) 45. Biswas Majumder PP. Analysis of RNA sequences of 3636 SARS-CoV-2 collected from 55 countries reveals selective sweep of one virus type. Indian J Med Res 2020; 151: 450. [CrossRef](http://medrxiv.org/lookup/external-ref?access\_num=10.4103/ijmr.IJMR_1125_20&link_type=DOI) 46. Shang J, Ye G, Shi K, et al. Structural basis of receptor recognition by SARS-CoV-2. Nature 2020; 581. DOI:10.1038/s41586-020-2179-y. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41586-020-2179-y&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=32225175&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F03%2F20%2F2021.03.17.21253810.atom) 47. Baum A, Fulton BO, Wloga E, et al. Antibody cocktail to SARS-CoV-2 spike protein prevents rapid mutational escape seen with individual antibodies. Science (80-) 2020; 369. DOI:10.1126/science.abd0831. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6Mzoic2NpIjtzOjU6InJlc2lkIjtzOjEzOiIzNjkvNjUwNi8xMDE0IjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjEvMDMvMjAvMjAyMS4wMy4xNy4yMTI1MzgxMC5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 48. Starr TN, Greaney AJ, Addetia A, et al. Prospective mapping of viral mutations that escape antibodies used to treat COVID-19. Science (80-) 2021. DOI:10.1126/science.abf9302. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6Mzoic2NpIjtzOjU6InJlc2lkIjtzOjEyOiIzNzEvNjUzMS84NTAiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyMS8wMy8yMC8yMDIxLjAzLjE3LjIxMjUzODEwLmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 49. Hodcroft EB, Domman DB, Oguntuyo K, et al. Emergence in late 2020 of multiple lineages of SARS-CoV-2 Spike protein variants affecting amino acid position 677. medRxiv 2021. 50. Turakhia Y, de Maio N, Thornlow B, et al. Stability of SARS-CoV-2 phylogenies. PLoS Genet 2020; 16. DOI:10.1371/journal.pgen.1009175. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pgen.1009175&link_type=DOI)