Spread of SARS-CoV-2 Delta variant infections bearing the S:E484Q and S:T95I mutations in July and August 2021 in France ======================================================================================================================== * Laura Verdurme * Gonché Danesh * Sabine Trombert-Paolantoni * Mircea T. Sofonea * Valérie Noel * Vincent Foulongne * Brigitte Montès * Edouard Tuaillon * Stéphanie Haim-Boukobza * Bénédicte Roquebert * Samuel Alizon ## Abstract Analysing 92,598 variant screening tests performed on SARS-CoV-2 positive samples collected in France between 1 July and 31 August 2021 shows an increase of Kappa-like infections. Full genome sequencing reveals that these correspond to Delta variants bearing the S:E484Q mutation. Most of these sequences belong to a phylogenetic cluster and also bear the S:T95I mutation. Further monitoring is needed to determine if this trend is driven by undocumented superspreading events or an early signal of future viral evolutionary dynamics. Keywords * COVID-19 * virus * variant * RT-PCR * epidemiology * statistical modelling * phylogenetics The emergence of SARS-CoV-2 variants shows that virus evolution can threaten our ability to control epidemics [1,2,3] and lead to increased infection virulence [4]. Recently, the variant of concern Delta, which has a high replication rate and immune evasion properties [5], has caused epidemic rebounds in several countries including France, where it was shown to have a 52 to 110% transmission advantage over the Alpha variant [6]. Ongoing surveillance protocols involve variant-specific screening tests that target key mutations and full genome sequencing. In particular, the extension of full vaccination coverage with 2 doses, which now reaches on average 62% in France and 59% in the European Union [7], can be expected to modify the virus fitness landscape and favour the spread of escape mutations [8]. ## Tests and study population We analysed 92,598 reverse transcriptase polymerase chain reaction (RT-PCR) variant-specific screening tests performed on samples collected in France by the CERBA network of medical analysis laboratories in patients from 5 to 80 years old between 1 July and 31 August 2021 (Table 1). 12.1% of these tests were performed using the VirSNiP SARS-CoV-2 Spike, TIB MOLBIOL assay, Berlin; 50.1% using the Variant Detect (TM) SARS-CoV-2 RT-PCR, PerkinElmer assay, USA, and 37.8% using the ID (TM) SARS-CoV-2/VOC evolution Pentaplex, ID Solution, France. The three tests screen for the presence of the E484K, the E484Q, and the L452R mutations in the Spike protein. Infections caused by the Delta variant are expected to be associated with L452R alone. The metadata associated with the variant-specific RT-PCR data is shown in Table 1. View this table: [Table 1:](http://medrxiv.org/content/early/2021/09/16/2021.09.13.21263371/T1) Table 1: Description of the population studied with the N= 92,598 variant-specific screening tests performed between 1 July and 31 August 2021. CI: confidence interval. To improve the specificity of the analysis, all the screening tests bearing the mutation S:E484Q and with low-enough Ct values, as well as a subset of other tests, were sequenced using the CovidSeq(tm) amplicon-based NGS assay according to supplier recommendations (Illumina technology). The bioinformatic workflow was executed using Nanoware(tm) software (Life and Soft®). Briefly, the fastQ files generated by NGS were prefiltered by seqktk v1.3 to eliminate bad quality bases. Human reads were then discarded using bwa v.0.7.17. Reads assemblage (megahit v1.2.9) and alignment (blast v2.9.0) were performed. A variant calling (freebayes v1.3.1) and annotation (snpEff v5.0) was carried out and a consensus sequence was constructed (ivar v1.3.1). Overall, we analysed 364 original full genome sequences. ## Variant-screening test results The majority of the screening tests (67,560, i.e. 73.0%) were consistent with an infection by a Delta variant (i.e. L452R+/E484Q-/E484K-). Fewer samples were consistent with the Alpha variant (3,179, i.e. 3.4%, L452R-/E484Q-/E484K-), the Beta/Gamma/Eta variants (364, i.e. 0.4%, L452R-/E484Q-/E484K+), and the Kappa variant (49, i.e. 0.1%, L452R+/E484Q+/E484K-). The number of tests analysed varied across French regions (Supplementary Figure 1). ![Figure S1:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/09/16/2021.09.13.21263371/F3.medium.gif) [Figure S1:](http://medrxiv.org/content/early/2021/09/16/2021.09.13.21263371/F3) Figure S1: Number of tests analysed per French region between 1 July and 31 August 2021 (A) and ratio of these tests consistent with a Kappa infection (B). To analyse the temporal trend associated with the frequency of a given variant, we used the methods described in [6] and performed a multinomial regression using the Delta-like variant as a reference (Table 2). The most striking effect we detected is that the few screening tests consistent with the Kappa variant appear to be strongly increasing with time in some French regions. The proportion of infections consistent with an Alpha variant was significantly decreasing over time in most French regions, which is consistent with the rapid spread of the Delta variant [6]. View this table: [Table 2:](http://medrxiv.org/content/early/2021/09/16/2021.09.13.21263371/T2) Table 2: Factors associated with the detection of potential SARS-CoV-2 variants, as assessed by relative risk ratios (RRR) using a multinomial log-linear model, France, 1 July-31 August 2021 (n=92,598). In the model, screening tests consistent with a Delta variant are the reference for the RRR. See [6] for details about the methods used. NS: non-signficant ## Doubling times and transmission advantage in the Parie area Focusing on the tests consistent with a Kappa infection, the majority (33/49, i.e. 67%) originated from the Ile-de-France region (the Paris area) and the incidence globally increased from July to August 2021 (Supplementary Figure S2). Following earlier studies [9,10], we assessed the change in relative frequencies of Kappa-consistent infections compared to Delta-consistent infections in the Ile-de-France region during August 2021. The results show that the former had a transmission advantage over the latter of 45% (95% confidence interval: 31 – 60%) (Figure 1). Using an earlier starting date in July had little effect on the estimations (Figure S3). ![Figure S2:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/09/16/2021.09.13.21263371/F4.medium.gif) [Figure S2:](http://medrxiv.org/content/early/2021/09/16/2021.09.13.21263371/F4) Figure S2: Daily number of tests consistent with a Kappa infection. Zeros are ignored. The lines show the output of a linear model and the shaded area the 95% confidence interval. ![Figure S3:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/09/16/2021.09.13.21263371/F5.medium.gif) [Figure S3:](http://medrxiv.org/content/early/2021/09/16/2021.09.13.21263371/F5) Figure S3: Effect of using an earlier date for the temporal interval on the estimates of the transmission advantage of the Kappa variant over the Delta variant. The starting date has little effect but the transmission advantage cannot be estimated on the whole time period (i.e. starting on July 1st), most likely because of stochastic variations during the month of July. ![Figure 1:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/09/16/2021.09.13.21263371/F1.medium.gif) [Figure 1:](http://medrxiv.org/content/early/2021/09/16/2021.09.13.21263371/F1) Figure 1: Estimated transmission advantage of the tests consistent with the Kappa variant with respect to that consistent with the Delta variant in Ile-de-France, 1-31 August 2021. The triangles indicate the generalised linear model (GLM)-fitted values and the line the output of the logistic growth model estimation. The shaded area represents the 95 %CI. The title indicates the estimated transmission advantage. ## Sequencing and phylogenetic analysis Full genome sequencing and analysis through the nextclade pipeline [11] revealed that all the samples (35/35, 100%) bearing the S:E484Q mutation and identified as potential Kappa were in fact Delta variants bearing the E484Q mutation. We then aligned the sequences using mafft [12], anonymised nucleotide position 23012 and performed a phylogenetic analysis using PhyML [13], using SMS to infer the substitution model [14]. The selected model was GTR+I+G. Strikingly, the majority of these Delta+E484Q sequences (28/35, 80%) clustered into the phylogeny into a clade (Figure 2). Most of these sequences from this clade (29/35, 83%) originated from individuals residing in the Ile-de-France region and the information was missing for 2 sequences. In addition to S:E484Q, the phylogenetic cluster is associated with the following substitutions that are not found in half of the other Delta sequences: ORF1a:T403I, ORF1a:D1167N, ORF1b:A1219S, and S:T95I. The latter is present in some variants of interest, such as Iota, and has been reported in vaccine breakthrough infections [15]. ![Figure 2:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/09/16/2021.09.13.21263371/F2.medium.gif) [Figure 2:](http://medrxiv.org/content/early/2021/09/16/2021.09.13.21263371/F2) Figure 2: Phylogeny inferred from 364 Delta-positive sequences from France. The reference sequence from Wuhan (China) was used to root the phylogeny and align the sequences. Blue tips indicate sequences bearing the S:T95I mutations, red tips sequences with the S:E484Q mutation, and purple tips sequences with both mutations. Finally, none of the individuals with Delta+E484Q reported travelling outside France 14 days before the sampling. As indicated above, nucleotide position 23,012, which is associated with the E484Q mutation, was ignored when performing the phylogenetic inference, meaning that the clustering originates from other mutations. Finally, based on the metadata analysis, there was no reported link between the 30 individuals from the phylogenetic cluster of interest (e.g. non were sequenced in the context of a transmission cluster). Note that two sequences in the cluster neither have the S:E484Q nor the S:T95I mutation. One possibility is the uncertainty in the phylogenetic reconstruction. Another could be the occurrence of genetic events such as recombination between two Delta lineages. ## Discussion We report an increase in infections caused by Delta variants bearing the E484Q mutation in the Paris area. The significant transmission advantage we estimate for this mutation is consistent with it being associated with immune evasion [8] and with the high vaccination coverage in France [7]. Note that other lineages bearing a mutation in position S:484 were found to spread earlier in 2021 [16] but their transmission advantage was more limited (approximately 17% versus 35 to 63% here) and the reference strain was the Alpha variant, whereas is less contagious that the Delta variant [6] and has less immune evasion properties [5]. Even if we detect a significant increasing trend, the Delta+E484Q samples represent a very small proportion of the tests performed. Therefore, the transmission advantage could be driven by an undocumented superspreading event in the area of Paris in August, that would leave traces in virus genomes [17]. This could be consistent with the fact that the number of sequences from Delta+E484Q in the GISAID database is currently very low (935 of the 937,570 Delta sequences in GISAID [18]). Furthermore, combining these sequences with 8 Delta sequences per month (April to September) from Italy, Spain, France, UK, Belgium, and Germany did not identify a clear link with the phylogenetic cluster of interest (Supplementary Figure S4). *In vitro* studies using viral culture assays could be instrumental to understand this phenomenon. SARS-CoV-2 prevalence remains high and the beginning of the school term is likely to increase virus spread. Therefore, the alternative, more worrying, interpretation of these results, which is an adaptive advantage of a combination of the S:E484Q and S:T95I mutations, warrants close monitoring of these genomic patterns. ![Figure S4:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/09/16/2021.09.13.21263371/F6.medium.gif) [Figure S4:](http://medrxiv.org/content/early/2021/09/16/2021.09.13.21263371/F6) Figure S4: Phylogeny showing sequences from the European countries (in yellow) and the 364 original sequences from the study. Sequences from the study bearing the S:E484Q mutation are in red, the others are in black. GISAID accession numbers will be provided upon publications. ## Data Availability Data will be made available after the peer-review process. ## Conflict of interest ‘None declared’. ## Ethical statement This study has been approved by the Institutional Review Board of the CHU of Montpellier and is registered at [ClinicalTrials.gov](http://ClinicalTrials.gov) with identifier [NCT04738331](http://medrxiv.org/lookup/external-ref?link_type=CLINTRIALGOV&access_num=NCT04738331&atom=%2Fmedrxiv%2Fearly%2F2021%2F09%2F16%2F2021.09.13.21263371.atom). ## Acknowledgements The authors thank Gilles Roullin, Eric Hedbaut and Virginie Dubois from CERBA for their technical assistance, and ETE team from CNRS, IRD, and University of Montpellier for discussion, and the EMERGEN consortium. This project was supported by the Occitanie region and the ANR (PHYEPI grant). * Received September 13, 2021. * Revision received September 13, 2021. * Accepted September 16, 2021. * © 2021, Posted by Cold Spring Harbor Laboratory The copyright holder for this pre-print is the author. All rights reserved. The material may not be redistributed, re-used or adapted without the author's permission. ## References 1. [1].Davies NG, Abbott S, Barnard RC, Jarvis CI, Kucharski AJ, Munday JD, et al. Estimated transmissibility and impact of SARS-CoV-2 lineage B.1.1.7 in England. Science. 2021;eabg3055. 2. [2].Faria NR, Mellan TA, Whittaker C, Claro IM, Candido D da S, Mishra S, et al. Genomics and epidemiology of the P.1 SARS-CoV-2 lineage in Manaus, Brazil. Science. 2021;372(6544):815–21. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6Mzoic2NpIjtzOjU6InJlc2lkIjtzOjEyOiIzNzIvNjU0NC84MTUiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyMS8wOS8xNi8yMDIxLjA5LjEzLjIxMjYzMzcxLmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 3. [3].Abdool Karim SS, de Oliveira T. New SARS-CoV-2 Variants — Clinical, Public Health, and Vaccine Implications. New England Journal of Medicine. 2021;384(19):1866–8. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F09%2F16%2F2021.09.13.21263371.atom) 4. [4].Alizon S, Sofonea MT. SARS-CoV-2 virulence evolution: avirulence theory, immunity, and trade-offs. Journal of Evolutionary Biology. 2021, in press, DOI 10.1111/jeb.13896 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1111/jeb.13896&link_type=DOI) 5. [5].Mlcochova P, Kemp S, Dhar MS, Papa G, Meng B, Ferreira IATM, et al. SARS-CoV-2 B.1.617.2 Delta variant replication and immune evasion. Nature. 2021; in press, DOI: 10.1038/s41586-021-03944-y. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41586-021-03944-y&link_type=DOI) 6. [6].Alizon S, Haim-Boukobza S, Foulongne V, Verdurme L, Trombert-Paolantoni S, Lecorche E, et al. Rapid spread of the SARS-CoV-2 Delta variant in some French regions, June 2021. Eurosurveillance. 2021;26(28):2100573. 7. [7]. Hannah Ritchie, Edouard Mathieu LR-G Cameron Appel, Charlie Giattino, Esteban Ortiz-Ospina, Joe Hasell, Bobbie Macdonald, Diana Beltekian, Roser M. Coronavirus pandemic (COVID-19). Our World in Data. 2020; [https://ourworldindata.org/coronavirus](https://ourworldindata.org/coronavirus) Accessed September 9, 2021 8. [8].Starr TN, Greaney AJ, Addetia A, Hannon WW, Choudhary MC, Dingens AS, et al. Prospective mapping of viral mutations that escape antibodies used to treat COVID-19. Science. 2021; in press DOI: 10.1126/science.abf9302 [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6Mzoic2NpIjtzOjU6InJlc2lkIjtzOjEyOiIzNzEvNjUzMS84NTAiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyMS8wOS8xNi8yMDIxLjA5LjEzLjIxMjYzMzcxLmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 9. [9].Chevin L-M. On measuring selection in experimental evolution. Biology Letters. 2011;7(2):210–3. 10. [10].Haim-Boukobza S, Roquebert B, Trombert-Paolantoni S, Lecorche E, Verdurme L, Foulongne V, et al. Detection of Rapid SARS-CoV-2 Variant Spread, France, January 26--February 16, 2021. Emerging Infectious Diseases. 2021;27(5):1496–9. 11. [11].Hadfield J, Megill C, Bell SM, Huddleston J, Potter B, Callender C, et al. Nextstrain: real-time tracking of pathogen evolution. Bioinformatics. 2018;34(23):4121–3. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10/gdkbqx&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F09%2F16%2F2021.09.13.21263371.atom) 12. [12].Katoh K, Rozewicki J, Yamada KD. MAFFT online service: multiple sequence alignment, interactive sequence choice and visualization. Briefings in Bioinformatics. 2019;20(4):1160–6. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/bib/bbx108&link_type=DOI) 13. [13].Guindon S, Gascuel O. A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood. Syst Biol. 2003;52(5):696–704. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1080/10635150390235520&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=14530136&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F09%2F16%2F2021.09.13.21263371.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000185732500010&link_type=ISI) 14. [14].Lefort V, Longueville J-E, Gascuel O. SMS: smart model selection in PhyML. Molecular biology and evolution 2017: 34:2422–2424. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/molbev/msx149&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=28472384&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F09%2F16%2F2021.09.13.21263371.atom) 15. [15].Hacisuleyman E, Hale C, Saito Y, Blachere NE, Bergh M, Conlon EG, et al. Vaccine Breakthrough Infections with SARS-CoV-2 Variants. New England Journal of Medicine. 2021; in press DOI: 10.1056/NEJMoa2105000 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1056/NEJMoa2105000&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=33882219&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F09%2F16%2F2021.09.13.21263371.atom) 16. [16].Roquebert B, Trombert-Paolantoni S, Haim-Boukobza S, Lecorche E, Verdurme L, Foulongne V, et al. The SARS-CoV-2 B.1.351 lineage (VOC β) is outgrowing the B.1.1.7 lineage (VOC α) in some French regions in April 2021. Eurosurveillance. 2021 Jun 10;26(23):2100447. 17. [17].Alizon S. Superspreading genomes. Science. 2021 Feb 5;371(6529):574–5. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6Mzoic2NpIjtzOjU6InJlc2lkIjtzOjEyOiIzNzEvNjUyOS81NzQiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyMS8wOS8xNi8yMDIxLjA5LjEzLjIxMjYzMzcxLmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 18. [18].Latif A. A., Mullen J L., Alkuzweny M, Tsueng G, Cano M, Haag E, Zhou J, Zeller M, Hufbauer E, Matteson N, Wu C, Andersen K G, Su A I, Gangavarapu K, Hughes L D, and the Center for Viral Systems Biology. [http://outbreak.info](http://outbreak.info), [https://outbreak.info/compare-lineages](https://outbreak.info/compare-lineages), Accessed 9 September 2021.