Abstract
Analysing 92,598 variant screening tests performed on SARS-CoV-2 positive samples collected in France between 1 July and 31 August 2021 shows an increase of Kappa-like infections. Full genome sequencing reveals that these correspond to Delta variants bearing the S:E484Q mutation. Most of these sequences belong to a phylogenetic cluster and also bear the S:T95I mutation. Further monitoring is needed to determine if this trend is driven by undocumented superspreading events or an early signal of future viral evolutionary dynamics.
The emergence of SARS-CoV-2 variants shows that virus evolution can threaten our ability to control epidemics [1,2,3] and lead to increased infection virulence [4]. Recently, the variant of concern Delta, which has a high replication rate and immune evasion properties [5], has caused epidemic rebounds in several countries including France, where it was shown to have a 52 to 110% transmission advantage over the Alpha variant [6]. Ongoing surveillance protocols involve variant-specific screening tests that target key mutations and full genome sequencing. In particular, the extension of full vaccination coverage with 2 doses, which now reaches on average 62% in France and 59% in the European Union [7], can be expected to modify the virus fitness landscape and favour the spread of escape mutations [8].
Tests and study population
We analysed 92,598 reverse transcriptase polymerase chain reaction (RT-PCR) variant-specific screening tests performed on samples collected in France by the CERBA network of medical analysis laboratories in patients from 5 to 80 years old between 1 July and 31 August 2021 (Table 1). 12.1% of these tests were performed using the VirSNiP SARS-CoV-2 Spike, TIB MOLBIOL assay, Berlin; 50.1% using the Variant Detect (TM) SARS-CoV-2 RT-PCR, PerkinElmer assay, USA, and 37.8% using the ID (TM) SARS-CoV-2/VOC evolution Pentaplex, ID Solution, France. The three tests screen for the presence of the E484K, the E484Q, and the L452R mutations in the Spike protein. Infections caused by the Delta variant are expected to be associated with L452R alone. The metadata associated with the variant-specific RT-PCR data is shown in Table 1.
To improve the specificity of the analysis, all the screening tests bearing the mutation S:E484Q and with low-enough Ct values, as well as a subset of other tests, were sequenced using the CovidSeq(tm) amplicon-based NGS assay according to supplier recommendations (Illumina technology). The bioinformatic workflow was executed using Nanoware(tm) software (Life and Soft®). Briefly, the fastQ files generated by NGS were prefiltered by seqktk v1.3 to eliminate bad quality bases. Human reads were then discarded using bwa v.0.7.17. Reads assemblage (megahit v1.2.9) and alignment (blast v2.9.0) were performed. A variant calling (freebayes v1.3.1) and annotation (snpEff v5.0) was carried out and a consensus sequence was constructed (ivar v1.3.1). Overall, we analysed 364 original full genome sequences.
Variant-screening test results
The majority of the screening tests (67,560, i.e. 73.0%) were consistent with an infection by a Delta variant (i.e. L452R+/E484Q-/E484K-). Fewer samples were consistent with the Alpha variant (3,179, i.e. 3.4%, L452R-/E484Q-/E484K-), the Beta/Gamma/Eta variants (364, i.e. 0.4%, L452R-/E484Q-/E484K+), and the Kappa variant (49, i.e. 0.1%, L452R+/E484Q+/E484K-). The number of tests analysed varied across French regions (Supplementary Figure 1).
To analyse the temporal trend associated with the frequency of a given variant, we used the methods described in [6] and performed a multinomial regression using the Delta-like variant as a reference (Table 2). The most striking effect we detected is that the few screening tests consistent with the Kappa variant appear to be strongly increasing with time in some French regions. The proportion of infections consistent with an Alpha variant was significantly decreasing over time in most French regions, which is consistent with the rapid spread of the Delta variant [6].
Doubling times and transmission advantage in the Parie area
Focusing on the tests consistent with a Kappa infection, the majority (33/49, i.e. 67%) originated from the Ile-de-France region (the Paris area) and the incidence globally increased from July to August 2021 (Supplementary Figure S2). Following earlier studies [9,10], we assessed the change in relative frequencies of Kappa-consistent infections compared to Delta-consistent infections in the Ile-de-France region during August 2021. The results show that the former had a transmission advantage over the latter of 45% (95% confidence interval: 31 – 60%) (Figure 1). Using an earlier starting date in July had little effect on the estimations (Figure S3).
Sequencing and phylogenetic analysis
Full genome sequencing and analysis through the nextclade pipeline [11] revealed that all the samples (35/35, 100%) bearing the S:E484Q mutation and identified as potential Kappa were in fact Delta variants bearing the E484Q mutation.
We then aligned the sequences using mafft [12], anonymised nucleotide position 23012 and performed a phylogenetic analysis using PhyML [13], using SMS to infer the substitution model [14]. The selected model was GTR+I+G. Strikingly, the majority of these Delta+E484Q sequences (28/35, 80%) clustered into the phylogeny into a clade (Figure 2). Most of these sequences from this clade (29/35, 83%) originated from individuals residing in the Ile-de-France region and the information was missing for 2 sequences. In addition to S:E484Q, the phylogenetic cluster is associated with the following substitutions that are not found in half of the other Delta sequences: ORF1a:T403I, ORF1a:D1167N, ORF1b:A1219S, and S:T95I. The latter is present in some variants of interest, such as Iota, and has been reported in vaccine breakthrough infections [15].
Finally, none of the individuals with Delta+E484Q reported travelling outside France 14 days before the sampling. As indicated above, nucleotide position 23,012, which is associated with the E484Q mutation, was ignored when performing the phylogenetic inference, meaning that the clustering originates from other mutations. Finally, based on the metadata analysis, there was no reported link between the 30 individuals from the phylogenetic cluster of interest (e.g. non were sequenced in the context of a transmission cluster). Note that two sequences in the cluster neither have the S:E484Q nor the S:T95I mutation. One possibility is the uncertainty in the phylogenetic reconstruction. Another could be the occurrence of genetic events such as recombination between two Delta lineages.
Discussion
We report an increase in infections caused by Delta variants bearing the E484Q mutation in the Paris area. The significant transmission advantage we estimate for this mutation is consistent with it being associated with immune evasion [8] and with the high vaccination coverage in France [7]. Note that other lineages bearing a mutation in position S:484 were found to spread earlier in 2021 [16] but their transmission advantage was more limited (approximately 17% versus 35 to 63% here) and the reference strain was the Alpha variant, whereas is less contagious that the Delta variant [6] and has less immune evasion properties [5].
Even if we detect a significant increasing trend, the Delta+E484Q samples represent a very small proportion of the tests performed. Therefore, the transmission advantage could be driven by an undocumented superspreading event in the area of Paris in August, that would leave traces in virus genomes [17]. This could be consistent with the fact that the number of sequences from Delta+E484Q in the GISAID database is currently very low (935 of the 937,570 Delta sequences in GISAID [18]). Furthermore, combining these sequences with 8 Delta sequences per month (April to September) from Italy, Spain, France, UK, Belgium, and Germany did not identify a clear link with the phylogenetic cluster of interest (Supplementary Figure S4). In vitro studies using viral culture assays could be instrumental to understand this phenomenon. SARS-CoV-2 prevalence remains high and the beginning of the school term is likely to increase virus spread. Therefore, the alternative, more worrying, interpretation of these results, which is an adaptive advantage of a combination of the S:E484Q and S:T95I mutations, warrants close monitoring of these genomic patterns.
Data Availability
Data will be made available after the peer-review process.
Conflict of interest
‘None declared’.
Ethical statement
This study has been approved by the Institutional Review Board of the CHU of Montpellier and is registered at ClinicalTrials.gov with identifier NCT04738331.
Acknowledgements
The authors thank Gilles Roullin, Eric Hedbaut and Virginie Dubois from CERBA for their technical assistance, and ETE team from CNRS, IRD, and University of Montpellier for discussion, and the EMERGEN consortium. This project was supported by the Occitanie region and the ANR (PHYEPI grant).