Abstract
Following its emergence in late 2019, severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has caused a global pandemic resulting in unprecedented efforts to reduce transmission and develop therapies and vaccines (WHO Emergency Committee, 2020; Zhu et al., 2020). Rapidly generated viral genome sequences have allowed the spread of the virus to be tracked via phylogenetic analysis (Worobey et al., 2020; Hadfield et al., 2018; Pybus et al., 2020). While the virus spread globally in early 2020 before borders closed, intercontinental travel has since been greatly reduced, allowing continent-specific variants to emerge. However, within Europe travel resumed in the summer of 2020, and the impact of this travel on the epidemic is not well understood. Here we report on a novel SARS-CoV-2 variant, 20E (EU1), that emerged in Spain in early summer, and subsequently spread to multiple locations in Europe. We find no evidence of increased transmissibility of this variant, but instead demonstrate how rising incidence in Spain, resumption of travel across Europe, and lack of effective screening and containment may explain the variant’s success. Despite travel restrictions and quarantine requirements, we estimate 20E (EU1) was introduced hundreds of times to countries across Europe by summertime travellers, likely undermining local efforts to keep SARS-CoV-2 cases low. Our results demonstrate how a variant can rapidly become dominant even in absence of a substantial transmission advantage in favorable epidemiological settings. Genomic surveillance is critical to understanding how travel can impact SARS-CoV-2 transmission, and thus for informing future containment strategies as travel resumes.
Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is the first pandemic where the spread of a viral pathogen has been globally tracked in near real-time using phylogenetic analysis of viral genome sequences (Worobey et al., 2020; Hadfield et al., 2018; Pybus et al., 2020). SARS-CoV-2 genomes continue to be generated at a rate far greater than for any other pathogen and more than 500,000 full genomes are available on GISAID as of February 2020 (Shu and McCauley, 2017).
In addition to tracking the viral spread, these genome sequences have been used to monitor mutations which might change the transmission, pathogenesis, or anti-genic properties of the virus. One mutation in particular, D614G in the spike protein, has received much attention. This variant (Nextstrain clade 20A) seeded large outbreaks in Europe in early 2020 and subsequently dominated the outbreaks in the Americas, thereby largely replacing previously circulating lineages. This rapid rise led to the suggestion that this variant is more transmissible, which has since been corroborated by phylogenetic (Korber et al., 2020; Volz et al., 2020) and experimental evidence (Plante et al., 2020; Yurkovetskiy et al., 2020).
Following the global dissemination of SARS-CoV-2 in early 2020 (Worobey et al., 2020), intercontinental travel dropped dramatically. Within Europe, however, travel and in particular holiday travel resumed in summer (though at lower levels than in previous years) with largely uncharacterized effects on the pandemic. Here we report on a novel SARS-CoV-2 variant 20E (EU1) (S:A222V) that emerged in early summer 2020, presumably in Spain, and subsequently spread to multiple locations in Europe. Over the summer, it rose in frequency in parallel in multiple countries. As we report here, this variant, 20E (EU1), and a second variant 20A.EU2 with mutation S477N in the spike protein accounted for the majority of sequences in Europe in the autumn of 2020.
Competing Interest Statement
DV is a consultant for Vir Biotechnology Inc. DC is an employee of Vir Biotechnology and may hold shares in Vir Biotechnology. The Veesler laboratory has received an unrelated sponsored research agreement from Vir Biotechnology Inc. AH is a co-founder of Kido Dynamics, DM is employed by Kido Dynamics. The other authors declare no competing interests.
Funding Statement
We would also like to thank the Swiss Federal Office of Public Health (FOPH) for providing access to their data. This work was supported by the Swiss National Science Foundation (SNSF) through grant numbers 31CA30_196046 (to RAN, EBH, CLA), 31CA30_196267 (to TS), European Union's Horizon 2020 research and innovation programme - project EpiPose (No 101003688) (MLR, CLA), core funding by the University of Basel and ETH Zürich, the National Institute of General Medical Sciences (R01GM120553 to DV), the National Institute of Allergy and Infectious Diseases (DP1AI158186 and HHSN272201700059C to DV), a Pew Biomedical Scholars Award (DV), an Investigators in the Pathogenesis of Infectious Disease Awards from the Burroughs Wellcome Fund (DV and JDB), a Fast Grants (DV), and NIAID grants R01AI141707 (JDB) and F30AI149928 (KHDC). SeqCOVID-SPAIN is funded by the Instituto de Salud Carlos III project COV20/00140, Spanish National Research Council and ERC StG 638553 to IC and BFU2017-89594R from MICIN to FGC. JDB is an Investigator of the Howard Hughes Medical Institute.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
This is an analysis of publicly available data and no IRB approval is needed.
All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
Footnotes
Updates from Version 2: This update includes all new sequence data available through 30 Nov 2020, and analyses, descriptions, and figures have been updated accordingly. We have included a more detailed model based on regional SARS-CoV-2 incidence in Spain and regional roaming mobile phone data to provinces in Spain from countries in Europe, combined with the epidemic changes in the home countries, to predict EU1 frequency over time. Additionally we include further lentiviral lab work, including antibody binding. We have updated Re estimates and included Re estimates for Europe as well as Switzerland, and included recorded data on number of cases and age of cases in Switzerland and Germany which were suspected to have been acquired in Spain over the summer. Updates from Version 1: This update includes all new sequence data available until 10 Nov 2020, and analyses, descriptions, and figures have been updated accordingly. Additionally, we include details of lentiviral lab work investigating the S:A222V mutation, and a simple model illustrating how travel volume and SARS-CoV-2 prevalence can largely explain the spread of 20A.EU1 observed across Europe over the summer. None of the updates have changed the overall conclusions of the paper.
Data Availability
All sequence data is available on GISAID (gisaid.org), and the code for the analyses performed is available on Github and linked in the manuscript text.