Genomic Epidemiology of SARS-CoV-2 in Seychelles, 2020-2021 =========================================================== * John Mwita Morobe * Brigitte Pool * Lina Marie * Dwayne Didon * Arnold W. Lambisia * Timothy Makori * Khadija Said Mohammed * Zaydah R. de Laurent * Leonard Ndwiga * Maureen W. Mburu * Edidah Moraa * Nickson Murunga * Jennifer Musyoki * Jedida Mwacharo * Lydia Nyamako * Debra Riako * Pariken Ephnatus * Faith Gambo * Josephine Naimani * Joyce Namulondo * Susan Zimba Tembo * Edwin Ogendi * Thierno Balde * Fred Athanasius Dratibi * Yahaya Ali Ahmed * Nicksy Gumede * Rachel A. Achilla * Peter K. Borus * Dorcas Wanjohi * Sofonias K. Tessema * Joseph Mwangangi * Philip Bejon * D. James Nokes * Lynette Isabella Ochola-Oyier * George Githinji * Leon Biscornet * Charles N. Agoti ## ABSTRACT Seychelles, an archipelago of 155 islands in the Indian Ocean, had confirmed 24,788 cases of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) by the 31st December 2021. The first SARS-CoV-2 cases in Seychelles were reported on the 14th of March 2020, but cases remained low until January 2021, when a surge of SARS-CoV-2 cases was observed on the islands. Here, we investigated the potential drivers of the surge by genomic analysis 1,056 SARS-CoV-2 positive samples collected in Seychelles between 14th March 2020 and 31st December 2021. The Seychelles genomes were classified into 32 Pango lineages, 1,042 of which fell within four variants of concern i.e., Alpha, Beta, Delta and Omicron. Sporadic cases of SARS-CoV-2 detected in Seychelles in 2020 were mainly of lineage B.1 (European origin) but this lineage was rapidly replaced by Beta variant starting January 2021, and which was also subsequently replaced by the Delta variant in May 2021 that dominated till November 2021 when Omicron cases were identified. Using ancestral state reconstruction approach, we estimated at least 78 independent SARS-CoV-2 introduction events into Seychelles during the study period. Majority of viral introductions into Seychelles occurred in 2021, despite substantial COVID-19 restrictions in place during this period. We conclude that the surge of SARS-CoV-2 cases in Seychelles in January 2021 was primarily due to the introduction of more transmissible SARS-CoV-2 variants into the islands. Key words * Seychelles * SARS-CoV-2 * Variants of concern ## BACKGROUND Seychelles, an archipelago of 155 islands in the Indian Ocean with a population size of approximately 99,2021, confirmed its first cases of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), the aetiological agent of coronavirus disease 2019 (COVID-19), on the 14th March 2020. This was shortly after the World Health Organisation (WHO) declared COVID-19 a global pandemic on the 11th March 2020. However, the number of COVID-19 cases in Seychelles remained low (average of 1 case/day) until January 2021 when a surge was observed in the country (average of 72 cases/day). As of 31st December 2021, Seychelles had reported 24,788 laboratory-confirmed COVID-19 cases, >98% of which were recorded in 20212. The surge of the number of COVID-19 cases in Seychelles in 2021 could be due to 2 major factors: (a) the relaxation of government COVID-19 stringency measures and (b) the arrival of more transmissible SARS-CoV-2 variants on the islands. Our analysis looked at these factors in an attempt to improve understanding of the COVID-19 transmission dynamics in Seychelles. The first round of COVID-19 countermeasures by the Seychelles government to curb introductions and spread of the virus were announced in March 2020. These measures included: a 14-day quarantine for people returning from countries with significant COVID-19 community transmission on the 16th March 2020, closure of day care centres and learning institutions, and ban on international arrivals and foreign travel by Seychellois citizens, except for medical emergencies beginning 23rd March 2020, a 21-day nationwide lockdown, tracing, isolation and monitoring of all persons who had close contact with COVID-19 patients for 14 days beginning 6th April 2020, closure of all shops except those that sell food items, groceries, pharmaceutical products beginning 6th April 2020, workplace closures and restriction of outdoor movement except for essential services on 9th April. On the 4th of May 2020, the Seychelles government eased some of the COVID-19 restrictions, including opening of all day care and learning institutions, opening of all shops, and lifting of the ban on restrictions of movement of people. In June 2020, the Seychelles government lifted the ban on international travel and allowed visitors (international tourists) from countries categorised as low-risk countries, however with a requirement to show a COVID-19 negative certificate (RT-PCR test). Despite this removal of many of the Government restrictions, the number of SARS-CoV-2 cases in Seychelles throughout 2020 remained low. Towards the end of 2020 and in 2021, in widely different geographical locations, five SARS-CoV-2 variants of concern (VOC) - Alpha, Beta, Gamma, Delta and Omicron, five variants of variants of interest (VOI) – Eta, Kappa, Iota, Epsilon and Theta and over 10 variants under monitoring (VUM) - emerged that appeared to be considerably more transmissible and with potential to escape pre-existing immunity3,4 Notably, the surge of COVID-19 cases in Seychelles in early 2021 coincided with their emergence. Further, in the last quarter of 2021, the Omicron SARS-CoV-2 variants of concern (VOCs) was detected globally4. The emergence and spread of these variants of concern globally, has exacerbated the effect of the COVID-19 pandemic5. The objective of this study was to describe the genomic epidemiology of SARS-CoV-2 in Seychelles and in particular lineages coinciding with the surge of cases that began in January 2021, with the aim of improving the understanding on the introduction and transmission of SARS-CoV-2 in Seychelles. ## MATERIALS AND METHODS ### Ethical statement The SARS-CoV-2 positive samples were sequenced at the Kenya Medical Research Institute (KEMRI) - Wellcome Trust Research Programme (KWTRP) as part of a regional collaborative COVID-19 public health rapid response initiative overseen by WHO-AFRO and Africa-CDC. KWTRP Kilifi is one of the 12 designated WHO-AFRO/Africa-CDC regional reference laboratories for SARS-CoV-2 genomic surveillance in Africa. The whole genome sequencing study protocol was reviewed and approved by the Scientific and Ethics Review Committee (SERU) at KEMRI, (SERU #4035). Individual patient consent requirement was waivered by the committee as the sequenced samples were part of the public health emergency response. ### Study site and samples A total of 1,298 SARS-CoV-2 real-time polymerase chain reaction (qRT-PCR)-confirmed nasopharyngeal and oropharyngeal (NP/OP) positive swab samples collected between 14th March 2020 and 31st December 2021 were targeted for whole genome sequencing. The samples received for sequencing at KWTRP for whole genome sequencing were selected considering cycle threshold (Ct) value cut off <30. The monthly temporal distribution of samples selected for whole genome sequencing is shown in **Supplementary Figure 1**. ### Laboratory procedures #### RNA extraction, cDNA synthesis and amplification The NP/OP swab samples on arrival at KWTRP laboratories were re-extracted using the QIAamp Viral RNA Mini Kit (Qiagen, Manchester, UK) following the manufacturer’s instructions, starting volume 140 µl, and elution volume of 60 μl. The RNA was then re-assayed to confirm SARS-CoV-2 genetic material using one of three commercial kits namely, Da An Gene Co. Ltd.’s Detection Kit (targeting N gene or ORF1ab), SD Biosensor’s Standard M Real Time Detection Kit (targeting E gene and ORF1ab) or KH Medical’s RADI COVID-19 Detection Kit (targeting RdRp and S genes) while following manufacturer’s instructions. Samples with Ct values <33 were selected for cDNA synthesis. Extracted RNA was reverse transcribed using the LunaScript® RT SuperMix Kit (New England Biolabs, US). For each of the selected sample, 2 μl of the LunaScript® RT SuperMix was added to 8 μl of RNA template, incubated at 25°C for 2 minutes, 55°C for 10 minutes, held at 95°C for 1 minute and placed on ice for 1 minute. The generated viral cDNA was amplified using the Q5® Hot Start High-Fidelity 2X Master Mix (NEB, US) along with ARTIC nCoV-2019 version 3 primers (primer pools A and B), as documented previously6. The thermocycling conditions involved a touchdown PCR with the following conditions: heat activation at 98°C for 30 seconds, followed by 40 amplification cycles (i.e., 25 cycles of 98°C for 15 seconds and 65°C for 5 minutes, and 15 cycles of 62.5°C for 5 minutes and 98°C for 15 seconds and one cycle at 62.5°C for 5 minutes), final extension at 62.5°C for 5 minutes, followed by a final hold at 4°C. To overcome amplicon dropouts in regions 3, 9, 17, 26, 64, 66, 67, 68, 74 88, 91 and 92 of the genomes6, primer pairs for the aforementioned regions were constituted in an additional pool, named herein pool C. After the multiplex PCR, an agarose gel electrophoresis step was included to exclude samples with no visible bands from further processing. #### Oxford Nanopore library preparation and sequencing For each sample, the PCR products of primer pools A, B and C were combined to make a total of 23μl (all of pool A (10μl), pool B (10μl) and pool C(3µl)) and cleaned using 1x AMPure XP beads (Beckman Coulter, US), followed by two ethanol (80%) washes. The pellet was resuspended in 20 μl nuclease-free water and 1 μl of the eluted sample was quantified using the Qubit dsDNA HS Assay Kit (ThermoFisher Scientific, US). End-prep reaction was performed according to the ARTIC nCoV-2019 sequencing protocol v3 (LoCost) with 200 fmol (50ng) of amplicons and the NEBNext Ultra II End repair/dA-tailing Kit (NEB, US) and incubated at 20°C for 5 minutes and 65°C for 5 minutes. From this, 1μl of DNA was used for barcode ligation using Native Barcoding Expansion 96 (Oxford Nanopore Technology, UK) and NEBNext Ultra II Ligation Module reagents (NEB, US). Incubation was performed at 20°C for 20 minutes and at 65°C for 10 minutes. This step was eventually modified to employ NEBNext Blunt/TA Ligase Master Mix (NEB, US) using the same barcodes and incubation conditions. The barcoded samples were pooled together. The pooled and barcoded DNA samples were cleaned using 0.4X AMPure XP beads followed by two ethanol (80%) washes and eluted in a 1/14 of the original volume of nuclease-free water. Adapter ligation was performed using 50-100 ng of the barcoded amplicon pool, NEBNext Quick Ligation Module reagents (NEB, US) and Adaptor Mix II (ONT, UK), and incubated at room temperature for 20 minutes. Final clean-up was performed using 1X AMPure XP beads and 125 μl of Short Fragment Buffer (ONT, UK). The library was eluted in 15 μl Elution Buffer (ONT, UK). The final library was normalised to 15-70 ng, loaded on a SpotON R9 flow cell and sequenced on a MinION Mk1B or GridION device6. #### SARS-CoV-2 genome consensus assembly The data generated via the MinION and GridION devices were processed using the ARTIC bioinformatic protocol ([https://artic.network/ncov-2019/ncov2019-bioinformatics-sop.html](https://artic.network/ncov-2019/ncov2019-bioinformatics-sop.html)). In brief, raw FAST5 files were base called and demultiplexed using ONT’s Guppy (version 4.0.5) in high accuracy mode using a minimum Q score of 7. FASTQ reads between 300 bps and 750 bps were filtered using the guppyplex module. The consensus sequences were generated by aligning base called reads against the SARS-CoV-2 reference genome (GenBank accession [MN908947.3](http://medrxiv.org/lookup/external-ref?link_type=GEN&access_num=MN908947.3&atom=%2Fmedrxiv%2Fearly%2F2022%2F03%2F29%2F2022.03.18.22272503.atom)) using MiniMap27. All positions with a genome coverage of less than 20 reads were masked with Ns. The consensus sequences were then polished using Nanopolish toolkit (version 0.13.3) using the raw signals. #### Lineage and VOC assignment Additional quality control, clade assignment and mutation profiles were obtained using the NextClade tool v1.13.28 using a SARS-CoV-2 reference genome (accession [NC\_045512](http://medrxiv.org/lookup/external-ref?link_type=GEN&access_num=NC_045512&atom=%2Fmedrxiv%2Fearly%2F2022%2F03%2F29%2F2022.03.18.22272503.atom)). All consensus sequences with a genome coverage >70% were classified using the PANGO lineage assignment tool (Pangolin version 3.1.20 and PangoLearn v.02/02/2022)9. #### Global comparison sequences Seychelles sequences were analysed against a backdrop of globally representative SARS-CoV-2 lineages. To ensure global representation of sequences, we downloaded all the sequences (n= 8,916,634) from the Global Initiative on Sharing All Influenza Data (GISAID) database collected before 31st December and used an in-house R script to randomly select a sub-sample of 5,179 genomes while considering Pango lineage (lineages detected in Seychelles only), continent and date of collection. These global random genomes were collected from 150 countries and territories between 2nd May 2020 and 31st December 2021. #### Phylogenetic reconstruction The retrieved global sequence dataset, the sequences from Seychelles were aligned using Nextalign version 1.4.1. against the reference SARS-CoV-2 genome (accession [NC\_045512](http://medrxiv.org/lookup/external-ref?link_type=GEN&access_num=NC_045512&atom=%2Fmedrxiv%2Fearly%2F2022%2F03%2F29%2F2022.03.18.22272503.atom)). A maximum likelihood (ML) phylogeny was inferred using IQTREE version 2.1.3 ([http://www.iqtree.org/](http://www.iqtree.org/)). The software initiates tree reconstruction after assessment and selection of the best model of nucleotide substitution for the alignment. TreeTime version 0.8.1 was used to transform the ML tree topology into a time calibrated phylogenetic tree. The resulting trees were visualized using the Bioconductor ggTree version 2.2.4 package10 in R11. #### Estimation of virus importations and exportations into Seychelles The global ML tree topology was used to estimate the number of viral transmission events between Seychelles and the rest of the world as described previously12. TreeTime was used to transform the ML tree topology into a dated phylogenetic tree. Outlier sequences (n=208) were identified by TreeTime and excluded during this process. A mugration model was fitted using the time-scaled phylogenetic tree, mapping the location status of the genomes from Seychelles at both the tips and internal nodes. Using the date and location annotated tree topology, we counted the number of transitions between Seychelles and the rest of the world and plotted this using ggplot2 version 3.3.313. #### Statistical analysis All statistical analyses were performed using R version 4.1.011. ## RESULTS ### Sequenced COVID-19 cases in Seychelles The rise of COVID-19 cases in 2021 was preceded by a period of relaxed countermeasures to curb the spread of the virus **(Figure1 A and B)**. Of 1,298 SARS-CoV-2 positive samples received at KWTRP for genome sequencing, near complete genomes (>70% genome coverage) were recovered from 1,056 samples, and these were used in the subsequent lineage and phylogenetic analysis **(Supplementary Figure 2)**. A summary of the demographic details for the samples successfully sequenced and those that failed are provided in **Table 1**. View this table: [Table 1](http://medrxiv.org/content/early/2022/03/29/2022.03.18.22272503/T1) Table 1 Demographic characteristics of SARS-CoV-2 positive samples received for sequencing at KWTRP. Sample were collected between 14th March 2020 and 31st December 2021 (n = 1,298). ### SARS-CoV-2 lineages circulating in Seychelles The recovered 1,056 genomes were classified into 32 distinct Pango lineages, 28 of which occurred within VOC, VOI or VUM: Alpha VOC (n=1), Beta VOC (n=1), Delta VOC (n=21) and Omicron VOC (n=3) and Kappa VOI (n=1) and B.1.640.2 VUM (n=1) **(Table 2)**. A total of four non-VOC/non-VOI/non-VUM lineages were detected among the sequenced infections in Seychelles: B.1 (n=9), B.1.1 (n=1), B.1.1.50 (n=1) and lineage B.1.1.277 (n=1). Lineage B.1 (predominantly detected in Europe) was the first lineage to be detected in Seychelles, in a sequenced sample from June 2020, followed by lineage B.1.1.277 (predominantly detected in Europe) in October 2020, B.1.1.50 (Predominantly detected in Israel and Palestine) in January 2021 followed by B.1.1 in May 2021. The non-VOC/VOI lineages were replaced by the Beta VOC (B.1.351) in February 2021, and which was later subsequently replaced by the Delta VOC in May 2021 that dominated until November 2021 when Omicron cases were first identified **(Figure 1C and Figure 2)**. View this table: [Table 2.](http://medrxiv.org/content/early/2022/03/29/2022.03.18.22272503/T2) Table 2. Description of SARS-CoV-2 lineages observed in Seychelles, their global history and VOC/VOI status ![Figure 1](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2022/03/29/2022.03.18.22272503/F1.medium.gif) [Figure 1](http://medrxiv.org/content/early/2022/03/29/2022.03.18.22272503/F1) Figure 1 **A** Seychelles government intervention levels as measured by the Oxford stringency index 29. **B** An epidemic curve for Seychelles derived from the daily positive case numbers obtained from [https://ourworldindata.org/coronavirus/country/seychelles](https://ourworldindata.org/coronavirus/country/seychelles). **C** Monthly temporal pattern of SARS-CoV-2 lineages and variants in Seychelles among the 1,056 samples sequenced from COVID-19 positive cases from the Seychelles (25th June 2020, to 31st December 2021). **D** Monthly temporal distribution of Delta VOC lineages among samples sequenced from COVID-19 positive cases from the Seychelles (25th June 2020, to 31st December 2021). **E** Monthly temporal distribution of Omicron VOC lineages among samples sequenced from COVID-19 positive cases from the Seychelles (25th June 2020, to 31st December 2021). ![Figure 2](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2022/03/29/2022.03.18.22272503/F2.medium.gif) [Figure 2](http://medrxiv.org/content/early/2022/03/29/2022.03.18.22272503/F2) Figure 2 SARS-CoV-2 Pango lineages in the sequenced 1,056 Seychelles samples and timing of detections (circle size scaled by number of daily detections). Detection of Beta VOC in February 2021 coincided with the start of a surge of COVID-19 cases, with a further sharp surge observed in May 2021 coinciding with detection of Delta VOC in May 2021 **(Figure 1B and C)**. Since the emergence of Delta VOC in Seychelles in May 2021, a total of 21 Delta VOC lineages co-circulated with varying frequency. Lineage AY.122 (n= 742) was the most prevalent, with detections until the end of the surveillance period covered in this analysis **(Figure 1D and Figure 2)**. Other common Delta lineages were AY.43 (n=33) and B.1.617.2 (n=13). We observed the start of another surge in November 2021 due to Omicron (lineages BA.1 (n=126), BA.1.1 (n=18) and BA.2 (n=1) **(Figure 1B and Supplementary Figure 1A)** which peaked in mid-January 2022 (not shown). ### SARS-CoV-2 diversity in Seychelles Genetic distance-resolved phylogeny of the Seychelles genomes, including global reference sequences (n=5,179) revealed that most of the Seychelles sequences were interspersed as clusters (>2 sequences) or singletons across the phylogenetic trees suggesting multiple viral introductions into Seychelles **(Figure 3)**. In the VOC/non-VOC-specific phylogenies, Delta VOC (n=863) grouped into 14 clusters (>2 sequences) clusters and 12 singletons on the global phylogenetic tree pointing to separate introductions of the Delta VOC into the country whereas Omicron VOC (n=145) clustered into 10 clusters and 11 singletons (**Figures 3D and E**). Seychelles’ Beta (n=29) and Alpha (n=5) viruses clustered closely amongst themselves suggesting few introductions that led to onward transmission in Seychelles **(Figures 3B and C)**. Seychelles B.1 viruses sampled in 2020 were dispersed on the global phylogeny as singletons most likely pointing to multiple introductions into Seychelles during the initial phase of the pandemic **(Figure 3A)**. Sequences from different locations (i.e., districts) in Seychelles clustered closely or together on the phylogenetic tree, a feature suggesting rapid spread of the virus within the country over a short period of time (not shown). ![Figure 3](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2022/03/29/2022.03.18.22272503/F3.medium.gif) [Figure 3](http://medrxiv.org/content/early/2022/03/29/2022.03.18.22272503/F3) Figure 3 Genetic distance-resolved lineage-specific phylogenetic trees for Omicron, Alpha, Beta, Delta VOC, and Non-VOC. Seychelles genomes are indicated with colored tip labels. **A** Phylogeny of Delta VOC that combined 863 Seychelles sequences and 2,676 global sequences. **B** Phylogeny of Omicron VOC that combined 145 Seychelles sequences and 1,195 global sequences **C** Phylogeny of Alpha VOC that combined 5 Seychelles sequences and 246 global sequences. **D** Phylogeny of Beta VOC that combined 29 Seychelles sequences and 187 global sequences. **E** Phylogeny of Non-VOC that combined 14 Seychelles sequences and 875 global sequences. SARS-CoV-2 variant analysis comparing sequences from Seychelles’ sequences to the Wuhan reference sequence (NC_045512.2) detected a total of 703 amino acid mutations across different gene regions; however, focusing on the frequent occurring mutations only, we identified a total of 27 amino acid mutations and two deletions that had a prevalence of >50% in the sequenced cases **(Supplementary Figure 3A and B)**. The most prevalent amino acid mutation was D614G (A23403G) (98.9%) occurring in the spike glycoprotein followed by P314L (C14408T) (93.3%) in the open reading frame 1b (ORF1b) **(Supplementary Figure 3A)**. ### Export and import of SARS-CoV-2 lineages in Seychelles Ancestral location state reconstruction of the dated global phylogeny **(Figure 4A)** was used to infer the number of viral importations and exportations. In total, between the 25th June 2020 and the 31st December 2021, we inferred at least 78 importations into Seychelles with 28 (35%) of the introductions coming from Europe, 21 separate introductions from Africa, 15 separate introductions from Asia, 6 introductions from North America, 5 separate introductions from Oceania and 3 introductions from South America. **(Figure 4B)**. Of the 78 detected viral imports into Seychelles, 66 occurred between January and December 2021 after the rise in COVID-19 cases was experienced in the Seychelles (**Figure 4C)**. From the analysis, we also inferred 32 export events from Seychelles to the rest of the world, mainly Asia (n=10) Europe (n=8) and Africa (n=6). ![Figure 4](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2022/03/29/2022.03.18.22272503/F4.medium.gif) [Figure 4](http://medrxiv.org/content/early/2022/03/29/2022.03.18.22272503/F4) Figure 4 **A**. Time-resolved global phylogeny that combined 1,056 Seychelles sequences (coloured tip labels) and 5,179 global reference sequences. **B** The number of viral imports and exports into and out of Seychelles. **C** Cumulative number of viral imports and export over time into Seychelles. ## DISCUSSION We estimated at least 78 independent introductions of SARS-CoV-2 into Seychelles between 25th June 2020 to 31st December 2021, with the importations likely originating from all the continents in the world. Notably, the surge of COVID-19 cases from January to December 2021 was characterised by detections of VOCs in the country. The Beta variant of concern was the dominant strain in circulation from February to April 2021 and probably responsible for the surge in cases in January 2021 (highest daily number of infections at 231, on the 4th March 2021), but the introduction and rapid spread of the Delta variant seen from May to December 2021 caused a further surge (highest daily number of infections at 484, on the 6th May 2021). The low number of cases detected for non-VOCs (B.1, B.1.1, B.1.1.50 and B.1.1.277) lineages in 2020, could be explained by the fact that this insular population experienced very low importations due to the COVID-19 restriction countermeasures that were in place in 2020 but also that the non-VOCs were not highly transmissible. We ascertain that the introduction of the VOCs with high transmissibility was probably the cause of the surge of cases in Seychelles in 2021. However, these assertions should be taken with caution since extensive genomic surveillance in the country began in early 2021. To note, the detection of VOCs in the country preceded period of relaxed COVID-19 restriction countermeasures, including allowing tourists into the country and the resumption of in person school classes. This may have led to increased viral importations, as detected in our dataset, and to spread in the country. The detection of Delta VOCs in May 2021 coincided with a period of relaxed COVID-19 restriction countermeasures (Oxford stringency index < 50) and the surge of cases during this time may have been due to multiple introductions and rapid spread of the virus in the country. By December, a total of 21 different Delta lineages were detected in Seychelles. The first Omicron detection in Seychelles was on the 29th of November 2021. By 31st December 2021, three Omicron lineages had been detected in Seychelles including BA.2 which has been noted to be rapidly growing across the world14–16 Unsurprisingly, the majority of the Seychelles sequences harboured important spike mutations e.g., D614G mutation occurring in 98.2% of all the sequenced cases. The D614G amino acid change has been associated with stronger interaction between the virus and the angiotensin-converting enzyme 2 (ACE2) affinity, leading to higher infectivity and transmissibility17,18, Other important mutations included : i. Beta VOC; K417N, E484K and N501Y occurring in the spike receptor binding domain (RBD), which have been associated with reduced sensitivity to convalescent and post-vaccination sera19; ii. Delta VOC, L452R, P681R and D950N mutations, which have been linked to reduced sensitivity to neutralizing antibodies and higher transmissibility20; iii. Alpha VOC; N501Y and P681H mutations and iv. Omicron VOC: Q498R and N501Y occurring in the RBD have been linked to ACE2 binding affinity21. These RBD mutations coupled with four amino acid substitutions (i.e., A67V, T95I, and L212) and three deletion (67-70, 142-144 and 211) and an insertion (EPE at position 214) in the N-terminal domain (NTD) are linked to reduced sensitivity of Omicron to reduced sensitivity to convalescent and post-vaccination sera22,23. A cluster of three mutations occurring at S1-S2 furin cleavage site (H655Y, N679K and P681H) have been associated with increased transmissibility24. Our phylogenetic analysis showed that Seychelles sequences virus diversity was nested within the global virus diversity (i.e., Seychelles sequences clustered with sequences sampled from different countries, suggestive of global spread of SARS-CoV-2 lineages). The close association between the viruses and those from other countries reflects global transmission of the virus as a result of global migration, increased connectivity, and social mixing. Further, focusing on the VOCs, we observed patterns of SARS-CoV-2 viral diversity inside Seychelles; phylogenetic clusters consisted of viruses which were derived from different geographic locations and formed a deep hierarchical structure, indicating an extensive and persistent nation-wide transmission of the virus. Our findings are consistent with findings from Island countries such as Comoros, Reunion and New Zealand, where these countries were able to contain the first pandemic wave starting in March 2020, due COVID-19 strict countermeasures such as ban on international arrivals, which may have led to limited viral introductions into the islands, or perhaps viral introductions during the early phase of the pandemic did not results to community transmission due to government countermeasure on COVID-19 such as countrywide lockdown or self-isolation of the entire population25,26. Surge of COVID-19 cases in Islandic populations appears to be majorly driven by introductions of VOCs. For example, Comoros seems to have experienced its first surge of COVID-19 cases after introduction of Beta VOCs into the population in January 202127, similar to the period when Seychelles saw its first surge also due Beta VOC. This study had some limitations. First, our import/export inferences can be influenced by sampling biases and the low rate of sequencing in Seychelles. Therefore, the true number of international introductions are likely significantly higher than what is reported here. Second, incomplete metadata for some samples limited the scope of our analysis, for example, lack of location of samples collected disallowed us to investigate the transmission pathway of viruses within the country. Third, SARS-CoV-2 sequences from Seychelles are only available from a very small fraction of the number of confirmed cases into the country. These data reinforce the importance of genomic surveillance in Seychelles as a tool for monitoring and providing real-time information on spread of emerging SARS-CoV-2 variants in the population with important implications for public health and immunization strategies. Surge of COVID-19 cases due to VOCs during a period of heightened COVID-19 countermeasures raises questions on the optimal timing of the introduction of public health interventions. When the interventions are introduced after a surge has started, it is often too late, and the control strategies should focus on local transmission to understand characteristics and origins of locally circulating SARS-CoV-2 diversity to prevent further spread28. Moreover, studies on genomic surveillance would also be useful in investigating vaccine effectiveness against circulating variants which appear to have a high turnover. ## Supporting information SupplementaryFigure1 [[supplements/272503_file03.pdf]](pending:yes) SupplementaryFigure2 [[supplements/272503_file04.pdf]](pending:yes) SupplementaryFigure3 [[supplements/272503_file05.pdf]](pending:yes) Appendix1 [[supplements/272503_file06.pdf]](pending:yes) ## Data Availability All data generated and analysis script for this manuscript will be available from the Virus Epidemiology and Control, Kenya Medical Research Institute (KEMRI) Wellcome Trust Research Programme data server, [https://doi.org/10.7910/DVN/AYT2UA](https://doi.org/10.7910/DVN/AYT2UA). [https://doi.org/10.7910/DVN/AYT2UA](https://doi.org/10.7910/DVN/AYT2UA) ## Grant Information This works was supported by the National Institute for Health Research (NIHR) (project references 17/63/82 and 16/136/33) using UK aid from the UK Government to support global health research, The UK Foreign, Commonwealth and Development Office and Wellcome Trust (grant# 220985/Z/20/Z) The views expressed in this publication are those of the author (s) and not necessarily those of NIHR or the Department of Health and Social Care, Foreign Commonwealth and Development Office. This work was also supported by the Seychelles Public Health Laboratory (Public Health Authority), Africa-CDC, WHO-Afro, WHO-Seychelles, ASLM, and WHO-Kenya offices. ## Data availability All data generated and analysis script for this manuscript will be available from the Virus Epidemiology and Control, Kenya Medical Research Institute (KEMRI)– Wellcome Trust Research Programme data server, [https://doi.org/10.7910/DVN/AYT2UA](https://doi.org/10.7910/DVN/AYT2UA). Whole genome sequences are available from GISAID database accession number EPI\_ISL\_4880527 - EPI\_ISL\_4880796, EPI\_ISL\_5942854 - EPI\_ISL\_6705093, EPI\_ISL\_8424404 - EPI\_ISL\_8424612 and EPI\_ISL\_11060102 - EPI\_ISL\_11060407. ## Conflicts of Interest The authors have declared no competing interest. ## SUPPLEMENTARY FIGURES ![Supplementary Figure 1](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2022/03/29/2022.03.18.22272503/F5.medium.gif) [Supplementary Figure 1](http://medrxiv.org/content/early/2022/03/29/2022.03.18.22272503/F5) Supplementary Figure 1 **A** The monthly temporal distribution of SARS-CoV-2 cases between March 2020 and 31st December 2021 in Seychelles vs SARS-CoV-2 positives samples received at KEMRI-WT for whole genome sequencing. ![Supplementary Figure 2](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2022/03/29/2022.03.18.22272503/F6.medium.gif) [Supplementary Figure 2](http://medrxiv.org/content/early/2022/03/29/2022.03.18.22272503/F6) Supplementary Figure 2 Flow chart describing processing of SARS-CoV-2 samples received at KEMRI-WT for whole genome sequencing. ![Supplementary Figure 3](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2022/03/29/2022.03.18.22272503/F7.medium.gif) [Supplementary Figure 3](http://medrxiv.org/content/early/2022/03/29/2022.03.18.22272503/F7) Supplementary Figure 3 **A** Prevalent amino acid mutation among the 1,056 genomes from Seychelles. **B** Prevalent amino acid deletions among the 1,056 genomes from Seychelles. ## Acknowledgements We thank the Seychelles Ministry of Health, Public Health Laboratory, Africa-CDC, WHO-Afro, WHO-Kenya Office, WHO-Seychelles Office for facilitating sharing of SARS-CoV-2 samples that were sequenced. Further, we thanks all the laboratories that have shared SARS-CoV-2 sequence data in GISAID that we included as comparison data in our analysis (see list in appendix 1). ## Footnotes * The list of authors and affiliations have been updated. The first paragraph in the abstract has been rephrased. * Received March 18, 2022. * Revision received March 28, 2022. * Accepted March 29, 2022. * © 2022, Posted by Cold Spring Harbor Laboratory This pre-print is available under a Creative Commons License (Attribution 4.0 International), CC BY 4.0, as described at [http://creativecommons.org/licenses/by/4.0/](http://creativecommons.org/licenses/by/4.0/) ## REFERENCES 1. 1.National Bureau of Statistics S. Population and Vital Statistics. Mid 2021 Population Estimates. Published August 31, 2021. Accessed March 29, 2022. [https://www.nbs.gov.sc/downloads/data-acquisition-census/population-and-vital-statistics/2021](https://www.nbs.gov.sc/downloads/data-acquisition-census/population-and-vital-statistics/2021) 2. 2.WHO. WHO Coronavirus (COVID-19) Dashboard. 3. 3.Centres for Disease Control and Prevention. SARS-CoV-2 Variant Classifications and Definitions. 4. 4.WHO. Tracking SARS-CoV-2 variants. Published 2021. Accessed August 27, 2021. [https://www.who.int/en/activities/tracking-SARS-CoV-2-variants/](https://www.who.int/en/activities/tracking-SARS-CoV-2-variants/) 5. 5.Tao K, Tzou PL, Nouhin J, et al. The biological and clinical significance of emerging SARS-CoV-2 variants. Nature Reviews Genetics. 2021;22(12):757–773. doi:10.1038/s41576-021-00408-x [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41576-021-00408-x&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=34535792&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F03%2F29%2F2022.03.18.22272503.atom) 6. 6.Tyson JR, James P, Stoddart D, et al. Improvements to the ARTIC multiplex PCR method for SARS-CoV-2 genome sequencing using nanopore. bioRxiv. Published online September 4, 2020:2020.09.04.283077. doi:10.1101/2020.09.04.283077 [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NzoiYmlvcnhpdiI7czo1OiJyZXNpZCI7czoxOToiMjAyMC4wOS4wNC4yODMwNzd2MSI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIyLzAzLzI5LzIwMjIuMDMuMTguMjIyNzI1MDMuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 7. 7.Li H. Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics. 2018;34(18):3094–3100. doi:10.1093/bioinformatics/bty191 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/bioinformatics/bty191&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=29750242&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F03%2F29%2F2022.03.18.22272503.atom) 8. 8.Hadfield J, Megill C, Bell SM, et al. Nextstrain: real-time tracking of pathogen evolution. Bioinformatics. 2018;34(23):4121–4123. doi:10.1093/bioinformatics/bty407 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/bioinformatics/bty407&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=29790939&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F03%2F29%2F2022.03.18.22272503.atom) 9. 9.O’Toole Á, Scher E, Underwood A, et al. Assignment of epidemiological lineages in an emerging pandemic using the pangolin tool. Virus Evolution. 2021;7(2):veab064. doi:10.1093/ve/veab064 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/ve/veab064&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=34527285&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F03%2F29%2F2022.03.18.22272503.atom) 10. 10.Yu G, Smith DK, Zhu H, Guan Y, Lam TTY. ggtree: an r package for visualization and annotation of phylogenetic trees with their covariates and other associated data. Methods in Ecology and Evolution. 2017;8(1):28–36. doi:[https://doi.org/10.1111/2041-210X.12628](https://doi.org/10.1111/2041-210X.12628) [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1111/2041-210x.12628&link_type=DOI) 11. 11.R Core Team. R: A language and environment for statistical computing. Published online 2013. Accessed November 9, 2021. URL [http://www.R-project.org/](http://www.R-project.org/) 12. 12.Eduan W, Marta G, Houriiyah T, et al. A year of genomic surveillance reveals how the SARS-CoV-2 pandemic unfolded in Africa. Science (1979). 2021;374(6566):423–431. doi:10.1126/science.abj4336 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1126/science.abj4336&link_type=DOI) 13. 13. Hadley Wickham. Ggplot2: Elegant Graphics for Data Analysis. Springer-Verlag; 2016. 14. 14.Yamasoba D, Kimura I, Nasser H, et al. Virological characteristics of SARS-CoV-2 BA.2 variant. bioRxiv. Published online January 1, 2022:2022.02.14.480335. doi:10.1101/2022.02.14.480335 [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NzoiYmlvcnhpdiI7czo1OiJyZXNpZCI7czoxOToiMjAyMi4wMi4xNC40ODAzMzV2MSI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIyLzAzLzI5LzIwMjIuMDMuMTguMjIyNzI1MDMuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 15. 15.Lyngse FP, Kirkeby CT, Denwood M, et al. Transmission of SARS-CoV-2 Omicron VOC subvariants BA.1 and BA.2: Evidence from Danish Households. medRxiv. Published online January 1, 2022:2022.01.28.22270044. doi:10.1101/2022.01.28.22270044 [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NzoibWVkcnhpdiI7czo1OiJyZXNpZCI7czoyMToiMjAyMi4wMS4yOC4yMjI3MDA0NHYxIjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjIvMDMvMjkvMjAyMi4wMy4xOC4yMjI3MjUwMy5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 16. 16.Iketani S, Liu L, Guo Y, et al. Antibody Evasion Properties of SARS-CoV-2 Omicron Sublineages. bioRxiv. Published online January 1, 2022:2022.02.07.479306. doi:10.1101/2022.02.07.479306 [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NzoiYmlvcnhpdiI7czo1OiJyZXNpZCI7czoxOToiMjAyMi4wMi4wNy40NzkzMDZ2MSI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIyLzAzLzI5LzIwMjIuMDMuMTguMjIyNzI1MDMuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 17. 17.Zhang L, Jackson CB, Mou H, et al. SARS-CoV-2 spike-protein D614G mutation increases virion spike density and infectivity. Nature Communications. 2020;11(1):6013. doi:10.1038/s41467-020-19808-4 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41467-020-19808-4&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=33243994&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F03%2F29%2F2022.03.18.22272503.atom) 18. 18.Ozono S, Zhang Y, Ode H, et al. SARS-CoV-2 D614G spike mutation increases entry efficiency with enhanced ACE2-binding affinity. Nature Communications. 2021;12(1):848. doi:10.1038/s41467-021-21118-2 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41467-021-21118-2&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=33558493&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F03%2F29%2F2022.03.18.22272503.atom) 19. 19.McCormick KD, Jacobs JL, Mellors JW. The emerging plasticity of SARS-CoV-2. Science (1979). 2021;371(6536):1306LP–1308. doi:10.1126/science.abg4493 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1126/science.abg4493&link_type=DOI) 20. 20.Motozono C, Toyoda M, Zahradnik J, et al. SARS-CoV-2 spike L452R variant evades cellular immunity and increases infectivity. Cell Host & Microbe. Published online 2021. doi:[https://doi.org/10.1016/j.chom.2021.06.006](https://doi.org/10.1016/j.chom.2021.06.006) 21. 21.Zahradník J, Marciano S, Shemesh M, et al. SARS-CoV-2 variant prediction and antiviral drug design are enabled by RBD in vitro evolution. Nature Microbiology. 2021;6(9):1188–1198. doi:10.1038/s41564-021-00954-4 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41564-021-00954-4&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=34400835&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F03%2F29%2F2022.03.18.22272503.atom) 22. 22.Iketani S, Liu L, Guo Y, et al. Antibody Evasion Properties of SARS-CoV-2 Omicron Sublineages. bioRxiv. Published online January 1, 2022:2022.02.07.479306. doi:10.1101/2022.02.07.479306 [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NzoiYmlvcnhpdiI7czo1OiJyZXNpZCI7czoxOToiMjAyMi4wMi4wNy40NzkzMDZ2MSI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIyLzAzLzI5LzIwMjIuMDMuMTguMjIyNzI1MDMuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 23. 23.Cele S, Jackson L, Khoury DS, et al. Omicron extensively but incompletely escapes Pfizer BNT162b2 neutralization. Nature. 2022;602(7898):654–656. doi:10.1038/s41586-021-04387-1 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41586-021-04387-1&link_type=DOI) 24. 24.Gong SY, Chatterjee D, Richard J, et al. Contribution of single mutations to selected SARS-CoV-2 emerging variants Spike antigenicity. bioRxiv. Published online January 1, 2021:2021.08.04.455140. doi:10.1101/2021.08.04.455140 [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NzoiYmlvcnhpdiI7czo1OiJyZXNpZCI7czoxOToiMjAyMS4wOC4wNC40NTUxNDB2MSI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIyLzAzLzI5LzIwMjIuMDMuMTguMjIyNzI1MDMuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 25. 25.Geoghegan JL, Ren X, Storey M, et al. Genomic epidemiology reveals transmission patterns and dynamics of SARS-CoV-2 in Aotearoa New Zealand. Nature Communications. 2020;11(1):6351. doi:10.1038/s41467-020-20235-8 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41467-020-20235-8&link_type=DOI) 26. 26.Wilkinson DA, Lebarbenchon C, Atyame C, et al. Genomic insights into early SARS-CoV-2 strains isolated in Reunion Island. medRxiv. Published online January 1, 2021:2021.01.21.21249623. doi:10.1101/2021.01.21.21249623 [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NzoibWVkcnhpdiI7czo1OiJyZXNpZCI7czoyMToiMjAyMS4wMS4yMS4yMTI0OTYyM3YxIjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjIvMDMvMjkvMjAyMi4wMy4xOC4yMjI3MjUwMy5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 27. 27. N. Agoti C, Githinji G, S. Mohammed K, et al. Detection of SARS-CoV-2 variant 501Y.V2 in Comoros Islands in January 2021 [version 1; peer review: 1 approved]. Wellcome Open Research. 2021;6(192). doi:10.12688/wellcomeopenres.16889.1 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.12688/wellcomeopenres.16889.1&link_type=DOI) 28. 28.Agoti CN, Ochola-Oyier LI, Mohammed KS, et al. Transmission networks of SARS-CoV-2 in coastal Kenya during the first two waves: a retrospective genomic study. medRxiv. Published online January 1, 2021:2021.07.01.21259583. doi:10.1101/2021.07.01.21259583 [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NzoibWVkcnhpdiI7czo1OiJyZXNpZCI7czoyMToiMjAyMS4wNy4wMS4yMTI1OTU4M3YyIjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjIvMDMvMjkvMjAyMi4wMy4xOC4yMjI3MjUwMy5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 29. 29. Hannah Ritchie, Esteban Ortiz-Ospina, Diana Beltekian, Edouard Mathieu, Joe Hasell, Bobbie Macdonald, Charlie Giattino, Cameron Appel LRG and MR. Coronavirus Pandemic (COVID-19). Our World in Data. Published 2020. Accessed March 1, 2022. [https://ourworldindata.org/coronavirus](https://ourworldindata.org/coronavirus)