Emergence and spread of two SARS-CoV-2 variants of interest in Nigeria ====================================================================== * Idowu B. Olawoye * Paul E. Oluniyi * Judith U. Oguzie * Jessica N. Uwanibe * Adeyemi T. Kayode * Testimony J. Olumade * Fehintola V. Ajogbasile * Edyth Parker * Philomena E. Eromon * Priscilla Abechi * Tope Sobajo * Chinedu Ugwu * George Uwem * Femi Ayoade * Kazeem Akano * Nicholas Oyejide * Iyanuoluwa Fred-Akintunwa * Kemi Adedotun-Sulaiman * Farida Brimmo * Benjamin Adegboyega * Courage Philip * Ayomide Adeleke * Grace C. Chukwu * Ahmed I. Muhammad * Oludayo O. Ope-ewe * Shobi Otitoola * Olusola A. Ogunsanya * Mudasiru F. Saibu * Ayotunde E. Sijuwola * Grace O. Ezekiel * Oluwagboadurami G. John * Julie O. Akin-John * Oluwasemilogo O. Akinlo * Olanrewaju O. Fayemi * Testimony O. Ipaye * Deborah C. Nwodo * Abolade E. Omoniyi * Iyobosa B. Omwanghe * Christabel A. Terkuma * Johnson Okolie * Olubukola Ayo-Ale * Odia Ikponmwosa * Ebo Benevolence * Okonofua Naregose * Akhilomen Patience * Osiemi Blessing * Airende Micheal * Agbukor Jacqueline * Aiyepada John * Paulson Ebhodaghe * Omiunu Racheal * Esumeh Rita * Giwa Rosemary * Ehikhametalor Solomon * Ekanem Anieno * Yerumoh Edna * Aire Chris * Adomeh Donatus * Ephraim Ogbaini * Mirabeau Y. Tatfeng * Hannah E. Omunakwe * Mienye Bob-Manuel * Rahaman Ahmed * Chika Onwuamah * Joseph Shaibu * Azuka Okwuraiwe * Anthony E. Atage * Andrew Bock-Oruma * Funmi Daramola * Akinwumi Fajola * Nsikak-Abasi Ntia * Julie J. Ekpo * Anietie Moses * Worbianueri B. Moore-Igwe * Ibrahim F. Yusuf * Enoch O. Fakayode * Monilade Akinola * Ibrahim Kida * Bamidele S. Oderinde * Zara Wudiri * Olufemi O. Adeyemi * Olusola Akanbi * Anthony Ahumibe * Afolabi Akinpelu * Oyeronke Ayansola * Olajumoke Babatunde * Adesuyi Omoare * Chimaobi Chukwu * Nwando Mba * Ewean C. Omoruyi * Johnson A. Adeniji * Moses O. Adewunmi * Oluseyi Olayinka * Olisa Olasunkanmi * Olatunji Akande * Ifeanyi Nwafor * Matthew Ekeh * Erim Ndoma * Richard Ewah * Rosemary Duruihuoma * Augustine Abu * Elizabeth Odeh * Venatious Onyia * Kingsley C. Ojide * Sylvanus Okoro * Daniel Igwe * Kamran Khan * Anthony N. Ajayi * Ebhodaghe Ngozi Ugwu * Collins N. Ugwu * Kingsley Ukwuaja * Emeka O. Ogah * Chukwuyem Abejegah * Nelson Adedosu * Olufemi Ayodeji * Rafiu O. Isamotu * Galadima Gadzama * Brittany Petros * Katherine J. Siddle * Stephen Schaffner * George Akpede * Cyril Oshomah Erameh * Marycelin Baba * Femi Oladiji * Rosemary Audu * Nnaemeka Ndodo * Adeola Fowotade * Sylvanus Okogbenin * Peter Okokhere * Danny Park * Bronwyn Mcannis * Ifedayo Adetifa * Chikwe Ihekweazu * Babatunde L. Salako * Oyewale Tomori * Anise N. Happi * Onikepe A. Folarin * Kristian G. Andersen * Pardis C. Sabeti * Christian T. Happi ## Abstract Identifying the dissemination patterns and impacts of a virus of economic or health importance during a pandemic is crucial, as it informs the public on policies for containment in order to reduce the spread of the virus. In this study, we integrated genomic and travel data to investigate the emergence and spread of the B.1.1.318 and B.1.525 variants of interest in Nigeria and the wider Africa region. By integrating travel data and phylogeographic reconstructions, we find that these two variants that arose during the second wave emerged from within Africa, with the B.1.525 from Nigeria, and then spread to other parts of the world. Our results show how regional connectivity in downsampled regions like Africa can often influence virus transmissions between neighbouring countries. Our findings demonstrate the power of genomic analysis when combined with mobility and epidemiological data to identify the drivers of transmission in the region, generating actionable information for public health decision makers in the region. ## Introduction With a population of over 200 million, Nigeria is the most populous country in Africa1. Since the first report of SARS-CoV-2 in Nigeria on the 27th of February 2020, the cumulative number of confirmed COVID-19 cases in Nigeria has risen to more than 265 000 as of mid-September 20222. However, this burden is extremely low relative to SARS-Cov-2 infections in the rest of the world. Adjusting for population size, there have been 129 cases per 100 000 people in Nigeria as of mid-September, compared to more than 34 700 and 28 700 in the UK and USA, and 2340 and 711 in Indonesia and Pakistan, respectively. Concurrently, Nigeria has also only had a little over 3000 reported deaths (2 per 100 000), compared to the UK and USA, which had 303 and 315 cumulative deaths per 100 000 people2. This relatively low but heterogeneous incidence in Nigeria and across the wider African region has been the cause of persistent speculation, including on the putative central role of case underascertainment3,4,5. The underlying cause remains understudied and is likely multifactorial, with many other factors speculated to contribute, including skewed age structures of populations, more restricted human mobility in certain regions, host genetics, environmental factors, and potential pre-existing population immunity to related viruses6,7. Before the emergence of globally sweeping variants such as Delta and Omicron, it was also unclear if the specific genetic diversity circulating in the African region may have contributed to the heterogeneous incidence and mortality observed8,9,10. It is therefore important to characterise the genomic epidemiology of SARS-CoV-2 in Nigeria over the course of the pandemic, to improve our understanding of what variants emerged to dominate the different epidemic waves. It is also important to improve our understanding of the drivers of transmission in the region, as this remains understudied in Africa and findings from other regions may not be generalizable9,11. Nigeria is highly connected to its neighbouring countries and the wider region, potentially acting as a dominant source of transmission via the high volume of movement across land as well as air borders12. Several SARS-CoV-2 variants, which have a constellation of mutations that are biologically significant to the virus, emerged during the pandemic9,13,14. The B.1.525 (Eta) and B.1.1.318 variants of interest (VOIs) dominantly co-circulated with the Alpha variant during the second wave in Nigeria from December 2020 to March 20212. B.1.525 and B.1.1.318 were suggested to have emerged in Nigeria based on epidemiological reports from travellers in countries such as the UK, India, Mauritius, Canada, and Brazil15,16,17,18,19. Notably, B.1.525 and B.1.1.318 both showed a significant increase in the infectivity of human and monkey cell lines experimentally, raising concerns of intrinsic increased transmissability20. Both of these lineages also share the well-characterised E484K substitution in the Spike protein receptor-binding domain (RBD), which effectively reduces antibody neutralisation21. In this study, we examine the genomic epidemiology of SARS-CoV-2 in Nigeria from March 2020 to September 2021, across the first three epidemic waves. In particular, we investigate the timing and origin of the emergence of the B.1.525 and B.1.1.318 VOIs. In phylogeographic reconstructions, we characterized the source-sink profiles of these best-sampled VOIs to better understand the bidirectional transmission dynamics of SARS-CoV-2 in Nigeria and the wider region. We compare our findings from our genomic data to integrated travel and epidemiological data to explore the role of Nigeria’s regional and intercontinental connectivity in the estimated import and export dynamics. ## Results ### Lineage dynamics of SARS-CoV-2 in Nigeria over the first three epidemic waves We generated a total of 1577 genomes from samples obtained between March 2020 and September 2021, across the first three waves of the pandemic (Fig. 1A and B). We collected samples from 25 of the 36 states and the Federal Capital Territory of Nigeria (Fig. 1C). The northern states were highly undersampled relative to the southern states, with sampling highly uneven across time (Fig. 1A and B). ![Fig. 1:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2022/09/25/2022.09.24.22280269/F1.medium.gif) [Fig. 1:](http://medrxiv.org/content/early/2022/09/25/2022.09.24.22280269/F1) Fig. 1: Genomic epidemiology of SARS-CoV-2 in Nigeria.**A)** Reported daily news Epicurve of SARS-CoV-2 cases in Nigeria (blue bars) in Nigeria and genomes produced assembled during the first three waves (red line) with the y-axis transformed to a log-scale. **B)** Time-varying sampling fraction of genome produced in this study per new cases**C)** Geographic distribution of sequences generated in the current study. The map of Nigeria shows the number of genomes from states across the country as region colour and marker size display lineages per state. **D)** Lineage frequency profile for the study period. The frequency of PANGO lineages across the country over a period of 80 weeks. Lineages that are not VOC or VOIs, or that do not appear more than three times over the course of the pandemic, are grouped a “others”. **E)** Human mobility data from Google for retail shops, pharmacies, parks, public transport, workplaces, and residential activity places from January 2020 to November 2021, as compared to the pre-pandemic baseline (January to February, 2021). To investigate the lineage dynamics across the different outbreak waves, we used the pangolin nomenclature tool to assign all sequences to corresponding lineages. We detected more than 35 lineages across the first three waves (Fig. 1D). We found that a number of ancestral lineages circulated during the first wave, including A, B, B.1, and B.1.1 (Fig. 1D). The onset of community transmission in Nigeria was initially delayed, with the first wave initiated two months after the first case detection in March 2020. A nationwide lockdown and ban on local and international air travel were enforced, with the first wave ending in August 2020 after four months (Fig. 1B). During the lockdown, we found that there was up to a 50% decline in activity at workplaces, retail, supermarkets, and public transport stations, and up to a 25% increase in residential activity (Fig. 1E). After this strictly adhered-to lockdown, Nigeria opened its airspace to international traffic in October 2020. This was shortly followed by an increase in new cases in November 2020, marking the start of the second wave (Fig. 1B), which was characterised by B.1.1.7, B.1.525, B.1.1.318 (VOC and VOI), and other variants such as L.3 and B.1.1.487 (Fig. 1D). Our secondary analysis focused on the lineages (B.1.525 and B.1.1.318) that contributed to the surge during the second wave and have also been shown to possess genotypic traits for increased transmissibility and also increased virulence in human and animal cell lines, such as the E484K spike protein RBD mutation. We excluded B.1.1.7 from our analysis as it has already been well studied22,23, and it was not the dominant variant in Nigeria during the second wave, unlike in other places where it was reported, such as the UK and the USA24,25,26. Moreso, the second wave began after travel restrictions were lifted and human mobility was back to normal, thus the need to investigate the emergence of B.1.1.318 and B.1.525 in Nigeria. The third wave was initiated in June 2021, with the Delta variant (B.1.617.2) and its sub-lineages sweeping to dominance in sampling (Fig. 1B and D). In the secondary analysis described in this work, we also sought to identify the timings of emergence and number of introductions of these VOI. ### Regional connectivity drove the introduction of B.1.1.318 into Nigeria Using Bayesian phylogeographic reconstructions, we investigated the timing and origin of the B.1.1.318 emergence to better understand Nigeria’s connectivity in the global SARS-CoV-2 transmission networks (see Methods). B.1.1.318 was first detected in Nigeria in Lagos State in December 2020. The lineage was detected in multiple countries within and outside Africa, notably resulting in a large outbreak in Mauritius19. In our phylogeographic reconstruction, we found that B.1.1.318 emerged in Africa (root state posterior support = 0.95-6) in early August [mean tMRCA = 5 August, 95% HPD 25 June to 20 September] across two replicates (Fig. 2A). We estimated that B.1.1.318 was introduced to Nigeria on at least 53 independent occasions [mean introductions, 95% HPD 50-59] beginning in November 2020, after travel restrictions were lifted in October (Fig. 2B). We found that the majority of introductions originated from other African nations (Fig. 2B), indicating Nigeria’s strong connectivity to the region. However, the number and origin of introductions and exports estimated from genomic data is sample dependent, with bidirectional transmission to and from undersampled regions obscured by uneven sampling globally. To overcome this, we quantified air travel patterns to and from Nigeria over time to better understand which countries were most likely to act as transmission sources or sinks based on their connectivity to Nigeria. ![Fig. 2:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2022/09/25/2022.09.24.22280269/F2.medium.gif) [Fig. 2:](http://medrxiv.org/content/early/2022/09/25/2022.09.24.22280269/F2) Fig. 2: **A)** Time-resolved B.1.1.318 phylogeny. Branches are coloured by region-level geographic state reconstruction. Internal nodes annotated with black points represent posterior support > 0.75. **B)** Mean number of introductions (Markov jumps) of B1.1.318 into Nigeria from all source regions, binned by month. The mean number of introductions overall (inset). **C)** Volume of monthly air passengers inbound to Nigeria across 2020-2021 by source region (bar, left axis) and fold increase in travel volume compared to May 2020 baseline. **D)** Estimated Introduction Intensity Index for Nigeria from 2020-2021. **E)** Percentage of inbound air travel to Nigeria by source region across 2020-2021. See Supplementary Figure 2 for country-level data. Before travel restrictions were lifted in October 2020, we found that the low levels of incoming air travel volume predominantly originated from other African nations, followed by the Middle East (driven by the UAE) and Europe (driven by the UK) (Fig. 2C-D, SFigure 2). This suggests that the introduction risk was predominantly driven by regional connectivity during the period when travel restrictions were in place. We sought to investigate this further by quantifying the introduction risk for all countries connected to Nigeria by air travel. We estimated the introduction intensity index (III), which accounts for the number of cases in the source countries as well as the travel volume to Nigeria (see Methods). Overall, we found that the introduction risk from air travel was very low during the period of travel restrictions (May 2020 to October 2020), as incoming travellers originated largely from other African nations with low incidence (Fig. 2F). We found that the number of incoming air passengers peaked in December 2020 - January 2021 (Fig. 2C-D). We also observed that the number of inbound passengers from Europe, the Middle East, and Northern America increased relative to other African nations after restrictions were lifted, resulting in comparable volumes from Africa, Europe, and the Middle East (Fig. 2C-D). The 30-fold increase in incoming passengers by December 2020 was reflected in the corresponding peak in the III, attributable to increased introduction risk from Europe and North America from November 2020 into early 2021 (Fig. 2F). The surge was predominantly driven by the high-incidence second waves of the UK and USA, respectively (SFigure 1). We found that the III peak but not the estimated sources of risk were consistent with the number and origin of introductions of B.1.1.318 in our phylogeographic reconstructions (Fig. 2B). We estimated the highest introduction risk for Europe after travel restrictions were lifted, whereas the majority of introductions estimated from genomic data originated in other African countries. The III from other African nations was relatively low, despite comparable incoming air travel volume. This is likely both a factor of the relatively low incidence in most African countries during this period (Fig. 2F, SFigure 3) as well as the fact that our travel data does not account for land-based travel, which will expectedly severely underestimate the introduction risk from surrounding countries. The III from the African region was largely driven by South Africa, which was experiencing a peaked incidence from its second wave (SFigure 3). In our III analyses, we found negligible introduction risk from outside of Europe and North America. ![Fig. 3:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2022/09/25/2022.09.24.22280269/F3.medium.gif) [Fig. 3:](http://medrxiv.org/content/early/2022/09/25/2022.09.24.22280269/F3) Fig. 3: **A)** Time-resolved B.1.525 phylogeny. Branches are coloured by region-level geographic state reconstruction. Internal nodes annotated with black points represent posterior support > 0.75. **B)** Mean number of exports (Markov jumps) of B.1.525 from Nigeria to destination regions, binned by month. Mean number of exports overall (inset). **C)** Volume of monthly air passengers outbound from Nigeria across 2020 - 2021 by destination region (bar, left axis) and fold increase in travel volume compared to baseline (May 2020). **D)** Percentage of outbound air travel from Nigeria by destination region across 2020-2021. See Supplementary Figure 1 for country-level data. **E)** Mean number of introductions (Markov jumps) of B1.525 into Nigeria from all origin regions, binned by month. The mean number of introductions overall (inset). **F)** Estimated Exportation Intensity Index (EII) for Nigeria from 2020-2021. ### The Eta variant of interest (B.1.525) likely emerged in Nigeria The B.1.525 (Eta) lineage drove the second wave from January to March 2021. B.1.525 was initially reported to have originated in Nigeria or the UK16,17,18. We investigate the timing, origin, and transmission dynamics of B.1.525 with Bayesian phylogeographic reconstructions (see Methods). We estimated that the B.1.525 lineage emerged in Nigeria (root state posterior support = 0.998) in late July [mean tMRCA 23 July, 95% HPD 7 June to 4 September], during the final weeks of Nigeria’s first wave (Fig. 3A). This suggests that this lineage circulated cryptically for several months before its first detection in Nigeria on 12 December 2020, likely owing to low-incidence-associated sparse sampling (Fig. 1A). We sought to better understand Nigeria’s connectivity in global SARS-CoV-2 transmission networks by identifying the most likely destinations of B.1.525’s exportation, as the lineage 1) emerged in Nigeria and 2) is Nigeria’s most sampled lineage, excluding the Delta variant sublineages. To investigate the spread of B.1.525 from Nigeria, we reconstructed the timing and pattern of geographic transitions out of and into Nigeria across the full posterior of our Bayesian phylogeographic reconstructions. We estimated that B.1.525 was exported from Nigeria a lower bound of 295 times [mean Markov jumps, 95% HPD 259-335] (Fig. 3B). The majority of sampled exports were destined for Europe, followed by the rest of Africa and Northern America from December 2020 to March 2021 (encompassing Nigeria’s second wave) (Fig. 3B). During this period, we also found that B.1.525 was re-introduced from Europe a lower bound of 20 [95% HPD 6-37] times (Fig. 3E). These estimated source-sink dynamics support Nigeria’s strong connectivity, particularly to Europe, but will underestimate exports to undersampled regions such as neighbouring African countries. To mitigate the effect of global sampling biases on genomic estimates of transmission dynamics, we again analysed changes in air travel to and from Nigeria over time to understand the sources and seeds of Nigeria’s bidirectional transmission dynamics. African nations dominated the reduced level of outbound travel during the period of travel restrictions (May to October 2020) (Fig. 3C-D), suggesting that regional connectivity drove Nigeria’s export risk during this period. We integrated the air travel data with Nigeria’s epidemic incidence data to quantify an exportation intensity index (EII)27. The EII quantifies the temporal trend in the daily estimated number of viral exports from Nigeria to sink countries (see Methods). Given the data available, Nigeria’s estimated export intensity was low overall compared to countries with high air traffic data to Nigeria, as the country’s SARS-CoV-2 incidence remained comparatively low on a global-scale for the entire period under investigation (Fig. 3F). Nigeria’s first wave had relatively low incidence, peaking at about 4000 cases in a week in late June (week 18) (SFigure 4B). Combined with reduced travel volumes, the period from May 2020 to September 2020 was characterised by negligible export intensity (Fig. 3F). Outbound air travel recovered gradually over 2020, peaking in December 2020 - January 2021 (Fig. 3C-D). We found that this peak corresponded with the peak of the larger second wave in Nigeria and therefore also the EII, as well as the distribution of exports of B.1.1.525 from Nigeria (Fig. 3B, F). The proportion of outbound air travel destined for countries in Europe, the Middle East, and North America increased relative to travel destined for African countries after travel restrictions were lifted (Fig. 3C-D). We estimated the highest EII for Europe (driven predominantly by travel to the UK) and Northern America (driven by travel to the USA) from December 2020 to February 2021 (SFigure 4). The export intensity to Europe is consistent with the high number of estimated exports in the genomic data, though North America was disproportionately underestimated in the genomic data (Fig. 3B). Overall, air travel volume destined for Europe was only moderately higher (10%) than Africa, North America, and the Middle East from December (after first detection and at the start of the second wave) to April 2021 (end of the second wave) (Fig. 3C-D). However, Europe had 4-fold more sampled introductions of B.1.525 than African nations or the USA. Notably, the comparatively high EII for the Middle East region (representing 20% of outbound volume overall) was not represented in our phylogeographic reconstructions (Fig. 3B). This highlights how unevenly distributed surveillance capacity can lead to an underestimation of transmission events (SFigure 3A). ## Discussion In this study, we combined genomic, travel and epidemiological data to characterise the emergence and bidirectional transmission of two focal variants of interest in Nigeria. The focal VOIs, B.1.525 and B.1.1.318, were suggested to have emerged in Nigeria before spreading globally, resulting in large-scale outbreaks in Brazil and Mauritius15,19. In our phylogeographic reconstructions, we found that the B.1.1.318 lineage most likely emerged in the African region in early August 2020, with its dominance in Nigeria driven by multiple introductions during the second wave from November 2020 onwards. Our import risk index underestimates the importance of regional connectivity in driving these introduction estimates, as they likely result from shorter distance connectivity to surrounding countries, which we could not collect data on. We also found that the Eta variant (B.1.525) emerged in Nigeria in late July, with high levels of export to Europe reflected in the genomic and estimated export index. The true extent of bidirectional transmission with other African nations will be severely underestimated in phylogeographic reconstructions, as Africa remains disproportionately undersampled despite heroic efforts28. Our findings should be interpreted in the light of our own sampling fraction, which was 0.026%. We attempted to mitigate these global surveillance biases by supplementing genomic estimates with sampling-independent metrics like the introduction and export intensity indices. However, these metrics are dependent on epidemiological incidence and will be underestimated if there is large-scale underascertainment in source countries, as previously shown for African (and other) nations based on excess mortality data29,30,31. At an unprecedented speed (72 hours from receiving the sample), ACEGID generated the first whole-genome sequence of the virus in Africa in March 2020. This catalysed and built confidence in SARS-CoV-2 genomic sequencing on the African continent. This immense collaborative effort, involving seventeen partner laboratories in Nigeria, helped to characterise the pandemic in near real-time during its first three waves in the country. As we have learnt the hard way in recent years, novel variants of interest or concern can emerge from anywhere around the world, notably including undersampled regions such as Africa13,14. In a highly connected world, there is a proactive need to adopt early warning systems for pandemic preemption and response in Africa, such as SENTINEL32. This would enable equality in global genomic surveillance, especially in countries with fragile public health systems, in order to effectively detect and curb emerging outbreaks before they spread. ## Methods ### Sample preparation RNA was extracted from nasopharyngeal swabs in viral transport media and saliva/sputum in PBS using the QiAmp viral RNA mini kit (Qiagen) according to the manufacturer’s instructions and MagMax pathogen RNA/DNA kit (Applied Biosystems) using a Kingfisher Flex purification system (ThermoFisher Scientific) according to the manufacturer’s instructions. ### Probe-based SARS-CoV-2 real-time RT-PCR RT-qPCR screening of suspected samples was carried out targeting N, ORF1ab and RdRP genes of the virus using commercially available kits: genesig® (Primerdesign Ltd, UK), DaAnGene (Daan Gene Co., Ltd, China), Liferiver (Shanghai ZJ Bio-Tech Co., Ltd, China), Genefinder (OSANG Healthcare Co., Ltd, Korea), and Sansure (Sansure Biotech Inc., China). ### SARS-CoV-2 whole-genome sequencing Samples with moderate to high viral load detected with RT-qPCR (Ct value < 30) were selected for sequencing, which were samples collected from genomic surveillance and hospitalised individuals spatially and temporally. The sampling gap was the inability to sequence samples from every state during all different waves of the pandemic in Nigeria. The Illumina COVIDseq protocol was used for sequencing preparation ([https://www.illumina.com/products/by-type/clinical-research-products/covidseq-assay.html](https://www.illumina.com/products/by-type/clinical-research-products/covidseq-assay.html)). RNA was converted to cDNA, followed by tiling amplification (400 bp) with Artic V3 primer pools, which covers the whole genome of the virus. Nextera DNA flex libraries were made from the amplicons. Libraries were pooled and sequenced on the Illumina MiSeq, NextSeq 2000, and NovaSeq 6000 platforms at the African Center of Excellence for Genomics of Infectious Diseases (ACEGID), Redeemer’s University, Ede, Nigeria. ### Genome Assembly We used the viral-ngs pipeline v2.1.19 ([https://github.com/broadinstitute/viral-ngs](https://github.com/broadinstitute/viral-ngs)) for demultiplexing, quality control, and genome assembly, consistent with prior work33,34,35. The demux_plus applet of viral-ngs was used for demultiplexing basecalls to BAM files. Genome assembly was carried out using the assemble_refbased workflow on the viral-ngs pipeline which maps each unmapped BAM file to the SARS-CoV-2 reference genome (NC_045512.2) to generate coverage plots and FASTA files of successfully assembled genomes. ### Lineage/clade assignment We used Phylogenetic Assignment of Named Global Outbreak LINeages (PANGOLIN)36 to identify the lineages of SARS-CoV-2 circulating within Nigeria and to identify the lineages circulating in specific states of Nigeria. Nextclade ([https://clades.nextstrain.org/](https://clades.nextstrain.org/)) was used to assign the sequences to globally circulating viral clades and to investigate mutations in the genome that could affect primer amplification during PCR. ### Maximum likelihood phylogenetic analysis 1,577 genomes with sequence length ≥ 20,930 (covering 70% or more of the reference - NC_045512.2) were assembled and aligned using MAFFT37. We used IQTREE38 and the GTR (generalised time-reversible) model with a bootstrap value of 1000 to construct the phylogenetic tree. Treetime39 was used for quantifying the molecular clock and identifying ancestral phylogeny in the augur pipeline of Nextstrain40. ### Bayesian phylogenetic analysis All B.1.1.318 (n = 3,858 with major locations: Europe, USA, and Africa accounting for 94%) and B.1.525 (n = 8,278 with major locations: Europe, USA, and Africa accounting for 86%) sequences were downloaded from GISAID41 on 18 August 2021. Sequences with > 5% ambiguous nucleotides, a length < 95%, or incomplete dates were discarded. Sequences were aligned to the reference (NC_045512.2) using minimap2, with the 5’ and 3’ UTRs and known problematic sites (GitHub - W-L/ProblematicSites_SARS-CoV2 3) masked42. Both lineage-specific datasets were downsampled for representativeness and computational tractability with a phylogenetic-informed downsampling scheme. A maximum-likelihood phylogeny was reconstructed for each lineage with automatic model selection in IQTREE43. The phylogeny was downsampled by root-to-tip tree-traversal, with all internal nodes subject to two rules: 1) if 95% of the leaves subtended by the internal node represented a single country, the earliest representative was retained alongside a random sequence; 2) if leaves from the same location were separated by a zero-branch length, the earliest representative was retained alongside a random sequence. All countries with more than the median number of sequences across countries were then downsampled randomly to the median, this yielded a total number of 1,118 and 1,746 genomes for B.1.1.318 and B.1.525 respectively. All Nigerian sequences were retained (n = 73 for B.1.1.318; n = 256 for B.1.525). We reconstructed Bayesian time-scaled phylogenies for each lineage using BEAST 1.10.544. For both lineages, the time-scaled phylogenies were reconstructed under an HKY substitution model with a gamma-distributed rate for variation among sites45, a relaxed molecular clock with a log-normal prior46, and an exponential growth coalescent tree prior47. For each dataset, we combined two independent Markov Chain Monte Carlo (MCMC) chains of 200 million states, run with the BEAGLE package48 to improve run time. Parameters and trees were sampled every 20,000 steps, with the first 20% of steps discarded as burn-in. Convergence and mixing of the MCMC chains were assessed in Tracer v1.7, to ensure the effective sample size (ESS) of all estimated parameters was > 20049. We performed an asymmetric discrete trait analysis using BEAST version 1.10.5 to reconstruct the location-transition history across an empirical distribution of 4000 time-calibrated trees (sampled from each of the posterior tree distributions estimated above). We aggregated the country of sampling on a regional level for computational tractability. We used Bayesian stochastic search variable selection (BSSVS) to infer non-zero migration rates and identify the statistically supported transition routes into and out of Nigeria by a Bayes factor test50. In addition to the discrete trait analysis, we used a Markov jump counting approach to estimate the timing and origin of geographic transitions into Nigeria to account for uncertainty in phylogeographic reconstruction associated with sparse sampling and low sequence variability51. We used the TreeMarkovJumpHistoryAnalyzer from the pre-release version of BEAST 1.10.5 to extract the Markov jumps from posterior tree distributions52. We used TreeAnnotator 1.10 to construct Maximum clade credibility (MCC) trees for all datasets. Trees were visualised using baltic ([https://github.com/evogytis/baltic](https://github.com/evogytis/baltic)). ### Air traffic and human movement data In order to associate international air travel and local human movement with variant movement across borders, we used air traffic data from the International Air Transportation Association (IATA) obtained from BlueDot ([https://bluedot.global/](https://bluedot.global/)) to quantify the volume of international travellers to and from Nigeria. The dataset included the number of travellers by origin and destination country, aggregated by month, across all international airports in Nigeria. Data was obtained for the period May 2020 to April 2021, encompassing the emergence and spread of B.1.525 and B.1.1.318. We also curated human movement data within Nigeria from Google Mobility ([https://www.google.com/covid19/mobility/](https://www.google.com/covid19/mobility/)) from January 2020 to November 2021, which reflects a baseline before lockdown (January - March 2020) and the study’s period of interest. We used R to display movement patterns after grouping human movement into six divisions: retail & recreation, grocery & pharmacy, parks, transport stations, workplaces, and residential areas. ### Estimated introduction and exportation intensity index From genomic data, estimates of the number and origin/destination of bidirectional transmissions with Nigeria are dependent on the sample’s representativeness of the population and are therefore limited by global and local sampling biases. To limit these sampling biases, we supplemented our phylogeographic analyses by estimating the introduction (III) and export intensity index (EII) according to Du Plesis et al27. The III estimates the daily risk of introductions into Nigeria from each country as a product of the number of asymptomatically infected individuals in each source country on that day (estimated from the time series of deaths) and their likelihood of traveling to Nigeria (based on the volume of inbound air travel from the source country). The EII is calculated similarly, based on the number of asymptomatically infected individuals in Nigeria and their likelihood of travel to each destination by air. Notably, we could not obtain data for land-based travel. Connectivity to regional and neighbouring countries and the associated introduction and export intensity are therefore likely severely underestimated. A time series of reported deaths for each country were collected from the outbreak.info R package53. As air travel data was aggregated by month, we conservatively assumed that travel was uniform across all days of the month. We quantified the III and EII for the period May 2020 to April 2021, encompassing the emergence and spread of B.1.525 and B.1.1.318. ## Data Availability All data produced are availale online at: [https://github.com/acegid/SARS-CoV-2\_Manucript\_Supplemental\_Data](https://github.com/acegid/SARS-CoV-2_Manucript_Supplemental_Data) [https://github.com/acegid/SARS-CoV-2\_Manucript\_Supplemental\_Data](https://github.com/acegid/SARS-CoV-2_Manucript_Supplemental_Data) ## Supplementary figures ![SFigure 1:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2022/09/25/2022.09.24.22280269/F4.medium.gif) [SFigure 1:](http://medrxiv.org/content/early/2022/09/25/2022.09.24.22280269/F4) SFigure 1: COVID-19 new cases per week as seen in Nigeria and countries with highest flight traffic in and out of Nigeria from the four major connected regions with the y-axis transformed to a log scale. ![SFigure 2:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2022/09/25/2022.09.24.22280269/F5.medium.gif) [SFigure 2:](http://medrxiv.org/content/early/2022/09/25/2022.09.24.22280269/F5) SFigure 2: Percentage of incoming travel from the four major connected regions by country as per Figure 3 ![SFigure 3:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2022/09/25/2022.09.24.22280269/F6.medium.gif) [SFigure 3:](http://medrxiv.org/content/early/2022/09/25/2022.09.24.22280269/F6) SFigure 3: Population adjusted rolling average of new daily cases for countries with the highest incoming air travel as per Figure S2. ![SFigure 4:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2022/09/25/2022.09.24.22280269/F7.medium.gif) [SFigure 4:](http://medrxiv.org/content/early/2022/09/25/2022.09.24.22280269/F7) SFigure 4: Percentage of outgoing travel to the four major connected regions by country as per Figure 4 ## Acknowledgements This work is made possible by the support provided to ACEGID by a cohort of generous donors through TED’s Audacious Project, including the ELMA Foundation, MacKenzie Scott, the Skoll Foundation, and Open Philanthropy. This work was also partly supported by grants from the National Institute of Allergy and Infectious Diseases ([https://www.niaid.nih.gov](https://www.niaid.nih.gov)), NIH-H3Africa ([https://h3africa.org](https://h3africa.org)) (U01HG007480 and U54HG007480), the World Bank (projects ACE-019 and ACE-IMPACT), the Rockefeller Foundation (Grant #2021 HTH), the Africa CDC through the African Society of Laboratory Medicine [ASLM] (Grant #INV018978), the Wellcome Trust (Project 216619/Z/19/Z), the WARN-ID/CREID grant # U01AI151812, and the Science for Africa Foundation. We want to especially thank the Redeemer’s University Management, the State Government of Osun and NCDC for their support during the course of the COVID-19 pandemic. * Received September 24, 2022. * Revision received September 24, 2022. * Accepted September 25, 2022. * © 2022, Posted by Cold Spring Harbor Laboratory This pre-print is available under a Creative Commons License (Attribution-NonCommercial-NoDerivs 4.0 International), CC BY-NC-ND 4.0, as described at [http://creativecommons.org/licenses/by-nc-nd/4.0/](http://creativecommons.org/licenses/by-nc-nd/4.0/) ## References 1. 1.World Bank, World Development Indicators (2022). Available at: [http://wdi.worldbank.org/tables](http://wdi.worldbank.org/tables) (Accessed: 29th April 2022). 2. 2.Tsueng G, Mullen J, Alkuzweny M, Cano M, Benjamin HR, Emily O, Curators L, Alaa A, Zhou X, Qian Z, Andersen KG. Outbreak. info Research Library: A standardized, searchable platform to discover and explore COVID-19 resources and data. BioRxiv. 2022 Feb 1. 3. 3.Nordling L. The pandemic appears to have spared Africa so far. Scientists are struggling to explain why. Science (80-). 2020 Aug 11; 4. 4.Kreier F. Morgue data hint at COVID’s true toll in Africa. Nature. 2022 Mar 23. 5. 5.Tessema SK, Nkengasong JN. Understanding COVID-19 in Africa. Nature Reviews Immunology. 2021 aug;21(8):469–70. 6. 6.Gilbert, M., Pullano, G., Pinotti, F., Valdano, E., Poletto, C., Boëlle, P.Y., d’Ortenzio, E., Yazdanpanah, Y., Eholie, S.P., Altmann, M. and Gutierrez, B., 2020. Preparedness and vulnerability of African countries against importations of COVID-19: a modelling study. The Lancet, 395(10227), pp. 871–877. 7. 7.Borrega, R., Nelson, D.K., Koval, A.P., Bond, N.G., Heinrich, M.L., Rowland, M.M., Lathigra, R., Bush, D.J., Aimukanova, I., Phinney, W.N. and Koval, S.A., 2021. Cross-reactive antibodies to SARS-CoV-2 and MERS-CoV in pre-COVID-19 blood samples from Sierra Leoneans. Viruses, 13(11), p. 2325. 8. 8.Bugembe DL, Phan MV, Ssewanyana I, Semanda P, Nansumba H, Dhaala B, Nabadda S, O’Toole ÁN, Rambaut A, Kaleebu P, Cotten M. Emergence and spread of a SARS-CoV-2 lineage A variant (A. 23.1) with altered spike protein in Uganda. Nature microbiology. 2021 aug;6(8):1094–101. 9. 9.Butera Y, Mukantwari E, Artesi M, O’Toole ÁN, Hill V, Rooke S, Hong SL, Dellicour S, Majyambere O, Bontems S, Boujemla B. Genomic sequencing of SARS-CoV-2 in Rwanda reveals the importance of incoming travelers on lineage diversity. Nature communications. 2021 Sep 29;12(1):1–1. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41467-021-23846-x&link_type=DOI) 10. 10.Wilkinson E, Giovanetti M, Tegally H, San JE, Lessells R, Cuadros D, Martin DP, Rasmussen DA, Zekri AR, Sangare AK, Ouedraogo AS. A year of genomic surveillance reveals how the SARS-CoV-2 pandemic unfolded in Africa. Science. 2021 Oct 22;374(6566):423–31. 11. 11.Parker E, Anderson C, Zeller M, Tibi A, Havens JL, Laroche G, Benlarbi M, Ariana A, Robles-Sikisaka R, Latif AA, Watts A. Regional connectivity drove bidirectional transmission of SARS-CoV-2 in the Middle East during travel restrictions. Nature communications. 2022 Aug 15;13(1):1–4. 12. 12.Bogoch II, Creatore MI, Cetron MS, Brownstein JS, Pesik N, Miniota J, Tam T, Hu W, Nicolucci A, Ahmed S, Yoon JW. Assessment of the potential for international dissemination of Ebola virus via commercial air travel during the 2014 west African outbreak. The Lancet. 2015 Jan 3;385(9962):29–35. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S0140-6736(14)61828-6&link_type=DOI) 13. 13.Tegally H, Wilkinson E, Giovanetti M, Iranzadeh A, Fonseca V, Giandhari J, Doolabh D, Pillay S, San EJ, Msomi N, Mlisana K. Detection of a SARS-CoV-2 variant of concern in South Africa. Nature. 2021 apr;592(7854):438–43. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F09%2F25%2F2022.09.24.22280269.atom) 14. 14.Viana, R., Moyo, S., Amoako, D.G., Tegally, H., Scheepers, C., Althaus, C.L., Anyaneji, U.J., Bester, P.A., Boni, M.F., Chand, M. and Choga, W.T., 2022. Rapid epidemic expansion of the SARS-CoV-2 Omicron variant in southern Africa. Nature, 603(7902), pp. 679–686. 15. 15.Pereira, F. et al. Genomic surveillance activities unveil the introduction of the SARS-CoV-2 B.1.525 variant of interest in Brazil: Case report. Journal of Medical Virology, 93, 9, 5523–5526 (2021). 16. 16.Dresch, M. Nigerian Covid-19 variant under investigation with 38 new cases found. [https://www.mirror.co.uk/news/uk-news/breaking-nigerian-covid-19-variant-23511168](https://www.mirror.co.uk/news/uk-news/breaking-nigerian-covid-19-variant-23511168) (2021) (Accessed 24th March, 2022). 17. 17.Vadlapatla, S. B.1.525 Variant Linked to Nigeria now Found in Telangana - [https://timesofindia.indiatimes.com/city/hyderabad/b-1-525-variant-linked-to-nigeria-now-found-in-telangana/articleshow/83133727.cms](https://timesofindia.indiatimes.com/city/hyderabad/b-1-525-variant-linked-to-nigeria-now-found-in-telangana/articleshow/83133727.cms). (2021) (Accessed 24th March, 2022). 18. 18.Takeuchi C. COVID-19 in B.C.: New variant from Nigeria detected, ski awareness campaign launched, and more. [https://www.straight.com/covid-19-pandemic/february-12-coronavirus-update-bc-vancouver-new-variant-from-nigeria-detected-ski-awareness-campaign-and-moreTakeuchi](https://www.straight.com/covid-19-pandemic/february-12-coronavirus-update-bc-vancouver-new-variant-from-nigeria-detected-ski-awareness-campaign-and-moreTakeuchi). (2021) (Accessed 24th March, 2022). 19. 19.Tegally, H. et al. A novel and expanding SARS-CoV-2 Variant, B. 1.1. 318, dominates infections in Mauritius. medRxiv [https://doi.org/10.1101/2021.06.16.21259017](https://doi.org/10.1101/2021.06.16.21259017) (2021). 20. 20.Zhang L, Cui Z, Li Q, Wang B, Yu Y, Wu J, Nie J, Ding R, Wang H, Zhang Y, Liu S. Ten emerging SARS-CoV-2 spike variants exhibit variable infectivity, animal tropism, and antibody neutralization. Communications biology. 2021 Oct 13;4(1):1–. 21. 21.Weisblum Y, Schmidt F, Zhang F, DaSilva J, Poston D, Lorenzi JC, Muecksch F, Rutkowska M, Hoffmann HH, Michailidis E, Gaebler C. Escape from neutralizing antibodies by SARS-CoV-2 spike protein variants. Elife. 2020 Oct 28;9:e61312. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.7554/eLife.61312&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F09%2F25%2F2022.09.24.22280269.atom) 22. 22.Galloway SE, Paul P, MacCannell DR, Johansson MA, Brooks JT, MacNeil A, Slayton RB, Tong S, Silk BJ, Armstrong GL, Biggerstaff M. Emergence of SARS-CoV-2 b. 1.1. 7 lineage—united states, december 29, 2020–january 12, 2021. Morbidity and Mortality Weekly Report. 2021 Jan 1;70(3):95. 23. 23.Emary KR, Golubchik T, Aley PK, Ariani CV, Angus B, Bibi S, Blane B, Bonsall D, Cicconi P, Charlton S, Clutterbuck EA. Efficacy of ChAdOx1 nCoV-19 (AZD1222) vaccine against SARS-CoV-2 variant of concern 202012/01 (B. 1.1. 7): an exploratory analysis of a randomised controlled trial. The Lancet. 2021 Apr 10;397(10282):1351–62. 24. 24.Chaillon, A. & Smith, D. M. Phylogenetic analyses of SARS-CoV-2 B.1.1.7 lineage suggest a single origin followed by multiple exportation events versus convergent evolution. Clinical Infectious Diseases 73, 12, 2314–2317 (2021). 25. 25.Washington, N. L. et al. (2021). Emergence and rapid transmission of SARS-CoV-2 B.1.1.7 in the United States. Cell 184, 10, 2587–2594. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.cell.2021.03.052&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=33861950&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F09%2F25%2F2022.09.24.22280269.atom) 26. 26.Davies, N. G. et al. Estimated transmissibility and impact of SARS-CoV-2 lineage B. 1.1. 7 in England. Science 372, 6538, eabg3055 (2021). [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6Mzoic2NpIjtzOjU6InJlc2lkIjtzOjE3OiIzNzIvNjUzOC9lYWJnMzA1NSI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIyLzA5LzI1LzIwMjIuMDkuMjQuMjIyODAyNjkuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 27. 27.du Plessis, L. et al. Establishment and lineage dynamics of the SARS-CoV-2 epidemic in the UK. Science (New York, N.Y.), 371, 6530, 708–712 (2021). 28. 28.Tegally H, San JE, Cotten M, Tegomoh B, Mboowa G, Martin DP, Baxter C, Moir M, Lambisia A, Diallo A, Amoako DG. The evolving SARS-CoV-2 epidemic in Africa: Insights from rapidly expanding genomic surveillance. medRxiv. 2022 Jan 1. 29. 29.Kleynhans J, Tempia S, Wolter N, von Gottberg A, Bhiman JN, Buys A, Moyes J, McMorrow ML, Kahn K, Gómez-Olivé FX, Tollman S. SARS-CoV-2 Seroprevalence in a rural and urban household cohort during first and second waves of infections, South Africa, July 2020–March 2021. Emerging infectious diseases. 2021 ec;27(12):3020. 30. 30.Wang H, Paulson KR, Pease SA, Watson S, Comfort H, Zheng P, Aravkin AY, Bisignano C, Barber RM, Alam T, Fuller JE. Estimating excess mortality due to the COVID-19 pandemic: a systematic analysis of COVID-19-related mortality, 2020–21. The Lancet. 2022 Apr 16;399(10334):1513–36. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/s0140-6736(21)02796-3&link_type=DOI) 31. 31.Cohen C, Kleynhans J, von Gottberg A, McMorrow ML, Wolter N, Bhiman JN, Moyes J, du Plessis M, Carrim M, Buys A, Martinson NA. SARS-CoV-2 incidence, transmission, and reinfection in a rural and an urban setting: results of the PHIRST-C cohort study, South Africa, 2020–21. The Lancet Infectious Diseases. 2022 Mar 14. 32. 32.Botti-Lodovico, Y. et al. The Origins and Future of Sentinel: An Early-Warning System for Pandemic Preemption and Response. Viruses 13, 8, 1605 (2021). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.3390/v13081605&link_type=DOI) 33. 33.Siddle, K. J. et al. Genomic Analysis of Lassa Virus during an Increase in Cases in Nigeria in 2018. The New England Journal of Medicine 379, 18, 1745–1753 (2018). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1056/NEJMoa1804498&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F09%2F25%2F2022.09.24.22280269.atom) 34. 34.Ajogbasile, F. V. et al. Real-time metagenomic analysis of undiagnosed fever cases unveils a yellow fever outbreak in Edo State, Nigeria. Scientific Reports 10, 1, 1–6 (2020). 35. 35.Oluniyi, P. First African SARS-CoV-2 genome sequence from Nigerian COVID-19 case. Virological [https://virological.org/t/first-african-sars-cov-2-genome-sequence-from-nigerian-covid-19-case/421](https://virological.org/t/first-african-sars-cov-2-genome-sequence-from-nigerian-covid-19-case/421) (2020). 36. 36.Rambaut, A. et al. A dynamic nomenclature proposal for SARS-CoV-2 lineages to assist genomic epidemiology. Nat Microbiol [https://doi.org/10.1038/s41564-020-0770-5](https://doi.org/10.1038/s41564-020-0770-5) (2020). 37. 37.Katoh, K., Misawa, K., Kuma, K., & Miyata, T. MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Research, 30, 14, 3059–3066 (2002). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/nar/gkf436&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=12136088&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F09%2F25%2F2022.09.24.22280269.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000177154300016&link_type=ISI) 38. 38.Minh, B.Q., Schmidt, H.A., Chernomor, O., Schrempf, D., Woodhams, M.D., von Haeseler, A. & Lanfear, R. IQ-TREE 2: New models and efficient methods for phylogenetic inference in the genomic era. Mol. Biol. Evol. 37, 1530–1534 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/molbev/msaa015&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=32556291&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F09%2F25%2F2022.09.24.22280269.atom) 39. 39.Sagulenko, P., Puller, V. & Neher, R.A. TreeTime: Maximum-likelihood phylodynamic analysis. Virus Evol. 4, 1, vex042 (2018). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/ve/vex042&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=29340210&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F09%2F25%2F2022.09.24.22280269.atom) 40. 40.Hadfield, J., Megill, C., Bell, S. M., Huddleston, J., Potter, B., Callender, C., Sagulenko, P., Bedford, T. & Neher, R. A. Nextstrain: real-time tracking of pathogen evolution. Bioinformatics (Oxford, England), 34, 23, 4121–4123 (2018). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10/gdkbqx&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F09%2F25%2F2022.09.24.22280269.atom) 41. 41.Elbe, S. & Buckland-Merrett, G. Data, disease and diplomacy: GISAID’s innovative contribution to global health. Global Challenges, 1, 33–46 (2017). 42. 42.Li, H. Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics, 34, 18, 3094–3100 (2018). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/bioinformatics/bty191&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=29750242&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F09%2F25%2F2022.09.24.22280269.atom) 43. 43.Kalyaanamoorthy, S., Minh, B. Q., Wong, T. K., Von Haeseler, A. & Jermiin, L. S. ModelFinder: fast model selection for accurate phylogenetic estimates. Nature methods, 14, 587–589 (2017). 44. 44.Suchard, M. A., Lemey, P., Baele, G., Ayres, D. L., Drummond, A. J. & Rambaut, A. Bayesian phylogenetic and phylodynamic data integration using BEAST 1.10. Virus evolution, 4, 1, vey016 (2018). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/ve/vey016&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=29942656&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F09%2F25%2F2022.09.24.22280269.atom) 45. 45.Hasegawa, M., Kishino, H. & Yano, T. A.. Dating of the human-ape splitting by a molecular clock of mitochondrial DNA. Journal of Molecular Evolution, 22, 2, 160–174 (1985). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/BF02101694&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=3934395&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F09%2F25%2F2022.09.24.22280269.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=A1985AWB1600007&link_type=ISI) 46. 46.Drummond, A. J., Ho, S. Y. W., Phillips, M. J. & Rambaut, A. Relaxed phylogenetics and dating with confidence. PLoS Biology, 4, 5, e88 (2006). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pbio.0040088&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=16683862&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F09%2F25%2F2022.09.24.22280269.atom) 47. 47.Drummond, A. J., Rambaut, A., Shapiro, B. E. T. H. & Pybus, O. G. Bayesian coalescent inference of past population dynamics from molecular sequences. Molecular Biology and Evolution, 22, 5, 1185–1192 (2005). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/molbev/msi103&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=15703244&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F09%2F25%2F2022.09.24.22280269.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000228700200004&link_type=ISI) 48. 48.Ayres, D. L. et al. BEAGLE: an application programming interface and high-performance computing library for statistical phylogenetics. Systematic Biology, 61, 1, 170–173 (2012). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/sysbio/syr100&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=21963610&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F09%2F25%2F2022.09.24.22280269.atom) 49. 49.Rambaut, A., Drummond, A. J., Xie, D., Baele, G. & Suchard, M. A. Posterior summarization in Bayesian phylogenetics using Tracer 1.7. Systematic Biology, 67(5), 901 (2018). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/sysbio/syy032&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=29718447&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F09%2F25%2F2022.09.24.22280269.atom) 50. 50.Lemey, P., Rambaut, A., Drummond, A. J. & Suchard, M. A. Bayesian phylogeography finds its roots. PLoS Computational Biology, 5, 9, e1000520 (2009). 51. 51.Worobey, M. et al. The emergence of SARS-CoV-2 in Europe and North America. Science, 370, 6516, 564–570 (2020). [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6Mzoic2NpIjtzOjU6InJlc2lkIjtzOjEyOiIzNzAvNjUxNi81NjQiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyMi8wOS8yNS8yMDIyLjA5LjI0LjIyMjgwMjY5LmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 52. 52.Lemey, P. et al. Untangling introductions and persistence in COVID-19 resurgence in Europe. Nature, 595, 7869, 713–717 (2021). 53. 53.Gangavarapu K, Latif AA, Mullen JL, Alkuzweny M, Hufbauer E, Tsueng G, Haag E, Zeller M, Aceves CM, Zaiets K, Cano M. Outbreak. info genomic reports: scalable and dynamic surveillance of SARS-CoV-2 variants and mutations. Research square. 2022 Jan 1.