Intrahost evolution leading to distinct lineages in the upper and lower respiratory tracts during SARS-CoV-2 prolonged infection ================================================================================================================================ * Majdouline El Moussaoui * Sebastien Bontems * Cecile Meex * Marie-Pierre Hayette * Marie Lejeune * Samuel L. Hong * Simon Dellicour * Michel Moutschen * Nadine Cambisano * Nathalie Renotte * Vincent Bours * Gilles Darcis * Maria Artesi * Keith Durkin ## Abstract Accumulating evidence points to persistent severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infections in immunocompromised individuals as a source of genetically divergent, novel lineages, generally characterised by increased transmissibility and immune escape. While intrahost evolutionary dynamics of the virus in chronically infected patients have been previously reported, existing knowledge is primarily based on samples obtained from the nasopharyngeal compartment. In this study, we investigate the intrahost evolution and genetic diversity that accumulated during a prolonged SARS-CoV-2 infection with the Omicron sublineage BF.7, estimated to have persisted for over one year in an immunosuppressed patient. Based on the sequencing of eight viral genomes collected from the patient at six time points, we identified 86 intrahost single-nucleotide variants (iSNVs), two indels, and a 362 bp deletion. Our analysis revealed distinct viral genotypes in the nasopharyngeal (NP), endotracheal aspirate (ETA), and bronchoalveolar (BAL) samples. Notably, while significant divergence was observed between NP and BAL samples, most of the iSNVs found in ETA samples were also detected in NP or BAL samples. This suggests that NP samples may not offer a comprehensive representation of the overall intrahost viral diversity. Nonsynonymous mutations were most frequent in the spike and envelope genes, along with loss-of-function mutations in ORF8, generated by a frameshift mutation and a large deletion detected in the BAL and NP samples, respectively. Using long-range PCR on SARS-CoV-2 samples sequenced as part of routine surveillance, we validated that similar deletions causing ORF8 loss of function can be carried by SARS-CoV-2 during acute infection. Our findings not only demonstrate that the Omicron sublineage BF.7 can further diverge from its already exceptionally mutated state but also highlight that patients chronically infected with SARS-CoV-2 can develop genetically specific viral populations across distinct anatomical compartments. This provides novel insights into the intricate nature of viral diversity and evolution dynamics in persistent infections. Keywords * SARS-CoV-2 * COVID-19 * within-host evolution * genetic compartmentalization * intrahost evolution * chronic infection * prolonged infection * immunosuppression * ORF8 ## Introduction The SARS-CoV-2 pandemic has been characterised by the repeated emergence of divergent lineages with a significant growth advantage over contemporary variants [1,2]. These variants of concern (VOCs) generally exhibit enhanced transmissibility and the ability to evade pre existing acquired immunity, posing significant challenges to the containment of SARS-CoV-2 spread. The World Health Organization (WHO) has designated five VOCs of SARS-CoV-2, namely, the Alpha (B.1.1.7), Beta (B.1.351), Gamma (P.1), Delta (B.1.617.2), and lastly Omicron (B.1.1.529) in November 2021 [2]. Compared with previous variants, Omicron encompassed a substantially larger number of unique mutations in the spike gene, contributing to antigenic distance and probably occurring at least partly in the context of a chronic infection. The emergence of Omicron raised immediate global concerns, driven by its rapid global spread attributed to its remarkable ability to evade immune responses induced by prior vaccination or infections, increased transmission rate, and high tropism for nasal epithelial cells [3–5]. Subsequently, several antigenically drifted sublineages of Omicron (e.g., BA.4/5, BA.2.75.2, BA.4.6, BQ.1.1, and XBB) emerged and supplanted prior subvariants [6]. Compared to the BA.2 lineage, BA.4 and BA.5 have demonstrated increased escape from antibody neutralization, attributed to mutations at antigenically potent RBD positions, notably L452R and F486V [7]. Although a large fraction of the mutations are localised in the Spike’s receptor-binding domain (RBD) or N-terminal domain (NTD), accumulating evidence suggests that mutations impacting viral genes outside of spike play critical roles in the fitness of Omicron sublineages [8]. More recently, the second half of 2023 saw the emergence of the novel lineage BA.2.86, an Omicron sublineage, which had accumulated over thirty mutations in the spike protein compared to its BA.2 progenitor. Although BA.2.86 initially showed modest growth, the acquisition of additional mutations led to the daughter lineage JN.1, which has subsequently become the dominant lineage worldwide [9]. Newly emergent VOCs generally perch atop a long ancestral phylogenetic branch, rooted in a historic lineage absent from the current variant landscape [6]. While SARS-CoV-spillover/spillback to and from an animal host could potentially generate similar long phylogenetic branches [10], recent evidence suggests that immunocompromised individuals with persistent infections are the main source of these divergent lineages [2,11]. Since the early stages of the pandemic, long-term replication has been documented in chronically infected immunocompromised individuals, accelerating intrahost viral evolution and leading to the acquisition of amino acid changes that often overlap with those identified in VOCs [12–14]. Therefore, monitoring of SARS-CoV-2 mutations and understanding intra-host viral evolution in immunocompromised individuals is a priority. In this report, we characterised the intrahost genetic diversity and evolution of SARS-CoV-2 in an immunocompromised patient with B-cell depletion due to diffuse large B-cell lymphoma (DLBCL), showing persistent SARS-CoV-2 Omicron BF.7 replication, suspected to have spanned over a year. In contrast to most chronic SARS-CoV-2 infections reported in the literature, we sequenced SARS-CoV-2 from different specimen sites collected on the same day, including nasopharyngeal swab (NP), endotracheal aspirate (ETA), and bronchoalveolar lavage (BAL) samples. The resulting sequences revealed a deep divergence in the viral lineages at opposite ends of the respiratory tract, with a mixture of viral populations in the ETA sample. These findings highlight that NP samples only capture a subset of the diversity present in chronic infection and that a negative test does not rule out the presence of a SARS-CoV-2 reservoir in immunocompromised individuals. ## Results ### The identification of a persistent SARS-CoV-2 infection in a immunosuppressed host The patient, in their sixties, suffered from follicular lymphoma (Grade 2, clinical stage IV), which subsequently progressed into diffuse large B-cell lymphoma (DLBCL). During the end of 2021, they underwent treatment with four cycles of rituximab with cyclophosphamide, doxorubicin, vincristine and prednisone (R-CHOP), followed by two cycles of rituximab alone, resulting in a complete remission of the disease. Since February 2022, the patient has been receiving bimonthly infusions of rituximab as maintenance therapy. In addition, the patient’s medical history was also notable for a first mild episode of COVID-19 in January 2022, which resolved over a few days. In April, reverse-transcriptase polymerase chain reaction (RT-PCR) for SARS-CoV-2 on NP was negative. Of note, the patient had received three doses of vaccine (two doses of ChAdOx1-S followed by one of BNT162b2) between March and October 2021, resulting in detectable anti-trimeric spike protein-specific IgG antibodies in June 2021. In July 2022, they were admitted for fever, cough, shortness of breath, haemoptysis, diarrhoea, and generalised weakness, evolving over a period of four days. Chest computed tomography (CT) showed diffuse and bilateral ground-glass opacities with moderate involvement (20 to 25%) taking consolidated appearance in the middle lobe, as well as a small bilateral pleural effusion. SARS-CoV-2 PCR performed on an NP swab was positive (RdRp Gene, cycle threshold [Ct] = 35), and the patient was diagnosed with COVID-19 pneumopathy. They were treated with a five-day course of Molnupiravir and required transient oxygen supplementation, but eventually evolved favourably and were discharged home seven days after admission. The maintenance therapy with Rituximab was also discontinued. During the subsequent months, the patient reported intermittent fatigue, low-grade fever, cough, blood-tinged sputum, and dyspnea. Chest CT revealed diffuse ground-glass opacities with new upper left and middle lobe consolidations suggestive of organising pneumonia (OP). After exclusion of infectious alternative diagnoses including notably a negative RT-PCR for SARS-CoV-2 on NP sample, treatment with methylprednisolone (8 mg/day) was initiated in September 2022, resulting in partial resolution of the symptomatology over a period of three weeks. However, shortly after discontinuing corticosteroids, symptoms recurred, prompting their reintroduction by the end of October 2022. Between December 2022 and April 2023, the clinical condition was marked by systematic recurrence of respiratory symptoms following each attempt of corticosteroid discontinuation. Of note, RT-PCR for SARS-CoV-2 performed on BAL sample remained negative in April 2023. In May 2023, the patient was hospitalised for fever, cough, and moderate respiratory failure that required oxygen supplementation. Chest CT showed a marked increase in diffuse and bilateral ground-glass opacities with severe lung involvement (75%). SARS-CoV-2 RT-PCR on NP swab was positive, confirming the diagnosis of COVID-19 pneumopathy. The patient was treated with dexamethasone and a seven-day course of antibiotic therapy due to a high suspicion of bacterial co-infection. Following an initial clinical amelioration, permitting discharge home with oxygen supplementation and oral corticosteroids, they were readmitted in June due to a subsequent exacerbation of respiratory insufficiency. This progression necessitated an eight-day stay in the intensive care unit (ICU) for the administration of high-flow nasal cannula therapy. After multiple rounds of antibiotic therapy for bacterial superinfections, intravenous immunoglobulin, and high-dose corticosteroid pulses, they gradually improved and were able to return home after approximately three months of hospitalisation. The persistence of the infection during the period was confirmed by repeated positive RT-PCR tests on NP, AET, and BAL samples, with Ct values ranging from 16.1 to 26.4, between May 2023 and September 2023 (126 days). Finally, in October 2023, the RT-PCR test yielded a negative result. The clinical course of the infection is summarised in Figure 1, while the evolution of imaging is displayed in Supplementary Figure S1. ![Supplementary Figure S1.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2024/03/04/2024.03.03.24303157/F5.medium.gif) [Supplementary Figure S1.](http://medrxiv.org/content/early/2024/03/04/2024.03.03.24303157/F5) Supplementary Figure S1. Evolution of the lesions visualized with chest CT scan over a period of one year. 1, Contrast-enhanced axial chest CT images performed during the first hospitalization showed bilateral and diffuse ground-glass opacities, consistent with moderate COVID-19 pneumonia (involvement of 20 to 25% of lung volume), and bilateral pleural effusions; 2, Axial chest CT images performed three months later showed a decrease in the extent and density of bilateral ground-glass opacities, which persist in peripheral and subpleural areas, and resolution of pleural effusions, indicating a positive response to treatment; 3, Axial chest CT images showed a decrease in density and a slight reduction in the extent of bilateral ground-glass opacities; 4, Contrast-enhanced axial chest CT images performed during the second hospitalization showed extensive ground-glass opacities with significant densification in both lungs involving more than 75% of lung volume, and the appearance of a 14 mm thick right pleural effusion; 5, Axial chest CT images performed during the third hospitalization showed multiple bilateral ground-glass opacities, with areas showing hyperdense opacities and condensation with a retractile appearance, accompanied by traction bronchiectasis; 6, Axial chest CT images showed a slight intensification of bilateral ground-glass opacities, regression of consolidations in the upper lobes with increased bronchiectasis features in the lower lobes, alongside near-complete regression of pleural effusions, suggesting a discordant evolution of interstitial involvement with potential lower lobar infection; 7, Axial chest CT images showed a worsening of bilateral panlobar pneumonia with increased consolidation volume and size of confluence foci, alongside the emergence of a mild bilateral pleural effusion; 8, Axial chest CT images performed after the third hospitalization showed a decrease in bilateral ground-glass opacities and a reduction of consolidative opacities in the lower lobes, right upper lobe, and lingula, with less prominent bronchiectasis features. ![Figure 1.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2024/03/04/2024.03.03.24303157/F1.medium.gif) [Figure 1.](http://medrxiv.org/content/early/2024/03/04/2024.03.03.24303157/F1) Figure 1. Timeline of the clinical history and disease course. The figure illustrates the timeline progression of the patient’s COVID-19 infection along with relevant clinical events. The top section displays the SARS-CoV-2 RT-PCR results and Ct values in NP, ETA, and BAL samples over time. Positive test results are indicated in red, while negative tests are represented in blue. The lower section provides a comprehensive overview of the patient’s clinical history including the history of diffuse large B-cell lymphoma (DLBCL) and its treatment, three hospitalizations (July 2022, May 2023, and from June 2023 to September 2023, including an eight-day ICU stay in June 2023) related to COVID-19, and the therapeutic interventions administered to the patient. Abbreviations: BAL = broncho-alveolar lavage, Ct = cycle threshold, DLBCL = diffuse large B-cell lymphoma, ETA = endotracheal aspirate, HFNC = High-flow nasal cannula therapy, ICU = intensive care unit, NP= nasopharyngeal swab, R-C = Rituximab with cyclophosphamide, doxorubicin, vincristine and prednisone (R-CHOP), RTX = Rituximab. ### Genomic sequencing points to chronic infection lasting over a year In total, we sequenced eight samples from the patient collected over an 85-day period, between June and September 2023. These samples were found to be derived from the almost extinct Omicron BA.5 sublineage BF.7 (Fig. 2A). This sublineage saw its Belgian peak in October 2022, accounting for 17.4% of sequenced samples (Fig. 2B). The sublineage declined in the following months, with the last Belgian example sequenced in February 2023. The patient’s consensus genomes were placed onto the existing SARS-CoV-2 phylogenetic tree using the UShER tool [15]. They sat at the end of a long phylogenetic branch, separated by over 25 single nucleotide variants (SNVs) from their closest relatives, which were collected towards the end of 2022 (Fig. 2C). ![Figure 2.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2024/03/04/2024.03.03.24303157/F2.medium.gif) [Figure 2.](http://medrxiv.org/content/early/2024/03/04/2024.03.03.24303157/F2) Figure 2. Dynamics of BF.7 sublineage and phylogenetic tree. (A) Temporal distribution of BF.7 sublineage (BA.5.2.1.7) samples from GISAID. (B) Proportion of public Belgian genomes from GISAID belonging to BF.7 sublineage sampled between May 2022 and May 2023. The number of samples that were sequenced each month is indicated above. (C) Maximum clade credibility (MCC) tree resulting from a time-scaled phylogenetic inference carried out to display the evolutionary relationships among the eight SARS-CoV-2 BF.7 sequences from the patient (NP for nasopharyngeal, ETA for endotracheal aspirate, and BAL for bronchoalveolar lavage) and the most closely related genomes identified in the UShER global phylogenetic tree. Belgian genomes are marked in orange, and nodes with a posterior probability above 0.7 and 0.9 are coloured in grey and blue, respectively. The patient tested positive for SARS-CoV-2 in January 2022 with a subsequent negative test in April. At this time, BF.7 was observed at very low frequencies worldwide (Fig. 2A) and was not detected in Belgium until May 2022 (Fig. 2B), making this first infection an unlikely time point for the establishment of a persistent infection. The patient again tested positive in July 2022, and at this time, BF.7 was the fifth most common lineage observed in Liège, representing 6.3% of samples sequenced during the week they tested positive (Supplementary Fig. S2). The patient had negative tests in August 2022 and in April 2023, but again tested positive in May 2023 and was consistently positive in subsequent tests (Fig. 1). Unfortunately, only samples from June 2023 forward were sequenced, and as a consequence, it is impossible to draw a definitive conclusion. Nevertheless, based on the viral lineage observed in the samples taken from June 2023 on, it would appear that the infection acquired in July 2022 never cleared (despite intervening negative PCR tests) and suggests the patient was chronically infected for over a year. ![Supplementary Figure S2.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2024/03/04/2024.03.03.24303157/F6.medium.gif) [Supplementary Figure S2.](http://medrxiv.org/content/early/2024/03/04/2024.03.03.24303157/F6) Supplementary Figure S2. Lineages observed in Liège, Belgium during the week of the patients 1st hospitalisation for COVID-19 pneumopathy, July 2022. ### Distinct viral populations in the upper and lower respiratory tracts While the presence of a long branch leading to the patient samples is expected in a prolonged infection, the relative placement of the consensus genomes was initially somewhat surprising. Notably, the six NP samples formed a cohesive cluster, whereas the BAL and ETA samples occupied a distinct branch that showed early divergence from the NP samples, indicating a long period of independent evolution. Across all samples, we identified eighty-six unique intrahost single-nucleotide variants (iSNVs; Fig. 3A), two indels and a 362 bp deletion (MN908947.3:27818-28180) distributed in accessory protein-coding genes ORF7a and ORF8 (Fig. 3B, Supplementary Fig. S4, Table S1). All iSNVs had an allele frequency of 18% or more in at least one sample. Interestingly, substantial disparities were observed in genetic variants found in upper and lower respiratory tract samples. In the first NP sample taken in June, 2023, we observed 29 iSNVs as well as the 362 bp deletion, all with an allele frequency of 87% or higher (Fig. 3A). In contrast, in the BAL sample collected on the same date, 54 iSNVs and two indels were found, with frequencies varying between 19 and 99% (Fig 3A). Remarkably, only four of these genetic variations were found in both samples (Fig. 3B), explaining the significant divergence observed in the branches of the phylogenetic tree (Fig. 1C). Regarding the ETA sample, we observed 68 iSNVs with allele frequencies ranging from 9 to 97%. Most of iSNVs found in ETA samples were also identified in NP or BAL samples. Only two iSNVs were unique to the ETA sample (Fig 3B). In comparison to the BAL and ETA samples, iSNVs identified in NP samples generally showed high allele frequencies, with variants appearing and disappearing relatively abruptly between samples (Fig. 3A). Over an 85-day period, we observed the emergence of six iSNVs and one reversion in the subsequent five NP samples. Notably, in the last sample, only three of the new iSNVs and the reversion persisted, as three had been lost in the intervening time (Fig. 3A). ![Supplementary Figure S3.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2024/03/04/2024.03.03.24303157/F7.medium.gif) [Supplementary Figure S3.](http://medrxiv.org/content/early/2024/03/04/2024.03.03.24303157/F7) Supplementary Figure S3. Long range PCR and nanopore sequencing of the 362bp deletion impacting ORF7b and ORF8 observed in the NP sample and a subset of the reads in the ETA sample. ![Supplementary Figure S4.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2024/03/04/2024.03.03.24303157/F8.medium.gif) [Supplementary Figure S4.](http://medrxiv.org/content/early/2024/03/04/2024.03.03.24303157/F8) Supplementary Figure S4. Mutation spectra of iSNVs in BAL, NP and all iSNVs, as well as SNVs in ancestral BF.7 lineage. ![Figure 3.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2024/03/04/2024.03.03.24303157/F3.medium.gif) [Figure 3.](http://medrxiv.org/content/early/2024/03/04/2024.03.03.24303157/F3) Figure 3. Evolution of patient-specific variants overtime according to different anatomical compartments. (A) Frequency of iSNV across the upper and lower respiratory tracts over time, day 0 corresponds to the first samples sequenced, taken June 2023. The top panel shows iSNV at the 5’ end of the virus; the bottom shows the 3’ end. Genomic position corresponding to ancestral Wuhan-1 (MN908947.3) is shown on the bottom, while amino acid changes are indicated on top. Colour intensity reflects allele frequency. Abbreviations: NP = nasopharyngeal, BAL = bronchoalveolar lavage, ETA = endotracheal aspirate. (B) Patient-specific mutations from upper and lower respiratory tracts are mapped to the SARS-CoV-2 genome, displaying viral proteins. The graph shows the frequency of alleles for variants found in NP, BAL, and ETA samples collected at six different time points. The height of the line represents the allele frequency. The highest allele frequency observed is used for NP samples at the six time points combined. Variants that are exclusively found in BAL and ETA samples are represented in blue, those found only in NP and ETA are shown in orange, and ETA-exclusive variants are shown in purple. Variants that are present in all the samples are represented in black. iSNVs are represented by circles, indels by polygons, and deletions by shaded rectangles. ### iSNVs mutational patterns The iSNV mutational landscape was dominated by C-U transitions (Fig. 4A), which is in line with previous reports [16–18]. Despite treatment with Molnupiravir [19] the frequency of G-to-A iSNV was similar to the G-to-A SNVs seen in ancestral BF.7 (Supplementary Fig. S4). When we compared the mutation patterns between samples collected at different anatomical sites, we did not observe a marked difference between BAL and NP samples (Fig. 4A). Additionally, the frequency of synonymous and nonsynonymous mutations in iSNVs was similar in BAL and NP samples (Fig. 4B). Considering all iSNVs, we further examined the proportion and number per kilobase of synonymous versus nonsynonymous changes in SARS-CoV-2 proteins impacted by at least three iSNVs. The spike and envelope proteins showed the highest frequency of nonsynonymous changes (Fig. 4C-D). We then investigated NP and BAL-specific iSNVs for the three SARS-CoV-2 proteins with the highest number of variants. We observed a similar pattern for spike and envelope in both BAL and NP samples. For NSP3 in the NP samples there was no difference in the frequency or the number of synonymous and nonsynonymous changes, while in the BAL sample there was a modest difference (Fig. 4E-F). Deep mutational scanning of the complete XBB.1.5 spike reported by Dadonaite and colleagues [20] suggests that key changes in the BAL spike include the F375S and E619A, both of which are predicted to increase ACE2 binding [20]. Conversely, the substitution T523I in NP samples was predicted to enhance ACE2 binding, while the substitution V976A may contribute to immune escape, although at the expense of ACE2 binding [20] (Supplementary Table S2). View this table: [Supplementary Table S2.](http://medrxiv.org/content/early/2024/03/04/2024.03.03.24303157/T1) Supplementary Table S2. Estimated impact of spike amino acid changes, values taken from Dadonaite *et al*. (2023)19. GISAID search was performed 2024-01-18, Low coverage genomes were excluded. Abbreviation: BAL = bronchoalveolar, NP = nasopharyngeal ![Figure 4.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2024/03/04/2024.03.03.24303157/F4.medium.gif) [Figure 4.](http://medrxiv.org/content/early/2024/03/04/2024.03.03.24303157/F4) Figure 4. Intrahost variant impacts. (A) Mutation spectra for all iSNVs observed; (B) Frequency of synonymous versus nonsynonymous mutations in BAL iSNV, NP iSNV and all iSNV combined; (C) Frequency of synonymous versus nonsynonymous mutations specifically in viral proteins with three or more iSNVs; (D) Quantification of iSNV per kb in viral proteins with three or more iSNV; (E) Frequency of synonymous versus nonsynonymous mutations in BAL and NP for three viral proteins (NPS3, spike and envelope); (F) Frequency of synonymous versus nonsynonymous mutations in BAL and NP for three viral proteins (NPS3, spike and envelope) with a presentation of iSNVs per kilobase. ### ORF8 knockouts were also observed in acute infections The frameshift identified in the BAL sample and the large deletion detected in the NP sample resulted in loss of function (LOF) mutations within ORF8. During the routine genomic surveillance of SARS-CoV-2, we have noticed large putative deletions in the ORF7a/b/8 region (discerned through a lack of coverage in the region and, in some instances, via reads spanning the breakpoint). To confirm that such deletions can be found in acute infections, we performed long range PCR on two routine samples suspected to have these deletions. The results revealed an XBB.1.5 sublineage carrying a 306 pb deletion within ORF8 and a CH.1.1 sublineage carrying an 872 bp deletion, which removes ORF7a, ORF7b and ORF8 (Supplementary Fig. S5). We used UShER to place these genomes on the SARS-CoV-2 phylogenetic tree and found that most of the adjacent genomes had either a gap or Ns in the same region (Supplementary Table S6), strongly indicating the presence of the respective deletions. ![Supplementary Figure S5.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2024/03/04/2024.03.03.24303157/F9.medium.gif) [Supplementary Figure S5.](http://medrxiv.org/content/early/2024/03/04/2024.03.03.24303157/F9) Supplementary Figure S5. Long-range PCR across to ORF7a/b/8 sequenced with nanopore. Two examples of large deletions in circulating lineages (CH.1.1 and XBB.1.5) were observed during routine SARS-CoV-2 genomic surveillance at the University of Liège (Belgium) in 2023. The deletion seen in the NP sample is shown below these. The last two panels show the reads from the BAL separated based on whether they carry the ORF8 frameshift or the wild-type sequence. ![Supplementary Figure S6.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2024/03/04/2024.03.03.24303157/F10.medium.gif) [Supplementary Figure S6.](http://medrxiv.org/content/early/2024/03/04/2024.03.03.24303157/F10) Supplementary Figure S6. Neighbouring samples in the global phylogenetic tree generated by UShER for the CH.1.1 sample with ORF7a/b/8 deletion and the XBB.1.5. sample with ORF8 deletion Both are indicated by an arrow. ## Discussion The case reported above documents a persistent SARS-CoV-2 infection in a patient undergoing B-cell depletion therapy with anti-CD20 therapy for diffuse large B-cell lymphoma. The chronic infection was confirmed through migratory lung infiltrates and positive RT-PCR tests from samples collected in the upper and lower respiratory tracts. Determining the exact duration of chronic infection in this patient is a challenge, as sequencing of samples only began in June 2023, and the patient intermittently tested negative for SARS-CoV-2. Nonetheless, we could confirm via sequencing that the infection endured for more than three months (between June and September 2023) and, based on the archaic lineage observed in the viral sequences, it is likely that the infection was acquired in June 2022. These findings suggest a prolonged infection lasting over a year and highlight the difficulty in diagnosing a low-grade SARS-CoV-2 viral infection in an immunocompromised patient. One of the strengths of this study is the examination of multiple samples from different anatomical compartments, and sampling at various time points, providing a broader understanding of the infection. A chronic infection could potentially be missed if data were only based on samples collected from the same site and over shorter periods. The extended reconstitution period of the B-cell population after rituximab treatment (six to twelve months), leading to significantly impaired autologous B-cell function stands out as one of the most influential risk factors for prolonged SARS-CoV-2 infection and viral intra-host evolution among immunocompromised individuals [21,22]. Consequently, this population represents a crucial target for genetic surveillance of viral evolution in immunocompromised patients. In this study, we obtained sequence data from eight positive specimens, covering six time points, including six nasopharyngeal samples, one endotracheal, and one bronchoalveolar sample. The mutational pattern showed predominance of C-U transitions, which is in line with the pattern of substitutions commonly observed in SARS-CoV-2 [16,23]. While the patient received Molnupiravir before sample collection, there was no characteristic excess of G-to-A mutations observed in the iSNV [19], suggesting this treatment did not have a major impact on the evolution of the viral population in the patient. Furthermore, we observed a higher proportion of non-synonymous changes relative to synonymous changes in the spike and envelope genes, with no discernible difference between NP and BAL specimens. The interpretation of amino acid changes is commonly based on their deviation from the Wuhan-Hu-1 reference sequence. However, following ∼4 years of selection and drift the lineages that now dominate differ from the reference genome at a number of positions across the viral genome, especially in regions coding for viral proteins that are targeted by the host adaptive immune system. Like Omicron lineages BA.2 and BA.5, the BF.7 sublineage carries the amino acid change S375F in the spike RBD. While the S375F mutation was still present in NP samples, the sequence obtained from the BAL sample reverted to the wild-type sequence. Recently deep mutational scanning using the XBB.1.5 and BA.2 spikes by Dadonaite and colleagues in 2023 [20] showed that a number of amino acid changes at position 375 (including the reversion to the reference) can increase ACE2 binding and highlights that the impact of an amino acid change is often context dependent. Among the remaining 11 amino acid changes observed in spike, E619A (BAL) and T523I (NP) are predicted to increase ACE2 binding, while V976A (NP) facilitates immune escape but reduces ACE2 binding (Supplementary Table S2). S:P330S (BAL) is predicted to cause a modest increase in ACE2 binding and has been observed in other chronic infections involving immunosuppressed individuals [14,20,24]. All the spike amino acid changes, except for the revertant at position 375, are currently observed at a low frequency in GISAID (Supplementary Table 2). However, if we use persistent infections involving pre-VOC lineages as a guide for Omicron persistent infections, it is reasonable to assume that some of the amino acid changes observed in the current case may appear in future lineages that rise to prominence, especially if these lineages are derived from a BA.5/4 ancestor. The spike protein plays a central role in cellular entry and represents a primary target for the adaptive immune system. Consequently, this protein has been subject to extensive scrutiny relative to the other 14 open reading frames of SARS-CoV-2. However, there is clear evidence that genetic variations outside the spike protein play a role in the pathogenicity and fitness of SARS-CoV-2. Recent work using a recombinant SARS-CoV-2 consisting of the spike of Omicron (BA.1) and the backbone of an ancestral isolate showed higher virulence than BA.1, with mutations in nsp6 identified as the main driver of this difference in phenotype [25]. It has also been noted that BA.5 outcompeted BA.4 despite having identical spike proteins [26]. Among the amino acid changes observed outside the spike in the current case, a number have previously been associated with chronic infections. Notably, E:T30I has been observed in multiple chronic infections [26,27] but is rarely seen in circulating isolates (0.01% of GISAID genomes in January 2024) and appears to have a negative impact on the virus’s ability to transmit to new hosts [27,28]. Likewise, H125Y in the M protein, has also been identified in multiple chronic infections [24] and has been observed to arise in cell culture settings [29]. It is infrequent in sequenced genomes (0.1% in GISAID as of January 2024), yet computational analyses suggest a fitness advantage for the virus harbouring this change [24,28]. Unlike the previous amino acid changes, NSP3:K977Q has previously been observed in a VOC (Gamma, P.1) and has been detected in the context of chronic infections as well as within cryptic lineages sampled from wastewater [30–32]. This amino acid change enhances the activity of the SARS-CoV-2 papain-like protease, potentially modulating the host’s innate immune response [33]. Finally while NSP5:K90R has not been implicated in chronic infections, of the amino acid changes observed in the patient it was the one with the highest impact on fitness based on the work of Bloom et al [28] (Supplementary Table 1) and like NSP3:K977Q it may play a role in modulating the innate immune response [34]. These findings underscore the diverse impacts of non-spike protein changes on viral transmission dynamics and host immune system modulation. Beyond single amino acid changes, sequencing of NP and BAL samples revealed a large deletion and a frameshift mutation in the ORF8 region, resulting in a complete loss of function of the viral accessory protein. Notably, ORF8 is recognized as one of the most hypervariable regions in the SARS-CoV-2 genome and presents the highest mutation density among non-structural proteins [28]. Since the early stages of the COVID-19 pandemic, various large deletions affecting ORF8 have been documented, with some found to reduce COVID-19 severity [35–39]. The mutational landscape of ORF8 over the past ∼four years of the pandemic suggests a strong selective pressure driving rapid evolution, leading to ORF8 knockouts across multiple SARS-CoV-2 lineages [35,36,39,40]. Several studies have reported evidence of relaxed purifying selection acting on ORF8, along with other accessory proteins [28], with ORF8 knockout clusters exhibiting a growth rate advantage [41]. Recent research by Kim and colleagues has proposed a biological mechanism underlying positive selection for ORF8 knockouts [42]. They demonstrated that ORF8 can downregulate virion spike protein levels, thereby facilitating immune evasion by reducing the presence of spike on the surface of infected cells [42]. Conversely, the absence of ORF8 increases spike protein levels on the virion, potentially enhancing its ability to infect host cells. This dynamic may confer a net benefit depending on the context. In immunocompromised individuals, viral strains lacking ORF8 could have “the best of both worlds” by exhibiting heightened infectivity without eliciting an immune response against infected cells due to reduced Spike protein exposure on the cell membrane. Moreover, viral strains lacking a complete ORF8 have been reported in immunocompromised patients with chronic SARS-CoV-2 infections, underscoring the importance of detecting ORF8 knockouts in SARS-CoV-2 genomic surveillance efforts [43]. Our phylogenetic analysis revealed that NP samples formed a cohesive cluster, while BAL and ETA samples occupied a branch that diverged from NP samples early on, indicating a prolonged period of independent evolution. Notably, in the initial samples collected on July 14, 2023, the BAL sample carried twenty-four iSNVs (allele frequency >70%) not seen in the NP sample. The NP sample carried twenty-six patient-specific variants (allele frequency >85%) that were not observed in the BAL sample. For reference, the original VOC Alpha had only twenty-three nucleotide changes compared to its ancestral lineage [44]. Although significant divergence was observed between variants detected in NP and BAL samples, most of the iSNVs found in the ETA sample were also identified in NP or BAL samples. This divergence aligns with documented viral diversity in different anatomical sites during acute SARS-CoV-2 infection, particularly when saliva and nasopharyngeal swab samples are tested concurrently [45]. Furthermore, our findings resonate with previous evidence of genetic compartmentalization in SARS-CoV-2 between pulmonary and extra-pulmonary tissues, as observed in post-mortem samples from immunocompromised patients and unvaccinated patients, both infected with SARS-CoV-2 in the first year of the pandemic [46,47]. The results presented above add to an accumulating body of evidence suggesting that in a small number of cases SARS-CoV-2 can form a reservoir in different anatomical compartments [48–50] and that in many cases this reservoir is not always detectable via conventional SARS-CoV-2 testing [48]. This highlights that further investigation on how these tissue reservoirs contribute to SARS-CoV-2 persistence is needed. These observations also have implications for the care of immunocompromised individuals as a negative SARS-CoV-2 test is not a guarantee that an infection has cleared. Alternative specimen types, such as stool should be considered [50] and any SARS-CoV-2 positive samples from immunocompromised patients should be sequenced to distinguish between acute and long-term SARS-CoV-2 infections. ## Materials and Methods ### SARS-CoV-2 genomic surveillance at the CHU of Liège Since the beginning of the pandemic the Department of Human genetics in collaboration with the Department of Clinical Microbiology of the CHU of Liège (Belgium) has been following the evolution of SARS-CoV-2 lineages circulating the region [51–53]. We have been a member of the National Genomic Surveillance Platform for SARS-CoV-2 in Belgium since its inception in 2021 [54]. The samples used in the current study were initially collected and sequences as a part of the Belgian SARS-CoV-2 genomic surveillance effort. ### Real-time PCR and RNA extraction In most cases SARS-CoV-2 was detected in patient samples via the cobas 6800 platform (Roche) using the cobas SARS-CoV-2 real-time PCR assay which detects the ORF1ab and E genes. Alternatively the Alinity SARS-CoV-2 assay (Abbott) or Genexpert SARS-CoV-2 assay (Cepheid) were used. Positive RNA was extracted from the nasopharyngeal swabs or bronchoalveolar lavage (300 µl) with the Maxwell 48 device and the Maxwell RSC Viral TNA kit (Promega) following the manufacturer’s instructions. ### SARS-CoV-2 whole genome sequencing Samples with a cycle threshold (Ct) ≤ 30 on the cobas system were used for sequencing. Reverse Transcription utilised 3.3 µl of eluted RNA, 1.2 µl of SuperScript IV VILOTM Master Mix and 1.5 µl of H2O for a total volume of 6 µl. The mix was incubated at 25°C for 10 min, 50°C for 10 minutes and 85°C for 5 minutes. PCR was carried out using Q5® High-Fidelity DNA Polymerase (NEB) and the ARTIC v5.2.0 1,200 bp amplicon primer pools ([https://github.com/quick-lab/SARS-CoV-2/tree/main/1200/v5.2.0_1200](https://github.com/quick-lab/SARS-CoV-2/tree/main/1200/v5.2.0_1200)) PCR conditions were set up according to the recommendations of the ARTIC Network sequencing protocol ([https://www.protocols.io/view/ncov-2019-sequencing-protocol-v3-locost-bp2l6n26rgqe/v3](https://www.protocols.io/view/ncov-2019-sequencing-protocol-v3-locost-bp2l6n26rgqe/v3)) The resultant PCR products were indexed via the Nanopore Rapid Barcoding Kit 96 V14 (SQK-RBK114.96) in the same manner described in Freed et al. 2020 [55]. Sequencing was carried out on a GridION using 10.4.1 flow cells, with Super-accurate basecalling, carried out via Guppy 6.5.7. ### Long range PCR and sequencing In the samples carrying a deletion in the ORF7/8 region we carried out long range PCR with the primers 27\_LEFT (5’-TGGATCACCGGTGGAATTGCTA-3’) and 28_RIGHT (5’-GTTTGGCCTTGTTGTTGTTGGC-3’) from Freed et al. 2020 [55]. Reverse Transcription and PCR conditions were the same as those used for ARTIC whole genome amplicon sequencing protocol. For these PCR products to retain the full-length amplicon we used the Native Barcoding Kit 96 V14 (SQK-NBD114.96) which ligates barcodes to the ends of the amplicon. Sequencing was also carried out on a GridION using 10.4.1 flow cells, with Super-accurate basecalling carried out via Guppy 6.5.7. ### Variant calling and consensus genome generation Consensus genomes were generated via the ARTIC-network field bioinformatics pipeline ([https://github.com/artic-network/fieldbioinformatics](https://github.com/artic-network/fieldbioinformatics)). Consensus sequences were only generated for regions with coverage of ≥30. Low frequency variants were called with LoFreq [56]. Variants were also called with Clair3 [57]. VCFs and BAMs were checked in IGV [58]. Consensus sequence for the long range PCR products were generated via bcftools consensus command ([https://samtools.github.io/bcftools/bcftools.html](https://samtools.github.io/bcftools/bcftools.html)) and manually incorporated to artic-network field bioinformatics pipeline consensus. ### Phylogenetics Consensus genomes were placed on the existing phylogenetic tree via UShER using the phylogenetic tree generated with genomes from GISAID, GeneBank, COG-UK and CNCB. In the case of the BF.7 samples, we used the genomes identified by UShER to infer a time-scaled phylogenetic tree using the software package BEAST v1.10.5 [59] with the following settings: a skygrid demographic model, a relaxed molecular clock with a normal (8E-4,1E-4) prior distribution, and a HKY+G4+I substitution model [60,61]. Three independent Markov Chain Monte Carlo runs were performed, each running for 10^6 iterations and sampling every 10^4 states. Convergence and mixing of all parameters were assessed using Tracer v1.7 [62] and a maximum clade credibility tree was obtained using TreeAnnotator v.1.10.5 [59] after combining all results using LogCombiner v.1.10.5 [59] and discarding the first 10% of samples as burn-in. ## Data availability The consensus SARS-CoV-2 genomes and raw reads are available on GISAID ([www.gisaid.org](http://www.gisaid.org)) under accession numbers EPI\_ISL\_17960749, EPI\_ISL\_18515316, EPI\_ISL\_17960747, EPI\_ISL\_18059076, EPI\_ISL\_18059074, EPI\_ISL\_18059075, EPI\_ISL\_18136392, EPI\_ISL_18274346. All GISAID genomes used in the analysis can be found via EPI_SET ID: EPI_SET_240229zf, doi:10.55876/gis8.240229zf. Raw reads for each sample have also been deposited on the European Nucleotide Archive (ENA) under the accession number PRJEB72450. ## Supplementary data Sup\_Table\_1\_intra\_host_variants.xlsx is available online. ## Ethics statement Methods of collection, testing of biological specimens and protected health information used in this study were approved by the University Hospital of Liège Ethics Review Board, Reference number: 2020-139, informed consent was received from the patient. ## Supporting information Supplementary Table 1 [[supplements/303157_file02.xlsx]](pending:yes) ## Funding This work was supported in part by the Région Wallone project WALGEMED (convention no. 1710180) and the FNRS (H.C.008.20). Sequencing was done as a part of the National Genomic Surveillance Platform for SARS-CoV-2 in Belgium. SD acknowledges support from the *Fonds National de la Recherche Scientifique* (F.R.S.-FNRS, Belgium; grant n°F.4515.22), from the Research Foundation — Flanders (*Fonds voor Wetenschappelijk Onderzoek — Vlaanderen*, FWO, Belgium; grant n°G098321N), and from the European Union Horizon 2020 projects MOOD (grant agreement n°874850) and LEAPS (grant agreement n°101094685). ## Conflict of interest We declare no conflicts of interest. ## Acknowledgements Figure 1 and Supplementary Figure S1 were performed with [www.biorender.com](http://www.biorender.com). * Received March 3, 2024. * Revision received March 3, 2024. * Accepted March 4, 2024. * © 2024, Posted by Cold Spring Harbor Laboratory This pre-print is available under a Creative Commons License (Attribution-NonCommercial-NoDerivs 4.0 International), CC BY-NC-ND 4.0, as described at [http://creativecommons.org/licenses/by-nc-nd/4.0/](http://creativecommons.org/licenses/by-nc-nd/4.0/) ## References 1. 1.Obermeyer F, Jankowiak M, Barkas N, Schaffner SF, Pyle JD, Yurkovetskiy L, et al. Analysis of 6.4 million SARS-CoV-2 genomes identifies mutations associated with fitness. Science. 2022;376: 1327–1332. doi:10.1126/science.abm1208 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1126/science.abm1208&link_type=DOI) 2. 2.Markov PV, Ghafari M, Beer M, Lythgoe K, Simmonds P, Stilianakis NI, et al. The evolution of SARS-CoV-2. Nat Rev Microbiol. 2023;21: 361–379. doi:10.1038/s41579-023-00878-2 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41579-023-00878-2&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=37020110&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F03%2F04%2F2024.03.03.24303157.atom) 3. 3.Viana R, Moyo S, Amoako DG, Tegally H, Scheepers C, Althaus CL, et al. Rapid epidemic expansion of the SARS-CoV-2 Omicron variant in southern Africa. Nature. 2022 [cited 11 Jan 2022]. doi:10.1038/d41586-021-03832-5 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/d41586-021-03832-5&link_type=DOI) 4. 4.Hui KPY, Ho JCW, Cheung M-C, Ng K-C, Ching RHH, Lai K-L, et al. SARS-CoV-2 Omicron variant replication in human bronchus and lung ex vivo. Nature. 2022;603: 715–720. doi:10.1038/s41586-022-04479-6 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41586-022-04479-6&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=35104836&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F03%2F04%2F2024.03.03.24303157.atom) 5. 5.Fischer C, Maponga TG, Yadouleton A, Abílio N, Aboce E, Adewumi P, et al. Gradual emergence followed by exponential spread of the SARS-CoV-2 Omicron variant in Africa. Science. 2022; eadd8737. doi:10.1126/science.add8737 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1126/science.add8737&link_type=DOI) 6. 6.Roemer C, Sheward DJ, Hisner R, Gueli F, Sakaguchi H, Frohberg N, et al. SARS-CoV-2 evolution in the Omicron era. Nat Microbiol. 2023. doi:10.1038/s41564-023-01504-w [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41564-023-01504-w&link_type=DOI) 7. 7.Pather S, Madhi SA, Cowling BJ, Moss P, Kamil JP, Ciesek S, et al. SARS-CoV-2 Omicron variants: burden of disease, impact on vaccine effectiveness and need for variant-adapted vaccines. Front Immunol. 2023;14: 1130539. doi:10.3389/fimmu.2023.1130539 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.3389/fimmu.2023.1130539&link_type=DOI) 8. 8.Jung C, Kmiec D, Koepke L, Zech F, Jacob T, Sparrer KMJ, et al. Omicron: What Makes the Latest SARS-CoV-2 Variant of Concern So Concerning? J Virol. 2022;96: e0207721. doi:10.1128/jvi.02077-21 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1128/jvi.02077-21&link_type=DOI) 9. 9.Kaku Y, Okumura K, Padilla-Blanco M, Kosugi Y, Uriu K, Hinay AA Jr, et al. Virological characteristics of the SARS-CoV-2 JN.1 variant. Lancet Infect Dis. 2024. doi:10.1016/S1473-3099(23)00813-7 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S1473-3099(23)00813-7&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=38184005&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F03%2F04%2F2024.03.03.24303157.atom) 10. 10.Feng A, Bevins S, Chandler J, DeLiberto TJ, Ghai R, Lantz K, et al. Transmission of SARS-CoV-2 in free-ranging white-tailed deer in the United States. Nat Commun. 2023;14: 4078. doi:10.1038/s41467-023-39782-x [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41467-023-39782-x&link_type=DOI) 11. 11.Carabelli AM, Peacock TP, Thorne LG, Harvey WT, Hughes J, COVID-19 Genomics UK Consortium, et al. SARS-CoV-2 variant biology: immune escape, transmission and fitness. Nat Rev Microbiol. 2023. doi:10.1038/s41579-022-00841-7 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41579-022-00841-7&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=36653446&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F03%2F04%2F2024.03.03.24303157.atom) 12. 12.Choi B, Choudhary MC, Regan J, Sparks JA, Padera RF, Qiu X, et al. Persistence and Evolution of SARS-CoV-2 in an Immunocompromised Host. N Engl J Med. 2020;383: 2291–2293. doi:10.1056/NEJMc2031364 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1056/NEJMc2031364&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=33176080&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F03%2F04%2F2024.03.03.24303157.atom) 13. 13.Avanzato VA, Matson MJ, Seifert SN, Pryce R, Williamson BN, Anzick SL, et al. Case Study: Prolonged Infectious SARS-CoV-2 Shedding from an Asymptomatic Immunocompromised Individual with Cancer. Cell. 2020;183: 1901–1912.e9. doi:10.1016/j.cell.2020.10.049 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.cell.2020.10.049&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=33248470&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F03%2F04%2F2024.03.03.24303157.atom) 14. 14.Kemp SA, Collier DA, Datir RP, Ferreira IATM, Gayed S, Jahun A, et al. SARS-CoV-2 evolution during treatment of chronic infection. Nature. 2021;592: 277–282. doi:10.1038/s41586-021-03291-y [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41586-021-03291-y&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=33545711&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F03%2F04%2F2024.03.03.24303157.atom) 15. 15.Turakhia Y, Thornlow B, Hinrichs AS, De Maio N, Gozashti L, Lanfear R, et al. Ultrafast Sample placement on Existing tRees (UShER) enables real-time phylogenetics for the SARS-CoV-2 pandemic. Nat Genet. 2021;53: 809–816. doi:10.1038/s41588-021-00862-7 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41588-021-00862-7&link_type=DOI) 16. 16.Tonkin-Hill G, Martincorena I, Amato R, Lawson ARJ, Gerstung M, Johnston I, et al. Patterns of within-host genetic diversity in SARS-CoV-2. Elife. 2021;10. doi:10.7554/eLife.66857 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.7554/eLife.66857&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=34387545&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F03%2F04%2F2024.03.03.24303157.atom) 17. 17.Chaguza C, Hahn AM, Petrone ME, Zhou S, Ferguson D, Breban MI, et al. Accelerated SARS-CoV-2 intrahost evolution leading to distinct genotypes during chronic infection. Cell Rep Med. 2023;4: 100943. doi:10.1016/j.xcrm.2023.100943 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.xcrm.2023.100943&link_type=DOI) 18. 18.Yi H, Wang J, Wang J, Lu Y, Zhang Y, Peng R, et al. The Emergence and Spread of Novel SARS-CoV-2 Variants. Front Public Health. 2021;9: 696664. doi:10.3389/fpubh.2021.696664 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.3389/fpubh.2021.696664&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=34409009&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F03%2F04%2F2024.03.03.24303157.atom) 19. 19.Sanderson T, Hisner R, Donovan-Banfield I’ah, Hartman H, Løchen A, Peacock TP, et al. A molnupiravir-associated mutational signature in global SARS-CoV-2 genomes. Nature. 2023. doi:10.1038/s41586-023-06649-6 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41586-023-06649-6&link_type=DOI) 20. 20.Dadonaite B, Brown J, McMahon TE, Farrell AG, Asarnow D, Stewart C, et al. Full-spike deep mutational scanning helps predict the evolutionary success of SARS-CoV-2 clades. bioRxiv. 2023. p. 2023.11.13.566961. doi:10.1101/2023.11.13.566961 [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NzoiYmlvcnhpdiI7czo1OiJyZXNpZCI7czoxOToiMjAyMy4xMS4xMy41NjY5NjF2MSI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDI0LzAzLzA0LzIwMjQuMDMuMDMuMjQzMDMxNTcuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 21. 21.Raglow Z, Surie D, Chappell JD, Zhu Y, Martin ET, Kwon JH, et al. SARS-CoV-2 shedding and evolution in immunocompromised hosts during the Omicron period: a multicenter prospective analysis. medRxiv. 2023. doi:10.1101/2023.08.22.23294416 [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NzoibWVkcnhpdiI7czo1OiJyZXNpZCI7czoyMToiMjAyMy4wOC4yMi4yMzI5NDQxNnYxIjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjQvMDMvMDQvMjAyNC4wMy4wMy4yNDMwMzE1Ny5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 22. 22.Van der Moeren N, Selhorst P, Ha M, Heireman L, Van Gaal P-J, Breems D, et al. Viral Evolution and Immunology of SARS-CoV-2 in a Persistent Infection after Treatment with Rituximab. Viruses. 2022;14. doi:10.3390/v14040752 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.3390/v14040752&link_type=DOI) 23. 23.Popa A, Genger J-W, Nicholson MD, Penz T, Schmid D, Aberle SW, et al. Genomic epidemiology of superspreading events in Austria reveals mutational dynamics and transmission properties of SARS-CoV-2. Sci Transl Med. 2020;12. doi:10.1126/scitranslmed.abe2555 [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6MTE6InNjaXRyYW5zbWVkIjtzOjU6InJlc2lkIjtzOjE1OiIxMi81NzMvZWFiZTI1NTUiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyNC8wMy8wNC8yMDI0LjAzLjAzLjI0MzAzMTU3LmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 24. 24.Wilkinson SAJ, Richter A, Casey A, Osman H, Mirza JD, Stockton J, et al. Recurrent SARS-CoV-2 mutations in immunodeficient patients. Virus Evol. 2022;8: veac050. doi:10.1093/ve/veac050 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/ve/veac050&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=9384748&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F03%2F04%2F2024.03.03.24303157.atom) 25. 25.Chen D-Y, Chin CV, Kenney D, Tavares AH, Khan N, Conway HL, et al. Spike and nsp6 are key determinants of SARS-CoV-2 Omicron BA.1 attenuation. Nature. 2023;615: 143–150. doi:10.1038/s41586-023-05697-2 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41586-023-05697-2&link_type=DOI) 26. 26.Parsons RJ, Acharya P. Evolution of the SARS-CoV-2 Omicron spike. Cell Rep. 2023;42: 113444. doi:10.1016/j.celrep.2023.113444 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.celrep.2023.113444&link_type=DOI) 27. 27.Harari S, Miller D, Fleishon S, Burstein D, Stern A. Using big sequencing data to identify chronic SARS-Coronavirus-2 infections. Nat Commun. 2024;15: 648. doi:10.1038/s41467-024-44803-4 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41467-024-44803-4&link_type=DOI) 28. 28.Bloom JD, Neher RA. Fitness effects of mutations to SARS-CoV-2 proteins. Virus Evol. 2023;9: vead055. doi:10.1093/ve/vead055 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/ve/vead055&link_type=DOI) 29. 29.Sonnleitner ST, Sonnleitner S, Hinterbichler E, Halbfurter H, Kopecky DBC, Koblmüller S, et al. The mutational dynamics of the SARS-CoV-2 virus in serial passages in vitro. Virol Sin. 2022;37: 198–207. doi:10.1016/j.virs.2022.01.029 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.virs.2022.01.029&link_type=DOI) 30. 30.Faria NR, Mellan TA, Whittaker C, Claro IM, Candido D da S, Mishra S, et al. Genomics and epidemiology of the P.1 SARS-CoV-2 lineage in Manaus, Brazil. Science. 2021;372: 815–821. doi:10.1126/science.abh2644 [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6Mzoic2NpIjtzOjU6InJlc2lkIjtzOjEyOiIzNzIvNjU0NC84MTUiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyNC8wMy8wNC8yMDI0LjAzLjAzLjI0MzAzMTU3LmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 31. 31.Nabieva E, Komissarov AB, Klink GV, Zaitsev SV, Sergeeva M, Fadeev AV, et al. A highly divergent SARS-CoV-2 lineage B.1.1 sample in a patient with long-term COVID-19. medRxiv. 2023. p. 2023.09.14.23295379. doi:10.1101/2023.09.14.23295379 [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NzoibWVkcnhpdiI7czo1OiJyZXNpZCI7czoyMToiMjAyMy4wOS4xNC4yMzI5NTM3OXYxIjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjQvMDMvMDQvMjAyNC4wMy4wMy4yNDMwMzE1Ny5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 32. 32.Gregory DA, Trujillo M, Rushford C, Flury A, Kannoly S, San KM, et al. Genetic diversity and evolutionary convergence of cryptic SARS-CoV-2 lineages detected via wastewater sequencing. PLoS Pathog. 2022;18: e1010636. doi:10.1371/journal.ppat.1010636 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.ppat.1010636&link_type=DOI) 33. 33.Patchett S, Lv Z, Rut W, Békés M, Drag M, Olsen SK, et al. A molecular sensor determines the ubiquitin substrate specificity of SARS-CoV-2 papain-like protease. Cell Rep. 2021;36: 109754. doi:10.1016/j.celrep.2021.109754 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.celrep.2021.109754&link_type=DOI) 34. 34.Fung S-Y, Siu K-L, Lin H, Yeung ML, Jin D-Y. SARS-CoV-2 main protease suppresses type I interferon production by preventing nuclear translocation of phosphorylated IRF3. Int J Biol Sci. 2021;17: 1547–1554. doi:10.7150/ijbs.59943 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.7150/ijbs.59943&link_type=DOI) 35. 35.Young BE, Fong S-W, Chan Y-H, Mak T-M, Ang LW, Anderson DE, et al. Effects of a major deletion in the SARS-CoV-2 genome on the severity of infection and the inflammatory response: an observational cohort study. Lancet. 2020;396: 603–611. doi:10.1016/S0140-6736(20)31757-8 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S0140-6736(20)31757-8&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=32822564&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F03%2F04%2F2024.03.03.24303157.atom) 36. 36.Su YCF, Anderson DE, Young BE, Linster M, Zhu F, Jayakumar J, et al. Discovery and Genomic Characterization of a 382-Nucleotide Deletion in ORF7b and ORF8 during the Early Evolution of SARS-CoV-2. MBio. 2020;11. doi:10.1128/mBio.01610-20 [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NDoibWJpbyI7czo1OiJyZXNpZCI7czoxNDoiMTEvNC9lMDE2MTAtMjAiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyNC8wMy8wNC8yMDI0LjAzLjAzLjI0MzAzMTU3LmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 37. 37.Tang Z, Yu P, Guo Q, Chen M, Lei Y, Zhou L, et al. Clinical characteristics and host immunity responses of SARS-CoV-2 Omicron variant BA.2 with deletion of ORF7a, ORF7b and ORF8. Virol J. 2023;20: 106. doi:10.1186/s12985-023-02066-3 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/s12985-023-02066-3&link_type=DOI) 38. 38.Zinzula L. Lost in deletion: The enigmatic ORF8 protein of SARS-CoV-2. Biochem Biophys Res Commun. 2021;538: 116–124. doi:10.1016/j.bbrc.2020.10.045 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.bbrc.2020.10.045&link_type=DOI) 39. 39.Mazur-Panasiuk N, Rabalski L, Gromowski T, Nowicki G, Kowalski M, Wydmanski W, et al. Expansion of a SARS-CoV-2 Delta variant with an 872 nt deletion encompassing ORF7a, ORF7b and ORF8, Poland, July to August 2021. Euro Surveill. 2021;26. doi:10.2807/1560-7917.ES.2021.26.39.2100902 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.2807/1560-7917.ES.2021.26.39.2100902&link_type=DOI) 40. 40.Gong Y-N, Tsao K-C, Hsiao M-J, Huang C-G, Huang P-N, Huang P-W, et al. SARS-CoV-2 genomic surveillance in Taiwan revealed novel ORF8-deletion mutant and clade possibly associated with infections in Middle East. Emerg Microbes Infect. 2020;9: 1457–1466. doi:10.1080/22221751.2020.1782271 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1080/22221751.2020.1782271&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=32543353&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F03%2F04%2F2024.03.03.24303157.atom) 41. 41.Wagner C, Kistler KE, Perchetti GA, Baker N, Frisbie LA, Torres LM, et al. Positive selection underlies repeated knockout of ORF8 in SARS-CoV-2 evolution. bioRxiv. 2023. doi:10.1101/2023.09.21.23295927 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1101/2023.09.21.23295927&link_type=DOI) 42. 42.Kim I-J, Lee Y-H, Khalid MM, Chen IP, Zhang Y, Ott M, et al. SARS-CoV-2 protein ORF8 limits expression levels of Spike antigen and facilitates immune evasion of infected host cells. J Biol Chem. 2023;299: 104955. doi:10.1016/j.jbc.2023.104955 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.jbc.2023.104955&link_type=DOI) 43. 43.Guilbaud R, Franco Yusti A-M, Leducq V, Zafilaza K, Bridier-Nahmias A, Todesco E, et al. Higher levels of SARS-CoV-2 genetic variation in immunocompromised patients: a retrospective case-control study. J Infect Dis. 2023. doi:10.1093/infdis/jiad499 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/infdis/jiad499&link_type=DOI) 44. 44.Hill V, Du Plessis L, Peacock TP, Aggarwal D, Colquhoun R, Carabelli AM, et al. The origins and molecular evolution of SARS-CoV-2 lineage B.1.1.7 in the UK. Virus Evol. 2022;8: veac080. doi:10.1093/ve/veac080 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/ve/veac080&link_type=DOI) 45. 45.Farjo M, Koelle K, Martin MA, Gibson LL, Walden KKO, Rendon G, et al. Within-host evolutionary dynamics and tissue compartmentalization during acute SARS-CoV-2 infection. J Virol. 2024; e0161823. doi:10.1128/jvi.01618-23 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1128/jvi.01618-23&link_type=DOI) 46. 46.Van Cleemput J, van Snippenberg W, Lambrechts L, Dendooven A, D’Onofrio V, Couck L, et al. Organ-specific genome diversity of replication-competent SARS-CoV-2. Nat Commun. 2021;12: 6612. doi:10.1038/s41467-021-26884-7 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41467-021-26884-7&link_type=DOI) 47. 47.Stein SR, Ramelli SC, Grazioli A, Chung J-Y, Singh M, Yinda CK, et al. SARS-CoV-2 infection and persistence in the human body and brain at autopsy. Nature. 2022;612: 758–763. doi:10.1038/s41586-022-05542-y [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41586-022-05542-y&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=36517603&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F03%2F04%2F2024.03.03.24303157.atom) 48. 48.Bussani R, Zentilin L, Correa R, Colliva A, Silvestri F, Zacchigna S, et al. Persistent SARS-CoV-2 infection in patients seemingly recovered from COVID-19. J Pathol. 2023;259: 254–263. doi:10.1002/path.6035 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/path.6035&link_type=DOI) 49. 49.Proal AD, VanElzakker MB, Aleman S, Bach K, Boribong BP, Buggert M, et al. SARS-CoV-2 reservoir in post-acute sequelae of COVID-19 (PASC). Nat Immunol. 2023;24: 1616–1627. doi:10.1038/s41590-023-01601-2 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41590-023-01601-2&link_type=DOI) 50. 50.Machkovech HM, Hahn AM, Wang JG, Grubaugh ND, Halfmann PJ, Johnson MC, et al. Persistent SARS-CoV-2 infection: significance and implications. Lancet Infect Dis. 2024;. doi:10.1016/S1473-3099(23)00815-0 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S1473-3099(23)00815-0&link_type=DOI) 51. 51.Bollen N, Artesi M, Durkin K, Hong SL, Potter B, Boujemla B, et al. Exploiting genomic surveillance to map the spatio-temporal dispersal of SARS-CoV-2 spike mutations in Belgium across 2020. Sci Rep. 2021;11: 18580. doi:10.1038/s41598-021-97667-9 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41598-021-97667-9&link_type=DOI) 52. 52.Dellicour S, Durkin K, Hong SL, Vanmechelen B, Martí-Carreras J, Gill MS, et al. A Phylodynamic Workflow to Rapidly Gain Insights into the Dispersal History and Dynamics of SARS-CoV-2 Lineages. Mol Biol Evol. 2020. doi:10.1093/molbev/msaa284 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/molbev/msaa284&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=33316043&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F03%2F04%2F2024.03.03.24303157.atom) 53. 53.Artesi M, Bontems S, Göbbels P, Franckh M, Maes P, Boreux R, et al. A recurrent mutation at position 26,340 of SARS-CoV-2 is associated with failure of the E-gene qRT-PCR utilized in a commercial dual-target diagnostic assay. J Clin Microbiol. 2020. doi:10.1128/JCM.01598-20 [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6MzoiamNtIjtzOjU6InJlc2lkIjtzOjE1OiI1OC8xMC9lMDE1OTgtMjAiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyNC8wMy8wNC8yMDI0LjAzLjAzLjI0MzAzMTU3LmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 54. 54.Cuypers L, Dellicour S, Hong SL, Potter BI, Verhasselt B, Vereecke N, et al. Two Years of Genomic Surveillance in Belgium during the SARS-CoV-2 Pandemic to Attain Country-Wide Coverage and Monitor the Introduction and Spread of Emerging Variants. Viruses. 2022;14. doi:10.3390/v14102301 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.3390/v14102301&link_type=DOI) 55. 55.Freed NE, Vlková M, Faisal MB, Silander OK. Rapid and inexpensive whole-genome sequencing of SARS-CoV-2 using 1200 bp tiled amplicons and Oxford Nanopore Rapid Barcoding. Biol Methods Protoc. 2020;5: bpaa014. doi:10.1093/biomethods/bpaa014 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/biomethods/bpaa014&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=33029559&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F03%2F04%2F2024.03.03.24303157.atom) 56. 56.Wilm A, Aw PPK, Bertrand D, Yeo GHT, Ong SH, Wong CH, et al. LoFreq: a sequence-quality aware, ultra-sensitive variant caller for uncovering cell-population heterogeneity from high-throughput sequencing datasets. Nucleic Acids Res. 2012;40: 11189–11201. doi:10.1093/nar/gks918 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/nar/gks918&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23066108&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F03%2F04%2F2024.03.03.24303157.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000313414800010&link_type=ISI) 57. 57.Zheng Z, Li S, Su J, Leung AW-S, Lam T-W, Luo R. Symphonizing pileup and full-alignment for deep learning-based long-read variant calling. Nat Comput Sci. 2022;2: 797–803. doi:10.1038/s43588-022-00387-x [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s43588-022-00387-x&link_type=DOI) 58. 58.Thorvaldsdóttir H, Robinson JT, Mesirov JP. Integrative Genomics Viewer (IGV): high-performance genomics data visualization and exploration. Brief Bioinform. 2013;14: 178–192. Available: doi:10.1093/bib/bbs017 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/bib/bbs017&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22517427&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F03%2F04%2F2024.03.03.24303157.atom) 59. 59.Suchard MA, Lemey P, Baele G, Ayres DL, Drummond AJ, Rambaut A. Bayesian phylogenetic and phylodynamic data integration using BEAST 1.10. Virus Evol. 2018;4: vey016. doi:10.1093/ve/vey016 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/ve/vey016&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=29942656&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F03%2F04%2F2024.03.03.24303157.atom) 60. 60.Hasegawa M, Kishino H, Yano T. Dating of the human-ape splitting by a molecular clock of mitochondrial DNA. J Mol Evol. 1985;22: 160–174. doi:10.1007/BF02101694 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/BF02101694&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=3934395&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F03%2F04%2F2024.03.03.24303157.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=A1985AWB1600007&link_type=ISI) 61. 61.Yang Z. Maximum likelihood phylogenetic estimation from DNA sequences with variable rates over sites: approximate methods. J Mol Evol. 1994;39: 306–314. doi:10.1007/BF00160154 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/BF00160154&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=7932792&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F03%2F04%2F2024.03.03.24303157.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=A1994PB53900008&link_type=ISI) 62. 62.Rambaut A, Drummond AJ, Xie D, Baele G, Suchard MA. Posterior Summarization in Bayesian Phylogenetics Using Tracer 1.7. Syst Biol. 2018;67: 901–904. doi:10.1093/sysbio/syy032 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/sysbio/syy032&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=29718447&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F03%2F04%2F2024.03.03.24303157.atom)