Abstract
Background The origin of divergent SARS-CoV-2 spike sequences found in wastewater, but not in clinical surveillance, remains unclear. These “cryptic” wastewater sequences have harbored many of the same mutations that later emerged in Omicron lineages. We first detected a cryptic lineage in municipal wastewater in Wisconsin in January 2022. Named the “Wisconsin Lineage”, we sought to determine this virus’s geographic origin and characterize its persistence and evolution over time.
Methods We systematically sampled maintenance holes to trace the Wisconsin Lineage’s origin. We sequenced spike RBD domains, and where possible, whole viral genomes, to characterize the evolution of this lineage over the 13 consecutive months that it was detectable.
Findings The persistence of the Wisconsin Lineage signal allowed us to trace it from a central waste-water plant to a single facility, with a high concentration of viral RNA. The viral sequences contained a combination of fixed nucleotide substitutions characteristic of Pango lineage B.1.234, which circulated in Wisconsin at low levels from October 2020 to February 2021, while mutations in the spike gene resembled those subsequently found in Omicron variants.
Interpretation We propose that prolonged detection of the Wisconsin Lineage in wastewater represents persistent shedding of SARS-CoV-2 from an infected individual, with ongoing within-host viral evolution leading to an ancestral B.1.234 virus accumulating “Omicron-like” mutations.
Funding The Rockefeller Foundation, Wisconsin Department of Health Services, Centers for Disease Control and Prevention (CDC), National Institute on Drug Abuse (NIDA), and the Center for Research on Influenza Pathogenesis and Transmission.
Evidence before this study To identify other studies characterizing unusual wastewater-specific SARS-CoV-2 lineages, we conducted a PubMed search using the keywords “cryptic SARS-CoV-2 lineages,” “novel SARS-CoV-2 lineages,” and “wastewater” on May 9, 2023. From the 18 papers retrieved, only two reported detecting wastewater-specific cryptic lineages. These lineages were identified by investigators from this research team in wastewater from California, Missouri, and New York City, but none could be definitively traced to a specific origin.
A third study in Nevada identified a unique recombinant variant (designated Pango lineage XL) in wastewater, which was also discovered in two clinical samples from the same community. However, it remained unclear whether the clinical samples were collected from the same individual(s) responsible for the virus detected in the wastewater. To our knowledge, no prior study has successfully traced cryptic SARS-CoV-2 wastewater lineages back to a specific location. How and where evolutionarily advanced cryptic lineages are introduced into wastewater has remained uncertain.
Added value of this study This study documents the presence and likely source of a novel and highly divergent cryptic SARS-CoV-2 lineage detected in Wisconsin wastewater for 13 months. In contrast to previously reported cryptic lineages, we successfully traced the Wisconsin Lineage to a single facility serving about 30 people. The exceptionally high viral RNA loads at the source account for the detection of these sequences in wastewater collected at the downstream municipal wastewater treatment facility.
Implications of all the available evidence Many SARS-CoV-2 lineages have been found exclusively in wastewater. The high number of unusual mutations found in these wastewater-specific cryptic sequences raises the possibility that they originate from individual prolonged shedders or even non-human sources. The Wisconsin Lineage’s persistence in wastewater, single-facility origin, and heavily mutated Omicron-like genotype support the hypothesis that cryptic wastewater lineages arise from persistently infected humans. As these divergent sequences contain amino acid changes that eventually emerge in circulating viruses, increased global monitoring of such lineages in wastewater could help anticipate future circulating mutations and/or variants of concern.
Competing Interest Statement
YK has received unrelated funding support from Daiichi Sankyo Pharmaceutical, Toyama Chemical, Tauns Laboratories, Shionogi, Otsuka Pharmaceutical, KM Biologics, Kyoritsu Seiyaku, Shinya Corporation and Fuji Rebio.
Funding Statement
This study was made possible by the generous support of the Rockefeller Foundation's Regional Accelerators for Genomics Surveillance (DHO/TCF), Wisconsin Department of Health Services Epidemiology and Laboratory Capacity funds (www.dhs.wisconsin.gov, 144 AAJ8216) to DHO, CDC contract 75D30121C11060 (DHO/TCF), Wisconsin Department of Health Services ELC Wastewater Surveillance funds (www.dhs.wisconsin.gov, 130:AAI8627) to the UW-Madison Wisconsin State Laboratory of Hygiene (WSLH), and NIDA contract 1U01DA053893-01 (MJ).
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
Footnotes
Disclaimer The findings and conclusions in this report are those of the authors and do not necessarily represent the official position of the Centers for Disease Control and Prevention.
Figures 3 and 4 have been updated to include only GenBank sequences. Minor update to Figure 1A. Otherwise the manuscript is the same.
Data Availability
Sequencing data are available in NCBI SRA and Genbank. Additional data is available from https://go.wisc.edu/4134pl. All sequences used for the phylogenetic inferences shown in Figure 4 were obtained from GenBank and can be accessed using the accession numbers available on the GitHub repository accompanying this manuscript (https://github.com/tcflab/wisconsin_cryptic_lineages).