Abstract
Australia experienced widespread COVID-19 outbreaks from infection with the SARS-CoV-2 Delta variant between June 2021 and February 2022. Whole-genome sequencing of virus from an early case revealed a sub-consensus level of sequencing reads supporting a 17-nucleotide frameshift-inducing deletion in ORF7a that truncated the peptide sequence. The variant rapidly became represented at the consensus level (Delta-ORF7aΔ17del) in most of the outbreak cases in Australia. Retrospective analysis of ORF7a deletions in all GISAID clade GK Delta genomes showed that of 4,018,216 genomes, 134,751 (∼3.35%) possessed a deletion in ORF7a, with the ORF7aΔ17del mutation by far the most common. Approximately 99.05% of Delta-ORF7aΔ17del genomes on GISAID originated from the Australian Delta outbreak, and comprised 87% of genomes in the outbreak. In vitro comparison of lineages in cell culture showed a significantly greater proportion of cells were infected with Delta-ORF7aΔ17del than with a contemporaneous Delta variant without ORF7aΔ17del (Delta-ORF7aintact), and the proportion was also measurably higher than an early SARS-CoV-2 strain (A.2.2). These results showed that Delta-ORF7aΔ17del potentially has a slight growth advantage compared to Delta-ORF7aintact. Delta-ORF7aΔ17del viruses still produced ORF7a protein, but significantly less than A.2.2, in a different cellular distribution with a more diffuse expression throughout the cytoplasm of infected cells. These data suggest that the proliferation of Delta-ORF7aΔ17del genomes during the Australian Delta outbreak was likely not a result of an intrinsic benefit of the ORF7aΔ17del mutation, but rather a chance founder effect. Nonetheless, the abundance of different ORF7a deletions in genomes worldwide suggests these have some benefit to virus transmission.
IMPORTANCE Deletions in the ORF7a region of SARS-CoV-2 have been noted since early in the COVID-19 pandemic, but are generally reported as transient mutations that are quickly lost in the population. Consequently, ORF7a deletions are considered disadvantageous to the virus through possible loss-of-function effects. In constrast to these earlier reports, we present the first report of a SARS-CoV-2 variant with an ORF7a deletion that dominated for the entirety of a protracted outbreak, and found no associated fitness disadvantage or advantage in cell culture. The relatively common rise and fall of ORF7a deletion variants over time likely represent chance founder events followed by proliferation until a more fit variant(s) is introduced to the population. Our global clade-level survey of ORF7a deletions will be a useful resource for future studies into this gene region.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
We thank the Royal College of Pathologists of Australasia Quality Assurance Programs (RCPAQAP), Medical Research Future Fund (MRFF) and the National Health and Medical Research Council (NHMRC) for financial support (COVID-19 Genomics Grant MRF9200006).
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
Data Availability
All consensus genomes analysed in this study were downloaded from GISAID. The full list of accessions and associated metadata is available via http://gisaid.org/EPI_SET_220809bs or https://doi.org/10.55876/gis8.220809bs, and the list of accessions for genomes in our filtered data set upon which our conclusions are based is within Table S1. The custom program designed for the purposes of analysis in this study is freely available from https://github.com/charlesfoster/deletion_detector or https://doi.org/10.5281/zenodo.7049120. All other data are contained within the manuscript and its supplementary files.