Abstract
Prolonged infection of SARS-CoV-2 represents a challenge to the development of effective public health policies to control the COVID-19 pandemic. The reason why some people have persistent infection and how the virus survives for so long are still not fully understood. For this reason, we aimed to investigate the intra-host evolution of SARS-CoV-2 during persistent infection. Thirty-three patients who remained RT-PCR positive in the nasopharynx for at least 16 days were included in this study. Complete SARS-CoV-2 sequences were obtained for each patient at two time points. Phylogenetic, populational, and computational analysis of viral sequences confirmed persistent infection with evidence for a transmission cluster in health care professionals that shared the same workplace. A high number of missense variants targeting crucial structural and non-structural proteins such as Spike and Helicase was found. Interestingly, longitudinal acquisition of substitutions in Spike protein mapped many SARS-CoV-2 predicted T cell epitopes. Furthermore, the mutational profiles observed were suggestive of RNA editing enzyme activities, indicating innate immune mechanisms of the host cell. Viral quasispecies analysis corroborates persistent infection mainly by increasing richness and nucleotide diversity over time. Altogether, our findings highlight a dynamic and complex landscape of host and pathogen interaction during persistent infection suggesting that the host’s innate immunity shapes the increase of intra-host diversity with possible implications for therapeutic strategies and public health decisions during the COVID-19 pandemic.
Competing Interest Statement
The authors have declared no competing interest.
Clinical Trial
National Commission of Ethics in Research 30161620.0.0000.5257 National Commission of Ethics in Research 30127020.0.0000.0068
Funding Statement
This work was developed in the frameworks of Corona-omica-RJ (FAPERJ = E-26/210.179/2020) and Rede Corona-omica BR MCTI/FINEP (FINEP = 01.20.0029.000462/20, CNPq = 404096/2020-4). The study was also supported by FAPERJ E-26/010.002434/2019 and E-26/210.178/2020 for A.T, and E-26/010.002278/2019 for C.M.V. A.T.R.V. is supported by Conselho Nacional de Desenvolvimento Cientifico e Tecnologico - CNPq (303170/2017-4) and FAPERJ (E-26/202.903/20). R.S.F.J is a recipient of a graduate fellowship from CNPq. C.C.C is supported by FAPERJ (E-26/202.791/2019).
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
The present study was approved by the National Commission of Ethics in Research (protocol numbers 30161620.0.0000.5257 and 30127020.0.0000.0068). Written informed consent was obtained from all participants.
All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
Data Availability
NGS data generated in our study is publicly available in SRA-NCBI (www.ncbi.nlm.nih.gov/sra), Bioproject accession PRJNA675840. Genome sequences are also deposited in Gisaid (www.gisaid.org) and the access identifiers are listed in Table S1.