PT - JOURNAL ARTICLE AU - Franco-Muñoz, Carlos AU - Álvarez-Díaz, Diego A. AU - Laiton-Donato, Katherine AU - Wiesner, Magdalena AU - Escandón, Patricia AU - Usme-Ciro, José A. AU - Franco-Sierra, Nicolás D. AU - Flórez-Sánchez, Astrid C. AU - Gómez-Rangel, Sergio AU - Rodríguez-Calderon, Luz D. AU - Barbosa-Ramirez, Juliana AU - Ospitia-Baez, Erika AU - Walteros, Diana M. AU - Ospina-Martinez, Martha L. AU - Mercado-Reyes, Marcela TI - Substitutions in Spike and Nucleocapsid proteins of SARS-CoV-2 circulating in South America AID - 10.1101/2020.06.02.20120782 DP - 2020 Jan 01 TA - medRxiv PG - 2020.06.02.20120782 4099 - http://medrxiv.org/content/early/2020/06/16/2020.06.02.20120782.short 4100 - http://medrxiv.org/content/early/2020/06/16/2020.06.02.20120782.full AB - SARS-CoV-2 is a new member of the genus Betacoronavirus, responsible for the COVID-19 pandemic. The virus crossed the species barrier and established in the human population taking advantage of the spike protein high affinity for the ACE receptor to infect the lower respiratory tract. The Nucleocapsid (N) and Spike (S) are highly immunogenic structural proteins and most commercial COVID-19 diagnostic assays target these proteins. In an unpredictable epidemic, it is essential to know about their genetic variability. The objective of this study was to describe the substitution frequency of the S and N proteins of SARS-CoV-2 in South America. A total of 504 amino acid and nucleotide sequences of the S and N proteins of SARS-CoV-2 from seven South American countries (Argentina, Brazil, Chile, Ecuador, Peru, Uruguay, and Colombia), reported as of June 3, and corresponding to samples collected between March and April 2020, were compared through substitution matrices using the Muscle algorithm in MEGA X.Forty-three sequences from 13 Colombian departments were obtained in this study using the Oxford Nanopore and Illumina MiSeq technologies, following the amplicon-based ARTIC network protocol. The substitutions D614G in S and R203K/G204R in N were the most frequent in South America, observed in 83% and 34% of the sequences respectively. Strikingly, genomes with the conserved position D614 were almost completely replaced by genomes with the G614 substitution between March to April, 2020. A similar replacement pattern was observed with R203K/G204R although more marked in Chile, Argentina and Brazil, suggesting similar introduction history and/or control strategies of SARS-CoV-2 in these countries. It is necessary to continue with the genomic surveillance of S and N proteins during the SARS-CoV-2 pandemic as this information can be useful for developing vaccines, therapeutics and diagnostic tests.HighlightsThe spike and nucleocapsid proteins of SARS-CoV-2 circulating in Colombia and South-American countries have similar patterns of non-synonymous substitutionsSubstitutions D614G in Spike and R203K-G204R in Nucleocapsid are the most frequent in Colombia and South-American countriesThe identification of genetic variability of SARS-CoV-2, is useful for vaccines, diagnostic test, and therapeutic designs.Competing Interest StatementThe authors have declared no competing interest.Funding StatementThis study was funded by the National Institute of Health, Bogota, ColombiaAuthor DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:This work was developed according to the national law 9/1979, decrees 786/1990 and 2323/2006, which establishes that the Instituto Nacional de Salud (INS) from Colombia is the reference lab and health authority of the national network of laboratories and in cases of public health emergency or those in which scientific research for public health purposes as required. The INS is authorized to use the biological material for research purposes, without informed consent, which includes the anonymous disclosure of results. This study was performed following the ethical standards of the Declaration of Helsinki 1964 and its later amendments. The information used for this study comes from secondary sources of data that were previously anonymized and do protect patient data.All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesThe authors confirm that the data supporting the findings of this study are available within the article [and/or] its supplementary material. Raw data were generated at Instituto Nacional de Salud - Unidad de Seguenciacion Analisis Genomico The data that support the findings of this study are available on request from the corresponding author upon reasonable request. https://www.gisaid.org/