PT - JOURNAL ARTICLE AU - Chan, Felicia AU - Ataide, Ricardo AU - Richards, Jack S. AU - Narh, Charles A. TI - Contrasting epidemiology and population genetics of COVID-19 infections defined with 74 polymorphic loci in SARS-CoV-2 genomes sampled globally AID - 10.1101/2021.04.25.21255897 DP - 2021 Jan 01 TA - medRxiv PG - 2021.04.25.21255897 4099 - http://medrxiv.org/content/early/2021/04/26/2021.04.25.21255897.short 4100 - http://medrxiv.org/content/early/2021/04/26/2021.04.25.21255897.full AB - SARS-CoV-2, the coronavirus causing COVID-19, has infected and killed several millions of people worldwide. Since the first COVID-19 outbreak in December 2019, SARS-CoV-2 has evolved with a few genetic variants associated with higher infectivity. We aimed to identify polymorphic loci in SARS-CoV-2 that can be used to define and monitor the viral epidemiology and population genetics in different geographical regions. Between December 2019 and September 2020, we sampled 5,959 SARS-CoV-2 genomes. More than 80% of the genomes sampled in Africa, Asia, Europe, North America, Oceania and South America were reportedly isolated from clinical infections in older patients, ≥ 20 years. We used the first indexed genome (NC_045512.2) as a reference and constructed multilocus genotypes (MLGs) for each sampled genome based on amino acids detected at 74 polymorphic loci located in ORF1ab, ORF3a, ORF8, matrix (M), nucleocapsid (N) and spike (S) genes. Eight of the 74 loci were informative in estimating the risk of carrying infections with mutant alleles among different age groups, gender and geographical regions. Four mutant alleles - ORF1ab L4715, S G614, and N K203 and R204 reached 90% prevalence globally, coinciding with peaks in transmission but not COVID-19 severity, from March to August 2020. During this period, the MLG genetic diversity was moderate in Asia, Oceania and North America; in contrast to Africa, Europe and South America, where lower genetic diversity and absence of linkage disequilibrium indicated clonal SARS-CoV-2 transmission. Despite close relatedness to Asian MLGs, MLGs in the global population were genetically differentiated by geographic region, suggesting structure in SARS-CoV-2 populations. Our findings demonstrate the utility of the 74 loci as a genetic tool to study and monitor SARS-CoV-2 transmission dynamics and evolution, which can inform future control interventions.Competing Interest StatementThe authors have declared no competing interest.Funding StatementFunding This work was partly supported by the National Health and Medical Research Council (NHMRC) of Australia [APP1161076 to JSR] and the British Society for Antimicrobial Chemotherapy [BSAC-COVID-64 to CAN & JSR]. Burnet Institute received funding from the NHMRC Independent Research Institutes Infrastructure Support Scheme, and the Victorian State Government Operational Infrastructure Support Scheme. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:Publicly available data was utilised for this study hence no IRB approval was sought.All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesAll relevant data are contained in the manuscript and the supplementary data