PT - JOURNAL ARTICLE AU - Corcoran, Derek AU - Urban, Mark C. AU - Wegrzyn, Jill AU - Merow, Cory TI - Virus evolution affected early COVID-19 spread AID - 10.1101/2020.09.29.20202416 DP - 2020 Jan 01 TA - medRxiv PG - 2020.09.29.20202416 4099 - http://medrxiv.org/content/early/2020/09/30/2020.09.29.20202416.short 4100 - http://medrxiv.org/content/early/2020/09/30/2020.09.29.20202416.full AB - As the SARS-Cov-2 virus spreads around the world afflicting millions of people, it has undergone divergent genetic mutations. Although most of these mutations are expected to be inconsequential, some mutations in the spike protein structure have been hypothesized to affect the critical stage at which the virus invades human cells, which could affect transmission probability and disease expression. If true, then we expect an increased growth rate of reported COVID-19 cases in regions dominated by viruses with these altered proteins. We modeled early global infection dynamics based on clade assignment along with other demographic and meteorological factors previously found to be important. Clade, but not variant D614G which has been associated with increased viral load, enhanced our ability to describe early COVID-19 growth dynamics. Including clade identity in models significantly improved predictions over earlier work based only on weather and demographic variables. In particular, higher proportions of clade 19A and 19B were negatively correlated with COVID-19 growth rate, whereas higher proportions of 20A and 20C were positively correlated with growth rate. A strong interaction between the prevalence of clade 20C and relative humidity suggests that the impact of clade identity might be more important when coupled with certain weather conditions. In particular, 20C an 20A generate the highest growth rates when coupled with low humidity. Projections based on data through April 2020 suggest that, without intervention, COVID-19 has the potential to grow more quickly in regions dominated by the 20A and 20C clades, including most of South and North America.Competing Interest StatementThe authors have declared no competing interest.Funding StatementNational Science Foundation awards HDR-1394790 supported CM and DEB-1555876 and the Center of Biological Risk supported MCU. The University of Connecticut's Vice Provost of Research supported DC, MCU, and CM. We thank the Center for Systems Science and Engineering at Johns Hopkins University for providing publicly available COVID-19 data.Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesTo be added to a github repository once the paper is accepted