Global-scale analysis and longitudinal assessment of COVID-19 incidence in the first six months =============================================================================================== * Sujoy Ghosh * Saikat Sinha Roy ## Abstract Studies examining factors responsible for COVID-19 incidence have mostly focused at the national or sub-national level. Here we undertake an analysis of COVID-19 cases at the global scale to identify key factors associated with disease incidence. A regression modeling framework was used to identify key variables associated with COVID-19 incidence, and to assess longitudinal trends in reported incidence at the country-level. New COVID-19 case dynamics in response to lockdowns was characterized via cluster analysis. Eleven variables were found to be independently associated with COVID-19 infections (p<1e-05) and a 4-variable model adequately explained global variations in COVID-19 cases (p<0.01). COVID-19 case trajectories for most countries followed the log-logistic curve. Six predominant country clusters summarized the differences in individual country’s response to lockdowns. Globally, economic and meteorological factors are important determinants of COVID-19 incidence. Analysis of longitudinal trends and lockdown effects on COVID-19 caseloads further highlights important nuances in country-specific responses to the pandemic. These findings on the first six months of the pandemic has important implications for additional phases of the disease currently underway in many countries. ## Introduction It has been almost 16 months since the first reports of the current COVID-19 pandemic was reported from mainland China in late 2019(1). First manifesting as an acute respiratory illness from infection with a zoonotically derived novel coronavirus SARS-CoV-2, COVID-19 rapidly spread to over 200 countries worldwide, and currently infects over 72 million people with devastating impacts on public health and global economic activity(2). COVID-19is also associated with significant morbidity and mortality, resulting in over1.6 million reported deaths worldwide (as of Dec 15, 2020). Large data collection efforts are underway for understanding the etiology of COVID-19, both for identifying biological predictors of susceptibility and outcome, as well as understanding the disease mechanisms impacted by viral infection. Insights into the epidemiological landscape of disease transmission, and effects of public policy interventions are equally important to help authorities respond more effectively to the epidemic and identify the most vulnerable communities. Along these lines, some important findings have already been reported with respect to the effects of contact tracing and travel restrictions on COVID-19 spread(3, 4), as well as the evolving epidemiology and transmission dynamics of disease(4, 5). Together, research into the biology, pathophysiology and epidemiology of SARS-CoV2 and COVID-19 disease has exploded in the recent scientific literature resulting in a large amount of new information. However, important gaps in our understanding with respect to the nature and scope of interactions between the virus and its environments still remain(6). The global response to COVID-19 has evolved over its time course, with several countries imposing, and often re-imposing restrictions (lockdowns) in response to infection burden and economic pressure. The impact from early contributing factors has also possibly changed over the course of the pandemic, with secondary factors (e.g. lockdown sequelae) becoming more important later. In this paper, we have restricted our examination to the association of potential early factors of COVID-19 transmission, and also investigated the global response to the pandemic over the first six months ending June 20, 2020. A large body of pre-existing literature makes it clear that virus transmission is the result of an interaction among several factors, including host behavior and defense mechanisms, virus infectivity, population density and environmental determinants(7). Previous studies on respiratory disorders have also emphasized the prevalent role of meteorological parameters on virus transmission and infectivity (8, 9). On the other hand, a retrospective analysis of government responses to epidemics and pandemics over the last century suggests that governments vary considerably in their adoption of non-pharmaceutical interventions such as quarantine, social distancing and contact tracing to stem the tide of public health burdens (10). Continuing along the lines of these prior reports, we have investigated the possible roles that selected, pre-existing demographic, health, geographic and economic factors may have played in determining the burden of COVID-19 infection globally, by examining their impact in countries with reliable COVID-19 infection data. Additionally, we have characterized country-specific time-courses of confirmed COVID-19caseload trajectories, and explored the diversity of infection trajectories via optimal model fits. Lastly, we have explored patterns in the differential influence of government-imposed lockdowns on the trajectories of new cases observed in those countries. While the first part of our investigation focuses on global factors influencing SARS-Cov-2 transmission, the later investigations seek to characterize country-level variations, both in the time course of COVID-19 transmission and in viral caseload dynamics surrounding the lockdown periods. ## Methods ### Data collection Data for COVID-19 confirmed cases was obtained from [https://ourworldindata.org/coronavirus-source-data](https://ourworldindata.org/coronavirus-source-data), which is updated daily and based on data on confirmed cases and deaths from Johns Hopkins University. Data on additional demographic, geographic health or economic variables were downloaded from a variety of sources listed in **Table 1**. For each variable, we utilized the latest possible data with the most country coverage in our analysis. Variables were categorized as belonging to Demographic, Geographic, Health or Economic domains. View this table: [Table 1:](http://medrxiv.org/content/early/2021/04/06/2021.03.31.21254739/T1) Table 1: Sources for demographic, geographic, health and economic data utilized in the current analysis ### Statistical Analysis We assessed longitudinal trends in the rise in COVID-19 cases in each country by considering them as growth curves and fitting the number of confirmed COVID-19 infections (expressed as a fraction of daily cases to the maximum number of cases) using linear (quadratic) and nonlinear (exponential, logistic, log-logistic, and Gompertz) regression models. The modeling equations are given as below: #### Logistic ![Graphic][1], where α, β, γ, and δ are 4 estimable parameters representing the maximum asymptote (α), minimum asymptote (β), S-curve inflection point (γ) and Hill coefficient (δ), respectively.. #### Log-logistic ![Graphic][2], where all four parameters have the same meaning as for logistic regression. #### Gompertz *f*(*x*) = *β* + (*α* − *β*)*exp* (−*exp*(*γ*(*x* − *δ*))), where β is the lower asymptote, α is the upper asymptote, γ is the growth-rate coefficient and δ is the time at inflection. #### Exponential *f*(*x*) = *α* + (*β* − *α*)*exp*(−*x* | *γ*), a 3-parameter model where α is the lower asymptote, β is the upper asymptote and γ is the steepness of the decay curve #### Quadratic *f*(*x*) = *α* + *β*1*x* + *β*2*x*2, where α is the value of f(x) at x=0, and β 1 and β 2 are the polynomial regression coefficients. A 4-parameter model was found to be optimum for logistic, log-logistic, and Gompertz fitted data. For each country, non-nested models were compared using the AIC criterion, with the model with the lowest AIC being selected for that country. These analyses were conducted via the *drc*(12), *aomisc (*[https://rdrr.io/github/OnofriAndreaPG/aomisc/](https://rdrr.io/github/OnofriAndreaPG/aomisc/)*)* or *tidyverse* packages in R. The *drc* and *aomisc* packages were used for their advantage of employing self-starter functions for calculating initial values for nonlinear regression models, based on numerical optimization algorithms (13). As we generated models on all countries simultaneously, it was considered judicious to use the data-guided self-starter functions in these packages rather than having the user guess the initial parameters for each model for each country separately. To identify the effect of the ‘lockdown’ period on new COVID-19 case trajectories in a country-specific manner, we obtained data on lockdown dates from [https://auravision.ai/COVID-19-lockdown-tracker/](https://auravision.ai/COVID-19-lockdown-tracker/), as well as internet-based reports from individual searches (**Supplementary Table 1**), considering data until June 30, 2020. Countries that had either not imposed, or imposed but not withdrawn their lockdown by June 30 were excluded from the analysis (e.g. PER, BLR, NPL, etc.), resulting in a final list of 106 countries with documented lockdown start and end dates. For countries with multiple lockdown dates (e.g. USA, China), the most common value (mode) of the lockdown start and end dates was taken to be representative for that country. The beginning and end of lockdown period was then overlaid on plots showing the number of daily new confirmed COVID-19 cases versus time. Countries were characterized on a custom five-point metric including percent changes in daily COVID-19 cases at the beginning, end and during lockdown, as well at early (5 days) and later (14 days) timepoints post-lockdown. Bivariate linear regression analysis was conducted by examining the association of each demographic, geographic, health or economic variable (independent variable) to the total number of confirmed COVID-19 cases (dependent variable, log10 transformed). A subset of the independent variables was log transformed. Regression modeling was performed via the *tidyverse* package in R ([www.tidyverse.org](http://www.tidyverse.org)). In addition to the bivariate analysis, we carried out variable subset selection in order to identify a parsimonious set of predictors forCOVID-19 incidence. Models including all variables were first compared and optimal sub-models, containing a combination of selected variables, were identified based on the Akaike Information Criterion (AIC). These analyses were conducted using the ‘lmSubsets’ package in R(11), based on newly developed theoretical strategies for the ‘all-subset regression’ problem. The variables selected in the optimized models were then included in a multivariable linear regression model to assess their relative contributions to COVID-19 cases. ## Results ### Longitudinal trends in COVID-19 associations by country We analyzed the temporal patterns of increases in confirmed COVID-19 cases at the country level. For each country, daily COVID-19 case data was obtained from the day of the first reported infection (Day 1 in the plots) until June 10, 2020. As the number of total COVID-19 cases varied widely between countries, we expressed the country-level increases in COVID-19 infections on a particular day as a proportion of the total number of cases on that day to the maximum number of cases observed for that country (June 10 data), essentially scaling the data between 0-1 for each country. Out of a total of 210 countries with available data, 38 countries with a maximum COVID-19 case load of less than 100 were excluded from the analysis. We further excluded Benin (BEN) because of an anomaly in its cumulative daily reported COVID-19 data which increased and then decreased over time. This resulted in a final list of 171 countries for longitudinal analysis of confirmed COVID-19 case patterns. For each country, the trajectory of total COVID-19 cases over time was examined via regression analysis, including both linear and non-linear regression models. The fits obtained with the various models were then compared using the AIC criterion and the model with the lowest AIC was selected as optimal for that country (**Supplementary Table 2**). The longitudinal trends results show that the selected model fits the data for individual country well. From the 5 models considered, the COVID-19 trajectory for the majority of countries was best explained by the log-logistic model (70 countries), followed by logistic (44 countries) and Gompertz models (41 countries), whereas fewer countries were optimally explained by the quadratic (9 countries) and exponential models (6 countries). **Figure 1** shows representative countries with optimal fits from the 5 modeling approaches (optimal model fits for all countries shown in **Supplementary Figure 1)**. ![Figure 1:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/04/06/2021.03.31.21254739/F1.medium.gif) [Figure 1:](http://medrxiv.org/content/early/2021/04/06/2021.03.31.21254739/F1) Figure 1: Association of selected variables with total COVID-19 cases in May 2020. Each plot show the change in total COVID-19 cases per million population (expressed in log10 units) on the y-axis and the relevant variables on the x-axis. The line of best fit is shown along with its equation, the coefficient of determination (R2) and the associated significance of the regression analysis. Some selected countries with very high or very low COVID-19 cases are annotated by their ISO codes ### Association of individual variables to COVID-19 cases Linear regression modeling of the logarithm of confirmed total COVID-19 reported cases for each country (normalized to per million population) against selected demographic, geographic, economic and health related indicators identified several variables as being significantly associated to the COVID-19 cases. To test the robustness of these findings, we analyzed data representing total COVID-19 infections at 3 different time points approximately 1 month apart (April 10, May 11 and June 10).**Table 2** shows the results of bivariate analysis for all 24 variables tested across the 3 time points. A total of 11 variables including employments in the agriculture, service and industrial sectors, percent population residing in urban areas, ages between 15-64 years and over 65 years, number of visitors, and temperatures in the months of Jan-Apr were found to be significant across all 3 time points tested (p<1e-05), with the coefficient of determination ranging from 0.2-0.49 for these regressors (May 11 data). Regression plots of the top 6 most significantly associated variables are shown for the May 11 data in **Figure 2**(plots for all 24 variables available in **Supplementary Figure 2**). ![Figure 2:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/04/06/2021.03.31.21254739/F2.medium.gif) [Figure 2:](http://medrxiv.org/content/early/2021/04/06/2021.03.31.21254739/F2) Figure 2: Multivariable regression analysis of variables associated with COVID-19 cases. (a) results from all-subsets regression analysis to identify the best sub-model with a smaller list of variables, based on minimization of the AIC. Selected variables are highlighted in red (in addition to the intercept). The y-axis refers to the size (number of variables) in each sub-model, and x-axis lists all the variables tested. (b) change in AIC scores depending on the number of variables. View this table: [Table 2:](http://medrxiv.org/content/early/2021/04/06/2021.03.31.21254739/T2) Table 2: Results of bivariate regression analysis of demographic, geographic, health and economic determinants of COVID-19 incidence, tested across 3 time points. ### Multivariable regression modeling of COVID-19 association We used multivariable linear regression to identify a parsimonious subset of variables that can jointly explain the variation in the number of confirmed COVID-19 cases across countries. An all-subsets regression analysis was undertaken using variables with p<0.01 in their respective bivariate analyses (15 variables), resulting in a series of sub-modelsconsisting of different subsets of the variables included in the analysis. Data from 131 countries was finally available for modeling, after removing missing data. From these models, the model with the lowest AIC score was selected to be the most parsimonious. This analysis identified a model with 5 variables (percent urban population, percent employed in agriculture, population density, percent population aged between 15-64 yrs,and temperature in March) as the most parsimonious (p<0.05) with respect to the global incidence of confirmed COVID-19 cases (p<0.01) for May 11 data (adj r2=0.68) (**Figure 3a**,**b**). In this figure, a total of 15 submodels were generated containing between 2-16 regressors (including intercept). Variables consistently appearing in sub-models of multiple sizes are weighted more than less frequently appearing variables. Notably, the population age related variable was not significant after adjusting for other variables **(Table 3)**. View this table: [Table 3:](http://medrxiv.org/content/early/2021/04/06/2021.03.31.21254739/T3) Table 3: Statistical summary of multivariable regression analysis. Statistical estimates of top variables after subset selection and minimization of the AIC are reported ![Fig 3:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/04/06/2021.03.31.21254739/F3.medium.gif) [Fig 3:](http://medrxiv.org/content/early/2021/04/06/2021.03.31.21254739/F3) Fig 3: Analysis of the time-course of increase in COVID-19 total cases by country, using different growth-curve models. For each plot, the actual number of COVID-19 cases are shown as open circles and the fitted curve is shown in red. The y-axis refers to the proportion of daily total cases to the maximum total cases recorded in the time interval studied (0-1 scaling), and the x-axis refers to the time-course in days. The best growth-curve model for each country was determined by minimization of the AIC. Two exemplar countries for each model-type are shown with model names listed at the top. Countries are indicated by their ISO codes. ### Effect of lockdown on new COVID-19 cases As the majority of the countries adopted some measure of restriction (lockdown) to reduce the incidence of COVID-19 infection, and also removed such restriction (partially or entirely) after a certain periodof time, we were interested in determining the patterns by which the daily new cases of COVID-19 infections were affected due to the lockdown. Countries which had imposed and relaxed lockdowns by June 10, 2020 were considered, whereas data on total COVID-19 cases were considered until June 30, 2020 to identify post-lockdown trends.We considered 5 different criteria to characterize a country’s response to the lockdown - (a) percent change in the number of daily cases at the beginning and end of lockdown, (b) the presence of a peak in the number of daily cases within the lockdown period, (c) percent change in the number of daily cases 5 days after lifting of lockdown (early post-lockdown effects), (d) percent change in the number of daily cases 14 days after lifting of lockdown (later post-lockdown effects), and (e) percent change between day 5 and day 14 post-lockdown. The percent change values were then thresholded as follows: For (a), a >20% change was indicated as 1, a <-20% change was indicated as -1 and a change between -20% to 20% was indicated as 0. For (c, d, e), changes >10% were indicated by 1, changes <-10% were indicated by -1 and changes between -10% to 10% were indicated by 0 (**Supplementary Table 3**). The thresholded dataset was then subjected to Euclidean hierarchical clustering and the results visualized on a dendrogram (**Fig 4**). The dendrogram was colored by cutting the tree at 6 branches. Representative plots for each of the six major clusters are shown alongside the dendrogram (lockdown plots for all countries are shown in **Supplementary Figure 3**). ![Figure 4:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/04/06/2021.03.31.21254739/F4.medium.gif) [Figure 4:](http://medrxiv.org/content/early/2021/04/06/2021.03.31.21254739/F4) Figure 4: Characterization of new COVID-19 cases at the beginning and close of lockdowns. Countries were characterized on a five-point heuristic based on new COVID-19 cases prior to, during, at the end of, and 5-days and 14-days post lockdown, and subjected to hierarchical clustering. Dendrogram and associated heatmap shows six major clusters. Time-courses of new COVID-19 cases are shown for an exemplar country from each cluster, with the lockdown start and end days indicated by the two vertical bars in each plot. Heatmap is color-coded by the assigned values of the five-point criteria (−1=skyblue, 0=ivory, 1=coral). ## Discussion Despite a large body of research, uncertainties remain regarding the importance of environmental factors and their roles in COVID-19 transmission(14). On the one hand, epidemiological and laboratory studies have identified ambient temperature to be a critical factor in the survival and transmission of other coronaviruses such as MERS-CoV and SARS-Cov-1 (15), and climate components including temperature, rainfall and wind speed have been postulated as biological catalysts for human-COVID-19 interactions in independent studies from several locations worldwide(9, 16, 17). However, results obtained from these studies have not conclusively resolved whether weather condition plays a key role in SARS-CoV-2 transmission (18). More specifically, results are conflicting regarding the association between COVID-19 infection and the effect of temperature(19). For example, Shi et al. (20) reported a slightly lower epidemic intensity of COVID-19 in the Jan-Feb 2020 timeframe in China following higher temperature days with a relative risk of 0.96 (95% CI: 0.93, 0.99). In contrast, Xie and Zhu(21) reported mean temperatures in Jan-Feb 2020 to have a positive linear relationship with COVID-19 cases in China with a 3°C threshold, and no further reduction of COVID-19 case countsat warmer temperatures.Another study (22) involving COVID-19 cases in Japan observed a positive association between low February temperatures and increased risk of COVID-19 infection. A study between March-April 2020 in Spain reported a daily incidence rate reduction by 7.5% for every 1°C increase in maximum temperature (18), whereas another study conducted between Feb-March 2020 in the Iberian peninsula failed to find a relationship between temperature and daily incidence of COVID-19(23). Several factors can contribute to the observed discrepancies, including differences in outcome measures (counts of confirmed cases, new cases, or total cases or cumulative incidence rate), or weak correlations between temperature and COVID-19 propagation (19). These conflicting reports highlight the need for further investigations of COVID-19’s weather dependency in different regions or countries or cities to refine our current understanding about its transmission. Compared to these published reports that focus on limited geographic regions, we examined global trends between confirmed COVID-19 cases and temperatures, by considering country-specific confirmed COVID-19 cases against their recorded monthly temperatures. In order to maximize temperature information for as many countries as possible, we relied on the most recent data available (2016). Our results from both bivariate and multivariable analysis generally agree with a negative association of confirmed COVID-19 cases with temperatures, especially in the months of March and April. In addition to the effects of temperature, our analysis of global COVID-19 incidence also indicates a significant negative correlation with markers of increased economic activity (e.g. percent urban population, employment in industrial and service sectors), possibly reflecting the consequences of increased congregation and socialization in the population (24). This finding agrees with similar associations observed during the spread of other viral outbreaks with economic booms and trade expansions (25), such as an increased incidence of influenza associated with increases in employment (26). These findings feed into the larger observation of the relationships between economic activity and population healthmediated byincreased interactions between populations not otherwise exposed to each other’s disease ecology (e.g. business and leisure visitors), and also dense permanent settlements around areas of high industrialization. Historically, both of these relationships have been found to negatively impact health of the populations exposed (27). Overall, our analysis supports this trend. Multivariable regression modeling with variable subset selection further affirmed that a mixture of economic, demographic and geographic variables was adequate for explaining the variation in total COVID cases at a global level. The analysis of time course trajectories of COVID-19 incidence showed important differences among the countries examined. While the log-logistic and logistic models were adequate in modeling the COVID-19 trajectories for the majority of countries, there were nations whose SARS-Cov-2incidence patterns were better modeled by exponential or quadratic fits. Such country-specific differences are probably the result of a combination of factors including natural elements (e.g. meteorology), socioeconomic regulators (e.g. urbanization), as well as governmental interventions (e.g. quarantines). Finally, we investigated the viral spread trajectories in additional detail by overlaying information on government-induced restrictions (“lockdown”) on the time course curves and estimating their effects on new COVID-19 caseincidence. In the absence of vaccines or effective pharmaceuticals, the majority of governments necessarily adopted some policy interventions to mitigate the spread of the disease. Generally, two fundamental mitigation strategies have been advanced for COVID-19, one focused on reducing, if not necessarily preventing, the virus spread (majority of Western countries), and the other enforcing more drastic measures to suppress and contain virus spread (e.g. China, Singapore, South Korea). However, as COVID-19 assumed pandemic proportions, mitigating strategies by necessity had to become more stringent in order to flatten the curve of virus transmission. Contact suppression through lockdown and enhanced social distancing measures emerged as the foremost administrative defense strategy in countering COVID-19 spread in almost all countries as a way to reduce mortality, preserve health-care service capacity, and buy time to develop a pandemic control responsesystem post-lockdown.However, socioeconomic pressures also necessitate that lockdowns be relaxed or lifted, if temporarily, to prevent economic collapse. How such imposition and lifting of mandatory lockdowns affects COVID-19 caseloads is important for understanding the effectiveness of large-scale quarantine efforts. Effectively administered lockdowns are expected to successfully reduce the virus reproduction number, but premature emergence from a lockdown may lead to epidemic rebounding in still susceptible populations(28). Our analysis of the country-level COVID-19 incidence around lockdown imposition and relaxation periods, displayed a wide divergence in viral incidence patterns. Hierarchical clustering allowed us to classify the responses into six main clusters depending on how the COVID-19 case numbers fluctuated before, during, and immediately after lockdowns. We found that countries such as Australia used lockdowns effectively to bring down the viral case-load to near zero levels well within the lockdown period, and kept it low post-lockdown, whereas another cluster represented by France for example, achieved near zero case-loads only as the lockdown was lifted. In contrast, countries such as India continued to see a steady rise in case numbers during, as well as after lockdown relaxation, probably due to premature timing of lockdown initiation. The timing of a lockdown relative to the stage of the pandemic appears to be an important factor in SARS-CoV-2 transmission patterns, as also reported elsewhere (29). Overall, our results provide empirical data on a global level that are consistent with some of the published modeling assumptions regarding the effects of lockdown on virus spread (30), and should prove useful for future policymaking. We should also note that our analysis was restricted only to the first lockdown imposed by each country until June 20, 2020, and some countries have gone into multiple waves of lockdowns since. Our analysis contributes to the existing literature in three aspects. First, we examine key economic, meteorological, geographic and health determinants of the spread of COVID-19 at a global level. Second, our study models the trajectory of SARS-CoV-2 spread at a country level and specifies important differences in the time-course of virus transmission around the world. Third, our study contributes to the assessments of public health measures such as lockdowns aiming at reducing virus transmissions and mortality, and also demonstrates salient differences in countries’ experiences with new virus infections around such restrictions. These findings assume importance especially in the context of second or later waves of COVID 19 infection across many countries. ## Supporting information Supplementary Tables and Figures [[supplements/254739_file02.pdf]](pending:yes) ## Data Availability All data referred to in the manuscript is publicly available. It can also be obtained by sending an email to sujoy.ghosh@duke-nus.edu.sg ## Author Bio Dr. Sujoy Ghosh is Associate Professor of Computational Biology at Duke-NUS Medical School, Singapore. His primary research interests include genetic epidemiology of cardio-metabolic disesases and biostatistical analysis of high-dimensional datasets. Dr. Saikat Sinha Roy is a Professor of Economics at Jadavpur University, Kolkata, India. His primary research interests include Applied Trade and Development, Open Economy Macroeconomics, Trade Modelling, and Applied Economics and Applied Econometric Modelling. ## Acknowledgment The study was funded by the National Medical Research Council and Ministry of Health, Singapore. * Received March 31, 2021. * Revision received March 31, 2021. * Accepted April 6, 2021. * © 2021, Posted by Cold Spring Harbor Laboratory This pre-print is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), CC BY-NC 4.0, as described at [http://creativecommons.org/licenses/by-nc/4.0/](http://creativecommons.org/licenses/by-nc/4.0/) ## References 1. 1.Huang C, Wang Y, Li X, Ren L, Zhao J, Hu Y, et al. Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China. Lancet. 2020 Feb 15;395(10223):497–506. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S0140-6736(20)30183-5&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F04%2F06%2F2021.03.31.21254739.atom) 2. 2.Kumar A, Misra S, Verma V, Vishwakarma RK, Kamal VK, Nath M, et al. Global impact of environmental temperature and BCG vaccination coverage on the transmissibility and fatality rate of COVID-19. PLoS One. 2020;15(10):e0240710. 3. 3.Sun K, Viboud C. Impact of contact tracing on SARS-CoV-2 transmission. Lancet Infect Dis. 2020 Aug;20(8):876–7. 4. 4.Zhang J, Litvinova M, Liang Y, Wang Y, Wang W, Zhao S, et al. Changes in contact patterns shape the dynamics of the COVID-19 outbreak in China. Science. 2020 Jun 26;368(6498):1481–6. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6Mzoic2NpIjtzOjU6InJlc2lkIjtzOjEzOiIzNjgvNjQ5OC8xNDgxIjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjEvMDQvMDYvMjAyMS4wMy4zMS4yMTI1NDczOS5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 5. 5.Sun K, Chen J, Viboud C. Early epidemiological analysis of the coronavirus disease 2019 outbreak based on crowdsourced data: a population-level observational study. Lancet Digit Health. 2020 Apr;2(4):e201–e8. 6. 6.Hollins D, Kiorpes AL. Evaluating the industrial hygiene, toxicology, and public health aspects of COVID-19. Toxicol Ind Health. 2020 Sep;36(9):605–6. 7. 7.Cory JS. Insect virus transmission: different routes to persistence. Curr Opin Insect Sci. 2015 Apr;8:130–5. 8. 8.Dalziel BD, Kissler S, Gog JR, Viboud C, Bjornstad ON, Metcalf CJE, et al. Urbanization and humidity shape the intensity of influenza epidemics in U.S. cities. Science. 2018 Oct 5;362(6410):75–9. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6Mzoic2NpIjtzOjU6InJlc2lkIjtzOjExOiIzNjIvNjQxMC83NSI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIxLzA0LzA2LzIwMjEuMDMuMzEuMjEyNTQ3MzkuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 9. 9.Pica N, Bouvier NM. Environmental factors affecting the transmission of respiratory viruses. Curr Opin Virol. 2012 Feb;2(1):90–5. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.coviro.2011.12.003&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22440971&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F04%2F06%2F2021.03.31.21254739.atom) 10. 10.Tognotti E. Lessons from the history of quarantine, from plague to influenza A. Emerg Infect Dis. 2013 Feb;19(2):254–9. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.3201/eid1902.120312&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23343512&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F04%2F06%2F2021.03.31.21254739.atom) 11. 11.Hofmann M, Gatu, C., Kontoghiorghes, EJ., Colubi, A., Zeileis, A. lmSubsets:Exact Variable-Subset Selection in Linear Regression for R. Journal of Statistical Software. 2020;93(3):1–21. 12. 12.Szabelska A, Siatkowski M, Goszczurna T, Zyprych-Walczak J. Comparison of growth models in package R. Nauka Przyroda Technologie. 2010;4:50. 13. 13.Nocedal J, Wright SD. Numerical Optimization. 2 ed: Springer-Verlag New York; 2006. 14. 14.Neher RA, Dyrdak R, Druelle V, Hodcroft EB, Albert J. Potential impact of seasonal forcing on a SARS-CoV-2 pandemic. Swiss Med Wkly. 2020 Mar 9;150:w20224. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.4414/smw.2020.20224&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=32176808&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F04%2F06%2F2021.03.31.21254739.atom) 15. 15.Casanova LM, Jeon S, Rutala WA, Weber DJ, Sobsey MD. Effects of air temperature and relative humidity on coronavirus survival on surfaces. Appl Environ Microbiol. 2010 May;76(9):2712–7. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6MzoiYWVtIjtzOjU6InJlc2lkIjtzOjk6Ijc2LzkvMjcxMiI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIxLzA0LzA2LzIwMjEuMDMuMzEuMjEyNTQ3MzkuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 16. 16.Sajadi MM, Habibzadeh P, Vintzileos A, Shokouhi S, Miralles-Wilhelm F, Amoroso A. Temperature, Humidity, and Latitude Analysis to Estimate Potential Spread and Seasonality of Coronavirus Disease 2019 (COVID-19). JAMA Netw Open. 2020 Jun 1;3(6):e2011834. 17. 17.Pani SK, Lin NH, RavindraBabu S. Association of COVID-19 pandemic with meteorological parameters over Singapore. Sci Total Environ. 2020 Oct 20;740:140112. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.scitotenv.2020.140112&link_type=DOI) 18. 18.Tobias A, Molina T. Is temperature reducing the transmission of COVID-19 ? Environ Res. 2020 Jul;186:109553. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.envres.2020.109553&link_type=DOI) 19. 19.Yuan S, Jiang SC, Li ZL. Do Humidity and Temperature Impact the Spread of the Novel Coronavirus? Front Public Health. 2020;8:240. 20. 20.Shi P, Dong Y, Yan H, Zhao C, Li X, Liu W, et al. Impact of temperature on the dynamics of the COVID-19 outbreak in China. Sci Total Environ. 2020 Aug 1;728:138890. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.scitotenv.2020.138890&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=32339844&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F04%2F06%2F2021.03.31.21254739.atom) 21. 21.Xie J, Zhu Y. Association between ambient temperature and COVID-19 infection in 122 cities from China. Sci Total Environ. 2020 Jul 1;724:138201. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F04%2F06%2F2021.03.31.21254739.atom) 22. 22.Ujiie M, Tsuzuki S, Ohmagari N. Effect of temperature on the infectivity of COVID-19. Int J Infect Dis. 2020 Jun;95:301–3. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ijid.2020.04.068&link_type=DOI) 23. 23.Briz-Redon A, Serrano-Aroca A. A spatio-temporal analysis for exploring the effect of temperature on COVID-19 early evolution in Spain. Sci Total Environ. 2020 Aug 1;728:138811. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=doi.org/10.1016/j.scitotenv.2020.138811&link_type=DOI) 24. 24.Yang F, Heemsbergen L, Fordyce R. Comparative analysis of China’s Health Code, Australia’s COVIDSafe and New Zealand’s COVID Tracer Surveillance Apps: a new corona of public health governmentality? Media International Australia. 2020;():1329878×20968277. 25. 25.Adda J. Economic activity and the spread of viral diseases: evidence from high frequency data. The Quarterly Journal of Economics. 2016;131(2):891–941. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/qje/qjw005&link_type=DOI) 26. 26.Markowitz S, Nesson E, Robinson JJ. The effects of employment on influenza rates. Econ Hum Biol. 2019 Aug;34:286–95. 27. 27.Szreter S. Industrialization and health. Br Med Bull. 2004;69:75–86. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/bmb/ldh005&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=15226198&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F04%2F06%2F2021.03.31.21254739.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000222402100007&link_type=ISI) 28. 28.Leung K, Wu JT, Liu D, Leung GM. First-wave COVID-19 transmissibility and severity in China outside Hubei after control measures, and second-wave scenario planning: a modelling impact assessment. Lancet. 2020 Apr 25;395(10233):1382–93. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S0140-6736(20)30746-7&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=32277878&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F04%2F06%2F2021.03.31.21254739.atom) 29. 29.Siqueira C, Freitas YNL, Cancela MC, Carvalho M, Oliveras-Fabregas A, de Souza DLB. The effect of lockdown on the outcomes of COVID-19 in Spain: An ecological study. PLoS One. 2020;15(7):e0236779. 30. 30.Dickens BL, Koo JR, Lim JT, Park M, Quaye S, Sun H, et al. Modelling lockdown and exit strategies for COVID-19 in Singapore. The Lancet Regional Health Western Pacific. 2020;1:100004. [1]: /embed/inline-graphic-1.gif [2]: /embed/inline-graphic-2.gif