Disparities in Case Frequency and Mortality of Coronavirus Disease 2019 (COVID-19) Among Various States in the United States ============================================================================================================================ * Rohit S. Loomba * Gaurav Aggarwal * Saurabh Aggarwal * Saul Flores * Enrique G. Villarreal * Juan S. Farias * Carl J. Lavie ## Abstract **Objective** To utilize publicly reported, state-level data to identify factors associated with the frequency of cases, tests, and mortality in the US. **Materials & Methods** Retrospective study using publicly reported data collected included the number of COVID-19 cases, tests, and mortality from March 14th through April 30th, 2020. Publicly available state-level data was collected which included: demographics comorbidities, state characteristics and environmental factors. Univariate and multivariate regression analyses were performed to identify the significantly associated factors with percent mortality, case and testing frequency. All analyses were state-level analyses and not patient-level analyses. **Results** A total of 1,090,500 COVID-19 cases were reported during the study period. The calculated case and testing frequency were 3,332 and 19,193 per 1,000,000 patients. There were 63,642 deaths during this period which resulted in a mortality of 5.8%. Factors including to but not limited to population density (beta coefficient 7.5, p< 0.01), transportation volume (beta coefficient 0.1, p< 0.01), tourism index (beta coefficient -0.1, p=0.02) and older age (beta coefficient 0.2, p=0.01) are associated with case frequency and percent mortality. **Conclusions** There were wide variations in testing and case frequencies of COVID-19 among different states in the US. States with higher population density had a higher case and testing rate. States with larger population of elderly and higher tourism had a higher mortality. **Key Messages** There were wide variations in testing and case frequencies of COVID-19 among different states in the US. States with higher population density had a higher case and testing rate. States with larger population of elderly and higher tourism had a higher mortality. Keywords * COVID-19 * coronavirus * climate * risk factors * epidemiology ## Introduction The coronavirus disease 2019 (COVID-19) has spread worldwide after its onset in Wuhan, China, reaching over three million cases1. In the United States (US), over seven million tests have been performed with over one million tests resulting positive. This disease has resulted in over 70,000 deaths in the US and a case fatality rate at approximately 6.2%. Recent reports have shown differences in case and fatality rates between nearby localities and thought to be likely due to differences in demographic and socioeconomic factors2. Furthermore, differences in climate, access to healthcare and adherence to social distancing have also been hypothesized to affect these rates3–5. However, factors mediating the differences in case frequency and testing frequency between the states in the US have not been formally assessed. The aim of this study is to use publicly available, state-level data to determine the factors associated with COVID-19 percent mortality and case and testing frequency. ## Materials and Methods This study utilized publicly available, deidentified, state-level data and so no institutional review board approval was required or sought. The study was performed in accordance with Declaration of Helsinki. ### Variable Identification and Data Collection Absolute counts for COVID-19 cases, tests, and mortality were obtained from Worldometer6. Data were collected from March 13th, 2020 through April 30th, 2020. These dates were selected as March 13th was the first day when these data were publicly reported, and April 30th was the last full day prior to data collection. The number of cases and tests were then normalized to the specific state’s population to develop a frequency per 1,000,000 population. Any reference in this manuscript to “case frequency” or “testing frequency” refers to the normalized values in this manner, unless explicitly stated otherwise. Next, data were collected at a state-wide level to help characterize the population, environment, and infrastructure in the state. The data sources are listed in Supplementary file 1. The selected variables were identified by literature review of the factors that impact the frequency and severity of viral illnesses, including COVID-19. The following variables were collected: age, gender, underinsured population, ethnicity, influenza vaccination status, population density (persons per square mile), urban air quality rank (lower number signifying better air quality), drinking water quality rank (lower number signifying better drinking water quality), ultraviolet index, precipitation (in inches), temperature (in degree Fahrenheit), census average household income (in US$), per capita spending on healthcare (in US$) and high school graduation rate. To capture comorbid conditions, the prevalence of obesity, prevalence of smoking, prevalence of cocaine abuse, prevalence of marijuana abuse, alcohol consumption (gallons per person per year), prevalence of asthma, prevalence of diabetes, prevalence of chronic obstructive pulmonary disease, prevalence of myocardial infarction, prevalence of coronary artery disease, prevalence of hypertension, prevalence of hyperlipidemia, and presence of inactivity were collected. To capture immunosuppressed states, the annual incidence of new cancer and HIV cases per 100,000 population were collected. Finally, the social distancing score by global positioning satellite data, public transportation volume, number of incarcerated inmates, number of nursing home residents, and tourism rank (lower number implies more tourists) were collected. Of these, ultraviolet index, temperature, and precipitation were averages for March and April of 2020, whereas the remainder of data were collected from the most recent iteration for each state. Much of the data was collected from government sources, such as the Centers for Disease Control and the complete list of sources is provided The collected data represents state-level and not patient-level data. The endpoints were divided amongst the authors and collected by everyone. The data for each endpoint was then verified by another author who did not primarily collect the data. Finally, values in the top and bottom 10th percentile were identified and verified by a third author. ### Statistical Analyses As the data was collected for each state and intended for state-level analyses, the absolute number of COVID-19 cases, tests, and mortality were converted to a frequency using the state population. The frequencies for all the endpoints were calculated per 1,000,000 population. The case frequencies were then used as the dependent variables in a series of single-independent variable linear regressions to determine the univariate association between case frequency and the other variables previously defined and served as the univariate analyses. Next, a stepwise multivariate regression was conducted with p-value of 0.05 or less required for inclusion into the final model. Of the resulting models, the one with the highest R-squared value was selected as the final model. The same process was repeated for testing frequency and percent mortality as the dependent variable. Collinearity analyses were run with all multivariate regressions. All statistical analyses were done using the user-coded, syntax-based interface of SPSS Version 23.0. A p-value of 0.05 or less was considered was considered statistically significant. The use of the word significant throughout the manuscript refers to “statistically significant” unless explicitly specified otherwise. All statistical analyses were done at the state-level with state-level data. Analyses were not conducted at a patient-level with patient-level data. The subjects here were the 50 states. The age, gender, and comorbidity prevalence are not based on patient-specific data but rather the state prevalence. ## Results ### COVID-19 Cases, Testing, and Mortality A total of 1,090,500 COVID-19 cases were reported in the study period. This resulted in a case frequency of approximately 3,332 per 1,000,000 patients (3.3%). In the same period, 6,299,143 tests were done, resulting in a testing frequency 19,193 per 1,000,000 patients out of which 17.3% were reported positive. There were 63,642 deaths during this period which resulted in a mortality of 5.8%. Figures 1 and 2 demonstrate the case frequency and the percent mortality by state. ![Figure 1:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2020/07/30/2020.07.28.20163931/F1.medium.gif) [Figure 1:](http://medrxiv.org/content/early/2020/07/30/2020.07.28.20163931/F1) Figure 1: Cases per million in the United States divides by state ![Figure 2:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2020/07/30/2020.07.28.20163931/F2.medium.gif) [Figure 2:](http://medrxiv.org/content/early/2020/07/30/2020.07.28.20163931/F2) Figure 2: Percent mortality in the United States divided by state ### COVID-19 Case Frequency, Univariate Analyses The following factors were associated with greater case frequency on univariate linear regression analyses: female gender (beta-coefficient 1,095.8, p=0.03), higher population density (beta-coefficient 8.9, p< 0.01), lower ultraviolet index (beta coefficient -825.3, p=0.03), lower prevalence of obesity (beta coefficient -248.3, p=0.03), lower prevalence of uninsured (beta coefficient -298.6, p=0.01), higher frequency of other race (beta coefficient 38,638.7, p=0.01), lower prevalence of current smokers (beta coefficient -294.5, =0.02), higher per capita health care spending (beta coefficient 1.0, p< 0.01), higher public transportation volume (beta coefficient 0.1, p< 0.01), number of residents in nursing home facilities (beta coefficient 0.1, p< 0.01), and lower number for tourism ranking and thus more tourists (beta coefficient -79.8, p=0.01) (Table 1 shows full univariate data and Table 2 shows univariate results). View this table: [Table 1:](http://medrxiv.org/content/early/2020/07/30/2020.07.28.20163931/T1) Table 1: Univariate analyses for factors associated with COVID-19 illness in the United States View this table: [Table 2:](http://medrxiv.org/content/early/2020/07/30/2020.07.28.20163931/T2) Table 2: Associations between demographic factors, comorbidities, and environmental factors on case numbers, test numbers and percent mortality in univariate analysis View this table: [Table 3:](http://medrxiv.org/content/early/2020/07/30/2020.07.28.20163931/T3) Table 3: Associations between demographic factors, comorbidities, and environmental factors on case numbers, test numbers and percent mortality in multivariate analysis ### COVID-19 Case Frequency, Multivariate Analyses The following factors were associated with greater case frequency on multivariable analyses: higher population density (beta coefficient 7.5, p< 0.01) and increased public transportation volume (beta coefficient 0.1, p< 0.01). The R-square for this model was 0.78. Collinearity analyses did not demonstrate any significant collinearity (Table 2 shows multivariate analyses). ### COVID-19 Testing Frequency, Univariate Analyses The following factors were associated with greater testing frequency on univariate linear regression analyses: higher population density (beta coefficient 19.2, p< 0.01), lower prevalence of uninsured (beta coefficient -900.9, p=0.02), higher per capita health care spending (beta coefficient 3.68, p<0.01), and higher public transportation volume (beta coefficient 0.1, p=0.01). ### COVID-19 Testing Frequency, Multivariate Analyses The following factors were associated greater testing frequency on multivariable analyses: higher population density (beta coefficient 19.9, p< 0.01). The R-square for this model was 0.27. Collinearity analyses did not demonstrate any significant collinearity. ### COVID-19 Percent Mortality, Univariate Analyses The following factors were associated with greater percent mortality on univariate linear regression analyses: older age in years (beta coefficient 0.2, p=0.01), population density (beta coefficient 0.1, p=0.02), higher prevalence of asthma (beta coefficient 0.4, p=0.04), lower prevalence of uninsured (beta coefficient -0.1, p=0.02), higher prevalence of cocaine use (beta coefficient 1.0, p=0.01), higher public transportation volume (beta coefficient 0.1, p=0.02), higher number of residents in nursing home facilities (beta coefficient 0.1, p=0.01), and lower number for tourism ranking and thus more tourists (beta coefficient -0.1, p=0.02) ### COVID-19 Percent Mortality, Multivariate Analyses The following factors were associated with greater percent mortality using multivariable analyses: median age in years (beta coefficient 0.2, p=0.01) and tourism ranking (beta coefficient-0.1, p=0.02). The R-square for this model was 0.29. Collinearity analyses did not demonstrate significant collinearity. ### Power Analyses For multivariate regression analyses for case frequency, for which there is a relatively adequate effect size, and two predictors in the model, 31 subjects would be required to achieve 80% power. With 50 states and thus 50 subjects in these analyses, the multivariate analyses for case frequency are adequately powered. For multivariable regression analyses for testing frequency, for which there is a relatively low effect size, and one predictor in the model, 385 subjects would be required to achieve 80% power. With 50 states and thus 50 subjects in these analyses, the multivariable analyses for testing frequency are not adequately powered. For multivariable regression analyses for percent mortality, for which there is relatively low effect size, and two predictors in the model, 478 patients would be required to achieve 80% power. With 50 states and thus 50 subjects in these analyses, the multivariable analyses for percent mortality are not adequately powered. ## Discussion In these analyses, wide variations in case frequency, testing frequency, and percent mortality were observed. These results also identified factors, such as population density, transportation volume, tourism index and older age to be some of the factors affecting the above outcomes. To the authors’ knowledge, this is the first study to incorporate large number of variables to study the differences in COVID-19 transmission among different states in the US. Several laboratory and clinical risk factors have been postulated to predispose patients to symptomatic infection with COVID-19 and related mortality7–9. However, non-clinical factors that affect such outcomes are unknown. After multivariate analysis in this study, a higher population density was found to be associated with higher case and testing frequency. The results also identified increased public transportation to be associated with higher case frequency. Finally, the results identified older age and increased tourism independently related to higher mortality. Since these analyses were based on state-level data, the power for multivariate regression was reduced compared to if the analyses were completed with patient-level data. Hence, the findings of the univariate analyses deserve attention as well here. We found direct relationships between case frequency in a state and female gender, underinsured status, average household income and per capita healthcare spending and inverse relationship with obesity, smoking and UV light exposure. Public transportation was significantly associated with frequency of cases, testing and mortality rates on univariate analysis. However, on multivariate analysis, only frequency of cases remained significantly associated with public transportation. Tourism ranking was also a significant predictor of all 3 endpoints on univariate analysis and remained significantly associated with mortality even after multivariate regression. There is limited data on the nature of healthcare disparities during the COVID-19 pandemic. The cumulative COVID-19 incidence has been reported to be significantly variable among jurisdictions, ranging from 20.6 per 100,000 cases in Minnesota to 915.3 per 100,000 cases in New York City10. The timing of introduction of COVID-19 in the state and the extent of mitigation measures may mediate some of this variation. The age of patients has been shown to be a significant predictor of COVID-19 infection and worse outcomes9,11. The race and gender of patients have also been reported to be associated with a higher case frequency and worse outcomes in patients with COVID-1912,13. It continues to be shown that population density is associated with an increase in transmission and infection for the high risk population14,15. The results of this study showed that the public transport volume was also linked to a higher case rate. Prior studies have demonstrated similar findings with influenza-like illnesses and that its use increases the individual’s risk for acquiring an acute respiratory infection16,17. A simulation model indicated that the high level of subway usage in New York can influence disease spread in an influenza epidemic and that between 4-5% of total infections would occur on subways18. This information is particularly important as recommendations to maintaining strict disinfecting guidelines for public transport along with shelter-in-place whenever possible are established19,20. The results of this study identified that higher tourism is associated with increased mortality. A possible explanation of this identified phenomenon has been published and thought to be related to an influx of infected patients presenting late in the disease course21,22. Furthermore, prior studies have shown that air transportation accelerates viral spreading mainly related to high passenger traffic and risk of surface contamination in airports23,24. Similar findings have been reported in trains and other types of commercial vessels 25,26. Hence, the widespread implementation of travel restrictions, social distancing and lockdowns have become the main preventative interventions to decrease viral spreading during this pandemic. However, in this analysis, social distancing score was not associated with COVID-19 case frequency or mortality. The result from prior studies showed that social distancing is an effective measure at decreasing viral spreading when comprised of quarantines, school closure and workplace distancing27,28. However, our current analysis showed a lack of association between social distancing and COVID-19 case frequency, which may also represent a limitation of the process of measuring the social distancing score. This is particularly important as some countries like Sweden have encountered higher COVID-19 case frequencies after adopting more lenient social distancing measures29. This analysis also showed a lack of impact of climate-related factors on case and testing frequency. These findings require further validation as conflicting reports have been published5,30,31. Surprisingly, in our analysis states with higher number of uninsured patients were found to have lower mortality which could possibly be related to underreporting in such populations as both case numbers and testing numbers were also lower in such states. Finally, most of the comorbidities analyzed were not found to be independently associated with the case mortality in this analysis. This may suggest the complex interplay between demographics, environmental factors and diseases processes32,33. The findings from this analysis also highlight some of the potential limitations of state-level data rather than patient-level data, as previous studies have found a few comorbidities to be associated with increases in case frequency and percent mortality. These analyses offer some early assessment of the factors that may be mediating COVID-19 case frequency, testing frequency, and percent mortality. These analyses present associations using state-level data and not patient-level data. While these analyses offer novel data regarding case frequency, testing frequency, and percent mortality in the US, these analyses are not without their limitations. First, all the study data was captured from publicly available sources which only had data until 2018. The use of state-level data reduced the power of analyses, as we used for the multivariate regression models the number of states as the subjects. Although the data collection carries a risk of bias, this was minimized by utilizing multiple investigators for accuracy of data captured. Due to data unavailability other important outcomes such as case positivity rate could not be assessed. The ecologic design of the paper and use of various data sources with varying methods are other limitations of our study. As the pandemic continues some of the variables selected for this study may be subject to change, particularly as state resources may become overwhelmed. Finally, lack of granularity to county or city level data further limits our interpretations. With these limitations in mind, it is important to frame the intentions of this study appropriately. These analyses are by no means intended to be definitive data but are intended to be exploratory data to help identify variables that should be accounted for in larger, multicenter studies that utilize patient-level data. Factors such as the environmental and local infrastructural characteristics appear to modulate the case frequency and percent mortality and thus could be beneficial to capture in future studies. The data from those variables may assist with the understanding of viral spreading and the pandemic evolution. For instance, the identification of the association between higher tourism volume with higher case frequency and percent mortality may help implement faster travel restrictions for future pandemics. Similarly, the association between public transportation volume and its association with increased case frequency and percent mortality may assist in developing future public response. ## Conclusion This observational analysis of publicly reported state-level data identified factors associated with increased case frequency, testing frequency, and percent mortality for COVID-19. These data can guide future study design and develop risk prediction models. ## Data Availability Deidentified individual participant data will be made available at https://figshare.com/articles/dataset/Case\_Numbers\_and\_Mortality\_in\_US\_Data\_Extraction\_xlsx/12730919 [https://figshare.com/articles/dataset/Case\_Numbers\_and\_Mortality\_in\_US\_Data\_Extraction\_xlsx/12730919](https://figshare.com/articles/dataset/Case\_Numbers\_and\_Mortality\_in\_US_Data_Extraction_xlsx/12730919) ## Disclosure None ## In Memoriam We dedicate this paper to the memory of Luis Carlos Gamboa Chavez (1974-2020), devoted father, son, and friend. ## Figure and table legends **Supplementary File 1**. Data sources ## Acknowledgement None * Received July 28, 2020. * Revision received July 28, 2020. * Accepted July 30, 2020. * © 2020, Posted by Cold Spring Harbor Laboratory This pre-print is available under a Creative Commons License (Attribution-NoDerivs 4.0 International), CC BY-ND 4.0, as described at [http://creativecommons.org/licenses/by-nd/4.0/](http://creativecommons.org/licenses/by-nd/4.0/) ## References 1. 1.World Health Organization. Coronavirus Disease 2019 (COVID-19) Pandemic. 2020; [https://www.who.int/emergencies/diseases/novel-coronavirus-2019](https://www.who.int/emergencies/diseases/novel-coronavirus-2019). Accessed May 3, 2020. 2. 2.Wadhera RK, Wadhera P, Gaba P, et al. Variation in COVID-19 Hospitalizations and Deaths Across New York City Boroughs. JAMA. 2020. 3. 3.Gómez-Ríos D, Ramirez-Malule D, Ramirez-Malule H. The effect of uncontrolled travelers and social distancing on the spread of novel coronavirus disease (COVID-19) in Colombia. Travel Medicine and Infectious Disease. 2020:101699. 4. 4.Matrajt L, Leung T. Evaluating the Effectiveness of Social Distancing Interventions to Delay or Flatten the Epidemic Curve of Coronavirus Disease. Emerg Infect Dis. 2020;26(8). 5. 5.Sobral MFF, Duarte GB, da Penha Sobral AIG, Marinho MLM, de Souza Melo A. Association between climate variables and global transmission oF SARS-CoV-2. Sci Total Environ. 2020;729:138997. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.scitotenv.2020.138997&link_type=DOI) 6. 6.Worldometer. COVID-19 coronavirus pandemic. 2020; [https://www.worldometers.info/coronavirus/](https://www.worldometers.info/coronavirus/). Accessed May 2, 2020. 7. 7.Aggarwal G, Cheruiyot I, Aggarwal S, et al. Association of Cardiovascular Disease with Coronavirus Disease 2019 (COVID-19) Severity: A Meta-Analysis. Current Problems in Cardiology. 2020:100617. 8. 8.Aggarwal G, Lippi G, Michael Henry B. Cerebrovascular disease is associated with an increased disease severity in patients with Coronavirus Disease 2019 (COVID-19): A pooled analysis of published literature. Int J Stroke. 2020:1747493020921664. 9. 9.Aggarwal S, Garcia-Telles N, Aggarwal G, Lavie C, Lippi G, Henry BM. Clinical features, laboratory characteristics, and outcomes of patients hospitalized with coronavirus disease 2019 (COVID-19): Early report from the United States. Diagnosis (Berl). 2020;7(2):91–96. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1515/dx-2020-0046&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=32352401&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F07%2F30%2F2020.07.28.20163931.atom) 10. 10.CDC COVID-19 Response team. Geographic Differences in COVID-19 Cases, Deaths, and Incidence - United States, February 12-April 7, 2020. MMWR Morb Mortal Wkly Rep. 2020;69(15):465–471. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.15585/mmwr.mm6915e4&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F07%2F30%2F2020.07.28.20163931.atom) 11. 11.Adams ML, Katz DL, Grandpre J. Population-Based Estimates of Chronic Conditions Affecting Risk for Complications from Coronavirus Disease, United States. Emerg Infect Dis. 2020;26(8). 12. 12.Dorn AV, Cooney RE, Sabin ML. COVID-19 exacerbating inequalities in the US. Lancet. 2020;395(10232):1243–1244. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S0140-6736(20)30893-X&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F07%2F30%2F2020.07.28.20163931.atom) 13. 13.Gausman J, Langer A. Sex and Gender Disparities in the COVID-19 Pandemic. J Womens Health (Larchmt). 2020;29(4):465–466. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1089/jwh.2020.8472&link_type=DOI) 14. 14.Tarwater PM, Martin CF. Effects of population density on the spread of disease. Complexity. 2001;6(6):29–36. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/cplx.10003&link_type=DOI) 15. 15.Dalziel BD, Kissler S, Gog JR, et al. Urbanization and humidity shape the intensity of influenza epidemics in U.S. cities. Science. 2018;362(6410):75–79. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6Mzoic2NpIjtzOjU6InJlc2lkIjtzOjExOiIzNjIvNjQxMC83NSI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIwLzA3LzMwLzIwMjAuMDcuMjguMjAxNjM5MzEuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 16. 16.Goscé L, Johansson A. Analysing the link between public transport use and airborne transmission: mobility and contagion in the London underground. Environmental Health. 2018;17(1). 17. 17.Troko J, Myles P, Gibson J, et al. Is public transport a risk factor for acute respiratory infection? BMC Infectious Diseases. 2011;11(1):16. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/1471-2334-11-16&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=21235795&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F07%2F30%2F2020.07.28.20163931.atom) 18. 18.Cooley P, Brown S, Cajka J, et al. The Role of Subway Travel in an Influenza Epidemic: A New York City Simulation. Journal of Urban Health. 2011;88(5):982–995. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s11524-011-9603-4&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=21826584&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F07%2F30%2F2020.07.28.20163931.atom) 19. 19.Anderson EL, Turnham P, Griffin JR, Clarke CC. Consideration of the Aerosol Transmission for COVID-19 and Public Health. Risk Anal. 2020. 20. 20.Park J. Changes in Subway Ridership in Response to COVID-19 in Seoul, South Korea: Implications for Social Distancing. Cureus. 2020;12(4):e7668. 21. 21.Carletti F, Lalle E, Messina F, Ippolito G, Capobianchi MR. About the origin of the first two Sars-CoV-2 infections in Italy: inference not supported by appropriate sequence analysis. J Med Virol. 2020. 22. 22.Lo IL, Lio CF, Cheong HH, et al. Evaluation of SARS-CoV-2 RNA shedding in clinical specimens and clinical characteristics of 10 patients with COVID-19 in Macau. Int J Biol Sci. 2020;16(10):1698–1707. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.7150/ijbs.45357&link_type=DOI) 23. 23.Browne A, Ahmad SS, Beck CR, Nguyen-Van-Tam JS. The roles of transportation and transportation hubs in the propagation of influenza and coronaviruses: a systematic review. Journal of travel medicine. 2016;23(1). 24. 24.Ikonen N, Savolainen-Kopra C, Enstone JE, et al. Deposition of respiratory virus pathogens on frequently touched surfaces at airports. BMC Infectious Diseases. 2018;18(1). 25. 25.Cui F, Luo H, Zhou L, et al. Transmission of pandemic influenza A (H1N1) virus in a train in China. Journal of epidemiology. 2011;21(4):271–277. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.2188/jea.JE20100119&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=21646746&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F07%2F30%2F2020.07.28.20163931.atom) 26. 26.Brotherton JM, Delpech VC, Gilbert GL, Hatzi S, Paraskevopoulos PD, McAnulty JM. A large outbreak of influenza A and B on a cruise ship causing widespread morbidity. Epidemiology and infection. 2003;130(2):263–271. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1017/S0950268802008166&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=12729195&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F07%2F30%2F2020.07.28.20163931.atom) 27. 27.Chao DL, Halloran ME, Obenchain VJ, Longini IM, Jr.. Flute, a publicly available stochastic influenza epidemic simulation model. PLoS Comput Biol. 2010;6(1):e1000656. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pcbi.1000656&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20126529&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F07%2F30%2F2020.07.28.20163931.atom) 28. 28.Fong MW, Gao H, Wong JY, et al. Nonpharmaceutical Measures for Pandemic Influenza in Nonhealthcare Settings-Social Distancing Measures. Emerg Infect Dis. 2020;26(5):976–984. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.3201/eid2605.190995&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F07%2F30%2F2020.07.28.20163931.atom) 29. 29.Erbrink C AC. ‘Life Has to Go On’: How Sweden Has Faced the Virus Without a Lockdown. 2020. 30. 30.Jahangiri M, Jahangiri M, Najafgholipour M. The sensitivity and specificity analyses of ambient temperature and population size on the transmission rate of the novel coronavirus (COVID-19) in different provinces of Iran. Sci Total Environ. 2020;728:138872. 31. 31.Yao Y, Pan J, Liu Z, et al. No Association of COVID-19 transmission with temperature or UV radiation in Chinese cities. Eur Respir J. 2020. 32. 32.Bookman EB, McAllister K, Gillanders E, et al. Gene-environment interplay in common complex diseases: forging an integrative model—recommendations from an NIH workshop. Genet Epidemiol. 2011;35(4):217–225. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=21308768&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F07%2F30%2F2020.07.28.20163931.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000289375400002&link_type=ISI) 33. 33.McMichael AJ. Population, environment, disease, and survival: past patterns, uncertain futures. The Lancet. 2002;359(9312):1145–1148.