Positive correlation between long term emission of several air pollutants and COVID-19 deaths in Sweden ======================================================================================================= * Lars Helander ## Abstract Several recent studies have found troubling links between air pollution and both incidence and mortality of COVID-19, the pandemic disease caused by the virus SARS-CoV-2. Here, we investigate whether such a link can be found also in Sweden, a country with low population density and a relatively good air quality in general, with low background levels of important pollutants such as PM2.5 and NO2. The investigation is carried out by relating normalized emission levels of several air pollutants to normalized COVID-19 deaths at the municipality level, after applying a sieve function using an empirically determined threshold value to filter out noise. We find a fairly strong correlation for PM2.5, PM10 and SO2, and a moderate one for NOx. We find no correlation neither for CO, nor (as expected) for CO2. Our results are statistically significant and the calculations are simple and easily verifiable. Since the study considers only emission levels of air pollutants and not measurements of air quality, climatic and meteorological factors (such as average wind speeds) can trivially be ruled out as confounders. Finally, we also show that although there are small positive correlations between population density and COVID-19 deaths in the studied municipalities (which are for the most part rural and non densely populated) they are either weak or not statistically significant. ## 1 Introduction Air pollution is a global health concern and numerous studies have found links between air pollution and both respiratory and chronic diseases [1, 2, 3, 4]. Since the start of the COVID-19 pandemic, several studies have found links between air pollution and COVID-19 incidence and mortality [5, 6, 7, 8, 9]. The present study aims at determining whether there is a link between air pollution and COVID-19 deaths at the municipality level in Sweden. We propose that Sweden is a suitable candidate for this kind of ecological study, due to its long tradition of high quality statistics coupled with the fact that the COVID-19 pandemic has not been met here with severe governmental mitigation measures such as lockdown or school closures or other coercive measures such as obligations to wear face masks. Also, regional responses to the pandemic have not varied much throughout the country, thus encouraging the disease to run its natural course and reach all corners of the country. The method of the present study, which will be detailed in Section 2, consists in a first step of relating normalized emission levels to normalized COVID-19 deaths at the municipality level. Normalization refers here to relating the figures to the number of inhabitants of each municipality. A positive correlation thus means that municipalities with relatively high emissions and few inhabitants will account for relatively more COVID-19 deaths than those with lower emissions or more inhabitants. In Sweden, municipalities with high emissions and few inhabitants are traditionally called “bruksorter” (mill towns), and are centered around one or several industries (paper mill, steel mill, factory, mine, etc.) playing a major role in the lives of the inhabitants. The industries are required by law to report yearly emission levels of hazardous substances to a public register. In a second step, we introduce a threshold value *t**q* for each pollutant *q*. Municipalities with normalized emissions of *q* below the threshold value *t**q* are excluded from the calculation of the correlation coefficient. The rationale for using a threshold is to remove noise, as COVID-19 death figures in municipalities with low emissions are likely to be driven by other factors than those emissions. Municipalities with high emissions and many inhabitants (such as big cities) may thus fall under the threshold value *t**q* even though the emissions very well may have an impact on COVID-19 mortality. However, excluding those cases should not lower the relevance of the study’s results. ## 2 Method Emission [10], census [11] and population density [11] data at the municipality level were retrieved directly from Statistics Sweden (SCB), the Swedish government statistics agency with a history dating back to before 1749. COVID-19 deaths registered up to November 30, 2020 [12] at the municipality level were retrieved directly from the Swedish National Board of Health and Welfare (Socialstyrelsen), another government agency. No data for municipalities with 1-3 COVID-19 deaths is given in [12] due to privacy concerns (in this case an ‘X’ is given in the table instead). For those municipalities, we assume for simplicity that there are 2 COVID-19 deaths. Now, let *q* denote the pollutant under study, with *q* ∈ {PM2.5, PM10, SO2, NOx, CO, CO2}. Let *M* denote the set of municipalities in Sweden (|*M* | = 290). For each *m* ∈ *M*, let ![Graphic][1] denote the average emission of *q* in *m* for the years 2008 to 2018 (inclusive), let *p**m* denote the number of inhabitants of *m* for the year 2019 and let *c**m* denote the number of COVID-19 deaths in *m* up to November 30, 2020. The normalized emissions ![Graphic][2] of *q* in *m* is calculated according to: ![Formula][3] The normalized COVID-19 deaths *d**m* in *m* is calculated according to: ![Formula][4] The following threshold values *t**q* for each type of pollutant has been identified empirically using the available data. (No correlation has been discernible for CO or CO2 for any threshold): We now introduce the sets *S**q* per pollutant, representing our samples, with the following definition (using the sieve function ![Graphic][5] to filter out noise): ![Formula][6] For each pollutant *q* we compute Pearson’s correlation coefficient *r* and *p*-value of our sample *S**q* and a 95% confidence interval in the standard way using [13]. To determine any possible relationship between population density and COVID-19 deaths, we let *ρ**m* denote the number of inhabitants per square kilometer in the municipality *m* and we define the set *P* and sets *P**q* for each pollutant *q* as follows: ![Formula][7] ![Formula][8] As a comparison, we also define the set ![Graphic][9] below, which corresponds to the set of municipalities with a population density above 200 inhabitants per square kilometer. The threshold of 200 has been empirically determined, in the same way as the thresholds for the pollutants. ![Formula][10] We then compute Pearson’s correlation coefficient and 95% confidence interval in the same way as before for all these sets (using [13]). ## 3 Results Below in Table 2 follows the main results of our study. For scatter plots, please refer to Appendix A. View this table: [Table 1:](http://medrxiv.org/content/early/2020/12/09/2020.12.05.20244418/T1) Table 1: Table of empirically determined threshold values for each pollutant. View this table: [Table 2:](http://medrxiv.org/content/early/2020/12/09/2020.12.05.20244418/T2) Table 2: Pearson’s correlation coefficient and 95% confidence interval for the set *S**q* for each pollutant *q*, representing the relationship between emission levels of *q* and COVID-19 deaths, as defined in Section 2. In Table 3 below, we present the correlations between population density and COVID-19 deaths in the different sets of municipalities. View this table: [Table 3:](http://medrxiv.org/content/early/2020/12/09/2020.12.05.20244418/T3) Table 3: Pearson’s correlation coefficient and 95% confidence interval for the set *P* and sets *P**q* for each pollutant *q*, representing the relationship between population density and COVID-19 deaths, as defined in Section 2. ![Graphic][11] corresponds to the set of municipalities with a population density above 200 inhabitants per square kilometer. ## 4 Conclusions After applying a sieve function to filter our noise, we have proved statistically significant correlations between the air pollutants PM2.5, PM10, SO2 and NOx and COVID-19 deaths at the municipality level in Sweden. The correlations for the three first substances are fairly strong, and for the last the correlation is more moderate. Our approach is new in the sense that we consider only emission levels of pollutants and not measurements of air quality, which means that we can easily exclude climate and weather (such as wind speeds) as confounding factors. We have also shown that the correlations between population density and COVID-19 deaths in the studied municipalities are either weak or not statistically significant, which means that the observed correlations with air pollution are not due to population density. When considering the country as a whole, there is a weak, but positive, correlation between population density and COVID-19 deaths. If we restrict ourselves to the municipalities with a population density above 200 inhabitants per square kilometer, the correlation becomes more pronounced. ### Limitations A limitation to this ecological study is that we cannot quantify to what extent toxic substances emitted in a municipality are actually inhaled by its inhabitants and thus to what extent they play a relevant role in the observed correlations. As a future improvement, we would like to incorporate meteorological factors at the municipality level, in particular average wind velocities, into our calculations as a means to modelize the speed of dispersal of the polluting substances. Also, we would like to remind the reader that we have only proved a correlation between data in statistical tables, not any causal relationship between air pollution and COVID-19 deaths. Nevertheless, we hope that this novel approach can be of value to the scientific community and to policymakers, as another small piece in the puzzle of the relationship between air pollution and disease burden in our societies. ## Data Availability All data is freely available on the Internet. [https://www.statistikdatabasen.scb.se/pxweb/en/ssd/START\_\_BE\_\_BE0101\_\_BE0101C/BefArealTathetKon/](https://www.statistikdatabasen.scb.se/pxweb/en/ssd/START\_\_BE\__BE0101__BE0101C/BefArealTathetKon/) [https://www.socialstyrelsen.se/globalassets/1-globalt/covid-19-statistik/statistik-over-antal-avlidna-i-covid-19/statistik-covid19-avlidna.xlsx](https://www.socialstyrelsen.se/globalassets/1-globalt/covid-19-statistik/statistik-over-antal-avlidna-i-covid-19/statistik-covid19-avlidna.xlsx) ## A Scatter Plots ![Figure 1:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2020/12/09/2020.12.05.20244418/F1.medium.gif) [Figure 1:](http://medrxiv.org/content/early/2020/12/09/2020.12.05.20244418/F1) Figure 1: Emissions of PM2.5 (tonnes per 10000 inhabitants) related to COVID-19 deaths. One point per municipality. ![Figure 2:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2020/12/09/2020.12.05.20244418/F2.medium.gif) [Figure 2:](http://medrxiv.org/content/early/2020/12/09/2020.12.05.20244418/F2) Figure 2: Emissions of PM2.5 (tonnes per 10000 inhabitants) related to COVID-19 deaths. One point per municipality with emissions *> t*PM2.5. The included municipalities are Arjeplog, Askersund, Bengtsfors, Dorotea, Gotland, Grums, Gällivare, Härjedalen, Kalix, Karlshamn, Kiruna, Kramfors, Kristinehamn, Lindesberg, Lysekil, Mönsterås, Norsjö, Oxelosund, Piteå, Ragunda, Robertsfors, Rättvik, Sorsele, Storuman, Tjörn, Vilhelmina, Vindeln, Ydre, Åsele, Ä lvkarleby, Örnskoldsvik, Överkalix. ![Figure 3:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2020/12/09/2020.12.05.20244418/F3.medium.gif) [Figure 3:](http://medrxiv.org/content/early/2020/12/09/2020.12.05.20244418/F3) Figure 3: Emissions of PM10 (tonnes per 10000 inhabitants) related to COVID-19 deaths. One point per municipality. ![Figure 4:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2020/12/09/2020.12.05.20244418/F4.medium.gif) [Figure 4:](http://medrxiv.org/content/early/2020/12/09/2020.12.05.20244418/F4) Figure 4: Emissions of PM10 (tonnes per 10000 inhabitants) related to COVID-19 deaths. One point per municipality with emissions *> t*PM10. The included municipalities are Askersund, Bengtsfors, Berg, Dorotea, Gotland, Grums, Gällivare, Kalix, Kiruna, Kramfors, Kristinehamn, Lindesberg, Lysekil, Mönsterås, Norsjö, Oxelösund, Piteå, Ragunda, Sorsele, Storuman, Tjörn, Vilhelmina, Vindeln, Ydre, Älvkarleby, Örnskoldsvik. ![Figure 5:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2020/12/09/2020.12.05.20244418/F5.medium.gif) [Figure 5:](http://medrxiv.org/content/early/2020/12/09/2020.12.05.20244418/F5) Figure 5: Emissions of SO2 (tonnes per 10000 inhabitants) related to COVID-19 deaths. One point per municipality. ![Figure 6:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2020/12/09/2020.12.05.20244418/F6.medium.gif) [Figure 6:](http://medrxiv.org/content/early/2020/12/09/2020.12.05.20244418/F6) Figure 6: Emissions of SO2 (tonnes per 10000 inhabitants) related to COVID-19 deaths. One point per municipality with emissions ![Graphic][12]. The included municipalities are Askersund, Bengtsfors, Bromölla, Gotland, Grums, Gällivare, Göteborg, Götene, Hammarö, Kalix, Karlshamn, Kiruna, Lidköping, Luleå, Mönsterås, Oxelösund, Piteå, Rattvik, Skellefteå, Stenungsund, Söderhamn, Timrå, Tjörn, Tranemo, Trelleborg, Älvkarleby, Örnskoldsvik. ![Figure 7:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2020/12/09/2020.12.05.20244418/F7.medium.gif) [Figure 7:](http://medrxiv.org/content/early/2020/12/09/2020.12.05.20244418/F7) Figure 7: Emissions of CO (tonnes per 10000 inhabitants) related to COVID-19 deaths. One point per municipality. *No correlation found*. ![Figure 8:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2020/12/09/2020.12.05.20244418/F8.medium.gif) [Figure 8:](http://medrxiv.org/content/early/2020/12/09/2020.12.05.20244418/F8) Figure 8: Emissions of CO2 (1000 tonnes per 10000 inhabitants) related to COVID-19 deaths. One point per municipality. *No correlation found*. ![Figure 9:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2020/12/09/2020.12.05.20244418/F9.medium.gif) [Figure 9:](http://medrxiv.org/content/early/2020/12/09/2020.12.05.20244418/F9) Figure 9: Population density (inhabitants per square kilometer) related to COVID-19 deaths. One point per municipality. * Received December 5, 2020. * Revision received December 5, 2020. * Accepted December 9, 2020. * © 2020, Posted by Cold Spring Harbor Laboratory This pre-print is available under a Creative Commons License (Attribution-NoDerivs 4.0 International), CC BY-ND 4.0, as described at [http://creativecommons.org/licenses/by-nd/4.0/](http://creativecommons.org/licenses/by-nd/4.0/) ## References 1. [1].Kim D, Chen Z, Zhou LF, Huang SX. Air pollutants and early origins of respiratory diseases., Chronic Dis Transl Med. 2018 Jun 7;4(2):75–94. doi: 10.1016/j.cdtm.2018.03.003. PMID: 29988883; PMCID: PMC6033955. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.cdtm.2018.03.003&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=29988883&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F12%2F09%2F2020.12.05.20244418.atom) 2. [2].Romieu I, Samet JM, Smith KR, Bruce N. Outdoor air pollution and acute respiratory infections among children in developing countries. J Occup Environ Med. 2002 Jul;44(7):640–9. doi: 10.1097/00043764-200207000-00010. PMID: 12134528. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1097/00043764-200207000-00010&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=12134528&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F12%2F09%2F2020.12.05.20244418.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000176845300009&link_type=ISI) 3. [3].Chen H, Goldberg MS. The effects of outdoor air pollution on chronic illnesses., Mcgill J Med. 2009 Jan;12(1):58-64. PMID: 19753290; PMCID: PMC2687917. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=19753290&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F12%2F09%2F2020.12.05.20244418.atom) 4. [4].Chau, TT., Wang, KY. An association between air pollution and daily most frequently visits of eighteen outpatient diseases in an industrial city. Sci Rep 10, 2321 (2020). [https://doi.org/10.1038/s41598-020-58721-0](https://doi.org/10.1038/s41598-020-58721-0) 5. [5].Wu X, Nethery RC, Sabath BM, Braun D, Dominici F. Exposure to air pollution and COVID-19 mortality in the United States: A nationwide cross-sectional study. medRxiv [Preprint]., 2020 Apr 7:2020.04.05.20054502. doi: 10.1101/2020.04.05.20054502. Update in: Sci Adv. 2020 Nov 4;6(45): PMID: 32511651; PMCID: PMC7277007. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NzoibWVkcnhpdiI7czo1OiJyZXNpZCI7czoyMToiMjAyMC4wNC4wNS4yMDA1NDUwMnYyIjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjAvMTIvMDkvMjAyMC4xMi4wNS4yMDI0NDQxOC5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 6. [6].Coker, E.S., Cavalli, L., Fabrizi, E. et al. The Effects of Air Pollution on COVID-19 Related Mortality in Northern Italy. Environ Resource Econ 76, 611634 (2020). [https://doi.org/10.1007/s10640-020-00486-1](https://doi.org/10.1007/s10640-020-00486-1) 7. [7].Cole MA, Ozgen C, Strobl E. Air Pollution Exposure and Covid-19 in Dutch Municipalities. Environ Resour Econ (Dordr). 2020 Aug 4:1–30. doi: 10.1007/s10640-020-00491-4. Epub ahead of print. PMID: 32836849; PMCID: PMC7399597. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s10640-020-00491-4&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=32836849&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F12%2F09%2F2020.12.05.20244418.atom) 8. [8]. Andrea Pozzer, Francesca Dominici, Andy Haines, Christian Witt, Thomas Mnzel, Jos Lelieveld. Regional and global contributions of air pollution to risk of death from COVID-19, Cardiovascular Research, Volume 116, Issue 14, 1 December 2020, Pages 22472253, 9. [9]. Bo Pieter, Johannes Andree. Incidence of COVID-19 and Connections with Air Pollution Exposure: Evidence from the Netherlands, medRxiv 2020.04.27.20081562; doi: [https://doi.org/10.1101/2020.04.27.20081562](https://doi.org/10.1101/2020.04.27.20081562) 10. [10].SCB, Air emissions by region (municipality, LAU2) and subject. Year 2008 - 2018, [http://www.statistikdatabasen.scb.se/pxweb/en/ssd/START\_\_MI\_\_MI1301\_\_MI1301B/UtslappKommun/](http://www.statistikdatabasen.scb.se/pxweb/en/ssd/START\_\_MI\__MI1301__MI1301B/UtslappKommun/). 11. [11].SCB, Population density per sq. km, population and land area by region and sex. Year 1991 - 2019, [https://www.statistikdatabasen.scb.se/pxweb/en/ssd/START\_\_BE\_\_BE0101\_\_BE0101C/BefArealTathetKon/](https://www.statistikdatabasen.scb.se/pxweb/en/ssd/START\_\_BE\__BE0101__BE0101C/BefArealTathetKon/). 12. [12].Socialstyrelsen, Statistics on number of COVID-19 deaths, [https://www.socialstyrelsen.se/globalassets/1-globalt/covid-19-statistik/statistik-over-antal-avlidna-i-covid-19/statistik-covid19-avlidna.xlsx](https://www.socialstyrelsen.se/globalassets/1-globalt/covid-19-statistik/statistik-over-antal-avlidna-i-covid-19/statistik-covid19-avlidna.xlsx) 13. [13]. Zhiya Zuo, Calculate Pearson Correlation Confidence Interval in Python, [https://zhiyzuo.github.io/Pearson-Correlation-CI-in-Python/](https://zhiyzuo.github.io/Pearson-Correlation-CI-in-Python/) March 31, 2018. [1]: /embed/inline-graphic-1.gif [2]: /embed/inline-graphic-2.gif [3]: /embed/graphic-1.gif [4]: /embed/graphic-2.gif [5]: /embed/inline-graphic-3.gif [6]: /embed/graphic-3.gif [7]: /embed/graphic-4.gif [8]: /embed/graphic-5.gif [9]: /embed/inline-graphic-4.gif [10]: /embed/graphic-6.gif [11]: T3/embed/inline-graphic-5.gif [12]: F6/embed/inline-graphic-6.gif