Abstract
Background SARS-CoV-2 has spread rapidly across the globe during the first several months of 2020, resulting in a pandemic. Substantial, non-discriminatory limitations have been imposed on air travel to inhibit further spread of the virus. However, as disease prevalence and incidence decreases, more specific control measures will be sought so that commercial air travel can operate yet not impose a high threat of COVID-19 resurgence. We considered the risk posed by different locations to initiate a resurgence of COVID-19 at such times.
Methods We use modelled global air travel data for October (just before a second wave of COVID-19 might be expected) and population density to analyse the risk posed by 1364 airports to initiate a COVID-19 outbreak. We use a probabilistic, branching-process based approach that considers the volume of air travelers between airports and the R0 of each location, scaled by population density. This exercise is performed globally as well as specifically for two potentially vulnerable locations: Africa and India.
Results We show that globally, many of the airports posing the highest risk are in China and India. An outbreak of COVID-19 in Africa is most likely to originate in a passenger travelling from Europe. On the other hand, the highest risk to India is from domestic travellers. Our results are robust to changes in the underlying epidemiological assumptions.
Conclusions Variation in flight volumes and destinations creates a non-uniform distribution of the risk different airports pose to resurgence of a COVID-19 outbreak. We suggest the method presented here as a tool for the estimation of this risk. Our method can be used to inform efficient allocation of resources, such as tests identifying infected passengers, so that they could be differentially deployed in various locations.
Introduction
The coronavirus disease 2019 (COVID-19), caused by the Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), has been spreading rapidly since early cases were detected In Wuhan, China during December 2019 [1]. On March 11, the WHO declared the COVID-19 outbreak a pandemic [2]; i.e. it is now deemed to be on a global scale, with significant autochthonous transmission in different countries. The implications of such a declaration can have is that tackling the COVID-19 pandemic might require international coordination and collaboration [3]. Up to date, COVID-19 cases have been observed in over 100 territories, in all inhabited continents [4]. Such a pervasive spread is partly a consequence of the innate transmissibility of the virus, which is expressed by the relatively high basic reproduction number point estimates ranging from 1.4 to as high as 6.5 (with uncertainty bounds providing an even wider range) [5], [6]. However, an additional prominent reason for the especially wide and rapid spread of COVID-19 is likely to be the current high prevalence of international travel.
Although the spread of COVID-19 through maritime transport has received substantial media coverage due to the Diamond Princess and other cruise ships [7], most of the imported cases outside China have likely originated from air travel [8]. These have spurred the applications of various outbreak control measures regarding air travel, including quarantining individuals returning via air travel from specific countries, or regardless of their departure locations [9]. Consequently, many flights have been cancelled, causing sharp decreases in flight volumes [8]. Such outbreak control measures are non-selective, and are hence costly, inconvenient and substantially hamper general international travel. As such, they might be resorted to mostly when other epidemic control options are not available. A less restrictive possibility to control the COVID-19 spread is to concentrate efforts of detection and isolation of cases to specific locations. Risk assessments of scenarios of COVID-19 spread to new locations, or retrospective analysis or importation to certain locations, have been previously performed [10]–[14]. These studies mainly focused on the introduction of COVID-19 into highly vulnerable locations, often with public health infrastructures that may not respond efficiently to contain an outbreak. On the other hand, locations can also be determined as dangerous by their likelihood to disseminate infected individuals and initiate novel or reoccurring COVID-19 outbreaks in other locations. Investing additional resources in individual screening at locations defined as dangerous in this sense might therefore prove efficient in preventing spillover from countries which do not completely cancel international flights. In addition to being currently relevant, such a strategy might also be important when SARS-CoV-2 is suppressed to low-frequency or even undetectable circulation. Under this scenario, narrower public health interventions would be needed, and increased focus would be pointed at preventing new or recurrent outbreaks.
In this study, we develop and examine a method for discriminating airports with a high potential of initiating a COVID-19 outbreak upon the introduction of an infected passenger. We apply this method to modelled flight data, and examine the risk of a new outbreak initiation globally, as well as in two regions where the effects of a sustained COVID-19 outbreak may be especially devastating: Africa and India. Our results are analysed and found robust to various disease outbreak modelling assumptions as well as to seasonal changes in flight patterns.
Methods
Data
We approximated monthly data of airport connectivity by the model presented in [15]. Briefly, this data is obtained by applying a Poisson regression model for the number of passengers on covariates such as city connectivity, purchasing power index [16], population and weather. This process has yielded information regarding n = 1364 major airports scattered throughout the world. The analyses presented in the text rely on air travel during October. This month was chosen as it approximates the beginning of major seasonal respiratory diseases [17] and is hypothesized to be the potential time of resurgence of COVID-19 [18]. However, similar results are obtained for different months as well (see supplementary material Figure S1). Continent and country information for each airport were retrieved from [19].
Human population density data was retrieved from NASA [20]. For each airport i = 1, … n an associated density, ρi was calculated. First, the maximal population density within a 130km edged square centered at each airport was found. Then, an average of a 30km edged square centered at said location of maximal density was calculated. This heuristic of density estimation is meant to approximate the density of a major nearby location to which an infected passenger is likely to arrive. For example, our heuristic method found the maximal density location of JFK, EWR and LGA (three major airports serving New York City) to be located in downtown Manhattan. Similarly, the two airports serving Berlin (TXL, SXF) and two airports serving Tokyo (NRT, HND) were correctly assigned the same maximal population density in their corresponding metropolitans.
Outbreak probability
Briefly, we assume a low prevalence of COVID-19 in each country, and use the number of passengers transferring from and to different airports to calculate the probability of arrival of infected individuals to each destination; in each destination, we scale R by the local density; calculate the risk of an outbreak using a branching process-based method; and, finally, combine these estimates to yield the risk each airport poses for a COVID-19 outbreak initiation. More details follow.
We used population density estimates to adjust the reproduction number, the expected number of cases generated by a single infected individual in the population, with the rationale being that individual contact rates scale approximately proportional to population density [21], which is consistent with another COVID-19 reproductive number estimate [22]. The reproduction number for region i, Ri, was calculated as Ri = Rw • ρi /ρw,where ρi is the density of region i, ρw is the estimated density associated with Wuhan Tianhe International Airport, and Rw is the estimated reproductive number in Wuhan. The value of Rw jwas set to 3 in all the results in the manuscript, as per recent estimates of the basic reproduction number [5]. A sensitivity analysis for Rw values of 2 and 4 was conducted to reflect the variance in the estimates and potentially lower values in previously exposed populations, and yielded similar results (see supplementary material Figure S3).
After obtaining the basic reproductive number for each location i, we can calculate the probability of no outbreak in that location based on the arrival of a single infected individual, pi, according to the formula derived from a branching process model [23] pi = 1/Ri. We note that this formula remains valid when the population has previously been exposed to the virus - the basic reproduction number can be simply replaced with the effective reproduction number. Such cases are covered by the sensitivity analysis performed on lower basic reproductive number values. Furthermore, this formula is correct for an exponential infectious period, which is often implicitly assumed in infectious diseases dynamic models [24]. Nonetheless, we performed a sensitivity analysis on the general case of a gamma distributed infectious period, with the appropriate correction [23], [25], while varying the shape parameter from the exponential case k = 1 to a a high value of k = 6 (see supplementary material Figure S2 for details). Results were qualitatively similar. In addition, we have performed a sensitivity analysis for the existence of superspreaders [26]. The derived formula for pi in this case is more complex (see derivation in supplementary material), but our results remained robust to this generalization as well (see supplementary material Figure S4).
To calculate the risk each airport poses to an initiation of a COVID-19 outbreak, we assume a fraction q of the population is infected. We assume that q is small when our method is used to mitigate risk of outbreak initiation or resurgence, probably in the order of magnitude of 10 −5 − 10 −6, since otherwise control measures including complete flight restriction are likely to be implemented. We denote the number of individuals travelling from airport k to airport i by nk,i. Therefore, the probability of no outbreak initiation from airport k (to any airport it is connected to) is where i goes over the range of airports connected to airport k. Since q is small, this is well approximated by . As raising to a positive power q preserves order (a > b >0 implies aq > bq), we can conclude the relative order of risk between two airports k and l, and πk and πl, does not depend on q (as long as q is small enough to preserve the approximation above). This independence of q enables us to rank the risk of outbreak initiation, 1 − πk, between different airports regardless of the value of q. For ease of presentation, in our results we choose a value of q = 10−6 yielding low numbers of infected individuals.
Lastly, we turn to highlight two low-income regions; Africa and India. For each region, we estimated the probability of an outbreak conditioned on infected individuals arriving in an airport in that region. The calculation of the probability of arrival is identical to the one described above, with the exception of restricting flights only to those destined to said regions.
We give the interested reader full access to our results in the supplementary material, including a table of airport risks corresponding to Figure 1.b (TableS1), a table of airport characteristics (including the associated density, R and probability of outbreak following the introduction of one infected individual - all in TableS2), and a full table of the risks by different airports (TableS3). For reproducibility purposes, we provide complete access to our code on GitHub, with detailed instructions on running it at https://github.com/yairdaon/infections. Code was run using python 3.7.5.
Results
We heuristically examine our method by identifying which places have the highest risk of transmission from Wuhan Tianhe International (WUH) Airport. Reassuringly, focal locations to the COVID-19 outbreak appeared among the top-5 airports most likely to have an outbreak originating in WUH, including Beijing, Shanghai and Shenzhen.
We start our analysis by examining the global risk of outbreak by airport. That is, every airport is assigned a risk of initiating an outbreak in any location, according to the ariport’s flight patterns and the destination’s risk of outbreak. First, we observe the strong connectivity clusters focused in Europe, east Asia, and the United States (Figure 1.a). We further note that risk is high in east Asia with all but two of the top ten highest-risk airports originating in that region (the exceptions being London Heathrow and Dubai airport; Figure 1). Calculating the average risk of the ten highest-risk airports in each continent yields the following risk order: Asia (0.51), Europe (0.28), North America (0.23), South America (0.14), Africa (0.06), and Oceania (0.04).
The estimates of risks in our results greatly vary between airports. For example, the risk estimated for Beijing Capital airport - the airport with the highest risk to initiate an outbreak - is approximately 0.74. On the other hand, Shanghai Hongqiao airport, ranked 17th, has a risk estimate which is lower by a factor of 2. The variability is even more extreme when we consider only the risk to initiate an outbreak in Africa. This risk is halved between the top ranked airport (Johannesburg, 0.1) to the fifth (Abuja, 0.05). When considering the risk of initiating an outbreak in India, it is halved between New Delhi airport (0.42) and the fourth ranked Hyderabad airport (0.19).
Since the locations most vulnerable to be severely affected by a COVID-19 outbreak are low- and middle-income countries [13], we turned our focus to examine the risk of initiating an outbreak in two locations of high interest: Africa and India. Africa is relatively disconnected from the rest of the world. Its strongest connections to locations outside the continent include connection to central Europe, with a relatively strong connectivity to France (mainly through Morocco), England, and the Arab peninsula (Figure 2.a). The risk posed by different airports to initiate a COVID-19 outbreak in Africa is highest at Johannesburg, Paris, Dubai, London, Abuja (Nigeria), Jedda (Saudi Arabia), Frankfurt, Amsterdam, Cairo and Istanbul (Figure 2.b). Contrary to Africa, in India we do observe the largest risk to initiate an outbreak arising from domestic flights, originating within India itself. Of the ten highest-risk airports with respect to India, eight are within the borders of India (the exceptions being airports in Dubai and Singapore). The same trend continues in the following twenty highest-risk airports - fifteen of the top twenty are in India. We further investigated the seasonal changes in the connectivity between airports. These did not yield substantial differences (see supplementary material Figure S1).
Discussion
Our study highlights the importance of quantifying the risk of COVID-19 outbreaks through the introduction of COVID-19 cases from international air travel. We show that (a) East Asia is the area most prone to initiating an outbreak; (b) West Europe is the largest threat to outbreak initiation in Africa; and (c) India’s population density and frequent domestic travel put it at high risk for outbreaks from infected passengers arriving from within the country. Our work complements the results of a recent study, showing that the spread of COVID-19 in Europe closely followed air travel patterns and that the severe travel restrictions implemented there resulted in substantial decreases in the disease’s spread [27]. The mentioned study has demonstrated that such restrictions often came into effect too late after the reporting of COVID-19 cases. In accordance, our study emphasizes that implementation of strict epidemiological control measures to prevent the spread of COVID-19 is likely not to be feasible or timely. We suggest that an additional layer of control measures, such as thorough screening of infected hosts, should be differentially implemented based on an airport’s risk to initiate a COVID-19 outbreak. Unfortunately, rapid, non-laboratory based methods of COVID-19 infection detection are still lacking, and screening relies on manifestation of symptoms (e.g. body temperature). Hence, individuals with asymptomatic or paucisymptomatic infections may still slip through even high coverage rates of screening at airports, until newer methods of screening are developed. Current estimates for the frequency of such individuals and their infectivity are still uncertain [28], [29], but high frequency of such individuals may regardless force the introduction of new control measures [26], [30].
Furthermore, we compare the risks of two locations potentially vulnerable to the effects of a COVID-19 outbreak - Africa and India. We observe that western European cities are considerably more prone to initiate an outbreak in Africa than cities within Africa itself. In India, the opposite is true - the cities most prone to initiate an outbreak in India are within the borders of the country. This contrast is surprising, considering that India and Africa are similar in terms of economic development and population size. One reason for this might be India’s high inner connectivity and population density, which makes outbreaks likely in many Indian cities (see supplementary material Figure S5).
The transmission of various viral pathogens is affected by seasonal changes in temperature and humidity [31]. Unfortunately, such effects have yet to be accurately estimated for SARS-COV-2, although some initial investigations in this direction have been made recently [32]–[34]. Hence, our study did not consider the potential effects that weather can have on the estimated risk. However, if such changes will be estimated in the future, they could be easily incorporated in our framework, by scaling the different R values according to the estimated effects of weather conditions in each location, and recalculating the risk. This could potentially change some of the risk estimates we present, especially of regions mostly connected to either the Northern or Southern hemisphere. In such regions, the differences in weather are expected to be the greatest in different seasons, and these are indeed often expressed in different seasonality patterns of infectious diseases [35].
Application of the risk estimation we perform, and other assessments relying on air travel data, can be utilized in real-time only if flight travel data is publicly available and easily accessible. Unfortunately, we were not able to obtain such data, and our model relied on previously modelled estimates (see Methods). Therefore, we emphasize the need in recent, or even live air travel data, and its importance to facilitate research and improved decision making for international COVID-19 outbreak control. Such data are crucial for real time management of crises of a scale we are currently observing.
To conclude, our method provides an estimate of the risk to initiate a COVID-19 outbreak posed by travel from different airports. It combines air travel volume with destination population density, and does not rely on assumptions on COVID-19 incidence, as long as it is not very high. We note that as much as connectivity between different locations is important, it is only one component determining the risk of an outbreak. Our risk measure combines connectivity weighted by the probability to initiate an outbreak in a non-linear fashion, giving rise to a realistic risk measure for outbreaks. Our results demonstrate the heterogeneity in the risk of outbreak initiation on both global, as well as more local scales. As the COVID-19 pandemic will subside, full air travel shutdown will be deemed costly and at times infeasible. Therefore, we suggest future control and screening measures to be applied differentially, according to the estimated risks different locations pose to outbreak initiation.
Data Availability
All data used are available in the supplementary material and in a github repository.
Funding
This work was supported by The Raymond and Beverly Sackler Post-Doctoral Scholarship (YD) and by a Junior Research Fellowship from Christ Church, Oxford (RNT).