Abstract
Transmission chains within cities provide an important contribution to case burden and economic impact during the ongoing COVID-19 pandemic, and should be a major focus for preventive measures to achieve containment. Here, at very high spatio-temporal resolution, we analysed determinants of SARS-CoV-2 transmission in a medium-sized European city. We combined detailed epidemiological, mobility, and socioeconomic data-sets with whole genome sequencing during the first SARS-CoV-2 wave. Both phylogenetic clustering and compartmental modelling analysis were performed based on the dominating viral variant (B.1-C15324T; 60% of all cases). Here we show that transmissions on the city population level are driven by the socioeconomically weaker and highly mobile groups. Simulated vaccination scenarios showed that vaccination of a third of the population at 90% efficacy prioritising the latter groups would induce a stronger preventive effect compared to vaccinating exclusively senior population groups first. Our analysis accounts for both social interaction and mobility on the basis of molecularly related cases, thereby providing high confidence estimates of the underlying epidemic dynamics that may readily be translatable to other municipal areas.
Introduction
The COVID-19 pandemic has affected the lives of billions of people globally. Efforts to understand the transmission of SARS-CoV-2 have been undertaken at different scales including at global level1–3, across continents (Europe and North America4), and within countries (such as Austria5, Brazil6, France7, Iceland 8, South Africa9, and Thailand10). The ultimate goal of COVID-19 molecular epidemiological studies is to understand and contain the pandemic. In order to reach this global goal, local interventions are effective to cut transmission chains in family, social, and small communities networks11, 12 and work best in well-defined (phylogenetically or epidemiologically) clusters. However, most infections are acquired from unknown sources and transmitted cryptically along the transmission chain and are the real drivers of the pandemic. Hence, it is essential to identify the key determinants and transmission routes at the city-level, which will ultimately guide the planning of vaccination campaigns. For this, spatially and temporally highly resolved and diverse data are required including detailed information on transmission chains, and on determinants such as demographic, socioeconomic, and interaction patterns that influence both population mobility and social interaction behaviour13. Molecular surveillance and epidemiology are indispensable to fully understand determinants driving transmission. However, such data are difficult to obtain. Studies on SARS-CoV-2 that have focused on selected, larger cities (>300k inhabitants) outside of Europe (Beijing14, Boston15, Houston16, and New York City17–19 either lack resolution in terms of sequenced positive cases (incomplete transmission chains), or have a paucity of socioeconomic or mobility data that naturally stratify a city (incomplete determinants). Of those studies describing the distribution of cases together with changes in mobility within cities, none to date rigorously study socioeconomic differences between city quarters as determinants of transmission 16, 18–20. An integrated model considering multiple factors (epidemiological, geographic, mobility, socioeconomic, transmission dynamics information) could add profound insights into the determinants underpinning transmission, rather than solely evaluating transmission dynamics. Understanding these determinants will be important for optimizing the delivery of vaccines to those areas and individuals that drive disease transmission and help to maximize the impact of an initially limited amount of vaccine doses.
In this study, we aim to guide vaccination strategies through the identification of determinants of SARS-CoV-2 transmission within a medium-sized European city. Medium-sized cites of less than 300k inhabitants encompass a considerable amount of the urban population (41%)21 and hence play an important role for the spread of the pandemic. Basel-City (175,350 inhabitants) serves here as a representative example of these. As in most other European cities, urban quarters within BaselCity differ in demographics, socioeconomic factors, housing structure, and mobility. We combine and complement epidemiological, mobility, and socioeconomic datasets with whole genome sequencing (WGS) data – all at very high spatial (statistical blocks: a city block partitioned by e.g. streets, rivers) and temporal (day-by-day) resolution for each case reported within Basel-City during the first COVID-19 wave – and serology data to account for the fraction of unreported cases. Specifically, we employ phylogenetic cluster analysis22 based on transmissions within the dominant virus variant (B.1-C15324T) that was responsible for 60% of the infections in Basel-City23 during the first wave with an ordinary differential equation (ODE) susceptible-exposed-infected-recovered (SEIR) model. This model includes a city-wide mobility network of real-world measured data to identify routes of transmission. Dynamic variations in the social interaction behaviour of the population of Basel were also included to estimate the temporal change in the effective reproduction numbers. The model fit was performed for partitions of the city based on different socioeconomic determinants (living space per person, share of 1-person households, median income, and share of senior residents). Finally, we applied the SEIR-model for the simulation of different mobility reduction and vaccination scenarios within the city, to demonstrate the impact of a targeted vaccination strategy accounting for the identified determinants.
Results
SARS-CoV-2 spread and clustering within the city
Over the time course of the first wave from 25th February, 2020 until 22nd April, 2020, the University Hospital Basel performed 7073 PCR tests (Basel-City residents only) in context of the obligatory screening of all in-coming patients irrespective of the reason for their hospital visit, as well as from an associated voluntary walk-in test center, hence samples include symptomatic (mild and severe) and asymptomatic cases. Of these, 750 were positive for SARS-CoV-2 (10.6% positivity rate) (Figure 1A, B) and only 0.7% of the sequenced cases were nosocomial infections. We successfully sequenced the whole-genomes of 411 samples. Epidemiological (occupation in a health service job, resident of a care home, contact to positive cases, onset of symptoms, place of infection) and demographic (age, address, occupation, income) were gathered for most of these cases.
We inferred the lineage identity24 and observed 29 viral lineages in Basel-City (Figure S8), with 247 genomes (60.0%) belonging to a viral variant that we henceforth refer to as B.1-C15324T23 (Figure 1A, B). Applying a genetic divergence threshold, a total of 128 phylogenetic clusters, and 70 clusters that belong to lineage B.1-C15324T (Figure 1C) were determined. Mapping this data across the 19 city quarters, we found that most within- and among-quarter transmission clusters were spread randomly, except for significant within-quarter transmissions in four quarters: Riehen, Bruderholz, Am Ring, and Iselin. The results indicate that a significant number of clusters are unique to these latter quarters. Only few quarters shared more clusters between each other than expected by chance as indicated by the coloured links (Figure 1D). Breaking up the statistical blocks within the city into tertiles of socioeconomic determinants (1: low, 2: intermediate, 3: high, N/A: no available data or censored for privacy reasons), we observe that the viruses shared among blocks with either the highest living space per person, lowest share of 1-person households, highest median income, or highest seniority are more likely to belong to the same phylogenetic cluster (Figure 1E). This indicates that largely retired or rather affluent people from Riehen, Bruderholz, Am Ring, and Iselin transmitted the virus in their social networks either in the same quarter or in the same socioeconomic rank. In contrast, positive cases that belong to lower socioeconomic tertiles (either lower income, less living space, or younger age) are less likely to be members of the same tight phylogenetic cluster indicating cryptic transmission from and to those cases across all socioeconomic levels and across the city.
Spatio-temporal variation of mobility and social interaction patterns
We studied both the spatial and temporal variation in mobility based on a high-resolution, multi-modal traffic model obtained from the traffic department of Basel-City. All following results are presented for a partition of the set of housing blocks into tertiles of low (T1), medium (T2) and high (T3) median income if not declared otherwise (see supplementary material for other socioeconomic partitions). Figure 2A (and Figures S1-S3) depicts this partition, which serves to compute both the intratertile and inter-tertile mobility from the Basel-City traffic model25, and the corresponding mobility graph. Importantly, the statistical blocks per tertile do not form a single, geographically connected, entity. We observe that mobility varies by transport modality and tertile (Figure 2A inset). For example, within T1 the share of private motorized traffic and mobility in general is more pronounced than in the tertiles of higher median income T2 and T3.
The full mobility matrix serves as input to our SEIR model (see methods, eq. (1-7)) to simulate the contribution of mobility to SARS-CoV-2 transmission. The summed edge weights of the mobility graph account for this mobility contribution and the obtained results are shown in Figure 2B for each partition. We observe that low and median income populations are more mobile than their wealthier counterparts. Moreover, there is little mobility within areas with a low share of 1-person households, a result of the predominantly peripheral location of the relevant statistical blocks (see Figure S5). For living space per person or percentage of senior citizens, mobility was comparable between tertiles with a trend towards higher mobility within the younger population groups.
Dynamic changes in mobility were assessed by agglomerating normalized traffic counts for public and private transport modalities (Figure 2C). There was a clear drop in mobility for both public and private transport modes around the onset of the national lockdown date (12th March, 2020). The decrease was more pronounced for public transport, resulting in a weighted average mobility drop of approximately 50% (Figure 2D). Figure 2D also shows the dynamic change in social interaction contribution to B.1-C15324T case numbers obtained from the estimation of the effective reproductive number using a Kalman filter26, 27. Despite noticeable fluctuation, social interaction contribution decreased on average over time. This data also reflects variation in case reporting which affected the estimated effective reproductive number. Importantly, since the B.1-C15324T was eventually eradicated, a final social interaction contribution of zero was expected. These results, in terms of spatial and temporal variation, serve as input to the following SEIR-model.
Spatio-temporal spread of the epidemic and its socioeconomic determinants
Using a SEIR epidemiological model considering both sequenced and non-sequenced/unreported cases, as well as mobility and social interactions, the dynamic spread of the first wave was fitted for each of the socioeconomic tertiles with the relevant mobility graphs. A bootstrap approach was used to account for stochastic data uncertainty while we accounted for systematic errors due to the fraction of unreported cases by carrying out a survey for serum SARS-CoV-2 specific IgG amongst samples from Basel residents from that time period. Unreported cases appeared to be a driving force of the transmission, as 1.9% of 2,019 residents showed detectable antibody levels, corresponding to 77.2% unreported cases overall, and 87.5% for the sequenced B.1-C15324T strain. Figures 3A-C (Figures S4-S6A-C) show the SEIR-model fit to data for each tertile. The corresponding dynamic change of the effective reproductive number (Reff) is given in Figures 3D-F (Figures S4-S6D-F). Independent of the underlying partition, the model provided adequate fits (RMSE < 4.5) and we observe a drop in Reff following the dynamic changes in mobility and social interaction. Importantly, there was a significant difference (2% achieved significance) in Reff between statistical blocks of the highest and lowest median income. The significance level was here scored based on a comparison to 99 random partitioning of the statistical blocks (see Table S1). For all socioeconomic partitions these differences are summarized as histograms in Figures 3G-J. We found that blocks with a higher median income (2% achieved significance), or higher living space per person (1% achieved significance), or lower share of 1-person households (2% achieved significance) had a significantly lower Reff (< 1.7) relative to the maximum Reff observed in the relevant partition. A partitioning based on the share of senior residents did not result in significant differences in Reff (45% achieved significance). Differences in Reff are due to two factors: the effective mobility contribution (Figure 2B), and the modelled reproductive number (R, eq.(8)). In particular, the tertile with the lowest share of 1-person households (T1) showed less mobility compared to T2 and T3, leading to significant differences in Reff. In contrast, mobility in the T1 and T3 tertiles of living space per person were similar (Figure 2B), yet differences in Reff were significant, indicating that the transmission was not dominated by mobility alone.
Impact of mobility changes and modelling of future vaccination scenarios
We simulated the developments of the first wave of the epidemic under the assumption of different mobility scenarios and modelled two future vaccination strategies. All estimated parameters (including Reff, see methods for details) were employed for this scenario building. Figure 4A (Figures S4-S6G) displays the results for mobility scenarios as observed with up to 50% mobility reduction (scenario MO), 100% mobility (scenario M1), and no mobility (scenario M2). Peak case numbers (April 12th) would have been approximately three times higher in the case of no reduction in mobility (M1). However, the decrease in peak case numbers assuming zero mobility (M2) as modelled, was not as pronounced. Mobility reduction hence played a vital role for the containment of SARS-CoV-2 during the first wave.
To study the optimized delivery of vaccines we used data-driven estimates for number of vaccines and their efficacy (70%28 and 90%29, 30) in the early 2021 period. Scenarios were compared based on absolute case numbers and on the time to reach 50% of intensive care unit (ICU) capacity as quantifiable endpoint to judge the degree of burden on the healthcare system. We model the scenario of a single exposed individual being introduced into the Basel-City population. Figure 4B (Figures S4-S6H) shows the results for an outbreak scenario (denoted as V1) in which a specific fraction of the population (33% or 66%) will be vaccinated at either 70% or 90% efficacy. As expected, we show that higher vaccine efficacy or higher population fraction vaccinated reduces the slope and plateau (i.e. overall reduction of case burden) of the epidemic curve. Based on these results, vaccination of a third of the city’s population at 90% efficacy would cause a decrease in the slope of the epidemic curve resulting in approximately 36 day delay to reach 50% ICU capacity. It should be noted, that vaccination of the population at random is an artificial scenario, applied here only to demonstrate the impact of vaccination efficacy relative to the population fraction vaccinated. This scenario will serve in the following as a baseline comparison for two more realistic vaccination strategies given in Figure 4C, and D (Figures S4-S6I). Figure 4C shows a scenario where selected demographics with lowest income, that had less options to socially distance and hence were more likely to be exposed to and/or transmit the virus (reflected by a higher Reff, scenario V2), were prioritized for vaccination. With this strategy, the slope of the epidemic curve would be reduced compared to randomly vaccinating the same number of subjects from the whole population leading to a substantial further delay of approximately 30 days to reach the 50% ICU capacity mark (see S9 for more detailed illustration regarding the development of ICU occupancy under the different vaccination scenarios), and an overall reduction of the case burden for the whole city of up to 6.9 folds relative to V1.
Figure 4D shows a scenario where priority was given to the population group with the highest share of senior residents (aged > 64), which had lower mobility than the rest of the population (Figure 2B) but constitutes 60% of ICU cases (scenario V3). We observe that scenario V3 resulted in a steeper epidemic curve and would yield 50% ICU capacity at a similar time as a random vaccination strategy. However, the total number of cases at this time would be approximately double in scenario V3 compared to V1.
Discussion
This analysis evaluates complementary aspects of the spread of SARS-CoV-2 within a medium-sized (<500k residents) city, including local transmission analysed by phylogenetic tree inference and clustering, and the overall spread described by a compartmental SEIR-model. We harness these rich and detailed data sets for the optimization of vaccination strategies within a city. The main strength of this study lies within the high degree of diverse and detailed data available, which included on a per case basis, whole genome sequencing of the virus, the residential address, socioeconomic data, symptom onset (if any), estimated place and time of infection as well as contact tracing information. This was complemented by high resolution, both spatially and temporally, mobility data as well as serology data for the estimation of the fraction of unreported cases, in one comprehensive study. Despite the small size of Basel-City compared to previous city-models that focused on large metropolises20, we benefit from high data density and quality with respect to the number of residents, and address here a representative example of a European, medium-sized city for which studies are currently lacking.
By basing our modelling efforts only on sequenced data, despite reducing the number of included cases, transmission dynamic analysis did not need to account for potential new introductions. This is a strong advantage of our approach over others15, 31–33 since a continuum model, such as a SEIR-model, is based on the assumption of inherently related cases which is difficult to control in the absence of sequencing information. Another strength of our modelling strategy was the accounting for unreported cases which were the driving force of infection outside the observed clusters. Our 77% estimate of unreported cases overall (not limited to B.1-C15324T) fall within the range of previous reports within Europe8, 34.
The use of mobility and socioeconomic data in our models is also unique among studies published to date. Our analysis is based on information regularly collected and analysed by the statistical office of Basel city, providing a high spatial and temporal resolution network of the inner city mobility patterns. In contrast to the evaluation of mobile phone data which has frequently been used by others19, 20, 35, 36 our mobility information is not restricted to groups with a device but represents long-term analysis of different traffic streams. Such data is not subjected to privacy legislation and is hence expected to be more readily available for other cities, making our analysis transferable. We do not hold information on the duration and specific location of individuals, but a continuum estimate of population mixing that aligns well with the concept of a compartmental ODE model. We enrich this information with time-variable data on the degree of capacity utilization for the specific study period.
Based on these data, we evaluated the effective reproductive number for different socioeconomic groups of the population, rather than at a purely geographical level. We found that socioeconomic brackets characterized by low median income and a smaller living space per person, were associated with a significantly larger effective reproductive number than the socioeconomically stronger groups. This observation is in line with previous results35, suggesting that population groups from a weaker socioeconomic background (here low median income) are more mobile and at higher risk for SARS-CoV-2 infection/transmission. This SEIR-model analysis complements our findings based on the analysis of phylogenetic clusters. We observed clusters predominantly within higher socioeconomic groups, implying that these are spread within the same social network. It is likely that those individuals are retired, or have had the ability to work from home during the first wave, a pattern that has been observed also in other cities37. In contrast, infections within lower socioeconomic groups may result from multiple sources. This observation aligns with the possibility that low socioeconomic status may relate to jobs requiring higher personal contact, and compulsory mobility38, which has been shown to increase the risk of infection by 76%39.
Mobility and the reduction thereof has been suggested as a proxy to evaluate the reduction of the spread of SARS-CoV-2 6, 32, 36; however, it is clear that in addition to a change in mobility patterns there has also been a change in the hygiene and social interaction behaviour of the population, encouraged by national campaigns (e.g. in Switzerland35). In our SEIR-model we separate these two contributions allowing for an easier translation of our model for scenario building, such as for vaccination. Assuming limited availability of vaccines in the early phase after licensing and market introduction, implying that immunizing the required percentage of population to achieve herd immunity (49% for a perfectly effective vaccine40 and a basic reproduction number of 1.95 corresponding to the WHO estimate) is infeasible, it is essential to strategically optimize vaccine distribution. According to our model the largest reduction in case numbers would occur in a scenario where priority is given to those population groups that cannot effectively reduce their contacts or that are highly mobile corresponding to low median income groups. The overall reduction in case burden by choosing a targeted distribution is estimated reach up to 6.9 folds compared to a random vaccine distribution strategy. Also, the significant delay of transmission dynamics provides additional time to increase overall vaccination coverage throughout the population. By making a well-informed choice for the distribution of vaccines, an over proportional effect can be achieved. We showed that the senior citizen population was not driving the transmission of SARS-CoV-2 during the first wave. Hence, although vaccinating high-risk groups would reduce the number of hospitalized and ICU patients in the short term, the spread of the pandemic would be more effectively contained by vaccinating the transmission drivers. A vaccine would be most effective if applied in areas where it is not possible to trace transmission chains, as is the case for mobile and socioeconomically weaker groups. In contrast, we observed that cases within the more senior population groups clustered, indicating that contact tracing strategies would be efficient for this group where transmission chains were detected and contained. By restricting vaccination to risk groups only, a larger fraction of the general population will be exposed to SARS-CoV-2 implying that contact and travel restrictions remain vital to contain transmission. Such measures come at great economic cost41. Also, reliable estimations on the efficacy of vaccines in different age groups is yet to be established and it remains uncertain if the vaccines will reach the reported 42, 43 high efficacy also in the senior population. We did not account for this effect in our SEIR-model and performed vaccination scenario building accounting only for parameters based on data-driven estimates. However, in the case of a less effective vaccine in senior population groups this would intensify the differences between the simulated scenarios in favour of vaccination of the transmission drivers. Our vaccine scenario simulations are based on a simple, yet data-driven model that is currently specific to Basel-City and we further did not account for vaccination ramp-up, groups with co-morbidity, or differences in vaccine efficacy directly as has been suggested by others 44. Despite these simplifications our results are in line with previous suggestions44 and, ideally after independent validation for a different city, provide valuable insight into how transmission dynamics could be harnessed to guide vaccination strategy. In conclusion, high-resolution city-level epidemiological studies are essential for understanding factors affecting pandemic transmission chains and delivering effective, thereby supporting tailored public health information campaigns and vaccination distribution strategies at the municipal level. We here provided an example of such an analysis within a European city and suggest that the findings and modelling approaches presented may be readily translated to other cities.
Data Availability
The SEIR-model code used for this submission will be available on https://github.com/BorgwardtLab/BaselEpi.git. Code that was used for phylogenetic inference and calculation of significance of clusters in specified groups is available at https://github.com/appliedmicrobiologyresearch. SARS-CoV-2 whole genomes from Basel-City are available at gisaid.com and at European Nucleotide Archive (ENA) under accession number PRJEB39887.
Methods
Ethical statement
Ethical approval was given by the local ethical committee Ethik Kommission Nordwest und Zentralschweiz (EKNZ No. 2020-00769, to be found at https://ongoingprojects.swissethics.ch) and the project was registered at clinicaltrial.gov under NCT04351503.
City data
The Canton of Basel-City, an administrative area consisting of Basel, Bettingen, and Riehen, is located in the North-Western part of Switzerland, bordering Germany and France, and comprises 36.95 square kilometers in size. An estimated 34,000 commuters cross the borders from and to France and Germany daily45, together with another 85,000 inner-Swiss commuters45. Basel-City is divided into 19 urban quarters and had a mean population of 175,350 inhabitants during the last five years (+/- 1,737 inhabitants) (Figure S7). For subsequent analyses, a total of 1,078 statistical (housing) blocks (a city block partitioned by e.g. streets, rivers) were identified within the city quarters. The University hospital Basel is a tertiary hospital with a capacity of 700 regular and 44 intensive care unit (ICU) beds.
PCR testing and whole-genome sequencing
PCR testing was available rapidly and frequent testing was established and supported by local guidelines by the end of February, 2020, before the first case arrived46. In total 7073 PCR tests from Basel-City residents were performed at the University hospital Basel (750 of which were SARS-CoV-2 positive) dating between 25th of February and 22nd of April, 2020. The ratio of negative to positive PCR tests changed during the local epidemic with a median of 10.6% positive PCR tests (Figure 1B). We successfully sequenced SARS-CoV-2 whole genomes from 411 unique patients (54.8% of all cases). Of these, 247 (247/411, 60%) could be attributed to contain the C15324T mutation in the B.1 lineage (and therefore called B.1-C15324T) characteristic to the virus variant that originated in this tri-national area23.
Geographic mapping and socioeconomic stratification
Each individual PCR test from (N = 7073), irrespective of the result, was linked to the patient’s place of residence anonymized at the scale of statistical blocks in ArcMap 10.7 (by ESRI). To explore the statistical association between SARS-CoV-2 transmission and socioeconomic factors, we employed data provided by the Canton of Basel City’s office for statistics for the year of 2017 (most recent data available), that specified the values for various socioeconomic indicators for each statistical block (except for those blocks where privacy legislation did not permit the sharing of such information). The indicators under study were (i) the living space (per capita in m2), (ii) the share of 1-person private households, (iii) the median income (CHF), and (iv) the population seniority (percentage of senior citizens aged over 64 years per block). According to these socioeconomic indicators, blocks were allocated to one of three socioeconomic city tertiles (T1:≤33rd percentile, T2: 33rd to 66th percentile T3:>66th percentile) where possible (e.g. Figure 2A). In general, sparsely populated blocks displayed a maximum of three positive cases and had to be excluded from analysis. All following analyses with respect to socioeconomic factors were based on these city partitions.
Phylogenetic inference and cluster analysis
Whole-SARS-CoV-2-genomes from Basel-City patients were assembled using our custom analysis pipeline COVGAP23 (github.com/appliedmicrobiologyresearch CoV-2/). Global sequences and metadata were downloaded from GISAID47, 48 (as of October 22,2020; 155,278 consensus sequences). Sequences with more than 10 percent N’s (27,013) and with incomplete dates (43,466) were removed. 84,799 sequences remained which were joined with Basel genomes. Filtering for the period of interest until April 22nd retained 39,913 genomes, of which 411 are from Basel-City residents dating from February 26th (first case) to April 22nd, 2020. To infer relatedness among the viral genomes and spread of SARS-CoV-2 in Basel-City, a time-calibrated phylogeny that was rooted to the first cases in Wuhan, China from December 2019, was inferred using a subset of the global genomes. For subsetting, we included 30 genomes per country and month, whereby all genomes from Basel-City were retained, totalling 3,495 genomes, using the nextstrain software v.2.0.0 (nextstrain.org) and augur v.8.0.03 as described in detail in 23. The resulting global phylogeny was used to infer phylogenetic clusters in Basel-City. First, polytomies, which are caused by identical genomes in the tree were resolved using ETE3 v.3.1.149. Cluster Picker v.1.2.3 22 was then used to identify clusters in the resolved tree (options 0, 0, 4e-4, 5). Identified clusters were consolidated with epidemiological data (occupation in a health service job, resident of a care home, contact to positive cases, onset of symptoms, place of infection) to confirm the suitability of the divergence parameter. Cluster Matcher v.1.2.422 was then used to combine ancillary geographic (quarter), and socioeconomic (share of 1-person households, median income, living space, seniority) information that were subdivided into tertiles (1: low, 2: intermediate, 3: high, N/A: no available data or censored for privacy reasons) on identified clusters. To test whether related genomes in Basel-City cluster according to a) quarter, b) living space per person, c) share of 1-person households, d) median income, or e) seniority a custom python-script for a random permutation test was performed50 (github.com/appliedmicrobiologyresearch/Influenza-2016-2017). The results for clustering within and among urban quarter and tertiles in socioeconomic determinants were visualized using circos v.0.6951.
Serology
We determined SARS-CoV-2 antibody responses in a total of 2,019 serum samples collected from individuals between 25th of February and 22nd of May, 2020, to account for sero-conversion. The cohorts included healthy blood donors and patients from the University Hospital Basel with residency in Basel-City, and Viollier AG Zurich, a lab medicine provider contracted for the SARS-CoV-2 PCR testing. We used SARS-CoV-2 serology testing to detect specific IgG and IgM antibodies (Cobas6800, Roche). This is an electro-chemiluminescence immunoassay (ECLIA) that targets the nucleocapsid. Serology information was used to estimate the fraction of unreported cases as follows: An estimated 1.88% (38/2,019) of the Basel-City population was infected with SARS-CoV-2. Of these 60% would be attributed to the B.1-C15324T strain, leading to a percentage of 88% of unreported/unsequenced cases to consider.
Mobility data
We employed the official traffic model provided by the traffic department of Basel-City 25. The latter consists of the 2016 average A-to-B traffic on a grid of ∼1400 counting zones for four transport modalities: foot, bike, public motorized transport and private motorized transport. We further obtained weekly averages of pass-by traffic for the same count zones over the period of the first wave of the pandemic for the categories of combined foot and bike traffic, as well as private motorized traffic. Additionally, weekly public-transport passenger loads were provided by both the Swiss Federal Rail Company and the local public transport services. From these datasets, we computed the spatio-temporal variation of mobility within the city as follows. Spatial variation was obtained by aggregating A-to-B traffic between the aforementioned counting zones, first to the statistical block level (by identifying the nearest block with respect to a zone’s centroid), and second to tertile level via a statistical-block’s association with a socioeconomic indicator tertile. This resulted in a there-by-three mobility matrix Mjk whose diagonal entries represent within-tertile mobility, while off-diagonal entries represent inter-tertile mobility. This matrix was normalized to one since only relative differences were relevant in our model. Temporal variation was obtained by computing the weighted sum of the private transport time-series provided by the traffic department and the public transport time-series provided the Swiss Federal Rail Company and the local public transport services. This sum was then normalized and smoothed with a uni-variate spline resulting the final time-series for temporal mobility variation employed in our model (denoted as αmob(t)), see Figure 2 D).
Dynamic changes in social interaction
SARS-CoV-2 transmission is contact-based. While the number of contacts potentially taking place within a day and a city is largely influenced by human mobility as estimated above, the risk of a contact becoming a transmission event is further determined by the precautions taken by the two individuals being in contact (such as washing hands, wearing masks, distance keeping). Both aspects together–mobility and risk-mitigating social behaviour within a (sub-)population–eventually result in an effective, time-dependent, reproductive number characterizing the virus’s transmission within that (sub-)population. Hence, there are three relevant time-series: changes in the overall effective reproductive number, in mobility, and in social behaviour. While the computation of the temporal variations in mobility was described above, the time-dependent effective reproductive number is obtained by means of applying a Kalman filter 26, 27 to the piece-wise linearised time-series of daily confirmed cases of individuals having contracted the B.1-C15324T variant of SARS-CoV-2. Assuming a multiplicative model, the time-dependence of residual transmission risk stemming from lack of precaution in social interaction (denoted as αsoc(t)), is obtained by point-wise division of the time-dependence of the effective reproductive number by the mobility time-series (depicted in Figure 2 D). Thus we are adhering to the logic that in the extreme case of zero mobility, no transmission can take place despite a finite risk of transmission rooted in a lack of precautions, while on the other hand in the case of zero risk of transmission due to perfect precautions, no transmission can take place despite non-zero mobility. Such logic dictates the choice of a multiplicative rather than additive model.
SEIR-model
Similar to previous work 20, 52, 53 the spread of the epidemic within the city of Basel was described using a compartmental two-arm SEIR model including both sequenced, and unsequencend/unreported cases. We accounted for susceptibles (S), exposed (E, incubation time Tinc), and pre-symptomatic yet infectious cases independent of reported/unreported status (P, reproductive number R). After a presymptomatic time TinfP, cases were separated according to the estimated proportion of reported and sequenced cases, psq, into either reported infectious (I), or unreported infectious (Ui, reproductive number R). Since our data did not include information on recovered patients, a ‘recovered’ compartment was not included following compartment I. It was assumed that reported cases remained isolated. The unreported compartment on the other hand transitions to recovery (Ur) after an infectious time TinfU. To allow for connectivity and exchange between the defined partitions, cross contamination was included through the mobility matrix Mjk described above. Over the course of the studied period the Swiss government imposed a partial lockdown over the country resulting in notable temporal variation of mobility and social interaction patterns of the population of Basel-City which we account for with two time dependent weighting factors αmob(t) and αsoc(t) introduced above. In summary this resulted in the following system of ODEs for the spread within each socioeconomic tertile j: For all compartments, the initial number of susceptibles was fixed to the relevant population. All other compartments were initialized as zeros, with the exception of a seed in the exposed compartment corresponding to the first reported cases. In summary, our model is based only on six free model parameters, including the respective reproductive number per tertile Rj (three parameters, range [0, 20]), the initial number of exposed in a single tertile (range [0, 5]), and the infectious times TinfU and TinfP (range [1.5, 12] days). We assumed a latency period Tinc and infectious time prior to symptom onset TinfP of two days each 54.
Fitting procedure and evaluation
In total 247 cases within the time period from the 25th of February until the 22nd of April were included in this analysis. For all data a seven day moving window average was taken to account for reporting bias on weekends, and cumulative numbers of infected cases (compartment I) were calculated. Due to the loss of single sequencing plate, missing numbers on the 29th, 30th and 31st of March were imputed by assuming a constant ratio of the B.1-C15324T strain amongst the sequenced samples. Simulations were initialized on the 22nd of February, the estimated date of the occurrence of the initial exposed cases23.
The ODE system was implemented in python (version 3.8.) using the scipy functions odeint to iteratively solve the system of equations and minimize (with L-BFGS algorithm, cut-off tolerance of 10−7) for parameter fitting based on the average sum of squared differences between the logarithm of estimated and recorded cases. Points with cumulative case numbers below 15 were not scored in the cost-function since the continuum assumption of the model may not be satisfied for small case numbers. The fit was performed simultaneously for all four socioeconomic partitions to account for the shared parameters TinfU (obtained 1.8 days) and TinfP (obtained 2.1 days). Fit performance was evaluated based on root mean squared error (RMSE) between predicted and recorded case numbers. Data uncertainty and the corresponding influence on parameter estimation were assessed by bootstrapping. For each of 50 bootstraps each data point was randomly shifted according to numbers drawn from a normal distribution centered around zero and standard deviation 0.3. Results are shown as mean values over bootstraps and bootstrap uncertainty bands. We compare effective reproductive numbers corresponding to the normalization of R by the the effective mobility contribution (Σ k Mjk):
Scenario simulation
The impact of mobility relative to social interaction was analysed by recalculating the predicted epidemic trajectory under the constraint of constant intra-urban mobility (αmob(t) = 1, scenario M1) or fully restricted (αmob(t) = 0, scenario M2) mobility, corresponding to perfect isolation of the affected city areas. These scenarios were compared to the baseline of the actual reduction in mobility (scenario M0).
Vaccination scenarios were simulated as for both 90% and 70% effective vaccines resembling current vaccine candidate data 42, 43. This was achieved by moving the fraction of the vaccinated population from the susceptible to the recovered compartment Ur and calculating the spread of the pandemic with constant effective reproductive number and intra-city mobility. We accounted for a change in social interaction behaviour following vaccination by assigning a mean social interaction score of the vaccinated and not-vaccinated population (αsoc,vacc(t) = 1, αsoc,novacc(t) = 0.5). Mobility was modelled as 100% (αmob(t) = 1). Two scenarios were investigated and compared to the no vaccine scenario (V0): i) vaccination of a fixed population fraction (one or two thirds) randomly throughout the population (scenario V1), ii) vaccination of the corresponding number of individuals from the socioeconomic city areas presenting the highest reproductive number (scenario V2). These were compared to the simulation of no vaccination (scenario V0). In order gauge the benefit of a particular vaccination scenario, we calculated the time to reach 50% of intensive care unit (ICU) capacity. The University Hospital Basel has a total of 44 ICU beds. During the first wave, 4.5% of reported SARS-CoV-2 positive cases were admitted to ICU, and their median length of ICU stay was 5.9 days (IQR, 1.5-12.9). If considering additional unreported cases (captured by serological testing), the percentage of patients requiring ICU admission was around 1%. Of all SARS-CoV-2 patients with ICU stay, 40% were younger than 64 years resulting in a probability of an under 64 year old infected case to be admitted to ICU of 0.5%.
Author contributions
AE, HHH, KB devised the project. SCB and JK developed and performed the mathematical modelling. MSt performed and interpreted phylogenetic analyses. SCB, JK and MSt led the writing and revising of the report. AM provided the genome assembly pipeline. TR and HSS prepared viral RNA for sequencing. MyB and RSS provided geographical expertise. KKS collected clinical data. KR, DAT, AG, AKS, MSch analysed serology samples. DC and OD provided serology samples from Viollier AG. KL provided virological expertise. AB provided serology samples from the blood transfusion service. JB, STS and SF provided public health and epidemiological expertise. HP, MSi, CHN, RB, MB provided clinical expertise and valuable discussion on the results. NR, UH, JB provided epidemiological expertise. All author reviewed and edited the manuscript.
Competing Interests
The authors declare that they have no competing financial interests.
Correspondence
Correspondence and requests for materials should be addressed to Adrian Egli, adrian.egli{at}usb.ch, Clinical Bacteriology and Mycology, University Hospital Basel, Petersgraben, CH-4031 Basel, Switzerland, Phone: +41 61 556 57 49.
Data Availability
The SEIR-model code used for this submission will be available on https://github.com/BorgwardtLab/BaselEpi.git. Code that was used for phylogenetic inference and calculation of significance of clusters in specified groups is available at https://github.com/appliedmicrobiologyresearch. SARS-CoV-2 whole genomes from Basel-City are available at gisaid.com and at European Nucleotide Archive (ENA) under accession number PRJEB39887.
Supplementary Material
Acknowledgements
We greatly appreciate the input and data received from Construction- and Traffic department Canton Basel-City, Baselland Transport AG, Basler Verkehrs-Betriebe, Autobus AG, SBB AG, Statistical Office of the Canton of Basel-City, and want to specifically thank Björn Lietzke (Statistical Office of the Canton of Basel-City), Madeleine Imhof (Statistical Office of the Canton of Basel-City), Michael Redle (Construction- and Traffic department Canton Basel-City), Kathrin Grotrian (Construction- and Traffic department Canton Basel-City), Matthias Hofmann (Basler Verkehrs-Betrieb), Roman Stingelin (Autobus AG), Nadine Ruch (SBB AG) and Stefan Burtschi (Baselland Transport AG) for their support. We thank Christine Kiessling, Magdalena Schneider, Elisabeth Schultheiss, Clarisse Straub, and Rosa-Maria Vesco (University Hospital Basel) for excellent technical assistance with next generation sequencing. Computations were performed at sciCORE (http://scicore.unibas.ch) scientific computing facility at the University of Basel.
We thank all authors, who have shared their genomic data on GISAID, especially the Stadler Lab from ETH Zurich for sharing Swiss sequences. A full table (csv) outlining the originating and submitting labs is included as a supplementary file.
Footnotes
↵+ These authors share senior authorship