Abstract
Background The ongoing COVID-19 epidemic that has spread widely in China since December 2019 is now generating local transmission in several countries including Singapore of February 19, 2020.
This highlights the need to monitor transmission potential of unfolding SARS-CoV-2 outbreaks in real time. In particular, five major COVID-19 clusters have emerged in Singapore.
Methods Here we estimate the effective reproduction number, Rt, for Singapore from the daily case series of imported and autochthonous cases by date of symptoms onset, after adjusting the local cases for reporting delays, using a generalized growth model and employing a renewal equation. We also derive the reproduction number from the distribution of cluster sizes using a branching process analysis.
Results As of February 19th, 2020, our results indicate that the mean effective reproduction number Rt is at 1.1 (95% CI: 1.1,1.3) while an estimate of Rt based on cluster size distribution is at 0.63 (95% CI: 0.4, 1.05).
Conclusion These estimates underscore positive impact of containment efforts in Singapore while at the same time suggest the need to sustain social distancing and active case finding efforts to stomp out all active chains of transmission.
Background
The ongoing COVID-19 epidemic started with a cluster of pneumonia cases of unknown etiology in Wuhan, China back in December 2019 (1, 2). The initial cases have been linked to a wet market in the city of Wuhan, pointing to an animal source of the epidemic (3). Subsequently, rapid human-to-human transmission of the disease was confirmed in January 2020, and the etiological agent was identified as severe acute respiratory syndrome-related coronavirus 2 (SARS-CoV-2) due to its genetic similarity to the SARS-CoV discovered in 2003 (1, 4). The total case tally has reached 75,700 infections including 2128 deaths as of February 19, 2020 (1, 2). As the virus continues to spread in the human population, obtaining an accurate “real-time” picture of the epidemic’s trajectory is complicated by several factors including reporting delays and changes in the case definition (5, 6). Although the epidemic appears to be on a downward trend in China, cases of the novel coronavirus have been reported in 26 countries around the globe, and local transmission is ongoing in Singapore, Japan, Korea and Thailand (7). Detailed epidemiological data from these countries is needed to monitor transmission potential of SARS-CoV-2 in near real-time.
Outside of China, Singapore is one of the hardest hit countries with the COVID-19 epidemic (8), reporting its first symptomatic imported case (a 66 years old Chinese male) on January 23, 2020 (9). Up until February 19, 2020, Singapore has reported 84 laboratory confirmed cases, including 22 reported case importations from Wuhan China (8). Imported cases include six evacuees who departed China between January 30 and February 9, 2020 (10, 11). On February 4, 2020, the Ministry of Health of Singapore reported its first local cluster of COVID-19, which was linked to the Yong Thai Hang shop (12). A total of 9 clusters with 1 or more COVID-19 local cases have been reported thus far. Table 1 summarizes the characteristics of the clusters.
Characteristics of the largest COVID-19 outbreak in Singapore as of February 19, 2020.
Although large-scale community transmission has not been reported in Singapore, the novel coronavirus can rapidly spread in confined and crowded places, as illustrated by a large cluster of COVID-19 cases that has been linked to the Grace Assembly of God Church, which has an average attendance of 4,800 (13). In China, substantial hospital-based transmission of SARS-CoV-2 has been reported, with a total of 1700 cases reported among healthcare workers (14, 15). This pattern aligns well with past outbreaks of SARS and MERS (16), including substantial nosocomial transmission during the 2003 SARS outbreak in Singapore (17). Thus, the distributions of cluster sizes for SARS and MERS outbreaks are characterized by high heterogeneity (16). To minimize the risk of hospital-based transmission of SARS-CoV-2, the Ministry of Health of Singapore has restricted the movement of patients and staff across hospitals (18). Also, because multiple unlinked community cases of COVID-19 have been reported (19), various social distancing measures have been put in place including advising the public against large social gatherings in order to mitigate the risk of community transmission (20, 21). These social distancing measures reduce the risk of onward transmission not only within Singapore, but also beyond the borders of this highly connected nation (22).
The reproduction number is a key threshold quantity to assess the transmission potential of an emerging disease such as COVID-19 (23, 24). It quantifies the average number of secondary cases generated per case. If the reproduction number is below 1.0, infections occur in isolated clusters as self-limited chains of transmission, and persistence of the disease would require continued undetected importations. On the other hand, reproduction numbers above 1.0 indicate sustained community transmission (16, 24). Using epidemiological data and mathematical modeling tools, we are monitoring the transmission potential of SARS-CoV-2 in Singapore in real-time, and here we report results as of February 19, 2020. Specifically, we assess the effective reproduction number from the daily case series of imported and autochthonous cases by date of symptoms onset after adjusting for reporting delays, and we also derive an estimate of the reproduction number based on the characteristics of the clusters of COVID-19 in Singapore.
Methods
Data
We obtained the daily series of confirmed cases of COVID-19 in Singapore as of February 19, 2020. Individual-level case details including the dates of symptoms onset, the date of reporting, and whether the case is autochthonous (local transmission) or imported are publicly available from the Ministry of Health, Singapore (25). Clusters consisting of one or more cases according to the infection source were also assembled from case descriptions obtained from field investigations conducted by the Ministry of Health, Singapore (25). Single imported cases or unlinked cases are analyzed as clusters of size 1.
Adjusting for reporting delays
As an outbreak progresses in real time, epidemiological curves can be distorted by reporting delays arising from several factors that include: (i) delays in case detection during field investigations, (ii) delays in symptom onset after infection, (iii) delays in seeking medical care, (iv) delays in diagnostics and (v) delays in processing data in surveillance systems (26). However, it is possible to adjust epidemic curves by reporting delays using statistical methods (27). The reporting delay for a case is defined as the time lag in days between the date of onset and date of reporting. Here we adjusted the COVID-19 epidemic curve of local cases by reporting delays using a non-parametric method that employs survival analysis techniques (28-30). The 95% prediction limits are derived according to Lawless et al. (31). For this analysis, we exclude 6 imported cases and 3 local cases without dates of symptoms onset.
Effective reproduction number from case incidence
We assess the effective reproduction number over time, Rt, which quantifies the temporal variation in the average number of secondary cases generated per case during the course of an outbreak after considering multiple factors including behavior changes, cultural factors, and the implementation of public health measures (16, 24, 32). Estimates of Rt>1 indicate sustained transmission; whereas, Rt <1 implies that the outbreak is slowing down and the incidence trend is declining. Hence, maintaining Rt <1 is required to bring an outbreak under control. Using the reporting delay adjusted incidence curve, we estimate the most recent estimate of Rt for COVID-19 in Singapore. We first characterize daily incidence of local cases using the generalized growth model (GGM) after adjusting for imported cases. This model characterizes the growth profile via two parameters: the growth rate (r) and the scaling of the growth parameter (p). The GGM can reproduce a range of growth dynamics, including constant growth (p=0), sub-exponential or polynomial growth (0<p<1), and exponential growth (p=1) (33). The generation interval is assumed to follow a gamma distribution with a mean of 4.41 days and a standard deviation of 3.17 days based on refs. (34, 35). We denote the local incidence at calendar time ti by Ii, the raw incidence of imported cases at calendar time ti by Ji, and the discretized probability distribution of the generation interval by ρi. Then, we can estimate the effective reproduction number by employing the renewal equation (36-38)
In this equation the numerator represents the new cases Ii, and the denominator represents the total number of cases that contribute to the new cases Ii at time ti. Parameter 0≤α≤1 represents the relative contribution of imported cases to the secondary disease transmission. We perform a sensitivity analyses by setting α = 0.15 and α =1.0(39). Next, in order to derive the uncertainty bounds around the curve of Rt,we estimate Rt for 300 simulated curves assuming a Poisson error structure (40).
Reproduction number (Rt) from the analysis of cluster sizes
In the branching process analysis, the number of transmissions caused by each new infection is modeled as a negative binomial distribution. This is parameterized by the effective reproduction number, Rt, and the dispersion parameter, k. The reproduction number provides the average number of secondary cases per index case, and the dispersion parameter varies inversely with the heterogeneity of the infectious disease. This parameterization allows variation in the both the strength and heterogeneity of disease transmission, where a lower dispersion parameter indicates higher transmission heterogeneity. Assuming each cluster of infection is initiated by a single infection, branching process theory provides an analytic representation of the size distribution of cluster sizes (41). This permits direct inference of the maximum likelihood estimate and confidence interval for Rt and k.
Results
Incidence data and reporting delays
The COVID-19 epidemic curve by the date of reporting and stratified for local and imported incidence case counts is shown in Figure 1. It is worth noting that the majority of the imported cases occurred at the beginning of the outbreak in Singapore. The cumulative curve shows an upward trajectory with an average of 3.4 new cases reported per day (Figure 1). Out of 22 imported cases, only 4 cases have been linked to secondary cases. Meanwhile, a total of 61 autochthonous cases have been reported as of February 19, 2020 including 8 cases that remain unlinked to any known transmission chains.
Unadjusted local and imported Incidence cases by date of reporting as of February 19, 2020. The solid line represents the cumulative cases count for the COVID-19 cases in Singapore by the date of reporting.
The reporting-delay adjusted epidemic curve of local cases by date of symptoms onset is shown in Figure 2, and the distribution for reporting delays is displayed in Figure 3. The gamma distribution provided a reasonable fit to the distribution of overall reporting delays and estimated a mean reporting delay at 6.9 days (95% CI: 5.9, 7.9) (Figure 3). We also found that imported cases tend to have shorter mean reporting delays compared to local cases (mean 3.5 vs. 8.3 days, Wilcoxon test, p<0.001) as imported cases tend to be identified more quickly.
Reporting delay adjusted incidence for the COVID-19 outbreak in Singapore accounting for local cases as of February 19, 2020. Blue bars represent the raw incidence, red solid line represents the adjusted incidence, red dotted lines represent the 95% lower and upper bound of the adjusted incidence.
Distribution of reporting delays for all cases as of February 19, 2020 represented as a histogram. The red circle represents the mean of reporting delays and the horizontal line represents the 95% CI assuming reporting delays have a gamma distribution.
Transmission clusters
As of February 19 2020, nine different clusters of COVID-19 cases with 1-22 cases per cluster have been reported in Singapore. A schematic diagram and characteristics of the COVID-19 clusters in Singapore are given in Figure 4 and Table 1. The geographic location of the five largest clusters accounting for 53.6% of the total cases is shown in Figure 5 whereas the corresponding distribution of cluster sizes is shown in Figure 6.
Cluster network of the cases in Singapore for the COVID-19 global epidemic as of February 19, 2020. The pink circles represent the imported cases where as the blue circles represent cases with no travel history to China. The larger dotted circles represent the COVID-19 disease clusters. Each blue arrow represents the direction in which the disease was transmitted. Pink arrows represent immediate family. Dates below the circles are the dates of case reporting.
Map depicting the spatial distribution of the 5 largest COVID-19 clusters in Singapore; Grand Hayatt cluster, Yong Thai Hang cluster, Grace Assembly of God Church, Seletar Aerospace cluster and The Life Church and Missions cluster as of February 19, 2020.
Distribution of COVID-19 cluster sizes in Singapore as of February 19, 2020.
Yong Thai Hang cluster
This cluster with 9 cases was the first to be reported in Singapore. It has nine traceable links, including eight Chinese and one Indonesian national associated with the visit on January 23, 2020 of Chinese tourists to the Yong Thai Hang health products store, a shop that primarily serves the Chinese population. Four shop employees and the tour guide were first identified as a cluster on February 4, 2020 (12, 42, 43). The tour guide subsequently infected her husband, a newborn and the domestic helper (43). No further cases have been added to this cluster as of February 8, 2020.
Grand Hayatt hotel
This cluster with 3 local cases was the second cluster to receive international attention, as it originated from a business meeting held at the Grand Hayatt hotel (44). Four international cases associated with this cluster had left Singapore before the local cluster was identified. All Singaporean residents associated with this cluster have recovered as of February 19, 2020 (44). No additional cases have been added to this cluster as of February 8, 2020.
The Life Church and Missions cluster
This cluster comprising of 6 cases was associated with The Life Church and Missions in Paya Lebar on February 7, 2020. This cluster is composed of two imported cases who are related to each other and four local cases. No further cases have been added to this cluster as of February 19, 2020.
Seletar Aerospace Heights cluster
This cluster with 5 Bangladeshi work pass holders was identified on February 9, 2020. No further cases have been added to this cluster as of February 15, 2020.
Grace Assembly of God Church cluster
The fifth and largest cluster composed of 22 local cases has been linked to the two branches of Grace Assembly of God Church at Tanglin and Bukit Batok, which was identified on February 12, 2020 (42, 45). This church has about 4800 people in attendance on average over the weekend. While the church has momentarily closed, field investigations have not led to conclusive evidence regarding super-spreading transmission (46). No further cases have been added to this cluster as of February 19, 2020.
Cluster A
The first case of this cluster comprising 2 imported and one local case was identified on January 30, 2020. The two imported cases are related to each other. No further cases have been added in this cluster as of February 10, 2020.
Cluster B
The first case of this cluster composed of 3 local cases was identified on February 14, 2020. No further cases have been added in this cluster since February 18, 2020.
Cluster C
This first case of this cluster composed of 4 local cases was identified on February 12, 2020. No further cases have been added to this cluster since February 17, 2020.
Cluster D
The first case of this cluster composed of two local cases who are related to each other was identified on February 9, 2020. No further cases have been added in this cluster since February 16, 2020.
Reproduction Numbers
Using the delay-adjusted incidence curve of local cases, we estimated the intrinsic growth rate r at 0.9 (95% CI: 0.4,1.7) and the scaling of growth parameter p at 0.4 (95%CI: 0.2,0.6), indicating sub-exponential growth dynamics of COVID-19 (Figure 7). Our R estimate was not sensitive to changes in parameter α. Specifically, our mean effective reproduction number Rt was estimated at 1.1 (95% CI:1.1, 1.3) when α =0.15 and 1.1 (95% CI:1.1, 1.2) when α =1(Figure 7). It is worth noting that ignoring the presence of imported cases, our estimates would have been slightly underestimated (by about 0.1 in absolute value).
Individual level reproduction number with 95% CI estimated by adjusting for the imported cases α =0.15. Estimates for growth rate (r) and the scaling of the growth rate parameter (p) are also provided. The plot at the bottom depicts the fit of the Generalized Growth Model to the Singapore adjusted incidence data assuming Poisson error structure.
Based on the entire distribution of cluster sizes, we jointly estimated the reproduction number R and the dispersion parameter k. Fitting the negative binomial distribution to the cluster data in the empirical distributions of the realizations during the early stages of the outbreak in Singapore, the reproduction number is estimated at 0.6 (95% CI: 0.4, 1.1) after adjusting for the independent and identically distributed observation probability of 50%. The dispersion parameter is estimated at 0.4 (95% CI: 0.1, Inf.) indicating higher transmission heterogeneity.
Discussion
Overall, current estimates of transmission potential in Singapore, based on two different data sources and different methods, suggest the potential for further local transmission of SARS-CoV-2 in Singapore. While large-scale transmission has not been reported, the fact that asymptomatic and subclinical cases are now well documented for COVID-19 (47) suggests that our estimates may be underestimated (48). On the other hand, it is not clear if asymptomatic or subclinical cases are as infectious as symptomatic cases. Indeed, we have reported that multiple local cases have yet to be traced to existing transmission chains. Additional data collected during the course of the outbreak will help obtain an improved picture of the transmission dynamics (8). These findings emphasize the need to strengthen public health interventions including active case contact tracing activities in countries with emerging transmission of SARS-CoV-2. It is worth noting that current data suggests that imported cases have contributed little to secondary cases in Singapore, with most of the imported cases dating back to the early phase of the epidemic. However, there are examples such as the Grand Hyatt Singapore conference cluster and the Yong Thai Hang cluster that were linked to imported sources, and the original sources had left Singapore before these local clusters emerged (44, 49).
Our Rt estimates for Singapore are substantially lower than mean estimates reported for the COVID-19 epidemic in China, which range from: 2.2-7.1 (50-53). This indicates that containment efforts are having a significant impact in Singapore. However, some differences in the reproduction numbers reported for the epidemic in China may result from different methods, differences in data sources, and time periods used to estimate the reproduction number. Similarly, a recent study has shown an average reporting delay of 6.1 days in China (54) which agrees with our mean estimate for local cases (6.9 days). Moreover, the scaling parameter for growth rate (p) indicates a sub-exponential growth pattern in Singapore, reflecting the effective isolation and control strategies in the region. This is consistent with a sub-exponential growth pattern for Chinese provinces excluding Hubei (p=0.67), as estimated by a recent study (55); whereas, an exponential growth pattern was estimated for Hubei (p∼1.0) (55).
A previous study (56) on the 2015 MERS outbreak in South Korea reported a substantial potential for superspreading despite a subcritical Rt. The lower estimate of the dispersion parameter in our study also indicate significant transmission heterogeneity in Singapore. Super-spreading events of MERS-CoV and SARS-CoV associated with nosocomial outbreaks have been observed in the past and attributed to substantial diagnostic delays (16, 30). Although the average time from onset of symptoms to diagnosis for COVID-19 patients in Singapore is 7.9 days and no super-spreading event has been observed yet, the dispersion parameter, k<1, indicates the probability of observing large clusters and the potential for super-spreading (56, 57). Therefore, public health measures enacted by public health authorities in Singapore that advise the public to avoid mass gatherings and confined places are crucial to prevent disease amplification events. However, the presence of asymptomatic cases in the community represent an ongoing threat (58, 59) although it is not currently known if subclinical cases are less infectious. This highlights the need for rapid testing suspected cases for rapid detection and isolation of confirmed cases. To achieve this goal, public health authorities in Singapore are reactivating 900 GP clinics (60). Investigations are also underway to identify additional cases linked to the “Grace Assembly of God” cluster in Singapore. While most clusters have stabilized (no recent additional cases in most clusters), this cluster is still being consolidated (49, 61).
Beyond Singapore, COVID-19 clusters have been reported in Germany and on the Diamond Cruise ship in Japan (62-65). Moreover, Singapore has also produced secondary chains of disease transmission beyond its borders (22). Although Singapore has been detecting and isolating cases with diligence, our findings from the analysis underscore the need for continued and sustained containment efforts to prevent large-scale community transmission including nosocomial outbreaks. Overall, the current situation in Singapore highlights the need to investigate the unlinked and asymptomatic cases that could be a potential source of secondary cases. Although Singapore has a highly efficient contact tracing mechanism in place (22), continued epidemiological investigations and active case finding efforts are needed to control the outbreak.
Our study is not exempt from limitations. First, the outbreak is still ongoing and we continue to monitor the transmission potential of COVID-19 in Singapore. Second, onset dates are missing in nine cases, which were excluded from our analyses. Third, we cannot rule out that additional cases will be added to existing clusters, which may lead to underestimating the reproduction number based on the cluster size distribution. Lastly, the cluster size analysis is modeled under the assumption that there is one primary case (i.e. all cases can be connected eventually to one person), rather than being multiple imported cases. Additional work could be conducted to account for the uncertainty regarding the primary number of cases per cluster and the possibility of ongoing transmission in some clusters.
Conclusion
This is the first real-time study to estimate the evolving transmission potential of SARS-CoV-2 in Singapore. Our current findings point to ongoing transmission of SARS-CoV-2, although the reproduction number appears to be slightly above the epidemic threshold of 1.0. These estimates underscore positive impact of containment efforts in Singapore while at the same time suggest the need to enhance social distancing and active case finding efforts to stomp out all active chains of transmission.
List of abbreviations
COVID-19
SARS-CoV-2
Ethics approval and consent to participate
Not applicable
Consent for publication
Not applicable
Conflict of Interest
The authors declare no conflicts of interest.
Funding
G.C. is supported by NSF grants 1610429 and 1633381.
Data declaration
All data are publicly available.
Author Contributions
A.T, S.B., P.Y. and G.C. analyzed the data. A.T., Y. L, P.Y and S.M. retrieved and managed data; A.T and G.C wrote the first draft of the manuscript. All authors contributed to writing and revising subsequent versions of the manuscript. All authors read and approved the final manuscript.