Abstract
We analyzed the epidemic doubling time of the 2019-nCoV outbreak by province in mainland China. Mean doubling time ranged from 1.0 to 3.3 days, being 2.4 days for Hubei (January 20-February 2, 2020). Trajectory of increasing doubling time by province indicated social distancing measures slowed the epidemic with some success.
To the editor
Our ability to estimate basic reproduction numbers for novel infectious diseases is hindered by the dearth of information about their epidemiological characteristics and transmission mechanisms (1). More informative metrics could synthesize real-time information about the extent to which the epidemic is expanding over time. Such metrics would be particularly useful if they rely on minimal data of the outbreak’s trajectory (2).
Epidemic doubling times characterize the sequence of times at which the cumulative incidence doubled (3). Here we analyze the evolution of the doubling times and the number of times the cumulative incidence doubles, associated with the novel coronavirus (2019-nCoV) outbreak by province in mainland China (4), from January 20 (when provinces outside Hubei started reporting cases) through February 2, 2020. See Technical Appendix for a sensitivity analysis applied to data from December 31, 2019 through February 2, 2020. If an epidemic is growing exponentially with a constant growth rate r, the doubling time should remain constant, where doubling time = (ln 2) / r. An increase in doubling time could mean the epidemic has slowed down, assuming that the underlying reporting rate remained unchanged (see Technical Appendix and Figure S1).
Cumulative incidence data from December 31, 2019 through February 2, 2020 were retrieved from official webpages of provincial health commissions, and that of the National Health Commission of China (5). They were double-checked against the reported numbers of the provinces according to Centre for Health Protection, Hong Kong, if available (6). Whenever discrepancies arose, the respective provincial government sources were deemed authoritative. Tibet was excluded from further analysis because there was only one case as of February 2, 2020 and thus doubling time could not be calculated. All data analyzed are publicly available.
From January 20 through February 2, 2020, the mean doubling time of the cumulative incidence ranged from 1.0 day (Hunan and Henan) to 3.3 days (Hainan) (Figure 1A). In Hubei, it was estimated as 2.4 days. The cumulative incidence of Hubei doubled 5 times (Figure 1B). Provinces with the cumulative incidence doubled ≥5 times, and mean doubling time <2d included Chongqing, Fujian, Heilongjiang, Henan, Hunan, Jiangxi, Shandong, Shanghai, Shanxi, Sichuan, Yunnan, and Zhejiang. These provinces experienced a faster and consistent epidemic growth (Figures 1 and S2).
The aggregate cumulative incidence of all non-Hubei provinces increased over time (Figure S3) and therefore suggested a sub-exponential growth of the epidemic outside Hubei. The gradual piece-meal increase in doubling time could be explained by the practice of self-quarantine since the Chinese New Year and the different levels of intra-and-inter-provincial travel restrictions imposed across China since the travel quarantine of Wuhan (imposed on Jan 23, 2020) (7).
The limitations of our study included the incompleteness of the cumulative incidence data as reported by mainland Chinese authorities. One potential reason for underreporting is underdiagnosis, due to the lack of diagnostic tests, healthcare workers and other resources. Differential underreporting across provinces could have biased the data. However, as long as the rate of reporting remains constant over time within the same province, the calculation of doubling times remains reliable. However, increased awareness and increased availability of diagnostic tests might have improved the reporting rate over time. This might artificially shorten the doubling time. Nevertheless, apart for Hubei, for the majority of mainland China, cases were only reported since January 20, 2020. It was when the Chinese authorities openly acknowledged the seriousness of the outbreak. Therefore, the bias due to increased awareness might be small to negligible.
Conclusions
We analyzed the epidemic doubling time of the 2019 novel coronavirus outbreak by province in mainland China. The mean doubling time of cumulative incidence in Hubei was 2.4 days (January 20 through February 2, 2020) but the mean doubling time of Henan, Hunan, and Shandong were the lowest.
Trajectory of increasing doubling time by province indicated social distancing measures adopted in China slowed the epidemic with some success.
Data Availability
All data analyzed is publicly available, aggregated, data. We will attach the data that generate the results to the final, published, version of this manuscript.
First author(s) biography
Kamalich Muniz-Rodriguez, MPH, is a doctoral student at the Jiann-Ping Hsu College of Public Health, Georgia Southern University. Her research interests include infectious disease epidemiology, digital epidemiology and disaster epidemiology.
Gerardo Chowell, PhD, is Professor of Epidemiology and Biostatistics, and Chair of the Department of Population Health Sciences at Georgia State University School of Public Health. As a mathematical epidemiologist, Prof Chowell studies the transmission dynamics of emerging infectious diseases, such as Ebola, MERS and SARS.
Disclaimer
This article does not represent the official positions of the Centers for Disease Control and Prevention, the National Institutes of Health, or the United States Government.
Technical appendix
Additional information on our motivation, scope and methods
Motivation
R0 is a widely used indicator of transmission potential in a totally susceptible population and is driven by the average contact rate and the mean infectious period of the disease (1). Yet, it only characterizes transmission potential at the onset of the epidemic and varies geographically for a given infectious disease according to local healthcare provision, outbreak response, as well as socioeconomic and cultural factors. Furthermore, estimating R0 requires information about the natural history of the infectious disease. Thus, our ability to estimate reproduction numbers for novel infectious diseases is hindered by the dearth of information about their epidemiological characteristics and transmission mechanisms. More informative metrics could synthesize real-time information about the extent to which the epidemic is expanding over time. Such metrics would be particularly useful if they rely on minimal data of the outbreak’s trajectory.
Scope and definition
We restricted our analysis to mainland China in this paper. A ‘province’ herein encompasses three different types of political sub-divisions of mainland China, namely, a province, a directly administered municipality (Beijing, Chongqing, Shanghai, and Tianjin) and an autonomous region (Guangxi, Inner Mongolia, Ningxia, Tibet, and Xinjiang). Our analysis does not include Hong Kong Special Administrative Region and Macau Special Administrative Region, which are under effective rule of the People’s Republic of China through the so-called ‘One Country, Two Systems’ political arrangements. Likewise, our analysis does not include Taiwan, which is de facto governed by a different government (the Republic of China).
Data apart from epidemic data
Provincial demographic, transportation and socioeconomic data were obtained from the National Bureau of Statistics of China (2) and other sources (see Table S2).
Doubling time calculation
As the epidemic grows, the times at which cumulative incidence doubles are given by such that where , and i = 0,1,2,3, …, nd where nd is the total number of times cumulative incidence doubles (Figure S1). The actual sequence of “doubling times” are defined as follows (Figure S1):
Doubling time calculation was conducted using MATLAB R2019b (Mathworks, Natick, MA). Multiple linear regression analyses were conducted using R version 3.6.2 (R Core Team). Significance level was a priori decided to be α = 0.05.
Additional information on our results and discussion
Demographic, transportation and socioeconomic factors
We performed multiple linear regression models with the latest doubling time, mean doubling time and the slope of the doubling time over the number of times the cumulative incidence doubles as the dependent variables, respectively. We included population density, average temperature in January, average household size, subnational Human Development Index in all models. We included passenger traffic and provincial capital’s distance from Wuhan, for railway (models group A) and highway (models group B) respectively. However, none of the independent variables were found statistically significantly (p > 0.05) associated with any of the dependent variables (Table S2).
Sensitivity analysis
We performed sensitivity analysis by expanding our data analysis to the data since January 31, 2019, when Hubei first reported a cluster of pneumonia cases with unexplained etiology that turned out to be 2019-nCoV. The only difference between the sensitivity analysis and the main analysis is the inclusion of Hubei data from January 31, 2019 through January 19, 2020, because all other provinces started to report cases on January 20, 2020. The only differences in results were found for Hubei, with the mean doubling time being 3.85 (Figures S4, S6), and the cumulative incidence in Hubei doubled 8 times from January 31, 2019 through February 2, 2020 (Figures S5, S6). The first doubling time of Hubei (Figure S5) was high, reflecting that real-time data was unavailable before mid-January. It was only by January 17, 2020 onwards when data reporting become increasingly transparent and timely.
In our sensitivity analysis, we performed the same multiple regression models previously described, with the mean doubling time, and the slope of the doubling time over the number of times the cumulative incidence doubles as dependent variables. We included population density, average temperature in January, average household size, subnational Human Development Index in all models. We included passenger traffic and provincial capital’s distance from Wuhan, for railway and highway respectively. However, none of the independent variables were found statistically significantly (p > 0.05) associated with the three dependent variables (results not shown).
Authors’ contributions
Project management: Dr. Gerardo Chowell, Dr. Isaac Chun-Hai Fung and Ms. Kamalich Muniz-Rodriguez
Manuscript writing: Dr. Isaac Chun-Hai Fung and Dr. Gerardo Chowell
Manuscript editing and data interpretation: Ms. Kamalich Muniz-Rodriguez, Dr. Gerardo Chowell, Dr. Isaac Chun-Hai Fung, Dr. Lone Simonsen
MATLAB code and Figure S1: Dr. Gerardo Chowell
Doubling time calculation using MATLAB and Figures S2, S3, S4 and S5: Ms. Kamalich Muniz-Rodriguez, Dr. Gerardo Chowell and Dr. Isaac Chun-Hai Fung
Statistical analysis in R: Dr. Isaac Chun-Hai Fung
Data management and quality check of epidemic data entry: Ms. Kamalich Muniz-Rodriguez
Entry of epidemic data for countries and territories outside mainland China (including Hong Kong, Macao and Taiwan): Ms. Kamalich Muniz-Rodriguez and Ms. Sylvia K. Ofori
Entry of epidemic data for provinces in mainland China: Ms. Manyun Liu (from the early reports, up to Jan 24, 2020 data), Ms. Po-Ying Lai (since Jan 25, 2020 data to today), Mr. Chi-Hin Cheung (since Jan 27, 2020 data to today), and Ms. Kamalich Muniz-Rodriguez and Dr. Isaac Chun-Hai Fung (whenever there is a back-log).
Retrieval of epidemic data from official websites (downloading and archiving of China’s national and provincial authorities’ press releases): Ms. Manyun Liu and Dr. Dongyu Jia
Retrieval of statistical data from the official website of National Bureau of Statistics of the People’s Republic of China: Mr. Chi-Hin Cheung
Retrieval of publicly available statistical data from various sources: Ms. Yiseul Lee, Dr. Isaac Chun-Hai Fung
Acknowledgement
GC acknowledges support from NSF grant 1414374 as part of the joint NSF-NIH-USDA Ecology and Evolution of Infectious Diseases program. ICHF acknowledges salary support from the National Center for Emerging and Zoonotic Infectious Diseases, Centers for Disease Control and Prevention (19IPA1908208). This article is not part of ICHF’s CDC-sponsored projects.
Footnotes
Email addresses: km11200{at}georgiasouthern.edu (K. Muniz-Rodriguez); gchowell{at}gsu.edu (G. Chowell); westerpants{at}gmail.com (C.-H. Cheung); djia{at}georgiasouthern.edu (D. Jia); pylai{at}bu.edu (P.-Y. Lai); ylee97{at}student.gsu.edu (Y. Lee); ml16842{at}georgiasouthern.edu (M. Liu); so01935{at}georgiasouthern.edu (S. K. Ofori); kroosa1{at}student.gsu.edu (K. M. Roosa); lone{at}gwu.edu (L. Simonsen); cfung{at}georgiasouthern.edu (I. C.-H. Fung)
References
- 1.
- 2.
- 3.
- 4.
- 5.
- 6.