Abstract
COVID-19 epidemic doubling time by Chinese province was increasing from January 20 through February 9, 2020. The harmonic mean of the arithmetic mean doubling time estimates ranged from 1.4 (Hunan, 95% CI, 1.2-2.0) to 3.1 (Xinjiang, 95% CI, 2.1-4.8), with an estimate of 2.5 days (95% CI, 2.4-2.6) for Hubei.
Main text
Our ability to estimate the basic reproduction number of emerging infectious diseases is often hindered by the paucity of information about the epidemiological characteristics and transmission mechanisms of new pathogens (1). Alternative metrics could synthesize real-time information about the extent to which the epidemic is expanding over time. Such metrics would be particularly useful if they rely on minimal and routinely collected data that capture the trajectory of an outbreak (2).
Epidemic doubling times characterize the sequence of intervals at which the cumulative incidence doubles (3). If an epidemic is growing exponentially with a constant growth rate r, the doubling time remains constant and equals to (ln 2)/r. An increase in the doubling time indicates a slowdown in transmission if the underlying reporting rate remains unchanged (Technical Appendix) (4).
Here we analyzed by province the number of times COVID-19 cumulative incidence doubled and the evolution of the doubling times in mainland China (5), from January 20 (when nationwide reporting began) through February 9, 2020. Province-level daily cumulative incidence data were retrieved from provincial health commissions’ websites. Two sensitivity analyses based on a longer and a shorter time period respectively were conducted (Technical Appendix). Tibet was excluded from further analysis because there had only been one case reported during the study period.
From January 20 through February 9, the harmonic mean of the arithmetic means of the doubling times estimated from cumulative incidence ranged from 1.4 (95% CI, 1.2, 2.0) days (Hunan) to 3.1 (95%CI, 2.1, 4.8) days (Xinjiang). In Hubei, it was estimated as 2.5 (95% CI, 2.4, 2.6) days. The cumulative incidence doubled 6 times in Hubei. The harmonic mean of the arithmetic means of doubling times in all of mainland China except Hubei was 1.8 (95% CI, 1.5, 2.3) days. Provinces with a harmonic mean of the arithmetic means of doubling times <2d included Fujian, Guangxi, Hebei, Heilongjiang, Henan, Hunan, Jiangxi, Shandong, Sichuan, and Zhejiang (Figures 1 and S1).
As the epidemic progressed, it took longer for the cumulative incidence in mainland China (except Hubei) to double itself, which indicated an overall sub-exponential growth pattern outside Hubei (Figures S1, S2). In Hubei, the doubling time decreased and then increased. A gradual increase in the doubling time coincided with the social distancing measures and intra-and-inter-provincial travel restrictions imposed across China since the implementation of quarantine of Wuhan on January 23 (6).
Our estimates of doubling times are shorter than prior estimates of 7.4 days (95% CI, 4.2-14) (5), 6.4 days (95% CrI, 5.8-7.1) (7), and 7.1 days (95% CI, 3.0-20.5) (8) respectively. Li et al. covered cases reported by January 22 (5). Wu et al. statistically inferred case counts in Wuhan by internationally exported cases as of January 25 (7). Volz et al. identified a common viral ancestor on December 8, 2019 using Bayesian phylogenetic analysis and fitted an exponential growth model to provide the epidemic growth rate (8). Our estimates are based on cumulative confirmed case count by reporting date by province from January 20 through February 9.
Our study is subject to limitations, including underreporting of cases (9). One reason for underreporting is underdiagnosis, due to lack of diagnostic tests, healthcare workers, and other resources. Further, underreporting is likely heterogeneous across provinces. As long as reporting remains invariant over time within the same province, the calculation of doubling times remains reliable; however, this is a strong assumption. Growing awareness of the epidemic and increasing availability of diagnostic tests might have strengthened reporting over time, which could have artificially shortened the doubling time. Nevertheless, apart from Hubei and Guangdong (first case reported on January 19), nationwide reporting only began on January 20, and at this point, Chinese authorities had openly acknowledged the magnitude and severity of the epidemic. Due to a lack of detailed case data describing incidence trends for imported and local cases, we focused our analysis on the overall trajectory of the epidemic without adjusting for the role of imported cases on the local transmission dynamics. Indeed, it is likely that the proportion of imported cases was significant for provinces that only reported a few cases; their short doubling times in the study period could simply reflect rapid detection of imported cases. However, with the data until February 9, only two provinces had a cumulative case count of <40 (Table S1). It would be interesting to investigate the evolution of the doubling time after accounting for case importations if more detailed data becomes available.
To conclude, we observed an increasing trend in the epidemic doubling time of COVID-19 by Chinese province from January 20 through February 9, 2020. The harmonic mean of the arithmetic means of doubling times of cumulative incidence in Hubei during the study period was estimated at 2.5 (95% CI, 2.4, 2.6) days.
Data Availability
All data analyzed is publicly available aggregated data. The data is provided in the Technical Appendix.
Disclaimers
This article does not represent the official positions of the Centers for Disease Control and Prevention, the National Institutes of Health, or the United States Government.
Author Bio
Kamalich Muniz-Rodriguez, MPH, is a doctoral student at the Jiann-Ping Hsu College of Public Health, Georgia Southern University. Her research interests include infectious disease epidemiology, digital epidemiology and disaster epidemiology.
Gerardo Chowell, PhD, is Professor of Epidemiology and Biostatistics, and Chair of the Department of Population Health Sciences at Georgia State University School of Public Health. As a mathematical epidemiologist, Prof Chowell studies the transmission dynamics of emerging infectious diseases, such as Ebola, MERS and SARS.
Technical appendix
Additional information on our motivation, scope and methods
Motivation
R0 is a widely used indicator of transmission potential in a totally susceptible population and is driven by the average contact rate and the mean infectious period of the disease (1). Yet, it only characterizes transmission potential at the onset of the epidemic and varies geographically for a given infectious disease according to local healthcare provision, outbreak response, as well as socioeconomic and cultural factors. Furthermore, estimating R0 requires information about the natural history of the infectious disease. Thus, our ability to estimate reproduction numbers for novel infectious diseases is hindered by the paucity of information about their epidemiological characteristics and transmission mechanisms. More informative metrics could synthesize real-time information about the extent to which the epidemic is expanding over time. Such metrics would be particularly useful if they rely on minimal data on the outbreak’s trajectory (2).
Scope and definition
Our analysis is restricted to mainland China in this paper. A ‘province’ herein encompasses three different types of political sub-divisions of mainland China, namely, a province, a centrally (literally, ‘directly’) administered municipality (Beijing, Chongqing, Shanghai, and Tianjin) and an ‘ethnic minority’ autonomous region (Guangxi, Inner Mongolia, Ningxia, Tibet, and Xinjiang). Our analysis does not include the Hong Kong Special Administrative Region and the Macau Special Administrative Region, which are under the effective rule of the People’s Republic of China through the so-called ‘One Country, Two Systems’ political arrangements. Likewise, our analysis does not include Taiwan, which is de facto governed by a different government (the Republic of China).
Data sources
Daily cumulative incidence data were retrieved from provincial health commissions’ websites (Table S8). Data were double-checked against the cumulative national total published by the National Health Commission (3), data compiled by the Centre for Health Protection, Hong Kong, when available (4) and by John Hopkins University Center for Systems Science and Engineering (5). Whenever discrepancies arose, provincial government sources were deemed authoritative.
Doubling time calculation and its relationship with growth rate of an epidemic
As the epidemic grows, the times at which cumulative incidence doubles are given by such that where and i = 0,1,2,3, … nd where nd is the total number of times cumulative incidence doubles. The actual sequence of “doubling times” are defined as follows:
To quantify parameter uncertainty, we used parametric bootstrapping with a Poisson error structure around the harmonic mean of doubling times dj to obtain the 95% confidence interval. See references (6-8) for further details.
If we assume homogeneous mixing (equal probability of acquiring infection through contacts) and exponential growth, then, C(t2) = C(t1)exp(rt), and therefore, ln(C(t2)/C(t1)) = rt. When C(t2)/C(t1) = 2 and thus t is the doubling time, i.e. t = td, ln2 = rtd. Therefore, the doubling time, td, equals to (ln2)/r. See Vynnycky and White (9), panel 4.1, p.74 for further explanation.
Additional details on methods
Doubling time calculation was conducted using MATLAB R2019b (Mathworks, Natick, MA). Figures were created either using R version 3.6.2 (R Core Team) or MATLAB 2019b. Significance level in this manuscript was a priori decided to be α = 0.05.
Additional information on our results and discussion
Cumulative incidence over time
From Figure S7 to Figure S10, we provided plots of cumulative incidence over time (left panel) and semi-log plots with log10-transformed cumulative incidence over time (right panel) for a total of 8 provinces with a relatively high number of cases, namely, the epicenter Hubei, followed by (in alphabetical order) Fujian, Guangdong, Heilongjiang, Henan, Hubei, Hunan, Jiangxi and Shandong. If the epidemic is growing exponentially, the log10-transformed cumulative incidence over time will be a linear curve. If social distancing would have an impact, the slope of the semi-log plot would decrease, indicating a decreasing epidemic growth rate.
Harmonic mean of the harmonic mean
In this study, we also presented the harmonic mean of the harmonic means of the estimates of the epidemic doubling times. The harmonic means of the epidemic doubling times are shorter than their arithmetic means. From January 20 through February 9, the harmonic mean of the harmonic means of the doubling times estimated ranged from 0.5 (95% CI, 0.2, 1.3) days for Guangxi, to 2.3 (95% CI, 2.3, 2.4) days for Hubei. The harmonic mean of the harmonic means of doubling times in mainland China except Hubei were 1.2 (95% CI, 1.0, 1.4) days.
Further discussion
The slowing-down of the epidemic as represented in increasing epidemic doubling times in our study is also consistent with a study by Benjamin F. Maier and Dirk Brockmann, “Effective containment explains sub-exponential growth in confirmed cases of recent COVID-19 outbreak in Mainland China” (pre-print available at arXiv. 2020:2002.07572). They also identified sub-exponential growth of the outbreak across provinces, as mass quarantine and restriction of travels across mainland China began since January 23, 2020.
Sensitivity analysis #1
We performed a sensitivity analysis by expanding our data analysis to the data since December 31, 2019, when Hubei first reported a cluster of pneumonia cases with unexplained etiology that turned out to be COVID-19. The only difference between the sensitivity analysis and the main analysis is the inclusion of Hubei and Guangdong data from December 31, 2019, through January 19, 2020, because nationwide reporting started on January 20, 2020. The only differences in results were found for Hubei and Guangdong. For Hubei, the harmonic mean of the arithmetic mean of the doubling times was 4.06 (95% CI, 3.85-4.33); the harmonic mean of the harmonic means of the doubling times for Hubei was 2.28 (95% CI, 2.08-2.56); and the cumulative incidence in Hubei doubled nine times from December 31, 2019, through February 9, 2020 (Table S5, Figures S3, S4, S12, S13, S14). The first doubling time of Hubei (Figure S3) was high, reflecting that real-time data was unavailable before mid-January. It was only by January 17, 2020, onwards when data reporting become increasingly transparent and timely.
Sensitivity analysis #2
We also performed a sensitivity analysis by restricting our data analysis to the data from January 23, 2020 through February 9, 2020, to allow for the time that all the other provinces to ramp up their testing. January 23 was also the day when the Chinese authorities to put the city of Wuhan on ‘lockdown’ and major inter-provincial travel restrictions were put in place. When we changed the start date of our study period from January 20 (main analysis) to January 23, 2020 (sensitivity analysis #2), the epidemic doubling time of the aggregate cumulative incidence of mainland China (except Hubei) increased from 1.79 (95% CI, 1.52, 2.25) to 2.90 (95% CI, 2.62, 3.24) (harmonic mean of the arithmetic means), and from 1.18 (95% CI, 0.96, 1.42) to 1.98 (95% CI, 1.82, 2.17) (harmonic mean of the harmonic means) (Table S7, Figure S5, S6). Apart from the epidemic doubling time of the aggregate cumulative incidence of mainland China (except Hubei), we did not observe significant differences by province between results in the main analysis and sensitivity analysis #2. Therefore, our results should be robust for the purpose of this study.
Authors’ contributions
Project management: Dr. Gerardo Chowell, Dr. Isaac Chun-Hai Fung and Ms. Kamalich Muniz-Rodriguez
Manuscript writing: Dr. Isaac Chun-Hai Fung and Dr. Gerardo Chowell
Manuscript editing and data interpretation: Ms. Kamalich Muniz-Rodriguez, Dr. Gerardo Chowell, Dr. Isaac Chun-Hai Fung, Dr. Lone Simonsen, and Dr. Cecile Viboud
MATLAB code and methods of doubling time estimation: Dr. Gerardo Chowell
Doubling time calculation using MATLAB and presentation of results: Ms. Kamalich Muniz-Rodriguez, Dr. Gerardo Chowell and Dr. Isaac Chun-Hai Fung
Statistical analysis in R: Dr. Isaac Chun-Hai Fung, Ms. Kamalich Muniz-Rodriguez
Data management and quality check of epidemic data entry: Ms. Kamalich Muniz-Rodriguez, Dr. Isaac Chun-Hai Fung
Curation of epidemic data for countries and territories outside mainland China (including Hong Kong, Macao and Taiwan): Ms. Kamalich Muniz-Rodriguez and Ms. Sylvia K. Ofori
Curation of epidemic data for provinces in mainland China: Ms. Manyun Liu (from the early reports, up to Jan 24, 2020 data), Ms. Po-Ying Lai (since Jan 25, 2020 data to today), Mr. Chi-Hin Cheung (since Jan 27, 2020 data to today), and Ms. Kamalich Muniz-Rodriguez and Dr. Isaac Chun-Hai Fung (whenever there is a back-log).
Retrieval of epidemic data from official websites (downloading and archiving of China’s national and provincial authorities’ press releases): Ms. Manyun Liu and Dr. Dongyu Jia (at the very beginning of our project)
Retrieval of statistical data from the official website of National Bureau of Statistics of the People’s Republic of China: Mr. Chi-Hin Cheung
Retrieval of publicly available statistical data from various sources: Ms. Yiseul Lee, Dr. Isaac Chun-Hai Fung
Table S8: Ms. Manyun Liu, Dr. Isaac Chun-Hai Fung Map creation: Ms. Kimberlyn M. Roosa
Assistance provided to Dr. Fung and Ms. Muniz-Rodriguez: Ms. Sylvia K. Ofori
Acknowledgements
GC acknowledges support from NSF grant 1414374 as part of the joint NSF-NIH-USDA Ecology and Evolution of Infectious Diseases program. ICHF acknowledges salary support from the National Center for Emerging and Zoonotic Infectious Diseases, Centers for Disease Control and Prevention (19IPA1908208). This article is not part of ICHF’s CDC-sponsored projects.