Abstract
The COVID-19 pandemic has stressed healthcare care systems throughout the world. Understanding clinical progression of cases is a key public health priority that informs optimal resource allocation during an emergency. Using data from Shenzhen, China, where all cases were monitored in hospital and symptom profiles and clinical and lab results were available starting from early stages of clinical course, we characterized clinical progression of COVID-19 cases and determined important predictors for faster clinical progression to key clinical events and longer use of medical resources. Epidemiological, demographic, laboratory, clinical, and outcome data were extracted from electronic medical records. We found that those who progressed to the severe stage, developed acute respiratory distress syndrome, and were admitted to the intensive care unit (ICU) progressed on average 9.5 days (95%CI 8.7,10.3), 11.0 days (95%CI 9.7,12.3), and 10.5 days (95%CI 8.2,13.3) after symptom onset, respectively. We estimated that patients who were admitted to ICUs remained there for an average of 34.4 days (95%CI 24.1,43.2) and the average time on a ventilator was 28.5 days (95%CI 20.0,39.1) among those requiring mechanical ventilation. The median length of hospital stay was 21.3 days (95%CI, 20.5, 22.2) for the mild or moderate cases who did not progress to the severe stage, but increased to 52.1 days (95%CI, 43.3, 59.5) for those who required ICU admission. Clear characterization of clinical progression informs planning for healthcare resource allocation during COVID-19 outbreaks and provides a basis that helps assess the effectiveness of new treatment and therapeutics.
Main
The epidemic of coronavirus SARS-CoV-2 has led to 1.6 million infections and over 100,000 deaths over 4 months after the first case was detected1, causing severe shortage of essential medical supplies and equipment, medical staff, and hospital beds 2. Complementary to data from COVID-19 epicenters like Wuhan (China) or Lombardy (Italy), data from places where healthcare capacity was not exceeded and patients were treated early and free of charge has the potential to shed light on the near complete clinical trajectory of cases. Clear characterization of COVID-19 clinical trajectory under the current standard of care informs planning for healthcare resource allocation during COVID-19 outbreaks and provides a basis that helps assess the effectiveness of new treatment and therapeutics.
Here, we use rich data on clinical progression of all COVID-19 cases diagnosed and treated in the only designated hospital in Shenzhen, China. Because all clinically confirmed cases, including a sizable portion detected through contact tracing, were required to be hospitalized for isolation purposes regardless of their clinical presentation and symptom profile, this dataset allows us to examine clinical progression of cases without the considerable selection bias typically seen in hospital-based studies. We estimate time from symptom onset to key clinical events, such as first clinical diagnosis, progression to severe clinical stages, development of acute respiratory distress syndrome (ARDS), admission to the critical care unit (ICU), and discharge. We also estimate duration hospitalized, in the ICU, and on ventilators. We determine the key predictors of faster clinical progression to a series of clinical events and longer use of healthcare resources.
Methods
Study design and study population
This single-centre, observational study was conducted at Shenzhen Third People’s Hospital, which is the designated hospital to treat all patients with COVID-19 in Shenzhen. We prospectively collected data of all 420 patients diagnosed and hospitalized with COVID-19 in Shenzhen between January 11th and March 10th 2020, regardless of their clinical severity and symptom profile.
Data collection
Epidemiological, demographic, laboratory, clinical, and outcome data were extracted from electronic medical records using a standardised data collection form. All information was updated as of April 7th 2020. Data were reviewed by multiple reviewers (BS, JC, JZ, PZ), and any disagreement between reviewers was resolved by consultation with an attending physician (JM). Data from patients with severe or critical clinical assessment were extracted by an attending physician (JM) and reviewed with a reviewer (CH) to ensure data quality.
We obtained information on demographic characteristics, mode of detection, and date of onset for each symptom. Date of symptom onset before admission was self-reported and date of symptom onset after admission was recorded by an attending physician. Clinical severity was defined based on guidelines issued by the National Health Commission of the People’s Republic of China, and the severity definition was generally consistent over time (see supplemental table 3 for guidelines in each version) 7. We recorded clinical severity at initial diagnosis and date of severity progression. Clinical severity was assessed daily when mild or moderate and was assessed twice a day when severe or critical.
Dates of ICU admission and discharge were recorded, as well as dates beginning and ending invasive ventilator use. Patients were eligible for discharge from the hospital or transfer to a non-COVID ward for treatment if they met all of the following: 1) no fever for over 3 days, 2) drastic improvement in respiratory symptoms, 3) pulmonary imaging showing significant reduction in inflammation, and 4) two consecutive negative RT-PCR results from respiratory sampling conducted over one day apart 7. We counted time in non-COVID wards for treatment of COVID-19 related complications towards the duration of hospitalization.
We recorded patients’ self-reported medical history (See table 1 for a list of baseline comorbidities). We reviewed laboratory results and created binary variables indicating presence of any abnormalities and the date when such abnormalities were detected (see Table1 for a complete list of lab indicators and abnormality cutoff). The x-ray computed tomography (CT) results were extracted from radiological examinations. We recorded the lowest cycle threshold values from the available RT-PCR testings and the date when the testing was performed. We also recorded complications developed during hospitalization.
The primary endpoints in this study include patients’ time from symptom onset to clinical progression beyond the moderate stage, ICU admission, invasive ventilator use, and discharge. We also examined other endpoints including time to when PaO2/FiO2 dropped under 300mmHg and time to developing ARDS.
Statistical analyses
We estimated cumulative incidence of developing key clinical events in the presence of competing risks (i.e., death and hospital discharge) using the Aalen-Johansen estimator 8,9. Outcomes included progression to severe stage, low PaO2/FiO2 ratio, ARDS, ICU admission, use of invasive ventilator, and hospital discharge.
We estimated patients’ duration of hospitalization by calculating the area above the cumulative incidence of hospital discharge or death, which was estimated using the Aalen-Johansen estimator 8,9. In addition, we estimated the time patients admitted to the ICU ultimately spent in the ICU as the area between the cumulative incidence functions for ICU entry and ICU exit among those who were admitted into ICU. We treated death while in the ICU and ICU discharge as a composite event when estimating cumulative incidence of ICU exit. Similarly, we estimated the duration of invasive ventilator use among patients requiring ventilation, as the area between the cumulative incidence functions for ventilator initiation and ventilator discontinuation. We treated death during ventilator use and withdrawal of ventilator support as a composite event when estimating ventilator discontinuation. We compared times in each state estimated using the nonparametric approach described above with times estimated using a parametric accelerated failure time models to examine improvements in precision seen when invoking a parametric approach (see Text S2 for detailed method). We used bootstrap simulation to construct confidence intervals (2000 bootstrap simulations for time to recovery and time to ARDS, and 200 bootstrap simulations for time to ICU entry/discharge and ventilator use/withdrawal). 95% confidence intervals were the 2.5th and 97.5th percentiles of the distribution of point estimates from the bootstrap samples.
We used competing risk regressions according to the methods of Fine and Gray 10 to estimate subdistribution hazard ratios comparing the rate of clinical progression between subgroups that were defined a priori (See Table 3 for the list of subgroups). We compared the rate of clinical progression to severe stage, ICU admission, ARDS, and hospital discharge between subgroups. Except for the models where time to hospital discharge was the outcome, hospital discharge and death were treated as competing events and end of study as administrative censoring for those still in treatment.
We used a flexible approach to stratify cases into three risk strata for the purpose of visualizing different clinical trajectories. We constructed a random survival forest model (RSF) and divided cases into low, medium, and high risk groups based on tertiles of RSF out-of-bag predictions. The candidate predictors used in the RSF model included 1) demographic information, 2) baseline comorbidities, 3) symptom profile, lab and CT results within 5 days of any symptom onset. Hospital discharge was treated as a competing event. All four deaths occurred after cases progressed beyond the severe stage, thus they were not treated as competing events. We calculated AUC over time since symptom onset (tAUC), providing a measure of model performance across all possible classification thresholds and based on the observed number of cases entering the severe stage by each time point 11. We fit 1000 trees, considered 5 random splits for each candidate splitting variables, and used the log-rank test as the split function.
Results
Characteristics of cases and description of baseline comorbidities
Four-hundred and twenty cases were admitted and hospitalized to Shenzhen Third People’s Hospital between January 11th and March 10th, 2020 (Figure S1). Fifteen percent (63/420) of cases were detected through contact tracing (Table 1). On average, the first clinical diagnosis occurred 1.9 days (95% CI 1.6,2.3) and hospitalization occurred 4.2 days (95%CI 3.8,4.6) after symptom onset.
Of the 420 cases, there were approximately equal numbers of males (47.6%, n=200) and females (52.4%, n=220) (Table 1). A large portion (38.8%, n=163) were under the age of 40 and the majority (83.1%, n=349) were detected through symptom-based surveillance. 21.9% (92/420) of cases had at least one self-reported comorbidity on admission, with hypertension (n=49) and diabetes (n=24) being the most prevalent.
Fever, cough, sputum production were the most common initial symptoms, with 68.6% (288) of patients showing fever, 60.0% (252) with cough, and 31.2% (131) with sputum production within 5 days of initial symptom onset (see supplemental figure for onset time distribution of each symptom). 25.2% (106) of patients never had fever, and 26.9% (113) of patients never had cough.
At the initial clinical assessment, 23 patients (5.5%) were clinically mild, the vast majority (93.8%, n=394) were moderate, and only 3 patients were clinically severe or critical (Figure S2).
Clinical progression of cases
We estimated the proportion of the initially mild or moderate cases in each stage (mild/moderate, severe, ICU, death or discharge) over time following symptom onset, taking into account patients both transitioning into and out of each stage (Figure 1A). The total number of patients in the severe stage reached its peak 12 days after symptom onset. Among the 417 patients who were classified as mild or moderate at the time of initial assessment, 21.6% (90/417) progressed to the severe stage. 9.6% (95%CI, 6.8%, 12.4%) progressed to the severe stage within 7 days after symptom onset, and 20.4% (95%CI, 16.5%, 24.3%) progressed within 14 days (Figure 1 and 2). Those who progressed to the severe stage progressed on average 9.5 days (95%CI 8.7,10.3) after symptom onset. Among the 417 patients who were classified as mild or moderate at the time of initial assessment, 8.6% (36/417) developed ARDS. 2.6% (95%CI 1.1%,4.2%) developed ARDS within 7 days from symptom onset, and 7.7% (95%CI 5.1%, 10.2%) within 14 days. Those who developed ARDS developed ARDS on average 11.0 days (95%CI 9.7,12.3) after symptom onset.
As of April 7th, 19 patients had been admitted to ICU, among which 18 patients required invasive mechanical ventilation support, 4 patients died (1 patient died after initial hospital discharge with viral clearance), and 4 patients remained hospitalized in critical condition. We estimated that among the 417 patients who were classified as mild or moderate at the time of initial assessment, 3.3% (95%CI, 1.6%, 5.1%) of patients required ICU admission within 14 days from symptom onset (same for patients who required mechanical ventilators support). Those who required ICU admission were admitted into ICU on average 10.5 days (95%CI 8.2,13.3) after symptom onset. Using data from the 19 patients who were admitted into ICU, we estimated the average time in ICU was 34.4 days (95%CI 24.1,43.2) (Table 2). Using data from the 18 patients who required mechanical ventilator support, we estimated the average time on a ventilator was 28.5 days (95%CI 20.0,39.1).
The median length of hospital stay was 21.3 days (95%CI, 20.5, 22.2) for the mild or moderate cases who did not progress to the severe stage, and increased to 30.3 days (95%CI, 26.7, 31.4) for cases who reached the severe stage but did not enter ICU and 52.1 days (95%CI, 43.3, 59.5) for the cases who required ICU admission. Of note, patients in Shenzhen were required to be hospitalized for about 2 weeks for isolation; 21% (86/417) of the cases that were initially mild or moderate were discharged within 14 days from hospitalization, 34% (140/417) within 16 days. The duration of hospitalization for the mild or moderate cases was likely inflated as a result. All patients who were clinically mild at the time of initial assessment stayed mild until discharge (Figure S1).
Characteristics associated with difference in clinical progression
We then identified a priori-defined patients’ characteristics that were associated with faster clinical progression. We found that having hypertension and diabetes at baseline was strongly associated with faster clinical progression to various clinical events, including progression to the severe stage (sHR=3.2, 95%CI 2.1,5.1 for hypertension and sHR=3.0, 95%CI 1.6,5.4 for diabetes), to developing ARDS (sHR=3.6, 95%CI 1.8,6.9 for hypertension and sHR=5.5, 95%CI 2.7,11.3 for diabetes), and to ICU admission (sHR=4.6, 95%CI 1.8,11.6 for hypertension and sHR=4.7, 95%CI 1.6,14.1 for diabetes) (Table 3). Having more baseline comorbidities was also associated with a higher rate of clinical progression to these events (Table 3). Although many lab abnormalities measured within 5 days of symptom onset were strongly predictive of faster clinical progression, including low lymphocyte count, low platelet count, high concentration of C-reactive protein, and high concentration of D-dimer, notably, a low PaO2/FiO2 ratio close to symptom onset was very strongly associated with faster clinical progression. We observed a 22.3 times (95%CI 8.4, 58.8) increase in the subdistribution hazard of ICU admission among those with early measures of low PaO2/FiO2 ratio (Table 3).
Older age was one of the most important predictors of faster clinical progression (Table 3). All four patients who died were male over the age of 60. About half (46.3%, 50/108) of cases aged 60 or above progressed to the severe or critical stage (Table 1). Although the vast majority of those under the age of 40 did not progress beyond the moderate stage, 6% (9/163) of cases in this younger age group became clinically severe or critical and none of them had any known underlying comorbidities.
Sex was strongly associated with clinical progression of cases, though the difference by sex was mostly driven by the difference among older patients. We did not observe a significant difference in time to ICU admission between males and females under the age of 60 (sHR = 0.7, 95%CI 0.1,4.3 comparing males with reference to females; Table S1, Figure 2b). However, we observed a noteable difference in progression to require ICU admission between males and females aged 60 or above; males in this older age group had a 10-fold increase in the subdistribution hazard of ICU admission compared to females in the same age group (sHR=10.5, 95%CI 1.0, 108.6), despite the similar sex-specific age distribution in this age group (males: mean 67, IQR 63,69 vs. females: mean 65, IQR 62,66) (Table S1). Differences in baseline comorbidities between older males and females did not explain the disparity; after adjusting for having any underlying comorbidity, the subdistribution hazard ratio remained unchanged (Table S1). Similarly, we found that males aged 60 or above had a lower rate of hospital discharge compared to females in the same age group, and no significant difference by sex in the younger age group (Figure 2c). However, we did not find significant disparities in clinical progression to severe stage or to developing ARDS by sex in any age group (Table S1).
Using linear regression, we found that the minimum RT-PCR cycle threshold values for the severe cases were significantly lower than the mild cases after adjusting for time of sample collection with reference to symptom onset. We observed a general trend of lower minimum cycle threshold values in patients with more severe clinical presentation and in patients in the older age group though the association with age was not statistically significant (Figure S5).
Risk strata for clinical progression
Based on the random survival forest results, the most important predictors of faster progression to the severe stage were low PaO2/FiO2 ratio, low platelet count, and high C-reactive protein concentration (Figure S8a). We observed very different clinical trajectories of patients in each risk group (Figure 1b), highlighting the effectiveness of risk stratification produced by the RSF model. In the low-risk group, no case required ICU admission and only 3% (95%CI, 0.1%,6%) of cases became severe within 14 days from symptom onset. Whereas in the high-risk group, we estimated that 43% (95%CI, 35, 52) cases became severe and 9% (95%CI, 4%,14%) required ICU admission within 14 days from symptom onset. All but one case that required ICU admission were classified into the high-risk group, and we estimated that the duration in ICU for those in the high-risk group was 35.1 days (95%CI, 26.1,45.1) (Table 2, Figure S8). Risk stratification produced by a RSF model that excluded lab and CT results from the candidate predictors also differentiated clinical trajectory of patients well (Figure S7, Figure S8 c-d).
Conclusions
The analysis of clinical data from COVID-19 patients in Shenzhen Third People’s hospital provides insights into clinical progression of cases starting early in the course of infection. We estimate the proportion of cases in each severity stage over 80 days following symptom onset. We present patient characteristics associated with faster clinical progression and longer use of medical resources.
Because treatment of COVID-19 was free of charge and hospitalization for all cases was mandatory in Shenzhen, we expect that our results are not strongly affected by the selection bias that plagues many hospital-based studies. Therefore, our results provide a clear picture of the composition of cases in a city in terms of disease severity and the clinical trajectories of these cases. Although healthcare resources in Shenzhen were rarely overwhelmed by influx of COVID-19 patients, we show that the trajectory of clinical progression of cases in Shenzhen were similar to the trajectory in Wuhan, China, with mean time to ICU admission and mean time to developing acute respiratory distress syndrome to be around 10 days 3,4. Patient characteristics previously reported to be associated with ARDS and death 5 including hypertension, diabetes, and various lab results were also highly predictive of faster clinical progression to various key clinical events including progression to severe stage, ARDS, and ICU admission. Previous studies reported that most cases were males and median age was over 45 3,6 and a higher percentage of men required ICU care 3. We further show differences in clinical progression between males and females that were primarily driven by the stark difference in the older age group independent of differences in baseline comorbidities.
This study has a number of limitations. Dates of symptom onset were extracted from physicians’ notes that were not recorded to explicitly ascertain information on symptom onset. When the date of symptom onset could not be determined, we assumed onset date was the date of initial diagnosis. We performed sensitivity analyses to assess patient characteristics for clinical progression to severe stage and ICU admission and the results remained qualitatively the same (Table S4). We also performed the RSF analyses using symptom profile and lab results within 5 days of initial diagnosis as candidate predictors, and the integrated AUC remained relatively unchanged (Figure S9). Our estimates of duration of ICU stay and ventilator use were somewhat imprecise, likely due to the small sample size for those reaching the critical stage. Even though estimating duration using our non-parametric approach showed improvement in precision compared to times estimated using parametric accelerated failure time models, the estimated time within subgroups needs to be interpreted with caution because of the small sample size (Table S3). Finally, the association presented between clinical progression and patient characteristics should not be interpreted as causal given the variation in treatment and numerous confounders that were not accounted for in this study.
We demonstrate that patient characteristics near symptom onset have tremendous potential to inform COVID-19 triage, grouping patients into risk sets with different outlook of clinical progression. While our RSF model performs well based on out-of-bag predictions (Figure S8; Text S4), we would be highly cautious of triaging patients in other settings using the important variables identified here due to our limited sample size. However, this is an important first step towards an applicable triage risk screening tool once well recorded data on clinical courses for more patients become available. Strategic response and allocation of medical resources for ongoing outbreaks may also benefit from a dynamic risk scoring system that incorporates new patient-level lab and symptom information as it is updated over time.
In conclusion, we provided quantitative characterization of the clinical progression of COVID-19 patients beginning from early clinical stages. Our estimates form the basis for assessing effectiveness of new treatments and inform planning for healthcare resource allocation during COVID-19 outbreaks.
Data Availability
The data that support the findings of the study are not publicly available. Summary data may be available from the authors upon reasonable request with permission from Shenzhen Third People's Hospital.
Author Contributions
QB, Z Wu, JE conducted statistical analyses and drafted the figures
QB, Z Wu, JE, AA, JL, LK, CH, JM, GZ drafted and provided critical feedback on the manuscript
BS, JC, JZ, PZ, CH, JM, Z Wang, CY extracted the data
QB, TM, JL, LL, GZ, TW conceived the study
Competing Interests statement
Authors declare no conflict of interests
Ethics Statement
This work was conducted in support of an ongoing public health response, hence was determined not to be human subjects research after consultation with the Johns Hopkins Bloomberg School of Public Health IRB. The study was approved by the ethics committees of Shenzhen Third People’s Hospital.
Acknowledgements
We thank all patients and their families involved in the study; as well as the front line medical staff and public health workers who collected this critical data. GZ was funded by the National Science and Technology Major Project for Control and Prevention of Major Infectious Diseases of China (No. 2017ZX10103004). CH, PZ, CY, BS, TW, JC, TM were funded by the Emergency Response Program of Harbin Institute of Technology (HITERP010) and Emergency Response Program of Peng Cheng Laboratory (PCLERP001). JL and QB were funded by a grant from the US Centers for Disease Control and Prevention (NU2GGH002000).