Abstract
Background This is the first study on prognostication in an entire cohort of laboratory-confirmed COVID-19 patients in the city of Hong Kong. Prognostic tool is essential in the contingency response for the next wave of outbreak. This study aims to develop prognostic models to predict COVID-19 patients’ clinical outcome on day 1 and day 5 of hospital admission.
Methods We did a retrospective analysis of a complete cohort of 1,037 COVID-19 laboratory-confirmed patients in Hong Kong as of 30 April 2020, who were admitted to 16 public hospitals with their data sourced from an integrated electronic health records system. It covered demographic information, chronic disease(s) history, presenting symptoms as well as the worst clinical condition status, biomarkers’ readings and Ct value of PCR tests on Day-1 and Day-5 of admission. The study subjects were randomly split into training and testing datasets in a 8:2 ratio. Extreme Gradient Boosting (XGBoost) model was used to classify the training data into three disease severity groups on Day-1 and Day-5.
Results The 1,037 patients had a mean age of 37.8 (SD±17.8), 53.8% of them were male. They were grouped under three disease outcome: 4.8% critical/serious, 46.8% stable and 48.4% satisfactory. Under the full models, 30 indicators on Day-1 and Day-5 were used to predict the patients’ disease outcome and achieved an accuracy rate of 92.3% and 99.5%. With a trade-off between practical application and predictive accuracy, the full models were reduced into simpler models with seven common specific predictors, including the worst clinical condition status (4-level), age group, and five biomarkers, namely, CRP, LDH, platelet, neutrophil/lymphocyte ratio and albumin/globulin ratio. Day-1 model’s accuracy rate, macro- and micro-averaged sensitivity and specificity were 91.3%, 84.9%-91.3% and 96.0%-95.7% respectively, as compared to 94.2%, 95.9%-94.2% and 97.8%-97.1% under Day-5 model.
Conclusions Both Day-1 and Day-5 models can accurately predict the disease severity. Relevant clinical management could be planned according to the predicted patients’ outcome. The model is transformed into a simple online calculator to provide convenient clinical reference tools at the point of care, with an aim to inform clinical decision on triage and step-down care.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
This study received no external funding.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
Research Ethics Committee(Kowloon Central / Kowloon East)
All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
Data Availability
The de-identified datasets generated and analysed during the current study are not publicly available for patient privacy protection as their disclosure at granular level may entail the risk of subject re-identification. Aggregate data are available from the corresponding author on reasonable request. The calculator tool which is developed based on the prognostic models' results of this study will be available for online public access after the study being published.
List of Abbreviations
- A
- albumin
- A/G
- ratio albumin-globulin ratio
- AI
- Artificial Intelligence
- AII
- airborne infection isolation
- ALP
- alkaline phosphatase
- ALT
- alanine aminotransferase
- AST
- aspartate aminotransferase
- CMS
- Clinical Management System
- COVID-19
- Coronavirus Disease 2019
- CRP
- C-reactive protein
- ECMO
- extracorporeal membrane oxygenation
- eNID
- Electronic Notification of Infectious Disease
- G
- globulin
- HA
- Hospital Authority
- HK
- Hong Kong
- ICU
- intensive care unit
- ILI
- influenza-like illness
- LDH
- lactate dehydrogenase
- MPV
- mean platelet volume
- N/L
- ratio neutrophil-lymphocyte ratio
- NDORS
- Notifiable Diseases and Outbreak Reporting System
- PCT
- procalicitonin
- RT-PCR
- reverse-transcription polymerase chain reaction
- SARS
- Severe Acute Respiratory Syndrome
- SD
- standard deviation
- WBC
- white blood cell count
- WHO
- World Health Organization
- XGBoost
- Extreme Gradient Boosting