Abstract
Objectives To develop a prognostic model to identify and quantify risk factors for mortality among patients admitted to the hospital with COVID-19.
Design Retrospective cohort study. Patients were randomly assigned to either training (80%) or test (20%) sets. The training set was used to fit a multivariable logistic regression. Predictors were ranked using variable importance metrics. Models were assessed by C-indices, Brier scores, and calibration plots in the test set.
Setting Optum® de-identified COVID-19 Electronic Health Record dataset.
Participants 17,086 patients hospitalized with COVID-19 between February 20, 2020 and June 5, 2020.
Main outcome measure All-cause mortality during hospital stay.
Results The full model that included information on demographics, comorbidities, laboratory results and vital signs had good discrimination (C-index = 0.87) and was well calibrated, with some overpredictions for the most at-risk patients. Results were generally similar on the training and test sets, suggesting that there was little overfitting.
Age was the most important risk factor. The performance of models that included all demographics and comorbidities (C-index = 0.79) was only slightly better than a model that only included age (C-index = 0.76). Across the study period, predicted mortality was 1.2% for 18-year olds, 8.4% for 55-year olds, and 28.6% for 85-year olds. Predicted mortality across all ages declined over the study period from 21.7% by March to 13.3% by May.
Conclusion Age was the most important predictor of all-cause mortality although vital signs and laboratory results added considerable prognostic information with oxygen saturation, temperature, respiratory rate, lactate dehydrogenase, and white blood cell count being among the most important predictors. Demographic and comorbidity factors did not improve model performance appreciably. The model had good discrimination and was reasonably well calibrated, suggesting that it may be useful for assessment of prognosis.
Competing Interest Statement
All authors are employees of Genentech, a member of the Roche Group.
Funding Statement
This study is funded by Genentech, a member of the Roche group.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
The use of Optum de-identified COVID-19 database was reviewed by the New England Institutional Review Board (IRB) and was determined to be exempt from broad IRB approval, as this study did not involve human subject research.
All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
Data Availability
Data for this study was licensed from Optum. While we are unfortunately not able to publicly share the data, the Supplement (with all code and output for the manuscript as well as supplementary analyses) is available at https://github.com/phcanalytics/covid19-prognostic-model.