ABSTRACT
Background and Purpose Stroke is a leading cause of death and disability worldwide. Predicting which patients are at risk for a prolonged length of stay (LOS) could assist in coordination of care and serve as a rough measure of clinical recovery trajectory. During the acute stroke period, there is a disruption in the fidelity of the blood-brain barrier and cerebral autoregulation, and we hypothesize that trends in physiologic parameters early in a patient’s hospital course may be used to predict which patients are increased risk for a prolonged LOS. In this work we sought to create a model to predict prolonged LOS (defined as ≥ 7 days) from patient data available at admission as well as routinely collected physiologic (pulse, blood pressure, respiratory rate, temperature), and other data from the first 24 hours of admission.
Methods This retrospective cohort study included stroke patients admitted to an urban comprehensive stroke center between 2016-2019. Data included common physiological parameters (pulse, temperature, blood pressure, respirations, and oxygen saturation) as well as demographic and comorbidity data. Raw time series data were transformed into statistical features for modeling. Logistic regression, random forest, and XGBoost models were trained on data collected during the first 24 hours after hospital admission to predict prolonged LOS and evaluated on a held-out test set.
Results A total of 2,025 patients were included. Using an XGBoost classifier we obtained a ROC AUC of 0.85 and Precision-Recall AUC of 0.77, with the optimal operating point achieving an accuracy of 0.80, sensitivity of 0.78, specificity of 0.81.
Conclusions The model suggests that prolonged LOS can be predicted with reasonable accuracy using clinical data obtained within the first 24 hours of hospitalization. This approach could provide the basis for development of a risk score and augment the care coordination process.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
American Heart Association Innovative Project Award.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
This study has been reviewed and acknowledged by the Johns Hopkins IRB
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.
Yes
Data Availability
The datasets generated during and/or analyzed during the current study are available from the corresponding author on reasonable request.
NON-STANDARD ABBREVIATIONS AND ACRONYMS
- AUC
- Area under receiver operating characteristic curve
- EHR
- Electronic health record
- FPR
- False positive rate
- GCS
- Glasgow Coma Scale
- GLM
- Generalized linear model
- LOS
- Length of stay
- LR
- Logistic regression
- PR
- Precision-recall
- RF
- Random forest
- ROC
- Receiver operating characteristi
- TPR
- True positive rate
- XGB
- XGBoost