Predicting Opportunities for Improvement in Trauma Care: A Registry-Based Cohort Study

Jonatan Attergrim; Kelvin Szolnoky; Lovisa Strömmer; Olof Brattström; Gunilla Whilke; Martin Jacobsson; Martin Gerdin Wärnberg

doi:10.1101/2023.01.19.23284654

Abstract

Importance Trauma quality improvement programs relies on peer review of patient cases to identify opportunities for improvement. Current state-of-the-art systems for selecting patient cases for peer review use audit filters that struggle with poor performance.

Objective To develop models predicting opportunities for improvement in trauma care and compare their performance to currently used audit filters.

Design, Setting and Participants This single-center registry-based cohort study used data from the trauma centre at Karolinska University Hospital in Stockholm, Sweden, between 2013 and 2023. Participants were adult trauma patients included in the local trauma registry. The models predicting opportunities for improvement in trauma care were developed using logistic regression and the eXtreme Gradient Boosting learner (XGBoost) with an add-one-year-in expanding window approach. Performance was measured using the integrated calibration index (ICI), area under the receiver operating curve (AUC), true positive rates (TPR) and false positive rates (FPR). We compared the performance of the models to locally used audit filters.

Main outcome measure Opportunities for improvement, defined as preventable events in patient care with adverse outcomes. These opportunities for improvement were identified by the local peer review processes.

Results A total of 8,220 patients were included. The mean (SD) age was 45 (21), 5696 patients (69%) were male, and the mean (SD) injury severity score was 12 (13). Opportunities for improvement were identified in 496 (6%) patients. The logistic regression and XGBoost models were well calibrated with ICIs (95% CI) of 0.032 (0.032-0.032) and 0.033 (0.032-0.033). Compared to the audit filters, both the logistic regression and XGBoost models had higher AUCs (95% CI) of 0.72 (0.717-0.723) and 0.75 (0.747-0.753), TPR (95% CI) of 0.885 (0.881-0.888) and 0.904 (0.901-0.907), and lower FPR (95% CI) of 0.636 (0.635-0.638) and 0.599 (0.598-0.6). The audit filters had an AUC (95% CI) of 0.616 (0.614-0.618), a TPR (95% CI) of 0.903 (0.9-0.906), and a FPR (95% CI) of 0.671 (0.67-0.672).

Conclusion and Relevance Both the logistic regression and XGBoost models outperformed audit filters in predicting opportunities for improvement among adult trauma patients and can potentially be used to improve systems for selecting patient cases for trauma peer review.

Key point Question: How does the performance of machine learning models compare to audit filters when screening for opportunities for improvement, preventable events in care with adverse outcomes, among adult trauma patients?

Findings: Our registry-based cohort study including 8,220 patients showed that machine learning models outperform audit filters, with improved discrimination and false-positive rates. Compared to audit filters, these models can be configurated to balance sensitivity against overall screening burden.

Meaning: Machine learning models have the potential to reduce false positives when screening for opportunities for improvement in the care of adult trauma patients and thereby enhancing trauma quality improvement programs.

Introduction

Trauma is a leading cause of death and disability worldwide (1,2). Peer review of patient cases, sometimes referred to as performance improvement, is a critical component of trauma quality improvement programmes (3–5). This review ideally involves representatives from all disciplines and professions involved in trauma care to identify opportunities for improvement, which are preventable events in patient care with adverse outcomes (6).

The current state-of-the-art systems for selecting patient cases for peer review uses audit filters, sometimes in combination with individual human screening (7). Audit filters are sentinel events in patient care that are associated with suboptimal care and potentially poor patient outcomes, such as delays in key interventions or unexpected deaths (3,8). When such an event occurs, it should trigger the peer review process. This process is then followed by the implementation of corrective actions (8).

It has long been known that audit filters perform poorly in this context (9). Replacing filters with trauma mortality prediction models has failed (10–12), likely because they were not developed to predict opportunities for improvement. No published research has evaluated prediction models for opportunities for improvement. We therefore aimed to develop models predicting opportunities for improvement in trauma care and compare their performance to currently used audit filters.

Methods

Design

We conducted a registry-based cohort study using all trauma patients included in both the Karolinska University Hospital trauma registry and the trauma care quality database between 2013 and 2022. The study was approved by the Swedish Ethical Review Authority (approval numbers 2021-02541 and 2021-03531).

Study Setting and Population

The trauma center at the Karolinska University Hospital in Solna, Sweden, manages approximately 1500 acute trauma patients each year (13).

The Karolinska University Hospital trauma registry, part of the Swedish Trauma Registry (13), includes all patients admitted to the Karolinska University Hospital with trauma team activation, regardless of injury severity score (ISS), as well as patients admitted without trauma team activation but found to have ISS of more than 9. The registry includes data on vital signs, times, injuries and interventions and demographics according to the European consensus statement, the Utstein template (14). The care quality database includes data relevant to the peer review process, including audit filters, identified opportunities for improvement, and proposed corrective actions.

The peer review process has evolved over time, but since 2017, a specialized nurse reviews the medical records of all trauma patients and flags patients with potential opportunities for improvement using a set of audit filters (Supplement E2 eTable 1) and clinical experience. A second nurse performs a more in-depth review of all flagged patients. Patients with suspected opportunities for improvement are then reviewed at a multidisciplinary conference, where the final decision on the presence of opportunities for improvement is made. All patients who die are reviewed in a separate conference that evaluates the preventability of the death and determines the presence of any opportunities for improvement. Before 2017, the process was less formalized, and a small group of clinicians involved in trauma care identified opportunities for improvement.

Eligibility Criteria

We included all patients screened for opportunities for improvement from the trauma registry and trauma care quality database between 1 January 2013 and 31 December 2022. Patients younger than 15 years were excluded because their clinical and review pathways differ from those of adults.

Outcome

The models’ outcome is the presence of any opportunities for improvement, as determined by the peer review process. The identified opportunities for improvement are further grouped into clinical judgment errors, delays in treatment or diagnosis, missed diagnoses, technical errors, preventable deaths or other errors.

Sample Size Considerations

The relationship between the number of predictors and required sample size for different learners has not been well researched expect for logistic regression (15,16). We used these guidelines to inform the number of predictors that we could include in our models, and we estimated that with a sample size of 3452, which is equivalent to 80% of the available data from 2017–2020, would support 45 parameters, assuming a 6% event rate, a r2 of 0.11 and a target shrinkage of 0.9.

Predictors

We selected predictors based on current audit filters, standard demographics, previous research and expert opinion (17). The categorical predictors were gender, type of emergency procedure, highest level of care, reprioritization, type of trauma alarm, discharge destination and death within 30 days. The continuous predictors included age, vital signs on arrival, time to CT and intervention, ISS and length of stay. These final set of predictors comprised 17 variables with 45 corresponding parameters. eTable 2 (Supplement E2) shows all 17 predictors.

Statistical Analysis Methods

The statistical analyses were conducted using R (18). We developed several prediction models around different learners, available in the supplementary material (Supplement E1 and E2), where we include the best-performing model: eXtreme Gradient Boosting (XGBoost), and the more interpretable model: Logistic Regression. XGBoost builds on the principles of gradient boosting, incorporating various algorithmic optimizations, including parallel tree boosting, to efficiently solve a range of machine learning problems such as classification and regression (19).

To evaluate the models, we used an add-one-year-in expanding window approach to best represents how the models would have performed if implemented prospectively. The years 2017-2022 were all used as separate validation hold-out sets in an iterative fashion. In each iteration, all years prior to the current validation sample were used as training data. The training data were then split, and 80% of the data were used for training and 20% for calibration. We estimated 95% confidence intervals (CIs) for all performance metrics through a bootstrap of 1000 resamples for each validation sample.

Data preprocessing and imputation

We developed a preprocessor that rescaled continuous predictors using Yeo-Johnson’s power transformation (20) and recorded categorical predictors into dummy variables via one-hot encoding. Predictors with near-zero variances were excluded. Missing continuous predictors were imputed using the mean of the predictor, and a missing indicator feature was created for each. Categorical predictors were imputed by introducing an ‘unknown’ category. If blood pressure or respiratory rate data were missing but corresponding revised trauma score categorical values were available, we imputed the missing data using the mean of all patients in that category. The preprocessor was initially run on the training sample for each split to learn metrics and prevent data leakage. The trained preprocessor model was then applied independently to both the training and validation samples. To balance the training samples, we used the adaptive synthetic algorithm (21), which generates synthetic data, enabling us to upsample the opportunities for improvement outcomes at a balanced 1:1 ratio between outcome classes.

Model development

We developed the logistic regression and XGBoost (19) models using the learners as implemented in the Tidymodels framework (22). All model hyperparameters were optimized on the training sample of each split using five-fold cross-validation through iterative Bayesian optimization, encompassing all the parameters provided by the tidymodels framework.

Performance measurements

The prediction models and audit filters performance were assessed and compared in terms of calibration, discrimination, as well as true and false positive rates in each validation sample. Calibration was measured using the integrated calibration index (ICI) (23) and discrimination was measured using the area under the receiver operating characteristic curve (AUC). The ICI was not calculated for the audit filters because they cannot estimate a probability of opportunities for improvement.

To determine the class probability cutoff for the two prediction models model, we first configured them using Platt scaling on a 20% holdout sample from the training samples. We then determined the cutoff that produced a 95% true positive rate on this configuration sample and applied it to the holdout validation sample, called “TPR_95%”. Additionally, we conducted an analysis to establish an “optimal” cutoff threshold by identifying the point on the ROC curve that maximizes the trade-off between sensitivity and specificity, called “balanced configuration”.

Predictor importance

We calculated the predictor importance for the prediction models using permutation feature importance (24) on the nonresampled validation samples. The importance of a feature was thus calculated by taking the average AUC performance when shuffling a feature’s data five times and comparing it to the model’s performance on nonshuffled data.

Code availability

The code used in this study is publicly available online: https://github.com/noacs-io/predicting-ofi-in-trauma under the MIT License.

Results

Participants

Out of the 13879 patients in the registry included between January 2013 and December 2022, 8220 (59.87%) patients had been reviewed regarding the presence of opportunities for improvement, which were identified in 496 (6%) patients. The most common categories of opportunities for improvement where clinical judgment errors (n=176, 35%) followed by inadequate resources (n=111, 22%). Out of the 718 deaths, 42 (6%) where considered possible preventable making up 9% of all opportunities for improvement. Figure 1 details inclusions and exclusions as well as the frequency of each category of opportunities for improvement. eTable 3 (Supplement E2) shows the specific opportunities for improvement. Patients with opportunities for improvement (mean = 49 years, SD = 21) were slightly older than patients without opportunities for improvement (mean = 45 years, SD = 21). The ISS was greater in patients with opportunities for improvement (mean = 19, SD = 11) than in patients without opportunities for improvement (mean = 12, SD = 13), and patients with opportunities for improvement had longer times (mean = 271 minutes, SD = 323) from hospital arrival to the first major intervention than patients without opportunities for improvement (mean = 251 minutes, SD = 353). Treatment frequencies also differed, with the biggest difference being radiological interventions, where patients with opportunities for improvement (n=32, 6%) had more interventions than those without opportunities for improvement (n=69, 1%). Table 1 shows the characteristics of all included patients.

Figure 1.

Flowchart describing the exclusions made and the process of accessing trauma patients from arrival until the decision for OFI.

View this table:

Table 1. Demographic and clinical characteristics of patients screened for OFI

View this table:

Table 2. Performance Metrics

Model Development, Specification and Performance

The frequency of opportunities for improvement varied between 2017 and 2022, with the highest occurring in 2017 (n=112, 9%) and the lowest in 2018 (n=36, 3%). Annual characteristics are provided in eTable 4 (Supplement E2). Figure 2 provides the number of patients and opportunities for improvement for each year and corresponding training datasets.

Figure 2. Annual AUC values for each model. Sample sizes and OFI number for each training and test set.

Year wise model performance and sample sizes for the expanding window add one year in analysis. A) Mean area under the curve (AUC) per model and year. Lines represent 95% confidence intervals. For any given year, the AUCs were calculated with that year as the test set and all preceding years, starting with 2013, as the training set. For example, for 2019 the AUCs were calculated using 2019 as the test set and 2013-2018 as the training set. B) Opportunities for improvement (OFI) and sample sizes per year. The training sample and the test sample sizes includes the OFI in each sample respectively. For any given year, the training OFI and training sample size rows are the total number of patients with OFI and total number of patients in all preceding years respectively. The test OFI test sample size rows are the number of patients with OFI and the total number of patients in a specific year. For example, for 2019 the training OFI is the total number of patients with OFI 2013-2018, the training sample size is the total number of patients 2013-2018, the test OFI is the number of patients with OFI in 2019 and the test sample size the number of patients in 2019.

ISS was the most important predictor, followed by highest level of care. Figure 3 shows the average predictor importance for all years between 2017 and 2022 for all predictors.

Figure 3. Average permuted variable importance for each model

The calculated, model-agnostic, permuted feature importance calculated using the AUC as the scoring metric. Variable importance is measured as AUC change when a variable is permuted. The model values are the average within that model between the years 2017 and 2022. The “Mean” value is the mean of all models. Definition of abbreviations: PH = pre-hospital; ED = emergency department, GCS = Glasgow Coma Scale, GOS = Glasgow Outcome Scale.

The logistic regression and XGBoost models were well calibrated with ICIs (95% CI) of 0.032 (0.032-0.032) and 0.033 (0.032-0.033). When averaging the results from each year, the audit filters had an AUC (95% CI) of 0.616 (0.614-0.618), a TPR (95% CI) of 0.903 (0.9-0.906), and a FPR (95% CI) of 0.671 (0.67-0.672). Compared to the audit filters, both the logistic regression and XGBoost models had higher AUCs (95% CI) of 0.72 (0.717-0.723) and 0.75 (0.747-0.753).

The XGBoost based model had a significantly lower FPR (95% CI) of 0.599 (0.598-0.6) while still retaining a superior TPR (95% CI) of 0.904 (0.901-0.907). The logistic regression model similarily displayed a superior FPR (95% CI) of 0.636 (0.635-0.638), however to the cost an inferior TPR (95% CI) of 0.885 (0.881-0.888).

In the TPR_95% configuration, the XGBoost model achieved a TPR (95% CI) of 0.904 (0.901-0.907) with a significantly lower FPR (95% CI) of 0.599 (0.598-0.6) compared to audit filters. The logistic regression model achieved a TPR (95% CI) of 0.885 (0.881-0.888) and a FPR (95% CI) of 0.636 (0.635-0.638). Both models demonstrated good calibration, with ICIs (95% CI) of 0.033 (0.032-0.033) for XGBoost and 0.032 (0.032-0.032) for logistic regression. In the balanced configuration, the XGBoost model had a TPR (95% CI) of 0.502 (0.496-0.507) and a FPR (95% CI) of 0.186 (0.185-0.187). The logistic regression model showed a TPR (95% CI) of 0.501 (0.496-0.507) and a FPR (95% CI) of 0.218 (0.217-0.219).

Figure 3 shows annual AUC values between 2017 and 2022. eFigures 1, eFigure 2 and eFigure 3 in the supplemental shows the annual TPR, FPR values and receiver operating characteristic curves between 2017 and 2022.

Discussion

Both the XGBoost and logistic regression prediction models outperformed audit filters in predicting opportunities for improvement among adult trauma patients, with XGBoost showing the best overall performance. The performances of both models were modest, and the audit filters exhibited poor performance. Unlike audit filters, these models can be configured towards specific goals where we tested two configuration strategies: one prioritizing a higher TPR with a moderate reduction in FPR, and another accepting a moderate loss in TPR for a substantial reduction FPR. This adaptability allows these models to better balance identifying opportunities for improvement and managing the screening burden, which in combination with a superior overall performance offers potential advantages over traditional audit filters.

Limitations

Importantly, our models’ results are most likely falsely low due to two limitations. First, the add on year in approach to simulate prospective implementation resulted in small sample sizes between 2017 and 2020, leading to poorer configuration. Second, these models are only evaluated against opportunities for improvement identified within the current peer review system. The low frequency of opportunities for improvement in this study, compared to previous studies, suggests potential false negatives (10,25,26). These false negatives would favor the models and increase their performance, as they were missed by the current audit filter and peer review system.

While defined as a binary variable, opportunities for improvement includes a diverse set of outcomes ranging from preventable deaths to lacking communication. The heterogeneity of these outcomes represents a range of clinical events, each likely correlating to different predictors. In addition, machine learning models struggle to handle rare events, and despite being an aggregate of all previously identified errors, the opportunities for improvement frequency is only 6%; as a result, opportunities for improvement is a considerable predictive challenge.

Another potential risk is a “data shift”. Due to feasibility, mortality and morbidity reviews and corresponding corrective actions can focus only on a subset of opportunities for improvement at any given time. Hence, a correctly flagged opportunities for improvement might not be registered since the system must prioritize other areas in need of correction. If human resources could be removed from basic screening tasks by reducing false positives, they could possibly be allocated toward more in-depth reviews, reducing the need to prioritize opportunities for improvement subgroups.

Interpretation and generalisability

While the use of audit filters when screening for opportunities for improvement remain the current state-of-the-art technique for trauma quality improvement programs their effectiveness, especially in in the mature trauma system, have long been questioned (8,9). The static nature of audit filters makes them less effective as the trauma system adapts, potentially requiring frequent changes over time. The adaptive nature of machine learning models offers a promising solution, allowing the models and subsequently selected patient cases to change as the models develop over time. While our study suggests this as a possibility, prospective implementation is needed for true evaluation.

The XGBoost model in the TPR_95% configuration had a similar TPR to that of the audit filters but achieved a 11% (n=90) reduction in the annual screening burden. This reduction in false positive are further highlighted when configuring toward balanced performance, reducing the screening burden by 72% (n=572) while identifying 46% (n=28) fewer opportunities for improvement annually. Although the reduction in TPR is not ideal, this trade-off should be considered given that trauma systems may forgo peer review altogether due to the high FPR of audit filters. Thus, the significant reduction in FPR could offer benefits for settings with limited resources.

Additionally, the need for extra human review as a consequence of the high FPR before the mortality and morbidity review risks introducing bias, reducing the intended multidisciplinary approach. Mortality prediction models have been suggested as a solution; however they perform poorly and are only applicable in mortality-related cases (10). The balanced models offer a potential solution where over 80% of the flagged cases contain opportunities for improvement. Cases can therefore be brought directly from standard trauma registries to the mortality and morbidity review without additional human pre-screening. The possibility to configure these models therefore represent a tool for high-yield selection in context that want to include morbidity cases while protecting the intended multidisciplinary approach of the final review.

A systematic review and meta-analysis by Zhang et al. investigated the performance of different machine learning applications and learners in the trauma setting found that similar performance where often found using Logistic regression compared to more complex machine learning models (28). Our study showed that XGBoost had a small, but significant, performance advantage compared to logistic regression however both performed modestly. Instead, a substantial performance increase would probably require both higher quantity and quality of data, e.g., higher-resolution data such as vital sign series and defined, complete and consistent opportunities for improvement classifications. However, in doing so one sacrifices external validity and general feasibility compared to the models in our study, which are easily applicable in settings with standard registries following the Utstein template (14).

Conclusion

It is important to note that perfect performance is far from expected. Comparing these models to entire systems using a combination of quantitative screening and several human reviews, including a multidisciplinary review, is unfair and not the goal of this paper. Instead, we strive to facilitate quality improvement efforts through a combination of human and artificial intelligence. Compared to audit filters, these models offer increased overall performance and the option to balance and optimize the tradeoff between screening burden and sensitivity goals, giving each trauma quality improvement program the potential to standardize and automate part of the review system in a way that complements human efforts.

Data Availability

The data that support the findings of this study are available following the approval of a project suggesting to use the data by the Swedish Ethical Review Authority and the appropriate bodies at the Karolinska University Hospital. More information is available on request from the corresponding author, J. Attergrim.

Contributors

M.G.W. and J.A. obtained funding and conceptualized the study. M.G.W., J.A. and K.S. drafted the study protocol. M.G.W., J.A., K.S. and M.J. wrote the statistical analysis plan. J.A. and K.S. performed the statistical analysis and model development. J.A. and K.S. drafted the manuscript, which was critically revised by all the authors. All the authors read and approved the final manuscript. J.A., K.S. and M.G.W. are guarantors. J.A. and K.S. contributed equally to this work. The corresponding author attests that all listed authors meet authorship criteria and that no others meeting the criteria have been omitted.

Funding

This work was supported by the Swedish Society of Medicine, grant number SLS-973387, and by “The Swedish Carnegie Hero Fund”. Parts of the results were presented orally and as an abstract at the London Trauma Conference.

Competing interests

All the authors have completed the ICMJE uniform disclosure form at http://www.icmje.org/disclosure-of-interest/ and declare the following: M.G.W. and J.A. received grants related to this study from the Swedish Society of Medicine and from “The Swedish Carnegie Hero Fund”. The authors have no financial relationships with any organizations that might have an interest in the submitted work in the previous three years. The authors have o other relationships or activities could appear to have influenced the submitted work.

The lead author (the manuscript’s guarantors) affirms that the manuscript is an honest, accurate, and transparent account of the study being reported; that no important aspects of the study have been omitted; and that any discrepancies from the study as planned (and, if relevant, registered) have been explained.

Dissemination to participants and related patient and public communities:

The results will be disseminated through local and international conferences. To date, results have presented at the London Trauma Conference (December 2022). Additionally, code for replicating the results and models are publicly available: https://github.com/noacs-io/predicting-ofi-in-trauma

Acknowledgments

The authors thank Liselott Västerbo for her participation in collecting and recording the data and screening for opportunities for improvement. The authors also thank all professionals who participated in the monthly mortality and morbidity conferences.

Footnotes

Orientation to clinical applications rather than specific machine learning modalities. This includes a rewritten manuscript including new figures and tables.

References

1.↵
Roth GA, Abate D, Abate KH, Abay SM, Abbafati C, Abbasi N, et al. Global, regional, and national age-sex-specific mortality for 282 causes of death in 195 countries and territories, 1980–2017: A systematic analysis for the global burden of disease study 2017. The Lancet [Internet]. 2018 Nov [cited 2022 Dec 17];392(10159):1736–88. Available from: https://linkinghub.elsevier.com/retrieve/pii/S0140673618322037
OpenUrl
2.↵
Vos T, Lim SS, Abbafati C, Abbas KM, Abbasi M, Abbasifard M, et al. Global burden of 369 diseases and injuries in 204 countries and territories, 1990–2019: A systematic analysis for the global burden of disease study 2019. The Lancet [Internet]. 2020 Oct [cited 2022 Dec 17];396(10258):1204–22. Available from: https://linkinghub.elsevier.com/retrieve/pii/S0140673620309259
OpenUrl
3.↵
World Health Organization. Guidelines for trauma quality improvement programmes [Internet]. 2009 [cited 2022 Aug 24] p. 104. Available from: https://www.who.int/publications/i/item/guidelines-for-trauma-quality-improvement-programmes
4.
Santana MJ, Stelfox HT. Development and evaluation of evidence-informed quality indicators for adult injury care. Annals of Surgery [Internet]. 2014 Jan;259(1):186–92. Available from: doi:10.1097/sla.0b013e31828df98e
OpenUrl CrossRef
5.↵
American College of Surgeons. Resources for Optimal Care of the Injured Patient. Chicago, IL 60611-3295: American College of Surgeons; 2022.
6.↵
Vioque SM, Kim PK, McMaster J, Gallagher J, Allen SR, Holena DN, et al. Classifying errors in preventable and potentially preventable trauma deaths: A 9-year review using the joint commission’s standardized methodology. The American Journal of Surgery [Internet]. 2014 Aug [cited 2022 Dec 17];208(2):187–94. Available from: https://linkinghub.elsevier.com/retrieve/pii/S0002961014001688
OpenUrl
7.↵
Hornor MA, Hoeft C, Nathens AB. Quality benchmarking in trauma: From the NTDB to TQIP. Curr Trauma Rep [Internet]. 2018 Jun [cited 2022 Dec 17];4(2):160–9. Available from: http://link.springer.com/10.1007/s40719-018-0127-1
OpenUrl
8.↵
Evans C, Howes D, Pickett W, Dagnone L. Audit filters for improving processes of care and clinical outcomes in trauma systems. Cochrane Injuries Group, editor. Cochrane Database of Systematic Reviews [Internet]. 2009 Oct 7 [cited 2022 Dec 17]; Available from: https://doi.wiley.com/10.1002/14651858.CD007590.pub2
9.↵
Cryer HG, Hiatt JR, Fleming AW, Gruen JP, Sterling J. Continuous Use of Standard Process Audit Filters Has Limited Value in an Established Trauma System. Journal of Trauma and Acute Care Surgery [Internet]. 1996 Sep [cited 2024 Apr 30];41(3):389. Available from: https://journals.lww.com/jtrauma/abstract/1996/09000/continuous_use_of_standard_process_audit_filters.3.aspx
OpenUrl
10.↵
Ghorbani P, Strömmer L. Analysis of preventable deaths and errors in trauma care in a scandinavian trauma level-i centre. Acta Anaesthesiol Scand [Internet]. 2018 Sep [cited 2022 Dec 17];62(8):1146–53. Available from: https://onlinelibrary.wiley.com/doi/10.1111/aas.13151
OpenUrl
11.
Radke OC, Heim C. Recognizing preventable death. Anesthesiology Clinics [Internet]. 2019 Mar [cited 2022 Dec 17];37(1):1–11. Available from: https://linkinghub.elsevier.com/retrieve/pii/S1932227518300880
OpenUrl
12.↵
Heim C, Cole E, West A, Tai N, Brohi K. Survival prediction algorithms miss significant opportunities for improvement if used for case selection in trauma quality improvement programs. Injury [Internet]. 2016 Sep [cited 2022 Dec 17];47(9):1960–5. Available from: https://linkinghub.elsevier.com/retrieve/pii/S0020138316302145
OpenUrl
13.↵
Årsrapporter | SweTrau [Internet]. [cited 2021 Feb 10]. Available from: http://rcsyd.se/swetrau/om-swetrau/arsrapporter
14.↵
Ringdal KG, Coats TJ, Lefering R, Di Bartolomeo S, Steen PA, Roise O, et al. The utstein template for uniform reporting of data following major trauma: A joint revision by SCANTEM, TARN, DGU-TR and RITG. Scand J Trauma Resusc Emerg Med [Internet]. 2008 [cited 2022 Dec 10];16(1):7. Available from: http://sjtrem.biomedcentral.com/articles/10.1186/1757-7241-16-7
OpenUrl
15.↵
Riley RD, Ensor J, Snell KIE, Harrell FE, Martin GP, Reitsma JB, et al. Calculating the sample size required for developing a clinical prediction model. Bmj [Internet]. 2020 Mar 18 [cited 2022 Dec 17];m441. Available from: https://www.bmj.com/lookup/doi/10.1136/bmj.m441
16.↵
Smeden M van, Moons KG, Groot JA de, Collins GS, Altman DG, Eijkemans MJ, et al. Sample size for binary logistic prediction models: Beyond events per variable criteria. Stat Methods Med Res [Internet]. 2019 Aug [cited 2022 Dec 17];28(8):2455–74. Available from: http://journals.sagepub.com/doi/10.1177/0962280218784726
OpenUrl
17.↵
Albaaj H, Attergrim J, Strömmer L, Brattström O, Jacobsson M, Wihlke G, et al. Patient and process factors associated with opportunities for improvement in trauma care: A registry-based study. Scandinavian Journal of Trauma, Resuscitation and Emergency Medicine [Internet]. 2023 Nov;31(1). Available from: doi:10.1186/s13049-023-01157-y
OpenUrl CrossRef
18.↵
R Core Team. R: A language and environment for statistical computing [Internet]. Vienna, Austria: R Foundation for Statistical Computing; 2020. Available from: https://www.R-project.org/
19.↵
Chen T, Guestrin C. XGBoost: A scalable tree boosting system. In: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining [Internet]. Acm; 2016 [cited 2022 Dec 17]. p. 785–94. Available from: https://dl.acm.org/doi/10.1145/2939672.2939785
20.↵
Yeo I-K. A new family of power transformations to improve normality or symmetry. Biometrika [Internet]. 2000 Dec 1 [cited 2022 Dec 17];87(4):954–9. Available from: https://academic.oup.com/biomet/article-lookup/doi/10.1093/biomet/87.4.954
OpenUrl
21.↵
Haibo He, Yang Bai, Garcia EA, Shutao Li. ADASYN: Adaptive synthetic sampling approach for imbalanced learning. In: 2008 IEEE international joint conference on neural networks (IEEE world congress on computational intelligence) [Internet]. Hong Kong, China: Ieee; 2008 [cited 2022 Dec 17]. p. 1322–8. Available from: http://ieeexplore.ieee.org/document/4633969/
22.↵
Kuhn M, Wickham H. Tidymodels: A collection of packages for modeling and machine learning using tidyverse principles. [Internet]. 2020. Available from: https://www.tidymodels.org
23.↵
Austin PC, Steyerberg EW. The integrated calibration index (ICI) and related metrics for quantifying the calibration of logistic regression models. Statistics in Medicine [Internet]. 2019 Sep 20 [cited 2023 Jan 4];38(21):4051–65. Available from: https://onlinelibrary.wiley.com/doi/10.1002/sim.8281
OpenUrl
24.↵
Fisher A, Rudin C, Dominici F. All models are wrong, but many are useful: Learning a variable’s importance by studying an entire class of prediction models simultaneously. J Mach Learn Res. 2019;20:177.
OpenUrl
25.↵
Roy N, Kizhakke Veetil D, Khajanchi MU, Kumar V, Solomon H, Kamble J, et al. Learning from 2523 trauma deaths in india-opportunities to prevent in-hospital deaths. BMC Health Serv Res [Internet]. 2017 Dec [cited 2022 Dec 17];17(1):142. Available from: http://bmchealthservres.biomedcentral.com/articles/10.1186/s12913-017-2085-7
OpenUrl
26.↵
Sanddal TL, Esposito TJ, Whitney JR, Hartford D, Taillac PP, Mann NC, et al. Analysis of preventable trauma deaths and opportunities for trauma care improvement in utah. Journal of Trauma: Injury, Infection & Critical Care [Internet]. 2011 Apr [cited 2022 Dec 17];70(4):970–7. Available from: https://journals.lww.com/00005373-201104000-00032
OpenUrl
27.
Cardosi JD, Shen H, Groner JI, Armstrong M, Xiang H. Machine learning for outcome predictions of patients with trauma during emergency department care. BMJ Health Care Inform [Internet]. 2021 Oct [cited 2022 Dec 17];28(1):e100407. Available from: https://informatics.bmj.com/lookup/doi/10.1136/bmjhci-2021-100407
OpenUrl
28.↵
Zhang T, Nikouline A, Lightfoot D, Nolan B. Machine learning in the prediction of trauma outcomes: A systematic review. Annals of Emergency Medicine [Internet]. 2022 Nov [cited 2022 Dec 17];80(5):440–55. Available from: https://linkinghub.elsevier.com/retrieve/pii/S0196064422003353
OpenUrl

View the discussion thread.

Posted August 20, 2024.

Download PDF

Supplementary Material

Data/Code

Citation Tools

Subject Area

Health Systems and Quality Improvement

Subject Areas

All Articles

Addiction Medicine (382)
Allergy and Immunology (699)
Anesthesia (189)
Cardiovascular Medicine (2833)
Dentistry and Oral Medicine (325)
Dermatology (242)
Emergency Medicine (427)
Endocrinology (including Diabetes Mellitus and Metabolic Disease) (1008)
Epidemiology (12534)
Forensic Medicine (10)
Gastroenterology (800)
Genetic and Genomic Medicine (4413)
Geriatric Medicine (400)
Health Economics (712)
Health Informatics (2840)
Health Policy (1046)
Health Systems and Quality Improvement (1045)
Hematology (373)
HIV/AIDS (893)
Infectious Diseases (except HIV/AIDS) (13956)
Intensive Care and Critical Care Medicine (827)
Medical Education (412)
Medical Ethics (114)
Nephrology (461)
Neurology (4168)
Nursing (220)
Nutrition (615)
Obstetrics and Gynecology (784)
Occupational and Environmental Health (721)
Oncology (2195)
Ophthalmology (623)
Orthopedics (254)
Otolaryngology (317)
Pain Medicine (265)
Palliative Medicine (81)
Pathology (485)
Pediatrics (1171)
Pharmacology and Therapeutics (487)
Primary Care Research (481)
Psychiatry and Clinical Psychology (3639)
Public and Global Health (6754)
Radiology and Imaging (1484)
Rehabilitation Medicine and Physical Therapy (863)
Respiratory Medicine (897)
Rheumatology (430)
Sexual and Reproductive Health (431)
Sports Medicine (368)
Surgery (471)
Toxicology (57)
Transplantation (200)
Urology (173)

[1] 1.↵
Roth GA, Abate D, Abate KH, Abay SM, Abbafati C, Abbasi N, et al. Global, regional, and national age-sex-specific mortality for 282 causes of death in 195 countries and territories, 1980–2017: A systematic analysis for the global burden of disease study 2017. The Lancet [Internet]. 2018 Nov [cited 2022 Dec 17];392(10159):1736–88. Available from: https://linkinghub.elsevier.com/retrieve/pii/S0140673618322037
OpenUrl

[2] 2.↵
Vos T, Lim SS, Abbafati C, Abbas KM, Abbasi M, Abbasifard M, et al. Global burden of 369 diseases and injuries in 204 countries and territories, 1990–2019: A systematic analysis for the global burden of disease study 2019. The Lancet [Internet]. 2020 Oct [cited 2022 Dec 17];396(10258):1204–22. Available from: https://linkinghub.elsevier.com/retrieve/pii/S0140673620309259
OpenUrl

[3] 3.↵
World Health Organization. Guidelines for trauma quality improvement programmes [Internet]. 2009 [cited 2022 Aug 24] p. 104. Available from: https://www.who.int/publications/i/item/guidelines-for-trauma-quality-improvement-programmes

[4] 4.
Santana MJ, Stelfox HT. Development and evaluation of evidence-informed quality indicators for adult injury care. Annals of Surgery [Internet]. 2014 Jan;259(1):186–92. Available from: doi:10.1097/sla.0b013e31828df98e
OpenUrl CrossRef

[5] 5.↵
American College of Surgeons. Resources for Optimal Care of the Injured Patient. Chicago, IL 60611-3295: American College of Surgeons; 2022.

[6] 6.↵
Vioque SM, Kim PK, McMaster J, Gallagher J, Allen SR, Holena DN, et al. Classifying errors in preventable and potentially preventable trauma deaths: A 9-year review using the joint commission’s standardized methodology. The American Journal of Surgery [Internet]. 2014 Aug [cited 2022 Dec 17];208(2):187–94. Available from: https://linkinghub.elsevier.com/retrieve/pii/S0002961014001688
OpenUrl

[7] 7.↵
Hornor MA, Hoeft C, Nathens AB. Quality benchmarking in trauma: From the NTDB to TQIP. Curr Trauma Rep [Internet]. 2018 Jun [cited 2022 Dec 17];4(2):160–9. Available from: http://link.springer.com/10.1007/s40719-018-0127-1
OpenUrl

[8] 8.↵
Evans C, Howes D, Pickett W, Dagnone L. Audit filters for improving processes of care and clinical outcomes in trauma systems. Cochrane Injuries Group, editor. Cochrane Database of Systematic Reviews [Internet]. 2009 Oct 7 [cited 2022 Dec 17]; Available from: https://doi.wiley.com/10.1002/14651858.CD007590.pub2

[9] 9.↵
Cryer HG, Hiatt JR, Fleming AW, Gruen JP, Sterling J. Continuous Use of Standard Process Audit Filters Has Limited Value in an Established Trauma System. Journal of Trauma and Acute Care Surgery [Internet]. 1996 Sep [cited 2024 Apr 30];41(3):389. Available from: https://journals.lww.com/jtrauma/abstract/1996/09000/continuous_use_of_standard_process_audit_filters.3.aspx
OpenUrl

[10] 10.↵
Ghorbani P, Strömmer L. Analysis of preventable deaths and errors in trauma care in a scandinavian trauma level-i centre. Acta Anaesthesiol Scand [Internet]. 2018 Sep [cited 2022 Dec 17];62(8):1146–53. Available from: https://onlinelibrary.wiley.com/doi/10.1111/aas.13151
OpenUrl

[11] 11.
Radke OC, Heim C. Recognizing preventable death. Anesthesiology Clinics [Internet]. 2019 Mar [cited 2022 Dec 17];37(1):1–11. Available from: https://linkinghub.elsevier.com/retrieve/pii/S1932227518300880
OpenUrl

[12] 12.↵
Heim C, Cole E, West A, Tai N, Brohi K. Survival prediction algorithms miss significant opportunities for improvement if used for case selection in trauma quality improvement programs. Injury [Internet]. 2016 Sep [cited 2022 Dec 17];47(9):1960–5. Available from: https://linkinghub.elsevier.com/retrieve/pii/S0020138316302145
OpenUrl

[13] 13.↵
Årsrapporter | SweTrau [Internet]. [cited 2021 Feb 10]. Available from: http://rcsyd.se/swetrau/om-swetrau/arsrapporter

[14] 14.↵
Ringdal KG, Coats TJ, Lefering R, Di Bartolomeo S, Steen PA, Roise O, et al. The utstein template for uniform reporting of data following major trauma: A joint revision by SCANTEM, TARN, DGU-TR and RITG. Scand J Trauma Resusc Emerg Med [Internet]. 2008 [cited 2022 Dec 10];16(1):7. Available from: http://sjtrem.biomedcentral.com/articles/10.1186/1757-7241-16-7
OpenUrl

[15] 15.↵
Riley RD, Ensor J, Snell KIE, Harrell FE, Martin GP, Reitsma JB, et al. Calculating the sample size required for developing a clinical prediction model. Bmj [Internet]. 2020 Mar 18 [cited 2022 Dec 17];m441. Available from: https://www.bmj.com/lookup/doi/10.1136/bmj.m441

[16] 16.↵
Smeden M van, Moons KG, Groot JA de, Collins GS, Altman DG, Eijkemans MJ, et al. Sample size for binary logistic prediction models: Beyond events per variable criteria. Stat Methods Med Res [Internet]. 2019 Aug [cited 2022 Dec 17];28(8):2455–74. Available from: http://journals.sagepub.com/doi/10.1177/0962280218784726
OpenUrl

[17] 17.↵
Albaaj H, Attergrim J, Strömmer L, Brattström O, Jacobsson M, Wihlke G, et al. Patient and process factors associated with opportunities for improvement in trauma care: A registry-based study. Scandinavian Journal of Trauma, Resuscitation and Emergency Medicine [Internet]. 2023 Nov;31(1). Available from: doi:10.1186/s13049-023-01157-y
OpenUrl CrossRef

[18] 18.↵
R Core Team. R: A language and environment for statistical computing [Internet]. Vienna, Austria: R Foundation for Statistical Computing; 2020. Available from: https://www.R-project.org/

[19] 19.↵
Chen T, Guestrin C. XGBoost: A scalable tree boosting system. In: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining [Internet]. Acm; 2016 [cited 2022 Dec 17]. p. 785–94. Available from: https://dl.acm.org/doi/10.1145/2939672.2939785

[20] 20.↵
Yeo I-K. A new family of power transformations to improve normality or symmetry. Biometrika [Internet]. 2000 Dec 1 [cited 2022 Dec 17];87(4):954–9. Available from: https://academic.oup.com/biomet/article-lookup/doi/10.1093/biomet/87.4.954
OpenUrl

[21] 21.↵
Haibo He, Yang Bai, Garcia EA, Shutao Li. ADASYN: Adaptive synthetic sampling approach for imbalanced learning. In: 2008 IEEE international joint conference on neural networks (IEEE world congress on computational intelligence) [Internet]. Hong Kong, China: Ieee; 2008 [cited 2022 Dec 17]. p. 1322–8. Available from: http://ieeexplore.ieee.org/document/4633969/

[22] 22.↵
Kuhn M, Wickham H. Tidymodels: A collection of packages for modeling and machine learning using tidyverse principles. [Internet]. 2020. Available from: https://www.tidymodels.org

[23] 23.↵
Austin PC, Steyerberg EW. The integrated calibration index (ICI) and related metrics for quantifying the calibration of logistic regression models. Statistics in Medicine [Internet]. 2019 Sep 20 [cited 2023 Jan 4];38(21):4051–65. Available from: https://onlinelibrary.wiley.com/doi/10.1002/sim.8281
OpenUrl

[24] 24.↵
Fisher A, Rudin C, Dominici F. All models are wrong, but many are useful: Learning a variable’s importance by studying an entire class of prediction models simultaneously. J Mach Learn Res. 2019;20:177.
OpenUrl

[25] 25.↵
Roy N, Kizhakke Veetil D, Khajanchi MU, Kumar V, Solomon H, Kamble J, et al. Learning from 2523 trauma deaths in india-opportunities to prevent in-hospital deaths. BMC Health Serv Res [Internet]. 2017 Dec [cited 2022 Dec 17];17(1):142. Available from: http://bmchealthservres.biomedcentral.com/articles/10.1186/s12913-017-2085-7
OpenUrl

[26] 26.↵
Sanddal TL, Esposito TJ, Whitney JR, Hartford D, Taillac PP, Mann NC, et al. Analysis of preventable trauma deaths and opportunities for trauma care improvement in utah. Journal of Trauma: Injury, Infection & Critical Care [Internet]. 2011 Apr [cited 2022 Dec 17];70(4):970–7. Available from: https://journals.lww.com/00005373-201104000-00032
OpenUrl

[27] 27.
Cardosi JD, Shen H, Groner JI, Armstrong M, Xiang H. Machine learning for outcome predictions of patients with trauma during emergency department care. BMJ Health Care Inform [Internet]. 2021 Oct [cited 2022 Dec 17];28(1):e100407. Available from: https://informatics.bmj.com/lookup/doi/10.1136/bmjhci-2021-100407
OpenUrl

[28] 28.↵
Zhang T, Nikouline A, Lightfoot D, Nolan B. Machine learning in the prediction of trauma outcomes: A systematic review. Annals of Emergency Medicine [Internet]. 2022 Nov [cited 2022 Dec 17];80(5):440–55. Available from: https://linkinghub.elsevier.com/retrieve/pii/S0196064422003353
OpenUrl