Recent Advancements in Machine Learning-Based Bloodstream Infection Prediction: A Systematic Review and Meta-analysis of Diagnostic Test Accuracy

Rajeev Bopche; Jan Kristian Damås

doi:10.1101/2024.04.15.24305877

Abstract

Purpose Bloodstream infections (BSIs) present significant public health challenges. With the advent of machine learning (ML), promising predictive models have been developed. This study evaluates their performance through a systematic review and meta-analysis.

Methods We performed a comprehensive systematic review across multiple databases, including PubMed, IEEE Xplore, ScienceDirect, ACM Digital Library, SpringerLink, Web of Science, Scopus, and Google Scholar. Eligible studies focused on BSIs within any hospital setting, employing ML models as the diagnostic test. We evaluated the risk of bias with the Quality Assessment of Diagnostic Accuracy Studies (QUADAS-2) checklist and assessed the quality of evidence using the Grading of Recommendations, Assessment, Development, and Evaluation (GRADE) approach. Models reporting the area under the receiver operating characteristic curve (AUROC) were included in the meta-analysis to identify key performance drivers.

Results After screening, a total of 30 studies were eligible for synthesis, from which 41 models and 8 data types were extracted. Most of the studies were carried out in the inpatient settings (n=17; 56%), followed by the emergency department (ED) settings (n=7; 23%), and followed by the ICU settings (n=6; 20%). The reported AUROCs in the hospital inpatients settings, ranged from 0.51-0.866, in the ICU settings AUROCs ranged from 0.668-0.970, and in the emergency department (ED) settings the AUROCs of the models ranged from 0.728-0.844. One study reported prospective cohort study, while two prospectively validated their models. In the meta-analysis, laboratory tests, Complete Blood Count/Differential Count (CBC/DC), and ML model type contributed the most to model performance.

Conclusion This systematic review and meta-analysis show that on retrospective data, individual ML models can accurately predict BSIs at different stages of patient trajectory. Although they enable early prediction of BSI, a comprehensive approach to integrate data types and models is necessary. Systematic reporting, externally validated, and clinical implementation studies are needed to establish clinical confidence.

1. INTRODUCTION

Bloodstream infections (BSIs) pose a severe public health threat, often escalating to critical conditions such as sepsis and septic shock, particularly if not promptly recognized and treated. The rapid progression of these infections contributes to high morbidity, mortality, and significant healthcare costs, making BSIs a pivotal challenge in clinical care [1, 2]. Currently available clinical decision tools for BSIs and sepsis are primarily based on changes in vital signs and abnormal blood test results, which lack sufficient accuracy [3, 4]. Enhancing prediction capabilities could lead to more efficient resource allocation and cost reduction. Accurate initial stratification helps direct resources to high-risk BSI patients and minimizes unnecessary tests and treatments for those at low risk. Moreover, ambiguous results from blood cultures with contaminants can prolong hospital stays and lead to unnecessary antibiotic administration [5, 6]. Therefore, improving the predictive value of these tests and reducing their unnecessary use are critical for optimizing healthcare resource allocation and minimizing costs.

Despite the extensive application of machine learning (ML) across various healthcare diagnostics, its potential in BSI prediction has not been fully exploited, particularly when compared to its use in predicting sepsis, where it has been more thoroughly researched [7–14]. This gap underscores a critical oversight; early detection of BSIs can significantly limit their progression to sepsis, thus reducing patient suffering and healthcare expenses [16]. This systematic literature review (SLR) and meta-analysis (MA) seek to bridge this gap by highlighting recent advancements in ML models for BSI prediction. Our analysis spans a range of algorithms and examines the influence of different data types used within these models, providing a comprehensive overview of the current landscape and suggesting directions for future research.

1.1. Related works

Recent systematic reviews on ML for BSI prediction have illuminated the capabilities of validated models in clinical environments. For example, Eliakim-Raz et al. (2015) emphasize the variability in study populations, model parameters, and validation processes, which highlights the complexity in creating universally effective ML models for BSI prediction [16]. In contrast, the review by Coburn et al. (2012) focuses on the critical clinical and laboratory markers needed to determine the necessity for blood cultures in suspected bacteremia cases, further supporting the role of ML in enhancing diagnostic accuracy and reducing false positives [2]. These studies demonstrate the untapped potential of ML to streamline diagnostic processes and improve patient care in BSI contexts, advocating for its increased integration into BSI prediction strategies.

2. METHODS

The guidelines provided by the Preferred Reporting Items for Systematic Reviews and Meta-Analysis of Diagnostic Test Accuracy studies (PRISMA-DTA) statement [15] was followed for the conduct of this literature review and meta-analysis.

2.1. Search Strategy

The literature search spanned nine databases to cover a wide array of relevant literature. The databases included: PubMed, IEEE Xplore Digital Library, ScienceDirect, ACM Digital Library, SpringerLink, Web of Science, Scopus, and Google Scholar. These databases were selected to encompass a vast range of medical and computer science literature, including journal articles and conference proceedings. To ensure a comprehensive and relevant literature retrieval, we first identified key terms and their possible alternatives. This approach ensures coverage of a broad spectrum of pertinent studies. The core search terms, along with their alternatives, were: Bloodstream Infection (BSI) – Blood culture, Bacteremia, Blood culture test; Prediction – Diagnosis, Identification, Detection; Machine Learning – Artificial Intelligence, Deep Learning, Supervised Learning, Computational Intelligence; Clinical Data – Electronic Health Records (EHR), Electronic Patient Records (EPR), Electronic Medical Records (EMR). Adjustments were made to the query to accommodate the syntax of different database search engines. The search query was constructed by combining these terms using Boolean operators:

− (Bloodstream Infection OR Blood culture OR Bacteremia) AND (Prediction OR Diagnosis OR Identification OR Detection) AND (Machine Learning OR Artificial Intelligence OR Deep Learning OR Supervised Learning OR Computational Intelligence) AND (Clinical Data OR Electronic Health Records OR Electronic Patient Records OR Electronic Medical Records)

The selection of relevant studies involved a multi-step process. First, The execution of the search query in each database. Then, Initial screening based on titles and abstracts to exclude non-relevant records. In the third step, detailed screening involving full-text reviews was carried out. The reasons for exclusion were documented. Following this, references in selected articles were examined to identify additional relevant studies, implementing snowballing technique for accessing new literature. This structured approach ensured comprehensive coverage of the topic, allowing the retrieval of documents that met the inclusion criteria for data extraction and analysis to address the research questions of this SLR.

2.2. Study Selection

2.2.1. Inclusion Criteria

- Study Focus: This review specifically targeted studies that utilized machine learning models for the prediction of bacteremia, BSI, or positive blood culture results. The primary objective was the development or validation of these predictive models.

- Types of Publications: The review was confined to peer-reviewed journal articles and conference papers.

- Target Population: The studies included focused on adult populations.

- Model Specification: Included studies clearly described the ML model(s) used, including the type of algorithm, data inputs, and validation methods.

- Outcome Measures: Studies reported specific outcomes related to the accuracy, sensitivity, specificity, or predictive value of the ML models for BSI prediction.

2.2.2. Exclusion Criteria

- Focus on Sepsis and Severe Sepsis: Studies aimed primarily at predicting the onset of sepsis or severe sepsis, rather than BSI, were excluded. This was to concentrate on predictive research for BSI, an antecedent condition, rather than sepsis, a consequent condition.

- Pediatric Studies: Research focusing on neonatal, pediatric, or infant populations was excluded, as the review was limited to adult populations.

- Post-Diagnosis Prediction: Studies focusing on predicting outcomes or complications for patients already diagnosed with BSI were excluded. The emphasis was on the initial prediction of BSI.

- Non-Machine Learning Models: Research employing statistical or other predictive methodologies without using ML algorithms was not included.

- Non-Peer Reviewed Literature: Studies that had not undergone a peer review process, including grey literature, technical reports, and unpublished manuscripts, were excluded.

- Opinion Pieces and Theoretical Works: Commentaries, opinion pieces, and purely theoretical works lacking empirical data or validation were omitted.

2.2.3. Timeframe and Language Restrictions

- Language Restrictions: The review included only studies published in English. This criterion was applied to ensure the feasibility of thorough analysis and comprehension of the research findings.

- Timeframe of Publication: The review focused on studies published within the last five years, capturing the most recent advancements in machine learning applications for bloodstream infection prediction.

2.3. Data Extraction

A structured approach was used to extract relevant data from identified studies. The data extraction process involved the following key elements:

− Study Identification: Details including authors, year of publication, and title of the study.
− Population Characteristics: Information on the patient demographics, clinical settings, and specific population subgroups studied.
− Dataset Description: Description of the dataset used in the study, including size, source, prevalence rates, and period of data collection.
− Algorithms Employed: Identification of the specific ML algorithms used in the study.
− Reported Metrics: Reporting on the performance metrics used in the study.
− Key Findings and Predictors: Summary of the main findings, including key predictors identified for BSI prediction.

2.4. Quality Assessment

To evaluate risk of bias, we used the Quality Assessment of Diagnostic Accuracy Studies (QUADAS-2) criteria [17]. Domains included patient selection, index test, reference standard, and flow and timing. In line with the recommendations from the QUADAS-2 guidelines, questions per domain were tailored for this paper and can be found in (Supplementary Information). If one of the questions was scored at risk of bias, the domain was scored as high risk of bias. At least one domain at high risk of bias resulted in an overall score of high risk of bias, only one domain scored as unclear risk of bias resulted in an overall score of unclear risk of bias for that paper. We used the Grading of Recommendations Assessment, Development and Evaluation (GRADE) methodology to assess the quality of evidence per hospital setting for all studies reporting the area under the curve of the receiver operating characteristic (AUROC) as their performance metric [18]. In line with the GRADE guidelines for diagnostic test accuracy, we included the study design, domains risk of bias (limitations), and inconsistency of the results. One level of evidence was deducted for each domain with serious concerns or high risk of bias, no factors increased the level of evidence (Supplementary Information). Overall level of evidence is expressed in four categories (high, moderate, low, very low).

2.5. Meta-analysis

For meta-analysis we conducted univariate and multivariate random effects model analysis fitted using restricted maximum likelihood estimation (REML) for the contribution of covariates towards model performance [19]. We grouped medical data types utilized to train the models such as (demographics as features, vital signs as features, laboratory tests as features, use of textual data, use of time series data, etc.) and included them as covariates. We performed pooled regression analysis to study the most significant covariates per study settings.

3. RESULTS

3.1 Study selection

A comprehensive literature search was carried out from November 2023 to March 2024, yielding 348 articles post-duplication removal. Following an in-depth assessment for eligibility, 30 studies conformed to our inclusion criteria, references [20–49]. The selection process, including reasons for exclusion at each stage, is outlined in the PRISMA flow diagram (Figure 1). Notably, the majority (29 out of 30) of the included articles employed retrospective study designs, with only two conducting prospective validations of their models [40, 41]. A single study presented a prospective cohort design [29].

Fig 1.

PRISMA flow diagram illustrating the systematic review methodology, depicting the screening, eligibility assessment, and inclusion of studies, alongside exclusion justifications at each phase

3.2 Study characteristics

The studies predominantly took place in inpatient settings (56%, n=17), with the emergency department (ED) (23%, n=7) and intensive care units (ICU) (20%, n=6) also represented. Within inpatient settings, nine studies examined general populations [20–28], and others focused on specific patient cohorts including those with central venous catheters (CVC) [30, 31], systemic inflammatory response syndrome (SIRS) [29], and various other conditions. In ED settings, seven studies were identified [37–43], two of which addressed specific patient groups. ICU-focused research numbered six studies [44–49], again with two concentrating on specific patient subsets. Bacteremia was the primary condition investigated in 24 studies, alongside candidemia and central line-associated bloodstream infection (CLABSI) in others. Contaminants were generally considered negative results, except in two studies [23, 45]. Table 1 presents a synthesis of the study characteristics, including target conditions, patient/sample sizes, data sources, and prevalence, categorized by setting.

View this table:

Table 1. Key characteristics and data sources of selected studies

Most of the reviewed studies were conducted within a single institution; however, three studies utilized datasets encompassing two hospital systems [20, 22, 44], and two studies expanded their analysis to incorporate multi-center data [41, 49]. External validation, which is critical for the generalizability of findings, was performed in four studies [38, 40, 41, 44]. Compliance with the Transparent Reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD) guidelines, which enhance the reliability of predictive modeling, was confirmed in five studies [20, 28, 37, 39, 41, 50]. Data diversity is key in model training and validation. Within this context, three studies utilized the publicly available Medical Information Mart for Intensive Care (MIMIC) database [41, 44, 48], while four studies included Complete Blood Count/Differential Count (CBC/DC) data [24, 25, 34, 40]. Furthermore, two of these studies also incorporated Cell Population Data (CPD) [24, 40], highlighting the integration of detailed hematologic parameters. Unstructured data utilization was observed in one study that utilized textual data [23]. In terms of data accessibility, most studies employed proprietary hospital data. Data sharing policies varied: one study explicitly stated that their data would not be shared [47], one study offered openly available data [34], and ten studies indicated that deidentified data could be provided upon reasonable request [24, 25, 27, 28, 29, 38, 39, 41, 42, 49], thus contributing to transparency and reproducibility. Notably, six articles did not specify the number of patients included in their analysis [22, 24, 25, 28, 31, 48].

3.3. Models and data types

The reported AUROCs in the hospital inpatients settings ranged from 0.51-0.866, while models on select group of participants like HD patients [32], patients with low Procalcitonin (PCT) [35], and HIV patients [36] achieved high AUROCs of 0.91, 0.92, and 0.91 respectively. For the emergency department (ED) settings the AUROCs of the models ranged from 0.728-0.844. For the ICU settings AUROCs ranged from 0.668-0.970. All articles reported high prediction performance (AUROC > 0.7) except for one article [26]. All except two articles [23, 34] reported AUROC as their performance metric. Figure 2 presents a horizontal bar chart that delineates the distribution of AUROC values across the various studies reporting AUROC, with the bar length indicating the range of AUROC values reported across various ML models utilized in the study. The gradient color scheme distinguishes the studies based on the clinical settings, ED (red), inpatient (blue), and ICU (yellow). On the right axis the number of patients, or the number of BC episodes/samples analyzed in the study. Figure 3 depicts the utilization frequency of different ML algorithms categorized into tree-based models, traditional ML algorithms, and neural networks. Notably, tree-based models such as RF and XGB displayed a higher incidence of application. LR was predominantly favored among traditional ML algorithms and within neural networks, ANN and MLP were the commonly employed architectures. Figure 4 presents a horizontal bar graph enumerating the occurrence of various performance metrics in the selected studies. The AUROC was the most reported metric. Figure 5 illustrates the types and diversity of data inputs used with the ML models across the selected studies via a stacked bar chart. The derived risk factors represent the derived clinical features included in the models. The ML algorithms and top predictors for each study grouped by settings are given in Table 2.

Fig 2.

Horizontal bar chart depicting the spread of AUROC values reported in various studies. The studies are listed on the vertical axis, and each bar’s horizontal extension represents the range of minimum and maximum AUROC value achieved among the various models employed in the study. The color gradient red, indicates studies in ED, blue, indicating hospital inpatient settings, and the yellow area covers the studies in ICU settings.

Fig 3.

Stacked bar chart displaying the occurrence of MLalgorithms in BSI prediction studies across three categories: Tree-based models, Traditional ML Algorithms, and Neural Networks. For Tree-based models: Random Forest (RF), Extreme Gradient Boosting (XGB), Gradient Boosting Machine (GBM), Light Gradient Boosting Machine (LGBM), Extra Trees (ET), Gradient Boosted Trees (GBT), Classification and Regression Trees (CART), and Categorical Boosting (CatBoost). Traditional ML Algorithms include Logistic Regression (LR), Support Vector Machine (SVM), K-Nearest Neighbors (KNN), Decision Tree (DT), Naive Bayes (NB), Generalized Linear Model (GLM), Linear Discriminant Analysis (LDA), Multiple Discriminant Analysis (MDA), and Elastic Net Regression (ENR). Neural Networks are represented by Artificial Neural Network (ANN), Multilayer Perceptron (MLP), Bayesian Neural Network (BNN), Deep Learning (DL), Recurrent Neural Network (RNN), Gated Recurrent Unit (GRU), Convolutional Neural Network (CNN), Bidirectional Long Short-Term Memory (BiLSTM), Attention-based Bidirectional Long Short-Term Memory (ABiLSTM), and Denoising Autoencoder (DAE)

Fig 4.

Horizontal bar graph summarizing the frequency of performance metrics reported in ML studies for BSI prediction. The metrics are displayed in descending order of occurrence, starting with Area Under the Receiver Operating Characteristic curve (AUC), followed by Sensitivity, Specificity, Positive Predictive Value (PPV), Negative Predictive Value (NPV), Accuracy, F1-scores, Area Under the Precision-Recall Curve (AUPRC), Precision, Brier scores, Likelihood Ratio (LR), Recall, Akaike Information Criterion (AIC), False Positive Rate (FPR), J-Statistic, Cohen’s Kappa, F-measure, and Diagnostic Odds Ratio (DOR)

Fig 5.

Stacked bar chart representing the types of data used in machine learning models for predicting bloodstream infections, across various studies listed on the vertical axis. Data types include demographics, vital signs, laboratory tests, textual data, derived risk factors, time-series data, Complete Blood Count/Differential Count (CBC/DC), and Cell Population Data (CPD). Each bar’s length indicates the number of data types used in each study, providing a comparison of data diversity across the research

View this table:

Table 2. ML algorithms and key predictors in the selected studies

3.4. Data challenges and strategies

BSIs may be relatively rare compared to the number of non-infection cases in a dataset. This imbalance can lead models to become biased towards predicting the majority class, reducing their effectiveness in identifying true infection cases. Most studies reported imbalanced dataset with prevalence rates of BSI as given in the Table 1. To overcome challenges with data imbalance, Synthetic Minority Over-sampling Technique (SMOTE) was widely used for augmenting the minority class in the dataset by generating synthetic samples [21, 30, 49, 51]. Goh et al. (2022) employed oversampling, undersampling, and random oversampling (ROSE) methods for model development [42]. Lien et al. (2022) and Van Steenkiste et al. (2019) employed Precision-Recall Area Under Curve (PRAUC) metric for a more accurate assessment of model performance in imbalanced datasets [25, 45].. The study by Garnica et al. (2021) encountered significant issues with missing data across the patient records used [27]. The types of missing data were classified into three categories: Missing Completely At Random (MCAR), Missing At Random (MAR), and Missing Not At Random (MNAR) [52]. They employed separate class method to represent the missing data, ensuring the ML models could handle these cases without dropping significant amounts of data. Using patient data for training ML models can raise concerns about privacy and data security. Ensuring patient anonymity and complying with regulations can limit the accessibility and use of certain data. Boerman et al. (2022) faced difficulty with the limitation of not being able to use free-text data such as physician and nurse reports due to privacy concerns [39]. To ensure patient privacy and compliance with data protection regulations researchers can implement effective deidentification of patient records, involving elimination or alteration of direct identifiers, such as names, age, gender, or location, which could be combined to identify an individual.

3.5. Quality of evidence and risk of bias

The risk of bias for the retrospective diagnostic test accuracy studies was assessed following the QUADAS-2 guidelines, and the results are shown in Table 3. For patient selection, six articles (20.6%) presented high risk of bias and four articles (13.7%) presented unclear risk of bias for failing to describe the study population and patient selection. For index test 12 articles scored a high risk of bias (41.3%) and five article (17.2%) scored unclear risk of bias, due to non-reporting of data splits and cross-validation strategies. Three articles (10.3%) scored high risk of bias and one score unclear risk of bias for not specifying the reference standards. One article presented unclear risk of bias in the flow and timing.

View this table:

Table 3. Risk of Bias Assessment Results According to QUADAS-2 Criteria

The GRADE evidence profile was estimated by pooling studies based on settings. In this analysis we only considered studies focusing on general adult patient population and did not consider studies reporting specific study population. The GRADE evidence profile is given as Table 4. The studies in ICU and Inpatients settings aggregates were considered at high risk of bias and studies in ED settings aggregates were considered unclear risk of bias. One level of evidence was deducted for observational study design and Inconsistencies due to heterogeneity. Consequently, the quality of evidence for each of the settings was scored as low. Additionally, the outcome column distinguishes AUROC values for high and unclear risk of bias studies. High risk of bias studies reported lowest AUROC values in the inpatient settings and reported highest AUROC values in the ED and ICU settings.

View this table:

Table 4. GRADE Evidence Profile and Quality Assessment of Studies by Setting

3.6. Meta-analysis

In the meta-analysis, only models presented in articles with target condition bacteremia and general patients population across the three settings were considered. Models for specific disease population, models not reporting AUROC metric, and models predicting CLABSIs were not considered in the meta-analysis. A total of 41 models and 9 covariates were included in the meta-analysis. The univariate and multivariate random effect model significant (p-value>0.05) results are shown in Table 5. The random effect model analysis shows that laboratory tests, use of time-series data, use of CBC/DC data, BiLSTM model, XGB model, and RF model positively contributed to the AUROC. We performed a pooled analysis by setting, to identify the best covariates for each. The results of the regression analysis are given as Table 6. For Inpatient setting, demographics (includes age), laboratory tests, and use of CBC/DC data positively contributed to the model performance and use of vital signs negatively impacted the model performance. In contrast in the ICU setting, vital signs, laboratory tests, derived risk factors, use of time-series data, and BiLSTM model employed for sequential data positively impacted the performance metrics.

View this table:

Table 5. Univariate and Multivariate Analysis of Predictive Performance Covariates

View this table:

Table 6. Pooled Regression Analysis Results by Study Setting

4. DISCUSSIONS

This is the first systematic review and meta-analysis that has integrated findings from diverse clinical settings to assess the performance of ML models in predicting BSIs. This work corroborates the growing evidence that ML models are viable tools for enhancing diagnostic accuracy and potentially reducing reliance on traditional blood cultures, a point underscored by contemporary studies [3, 6]. Significantly, our meta-analysis revealed that models performed variably across different clinical environments, with AUROC scores ranging widely. In the inpatient settings, AUROCs ranged from 0.51 to 0.866, demonstrating moderate to high diagnostic accuracy. This performance variance was even more pronounced in the ICU settings, where AUROCs spanned from 0.668 to 0.970, suggesting that certain models are highly effective while others may require further refinement. The variability in model performance aligns with the findings from previous studies which have highlighted the complexity involved in creating universally effective ML models for BSI prediction [16]. This analysis not only validates the potential of ML in clinical diagnostics but also highlights the critical need for tailored approaches depending on specific hospital settings and patient populations. Such differentiation in model efficacy emphasizes the importance of context-specific model development and deployment, which should be informed by the distinct dynamics and needs of each clinical environment. Furthermore, the integration of various data types, such as demographic features, laboratory tests, and derived clinical features, has been shown to significantly impact model performance, echoing the findings of recent meta-analyses which advocate for a multi-faceted approach to data integration within ML models to enhance diagnostic precision and reliability [53].

4.1. Clinical relevance and model performance

The findings from our study underscore the potential of ML models to enhance the diagnostic accuracy of bloodstream infections BSIs significantly. This enhancement is crucial, considering the current reliance on traditional blood cultures, which, while standard, are not without their limitations such as delays in results and the potential for contamination [54]. ML models, by leveraging a variety of clinical data, including patient demographics, prior medical history, and laboratory results, can predict BSIs with notable accuracy. This predictive capability is especially valuable in clinical settings where rapid decision-making is critical [55]. By predicting BSIs accurately, ML models can facilitate earlier intervention strategies, potentially leading to improved patient outcomes and reduced mortality rates [56]. Furthermore, the integration of ML into hospital systems offers a pathway to more streamlined resource allocation [57–59]. By accurately identifying patients at high risk of BSIs, hospitals can optimize the use of tests and allocate personnel and medical resources more efficiently [60]. This not only helps in managing hospital resources but also in reducing unnecessary antibiotic use, which is often a knee-jerk response to suspected infections [61].

4.2. Future directions and academic contribution

Our study have laid the groundwork for several promising directions that future investigations could take to advance this critical area of medical informatics. While retrospective studies form the bulk of current research on ML models for BSI prediction, there is a pronounced need for prospective studies. Such studies will allow for real-time data collection and model validation, offering insights that are often obscured in retrospective analyses [12]. This shift could also facilitate the adjustment of models in accordance with dynamic clinical environments, ensuring that they remain robust and reliable over time. The potential of ML models to be integrated into real-time clinical decision support systems represents a significant leap towards operationalizing AI in everyday clinical practice [62]. However, achieving this requires rigorous testing and validation of these models within clinical settings to ensure they perform reliably when interfaced with live data streams [63, 64]. There is substantial scope for exploring new predictive variables that could enhance the predictive accuracy of ML models. The incorporation of genomic data, patient mobility patterns, and real-time monitoring data could provide new insights into infection risk factors, potentially leading to more sophisticated and accurate prediction models [65, 66]. To build trust and validate the efficacy of ML models in clinical settings, systematic reporting and external validation are essential [67, 68]. Models need to be tested across diverse demographics and varied clinical environments to assess their universality and reliability [69–72]. The work of Fleuren et al. (2020) and Moor et al. (2021) highlights the importance of such validation in confirming the utility and accuracy of predictive models for sepsis, which can be paralleled in BSI prediction [12, 13]. As ML applications in healthcare continue to expand, it is crucial to consider the regulatory and ethical implications of their use [73, 74]. Ensuring patient privacy, securing data, and maintaining transparency in AI decision-making processes are paramount [75, 76]. Future research must also address these aspects to foster a safe and trustful adoption of AI technologies in healthcare.

4.2. Strengths and Limitations

Our systematic review and meta-analysis are grounded in a comprehensive, methodologically robust approach that integrates a variety of data sources and analytical techniques. The extensive database search across multiple platforms including PubMed, IEEE Xplore, and Scopus ensures a broad capture of relevant studies, reducing the risk of publication bias. This wide-ranging data acquisition is supplemented by our application of the QUADAS-2 framework and GRADE methodology, which enhances the credibility of our findings by systematically assessing the risk of bias and the quality of evidence across studies. Furthermore, the synthesis of data into figures and comprehensive tables enables clear visual representation and understanding of the applications of ML models across varied clinical settings. Our findings are supported by rigorous statistical analysis, including univariate and multivariate models, which reveal key performance drivers and validate the predictive power of ML models for bloodstream infections. Despite these strengths, several limitations must be acknowledged. A significant proportion of the included studies utilized retrospective designs. While retrospective studies provide valuable historical insights, they are inherently limited by the data available, often lacking the prospective validation needed to confirm the efficacy of predictive models under current clinical conditions. This design limitation impacts the generalizability of our findings, as the models might perform differently when deployed in real-time environments. To enhance the generalizability and applicability of future research, several strategies can be adopted. Prospective study designs should be prioritized, as they allow for the real-time evaluation and adjustment of ML models, ensuring that the models are tested and validated under varied clinical conditions. This approach not only tests the robustness of the models but also helps in fine-tuning them for practical deployment. Furthermore, multi-center studies involving diverse populations and settings should be encouraged to test the efficacy of these predictive models across different demographics and operational conditions. Such studies can help identify and mitigate any population-specific biases, thereby enhancing the models’ applicability and reliability. Lastly, ongoing validation and systematic reporting should be integral to future research efforts. By continuously assessing and reporting the performance of ML models, researchers can ensure that these tools remain effective and relevant in the ever-evolving clinical landscape. By addressing these limitations and building on the strengths of our current approach, future research can significantly advance the field of ML in medical diagnostics, particularly in the prediction and management of bloodstream infections.

5. CONCLUSIONS

Our systematic review and meta-analysis have critically evaluated the efficacy of ML models in predicting BSIs, a crucial domain where timely and accurate diagnosis can significantly influence clinical outcomes. ML models, especially those incorporating diverse data types such as laboratory results and demographic information, demonstrated a capacity to predict BSIs with a high degree of accuracy. The integration of these models into clinical settings can potentially reduce the reliance on traditional blood cultures, which, while foundational, are hampered by delays and susceptibility to contamination. This shift could streamline diagnostic workflows and enhance the speed and precision of infection response interventions, thus improving patient care and outcomes. To advance the use of ML in predicting BSIs, future research should focus on prospective studies and the development of real-time, adaptive ML systems that can be integrated seamlessly into clinical decision-support frameworks. Furthermore, external validation of these models across diverse populations and settings is essential to bolster clinical confidence and foster wider adoption. In conclusion, while ML models hold significant promise for transforming BSI diagnosis, their successful implementation will depend on meticulous model development, validation, and customization tailored to the nuanced demands of different healthcare settings. This holistic approach will be crucial in overcoming the current challenges and fully realizing the potential of ML in clinical diagnostics.

Data Availability

All data produced in the present work are contained in the manuscript

Author’s contributions

RB conceptualized and designed the study with inputs from JKD and ØN. RB performed database search and literature analysis. RB wrote the initial and subsequent drafts of the manuscript, which other authors reviewed and approved.

Funding

Financial support for this study was provided by the Computational Sepsis Mining and Modelling project through the Norwegian University of Science and Technology Health Strategic Area.

Acknowledgements

We would like to thank the researchers at Mid-Norway Centre for Sepsis Research for valuable discussions and feedback.

Appendix List of Medical Abbreviations

ALP: Alkaline Phosphatase
ALT: Alanine Aminotransferase
APS III: Acute Physiology Score III
ART: Antiretroviral Therapy
AST: Aspartate Aminotransferase
AKI: Acute Kidney Injury
BC: Blood Cultures
BDG: Beta-D-Glucan
BUN: Blood Urea Nitrogen
CBC: Complete Blood Count
CHG: Chlorhexidine Gluconate
CKD: Chronic Kidney Disease
CLABSI: Central Line-Associated Bloodstream Infection
COPD: Chronic Obstructive Pulmonary Disease
CPD: Cephalopelvic Disproportion
CRP: C-reactive Protein
CRRT: Continuous Renal Replacement Therapy
CVC: Central Venous Catheter
DBP: Diastolic Blood Pressure
DIFF: Differential Count
DM: Diabetes Mellitus
ESR: Erythrocyte Sedimentation Rate
GCS: Glasgow Coma Scale
Hb: Hemoglobin
HR: Heart Rate
ICU: Intensive Care Unit
LMR: Lymphocyte to Monocyte Ratio
NLR: Neutrophil to Lymphocyte Ratio
OASIS: Oxford Acute Severity of Illness Score
PCT: Procalcitonin
PLR: Platelet to Lymphocyte Ratio
PLT: Platelet Count
PPI: Proton Pump Inhibitor
PT: Prothrombin Time
RR: Respiratory Rate
SAPS II: Simplified Acute Physiology Score II
SBP: Systolic Blood Pressure
WBC: White Blood Cell Count

References

[1].↵
Goto, M., & Al-Hasan, M. N. (2013). Overall burden of bloodstream infection and nosocomial bloodstream infection in North America and Europe. Clinical Microbiology and Infection, 19(6), 501–509.
OpenUrl CrossRef PubMed
[2].↵
Coburn, B., Morris, A. M., Tomlinson, G., & Detsky, A. S. (2012). Does this adult patient with suspected bacteremia require blood cultures?. JAMA, 308(5), 502–511. doi:10.1001/jama.2012.8262
OpenUrl CrossRef PubMed Web of Science
[3].↵
Wong A, Otles E, Donnelly JP, et al. External Validation of a Widely Implemented Proprietary Sepsis Prediction Model in Hospitalized Patients. JAMA Intern Med. 2021;181(8):1065–1070. doi:10.1001/jamainternmed.2021.2626
OpenUrl CrossRef PubMed
[4].↵
Schootman, M., Wiskow, C., Loux, T., Meyer, L., Powell, S., Gandhi, A., & Lacasse, A. (2022). Evaluation of the effectiveness of an automated sepsis predictive tool on patient outcomes. Journal of critical care, 71, 154061. doi:10.1016/j.jcrc.2022.154061
OpenUrl CrossRef
[5].↵
Bates, D. W., Cook, E. F., Goldman, L., & Lee, T. H. (1990). Predicting bacteremia in hospitalized patients. A prospectively validated model. Annals of internal medicine, 113(7), 495–500. doi:10.7326/0003-4819-113-7-495
OpenUrl CrossRef PubMed Web of Science
[6].↵
Sakarikou, C., Altieri, A., Bossa, M. C., Minelli, S., Dolfa, C., Piperno, M., & Favalli, C. (2018). Rapid and cost-effective identification and antimicrobial susceptibility testing in patients with Gram-negative bacteremia directly from blood-culture fluid. Journal of microbiological methods, 146, 7–12. doi:10.1016/j.mimet.2018.01.012
OpenUrl CrossRef
[7].↵
Buch, V. H., Ahmed, I. & Maruthappu, M. Artificial intelligence in medicine: Current trends and future possibilities. Br. J. Gen. Pract. 68, 143–144. doi:10.3399/bjgp18X695213 (2018).
OpenUrl FREE Full Text
[8].
Rajpurkar, P., Chen, E., Banerjee, O., et al. AI in health and medicine. Nat Med 28, 31–38 (2022). doi:10.1038/s41591-021-01614-0
OpenUrl CrossRef PubMed
[9].
A. Baldominos, A. Puello, H. Oğul, T. Aşuroğlu and R. Colomo-Palacios, "Predicting Infections Using Computational Intelligence – A Systematic Review," in IEEE Access, vol. 8, pp. 31083–31102, 2020, doi: 10.1109/ACCESS.2020.2973006.
OpenUrl CrossRef
[10].
Hassan, N., Slight, R., Weiand, D., Vellinga, A., Morgan, G., Aboushareb, F., & Slight, S. P. (2021). Preventing sepsis; how can artificial intelligence inform the clinical decision-making process? A systematic review. International journal of medical informatics, 150, 104457. doi:10.1016/j.ijmedinf.2021.104457
OpenUrl CrossRef PubMed
[11].
Islam, K. R., Prithula, J., Kumar, J., Tan, T. L., Reaz, M. B. I., Sumon, M. S. I., & Chowdhury, M. E. H. (2023). Machine Learning-Based Early Prediction of Sepsis Using Electronic Health Records: A Systematic Review. Journal of clinical medicine, 12(17), 5658. doi:10.3390/jcm12175658
OpenUrl CrossRef
[12].↵
Fleuren, L. M., Klausch, T. L. T., Zwager, C. L., Schoonmade, L. J., Guo, T., Roggeveen, L. F., Swart, E. L., Girbes, A. R. J., Thoral, P., Ercole, A., Hoogendoorn, M., & Elbers, P. W. G. (2020). Machine learning for the prediction of sepsis: a systematic review and meta-analysis of diagnostic test accuracy. Intensive care medicine, 46(3), 383–400. doi:10.1007/s00134-019-05872-y
OpenUrl CrossRef PubMed
[13].↵
Moor, M., Rieck, B., Horn, M., Jutzeler, C. R., & Borgwardt, K. (2021). Early Prediction of Sepsis in the ICU Using Machine Learning: A Systematic Review. Frontiers in medicine, 8, 607952. doi:10.3389/fmed.2021.607952
OpenUrl CrossRef
[14].↵
Deng, H. F., Sun, M. W., Wang, Y., Zeng, J., Yuan, T., Li, T., Li, D. H., Chen, W., Zhou, P., Wang, Q., & Jiang, H. (2021). Evaluating machine learning models for sepsis
[15].↵
McInnes, M. D. F., Moher, D., Thombs, B. D., McGrath, T. A., Bossuyt, P. M., and the PRISMA-DTA Group, Clifford, T., Cohen, J. F., Deeks, J. J., Gatsonis, C., Hooft, L., Hunt, H. A., Hyde, C. J., Korevaar, D. A., Leeflang, M. M. G., Macaskill, P., Reitsma, J. B., Rodin, R., Rutjes, A. W. S., Salameh, J. P., … Willis, B. H. (2018). Preferred Reporting Items for a Systematic Review and Meta-analysis of Diagnostic Test Accuracy Studies: The PRISMA-DTA Statement. JAMA, 319(4), 388–396. doi:10.1001/jama.2017.19163
OpenUrl CrossRef PubMed
[16].↵
Eliakim-Raz, N., Bates, D. W., & Leibovici, L. (2015). Predicting bacteraemia in validated models--a systematic review. Clinical microbiology and infection : the official publication of the European Society of Clinical Microbiology and Infectious Diseases, 21(4), 295–301. doi:10.1016/j.cmi.2015.01.023
OpenUrl CrossRef PubMed
[17].↵
Whiting PF, Rutjes AWS, Westwood ME et al. (2011) QUADAS-2: a revised tool for the quality assessment of diagnostic accuracy studies. Ann Intern Med 155:529. doi:10.7326/0003-4819-155-8-201110180-00009
OpenUrl CrossRef PubMed Web of Science
[18].↵
Schünemann, H. J., Oxman, A. D., Brozek, J., Glasziou, P., Jaeschke, R., Vist, G. E., Williams, J. W., Jr., Kunz, R., Craig, J., Montori, V. M., Bossuyt, P., Guyatt, G. H., & GRADE Working Group (2008). Grading quality of evidence and strength of recommendations for diagnostic tests and strategies. BMJ (Clinical research ed.), 336(7653), 1106–1110. doi:10.1136/bmj.39500.677199.AE
OpenUrl FREE Full Text
[19].↵
Higgins, J. P., Thompson, S. G., & Spiegelhalter, D. J. (2009). A re-evaluation of random-effects meta-analysis. Journal of the Royal Statistical Society. Series A, (Statistics in Society), 172(1), 137–159. doi:10.1111/j.1467-985X.2008.00552.x
OpenUrl CrossRef PubMed Web of Science
[20].↵
Bhavani, S. V., Lonjers, Z., Carey, K. A., Afshar, M., Gilbert, E. R., Shah, N. S., Huang, E. S., & Churpek, M. M. (2020). The Development and Validation of a Machine Learning Model to Predict Bacteremia and Fungemia in Hospitalized Patients Using Electronic Health Record Data. Critical Care Medicine, 48(11), E1020–E1028. doi:10.1097/CCM.0000000000004556
OpenUrl CrossRef
[21].↵
Lee, K. H., Dong, J. J., Jeong, S. J., Chae, M.-H., Lee, B. S., Kim, H. J., Ko, S. H., & Song, Y. G. (2019). Early detection of bacteraemia using ten clinical variables with an artificial neural network approach. Journal of Clinical Medicine, 8(10). doi:10.3390/jcm8101592
OpenUrl CrossRef
[22].↵
Lee, K. H., Dong, J. J., Kim, S., Kim, D., Hyun, J. H., Chae, M.-H., Lee, B. S., & Song, Y. G. (2022). Prediction of Bacteremia Based on 12-Year Medical Data Using a Machine Learning Approach: Effect of Medical Data by Extraction Time. Diagnostics, 12(1). doi:10.3390/diagnostics12010102
OpenUrl CrossRef
[23].↵
Cheng, M., Zhao, X., Ding, X., Gao, J., Xiong, S., & Ren, Y. (2020). Prediction of blood culture outcome using hybrid neural network model based on electronic health records. BMC medical informatics and decision making, 20(Suppl 3), 121. doi:10.1186/s12911-020-1113-4
OpenUrl CrossRef
[24].↵
McFadden, B. R., Inglis, T. J. J., & Reynolds, M. (2023). Machine learning pipeline for blood culture outcome prediction using Sysmex XN-2000 blood sample results in Western Australia. BMC Infectious Diseases, 23(1). doi:10.1186/s12879-023-08535-y
OpenUrl CrossRef
[25].↵
Lien, F., Lin, H.-S., Wu, Y.-T., & Chiueh, T.-S. (2022). Bacteremia detection from complete blood count and differential leukocyte count with machine learning: complementary and competitive with C-reactive protein and procalcitonin tests. BMC Infectious Diseases, 22(1). doi:10.1186/s12879-022-07223-7
OpenUrl CrossRef
26.↵
Mahmoud, E., Dhoayan, M. A., Bosaeed, M., al Johani, S., & Arabi, Y. M. (2021). Developing machine-learning prediction algorithm for bacteremia in admitted patients. Infection and Drug Resistance, 14, 757–765. doi:10.2147/IDR.S293496
OpenUrl CrossRef
[27].↵
Garnica, O., Gómez, D., Ramos, V., Hidalgo, J. I., & Ruiz-Giardín, J. M. (2021). Diagnosing hospital bacteraemia in the framework of predictive, preventive and personalised medicine using electronic health records and machine learning classifiers. EPMA Journal, 12(3), 365–381. doi:10.1007/s13167-021-00252-3
OpenUrl CrossRef
28.↵
Murri, R., De Angelis, G., Antenucci, L., Fiori, B., Rinaldi, R., Fantoni, M., Damiani, A., Patarnello, S., Sanguinetti, M., Valentini, V., Posteraro, B., & Masciocchi, C. (2024). A Machine Learning Predictive Model of Bloodstream Infection in Hospitalized Patients. Diagnostics (Basel, Switzerland), 14(4), 445. doi:10.3390/diagnostics14040445
OpenUrl CrossRef
[29].↵
Ratzinger, F., Haslacher, H., Perkmann, T., Pinzan, M., Anner, P., Makristathis, A., Burgmann, H., Heinze, G., & Dorffner, G. (2018). Machine learning for fast identification of bacteraemia in SIRS patients treated on standard care wards: a cohort study. Scientific Reports, 8(1). doi:10.1038/s41598-018-30236-9
OpenUrl CrossRef
[30].↵
Rahmani, K., Garikipati, A., Barnes, G., Hoffman, J., Calvert, J., Mao, Q., & Das, R. (2022). Early prediction of central line associated bloodstream infection using machine learning. American journal of infection control, 50(4), 440–445. doi:10.1016/j.ajic.2021.08.017
OpenUrl CrossRef
[31].↵
Beeler, C., Dbeibo, L., Kelley, K., Thatcher, L., Webb, D., Bah, A., Monahan, P., Fowler, N. R., Nicol, S., Judy-Malcolm, A., Judy-Malcolm, A., & Azar, J. (2018). Assessing patient risk of central line-associated bacteremia via machine learning. American Journal of Infection Control, 46(9), 986–991. doi:10.1016/j.ajic.2018.02.021
OpenUrl CrossRef
[32].↵
Zhou, T., Ren, Z., Ma, Y., He, L., Liu, J., Tang, J., & Zhang, H. (2023). Early identification of bloodstream infection in hemodialysis patients by machine learning. Heliyon, 9(7). doi:10.1016/j.heliyon.2023.e18263
OpenUrl CrossRef
[33].
Yoo, J., Kim, S. H., Hur, S., Ha, J., Huh, K., & Cha, W. C. (2021). Candidemia Risk Prediction (CanDETEC) Model for Patients With Malignancy: Model Development and Validation in a Single-Center Retrospective Study. JMIR medical informatics, 9(7), e24651. doi:10.2196/24651
OpenUrl CrossRef
[34].↵
Mooney, C., Eogan, M., Ní Áinle, F., Cleary, B., Gallagher, J. J., O’Loughlin, J., & Drew, R. J. (2021). Predicting bacteraemia in maternity patients using full blood count parameters: A supervised machine learning algorithm approach. International Journal of Laboratory Hematology, 43(4), 609–615. doi:10.1111/ijlh.13434
OpenUrl CrossRef
[35].↵
Su, M., Chen, H., Qiu, J., & Huang, J. (2021). Four Biomarkers-Based Artificial Neural Network Model for Accurate Early Prediction of Bacteremia with Low-level Procalcitonin. Annals of Clinical and Laboratory Science, 51(3), 408–414.
OpenUrl Abstract/FREE Full Text
[36].↵
Wu, L., Xia, D., & Xu, K. (2023). Multi-Clinical Factors Combined with an Artificial Intelligence Algorithm Diagnosis Model for HIV-Infected People with Bloodstream Infection. Infection and Drug Resistance, 16, 6085–6097. doi:10.2147/IDR.S423709
OpenUrl CrossRef
[37].↵
Choi, D. H., Hong, K. J., Park, J. H., Shin, S. D., Ro, Y. S., Song, K. J., Kim, K. H., & Kim, S. (2022). Prediction of bacteremia at the emergency department during triage and disposition stages using machine learning models. American Journal of Emergency Medicine, 53, 86–93. doi:10.1016/j.ajem.2021.12.065
OpenUrl CrossRef
[38].↵
Choi, D. H., Lim, M. H., Kim, K. H., Shin, S. D., Hong, K. J., & Kim, S. (2023). Development of an artificial intelligence bacteremia prediction model and evaluation of its impact on physician predictions focusing on uncertainty. Scientific Reports, 13(1). doi:10.1038/s41598-023-40708-2
OpenUrl CrossRef
[39].↵
Boerman, A. W., Schinkel, M., Meijerink, L., van den Ende, E. S., Pladet, L. C., Scholtemeijer, M. G., Zeeuw, J., van der Zaag, A. Y., Minderhoud, T. C., Elbers, P. W. G., Wiersinga, W. J., de Jonge, R., Kramer, M. H., & Nanayakkara, P. W. B. (2022). Using machine learning to predict blood culture outcomes in the emergency department: a single-centre, retrospective, observational study. BMJ open, 12(1), e053332. doi:10.1136/bmjopen-2021-053332
OpenUrl Abstract/FREE Full Text
[40].↵
Chang, Y.-H., Hsiao, C.-T., Chang, Y.-C., Lai, H.-Y., Lin, H.-H., Chen, C.-C., Hsu, L.-C., Wu, S.-Y., Shih, H.-M., Hsueh, P.-R., Hsueh, P.-R., & Cho, D.-Y. (2023). Machine learning of cell population data, complete blood count, and differential count parameters for early prediction of bacteremia among adult patients with suspected bacterial infections and blood culture sampling in emergency departments. Journal of Microbiology, Immunology and Infection, 56(4), 782–792. doi:10.1016/j.jmii.2023.05.001
OpenUrl CrossRef
41.↵
Schinkel, M., Boerman, A. W., Bennis, F. C., Minderhoud, T. C., Lie, M., Peters-Sengers, H., Holleman, F., Schade, R. P., de Jonge, R., Wiersinga, W. J., Wiersinga, W. J., & Nanayakkara, P. W. B. (2022). Diagnostic stewardship for blood cultures in the emergency department: A multicenter validation and prospective evaluation of a machine learning prediction tool. EBioMedicine, 82. doi:10.1016/j.ebiom.2022.104176
OpenUrl CrossRef
[42].↵
Goh, V., Chou, Y.-J., Lee, C.-C., Ma, M.-C., Wang, W. Y. C., Lin, C.-H., & Hsieh, C.-C. (2022). Predicting Bacteremia among Septic Patients Based on ED Information by Machine Learning Methods: A Comparative Study. Diagnostics, 12(10). doi:10.3390/diagnostics12102498
OpenUrl CrossRef
[43].↵
Tsai, W.-C., Liu, C.-F., Ma, Y.-S., Chen, C.-J., Lin, H.-J., Hsu, C.-C., Chow, J. C., Chien, Y.-W., & Huang, C.-C. (2023). Real-time artificial intelligence system for bacteremia prediction in adult febrile emergency department patients. International Journal of Medical Informatics, 178. doi:10.1016/j.ijmedinf.2023.105176
OpenUrl CrossRef
[44].↵
Roimi, M., Neuberger, A., Shrot, A., Paul, M., Geffen, Y., & Bar-Lavie, Y. (2020). Early diagnosis of bloodstream infections in the intensive care unit using machine-learning algorithms. Intensive Care Medicine, 46(3), 454–462. doi:10.1007/s00134-019-05876-8
OpenUrl CrossRef PubMed
[45].↵
Van Steenkiste, T., Ruyssinck, J., De Baets, L., Decruyenaere, J., De Turck, F., Ongenae, F., & Dhaene, T. (2019). Accurate prediction of blood culture outcome in the intensive care unit using long short-term memory neural networks. Artificial intelligence in medicine, 97, 38–43. doi:10.1016/j.artmed.2018.10.008
OpenUrl CrossRef PubMed
[46].
Boner, Z., & Moore, C. C. (2022). Deep Learning Risk Prediction of Bloodstream Infection in the Intensive Care Unit. In KDD Undergraduate Consortium, August 14-18, 2022, Washington, D.C… ACM, New York, NY, USA, 7 pages
[47].↵
Pai, K.-C., Wang, M.-S., Chen, Y.-F., Tseng, C.-H., Liu, P.-Y., Chen, L.-C., Sheu, R.-K., & Wu, C.-L. (2021). An artificial intelligence approach to bloodstream infections prediction. Journal of Clinical Medicine, 10(13). doi:10.3390/jcm10132901
OpenUrl CrossRef
[48].↵
Parreco, J. P., Hidalgo, A. E., Badilla, A. D., Ilyas, O., & Rattan, R. (2018). Predicting central line-associated bloodstream infections and mortality using supervised machine learning. Journal of Critical Care, 45, 156–162. doi:10.1016/j.jcrc.2018.02.010
OpenUrl CrossRef
[49].↵
Yuan, S., Sun, Y., Xiao, X., Long, Y., & He, H. (2021). Using Machine Learning Algorithms to Predict Candidaemia in ICU Patients With New-Onset Systemic Inflammatory Response Syndrome. Frontiers in Medicine, 8. doi:10.3389/fmed.2021.720926
OpenUrl CrossRef
[50].↵
Collins, G. S., Reitsma, J. B., Altman, D. G., & Moons, K. G. (2015). Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): the TRIPOD statement. BMJ (Clinical research ed.), 350, g7594. doi:10.1136/bmj.g7594
OpenUrl CrossRef PubMed
[51].↵
Chawla, N. V., Bowyer, K. W., Hall, L. O., & Kegelmeyer, W. P. (2002). SMOTE: synthetic minority over-sampling technique. Journal of artificial intelligence research, 16, 321–357.
OpenUrl CrossRef
[52].↵
Little, R., & Rubin, D. (2019). Statistical analysis with missing data, third edition. Wiley.
[53].↵
Younis HA, Eisa TAE, Nasser M, et al. A Systematic Review and Meta-Analysis of Artificial Intelligence Tools in Medicine and Healthcare: Applications, Considerations, Limitations, Motivation and Challenges. Diagnostics (Basel). 2024;14(1):109. Published 2024 Jan 4. doi:10.3390/diagnostics14010109
OpenUrl CrossRef
[54].↵
Dargère, S., Cormier, H., & Verdon, R. (2018). Contaminants in blood cultures: importance, implications, interpretation and prevention. Clinical microbiology and infection : the official publication of the European Society of Clinical Microbiology and Infectious Diseases, 24(9), 964–969. doi:10.1016/j.cmi.2018.03.030
OpenUrl CrossRef PubMed
[55].↵
Habehh, H., & Gohel, S. (2021). Machine Learning in Healthcare. Current genomics, 22(4), 291–300. doi:10.2174/1389202922666210705124359
OpenUrl CrossRef
[56].↵
Zoabi, Y., Kehat, O., Lahav, D. et al. Predicting bloodstream infection outcome using machine learning. Sci Rep 11, 20101 (2021). doi:10.1038/s41598-021-99105-2
OpenUrl CrossRef
[57].↵
Bohr, A., & Memarzadeh, K. (2020). The rise of artificial intelligence in healthcare applications. Artificial Intelligence in Healthcare, 25–60. doi:10.1016/B978-0-12-818438-7.00002-2
OpenUrl CrossRef
[58].
Javaid, M., Haleem, A., Singh, R. P., Suman, R., & Rab, S. (2022). Significance of machine learning in healthcare: Features, pillars and applications. International Journal of Intelligent Networks, 3, 58–73. doi:10.1016/j.ijin.2022.05.002
OpenUrl CrossRef
[59].↵
Bajwa, J., Munir, U., Nori, A., & Williams, B. (2021). Artificial intelligence in healthcare: transforming the practice of medicine. Future healthcare journal, 8(2), e188–e194. doi:10.7861/fhj.2021-0095
OpenUrl Abstract/FREE Full Text
[60].↵
Batko, K., & Ślęzak, A. (2022). The use of Big Data Analytics in healthcare. Journal of big data, 9(1), 3. doi:10.1186/s40537-021-00553-4
OpenUrl CrossRef
[61].↵
Alemkere, G., Tenna, A., & Engidawork, E. (2019). Antibiotic use practice and predictors of hospital outcome among patients with systemic bacterial infection: Identifying targets for antibiotic and health care resource stewardship. PloS one, 14(2), e0212661. doi:10.1371/journal.pone.0212661
OpenUrl CrossRef PubMed
[62].↵
Giordano, C., Brennan, M., Mohamed, B., Rashidi, P., Modave, F., & Tighe, P. (2021). Accessing Artificial Intelligence for Clinical Decision-Making. Frontiers in digital health, 3, 645232. doi:10.3389/fdgth.2021.645232
OpenUrl CrossRef
[63].↵
Antoniadi AM, Du Y, Guendouz Y, Wei L, Mazo C, Becker BA, Mooney C. Current Challenges and Future Opportunities for XAI in Machine Learning-Based Clinical Decision Support Systems: A Systematic Review. Applied Sciences. 2021; 11(11):5088. doi:10.3390/app11115088
OpenUrl CrossRef
[64].↵
Susanto, A. P., Lyell, D., Widyantoro, B., Berkovsky, S., & Magrabi, F. (2024). How Well Do AI-Enabled Decision Support Systems Perform in Clinical Settings?. Studies in health technology and informatics, 310, 279–283. doi:10.3233/SHTI230971
OpenUrl CrossRef
[65].↵
Iqbal, J., Cortés Jaimes, D. C., Makineni, P., Subramani, S., Hemaida, S., Thugu, T. R., Butt, A. N., Sikto, J. T., Kaur, P., Lak, M. A., Augustine, M., Shahzad, R., & Arain, M. (2023). Reimagining Healthcare: Unleashing the Power of Artificial Intelligence in Medicine. Cureus, 15(9), e44658. doi:10.7759/cureus.44658
OpenUrl CrossRef
[66].↵
Xu, Y., Liu, X., Cao, X., Huang, C., Liu, E., Qian, S., Liu, X., Wu, Y., Dong, F., Qiu, C.-W., Qiu, J., Hua, K., Su, W., Wu, J., Xu, H., Han, Y., Fu, C., Yin, Z., Liu, M., Roepman, R., … Zhang, J. (2021). Artificial intelligence: A powerful paradigm for scientific research. The Innovation, 2(4), Article 100179. doi:10.1016/j.xinn.2021.100179
OpenUrl CrossRef
[67].↵
Myllyaho, L., Raatikainen, M., Männistö, T., Mikkonen, T., & Nurminen, J. K. (2021). Systematic literature review of validation methods for AI systems. Journal of Systems and Software, 181, Article 111050. doi:10.1016/j.jss.2021.111050
OpenUrl CrossRef
[68].↵
de Hond, A.A.H., Leeuwenberg, A.M., Hooft, L. et al. Guidelines and quality criteria for artificial intelligence-based prediction models in healthcare: a scoping review. npj Digit. Med. 5, 2 (2022). doi:10.1038/s41746-021-00549-7
OpenUrl CrossRef PubMed
[69].↵
Stevens, L. M., Mortazavi, B. J., Deo, R. C., Curtis, L., & Kao, D. P. (2020). Recommendations for Reporting Machine Learning Analyses in Clinical Research. Circulation. Cardiovascular quality and outcomes, 13(10), e006556. doi:10.1161/CIRCOUTCOMES.120.006556
OpenUrl CrossRef PubMed
[70].
Rieke, N., Hancox, J., Li, W. et al. The future of digital health with federated learning. npj Digit. Med. 3, 119 (2020). doi:10.1038/s41746-020-00323-1
OpenUrl CrossRef PubMed
[71].
Moor, M., Banerjee, O., Abad, Z.S.H. et al. Foundation models for generalist medical artificial intelligence. Nature 616, 259–265 (2023). doi:10.1038/s41586-023-05881-4
OpenUrl CrossRef PubMed
[72].↵
Antoniou, T., & Mamdani, M. (2021). Evaluation of machine learning solutions in medicine. CMAJ : Canadian Medical Association journal = journal de l’Association medicale canadienne, 193(36), E1425–E1429. doi:10.1503/cmaj.210036
OpenUrl FREE Full Text
[73].↵
Gerke, S., Minssen, T., & Cohen, G. (2020). Ethical and legal challenges of artificial intelligence-driven healthcare. Artificial Intelligence in Healthcare, 295–336. doi:10.1016/B978-0-12-818438-7.00012-5
OpenUrl CrossRef
[74].↵
Naik, N., Zeeshan, H. B. M., Shetty, D. K., Swain, D., Shah, M., Paul, R., Aggarwal, K., Ibrahim, S., Patil, V., Komal, S., Shetty, S., Rai, B. P., Chlosta, P., & Somani, B. K. (2022). Legal and ethical consideration in artificial intelligence in healthcare: Who takes responsibility? Frontiers in Surgery, 9. doi:10.3389/fsurg.2022.862322
OpenUrl CrossRef
[75].↵
Khalid, N., Qayyum, A., Bilal, M., Al-Fuqaha, A., & Qadir, J. (2023). Privacy-preserving artificial intelligence in healthcare: Techniques and applications. Computers in Biology and Medicine, 158, Article 106848. doi:10.1016/j.compbiomed.2023.106848
OpenUrl CrossRef
[76].↵
Murdoch, B. Privacy and artificial intelligence: challenges for protecting health information in a new era. BMC Med Ethics 22, 122 (2021). doi:10.1186/s12910-021-00687-3
OpenUrl CrossRef

View the discussion thread.

Posted April 16, 2024.

Download PDF

Supplementary Material

Data/Code

Citation Tools

Subject Area

Health Informatics

Subject Areas

All Articles

Addiction Medicine (322)
Allergy and Immunology (624)
Anesthesia (162)
Cardiovascular Medicine (2335)
Dentistry and Oral Medicine (283)
Dermatology (205)
Emergency Medicine (373)
Endocrinology (including Diabetes Mellitus and Metabolic Disease) (829)
Epidemiology (11703)
Forensic Medicine (10)
Gastroenterology (693)
Genetic and Genomic Medicine (3687)
Geriatric Medicine (345)
Health Economics (632)
Health Informatics (2365)
Health Policy (925)
Health Systems and Quality Improvement (884)
Hematology (340)
HIV/AIDS (773)
Infectious Diseases (except HIV/AIDS) (13264)
Intensive Care and Critical Care Medicine (765)
Medical Education (364)
Medical Ethics (104)
Nephrology (396)
Neurology (3450)
Nursing (195)
Nutrition (520)
Obstetrics and Gynecology (667)
Occupational and Environmental Health (660)
Oncology (1802)
Ophthalmology (533)
Orthopedics (218)
Otolaryngology (285)
Pain Medicine (231)
Palliative Medicine (66)
Pathology (444)
Pediatrics (1021)
Pharmacology and Therapeutics (424)
Primary Care Research (415)
Psychiatry and Clinical Psychology (3145)
Public and Global Health (6091)
Radiology and Imaging (1256)
Rehabilitation Medicine and Physical Therapy (735)
Respiratory Medicine (821)
Rheumatology (375)
Sexual and Reproductive Health (366)
Sports Medicine (320)
Surgery (396)
Toxicology (50)
Transplantation (171)
Urology (144)

[1] [1].↵
Goto, M., & Al-Hasan, M. N. (2013). Overall burden of bloodstream infection and nosocomial bloodstream infection in North America and Europe. Clinical Microbiology and Infection, 19(6), 501–509.
OpenUrl CrossRef PubMed

[2] [2].↵
Coburn, B., Morris, A. M., Tomlinson, G., & Detsky, A. S. (2012). Does this adult patient with suspected bacteremia require blood cultures?. JAMA, 308(5), 502–511. doi:10.1001/jama.2012.8262
OpenUrl CrossRef PubMed Web of Science

[3] [3].↵
Wong A, Otles E, Donnelly JP, et al. External Validation of a Widely Implemented Proprietary Sepsis Prediction Model in Hospitalized Patients. JAMA Intern Med. 2021;181(8):1065–1070. doi:10.1001/jamainternmed.2021.2626
OpenUrl CrossRef PubMed

[4] [4].↵
Schootman, M., Wiskow, C., Loux, T., Meyer, L., Powell, S., Gandhi, A., & Lacasse, A. (2022). Evaluation of the effectiveness of an automated sepsis predictive tool on patient outcomes. Journal of critical care, 71, 154061. doi:10.1016/j.jcrc.2022.154061
OpenUrl CrossRef

[5] [5].↵
Bates, D. W., Cook, E. F., Goldman, L., & Lee, T. H. (1990). Predicting bacteremia in hospitalized patients. A prospectively validated model. Annals of internal medicine, 113(7), 495–500. doi:10.7326/0003-4819-113-7-495
OpenUrl CrossRef PubMed Web of Science

[6] [6].↵
Sakarikou, C., Altieri, A., Bossa, M. C., Minelli, S., Dolfa, C., Piperno, M., & Favalli, C. (2018). Rapid and cost-effective identification and antimicrobial susceptibility testing in patients with Gram-negative bacteremia directly from blood-culture fluid. Journal of microbiological methods, 146, 7–12. doi:10.1016/j.mimet.2018.01.012
OpenUrl CrossRef

[7] [7].↵
Buch, V. H., Ahmed, I. & Maruthappu, M. Artificial intelligence in medicine: Current trends and future possibilities. Br. J. Gen. Pract. 68, 143–144. doi:10.3399/bjgp18X695213 (2018).
OpenUrl FREE Full Text

[8] [8].
Rajpurkar, P., Chen, E., Banerjee, O., et al. AI in health and medicine. Nat Med 28, 31–38 (2022). doi:10.1038/s41591-021-01614-0
OpenUrl CrossRef PubMed

[9] [9].
A. Baldominos, A. Puello, H. Oğul, T. Aşuroğlu and R. Colomo-Palacios, "Predicting Infections Using Computational Intelligence – A Systematic Review," in IEEE Access, vol. 8, pp. 31083–31102, 2020, doi: 10.1109/ACCESS.2020.2973006.
OpenUrl CrossRef

[10] [10].
Hassan, N., Slight, R., Weiand, D., Vellinga, A., Morgan, G., Aboushareb, F., & Slight, S. P. (2021). Preventing sepsis; how can artificial intelligence inform the clinical decision-making process? A systematic review. International journal of medical informatics, 150, 104457. doi:10.1016/j.ijmedinf.2021.104457
OpenUrl CrossRef PubMed

[11] [11].
Islam, K. R., Prithula, J., Kumar, J., Tan, T. L., Reaz, M. B. I., Sumon, M. S. I., & Chowdhury, M. E. H. (2023). Machine Learning-Based Early Prediction of Sepsis Using Electronic Health Records: A Systematic Review. Journal of clinical medicine, 12(17), 5658. doi:10.3390/jcm12175658
OpenUrl CrossRef

[12] [12].↵
Fleuren, L. M., Klausch, T. L. T., Zwager, C. L., Schoonmade, L. J., Guo, T., Roggeveen, L. F., Swart, E. L., Girbes, A. R. J., Thoral, P., Ercole, A., Hoogendoorn, M., & Elbers, P. W. G. (2020). Machine learning for the prediction of sepsis: a systematic review and meta-analysis of diagnostic test accuracy. Intensive care medicine, 46(3), 383–400. doi:10.1007/s00134-019-05872-y
OpenUrl CrossRef PubMed

[13] [13].↵
Moor, M., Rieck, B., Horn, M., Jutzeler, C. R., & Borgwardt, K. (2021). Early Prediction of Sepsis in the ICU Using Machine Learning: A Systematic Review. Frontiers in medicine, 8, 607952. doi:10.3389/fmed.2021.607952
OpenUrl CrossRef

[14] [14].↵
Deng, H. F., Sun, M. W., Wang, Y., Zeng, J., Yuan, T., Li, T., Li, D. H., Chen, W., Zhou, P., Wang, Q., & Jiang, H. (2021). Evaluating machine learning models for sepsis

[15] [15].↵
McInnes, M. D. F., Moher, D., Thombs, B. D., McGrath, T. A., Bossuyt, P. M., and the PRISMA-DTA Group, Clifford, T., Cohen, J. F., Deeks, J. J., Gatsonis, C., Hooft, L., Hunt, H. A., Hyde, C. J., Korevaar, D. A., Leeflang, M. M. G., Macaskill, P., Reitsma, J. B., Rodin, R., Rutjes, A. W. S., Salameh, J. P., … Willis, B. H. (2018). Preferred Reporting Items for a Systematic Review and Meta-analysis of Diagnostic Test Accuracy Studies: The PRISMA-DTA Statement. JAMA, 319(4), 388–396. doi:10.1001/jama.2017.19163
OpenUrl CrossRef PubMed

[16] [16].↵
Eliakim-Raz, N., Bates, D. W., & Leibovici, L. (2015). Predicting bacteraemia in validated models--a systematic review. Clinical microbiology and infection : the official publication of the European Society of Clinical Microbiology and Infectious Diseases, 21(4), 295–301. doi:10.1016/j.cmi.2015.01.023
OpenUrl CrossRef PubMed

[17] [17].↵
Whiting PF, Rutjes AWS, Westwood ME et al. (2011) QUADAS-2: a revised tool for the quality assessment of diagnostic accuracy studies. Ann Intern Med 155:529. doi:10.7326/0003-4819-155-8-201110180-00009
OpenUrl CrossRef PubMed Web of Science

[18] [18].↵
Schünemann, H. J., Oxman, A. D., Brozek, J., Glasziou, P., Jaeschke, R., Vist, G. E., Williams, J. W., Jr., Kunz, R., Craig, J., Montori, V. M., Bossuyt, P., Guyatt, G. H., & GRADE Working Group (2008). Grading quality of evidence and strength of recommendations for diagnostic tests and strategies. BMJ (Clinical research ed.), 336(7653), 1106–1110. doi:10.1136/bmj.39500.677199.AE
OpenUrl FREE Full Text

[19] [19].↵
Higgins, J. P., Thompson, S. G., & Spiegelhalter, D. J. (2009). A re-evaluation of random-effects meta-analysis. Journal of the Royal Statistical Society. Series A, (Statistics in Society), 172(1), 137–159. doi:10.1111/j.1467-985X.2008.00552.x
OpenUrl CrossRef PubMed Web of Science

[20] [20].↵
Bhavani, S. V., Lonjers, Z., Carey, K. A., Afshar, M., Gilbert, E. R., Shah, N. S., Huang, E. S., & Churpek, M. M. (2020). The Development and Validation of a Machine Learning Model to Predict Bacteremia and Fungemia in Hospitalized Patients Using Electronic Health Record Data. Critical Care Medicine, 48(11), E1020–E1028. doi:10.1097/CCM.0000000000004556
OpenUrl CrossRef

[21] [21].↵
Lee, K. H., Dong, J. J., Jeong, S. J., Chae, M.-H., Lee, B. S., Kim, H. J., Ko, S. H., & Song, Y. G. (2019). Early detection of bacteraemia using ten clinical variables with an artificial neural network approach. Journal of Clinical Medicine, 8(10). doi:10.3390/jcm8101592
OpenUrl CrossRef

[22] [22].↵
Lee, K. H., Dong, J. J., Kim, S., Kim, D., Hyun, J. H., Chae, M.-H., Lee, B. S., & Song, Y. G. (2022). Prediction of Bacteremia Based on 12-Year Medical Data Using a Machine Learning Approach: Effect of Medical Data by Extraction Time. Diagnostics, 12(1). doi:10.3390/diagnostics12010102
OpenUrl CrossRef

[23] [23].↵
Cheng, M., Zhao, X., Ding, X., Gao, J., Xiong, S., & Ren, Y. (2020). Prediction of blood culture outcome using hybrid neural network model based on electronic health records. BMC medical informatics and decision making, 20(Suppl 3), 121. doi:10.1186/s12911-020-1113-4
OpenUrl CrossRef

[24] [24].↵
McFadden, B. R., Inglis, T. J. J., & Reynolds, M. (2023). Machine learning pipeline for blood culture outcome prediction using Sysmex XN-2000 blood sample results in Western Australia. BMC Infectious Diseases, 23(1). doi:10.1186/s12879-023-08535-y
OpenUrl CrossRef

[25] [25].↵
Lien, F., Lin, H.-S., Wu, Y.-T., & Chiueh, T.-S. (2022). Bacteremia detection from complete blood count and differential leukocyte count with machine learning: complementary and competitive with C-reactive protein and procalcitonin tests. BMC Infectious Diseases, 22(1). doi:10.1186/s12879-022-07223-7
OpenUrl CrossRef

[26] 26.↵
Mahmoud, E., Dhoayan, M. A., Bosaeed, M., al Johani, S., & Arabi, Y. M. (2021). Developing machine-learning prediction algorithm for bacteremia in admitted patients. Infection and Drug Resistance, 14, 757–765. doi:10.2147/IDR.S293496
OpenUrl CrossRef

[27] [27].↵
Garnica, O., Gómez, D., Ramos, V., Hidalgo, J. I., & Ruiz-Giardín, J. M. (2021). Diagnosing hospital bacteraemia in the framework of predictive, preventive and personalised medicine using electronic health records and machine learning classifiers. EPMA Journal, 12(3), 365–381. doi:10.1007/s13167-021-00252-3
OpenUrl CrossRef

[28] 28.↵
Murri, R., De Angelis, G., Antenucci, L., Fiori, B., Rinaldi, R., Fantoni, M., Damiani, A., Patarnello, S., Sanguinetti, M., Valentini, V., Posteraro, B., & Masciocchi, C. (2024). A Machine Learning Predictive Model of Bloodstream Infection in Hospitalized Patients. Diagnostics (Basel, Switzerland), 14(4), 445. doi:10.3390/diagnostics14040445
OpenUrl CrossRef

[29] [29].↵
Ratzinger, F., Haslacher, H., Perkmann, T., Pinzan, M., Anner, P., Makristathis, A., Burgmann, H., Heinze, G., & Dorffner, G. (2018). Machine learning for fast identification of bacteraemia in SIRS patients treated on standard care wards: a cohort study. Scientific Reports, 8(1). doi:10.1038/s41598-018-30236-9
OpenUrl CrossRef

[30] [30].↵
Rahmani, K., Garikipati, A., Barnes, G., Hoffman, J., Calvert, J., Mao, Q., & Das, R. (2022). Early prediction of central line associated bloodstream infection using machine learning. American journal of infection control, 50(4), 440–445. doi:10.1016/j.ajic.2021.08.017
OpenUrl CrossRef

[31] [31].↵
Beeler, C., Dbeibo, L., Kelley, K., Thatcher, L., Webb, D., Bah, A., Monahan, P., Fowler, N. R., Nicol, S., Judy-Malcolm, A., Judy-Malcolm, A., & Azar, J. (2018). Assessing patient risk of central line-associated bacteremia via machine learning. American Journal of Infection Control, 46(9), 986–991. doi:10.1016/j.ajic.2018.02.021
OpenUrl CrossRef

[32] [32].↵
Zhou, T., Ren, Z., Ma, Y., He, L., Liu, J., Tang, J., & Zhang, H. (2023). Early identification of bloodstream infection in hemodialysis patients by machine learning. Heliyon, 9(7). doi:10.1016/j.heliyon.2023.e18263
OpenUrl CrossRef

[33] [33].
Yoo, J., Kim, S. H., Hur, S., Ha, J., Huh, K., & Cha, W. C. (2021). Candidemia Risk Prediction (CanDETEC) Model for Patients With Malignancy: Model Development and Validation in a Single-Center Retrospective Study. JMIR medical informatics, 9(7), e24651. doi:10.2196/24651
OpenUrl CrossRef

[34] [34].↵
Mooney, C., Eogan, M., Ní Áinle, F., Cleary, B., Gallagher, J. J., O’Loughlin, J., & Drew, R. J. (2021). Predicting bacteraemia in maternity patients using full blood count parameters: A supervised machine learning algorithm approach. International Journal of Laboratory Hematology, 43(4), 609–615. doi:10.1111/ijlh.13434
OpenUrl CrossRef

[35] [35].↵
Su, M., Chen, H., Qiu, J., & Huang, J. (2021). Four Biomarkers-Based Artificial Neural Network Model for Accurate Early Prediction of Bacteremia with Low-level Procalcitonin. Annals of Clinical and Laboratory Science, 51(3), 408–414.
OpenUrl Abstract/FREE Full Text

[36] [36].↵
Wu, L., Xia, D., & Xu, K. (2023). Multi-Clinical Factors Combined with an Artificial Intelligence Algorithm Diagnosis Model for HIV-Infected People with Bloodstream Infection. Infection and Drug Resistance, 16, 6085–6097. doi:10.2147/IDR.S423709
OpenUrl CrossRef

[37] [37].↵
Choi, D. H., Hong, K. J., Park, J. H., Shin, S. D., Ro, Y. S., Song, K. J., Kim, K. H., & Kim, S. (2022). Prediction of bacteremia at the emergency department during triage and disposition stages using machine learning models. American Journal of Emergency Medicine, 53, 86–93. doi:10.1016/j.ajem.2021.12.065
OpenUrl CrossRef

[38] [38].↵
Choi, D. H., Lim, M. H., Kim, K. H., Shin, S. D., Hong, K. J., & Kim, S. (2023). Development of an artificial intelligence bacteremia prediction model and evaluation of its impact on physician predictions focusing on uncertainty. Scientific Reports, 13(1). doi:10.1038/s41598-023-40708-2
OpenUrl CrossRef

[39] [39].↵
Boerman, A. W., Schinkel, M., Meijerink, L., van den Ende, E. S., Pladet, L. C., Scholtemeijer, M. G., Zeeuw, J., van der Zaag, A. Y., Minderhoud, T. C., Elbers, P. W. G., Wiersinga, W. J., de Jonge, R., Kramer, M. H., & Nanayakkara, P. W. B. (2022). Using machine learning to predict blood culture outcomes in the emergency department: a single-centre, retrospective, observational study. BMJ open, 12(1), e053332. doi:10.1136/bmjopen-2021-053332
OpenUrl Abstract/FREE Full Text

[40] [40].↵
Chang, Y.-H., Hsiao, C.-T., Chang, Y.-C., Lai, H.-Y., Lin, H.-H., Chen, C.-C., Hsu, L.-C., Wu, S.-Y., Shih, H.-M., Hsueh, P.-R., Hsueh, P.-R., & Cho, D.-Y. (2023). Machine learning of cell population data, complete blood count, and differential count parameters for early prediction of bacteremia among adult patients with suspected bacterial infections and blood culture sampling in emergency departments. Journal of Microbiology, Immunology and Infection, 56(4), 782–792. doi:10.1016/j.jmii.2023.05.001
OpenUrl CrossRef

[41] 41.↵
Schinkel, M., Boerman, A. W., Bennis, F. C., Minderhoud, T. C., Lie, M., Peters-Sengers, H., Holleman, F., Schade, R. P., de Jonge, R., Wiersinga, W. J., Wiersinga, W. J., & Nanayakkara, P. W. B. (2022). Diagnostic stewardship for blood cultures in the emergency department: A multicenter validation and prospective evaluation of a machine learning prediction tool. EBioMedicine, 82. doi:10.1016/j.ebiom.2022.104176
OpenUrl CrossRef

[42] [42].↵
Goh, V., Chou, Y.-J., Lee, C.-C., Ma, M.-C., Wang, W. Y. C., Lin, C.-H., & Hsieh, C.-C. (2022). Predicting Bacteremia among Septic Patients Based on ED Information by Machine Learning Methods: A Comparative Study. Diagnostics, 12(10). doi:10.3390/diagnostics12102498
OpenUrl CrossRef

[43] [43].↵
Tsai, W.-C., Liu, C.-F., Ma, Y.-S., Chen, C.-J., Lin, H.-J., Hsu, C.-C., Chow, J. C., Chien, Y.-W., & Huang, C.-C. (2023). Real-time artificial intelligence system for bacteremia prediction in adult febrile emergency department patients. International Journal of Medical Informatics, 178. doi:10.1016/j.ijmedinf.2023.105176
OpenUrl CrossRef

[44] [44].↵
Roimi, M., Neuberger, A., Shrot, A., Paul, M., Geffen, Y., & Bar-Lavie, Y. (2020). Early diagnosis of bloodstream infections in the intensive care unit using machine-learning algorithms. Intensive Care Medicine, 46(3), 454–462. doi:10.1007/s00134-019-05876-8
OpenUrl CrossRef PubMed

[45] [45].↵
Van Steenkiste, T., Ruyssinck, J., De Baets, L., Decruyenaere, J., De Turck, F., Ongenae, F., & Dhaene, T. (2019). Accurate prediction of blood culture outcome in the intensive care unit using long short-term memory neural networks. Artificial intelligence in medicine, 97, 38–43. doi:10.1016/j.artmed.2018.10.008
OpenUrl CrossRef PubMed

[46] [46].
Boner, Z., & Moore, C. C. (2022). Deep Learning Risk Prediction of Bloodstream Infection in the Intensive Care Unit. In KDD Undergraduate Consortium, August 14-18, 2022, Washington, D.C… ACM, New York, NY, USA, 7 pages

[47] [47].↵
Pai, K.-C., Wang, M.-S., Chen, Y.-F., Tseng, C.-H., Liu, P.-Y., Chen, L.-C., Sheu, R.-K., & Wu, C.-L. (2021). An artificial intelligence approach to bloodstream infections prediction. Journal of Clinical Medicine, 10(13). doi:10.3390/jcm10132901
OpenUrl CrossRef

[48] [48].↵
Parreco, J. P., Hidalgo, A. E., Badilla, A. D., Ilyas, O., & Rattan, R. (2018). Predicting central line-associated bloodstream infections and mortality using supervised machine learning. Journal of Critical Care, 45, 156–162. doi:10.1016/j.jcrc.2018.02.010
OpenUrl CrossRef

[49] [49].↵
Yuan, S., Sun, Y., Xiao, X., Long, Y., & He, H. (2021). Using Machine Learning Algorithms to Predict Candidaemia in ICU Patients With New-Onset Systemic Inflammatory Response Syndrome. Frontiers in Medicine, 8. doi:10.3389/fmed.2021.720926
OpenUrl CrossRef

[50] [50].↵
Collins, G. S., Reitsma, J. B., Altman, D. G., & Moons, K. G. (2015). Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): the TRIPOD statement. BMJ (Clinical research ed.), 350, g7594. doi:10.1136/bmj.g7594
OpenUrl CrossRef PubMed

[51] [51].↵
Chawla, N. V., Bowyer, K. W., Hall, L. O., & Kegelmeyer, W. P. (2002). SMOTE: synthetic minority over-sampling technique. Journal of artificial intelligence research, 16, 321–357.
OpenUrl CrossRef

[52] [52].↵
Little, R., & Rubin, D. (2019). Statistical analysis with missing data, third edition. Wiley.

[53] [53].↵
Younis HA, Eisa TAE, Nasser M, et al. A Systematic Review and Meta-Analysis of Artificial Intelligence Tools in Medicine and Healthcare: Applications, Considerations, Limitations, Motivation and Challenges. Diagnostics (Basel). 2024;14(1):109. Published 2024 Jan 4. doi:10.3390/diagnostics14010109
OpenUrl CrossRef

[54] [54].↵
Dargère, S., Cormier, H., & Verdon, R. (2018). Contaminants in blood cultures: importance, implications, interpretation and prevention. Clinical microbiology and infection : the official publication of the European Society of Clinical Microbiology and Infectious Diseases, 24(9), 964–969. doi:10.1016/j.cmi.2018.03.030
OpenUrl CrossRef PubMed

[55] [55].↵
Habehh, H., & Gohel, S. (2021). Machine Learning in Healthcare. Current genomics, 22(4), 291–300. doi:10.2174/1389202922666210705124359
OpenUrl CrossRef

[56] [56].↵
Zoabi, Y., Kehat, O., Lahav, D. et al. Predicting bloodstream infection outcome using machine learning. Sci Rep 11, 20101 (2021). doi:10.1038/s41598-021-99105-2
OpenUrl CrossRef

[57] [57].↵
Bohr, A., & Memarzadeh, K. (2020). The rise of artificial intelligence in healthcare applications. Artificial Intelligence in Healthcare, 25–60. doi:10.1016/B978-0-12-818438-7.00002-2
OpenUrl CrossRef

[58] [58].
Javaid, M., Haleem, A., Singh, R. P., Suman, R., & Rab, S. (2022). Significance of machine learning in healthcare: Features, pillars and applications. International Journal of Intelligent Networks, 3, 58–73. doi:10.1016/j.ijin.2022.05.002
OpenUrl CrossRef

[59] [59].↵
Bajwa, J., Munir, U., Nori, A., & Williams, B. (2021). Artificial intelligence in healthcare: transforming the practice of medicine. Future healthcare journal, 8(2), e188–e194. doi:10.7861/fhj.2021-0095
OpenUrl Abstract/FREE Full Text

[60] [60].↵
Batko, K., & Ślęzak, A. (2022). The use of Big Data Analytics in healthcare. Journal of big data, 9(1), 3. doi:10.1186/s40537-021-00553-4
OpenUrl CrossRef

[61] [61].↵
Alemkere, G., Tenna, A., & Engidawork, E. (2019). Antibiotic use practice and predictors of hospital outcome among patients with systemic bacterial infection: Identifying targets for antibiotic and health care resource stewardship. PloS one, 14(2), e0212661. doi:10.1371/journal.pone.0212661
OpenUrl CrossRef PubMed

[62] [62].↵
Giordano, C., Brennan, M., Mohamed, B., Rashidi, P., Modave, F., & Tighe, P. (2021). Accessing Artificial Intelligence for Clinical Decision-Making. Frontiers in digital health, 3, 645232. doi:10.3389/fdgth.2021.645232
OpenUrl CrossRef

[63] [63].↵
Antoniadi AM, Du Y, Guendouz Y, Wei L, Mazo C, Becker BA, Mooney C. Current Challenges and Future Opportunities for XAI in Machine Learning-Based Clinical Decision Support Systems: A Systematic Review. Applied Sciences. 2021; 11(11):5088. doi:10.3390/app11115088
OpenUrl CrossRef

[64] [64].↵
Susanto, A. P., Lyell, D., Widyantoro, B., Berkovsky, S., & Magrabi, F. (2024). How Well Do AI-Enabled Decision Support Systems Perform in Clinical Settings?. Studies in health technology and informatics, 310, 279–283. doi:10.3233/SHTI230971
OpenUrl CrossRef

[65] [65].↵
Iqbal, J., Cortés Jaimes, D. C., Makineni, P., Subramani, S., Hemaida, S., Thugu, T. R., Butt, A. N., Sikto, J. T., Kaur, P., Lak, M. A., Augustine, M., Shahzad, R., & Arain, M. (2023). Reimagining Healthcare: Unleashing the Power of Artificial Intelligence in Medicine. Cureus, 15(9), e44658. doi:10.7759/cureus.44658
OpenUrl CrossRef

[66] [66].↵
Xu, Y., Liu, X., Cao, X., Huang, C., Liu, E., Qian, S., Liu, X., Wu, Y., Dong, F., Qiu, C.-W., Qiu, J., Hua, K., Su, W., Wu, J., Xu, H., Han, Y., Fu, C., Yin, Z., Liu, M., Roepman, R., … Zhang, J. (2021). Artificial intelligence: A powerful paradigm for scientific research. The Innovation, 2(4), Article 100179. doi:10.1016/j.xinn.2021.100179
OpenUrl CrossRef

[67] [67].↵
Myllyaho, L., Raatikainen, M., Männistö, T., Mikkonen, T., & Nurminen, J. K. (2021). Systematic literature review of validation methods for AI systems. Journal of Systems and Software, 181, Article 111050. doi:10.1016/j.jss.2021.111050
OpenUrl CrossRef

[68] [68].↵
de Hond, A.A.H., Leeuwenberg, A.M., Hooft, L. et al. Guidelines and quality criteria for artificial intelligence-based prediction models in healthcare: a scoping review. npj Digit. Med. 5, 2 (2022). doi:10.1038/s41746-021-00549-7
OpenUrl CrossRef PubMed

[69] [69].↵
Stevens, L. M., Mortazavi, B. J., Deo, R. C., Curtis, L., & Kao, D. P. (2020). Recommendations for Reporting Machine Learning Analyses in Clinical Research. Circulation. Cardiovascular quality and outcomes, 13(10), e006556. doi:10.1161/CIRCOUTCOMES.120.006556
OpenUrl CrossRef PubMed

[70] [70].
Rieke, N., Hancox, J., Li, W. et al. The future of digital health with federated learning. npj Digit. Med. 3, 119 (2020). doi:10.1038/s41746-020-00323-1
OpenUrl CrossRef PubMed

[71] [71].
Moor, M., Banerjee, O., Abad, Z.S.H. et al. Foundation models for generalist medical artificial intelligence. Nature 616, 259–265 (2023). doi:10.1038/s41586-023-05881-4
OpenUrl CrossRef PubMed

[72] [72].↵
Antoniou, T., & Mamdani, M. (2021). Evaluation of machine learning solutions in medicine. CMAJ : Canadian Medical Association journal = journal de l’Association medicale canadienne, 193(36), E1425–E1429. doi:10.1503/cmaj.210036
OpenUrl FREE Full Text

[73] [73].↵
Gerke, S., Minssen, T., & Cohen, G. (2020). Ethical and legal challenges of artificial intelligence-driven healthcare. Artificial Intelligence in Healthcare, 295–336. doi:10.1016/B978-0-12-818438-7.00012-5
OpenUrl CrossRef

[74] [74].↵
Naik, N., Zeeshan, H. B. M., Shetty, D. K., Swain, D., Shah, M., Paul, R., Aggarwal, K., Ibrahim, S., Patil, V., Komal, S., Shetty, S., Rai, B. P., Chlosta, P., & Somani, B. K. (2022). Legal and ethical consideration in artificial intelligence in healthcare: Who takes responsibility? Frontiers in Surgery, 9. doi:10.3389/fsurg.2022.862322
OpenUrl CrossRef

[75] [75].↵
Khalid, N., Qayyum, A., Bilal, M., Al-Fuqaha, A., & Qadir, J. (2023). Privacy-preserving artificial intelligence in healthcare: Techniques and applications. Computers in Biology and Medicine, 158, Article 106848. doi:10.1016/j.compbiomed.2023.106848
OpenUrl CrossRef

[76] [76].↵
Murdoch, B. Privacy and artificial intelligence: challenges for protecting health information in a new era. BMC Med Ethics 22, 122 (2021). doi:10.1186/s12910-021-00687-3
OpenUrl CrossRef

Recent Advancements in Machine Learning-Based Bloodstream Infection Prediction: A Systematic Review and Meta-analysis of Diagnostic Test Accuracy

Abstract

1. INTRODUCTION

1.1. Related works

2. METHODS

2.1. Search Strategy

2.2. Study Selection

2.2.1. Inclusion Criteria

2.2.2. Exclusion Criteria

2.2.3. Timeframe and Language Restrictions

2.3. Data Extraction

2.4. Quality Assessment

2.5. Meta-analysis

3. RESULTS

3.1 Study selection

3.2 Study characteristics

3.3. Models and data types

3.4. Data challenges and strategies

3.5. Quality of evidence and risk of bias

3.6. Meta-analysis

4. DISCUSSIONS

4.1. Clinical relevance and model performance

4.2. Future directions and academic contribution

4.2. Strengths and Limitations

5. CONCLUSIONS

Data Availability

Author’s contributions

Funding

Acknowledgements

Appendix List of Medical Abbreviations

References

Citation Manager Formats

Subject Area