ABSTRACT
Background The contribution of registered nurses (RN) towards safe patient care has been demonstrated by several studies. However, most of the evidence is cross-sectional, hence the inability to demonstrate that staffing levels precede patient outcomes. No reviews have summarised longitudinal studies considering nurse staffing and patient outcomes.
Objectives To synthesise longitudinal studies focusing on associations between nurse staffing levels and patient outcomes.
Methods Systematic review. We conducted our search in 2020 and updated it in July 2021. We searched Medline, CINAHL, Embase and the Cochrane Library. We used the ROBINS-I tool for assessing risk of bias. We reported results narratively grouped by outcome.
Results 28 papers were included. Most studies were either at serious (n=12) or critical (n=6) risk of bias, with 3 studies at low risk of bias. Studies were conducted in a variety of settings and populations. Notwithstanding the limitations, findings are consistent with an overall picture of a beneficial effect from higher RN staffing on preventing patient death. Studies with the greatest risk of bias were judged as most likely to underestimate the effect of higher RN staffing. The evidence is less clear for other patient outcomes, but estimates, though at moderate or serious risk of bias, indicate that higher RN staffing is likely to lead to better patient outcomes. Evidence about the contribution of other nursing staff groups and skill mix of the team is unclear.
Conclusion There is a likely causal relationship between low RN staffing and harm to patients, although uncertainties remain regarding the magnitude of effect. To address these uncertainties, future studies should be conducted in more than one hospital and using standardised measures when reporting staffing levels.
INTRODUCTION
Nursing staff account for nearly half of the health workforce worldwide, and the cost of providing inpatient hospital nursing is one of the main cost drivers for health systems.1 Providing nurse staffing levels that match patient demand is key to deliver cost-effective health services. In the face of increasing financial pressures and budget constraints, registered nurses (RNs) are sometimes viewed simply as a costly labour input, which can often be substituted with lower paid unregistered staff.2 This has raised questions about the contribution of RNs and other nurses in the workforce in ensuring patients receive safe and high-quality care, including preventing deterioration and death among hospital inpatients. Besides RNs, the nursing workforce comprises of unregistered nursing assistants (NAs), and licenced practical nurses/nursing associates (LPNs), who accessed the profession not through a university degree, but rather by completing a formal training programme.
When considering the associations between nurse staffing and patient outcomes in inpatient hospital settings, the breadth of the evidence is apparent, with hundreds of studies published to date, most of which support a conclusion that the higher the RN staffing, the lower the rate of adverse patient outcomes, including death.3 The volume of the evidence has led many to question whether more studies are needed because the implications for policy and practice are clear and the evidence definitive.4 5 However past reviews of this literature,6-9 have noted the preponderance of cross-sectional studies. Such studies are unable to establish that the observed variation in staffing levels, typically measured at a hospital level average, was experienced by the patients whose outcomes were measured, typically aggregated at the hospital level over a year. This approach cannot accurately estimate the nursing care available to individual patients whose outcomes are studied.
The limitations of the evidence are such that for some, a causal relationship is still questionable, although a careful analysis applying epidemiological principles suggests that the body of evidence is consistent with a “cause and effect” relationship.3 6 Nonetheless, the indirect associations reported in most studies means it is impossible to estimate the effect of change in staffing levels without bias.
In recent years, there has been a steady increase in studies using routinely collected data in healthcare, including studies that use electronic rostering systems and patient records to link patients to the staffing levels they are exposed to throughout their hospital stay.10 11 Such studies have the potential to establish the staffing levels that individual patients have been exposed to prior to experiencing the outcome and so directly explore the effect of variation. In addition to removing many potential sources of bias associated with cross-sectional studies, longitudinal studies avoid the intrinsic limitation of cross-sectional studies, as the presumed cause can be shown to precede the outcome of interest. There is no summary of studies using longitudinal designs to explore the impact of staffing levels on patient and organisational outcomes in inpatient hospital settings. Therefore, the aim of this systematic review is to synthesise the evidence on the effect of exposure to variation in nurse staffing levels on subsequent patient outcomes.
METHODS
Eligibility criteria
We included all quantitative, longitudinal studies that measured nurse staffing and assessed the association between variation in staffing and subsequent patient outcome(s). We included prospective, retrospective, cohort, case-control, randomised or quasi-randomised controlled trials, controlled before-and-after studies, interrupted time series, difference-in-difference or panel studies. To be eligible, studies must establish a temporal link between nurse staffing and patient outcomes, such that variation in nurse staffing levels preceded the outcomes patients experienced. Studies with repeated measures which deployed a cross-sectional analysis were excluded except where difference in difference designs were used to show an association between change in staffing and change in outcomes over time. Similarly, we included studies where planned interventions were implemented and natural experiments where the effect of an exogenous ‘shock’ (e.g. changes in legislation or major policies designed to alter staffing levels) involving a change in staffing was studied. We did not specify a list of patient outcomes a priori, but we defined patient outcomes as any outcomes experienced by patients as opposed to staff, families, and the healthcare systems.
Study selection and data extraction
We registered our review protocol in Prospero (Blinded_for_peer_review). We conducted our search in October 2020 and updated it in July 2021. We searched Medline, CINAHL, Embase and the Cochrane Library. The complete search strategy can be found in Appendix 1.
One reviewer (XXX) undertook the first screening to remove duplicates and irrelevant studies. Potentially relevant papers were then further screened with a more detailed assessment of titles and abstracts. At this stage, all other reviewers assessed samples of 10 studies each to ensure that there was consistency, and to identify points of ambiguity and uncertainty in selection criteria. Full texts of studies that remained after this screening were retrieved and detailed assessment was made against the criteria. All full text papers were assessed by XXX and another reviewer, and any disagreements were resolved by discussion and reaching consensus among all reviewers.
From each included study we extracted author(s), year; country and setting; sample size; measure(s) of staffing levels; outcomes and risk-adjustment; findings.
Assessment of risk of bias
We used the ROBINS-I tool for assessing risk of bias in non-randomised studies; this tool focuses on a study’s internal validity, and bias is defined as a tendency for study results to differ systematically from the results expected from a randomised trial, conducted on the same participants and with no flaws in its conduct.12 When using the tool to assess studies of natural variation in staffing we interpreted “intervention” as variation in exposure to nurse staffing levels. The risk of bias due to different methodological aspects (called “domains”) was assessed: confounding, selection of participants into the study, classification of interventions, deviation from intended interventions, missing data, measurement of outcomes, and selection of the reported results. Each domain of the ROBINS-I tool was graded as either low, moderate, serious, or critical risk of bias, and the domain grade with the highest risk of bias determined the overall risk of bias grade. According to the ROBINS-I guidance,12 we excluded studies at critical risk of bias from our synthesis,13-18 but these studies’ characteristics are reported in Appendix 2.
We considered whether bias was likely to over or under estimate nurse staffing effects. Two reviewers independently assessed the risk of bias for each study, and where there was disagreement, this was resolved by collective discussion.
Evidence synthesis
We were unable to identify groups of studies that used measures of staffing and outcomes that were sufficiently comparable to pool in a statistical meta-analysis. Therefore, we performed a narrative synthesis with results grouped by outcome. Where studies performed more than one analysis, we reported results at the analysis level rather than summarising them at the study level. Outcome measures were grouped if they were reported in ten or more analyses using a compatible staffing measure. When less than twelve analyses reported on the same outcome, we grouped outcomes in common themes.
RESULTS
Our search yielded 4,080 records, of which 954 were duplicates and 2,834 were excluded based on title and abstract. We assessed the full text of the remaining 292 studies for eligibility. Of these, 28 were included in the review (see Figure 1).
Studies were published between 2003 and 2021. Most studies collected data from one hospital only (n= 15), but there were some notable exceptions, for example McHugh et al included 55 hospitals,19 Hamilton et al included 54 hospitals,20 and Mark and Belyea included 145 hospitals. Patient samples ranged between 8514 and 489,155.19 In terms of settings, 11 studies were conducted in Intensive Care Units (ICU); three in Acute Medical Units; thirteen in a variety of inpatient wards, which could also include ICUs. Most studies (n=9) were conducted in the US, 5 in the UK, 4 in Canada, 3 each in Italy, Australia and Switzerland, and one in Finland.
The length of the gap between the measurement of staffing levels and the outcome (the “exposure window”) ranged between 6 hours21 and 30 days.15 In studies considering policy implementations, staffing and outcomes relationships were measured one year after implementation.18 19 Thirteen studies considered staffing levels averaged or summed cumulatively for the whole patient stay. 13 16 17 20 22-30 Six studies considered staffing in the early part of the patient stay only, ranging between the first day and the first seven days of the patients’ stay. 23-25 31-33
Nurse staffing levels were measured in a variety of ways, which are summarised in Table 1 Staffing levels and skill mix measures deployed by studies. Some studies used multiple measures of staffing.
Risk of bias
Of the 28 studies, only three were assessed as at overall low risk of bias, seven studies were at moderate risk of bias, and twelve studies were classified as at serious risk of bias. Finally, six studies were at critical risk of bias and were not included in our synthesis.
Overall, we found that most studies had either serious (n=6) or critical (n=3) risk of bias resulting from confounding. I Many studies lacked appropriate adjustment for risk at baseline and/or failed to address time-varying confounding resulting from potential staffing increases as a response to worsening of a patients’ condition. Bias in selection of participants into the study was either low or moderate, apart from two studies of the implementation of staffing policies, which were classified as serious risk of bias due to the selection of units and hospitals which implemented staffing interventions and controls, which was not random. 18 19
In many studies, the direction of any bias was determined to be unpredictable. However, in 9 studies with a serious or critical risk of bias, lack of individual risk-adjustment would likely bias results in favour of lower staffing or attenuate any observed effects from higher staffing. Failure to include known predictors of the outcome in statistical models could have led to estimates favouring lower staffing levels because patients at low risk of experiencing a negative outcome have lower need and may consequently be exposed to lower staffing. 34 35 On the other hand, in two studies failure to control for levels of other staff could lead to an overestimate of the benefits of higher nurse staffing levels because the nurse staffing measure was likely to correlate with staffing by other staff groups and these staff groups have an effect on outcomes. While some other studies had no direct control for other staff groups the effect of the omission was less predictable as the design meant that variation in nurse staffing was unlikely to be correlated with that of other staff groups or else there was evidence presented for a lack of correlation. Detailed assessments can be found in Appendix 3.
Mortality
Most studies and analyses showed higher staffing to be associated with reduced mortality. Twelve studies reported 40 analyses on patient mortality, but two studies (8 analyses) were at critical risk of bias and were excluded from the narrative synthesis.
Focusing on RNs (20 analyses in total), most studies and analyses showed higher staffing to be associated with reduced mortality although there was some inconsistency. Studies favouring higher staffing included larger studies at low/moderate risk of bias. Thirteen analyses from nine studies found that higher RN staffing levels were associated with a reduced risk of in-hospital mortality in adult general patient populations13 23-26 28 30 32 and reduced risk of death in neonatal populations.20 Six analyses from four studies reported associations between RN staffing levels and patient mortality that were not statistically significant, 20 25 26 33 although coefficients, when presented, favoured higher RN staffing. One analysis from one small study with serious risk of bias found that higher RN staffing levels were associated with higher risk of mortality.33
We report all point estimates for RN staffing and mortality in Table 2. Effect sizes were, typically, small. Exposure to low staffing was measured with different thresholds and different exposure windows. The diversity of staffing measures and exposure windows makes any meaningful comparison or synthesis impossible.
Skill mix and non-RN staffing findings were mixed and provide an inconsistent picture with contrasting results. Three analyses (two studies) reported increased mortality when patients were exposed to low NA staffing, 25 32 although Griffiths et al found a non-linear relationship so NA staffing above the norm was also associated with increased risk of mortality.32 Two analyses from two studies reported higher mortality when patients were exposed to higher NA staffing levels, although these were not statistically significant.28 32 Needleman and colleagues also combined RN and NA staffing levels in two analyses and found patients exposed to lower staffing levels from both groups were more likely to die.25 Musy et al found that being exposed to either high or low LPN staffing levels was associated with lower mortality, although only high LPN staffing levels were statistically significant.30 Skill mix was not associated with patient mortality.26 Point estimates for all nurses, NA and LPN staffing and mortality analyses are reported in Table 3.
Focusing on temporary staffing, three analyses from one study in one hospital with 138,133 found that exposure to higher levels of temporary RN and NA staffing were associated with higher patient mortality. However, for RNs this effect was found only when a high number of RN hours, equivalent to between one third and one half of the average ward staffing complement, were from temporary nurses (HR = 1.12; 95% CI = 1.03–1.21).31 The study was at serious risk of bias.
Infections
Overall, there was inconclusive evidence of the effect of nurse staffing levels on infections, with results pointing towards an effect in favour of higher RN staffing levels.
After removing one study at critical risk of bias, 16 analyses from seven studies examined the impact of staffing levels on patients’ infections, including central venous catheter (CVC)-associated bloodstream infections, early and late onset ventilator-associated pneumonia, and healthcare-associated infections. Considering RNs, eight analyses from seven studies reported statistically non-significant results, although point estimates, when available, showed a protective effect of higher RN staffing,22 36-39 except for one analysis.40 41 Four analyses found that being exposed to higher RN staffing was protective of infections in neonatal intensive care settings,40 41 adult intensive care settings37 38 and adult hospital settings.39
While two analyses found no statistically significant relationships (point estimates not reported) between NA staffing and infections,22 39 one analysis found that higher NA staffing levels were associated with lower risk of healthcare-associated infections.39 One analysis found that patients exposed to days with higher levels of temporary nurse staffing were more likely to experience CVC-associated bloodstream infections.22 All available point estimates for the analyses of staffing levels and infections are reported in Table 4.
Other outcomes
In a single site study of 138,133 patients, Griffiths and colleagues examined the effect of staffing levels on a composite outcome of adverse events, including death, cardiac arrest or unplanned ICU admission. They found that being exposed to higher RNHPPD in the first five days of a hospital admission was associated with a reduced hazard of experiencing adverse events (HR = 0.98; 95% CI = 0.96-0.99). Results for NAHPPD were in the opposite direction, although not statistically significant (HR = 1.01; 95% CI = 0.99-1.02).29
Patrician and colleagues examined the effect of staffing levels and skill mix on hospital-acquired pressure injuries in 13 hospitals and 1643 patients through 12 distinct analyses, exploring various exposure windows and skill mix configurations. The majority (10/12) of these analyses found no statistically significant associations, but two analyses found that a skill mix richer in LPNs over the three days (HR = 0.27 (no 95% CI reported)) and one week prior (HR = 0.56 (no 95% CI reported)) to the hospital-acquired pressure injury was protective of hospital-acquired pressure injuries.42
One study in two ICUs (one paediatric and one cardiac) and 11,310 admissions found that patients were more likely to receive positive pressure ventilation following unplanned extubation when there were more RNs in the preceding six hours (OR= 1.53; 95% CI: 1.11-2.12).21
Griffiths and colleagues found that patients’ length of stay in hospital was reduced by a mean of 0.23 days for each additional RN HPPD that a patient experienced (Gamma coefficient = -0.23; 95% CI = - 0.30 - -0.16), while there was a minimal increase in days for each additional NA HPPD a patient was exposed to throughout their stay (Gamma coefficient = 0.076; 95% CI = 0.03 - 0.13).29 Tschannen found that being exposed to higher NHPPD was associated with shorter than expected stays (relative to Diagnoses Related Groups based norms) (B= 2.481, SE= 1.0), but not overall average length of stay (B =0.43, SE = 0.01).27 A single site study of 9,643 patients over the age of 75 who received a cognitive screening, found that, for each day patients were exposed to an additional 0.5 RNHPPD, they were less likely to be readmitted, although this was not statistically significant (OR = 0.94; 95% CI = 0.82–1.06).28
Staffing policies
Of two studies examining the effect of staffing policy interventions,18 19 one was at critical risk of bias. A prospective panel study compared patient outcomes in hospitals that implemented a minimum nurse-to-patient ratio policy. Twenty-seven hospitals were subject to the staffing policy and 28 were not. Reducing workloads by one patient per nurse was associated with a decrease in 30-day mortality (OR= 0.93; 95% CI = 0.86-0.99), 7-day readmissions (OR = 0.93; 95% CI = 0.89-0.97) and length of stay (OR= 0.97; 95% CI = 0.94-0.99).19
DISCUSSION
This is the first systematic review of the effect of nurse staffing levels on patient outcomes focusing on longitudinal studies that can demonstrate a temporal link between exposure to staffing levels and outcomes. While studies were performed in different populations and some were at serious or critical risk of bias, findings are consistent with higher RN staffing reducing the risk of patient death. On the other hand, evidence about the positive contribution of other staff groups and the skill mix of the team is unclear.
This evidence is consistent with conclusions based primarily on cross-sectional evidence;3 but our review is limited to studies that can demonstrate that low staffing levels precede outcomes and thus eliminate several sources of confounding. When considering criteria to establish a causal relationship, there is a fundamental requirement that cause precedes effect.43 44 Although the evidence reviewed here remains observational, if relevant confounders are controlled for in the analyses, the parameter estimates for exposure can be causally interpreted,45 although some risk of bias remains. Studies with the greatest risk of bias were judged as most likely to underestimate the effect of higher nurse staffing.
The diverse measures of exposure and diverse settings and patient populations make it difficult to come to firm conclusions about the size of effects beyond the estimates provided by individual studies. However, while the effect sizes observed for mortality are typically small, the large populations exposed to risk means that these effects are, nonetheless, important. In a study that included 138,000 general medical and surgical admissions to one large hospital over 3 years we estimated that an increase of 1 RN hour per patient day might avoid 657 deaths and reduce bed use by over 30,000 bed days.29
When considering NAs and other grades of nursing staff, the effects observed are mixed, and so a causal inference from these results is more challenging. It seems likely from the evidence that any causal relationship is complex, with the hints of non-linear effects32 suggesting competing causal mechanisms from increased resource as opposed to substitution of RNs. Findings around temporary staffing, albeit from a single study, show that when small proportions of temporary nurses are deployed, possibly to maintain staffing levels, the effect on mortality seems to be beneficial, but when between a third and half of the RNs on shift are temporary, the association with mortality changes direction.
Regarding other patient outcomes, there is more uncertainty due to higher risk of bias in studies; while the direction of the relationships observed remains compatible with a protective effect of higher nurse staffing levels, most studies are at either serious or critical risk of bias.
Recommendations for future research practice
The evidence for the link between nurse staffing levels and mortality is strong from an internal validity perspective, but longitudinal studies using routinely collected data from a larger number of hospitals are needed to increase external validity, improving both the precision and generalisability of estimates. While we did not formally assess external validity, all studies at low risk of bias were conducted in a single hospital, which limits generalisability of findings. Ensuring external validity will be crucial to change policy and practice46 47 around nurse staffing levels.
Considering other patient outcomes, studies that can address the limitations of existing literature are needed. The causal pathway from low staffing to mortality has been theorised and empirically demonstrated, in part mediated by a failure to observe and mobilise response to deterioration.48-51 While much work has been done already about nurse-sensitive outcomes,52-54 the current evidence does not provide a consistent and coherent overview of how nurse staffing affects other patient outcomes. Estimation of the effect of variation in nurse staffing levels on some well-established nurse sensitive outcomes including falls and pressure ulcers is hampered by inadequate risk adjustment models, poor recording, and ascertainment bias in the current research.
In addition, we note that staffing levels were measured in a variety of ways across studies. If approaches to measuring staffing inputs were standardised, or if raw anonymised data were provided alongside papers, more comparable estimates could be combined in meta-analyses. While it is difficult to establish the clear superiority of one staffing effect measure over another, we would encourage all authors to offer analyses using measures based on existing reports in addition to any de-novo measures they derive. Staffing deviation standardised against unit means has been used as an absolute (low staffing) and continuous (hours per patient day relative to the mean) effect measure, which could be derived for all studies irrespective of the methods used to determine staffing requirements.
Limitations of the review
Although our search was extensive, the topic is difficult to capture precisely in structured searches. Although as a team we are familiar with the literature and were thus able to test search strategies, it is possible that we missed some eligible studies. However, it seems unlikely that we would have missed a sufficient number of low risk of bias studies to fundamentally change our conclusions or the general picture of the literature that we have presented here.
CONCLUSIONS
This review identified a likely causal relationship between higher RN staffing levels and reduced patient death. In contrast, we found much less certain evidence about the role of assistant staff, changes in skill mix and grades of nurse other than RNs. While it seems possible that such staff may make some contribution to patient safety, the evidence supports a policy of increases in RNs in the acute setting studied but cannot be used to support substitution of RNs by other grades of staff. That there is a causal relationship between low RN staffing and harm to patients now seems beyond dispute although uncertainties remain regarding the magnitude of effect.
Data Availability
No data included - systematic review
FINANCIAL SUPPORT
This study is funded by the Removed_for_peer_review
COMPETING INTERESTS
The authors declare no competing interests.