Abstract
Background COVID-19 has put unprecedented pressure on healthcare systems worldwide, leading to a reduction of the available healthcare capacity. Our objective was to develop a decision model that supports prioritization of care from a utilitarian perspective, which is to minimize population health loss.
Methods A cohort state-transition model was developed and applied to 43 semi-elective non-paediatric surgeries commonly performed in academic hospitals. Scenarios of delaying surgery from two weeks were compared with delaying up to one year, and no surgery at all. Model parameters were based on registries, scientific literature, and the World Health Organization global burden of disease study. For each surgery, the urgency was estimated as the average expected loss of Quality-Adjusted Life-Years (QALYs) per month.
Results Given the best available evidence, the two most urgent surgeries were bypass surgery for Fontaine III/IV peripheral arterial disease (0.23 QALY loss/month, 95%-CI: 0.09-0.24) and transaortic valve implantation (0.15 QALY loss/month, 95%-CI: 0.09-0.24). The two least urgent surgeries were placing a shunt for dialysis (0.01, 95%-CI: 0.005-0.01) and thyroid carcinoma resection (0.01, 95%-CI: 0.01-0.02): these surgeries were associated with a limited amount of health lost on the waiting list.
Conclusion Expected health loss due to surgical delay can be objectively calculated with our decision model based on best available evidence, which can guide prioritization of surgeries to minimize population health loss in times of scarcity. This tool should yet be placed in the context of different ethical perspectives and combined with capacity management tools to facilitate large-scale implementation.
What is already known on this topic The perspective of maximizing population health, a utilitarian ethical perspective, has been described to be most defendable in times of scarcity. To prioritize surgical patients, literature mainly discusses approaches which are intra-disciplinary (e.g. within gynecological or oncological surgery) and mostly existed of narrative reviews of the literature. Some decision tools were developed, which rely on the consensus of experts on various measures of urgency (e.g. health benefit, or time until inoperable). No approach was found which transparently weighs objective factors in order to quantify a clinically relevant measure of urgency.
What this study adds In contrast to previously developed approaches, our approach transparently and consistently aggregates best available objective evidence across disciplines. This novel aggregated urgency measure can be easily linked with capacity management tools. Our approach can help to minimize health losses when trying to overcome delay in surgeries in times of surgical scarcity, during the COVID-19 pandemic and beyond.
Background
COVID-19 has put unprecedented pressure on healthcare systems worldwide. The healthcare demand of this pandemic supersedes available healthcare capacity, far beyond the demand that was imposed by the 2017 influenza pandemic.1,2 The pressure on the available healthcare capacity impacts the continuity of regular care. Among other things because 1) wards and operating theaters are converted to COVID-19 care facilities,3 2) physicians are deployed to care for COVID-19 patients,4,5 and 3) the fear of contagion with SARS-CoV-2 may leave non-COVID patients reluctant to seek care4,5, as was seen in similar health crises like the SARS epidemic.6
Delay in surgical care may dramatically impact health care quality and accessibility. In the first weeks of the COVID-19 crisis in the Netherlands, 75-90% fewer surgeries were performed compared to previous years.7 The delay in cancer surgery already has made a large impact in the life expectancy of oncological patients.8 Moreover, it may be impossible to treat the whole accumulating group of patients, as estimated for orthopaedic and cardiothoracic surgery in the US.9,10 Because of these problems, hospitals are facing a dilemma: Which patients should be prioritized?
As stated by Emanuel et al., “The question is not whether to set priorities, but how to do so ethically and consistently, rather than basing decisions on individual institutions’ approaches or a clinician’s intuition in the heat of the moment”.2 In practice, individual surgical patients are most often triaged by experts from the respective surgical fields.11 Unfortunately the level of agreement on prioritization between experts is low.12 Additionally, prioritization across disciplines is complicated by the high degree of specialization in modern medicine. Most importantly, this approach does not systematically optimize population health. While the perspective of maximizing population health, a utilitarian ethical perspective,13 has been described to be most defendable in times of scarcity. 2,14–18
To guide prioritization of semi-elective surgeries across disciplines from a utilitarian perspective, our study aims to develop a decision model to estimate the impact of postponing surgery on health.
Methods
Overview
The most frequently performed semi-electivei surgeries in our institute were selected. Data about these surgeries were collected and used in a broadly applicable computer-based model to estimate the effect of surgical delay on survival and health related quality of life (QoL).
Patients and setting
The evaluated surgeries in this study comprised of non-paediatric and non-obstetric, semi-electivei surgeries in Erasmus University Medical Center, an academic tertiary referring hospital in the Netherlands. From the electronic patient registry (ChipSoft, HiX), the number of surgeries, surgery time, length of stay at an intensive care unit (ICU), and length of stay at a non-ICU of all non-urgent surgeries were retrieved from July 2017 to December 2019. Next, two senior clinicians selected the semi-elective surgeries from this list. Finally, the Value Based operation room (OR) team collaborators approved the selection. Ultimately, 43 semi-elective surgeries were selected that were performed more than 80 times during the inclusion interval. Where relevant, mild and severe cases undergoing the surgery were distinguished based on clinical insight of our collaborators.
Input parameters
The model required 7 input parameters: 1) survival rates pre-surgery, 2) survival rate post- surgery, 3) QoL pre-surgery, 4) QoL post-surgery, 5) mean age of patients undergoing the surgery, 6) time until no effect of treatment can be expected on survival or 7) time until no effect of treatment can be expected on QoL (see Appendix A). The class of collected evidence was defined as class I (Randomized Controlled Trials (RCT) or systematic reviews of RCTs), class IIa (Prospective observational studies, before-after studies), class IIb (Retrospective observational studies, expert panels for the utilities, national registries), and class III (expert opinion).
Markov model
A three-state cohort state-transition model was developed. This model simulates a hypothetical cohort of patients over a defined period in fixed time intervals, called cycles, to estimate the average time individuals spend in the various health conditions, called health states.19,20 Individuals could transition between a preoperative state, a postoperative state, and a dead state (Figure 1). Based on the time spent in these states, health benefits, like life years (LYs) or quality-adjusted life-years (QALYs) are calculated.20–22 The entire cohort started in the preoperative state, and was followed their remaining lifespan, until they were 100 years old, using weekly cycles. The transition from the preoperative state to the postoperative state was set to a specific week, depending on the scenario. Scenarios of surgical delay of two weeks were compared with surgical delay up to surgical delay of a year using intervals of ten weeks. In addition, the scenario where patients never received treatment was evaluated: this was modelled by following patients their entire remaining lifespan in the preoperative health state. In all scenarios, the transitions from the pre- and postoperative states to the dead state were based on survival data. A description of the model parameters and assumptions can be found in Appendix C.
Health Effects of Surgery
The effects of delays in surgery on the health outcomes (LYs and QALYs) were evaluated. LYs disregard QoL (QoL = 100%), while QALYs incorporate QoL and are therefore preferred. The health outcomes without surgery were compared to the health outcomes with surgery at 2 weeks and 52 weeks to determine the overall health outcomes associated with surgery and health outcomes lost per 50 weeks. This measure of urgency was converted to loss per month (loss/month) and was used to rank the surgeries, where a high loss/month indicates an urgent surgery.
Analysis
Probabilistic sensitivity analysis (PSA) was used to incorporate parameter uncertainty in the model outcome (Appendix C). Rankings based on health benefits or health loss per unit of time were compared using Spearman’s rank correlation coefficient.
The available surgical resources should also be optimally used. Therefore, the model results were compared visually to the capacity requirements in our hospital, obtained from the electronic patient registry. Since OR room availability was the bottleneck in our hospital during the first COVID-19 wave, this study focused on surgery time.
The model was built with R software23 and adapted from previously published code.24,25 The full model code is available on GitHub: https://github.com/bgravesteijn/Utilitarian-distribution-of-OR-capacity-during-COVID-19.
Results
Data collection
In total, 12 cardiothoracic surgeries were evaluated, along with 23 oncological surgeries, 2 transplantations (liver and living donor kidney), 5 vascular surgeries, and 1 other type of surgery (creation of a shunt to facilitate haemodialysis). These 43 evaluated surgeries comprised of 69% of the total semi-elective program in our hospital.
Survival with treatment was mostly based on national registries (31/43). Survival without treatment was mostly based on data from (inter)national registries (12/43 surgeries, 6 indirectly calculated through the treatment effect), but also frequently from RCT’s (10/43 surgeries, 7 indirectly calculated), and observational studies (9/43 surgeries, 3 indirectly calculated). For 14/43 surgeries, QoL was available through the WHO Global Burden of Disease study.26 For the remaining 29 surgeries, the QoL of the pre- and postoperative health state was estimated by the expert panel as described in Appendix C. For 6/43 surgeries, a “time-to-no-effect-on-QoL” within one year, our maximum period of delaying surgery, was applicable. For 23 surgeries, a “time-to-no-effect-of-treatment-on-survival” was assumed based on qualitative assessment of the literature. Most of these surgeries were oncological surgeries (20/23). The estimates for the time until surgery becomes ineffective was mostly based on class IIb evidence (retrospective and prospective observational studies, see table 1). Input parameters varied widely between surgeries (Figure 2). Appendix A presents all input parameters, their sources26,27,36–45,28,46–55,29,56–65,30,66–75,31,76–82,32–35, and the corresponding model output for each semi-elective surgery.
Quality of Life
The preoperative and postoperative health state of 3 surgeries (one with a mild and severe subgroup) were estimated in both sessions, resulting in 8 double estimates of QoL. The gain in QoL due to surgery was consistent in the two sessions (the standardized mean difference was 0.025, 95% CI: −0.11 – 0.16, Appendix B: table 3 and figure 1).
The maximum expected benefit of surgery, i.e. in a scenario without delay, ranged from 0.48 QALYs (95% CI: 0.32 – 0.83) for resection of muscle invasive bladder cancer to 10.3 QALYs (95% CI: 8.7 – 11.9) for kidney transplantation (Figure 3). The ranking based on QALYs gained by surgery was correlated with the ranking based on LYs gained by surgery: The Spearman rank correlation coefficient between the ranking of surgeries based on LYs and QALYs was 0.45 (p=0.003).
Urgency
Most surgeries had a clear linear descend in terms of QALYs per delay, except surgeries where a time until no effect of treatment on survival was used (figure 1, appendix B).
The urgency of the surgeries ranged from 0.01 QALY loss/month (95% CI: 0.00-0.01) for placing a shunt for dialysis, to 0.23 QALY loss/month (0.09-0.24) for a bypass surgery for Fontaine III/IV peripheral arterial disease (Appendix B: figure 4 and table 1). If the latter would be postponed by a month, patients lose approximately 84 days (0.23*365) spent in perfect health of their remaining expected QALYs gained by surgery.
The most urgent surgeries after bypass surgery for Fontaine III/IV peripheral arterial disease, was transaortic valve implantation (0.15 QALY loss/month, 95% CI: 0.09-0.24). After placing a shunt for patients with end-stage renal disease, the least urgent surgery was resection of thyroid cancer (0.01 QALY loss/month, 95% CI: 0.01-0.02) (Appendix B). Surgeries that were associated with a higher expected QALY benefit, often lost more QALYs per month delay: The Spearman correlation coefficient between the ranking of health benefit, in terms of QALYs, and urgency, in terms of QALY loss/month, was 0.32 (p=0.04).
Urgency was strongly correlated with the LYs gained per surgery: The Spearman rank correlation coefficient between the ranking of surgeries based on LYs and QALY loss/month was 0.79 (p< 0.001).
Capacity
Surgeries that are ranked high in terms of urgency and had relative short surgery time compared to other surgeries include repair of atrial septum defects (surgery time: 74 min [IQR: 56-131], 0.06 QALY loss/month [95% CI: 0.02-0.14]), pacemaker implantations (115 min [82-154], 0.11 QALY loss/month [0.04-0.22]), resection of mild larynx carcinoma (70 min [38-109], 0.07 QALY loss/month[0.04-0.11]), and valve replacements (99 min [77-125]; mitral valve replacement: 0.09 QALY loss/month [0.04-0.15]; aortic valve replacement: 0.09 QALY loss/month [0.06-0.17]) (Figure 5). Liver transplant is relatively urgent but requires an exceptional amount of OR-time (875 min [797-957], 0.08 QALY loss/month [0.07-0.09]) (table 2 Appendix B).
Discussion
Our proposed decision model is an attempt to systematically guide prioritization of surgeries from a utilitarian perspective. Urgency was quantified based on the expected health loss due to surgery delay. Our approach operationalizes ethical values that are the most appropriate in times of scarcity.2 Available evidence suggests that semi-elective surgeries can be ranked based on their urgency using a simple decision model. For survival after surgery, most evidence was based on national registries, while treatment effects were mostly derived from RCTs. The time until no effect of treatment on survival or QoL, however, was most often class IIb/III evidence.
Among the 43 surgeries analysed, bypass surgery for Fontaine III/IV peripheral arterial disease and transaortic valve implantation were the most urgent surgeries. Less urgent surgeries were placement of a shunt for dialysis and resection of thyroid cancer. Liver transplantation shows to be a relatively urgent surgery but requires an exceptionally long surgery time. In times of scarce OR-capacity, this surgery is less efficient in the prevention of QALY loss.
Interestingly, the ranking of urgency is primarily driven by the gain in LY associated with surgery: Surgeries that are associated with substantial gain in LY (e.g. mitral valve replacement), also lose more QALYs per month delay than surgeries that are associated with no gain in LY (e.g. creation of a shunt for haemodialysis). The larger the total health benefit associated with surgery, the more health can potentially be lost by postponing surgery.
To optimize OR triage, our metric for urgency should be weighed against capacity indicators. This is effectively a cost-effectiveness analysis, where resource constraints represent costs. For the scenario where OR-capacity is the most scarce in terms of hospital capacity, urgency can be plotted against surgery time. This simple method revealed that repairing atrial septum defects, pacemaker implantations, resection of mild larynx carcinoma, and valve replacements are the most efficient surgeries in our hospital to perform in this context. However, there are contexts where other types of capacity (e.g. beds or staff) are scarcer, and therefore more relevant to be weighed against urgency.
Although our modelling approach rationalizes and objectively quantifies urgency from a utilitarian perspective, it needs to be complemented by other perspectives to be used effectively in practice. First, an important consideration from the medical perspective may be the availability of alternative treatment strategies. In cancer treatment, (chemo-)radiation or systematic therapy alone may be considered instead of surgery. Second, a financial perspective might also be explored. This perspective might be less relevant in a crisis such as the COVID-19 pandemic, where the bottleneck mainly seems hospital capacity instead of costs. If this approach would be applied to the context of regular care, this perspective might be of increasing importance. Finally, other ethical perspectives (e.g. rule of rescue13) might be explored to assess the viability of our approach, and it needs to be established whether our approach is applicable to all surgical procedures.
There are practical advantages of comparing “average patients” on urgency, despite the fact that there is no such thing as an “average patient”: It prevents our approach from systematically discriminating against a specific group of patients. Our approach would only discriminate if specific socioeconomic groups would suffer more frequently from diseases that are less urgent. It is known that lower socioeconomic groups are more prone to develop diseases that have clear association with unhealthy behaviour, such as lung cancer.83 However, these diseases do not systematically rank low in our approach. Comparing the average patients across specialties on urgency may not be a personalized approach, but it can be tailored to an individual’s context by providing input for shared decision making. We feel that next to a quantitative estimation of urgency from a utilitarian perspective, individual patient’s preferences, social contexts, and operability should also be included in the decision making process.
Since all models are a simplification of reality, our model has several limitations. First, the survival data used were not always derived from high-quality evidence. Although survival with treatment might be validly estimated from national registries, the survival without treatment is harder to be unbiasedly estimated. The surgeries that were evaluated are often part of standard clinical practice. Therefore, data might be biased (e.g. selection bias in the survival without treatment because patients opt for palliative care), or not available (it would be unethical now to perform RCTs evaluating surgery versus no surgery). Instead, best available evidence was used, which in part included evidence from more historical RCTs. As such, data might be biased, and as a result, so are the estimates from our model. Because of this limitation, our approach is simply to aggregate transparently and systematically the best currently available evidence using a model.
Second, it was assumed that all surgeries are successful. The current model does not simulate adverse events, like major bleedings or death due to surgery. Also, the potential reduction of QoL due to these adverse events were not incorporated, nor the QoL reduction of a temporary period of recovery after surgery. Because of these assumptions, the overall QALYs associated with the surgery should not be interpreted as an absolute estimate. They are the maximum possible QALYs that can be acquired by performing the surgery. However, these assumptions were considered reasonable to achieve the main goal of this study: when surgery without delay is compared to surgery with delay, the harm in both scenarios is similar and therefore cancel out.
Third, because the health loss in 50 weeks was simply converted to loss per month, a linear approximation was effectively used to quantify urgency by delaying surgery up to a year. Some surgeries did show a slightly steeper decrease in the period up to 32 weeks delay. The data needed to validly model this decay in QALYs per unit of time for all surgeries likely don’t exist: most of the estimates of time to no effect on survival were based on observational studies, which are likely biased. A more detailed approximation would be possible using a more individualized model which also models the natural grow of tumours, or aneurysms, and validly model the development of metastasis. It was not feasible to develop this for all evaluated surgeries. Instead, we opted for a more pragmatic approach.
Fourth, QoL weights were derived from expert-opinion. In this approach the patient is not involved. Experts interpret the health states and give weights, thereby our approach takes a societal perspective. There are also multiple methodological, ethical, and contextual disadvantages of using QALYs, but it should be noted that most of those discussion are more about utilitarian principles.84
Fifth, the potential impact on QoL of delaying a semi-elective surgery was not included. This impact might differ across surgeries. It might be hypothesized that surgeries performed after already a long disease history (e.g. kidney transplant) might have less “waiting time disutility” than recently diagnosed diseases (e.g. mammacarcinoma).
Part of the input parameters were based on national registry data, but a substantial amount of the input originated from various international sources. Therefore, with some modifications, the model can easily be adapted to different contexts. Therefore, this study can be considered the first step towards a triaging strategy which optimizes surgical benefit in times of scarcity in surgical capacity, such as during the COVID-19 pandemic. To improve validity, it is however essential to periodically review the literature and update the model with higher quality evidence, much like a living systematic review.85 If accepted, a wider range of surgeries should be considered, implementation strategies should be explored and evaluated, and the model should be applied to a variety of settings.
Conclusion
By transparently aggregating best available evidence, our decision model support prioritization of surgical care in times of scarce surgical capacity (e.g. during pandemics) from a utilitarian perspective. Our approach quantifies the expected health loss due to delay for semi-elective surgeries in an academic hospital in the Netherlands. This approach can help to minimize health losses when trying to overcome delay in surgeries across disciplines. This approach is more transparent, more evidence-based, and more consistent than the alternative strategy of triaging based on expert opinion.
It should be noted that evidence from well-controlled comparison studies is often lacking. Instead, adjusted estimates from observational studies are often the best available evidence for benefit of surgery and the effects of delay on survival. Therefore, model inputs should be periodically updated with newer, higher quality evidence.
Finally, our approach should be placed in the context of other ethical perspectives and combined with capacity management tools. If accepted, we believe this tool should be implemented on a large scale, in order to minimize health loss of the accumulating group of patients awaiting surgery.
Data Availability
All source data and code are made available in an online github repository.
https://github.com/bgravesteijn/Utilitarian-distribution-of-OR-capacity-during-COVID-19
Authors contribution
Benjamin Gravesteijn: Conceptualization (Lead), Data curation (Lead), Formal analysis (Lead) Investigation (Lead), Methodology (Lead), Validation (Equal), Visualization (Lead), Writing-original draft (Lead)
Eline Krijkamp: Conceptualization (Lead), Data curation (Equal), Formal analysis (Lead), Investigation (Lead), Methodology (Lead), Validation (Supporting), Visualization (Lead), Writing-original draft (Lead)
Jan Busschbach: Conceptualization (Equal), Investigation (Equal), Methodology (Equal), Supervision (Supporting), Validation (Equal), Writing-review & editing (Equal)
Geert Geleijnse: Conceptualization (Equal), Data curation (Equal), Investigation (Supporting), Supervision (Supporting), Validation (Equal), Visualization (Supporting), Writing-review & editing (Equal)
Isabel Retel Helmrich: Data curation (Equal), Investigation (Equal), Methodology (Supporting), Writing-review & editing (Supporting)
Sophie Bruinsma: Data curation (Equal), Investigation (Equal), Writing-review & editing (Supporting)
Céline van Lint: Data curation (Equal), Investigation (Equal), Writing-review & editing (Supporting)
Ernest van Veen: Data curation (Supporting), Investigation (Supporting), Writing-review & editing (Supporting)
Ewout Steyerberg: Methodology (Supporting), Validation (Equal), Writing-review & editing (Supporting)
Kees Verhoef: Investigation (Supporting), Methodology (Supporting), Validation (Equal), Writing-review & editing (Supporting)
Jan van Saase: Conceptualization (Equal), Investigation (Equal), Supervision (Supporting), Validation (Equal), Writing-original draft (Equal), Writing-review & editing (Equal)
Hester Lingsma: Conceptualization (Lead), Investigation (Lead), Supervision (Lead), Writing-original draft (Equal), Writing-review & editing (Equal)
Rob Baatenburg de Jong: Conceptualization (Lead), Investigation (Lead), Supervision (Lead), Validation (Supporting), Writing-original draft (Equal), Writing-review & editing (Equal)
Funding
No specific funds were rewarded for this project.
Patient and public involvement
Already in the early phases, in the process of the design of this study, patient and public involvement was achieved by communicating our ideas via press-releases, interviews and open discussions. The aim for having this discussion was to identify barriers and facilitators of our approach, also possibly informing the study design.
Of course, it was not appropriate to include patient and/or public in the analysis of the study and writing the manuscript. However, medical experts, representing a part of the public, were included in the process of data collection for the quality of life values and further development of the methods. Moreover, the department of surgery within our hospital, an important end-user of our approach, tested the approach on face-validity and helped writing the manuscript, thereby showing their support.
Currently, we are also presenting our approach to patient federations. They have already expressed widespread interest and support to our approach. Finally, we are discussing with these stakeholders possible other barriers and facilitators for implementation which were not already identified.
Disclosures
Isabel Retel Helmrich and Ernest van Veen are supported by the European Union 7th Framework program (Center-TBI, EC grant 602150). Eline Krijkamp is supported by the Society for Medical Decision Making (SMDM) fellowship through a grant by the Gordon and Betty Moore Foundation (GBMF7853).
Appendix A
An overview per disease of the distribution and source of the input parameters and a graphical representation of the output of the model.
Appendix B
A summary of the estimates of the decision model and an overview of the counts, duration, and length of stay of the included surgeries in our hospital.
Appendix C
In-dept methods description.
Acknowledgement
We are grateful for Lisa Caulley for her suggestions to revise the final manuscript. We are grateful for H. Karreman and C. Van der Velden-van der Graaf for the work they have done for the quality of life data collection. Moreover, we want to thank Ruben Goedhart, Esther van Spronsen and Linda van der Sluijs – van der Beek for extracting the data from the electronic patient registry.
Footnotes
↵** Value Based Operation Room Triage team collaborators: Chris Bangma, Ivo Beetz, Patrick Bindels, Alexandra Brandt-Kerkhof, Danielle van Diepen, Clemens Dirven, Tjebbe Galema, Jeanette Goudzwaard, Mieke Hazes, Sjoerd Lagarde, Harmke Polinder-Bos, Eva Maria Roes, Hanneke Takkenberg, Mark van Vledder
↵i A semi-elective surgery is defined as a surgery that should ideally be performed within three days up to three weeks.