Cost-Effectiveness of Personalized Policies for Implementing Organ-at-Risk Sparing Adaptive Radiation Therapy in Head and Neck Cancer: A Markov Decision Process Approach

Seyedmohammadhossein Hosseinian; Daniel Suarez-Aguirre; Cem Dede; Raul Garcia; Lucas McCullum; Mehdi Hemmati; Aysenur Karagoz; Abdallah S. R. Mohamed; Stephen Y. Lai; Katherine A. Hutcheson; Amy C. Moreno; Kristy K. Brock; Fatemeh Nosrat; Clifton D. Fuller; Andrew J. Schaefer; the Rice/MD Anderson Center for Operations Research in Cancer (CORC); the MD Anderson Head and Neck Cancer Symptom Working Group

doi:10.1101/2024.11.05.24316767

Abstract

Purpose To develop a clinical decision-making model for implementation of personalized organ-at-risk (OAR)-sparing adaptive radiation therapy (ART) that balances the costs and clinical benefits of radiation plan adaptations, without limiting the number of re-plannings per patient, and derive optimal policies for head and neck cancer (HNC) radiation therapy.

Methods and Materials By leveraging retrospective CT-on-Rails imaging data from 52 HNC patients treated at the University of Texas MD Anderson Cancer Center, a Markov decision process (MDP) model was developed to identify the optimal timing for plan adaptations based on the difference in normal tissue complication probability (ΔNTCP) between the planned and delivered dose to OARs. To capture the trade-off between the costs and clinical benefits of plan adaptations, the end-treatment ΔNTCPs were converted to Quality Adjusted Life Years (QALYs) and, subsequently, to equivalent monetary values, by applying a willingness-to-pay per QALY parameter.

Results The optimal policies were derived for 96 combinations of willingness-to-pay per QALY (W) and re-planning cost (RC). The results were validated through a Monte Carlo (MC) simulation analysis for two representative scenarios: (1) W = $200,000 and RC = $1,000; (2) W = $100,000 and RC = $2,000. In Scenario (1), the MDP model’s policy was able to reduce the probability of excessive toxicity, characterized by ΔNTCP ≥ 5%, to zero (down from 0.21 when no re-planning was done) at an average cost of $380 per patient. Under Scenario (2), it reduced the probability of excessive toxicity to 0.02 at an average cost of $520 per patient.

Conclusions The MDP model’s policies can significantly improve the treatment toxicity outcomes compared to the current fixed-time (one-size-fits-all) approaches, at a fraction of their costs per patient. This work lays the groundwork for developing an evidence-based and resource-aware workflow for the widespread implementation of ART under limited resources.

Introduction

Radiation therapy for head and neck cancer (HNC) has been markedly successful in achieving locoregional tumor control over the past decades, mainly due to the predominance of human papillomavirus associated (HPV+) tumors.¹ However, a range of acute and chronic toxicities resulting from radiation injuries to non-targeted organs-at-risk (OARs) are still common sequelae in current standard HNC radiation therapy.^2-9 The risk of normal tissue injury increases with anatomical changes during treatment, such as tumor shrinkage and OAR deformation, which can lead to discrepancies between the planned and actual doses received by the OARs.^10-12 The principle of adaptive radiation therapy (ART) is to account for these anatomical changes by enabling on-treatment adaptations to the radiation plan (i.e., re-planning).^13-16

Adapting the radiation plan at each treatment fraction is an ideal form of implementing ART. However, daily re-planning is impractical with current technology because of the finite availability of crucial resources, such as human experts, required for tasks like segmentation and quality assurance. Recent research has aimed to enable a more automated process, for example, by using artificial intelligence (AI)-based algorithms for auto-segmentation.^17-23 These methods, however, are still developing, and their integration into the clinical workflows is emerging.^24-28 As a result, the implementation of ART in current practice remains limited to a few treatment fractions. Current institutional guidelines for ART in the US adopt a one-size-fits-all approach and recommend re-planning at fixed intervals, typically once mid-treatment, which fails to consider the uncertain trajectory of toxicities that individual patients may experience.^29,30 Thus, determining the optimal timing for personalized plan adaptations remains an urgent yet unmet need in HNC radiation therapy to improve care for cancer patients.

This work builds on the previous findings of our research group regarding optimal implementation of OAR-sparing ART under limited resources, as reported by Heukelom et al.²⁹ and Nosrat et al.³⁰ Heukelom et al.²⁹ investigated the optimal timing for ART with a single re-planning allowance, utilizing daily on-treatment CT imaging with a CT-on-rails device. They introduced the difference in normal tissue complication probability (NTCP) between the planned and delivered dose to OARs as an objective selection strategy for ART. Their findings showed that NTCP calculations based on dose differences (ΔNTCP) at fraction 10 were superior to clinical judgment for personalized implementation of ART. Nosrat et al.³⁰ investigated the optimal timing for re-planning based on ΔNTCP when multiple adaptations were possible, through a Markov Decision Process (MDP) model. They reported a personalized policy for implementing (OAR-sparing) ART in HNC radiation therapy, with each patient allowed a fixed number of re-plannings.

A critical consideration for successful implementation of ART in practice is its financial feasibility. Nosrat et al.³⁰ pioneered applying MDP for personalized ART and successfully identified an optimal re-planning policy for HNC radiation therapy. However, their model assumes the availability of resources for a fixed number of adaptations for every patient. This assumption carries significant financial implications, which may limit the widespread applicability of such policies in clinical settings. Prior research has demonstrated that, while ART can prevent clinically significant toxicities for individual patients, the majority of HNC patients will not require plan adaptations.²⁹ Reserving capacity for even a relatively small number of adaptations for each patient could impose a prohibitive financial burden on the healthcare system and create a barrier to the clinical implementation of personalized policies. To address this shortcoming, this paper presents an MDP model for ART decision-making that incorporates financial considerations and seeks an optimal policy that balances the cost and benefits of ART without restricting the number of re-plannings per patient. MDP is a mathematically rigorous framework for decision-making under uncertainty and determining optimal actions in stochastically evolving systems; it has been successfully applied to find optimal timing for various medical interventions.^31-39 However, its application for decision-making in radiation oncology has largely remained unexplored.⁴⁰

This study serves three specific aims: (1) Develop a clinical decision-making framework for implementing OAR-sparing ART that integrates financial considerations. (2) Determine optimal timing for cost-effective plan adaptations in HNC patients by analyzing data from the University of Texas MD Anderson Cancer Center. (3) Lay the foundation for future research to extend these methodologies to dynamic scenarios, such as online adaptive workflows, with the aim of enabling real-time clinical decision-making.

Methods and Materials

Data

This study presents a secondary cost-effectiveness analysis, conducted as per the International Society for Pharmacoeconomics and Outcomes Research (ISPOR) Consolidated Health Economic Evaluation Reporting Standards (CHEERS)⁴¹. The CHEERS 2022 checklist is provided in Appendix A (Supplementary Materials). We performed in silico analysis of a previously reported dataset published by Heukelom et al.²⁹ and leveraging a recent Markov Decision Process (MDP) elucidation by Nosrat et al.³⁰ which includes data from 52 HNC patients treated with daily CT-on-Rails Image-Guided Radiation Therapy (IGRT) at the University of Texas MD Anderson Cancer Center between 2007 and 2013. Patients were treated with daily kilovoltage CT imaging, and daily CT-image-based replanning was generated to assess “virtual daily replanning” across all-delivered fractions, assuming a “fixed GTV/CTV1” approach (i.e., normal-tissue and weight loss were accounted for, but GTV and high-dose CTV volumes were neither altered nor dose-reduced). This secondary analysis was conducted under institutional review board approval MDA RCR03-0800. These patients received radiation therapy (9-beam step-and-shoot IMRT), either alone or in combination with chemotherapy or cetuximab. The primary cancer sites included oropharynx, nasopharynx, sinonasal region, oral cavity, and larynx. In this cohort, 69% of the patients were male, and 31% were female. The patient characteristics are provided in Appendix B (Supplementary Materials).

To develop a novel MDP model for cost-effective ART implementation, we utilized the published analysis results of Heukelom et al.²⁹ and Nosrat et al.³⁰ on this dataset. Specifically, we leveraged the ΔNTCP calculations of Heukelom et al.²⁹ along with the probabilistic estimates of ΔNTCP trajectories from Nosrat et al.³⁰ For each patient in this cohort, Heukelom et al.²⁹ analyzed the deviation of the delivered (accumulated) dose from the planned dose on a daily basis for multiple OARs and, accordingly, calculated ΔNTCP for xerostomia, dysphagia, parotid gland dysfunction, and tube feeding dependency at 6 months post-treatment. Building on these results, Nosrat et al.³⁰ developed an MDP model, capturing the patient’s state of toxicity by ΔNTCP, and calculated the associated transition probabilities. In a similar manner, we consider ΔNTCP as our decision-making criterion and use the transition probabilities reported by Nosrat et al.³⁰ to develop an MDP model that balances clinical benefits with the cost of plan adaptations. The ΔNTCP values and associated transition probabilities from this dataset are presented in Appendix C (Supplementary Materials).

Decision Model

The MDP model captures the state of toxicity through ΔNTCP. Before treatment begins, the system is at ΔNTCP = 0%, indicating that the expected post-treatment toxicities align with the treatment plan. Throughout the treatment, the patient’s ΔNTCP follows a stochastic trajectory due to uncertain anatomical changes. This uncertainty is captured by transition probabilities, which quantify the likelihood of changes in ΔNTCP from one epoch (e.g., treatment fraction) to the next. At each epoch, the clinician observes the system’s state (i.e., ΔNTCP) and may choose between two actions: (1) re-plan, or (2) continue with the current plan. Re-planning, which incurs a monetary cost, modifies the probabilistic transition towards more favorable outcomes/states. At the end of the treatment, the process culminates in a terminal reward, determined by the system’s final state (i.e., the patient’s end-treatment ΔNTCP).

To capture the trade-off between the cost and clinical benefits of plan adaptations, the terminal reward in our model is defined as the monetary equivalent of the clinical benefits. This is achieved by converting end-treatment ΔNTCPs to Quality Adjusted Life Years (QALYs) and then applying a willingness-to-pay per QALY parameter to determine the monetary equivalent of the QALY gains. An optimal solution to this MDP determines the best action (to re-plan or not) at each decision epoch based on the patient’s toxicity state (ΔNTCP). Such an optimal policy (i.e., the set of optimal actions) will depend on the re-planning cost and clinical benefits.

Components of the MDP model, developed using the dataset described previously, are as follows:

Decision epochs

Given a typical treatment period of 33-35 fractions for HNC, decision epochs were set at fractions 10, 15, 20, and 25 (weekly). Fraction 5 was excluded based on earlier research showing that anatomical changes are unlikely to occur early during treatment.²⁹ Similarly, fraction 30 was omitted due to its proximity to the treatment’s conclusion, where adjustments would have minimal effect on the overall dose delivered to OARs. This choice of decision epochs follows the MDP model of Nosrat et al.³⁰

States

At each decision epoch, the state of the system was represented by ΔNTCP, ranging from 0% to 12%, based on the findings of Heukelom et al.²⁹ The results of Heukelom et al.²⁹ and the transition probability calculations of Nosrat et al.³⁰ are based on the number of patients exhibiting a certain ΔNTCP for any of the considered toxicities (i.e., xerostomia, dysphagia, parotid gland dysfunction, and tube feeding dependency). Consequently, the definition of the states in our model adopts a holistic approach, aiming to protect against all these toxicities.

Actions

At each state, two possible actions were included: (1) re-planning, or (2) continuing with the current plan (no re-planning).

Transition probabilities

The stochastic transition of the system’s state (i.e., ΔNTCP) from one decision epoch to the next is governed by transition probabilities, as a function of the action taken. We leveraged the transition probabilities calculated by Nosrat et al.³⁰ in our model.

Immediate rewards

In the MDP setting, each action at each state may yield an immediate reward. We note that “reward” is standard terminology in the MDP literature. In our application, the immediate reward is the cost of re-planning (only associated with the action “re-planning”), which is represented as a negative number. The cost of re-planning may vary across institutions. To ensure a comprehensive analysis, we considered various costs in our model, ranging from $500 to $2000 (in $100 increments) per re-planning, based on recently reported re-planning costs in the literature.^42,43

Terminal rewards

The terminal reward of the process was calculated based on the terminal state (i.e., end-treatment ΔNTCP), as follows: where T denotes the number of years (post-treatment) that the patient’s quality of life is considered, S is the patient’s T-year survival probability, Q represents a quality-of-life factor used to transform end-treatment ΔNTCP into changes in QALY, which is determined by the type and severity of toxicities, and W denotes the willingness-to-pay per QALY (in $). Details of the derivation of this formula are presented in Appendix D (Supplementary Materials). In our analysis, we used T= 5 years and S = 0.685, as the 5-year overall survival rate of HNC patients.⁴⁴ The literature reports a range of values for Q concerning different radiation toxicities in HNC patients;⁴⁵ see Appendix E (Supplementary Materials). Following our holistic approach for protecting against the toxicities considered by Heukelom et al.²⁹, we used Q = 0.80 in our analysis. The literature reports various values for willingness-to-pay per QALY as well.^40,46 To ensure a comprehensive analysis, we used a range of values for W, including $50,000, $75,000, $100,000, $150,000, $200,000, and $250,000.

Like immediate rewards, terminal rewards in our application have negative values because ΔNTCP represents the extent to which the actual NTCP is worse than the planned NTCP. The objective of MDP is to find a policy that maximizes the sum of immediate and terminal rewards. Since the rewards are negative in our application, this can be viewed as minimizing the patient’s total loss, that is the combination of the monetary equivalent of the patient’s loss in the quality of life and the money spent on re-planning.

The MDP model was implemented in Python and solved using the mdptoolbox library.⁴⁷ The code and related results are publicly available in the online repository of this project.

Results

An optimal policy was calculated for each pair of willingness-to-pay per QALY (W) and re-planning cost (RC). Therefore, a total of 6 × 16 = 96 optimal policies were obtained. For each W-RC pair, the optimal policy indicates the optimal action (0 for “no re-planning” and 1 for “re-planning”) for each ΔNTCP state at each decision epoch. Table 1 demonstrates the optimal policy for W = $200,000 and RC = $1,000, as an example. Since the presentation of the optimal policies (for all 96 cases) is extensive and clinically less informative, we present them based on the concept of re-planning ΔNTCP threshold, as follows. The optimal policies in their extensive form are available in the project’s online repository.

View this table:

Table 1: Optimal policy for W = $200,000 and RC = $1,000. An entry 1 in the optimal policy indicates that the optimal action (at the corresponding treatment fraction and ΔNTCP) is “re-planning” and an entry 0 indicates the optimal action is “no re-planning.”

To obtain clinically interpretable policies, for each optimal policy obtained from the MDP model, we considered the smallest ΔNTCP value that justified re-planning as the re-planning threshold. This converts an optimal policy to a threshold policy that recommends re-planning only if ΔNTCP is greater than or equal to the threshold. For example, ΔNTCP = (3%, 4%, 3%, 4%$) at the treatment fractions 10, 15, 20, and 25, respectively, are the decision-making thresholds associated with the optimal policy presented in Table 1.

Figure 1. summarizes the threshold policies for all the W-RC pairs considered. Each subfigure corresponds to a willingness-to-pay per QALY value, with each row representing a re-planning cost. For each W-RC pair, the policy is depicted by four ΔNTCP thresholds for decision-making at treatment fractions 10, 15, 20, and 25 (i.e., decision epochs). The threshold policy associated with the optimal policy of Table 1 is highlighted with a red box in this figure.

Figure 1.

ΔNTCP (%) thresholds for re-planning at each decision epoch. Each subfigure concerns a fixed value of willingness-to-pay per QALY (W) and presents ΔNTCP thresholds for various re-planning costs (RC). The threshold policy associated with the optimal policy of Table 1 is highlighted with a red box. The threshold of ΔNTCP = 99% in this figure implies “no re-planning” for any ΔNTCP value.

Model Validation

To validate these results, we conducted a Monte Carlo (MC) simulation study to compare the clinical and financial outcomes of the MDP model’s threshold policies against two fixed-time (one-size-fits-all) policies. MC simulation is an established method for cost-effectiveness analysis of medical interventions; it provides probabilistic estimates of the cost and utility of a specific policy and enables the comparison of different policies in these terms.⁴⁰ In our analysis, we considered three re-planning policies: (a) the patient receives a single “re-planning” at fraction 10, regardless of their toxicity status; (b) the patient receives a single “re-planning” at fraction 15, regardless of their toxicity status; (c) the threshold policy of the MDP model, which does not limit the number of re-plannings, is followed. Since the threshold policies depend on W and RC, we examined two representative scenarios with respect to these parameters: (1) W = $200,000 and RC = $1,000; (2) W = $100,000 and RC = $2,000. These scenarios are intended to represent different conditions regarding economic prosperity and resource availability. The MC analysis involved 10,000 simulation runs for each policy.

Figure 2. illustrates the distribution of end-treatment ΔNTCP for policies (a), (b), and (c) with W = $200,000 and RC = $1,000, as well as the case where no re-planning is done for the patient, referred to as policy (d). When policy (d) was followed, meaning no re-planning at all, the expected end-treatment ΔNTCP (mean) was 2.36% (standard deviation (SD) = 2.97%). Following policies (b) and (a)—a single re-planning at fractions 15 and 10, respectively—the expected end-treatment ΔNTCP reduced to 1.57% (SD = 1.92%) and 1.20% (SD = 1.48%), respectively. The MDP model’s threshold policy, i.e., policy (c), led to the expected end-treatment ΔNTCP of 0.98% (SD = 0.82%).

Figure 2.

Distribution of the end-treatment ΔNTCP (%) under the following policies: (a) the patient receives a single “re-planning” at fraction 10; (b) the patient receives a single “re-planning” at fraction 15; (c) the threshold policy of the MDP model for W = $200,000 and RC = $1,000 is followed; (d) no re-planning at all. The number of simulation runs was 10,000.

As highlighted by Heukelom et al.²⁹ the majority of HNC patients in this cohort did not require re-planning, which is manifested through the small mean ΔNTCPs, even under policy (d). However, some patients in this cohort exhibited considerable discrepancies between the planned and actual dose to OARs, resulting in large ΔNTCP values (up to 12%). In this regard, we examined the effectiveness of the threshold policy in preventing large ΔNTCPs. Under policy (d), i.e., no re-planning, the probability of the patient experiencing ΔNTCP ≥ 5% was 0.21. This probability reduced to 0.12 and 0.06 when the policy was to re-plan at fractions 15 and 10, i.e., policies (b) and (a), respectively. Under the MDP model’s threshold policy, i.e., policy (c), the probability of ΔNTCP ≥ 5% was approximately zero.

Regarding the financial aspect, policies (a) and (b) require re-planning once, regardless of the patient’s toxicity status. Therefore, the number of re-planning under these policies is one, implying an average cost of $1,000 per patient for this scenario. The outcome of the MC analysis under the MDP model’s threshold policy is a distribution over all possible number of re-plannings, ranging from 0 to 4 (re-planning at all four decision epochs), which is illustrated in Figure 3. The expected number of re-plannings (mean) under this policy was 0.38 (SD = 0.68), implying an average cost of $380 per patient.

Figure 3.

Distribution of the number of re-plannings under the threshold policy of the MDP model for W = $200,000 and RC = $1,000. The number of simulation runs was 10,000.

Since the MDP model’s threshold policies depend on the willingness-to-pay per QALY and re-planning cost, we repeated the analysis with a different set of values for these parameters; that is W = $100,000 and RC = $2,000. This second scenario resembles lower-income communities with greater resource constraints compared to the previous scenario. Figure 4 illustrates the distribution of end-treatment ΔNTCP along with the distribution of the number of re-plannings when the threshold policy of the MDP model with these parameters was followed. The expected end-treatment ΔNTCP under the threshold policy for this scenario was 1.14% (SD = 1.12%). The probability of experiencing ΔNTCP ≥ 5% was 0.02. The expected number of re-plannings for this scenario was 0.26 (SD = 0.56), implying the average cost of $520 per patient.

Figure 4.

Distribution of the end-treatment ΔNTCP (left) and distribution of the number of re-plannings (right) under the threshold policy of the MDP model for W = $100,000 and RC = $2,000. The number of simulation runs was 10,000.

The MC simulation results are summarized in Table 2.

View this table:

Table 2: Summary of the Monte Carlo simulation results. The willingness-to-pay per QALY and re-planning cost are denoted by W and RC, respectively.

Discussion

The MC simulation analysis results demonstrate that the MDP model’s threshold policies outperform the fixed-time policies in both clinical and financial outcomes. As noted by Heukelom et al.²⁹ most patients in this cohort did not require re-planning due to minimal anatomical changes. Consequently, the expected end-treatment ΔNTCP for all the considered policies falls within a narrow range, potentially masking the significance of the benefits of the MDP model’s threshold policies compared to the fixed-time policies. The main advantage of the MDP model’s threshold policies lies in their ability to prevent excessive toxicities. A subset of patients in this cohort did experience notable anatomical changes and large ΔNTCPs. The analysis indicates that a single re-planning at fraction 15 could approximately halve the probability of patients experiencing ΔNTCP ≥ 5% (from 0.21 to 0.12). If re-planning occurs earlier at fraction 10, this probability drops to less than one third (from 0.21 to 0.06). Following the MDP model’s threshold policy for Scenario 2 reduces the probability to under one tenth (from 0.21 to 0.02), while the threshold policy for Scenario 1 makes it negligible (from 0.21 to 0.00). The choice of ΔNTCP ≥ 5% in our analysis is motivated by the threshold of ΔNTCP = 5% set by the Dutch Society for Radiation Oncology for assigning patients to proton therapy for grade III complications.²⁹ Finally, we note that these results are aligned with the findings of Heukelom et al.²⁹ regarding the clinical advantages of re-planning at fraction 10 compared to fraction 15.

The average cost per patient under each policy demonstrates the financial advantage of the MDP model’s threshold policies. The MDP model’s policies allocate re-planning resources to patients who need them most, without limiting the number of re-plannings per patient, in contrast to the fixed-time policies that imply re-planning for every patient. Since many patients are expected to experience minimal anatomical changes, the expected number of re-plannings per patient under the MDP model’s policy for Scenario 1 was 0.38. Given the re-planning cost of $1,000 in this scenario, this translates to the average cost of $380 per patient—almost one third of the cost incurred if every patient received one re-planning. For Scenario 2, the expected number of re-plannings per patient was 0.26 under the MDP model’s threshold policy, which translates to the expected cost of $520 per patient—nearly a quarter of the cost of fixed-time policies. This result is of a significant importance for financial stakeholders (e.g., insurers) because harnessing the full capacity of ART to improve patient care requires flexible re-planning as needed—not limited to one or two instances—and the MDP analysis demonstrates that this can be achieved with an average of less than one re-planning per patient. Finally, we note that willingness-to-pay per QALY is a measure of the population’s economic prosperity rather than an individual patient’s willingness to pay for their treatment.

As noted earlier, the optimal policies of the MDP model were not necessarily threshold policies. This complicates their clinical interpretation and makes them less amenable to implementation in practice. To address this, we converted the optimal policies into threshold policies by defining the threshold as the smallest ΔNTCP that justifies re-planning. It is important to recognize that the MDP model’s optimal policies are highly dependent on transition probabilities, which are often inferred from limited-size datasets and can be susceptible to noise. This can result in counterintuitive recommendations, such as “no re-planning” at fraction 20 for ΔNTCP = 7% in the optimal policy shown in Table 1, while the recommendation for ΔNTCP between 3% and 6% as well as ΔNTCP greater than or equal to 8% at this fraction are “re-planning.” This discrepancy stems from the transition probabilities calculated based on a cohort of only 52 patients. Thus, the threshold policies not only enhance the clinical interpretability and applicability of the policies derived from the MDP analysis but also incorporate clinical intuition to mitigate the noise inherent in small datasets.

The presented findings are subject to the limitations inherent to a single-institution retrospective in silico analysis. The results are constrained by the inferred transition probabilities based on data gathered from 52 patients. While our dataset is among the most curated and well-accredited datasets relevant to adaptive radiation therapy for HNC, we recognize the significance of sample size for MDP calibration and acknowledge the need for external validation with larger datasets. These findings are also constrained by the limitations of the ΔNTCP calculations and the inference of the transition probabilities, elaborated in Heukelom et al.²⁹ and Nosrat et al.³⁰ Particularly, the dataset did not include adaptations or daily re-optimization in vivo, and the daily dose accumulation was calculated post hoc from high-resolution imaging data. As a result, there were discrepancies, which were omitted, leading to gaps in the resulting NTCPs. As the dataset did not include patients’ records for the entire treatment period, transition probabilities for the missing time points were inferred from available data in conjunction with clinical judgment. The results are also specific to the toxicities considered and the NTCP models used. Novel approaches to NTCP modeling, such as cluster-based methods⁸, can potentially lead to improved policies. Finally, the results are subject to potential inaccuracies and limitations regarding the values obtained from the literature, particularly the parameters used to calculate the monetary equivalent of the clinical gains. Thus, these findings should be considered an informative semi-synthetic use case rather than a definitive basis for large-scale implementation of ART.

The results presented in this paper utilized CT-on-rails volumetric IGRT data for HNC. However, the proposed methodology is applicable to various platforms, notably MR-linacs that offer a compelling use case, as well as other cancer sites. The decision epochs in our model correspond to weekly re-planning intervals, reflecting our current institutional adaptive protocols,¹ but the methodology can be scaled to daily treatment decisions. The model is flexible for use in both resource-rich and resource-limited facilities and is capable of considering the economic and financial standing of the communities they serve. Additionally, the model allows for further treatment personalization, such as incorporating stratified survival probabilities (e.g., HPV+ vs. HPV-tumors) and considering different timeframes for accounting for patients’ quality of life. The presented model focuses on OAR-sparing ART, assuming that the prescribed dose to the tumor remains unchanged during treatment. Incorporating dose adaptations for the tumor and considering changes in tumor control probability (TCP) during treatment presents a promising avenue for future research. Our results employ ΔNTCP as an evidence-based measure of treatment toxicity. We recognize that this setting adds a layer of computational complexity to clinical decision-making. This motivates future research to leverage the presented framework for decision-making based on easily observable treatment outcomes through partially observable Markov decision processes (POMDPs).⁴⁸

Conclusion

This paper presents the first MDP model for implementing patient-specific OAR-sparing ART in HNC that balances the costs and benefits of plan adaptations and renders cost-effective personalized policies. By analyzing data collected from 52 patients treated at MD Anderson Cancer Center with high-quality CT-based replanning, we derived optimal and threshold policies for implementing ART for HNC. Through MC simulation analysis, we demonstrated that the policies obtained from the MDP analysis can significantly outperform fixed-time (one-size-fits-all) policies in terms of both individual toxicity outcomes and the financial burden on the healthcare system. It was discussed that the policies derived from the MDP model allocate resources to patients that need them most, without limiting the number of re-plannings per patient; they can effectively prevent extreme toxicities at an average cost of less than one re-planning per patient. In addition to these results, by providing an evidence-based, resource-aware, and scalable analytical framework for individualized adaptation, we lay the groundwork for future research aimed at developing an online adaptive workflow.

Data Availability

All data produced are available online at https://github.com/DSuarez03/Cost_Effective_ART_MDP.

APPENDIX A CHEERS 2022 Checklist

View this table:

APPENDIX B Patient Characteristics¹

View this table:

APPENDIX C Transition Probabilities²

Action: No re-planning

View this table:

Table C1: Transition probabilities from fraction 0 (F0) to fraction 10 (F10) under “no re-planning.”

View this table:

Table C2: Smoothed transition probabilities from fraction 10 (F10) to fraction 15 (F15) under “no re-planning.” The same probabilities apply to subsequent transitions, i.e., from F15 to F20, from F20 to F25, and from F25 to end-treatment, under “no re-planning.”

Action: Re-planning

View this table:

Table C3: Smoothed transition probabilities from fraction 10 to fraction 15 under “re-planning.”

View this table:

Table C4: Smoothed transition probabilities from fraction 15 to fraction 20 under “re-planning.”

View this table:

Table C5: Smoothed transition probabilities from fraction 20 to fraction 25 under “re-planning.”

View this table:

Table C6: Smoothed transition probabilities from fraction 25 to end-treatment under “re-planning.”

APPENDIX D Terminal Rewards

We consider a patient’s quality of life over T years after treatment. Let A denote the event that the patient’s lifespan exceeds T, and A^c be the complementary event, meaning the patient dies within T years post-treatment. We denote the patient’s T -year survival probability with S, that is P (A) = S and P (A^c) = 1 −S. In the former event, the patient’s quality of life will be Q < 1 if they experience a certain radiation toxicity, with the probability NTCP. Otherwise, their quality of life will be 1, with the probability 1 − NTCP. We naturally assume the quality of life of zero in the latter event. Based on the described chain of conditional probabilities, the patient’s expected quality-adjusted life years (QALY) over T years, denoted by 𝔼 QALY, can be calculated as follows: Therefore, the difference in the patient’s QALY resulting from two distinct normal tissue complica-tion probabilities, denoted by NTCP₁ and NTCP₂, can be expressed as follows: Define ΔNTCP = NTCP₁ − NTCP₂, and let W denote a willingness-to-pay per QALY ($) value. Thus, the corresponding change in the patient’s expected QALY (over T years) can be equivalently expressed in monetary terms as follows: where $(QALY) denotes the monetary equivalence of the patient’s QALY over T years. Because Q < 1, observe that NTCP₂ > NTCP₁ implies Δ𝔼 $(QALY)] < 0.

APPENDIX E Quality-of-Life Factors³

View this table:

Footnotes

↵† Author Responsible for Statistical Analysis: Seyedmohammadhossein Hosseinian, PhD (hosseinian{at}ncsu.edu)
Conflict of Interest Statement: The authors declare no conflicts of interest.
Data Availability Statement: Research data are available at https://github.com/DSuarez03/Cost_Effective_ART_MDP.
CRediT Author Statement: S.H.: Conceptualization, Formal analysis, Investigation, Methodology, Project administration, Software, Supervision, Visualization, Writing − original draft; D.S-A.: Conceptualization, Formal analysis, Investigation, Methodology, Software, Visualization, Writing − original draft; C.D.: Conceptualization, Data curation, Investigation, Methodology, Resources, Writing - review & editing; R.G.: Conceptualization, Investigation, Methodology, Writing - review & editing; L.M.: Conceptualization, Investigation, Writing - review & editing; M.H.: Conceptualization, Investigation, Methodology, Writing - review & editing; A.K.: Conceptualization, Investigation, Methodology, Writing - review & editing; A.S.R.M.: Conceptualization, Data curation, Investigation, Resources, Supervision, Writing - review & editing; S.Y.L.: Conceptualization, Resources, Writing - review & editing; K.A.H.: Conceptualization, Funding acquisition, Investigation, Resources, Writing - review & editing; A.C.M.: Conceptualization, Methodology, Visualization, Writing - review & editing; K.K.B.: Conceptualization, Resources, Writing - review & editing; F.N.: Conceptualization, Data curation, Investigation, Methodology, Writing - review & editing; C.D.F.: Conceptualization, Funding acquisition, Investigation, Methodology, Project administration, Resources, Supervision, Writing - review & editing; A.J.S.: Conceptualization, Funding acquisition, Investigation, Methodology, Project administration, Resources, Supervision, Writing - review & editing.
↵¹ J. Heukelom, M. E. Kantor, A. S. Mohamed, H. Elhalawani, E. Kocak-Uzel, T. Lin, J. Yang, M. Aristophanous, C. R. Rasch, C. D. Fuller, and J. Sonke. Differences between planned and delivered dose for head and neck cancer, and their consequences for normal tissue complication probability and treatment adaptation. Radiotherapy and Oncology, 142:100–106, 2020.
↵² F. Nosrat, C. Dede, L. B. McCullum, R. Garcia, A. S. Mohamed, J. G. Scott, J. E. Bates, B. A. McDonald, K. A. Wahid, M. A. Naser, R. He, A. C. Moreno, L. V. van Dijk, K. K. Brock, J. Heukelom, S. Hosseinian, M. Hemmati, A. J. Schaefer, and C. D. Fuller. Optimal timing of organs-at-risk-sparing adaptive radiation therapy for head-and-neck cancer under re-planning resource constraints. medRxiv, 2024. DOI: 10.1101/2024.04.01.24305163.
↵³ N. P. Brodin, R. Kabarriti, M. Pankuch, C. B. Schechter, V. Gondi, S. Kalnicki, C. Guha, M. K. Garg, and W. A. Tomé. A quantitative clinical decision–support strategy identifying which patients with oropharyngeal head and neck cancer may benefit the most from proton radiation therapy. International Journal of Radiation Oncology, Biology, Physics, 104(3):540–552, 2019.
⁴ R. E. Kohler, N. C. Sheets, S. B. Wheeler, C. Nutting, E. Hall, and B. S. Chera. Two-year and lifetime cost-effectiveness of intensity modulated radiation therapy versus 3-dimensional conformal radiation therapy for head- and-neck cancer. International Journal of Radiation Oncology, Biology, Physics, 87(4):683–689, 2013.
⁵ B. L. Ramaekers, M. A. Joore, J. P. Grutters, P. Van Den Ende, J. De Jong, R. Houben, P. Lambin, M. Christianen, I. Beetz, M. Pijls-Johannesma, et al. The impact of late treatment-toxicity on generic health-related quality of life in head and neck cancer patients after radiotherapy. Oral oncology, 47(8):768–774, 2011.
⁶ L. Rudmik, W. An, D. Livingstone, W. Matthews, H. Seikaly, R. Scrimger, and D. Marshall. Making a case for high-volume robotic surgery centers: A cost-effectiveness analysis of transoral robotic surgery. Journal of Surgical Oncology, 112(2):155–163, 2015.

References

1.↵
H. Bahig, Y. Yuan, A. S. Mohamed, K. K. Brock, S. P. Ng, J. Wang, Y. Ding, K. Hutcheson, M. McCulloch, P. A. Balter, S. Y. Lai, A. Al-Mamgani, et al. Magnetic resonance-based response assessment and dose adaptation in human papilloma virus positive tumors of the oropharynx treated with radiotherapy (MR-ADAPTOR): An R-IDEAL stage 2a-2b/Bayesian phase II trial. Clinical and Translational Radiation Oncology, 13:19–23, 2018
OpenUrl
2.↵
D. M. Gujral and C. M. Nutting. Patterns of failure, treatment outcomes and late toxicities of head and neck cancer in the current era of IMRT. Oral Oncology, 86:225–233, 2018.
OpenUrl PubMed
3.
M. E. Christianen, I. M. Verdonck-de Leeuw, P. Doornaert, O. Chouvalova, R. J. Steenbakkers, P. W. Koken, C. R. Leemans, S. F. Oosting, J. L. Roodenburg, B. F. van der Laan, et al. Patterns of long-term swallowing dysfunction after definitive radiotherapy or chemoradiation. Radiotherapy and Oncology, 117(1):139–144, 2015.
OpenUrl PubMed
4.
K. A. Hutcheson, Z. Nurgalieva, H. Zhao, G. B. Gunn, S. H. Giordano, M. K. Bhayani, J. S. Lewin, and C. M. Lewis. Two-year prevalence of dysphagia and related outcomes in head and neck cancer survivors: An updated SEER-Medicare analysis. Head & Neck, 41(2):479–487, 2019.
OpenUrl PubMed
5.
C. E. Barbon, C. M. Yao, C. P. Alvarez, R. P. Goepfert, C. D. Fuller, S. Y. Lai, N. D. Gross, and K. A. Hutcheson. Dysphagia profiles after primary transoral robotic surgery or radiation for oropharyngeal cancer: A registry analysis. Head & Neck, 43(10):2883–2895, 2021.
OpenUrl PubMed
6.
C. J. Tsai, T. M. Hofstede, E. M. Sturgis, A. S. Garden, M. E. Lindberg, Q. Wei, S. L. Tucker, and L. Dong. Osteoradionecrosis and radiation dose to the mandible in patients with oropharyngeal cancer. International Journal of Radiation Oncology, Biology, Physics, 85(2):415–420, 2013.
OpenUrl CrossRef PubMed
7.
L. V. van Dijk, A. A. Abusaif, J. Rigert, M. A. Naser, K. A. Hutcheson, S. Y. Lai, C. D. Fuller, S. Mohamed. Normal Tissue Complication Probability (NTCP) prediction model for osteoradionecrosis of the mandible in patients with head and neck cancer after radiation therapy: Large-scale observational cohort. International Journal of Radiation Oncology, Biology, Physics, 111(2):549–558, 2021.
OpenUrl CrossRef PubMed
8.↵
S. Hosseinian, M. Hemmati, C. Dede, T. C. Salzillo, L. V. van Dijk, A. S. Mohamed, S. Y. Lai, A. J. Schaefer, and C. D. Fuller. Cluster-based toxicity estimation of osteoradionecrosis via unsupervised machine learning: Moving beyond single dose-parameter normal tissue complication probability by using whole dose-volume histograms for cohort risk stratification. International Journal of Radiation Oncology, Biology, Physics, 119(5): 1569–1578, 2024.
OpenUrl CrossRef PubMed
9.↵
A. J. Frankart, M. J. Frankart, B. Cervenka, A. L. Tang, D. G. Krishnan, and V. Takiar. Osteoradionecrosis: Exposing the evidence not the bone. International Journal of Radiation Oncology, Biology, Physics, 109(5):1206–1218, 2021.
OpenUrl CrossRef PubMed
10.↵
J. L. Barker, A. S. Garden, K. K. Ang, J. C. ODaniel, H. Wang, L. E. Court, W. H. Morrison, D. I. Rosenthal, K. S. Chao, S. L. Tucker, R. Mohan, and L. Dong. Quantification of volumetric and geometric changes occurring during fractionated radiotherapy for head-and-neck cancer using an integrated CT/linear accelerator system. International Journal of Radiation Oncology, Biology, Physics, 59(4):960–970, 2004.
OpenUrl CrossRef PubMed Web of Science
11.
M. Beltran, M. Ramos, J. J. Rovira, S. Perez-Hoyos, M. Sancho, E. Puertas, S. Benavente, M. Ginjaume, and J. Giralt. Dose variations in tumor volumes and organs at risk during IMRT for head-and-neck cancer. Journal of Applied Clinical Medical Physics, 13(6):3723, 2012.
OpenUrl PubMed
12.↵
W. Chen, P. Bai, J. Pan, Y. Xu, and K. Chen. Changes in tumor volumes and spatial locations relative to normal tissues during cervical cancer radiotherapy assessed by cone beam computed tomography. Technology in Cancer Research and Treatment, 16(2):246–252, 2017.
OpenUrl
13.↵
J. Castelli, A. Simon, C. Lafond, N. Perichon, B. Rigaud, E. Chajon, B. De Bari, M. Ozsahin, J. Bourhis, and R. de Crevoisier. Adaptive radiotherapy for head and neck cancer. Acta Oncology, 57(10):1284–1292, 2018.
OpenUrl
14.
M. Figen, D. Ç. Öksüz, E. Duman, R. Prestwich, K. Dyker, K. Cardale, S. Ramasamy, P. Murray, and M. Şen. Radiotherapy for head and neck cancer: Evaluation of triggered adaptive replanning in routine practice. Frontiers in Oncology, 59(4):960–970, 2020.
OpenUrl
15.
R. Avgousti, C. Antypas, C. Armpilia, F. Simopoulou, Z. Liakouli, P. Karaiskos, V. Kouloulias, E. Kyrodimos, L. A. Moulopoulos, and A. Zygogianni. Adaptive radiation therapy: When, how and what are the benefits that literature provides? Cancer/Radiothérapie, 26(4):622–636, 2022.
OpenUrl
16.↵
B. M. Beadle and A. W. Chan. The potential of adaptive radiotherapy for patients with head and neck cancer–too much or not enough? JAMA Oncology, 9(8):1064–1065, 2023.
OpenUrl PubMed
17.↵
E. Huynh, A. Hosny, C. Guthier, S. F. Bitterman, D. A. Hass-Kogan, B. Kann, H. J. W. L. Aerts, and R. H. Mak. Artificial intelligence in radiation oncology. Nature Reviews Clinical Oncology, 17:771–781, 2020.
OpenUrl PubMed
18.
V. Andrearczyk, V. Oreiller, S. Boughdad, C. Cheze Le Rest, H. Elhalawani, M. Jreige, J. O. Prior, M. Vallières, D. Visvikis, M. Hatt, et al. Overview of the HEKTOR challenge at MICCAI 2021: Automatic head and neck tumor segmentation and outcome prediction in PET/CT images. In 3D Head and Neck Tumor Segmentation in PET/CT Challenge, pages 1–37. Springer, 2021.
19.
J. Ren, J. G. Eriksen, J. Nijkamp, and S. S. Korreman. Comparing different CT, PET and MRI multi-modality image combinations for deep learning-based head and neck tumor segmentation. Acta Oncologica, 60(11):1399–1406, 2021.
OpenUrl
20.
R. R. Outeiral, P. Bos, A. Al-Mamgani, B. Jasperse, R. Simões, and U. A. van der Heide. Oropharyngeal primary tumor segmentation for radiotherapy planning on magnetic resonance imaging using deep learning. Physics and Imaging in Radiation Oncology, 19:39–44, 2021.
OpenUrl
21.
K. A. Wahid, S. Ahmed, R. He, L. V. van Dijk, J. Teuwen, B. A. McDonald, V. Salama, A. S. Mohamed, T. Salzillo, C. Dede, et al. Evaluation of deep learning-based multiparametric MRI oropharyngeal primary tumor auto-segmentation and investigation of input channel effects: Results from a prospective imaging registry. Clinical and Translational Radiation Oncology, 32:6–14, 2022.
OpenUrl
22.
J. Sahlsten, J. Jaskari, K. A. Wahid, S. Ahmed, E. Glerean, R. He, B. H. Kann, A. Mäkitie, C. D. Fuller, M. A. Naser, et al. Application of simultaneous uncertainty quantification and segmentation for oropharyngeal cancer use-case with Bayesian deep learning. Communications Medicine, 4(1):110, 2024.
OpenUrl PubMed
23.↵
L Humbert-Vidan, K. A. Wahid, Z. Kaffey, S. Mirbahaeddin, J. Curiel, S. Acharya, A. Shekha, J. M. Rigert, C. Dede, S. Attia, et al. Deep learning-based auto contouring of mandibular sub-volumes based on the ClinRad system for spatial localization of osteoradionecrosis of the jaw. International Journal of Radiation Oncology, Biology, Physics, 120(2):e141–e142, 2024.
OpenUrl
24.↵
C. A. van den Berg and E. F Meliadò. Uncertainty assessment for deep learning radiotherapy applications. In Seminars in Radiation Oncology, volume 32, pages 304–318. Elsevier, 2022.
OpenUrl
25.
M. Claessens, C. Seller Oria, C. L. Brouwer, B. P. Ziemer, J. E. Scholey, H. Lin, A. Witztum, O. Morin, I. El-Naqa, W. van Elmpt, et al. Quality assurance for AI-based applications in radiation therapy. In Seminars in Radiation Oncology, volume 32, pages 421–431. Elsevier, 2022.
OpenUrl
26.
R. R. Outeiral, N. F. Silvério, P. J. González, E. E. Schaake, T. Janssen, U. A. van der Heide, and R. Simões. A network score-based metric to optimize the quality assurance of automatic radiotherapy target segmentations. Physics and Imaging in Radiation Oncology, 28:100500, 2023.
OpenUrl
27.
Z. Kaffey, K. A. Wahid, D. Farris, L. Humbert-Vidan, T. Netherton, G. Balakrishnan, A. C. Moreno, M. Naser, C. D. Fuller, D. Fuentes, et al. Uncertainty quantification in machine learning radiotherapy applications–A scoping review. International Journal of Radiation Oncology, Biology, Physics, 120(2):e631, 2024.
OpenUrl
28.↵
K. A. Wahid, J. Sahlsten, J. Jaskari, M. J. Dohopolski, K. Kaski, R. He, E. Glerean, B. H Kann, A. Mäkitie, C. D. Fuller, et al. Harnessing uncertainty in radiotherapy auto-segmentation quality assurance. Physics and Imaging in Radiation Oncology, 29:100526, 2024.
OpenUrl
29.↵
J. Heukelom, M. E. Kantor, A. S. Mohamed, H. Elhalawani, E. Kocak-Uzel, T. Lin, J. Yang, M. Aristophanous, C. R. Rasch, C. D. Fuller, and J. Sonke. Differences between planned and delivered dose for head and neck cancer, and their consequences for normal tissue complication probability and treatment adaptation. Radiotherapy and Oncology, 142:100–106, 2020.
OpenUrl PubMed
30.↵
F. Nosrat, C. Dede, L. B. McCullum, R. Garcia, A. S. Mohamed, J. G. Scott, J. E. Bates, B. McDonald, K. A. Wahid, M. A. Naser, R. He, A. C. Moreno, L. V. van Dijk, K. K. Brock, J. Heukelom, S. Hosseinian, M. Hemmati, A. J. Schaefer, and C. D. Fuller. Optimal timing of organs-at-risk-sparing adaptive radiation therapy for head-and-neck cancer under replanning resource constraints. medRxiv, 2024. DOI: 10.1101/2024.04.01.24305163.
OpenUrl Abstract/FREE Full Text
31.↵
J. R. Beck and S. G. Pauker. The Markov process in medical prognosis. Medical Decision Making, 3(4):419–458, 1983.
OpenUrl CrossRef PubMed Web of Science
32.
E. A. Feinberg and A. Shwartz. Handbook of Markov Decision Processes: Methods and Applications. Springer, 2002.
33.
O. Alagoz, L. M. Maillart, A. J. Schaefer, and M. S. Roberts. The optimal timing of living-donor liver transplantation. Management Science, 50(10):1420–30, 2004.
OpenUrl
34.
M. L. Puterman. Markov Decision Processes: Discrete Stochastic Dynamic Programming. Wiley, 2005.
35.
S. M. Shechter, M. D. Bailey, A. J. Schaefer, and M. S. Roberts. The optimal time to initiate HIV therapy under ordered health states. Operations Research, 56(1):20–33, 2008.
OpenUrl CrossRef Web of Science
36.
B. T. Denton, M. Kurt, N. D. Shah, S. C. Bryant, and S. A. Smith. Optimizing the start time of statin therapy for patients with diabetes. Medical Decision Making, 29(3):351–67, 2009.
OpenUrl CrossRef PubMed Web of Science
37.
O. Alagoz, H. Hsu, A. J. Schaefer, and M. S. Roberts. Markov decision processes: A tool for sequential decision making under uncertainty. Medical Decision Making, 30(4):474–483, 2010.
OpenUrl CrossRef PubMed Web of Science
38.
K. M. Kuntz, L. B. Russell, D. K. Owens, G. D. Sanders, T. A. Trikalinos, and J. A. Salomon. Decision models in cost-effectiveness analysis. Cost-Effectiveness in Health and Medicine. Oxford University Press, 2016.
39.↵
S. P. Ng, T. Ajayi, A. J. Schaefer, C. Pollard III., H. Bahig, A. S. Garden, D. I. Rosenthal et al. Surveillance imaging for patients with head and neck cancer treated with definitive radiotherapy: A partially observed Markov decision process model. Cancer, 126(4):749–756, 2020.
OpenUrl PubMed
40.↵
L. B. McCullum, A. Karagoz, C. Dede, R. Garcia, F. Nosrat, M. Hemmati, S. Hosseinian, A. J. Schaefer, and C. D. Fuller. Markov models for clinical decision-making in radiation oncology: A systematic review. Journal of Medical Imaging and Radiation Oncology, 68:610–623, 2024.
OpenUrl
41.↵
D. Husereau, M. Drummond, F. Augustovski, E. de Bekker-Grob, A. H. Briggs, C. Carswell, L. Caulley, N. Chaiyakunapruk, D. Greenberg, E. Loder, et al. Consolidated health economic evaluation reporting standards (CHEERS) 2022 explanation and elaboration: A report of the ISPOR CHEERS II Good Practices Task Force. Value in Health, 25(1):10–31, 2022.
OpenUrl CrossRef PubMed
42.↵
R. J. Walls and C. McLaughlin. Comparison and breakdown of cost in head and neck cancer radiation therapy and adaptive replanning. International Journal of Radiation Oncology, Biology, Physics, 117(2):e632, 2023.
OpenUrl
43.↵
L. Perrier, F. Balusson, M. Morelle, J. Castelli, J. Thariat, K. Bénézery, A. Hasbini, B. Géry, Berger, X. Liem, et al. Cost-effectiveness of weekly adaptive radiotherapy versus standard IMRT in head and neck cancer alongside the ARTIX trial. Radiotherapy and Oncology, 193:110116, 2024.
OpenUrl PubMed
44.↵
A. Barsouk, J. S. Aluru, P. Rawla, K. Saginala, and Alexander Barsouk. Epidemiology, risk factors, and prevention of head and neck squamous cell carcinoma. Medical Sciences, 11(2):42, 2023.
OpenUrl PubMed
45.↵
N. P. Brodin, R. Kabarriti, M. Pankuch, C. B. Schechter, V. Gondi, S. Kalnicki, C. Guha, M. K. Garg, and W. A. Tomé. A quantitative clinical decision–support strategy identifying which patients with oropharyngeal head and neck cancer may benefit the most from proton radiation therapy. International Journal of Radiation Oncology, Biology, Physics, 104(3):540–552, 2019.
OpenUrl PubMed
46.↵
Institute for Clinical and Economic Review. https://icer.org/news-insights/press-releases/semaglutide_evidence_report/. Accessed: 06-12-2024.
47.↵
Markov Decision Process (MDP) Toolbox for Python. https://pymdptoolbox.readthedocs.io/en/latest/. Accessed: 06-12-2024.
48.↵
M. T. Spaan. Partially observable Markov decision processes. In Reinforcement Learning: State-of-the-Art, pages 387–414. Springer Berlin Heidelberg, 2012.

View the discussion thread.

Posted November 05, 2024.

Download PDF

Data/Code

Citation Tools

Subject Area

Oncology

Subject Areas

All Articles

Addiction Medicine (399)
Allergy and Immunology (709)
Anesthesia (201)
Cardiovascular Medicine (2942)
Dentistry and Oral Medicine (334)
Dermatology (249)
Emergency Medicine (440)
Endocrinology (including Diabetes Mellitus and Metabolic Disease) (1036)
Epidemiology (12743)
Forensic Medicine (12)
Gastroenterology (828)
Genetic and Genomic Medicine (4583)
Geriatric Medicine (417)
Health Economics (729)
Health Informatics (2917)
Health Policy (1069)
Health Systems and Quality Improvement (1078)
Hematology (389)
HIV/AIDS (924)
Infectious Diseases (except HIV/AIDS) (14098)
Intensive Care and Critical Care Medicine (846)
Medical Education (425)
Medical Ethics (115)
Nephrology (469)
Neurology (4353)
Nursing (236)
Nutrition (639)
Obstetrics and Gynecology (805)
Occupational and Environmental Health (735)
Oncology (2268)
Ophthalmology (646)
Orthopedics (258)
Otolaryngology (325)
Pain Medicine (279)
Palliative Medicine (83)
Pathology (501)
Pediatrics (1196)
Pharmacology and Therapeutics (504)
Primary Care Research (496)
Psychiatry and Clinical Psychology (3755)
Public and Global Health (6938)
Radiology and Imaging (1527)
Rehabilitation Medicine and Physical Therapy (905)
Respiratory Medicine (915)
Rheumatology (437)
Sexual and Reproductive Health (443)
Sports Medicine (385)
Surgery (488)
Toxicology (60)
Transplantation (212)
Urology (180)

[1] 1.↵
H. Bahig, Y. Yuan, A. S. Mohamed, K. K. Brock, S. P. Ng, J. Wang, Y. Ding, K. Hutcheson, M. McCulloch, P. A. Balter, S. Y. Lai, A. Al-Mamgani, et al. Magnetic resonance-based response assessment and dose adaptation in human papilloma virus positive tumors of the oropharynx treated with radiotherapy (MR-ADAPTOR): An R-IDEAL stage 2a-2b/Bayesian phase II trial. Clinical and Translational Radiation Oncology, 13:19–23, 2018
OpenUrl

[2] 2.↵
D. M. Gujral and C. M. Nutting. Patterns of failure, treatment outcomes and late toxicities of head and neck cancer in the current era of IMRT. Oral Oncology, 86:225–233, 2018.
OpenUrl PubMed

[3] 3.
M. E. Christianen, I. M. Verdonck-de Leeuw, P. Doornaert, O. Chouvalova, R. J. Steenbakkers, P. W. Koken, C. R. Leemans, S. F. Oosting, J. L. Roodenburg, B. F. van der Laan, et al. Patterns of long-term swallowing dysfunction after definitive radiotherapy or chemoradiation. Radiotherapy and Oncology, 117(1):139–144, 2015.
OpenUrl PubMed

[4] 4.
K. A. Hutcheson, Z. Nurgalieva, H. Zhao, G. B. Gunn, S. H. Giordano, M. K. Bhayani, J. S. Lewin, and C. M. Lewis. Two-year prevalence of dysphagia and related outcomes in head and neck cancer survivors: An updated SEER-Medicare analysis. Head & Neck, 41(2):479–487, 2019.
OpenUrl PubMed

[5] 5.
C. E. Barbon, C. M. Yao, C. P. Alvarez, R. P. Goepfert, C. D. Fuller, S. Y. Lai, N. D. Gross, and K. A. Hutcheson. Dysphagia profiles after primary transoral robotic surgery or radiation for oropharyngeal cancer: A registry analysis. Head & Neck, 43(10):2883–2895, 2021.
OpenUrl PubMed

[6] 6.
C. J. Tsai, T. M. Hofstede, E. M. Sturgis, A. S. Garden, M. E. Lindberg, Q. Wei, S. L. Tucker, and L. Dong. Osteoradionecrosis and radiation dose to the mandible in patients with oropharyngeal cancer. International Journal of Radiation Oncology, Biology, Physics, 85(2):415–420, 2013.
OpenUrl CrossRef PubMed

[7] 7.
L. V. van Dijk, A. A. Abusaif, J. Rigert, M. A. Naser, K. A. Hutcheson, S. Y. Lai, C. D. Fuller, S. Mohamed. Normal Tissue Complication Probability (NTCP) prediction model for osteoradionecrosis of the mandible in patients with head and neck cancer after radiation therapy: Large-scale observational cohort. International Journal of Radiation Oncology, Biology, Physics, 111(2):549–558, 2021.
OpenUrl CrossRef PubMed

[8] 8.↵
S. Hosseinian, M. Hemmati, C. Dede, T. C. Salzillo, L. V. van Dijk, A. S. Mohamed, S. Y. Lai, A. J. Schaefer, and C. D. Fuller. Cluster-based toxicity estimation of osteoradionecrosis via unsupervised machine learning: Moving beyond single dose-parameter normal tissue complication probability by using whole dose-volume histograms for cohort risk stratification. International Journal of Radiation Oncology, Biology, Physics, 119(5): 1569–1578, 2024.
OpenUrl CrossRef PubMed

[9] 9.↵
A. J. Frankart, M. J. Frankart, B. Cervenka, A. L. Tang, D. G. Krishnan, and V. Takiar. Osteoradionecrosis: Exposing the evidence not the bone. International Journal of Radiation Oncology, Biology, Physics, 109(5):1206–1218, 2021.
OpenUrl CrossRef PubMed

[10] 10.↵
J. L. Barker, A. S. Garden, K. K. Ang, J. C. ODaniel, H. Wang, L. E. Court, W. H. Morrison, D. I. Rosenthal, K. S. Chao, S. L. Tucker, R. Mohan, and L. Dong. Quantification of volumetric and geometric changes occurring during fractionated radiotherapy for head-and-neck cancer using an integrated CT/linear accelerator system. International Journal of Radiation Oncology, Biology, Physics, 59(4):960–970, 2004.
OpenUrl CrossRef PubMed Web of Science

[11] 11.
M. Beltran, M. Ramos, J. J. Rovira, S. Perez-Hoyos, M. Sancho, E. Puertas, S. Benavente, M. Ginjaume, and J. Giralt. Dose variations in tumor volumes and organs at risk during IMRT for head-and-neck cancer. Journal of Applied Clinical Medical Physics, 13(6):3723, 2012.
OpenUrl PubMed

[12] 12.↵
W. Chen, P. Bai, J. Pan, Y. Xu, and K. Chen. Changes in tumor volumes and spatial locations relative to normal tissues during cervical cancer radiotherapy assessed by cone beam computed tomography. Technology in Cancer Research and Treatment, 16(2):246–252, 2017.
OpenUrl

[13] 13.↵
J. Castelli, A. Simon, C. Lafond, N. Perichon, B. Rigaud, E. Chajon, B. De Bari, M. Ozsahin, J. Bourhis, and R. de Crevoisier. Adaptive radiotherapy for head and neck cancer. Acta Oncology, 57(10):1284–1292, 2018.
OpenUrl

[14] 14.
M. Figen, D. Ç. Öksüz, E. Duman, R. Prestwich, K. Dyker, K. Cardale, S. Ramasamy, P. Murray, and M. Şen. Radiotherapy for head and neck cancer: Evaluation of triggered adaptive replanning in routine practice. Frontiers in Oncology, 59(4):960–970, 2020.
OpenUrl

[15] 15.
R. Avgousti, C. Antypas, C. Armpilia, F. Simopoulou, Z. Liakouli, P. Karaiskos, V. Kouloulias, E. Kyrodimos, L. A. Moulopoulos, and A. Zygogianni. Adaptive radiation therapy: When, how and what are the benefits that literature provides? Cancer/Radiothérapie, 26(4):622–636, 2022.
OpenUrl

[16] 16.↵
B. M. Beadle and A. W. Chan. The potential of adaptive radiotherapy for patients with head and neck cancer–too much or not enough? JAMA Oncology, 9(8):1064–1065, 2023.
OpenUrl PubMed

[17] 17.↵
E. Huynh, A. Hosny, C. Guthier, S. F. Bitterman, D. A. Hass-Kogan, B. Kann, H. J. W. L. Aerts, and R. H. Mak. Artificial intelligence in radiation oncology. Nature Reviews Clinical Oncology, 17:771–781, 2020.
OpenUrl PubMed

[18] 18.
V. Andrearczyk, V. Oreiller, S. Boughdad, C. Cheze Le Rest, H. Elhalawani, M. Jreige, J. O. Prior, M. Vallières, D. Visvikis, M. Hatt, et al. Overview of the HEKTOR challenge at MICCAI 2021: Automatic head and neck tumor segmentation and outcome prediction in PET/CT images. In 3D Head and Neck Tumor Segmentation in PET/CT Challenge, pages 1–37. Springer, 2021.

[19] 19.
J. Ren, J. G. Eriksen, J. Nijkamp, and S. S. Korreman. Comparing different CT, PET and MRI multi-modality image combinations for deep learning-based head and neck tumor segmentation. Acta Oncologica, 60(11):1399–1406, 2021.
OpenUrl

[20] 20.
R. R. Outeiral, P. Bos, A. Al-Mamgani, B. Jasperse, R. Simões, and U. A. van der Heide. Oropharyngeal primary tumor segmentation for radiotherapy planning on magnetic resonance imaging using deep learning. Physics and Imaging in Radiation Oncology, 19:39–44, 2021.
OpenUrl

[21] 21.
K. A. Wahid, S. Ahmed, R. He, L. V. van Dijk, J. Teuwen, B. A. McDonald, V. Salama, A. S. Mohamed, T. Salzillo, C. Dede, et al. Evaluation of deep learning-based multiparametric MRI oropharyngeal primary tumor auto-segmentation and investigation of input channel effects: Results from a prospective imaging registry. Clinical and Translational Radiation Oncology, 32:6–14, 2022.
OpenUrl

[22] 22.
J. Sahlsten, J. Jaskari, K. A. Wahid, S. Ahmed, E. Glerean, R. He, B. H. Kann, A. Mäkitie, C. D. Fuller, M. A. Naser, et al. Application of simultaneous uncertainty quantification and segmentation for oropharyngeal cancer use-case with Bayesian deep learning. Communications Medicine, 4(1):110, 2024.
OpenUrl PubMed

[23] 23.↵
L Humbert-Vidan, K. A. Wahid, Z. Kaffey, S. Mirbahaeddin, J. Curiel, S. Acharya, A. Shekha, J. M. Rigert, C. Dede, S. Attia, et al. Deep learning-based auto contouring of mandibular sub-volumes based on the ClinRad system for spatial localization of osteoradionecrosis of the jaw. International Journal of Radiation Oncology, Biology, Physics, 120(2):e141–e142, 2024.
OpenUrl

[24] 24.↵
C. A. van den Berg and E. F Meliadò. Uncertainty assessment for deep learning radiotherapy applications. In Seminars in Radiation Oncology, volume 32, pages 304–318. Elsevier, 2022.
OpenUrl

[25] 25.
M. Claessens, C. Seller Oria, C. L. Brouwer, B. P. Ziemer, J. E. Scholey, H. Lin, A. Witztum, O. Morin, I. El-Naqa, W. van Elmpt, et al. Quality assurance for AI-based applications in radiation therapy. In Seminars in Radiation Oncology, volume 32, pages 421–431. Elsevier, 2022.
OpenUrl

[26] 26.
R. R. Outeiral, N. F. Silvério, P. J. González, E. E. Schaake, T. Janssen, U. A. van der Heide, and R. Simões. A network score-based metric to optimize the quality assurance of automatic radiotherapy target segmentations. Physics and Imaging in Radiation Oncology, 28:100500, 2023.
OpenUrl

[27] 27.
Z. Kaffey, K. A. Wahid, D. Farris, L. Humbert-Vidan, T. Netherton, G. Balakrishnan, A. C. Moreno, M. Naser, C. D. Fuller, D. Fuentes, et al. Uncertainty quantification in machine learning radiotherapy applications–A scoping review. International Journal of Radiation Oncology, Biology, Physics, 120(2):e631, 2024.
OpenUrl

[28] 28.↵
K. A. Wahid, J. Sahlsten, J. Jaskari, M. J. Dohopolski, K. Kaski, R. He, E. Glerean, B. H Kann, A. Mäkitie, C. D. Fuller, et al. Harnessing uncertainty in radiotherapy auto-segmentation quality assurance. Physics and Imaging in Radiation Oncology, 29:100526, 2024.
OpenUrl

[29] 29.↵
J. Heukelom, M. E. Kantor, A. S. Mohamed, H. Elhalawani, E. Kocak-Uzel, T. Lin, J. Yang, M. Aristophanous, C. R. Rasch, C. D. Fuller, and J. Sonke. Differences between planned and delivered dose for head and neck cancer, and their consequences for normal tissue complication probability and treatment adaptation. Radiotherapy and Oncology, 142:100–106, 2020.
OpenUrl PubMed

[30] 30.↵
F. Nosrat, C. Dede, L. B. McCullum, R. Garcia, A. S. Mohamed, J. G. Scott, J. E. Bates, B. McDonald, K. A. Wahid, M. A. Naser, R. He, A. C. Moreno, L. V. van Dijk, K. K. Brock, J. Heukelom, S. Hosseinian, M. Hemmati, A. J. Schaefer, and C. D. Fuller. Optimal timing of organs-at-risk-sparing adaptive radiation therapy for head-and-neck cancer under replanning resource constraints. medRxiv, 2024. DOI: 10.1101/2024.04.01.24305163.
OpenUrl Abstract/FREE Full Text

[31] 31.↵
J. R. Beck and S. G. Pauker. The Markov process in medical prognosis. Medical Decision Making, 3(4):419–458, 1983.
OpenUrl CrossRef PubMed Web of Science

[32] 32.
E. A. Feinberg and A. Shwartz. Handbook of Markov Decision Processes: Methods and Applications. Springer, 2002.

[33] 33.
O. Alagoz, L. M. Maillart, A. J. Schaefer, and M. S. Roberts. The optimal timing of living-donor liver transplantation. Management Science, 50(10):1420–30, 2004.
OpenUrl

[34] 34.
M. L. Puterman. Markov Decision Processes: Discrete Stochastic Dynamic Programming. Wiley, 2005.

[35] 35.
S. M. Shechter, M. D. Bailey, A. J. Schaefer, and M. S. Roberts. The optimal time to initiate HIV therapy under ordered health states. Operations Research, 56(1):20–33, 2008.
OpenUrl CrossRef Web of Science

[36] 36.
B. T. Denton, M. Kurt, N. D. Shah, S. C. Bryant, and S. A. Smith. Optimizing the start time of statin therapy for patients with diabetes. Medical Decision Making, 29(3):351–67, 2009.
OpenUrl CrossRef PubMed Web of Science

[37] 37.
O. Alagoz, H. Hsu, A. J. Schaefer, and M. S. Roberts. Markov decision processes: A tool for sequential decision making under uncertainty. Medical Decision Making, 30(4):474–483, 2010.
OpenUrl CrossRef PubMed Web of Science

[38] 38.
K. M. Kuntz, L. B. Russell, D. K. Owens, G. D. Sanders, T. A. Trikalinos, and J. A. Salomon. Decision models in cost-effectiveness analysis. Cost-Effectiveness in Health and Medicine. Oxford University Press, 2016.

[39] 39.↵
S. P. Ng, T. Ajayi, A. J. Schaefer, C. Pollard III., H. Bahig, A. S. Garden, D. I. Rosenthal et al. Surveillance imaging for patients with head and neck cancer treated with definitive radiotherapy: A partially observed Markov decision process model. Cancer, 126(4):749–756, 2020.
OpenUrl PubMed

[40] 40.↵
L. B. McCullum, A. Karagoz, C. Dede, R. Garcia, F. Nosrat, M. Hemmati, S. Hosseinian, A. J. Schaefer, and C. D. Fuller. Markov models for clinical decision-making in radiation oncology: A systematic review. Journal of Medical Imaging and Radiation Oncology, 68:610–623, 2024.
OpenUrl

[41] 41.↵
D. Husereau, M. Drummond, F. Augustovski, E. de Bekker-Grob, A. H. Briggs, C. Carswell, L. Caulley, N. Chaiyakunapruk, D. Greenberg, E. Loder, et al. Consolidated health economic evaluation reporting standards (CHEERS) 2022 explanation and elaboration: A report of the ISPOR CHEERS II Good Practices Task Force. Value in Health, 25(1):10–31, 2022.
OpenUrl CrossRef PubMed

[42] 42.↵
R. J. Walls and C. McLaughlin. Comparison and breakdown of cost in head and neck cancer radiation therapy and adaptive replanning. International Journal of Radiation Oncology, Biology, Physics, 117(2):e632, 2023.
OpenUrl

[43] 43.↵
L. Perrier, F. Balusson, M. Morelle, J. Castelli, J. Thariat, K. Bénézery, A. Hasbini, B. Géry, Berger, X. Liem, et al. Cost-effectiveness of weekly adaptive radiotherapy versus standard IMRT in head and neck cancer alongside the ARTIX trial. Radiotherapy and Oncology, 193:110116, 2024.
OpenUrl PubMed

[44] 44.↵
A. Barsouk, J. S. Aluru, P. Rawla, K. Saginala, and Alexander Barsouk. Epidemiology, risk factors, and prevention of head and neck squamous cell carcinoma. Medical Sciences, 11(2):42, 2023.
OpenUrl PubMed

[45] 45.↵
N. P. Brodin, R. Kabarriti, M. Pankuch, C. B. Schechter, V. Gondi, S. Kalnicki, C. Guha, M. K. Garg, and W. A. Tomé. A quantitative clinical decision–support strategy identifying which patients with oropharyngeal head and neck cancer may benefit the most from proton radiation therapy. International Journal of Radiation Oncology, Biology, Physics, 104(3):540–552, 2019.
OpenUrl PubMed

[46] 46.↵
Institute for Clinical and Economic Review. https://icer.org/news-insights/press-releases/semaglutide_evidence_report/. Accessed: 06-12-2024.

[47] 47.↵
Markov Decision Process (MDP) Toolbox for Python. https://pymdptoolbox.readthedocs.io/en/latest/. Accessed: 06-12-2024.

[48] 48.↵
M. T. Spaan. Partially observable Markov decision processes. In Reinforcement Learning: State-of-the-Art, pages 387–414. Springer Berlin Heidelberg, 2012.

Cost-Effectiveness of Personalized Policies for Implementing Organ-at-Risk Sparing Adaptive Radiation Therapy in Head and Neck Cancer: A Markov Decision Process Approach

Abstract

Introduction

Methods and Materials

Data

Decision Model

Decision epochs

States

Actions

Transition probabilities

Immediate rewards

Terminal rewards

Results

Model Validation

Discussion

Conclusion

Data Availability

APPENDIX A CHEERS 2022 Checklist

APPENDIX B Patient Characteristics1

APPENDIX C Transition Probabilities2

APPENDIX D Terminal Rewards

APPENDIX E Quality-of-Life Factors3

Footnotes

References

Citation Manager Formats

Subject Area

APPENDIX B Patient Characteristics¹

APPENDIX C Transition Probabilities²

APPENDIX E Quality-of-Life Factors³