ABSTRACT
Checkpoint inhibitor therapy of cancer has led to markedly improved survival of a subset of patients in multiple solid malignant tumor types, yet the factors driving these clinical responses or lack thereof are not known. Here we present a translational mathematical model dependent on three key parameters for describing efficacy of checkpoint inhibitors in human cancer: tumor growth rate (α), tumor immune infiltration (Λ), and immunotherapy-mediated amplification of anti-tumor response (µ). The model was validated by fitting it to a compiled clinical tumor response dataset (n = 189 patients) obtained from published anti-PD-1 and anti-PD-L1 clinical trials, and then validated on an additional validation cohort (n = 64 patients) obtained from our in-house clinical trials. The derived parameters Λ and µ were both significantly different between responding versus non-responding patients. Of note, our model appropriately classified response in 81.4% of patients by using only tumor volume measurements and within two months of treatment initiation in a retrospective analysis. The model reliably predicted clinical response to the PD-1/PD-L1 class of checkpoint inhibitors across multiple solid malignant tumor types. Comparison of model parameters to immunohistochemical measurement of PD-L1 and CD8+ T cells confirmed robust relationships between model parameters and their underlying biology, demonstrating reliable methods to inform model parameters directly from biopsy samples, which are conveniently obtainable as early as the start of treatment. Together, these results suggest that the model parameters may serve as early and robust biomarkers of the efficacy of checkpoint inhibitor therapy on an individualized per-patient basis.
SIGNIFICANCE Checkpoint inhibitors have revolutionized cancer immunotherapy, but only a subset of patients with solid tumors responds clinically. The ability to predict tumor responses a priori or soon after starting therapy would allow for personalized and timely adaptive clinical applications of checkpoint inhibitor- based immunotherapy in patients. By applying a mechanistic mathematical model, we show that checkpoint inhibitor therapeutic effectiveness is accurately predictable in most patients within two months after treatment initiation. Our method may be implemented directly into clinical practice, as it relies on standard-of-care imaging and pathology. If successful in prospective studies, this model will improve selection of cancer patients for checkpoint inhibitor therapy, and perhaps for other forms of humoral- or cell-based immunotherapy.
INTRODUCTION
Recent advances in the understanding of immunological pathways responsible for antibody- and/or cell- mediated destruction of tumors has led to the development of unique cancer therapeutics in recent years, leading to markedly improved survival in the setting of previously intractable metastatic melanoma, bladder, kidney, lung, and head and neck cancers, among several other human solid tumor types (1-3). In particular, one of the most successful clinical applications of immune checkpoint inhibitors include antibodies directed against the programmed death protein 1 (PD-1) pathway, which inhibits cellular immune killing of cancer cells via complementary binding of tumor expressed programmed death ligand 1 (PD-L1) to PD-1 on immune cells (4). As a still emerging yet quite compelling immunotherapy approach in contemporary cancer medicine, targeting of immune checkpoints is being extensively investigated in ongoing and upcoming clinical trials aimed at unleashing T-cell activity, augmenting immune recognition against tumor metastases, and boosting immune- memory for long-lasting clinical remission post-treatment (5, 6). While the remarkable potential for checkpoint inhibitors in treating cancer is unequivocally exciting, the combined clinical trial experience has shown that durable effective treatment outcomes occur only in a limited subset of patients (7). Alas, some cancer types are presumed to be minimally immunogenic, resulting in little or no response to this treatment strategy (8). An accumulating body of evidence has established that certain immunological features, including T-cell exhaustion and exclusion (9), senescence markers (10) such as CD27, Tim-3, CD57, and/or T-cell receptor (TCR) repertoires (11) could perhaps reflect or even predict sensitivity and resistance to checkpoint inhibitor-based cancer immunotherapy. However, early attempts at identification of specific pathological biomarkers to predict immunotherapy response have thus far shown somewhat inconsistent results, challenged further by the inherent molecular, cellular, and biophysical diversity of human tumors (12-14). Considering the sheer complexity of the biological interactions between the immune system and the tumor microenvironment, one could reason that the introduction of more sophisticated mathematical analytic techniques would hopefully have potential to enhance the qualitative and quantitative understanding of such interactions and to improve malignant tumor treatment with checkpoint inhibitors, ultimately supporting the development of a predictive clinical tool that could either minimize or overcome this clear unmet need of cancer medicine.
In previous work, mechanistic mathematical models of immunomodulatory interventions on cancer control utilizing coupled ordinary differential equations (ODE) have already allowed the prediction of tumor response after Bacillus Calmette-Guérin (BCG) administration in superficial transitional cell carcinoma of the urinary bladder and the serum prostate-specific antigen response after prostate cancer vaccine administration with considerable accuracy (15-17). Moreover, other investigators have shown that mechanistic models of interleukin-21 (IL-21) therapy schedules based on tumor mass and antigenic properties can predict growth patterns of multiple tumor types on patients receiving personalized doses of IL-21 (18). In another notable work, mathematical modeling of neoantigen fitness based on cancer population evolution and antigen recognition potential was able to predict tumor response and patient survival following treatment with checkpoint blockade therapy, as validated in both melanoma patients treated with anti-CTLA4 and in lung cancer patients treated with anti-PD-1 therapies (19). Finally, modeling of chemotherapy and targeted drug therapy by our group with similar mathematical techniques was able to accurately reproduce entire dose-response curves of tumor cell kill with untargeted cytotoxic and ligand-directed agents (20-24). These examples highlight the previous success of mathematical models to qualitatively or quantitatively discern the underlying biologically and physically relevant, often non-linear processes present in cancer, which may otherwise be missed, and help optimize treatment delivery approaches.
The purpose of the present study is to demonstrate how standard clinical measures, including radiological imaging assessments and immunohistochemical (IHC) analysis of tissue samples, may inform a mechanistic mathematical model to elucidate patient and tumor-specific attributes that would likely benefit from the application of checkpoint inhibitor-based therapy. The mathematical model presented in this report is based on an extensive series of prior methodological reports (20-34), but with rigorous emphasis on parameters related to the cancer immune response to the treatment with checkpoint inhibitors in the setting of patients with solid tumors. In particular, we have developed a model that contains key mechanistic biological and physical processes involved in checkpoint inhibitor therapy, which has been objectively validated against imaging data of patient tumor burden and measured response, defined by both change in total tumor burden and patient survival (35). To date, this modeling work has only examined tumor response on a bulk scale, without confirming the mechanistic links between key model parameters and the underlying biology they describe. In this work, we demonstrate how model parameters may be informed by using cell-scale IHC analysis of biopsied tissues, which are available as early as at the start of treatment, for reliable prediction of therapeutic response. Thus, this represents a key step towards clinical translation, (i) by validating prior results in additional patient cohorts, and (ii) by demonstrating how IHC may be used with the model to predict patient response at times closer to the start of treatment. The predictions obtained through this model were compared against available retrospective clinical data from published trials with monoclonal antibodies blocking the PD-1/PD-L1 pathway for validation, thereby focusing on a quantitative relationship between tumor response and its underlying mechanisms. Taken together, these retrospective results indicate that our predictive mathematical model with the class of PD-1/PD-L1 checkpoint inhibitors has translational merit and that it should therefore be evaluated as an integral predictive biomarker in the setting of carefully designed prospective cancer investigational trials.
METHODS
Mathematical Model
The final mathematical model we used for comparison with clinical data is a single ODE, which determines changes in relative tumor mass over time after initiation of anti-PD1 therapy. It is a simplified and user-friendly version originated from a complex set of partial differential equations, which takes into account spatial relationships within the tumor microenvironment. The full mathematical derivation has been demonstrated elsewhere (35), but its underlying cancer biology is also schematically depicted in Figure 1. It is ultimately represented by: where ρ’ is the normalized tumor mass, t is time, α is a proliferation constant of tumor cells, Λ is the ratio of cancer cells (σ) to effector immune cells multiplied by the ability of immune cells (ψ) to kill cancer cells (namely, the anti-tumor immune state), µ is the effect of immunotherapy on the ability of immune cells to kill cancer cells, and ρ∞ represents the final long-term tumor burden, which may be calculated as: In words, Equation (1) describes the time-course tumor burden under immunotherapy intervention, as a function of three key parameters α, Λ, and µ, which represent the ability of malignant tumor cells to grow, the ability of immune cells to kill cancer cells within the tumor environment, and the potential efficacy of checkpoint inhibitor-based immunotherapy treatment. Equation (2) states that long-term tumor burden is a function of the relationship between tumor growth and kill rates [the greater of which sets the sign of response, i.e., progressive disease (α > μ) or favorable clinical response (α < μ)], as scaled by the parameter μ⋅ ⋀. These three parameters are intrinsically abstract terms, which have multiple components in themselves. Λ is dependent on the density of malignant tumor versus immune cells at the initiation of immunotherapy (t = 0) and on the ability of each immune cell to kill tumor cells. Equation (1) assumes the following five assumptions: (i) Tumor mass grows exponentially with rate α in the absence of immune cell killing, (ii) cancer cells are killed by one type of net effector immune cell (termed 𝜓), (iii) the influx of immune cells and the concentration of checkpoint inhibitor within the tumor environment reaches a drug steady-state on timescales far faster (i.e., from hours to days) than clinical response (i.e., from weeks to months), (iv) all cancer cells are within an average intratumoral movement path length of effector immune cells, and the effect of cytokines in stimulating immune cell function/movement is small, thus spatial diversity effects are negligible in this particular case, and (v) the site of action of the checkpoint inhibitor takes place at the interface between the tumor cell and immune cell, with blocking of one of the immune checkpoint receptors, either on the cancer or on the immune cell, which are deemed necessary and sufficient for blockade of the immune inhibitory pathway.
While one must recognize that although these assumptions certainly represent a gross biological generalization of the true tumor-immune microenvironment complexity, they enable direct comparison with clinical response data, in which measurements of all the parameters required for a full description of immune system reaction to tumors are often not readily available in conventional clinical settings. In this retrospective mechanistic analysis, we start with a simple biophysical model to validate clinical findings and the mechanistic links between high-level model parameters and the underlying biology they describe, with the possible addition of further refining variables as an aspirational goal for future prospective translational or clinical cancer studies. The mathematical simulations and procedures conducted are fully described in the Supplementary Information (Figures S1 and S2); finally, we also address the implications of these arbitrary choices in the Discussion.
Clinical Evaluation and Fitting of the Mathematical Model
To evaluate Equation (1) with clinical data, we initially focused on clinical trials with the class of checkpoint inhibitors that specifically target the PD-1/PD-L1 pathway, although the model may be adaptable to other types of checkpoint inhibitors affecting the CTLA-4 pathway or perhaps even to other cell-based immunotherapies, such as CAR T-cells, as appropriate. In this analysis, tumor volumes over time measurements were obtained from six published clinical trial reports (1, 36-40). As an initial retrospective proof-of-concept, five clinical trials of anti-PD-1 therapy plus another clinical trial of anti- PD-L1 treatment (Table 1) were included, and the combined dataset representing several common solid tumor pathology types (n = 189 cancer patients); this dataset represents our model calibration cohort. An online tool [Web Plot Digitizer (41)] served to extract the individual patient tumor response data over time from the published clinical trials, and subsequently Equation (1) was fit to these individual and collective datasets to derive the parameters α, Λ, and µ.
The fitting procedure was performed in two sequential steps. First, the fastest progressing patient in each trial was fit to a simple exponential function to estimate α for each cancer histopathology (i.e., approximation for uninhibited tumor doubling-time). Although this approximation represents a limiting factor in the study, it was necessary due to lack of pre-treatment tumor measurement in the calibration cohort, which would allow for more accurate estimation of the growth kinetic without clinical intervention. As presented, α may represent the fastest tumor growth potential in the cohort thus and overestimate the true average growth kinetic; however, because it likely represents the patient with the least response to treatment in each cohort, we take it to be a reasonable approximation of tumor growth kinetic unaffected by treatment. Second, this cancer-specific α was used to determine the remaining parameters, Λ and µ, for each individual patient by fitting Equation (1) to time-course tumor burden data (Figure 2 and Figure S1). The curve fitting was performed by using non-linear least squares and the Mathematica function NonLinearModelFit (42). The workflow for deriving the model parameters from the clinical data is depicted in Figure S1.
The results from this model calibration was checked against an additional validation patient cohort from the University of Texas M.D. Anderson Cancer Center (MDACC). Briefly, data from patients (n = 64) with non-small cell lung cancer (NSCLC) treated with pembrolizumab (MK-3475) were obtained from the investigational study NCT02444741. Serial lung tumor measurements were taken on post-contract chest CT images acquired with 2.5 mm slice thickness. All lesions that were previously indexed by the radiologist were included in order to best reproduce actual clinical practice (patients had a range of 1 to 12 indexed lesions; median, 3 indexed lesions). Lesion volumes were estimated as 3D spheres calculated from the geometric mean of long and short axes of each lesion at each follow-up, and then volumes were summed to generate a total volume for all indexed lesions at each follow-up time point. Because imaging data before the start of pembrolizumab treatment was available for each patient within this cohort, we were able to obtain a patient-specific pre-treatment growth rate α for each patient between the imaging immediately preceding start of treatment and at the time of first immunotherapy treatment by using the exponential growth-rate estimation p(t) ≈ eα·t between tumor burdens measured via imaging collected before treatment and at start of treatment. These per-patient α values were then used in Equation (1) to obtain values for Λ and µ, as described above.
Sensitivity Analysis and Model Simulations
A sensitivity analysis was next performed on Equation (1) in two independent ways (Figure S2). First, each of the parameters Λ and µ were individually altered by ±10% while holding the other parameters constant to simulate the change in relative tumor mass (p′). Second, individual values of p′ in the calibration cohort were varied by a uniform random variable by ±0.1 (i.e., 10%) and Equation (1) was re- fitted to these modified data to determine any changes in the derived values for Λ and µ. To determine stability of derived Λ and µ values over time, we also truncated the p′ data at differing time points (namely, t < 30, 60, 120, 200 days), and subsequently re-fitted Equation (1) to this truncated data to determine Λ and µ. The correlation between the derived values of Λ and µ from the truncated datasets and the full composite dataset was then determined by using Spearman correlation analysis. A simulated long-term RECIST v1.1 score was then determined for each patient by projecting the expected tumor burden at t = 700 days (chosen as a time point beyond the last reported follow-up for most of the patients) by using these values of Λ and µ derived from the truncated fits in order to compare the predicted RECIST scores to the known retrospective scores. Finally, full model simulations were performed at different levels of α, Λ and µ to determine values of p′ based on input parameter variation.
Validation of Λ and µ by Tumor-infiltrating Immune Cells and Immunostaining
The parameters Λ and µ derived from fitting the clinical response data were compared to pathologic biomarkers from additional anti-PD-1 or PD-L1 clinical trials (1, 3, 8, 13, 38-40, 43-48), as shown in Table S1. Specifically, we investigated the ability of Λ and µ to correlate with CD8+ tumor-infiltrating lymphocytes (TILs) and tumor PD-L1 expression, respectively. To compare the values of Λ and µ derived from fitting the clinical response data over time with the trials assessing TIL and PD-L1 expression, the extracted cancer patients were categorized into objective responders versus non-responders by applying the Response Evaluation Criteria in Solid Tumors (RECIST) v1.1 (49); note that resist categories determined by lesion diameter (D) or our volume estimation are mathematically comparable by Volume = 4/3π(D/2)3. Specifically, patients from all RECIST categories (49) were condensed into two groups for analysis, with patients who had ≥ 30% reduction in tumor burden (partial or complete response) grouped into the “favorable” response group, and patients with either < 30% disease reduction (stable disease or disease progression) into the “unfavorable” response group. The last recorded time point from the response curve from each individual patient served to designate the RECIST v1.1 category, as this provides the most accurate long-term treatment outcome.
After stratifying each patient into a RECIST v1.1 category, Λ and µ were compared between responding and non-responding patients by using a Wilcoxon rank sum test and compared to the literature either as a continuous variable (Λ) or by using thresholds (µ). For comparison with biomarkers reported in the literature, Λ was converted to an estimated intratumoral CD8+ T-cell count (for details, the interested reader is referred to (35)) by assuming each CD8+ T-cell would kill one tumor cell on average (the assumed mean “fitness” of the immune cell population (50)), and that there were 5,558 cells/mm2 in the tumor microenvironment, as has been quantitatively measured in melanoma (51). For comparison with PD-L1 staining, µ was converted from its raw numerical value to a percentage; no other scaling of the variable was performed as the number of cancer cells bound by anti-PD-1/PD-L1 therapy action is the dominant term in the integral specified in Equation S4 in ref. (35); this approximation and its implications are further examined in Discussion. Assessment of objective response rates (ORR) at standard thresholds of 1% and 5% PD-L1 staining (corresponding to µ) were used in this study. Other alternative PD-L1 cutoffs present in some studies (Table 1 and Table S1) were not included in this analysis due to the relatively small numbers of patients. The biomarker trials used for validation of these parameters are also presented in Table S1.
RESULTS
Quantification of Model Parameters α, Λ and µ
Measured tumor burden data over time from the total collective calibration patient pool (n = 189) after initiating checkpoint inhibitor-based therapy were extracted from the six reported clinical trials (Table 1). Of this population, 55 patients (29%) had objective responses by RECIST v1.1 criteria while 134 patients (71%) demonstrated stable/progressive disease. The derived tumor proliferation constant, α, was determined to range from 0.0034 (tumor doubling-time of ∼200 days) in renal cell carcinoma to 0.0622 (tumor doubling-time of ∼11 days) in non-mismatch repair colon cancer with an average α of 0.018 and average tumor doubling-time of ∼85 days (Table 1). In the validation NSCLC cohort, 25 patients (39%) had objective response, and 39 patients (61%) demonstrated stable/progressive disease, with individual patient proliferation constants (α) ranging from -0.0129 to 0.0602 (note that α < 0 indicates a subset of patients that had a shrinking tumor burden before start of therapy).
Having determined α for each cancer histopathology, the clinical response curves to checkpoint inhibitors for each tumor type were fit to Equation (1) to determine Λ and µ. The average root-mean- square error of fitting Equation (1) to the clinical data was only 0.4%. A sample exponential fit to extracted melanoma time course data are shown (Figure S1, second panel, red curve). A sample of these curve fits for patients with melanoma from the trial by Topalian et al. (40) is depicted in Figure 2. The derived fits from all 189 patient response curves in the calibration cohort yielded a mean ± standard error of the mean (S.E.M.) value of Λ of 0.714 ± 0.257 in patients with partial/complete response and 0.0995 ± 0.0264 in patients with stable/progressive disease (p = 0.119; Wilcoxon 2-tail), while the mean ± S.E.M. value of μ was 0.054 ± 0.014 versus 0.013 ± 0.0012 in partial/complete response versus stable/progressive disease, respectively (p < 0.001). In the validation NSCLC cohort, the mean ± S.E.M. value of Λ was 0.876 ± 0.102 in patients with partial/complete response and 0.0297 ± 0.469 in patients with stable/progressive disease (p < 0.001), while the mean ± S.E.M. value of μ was 0.0529 ± 0.00982 versus -0.0064 ± 0.0032 in partial/complete response versus stable/progressive disease (p < 0.001). These values are depicted in Figure 3. These results suggest that, while Λ and µ give significantly separated classification ranges for partial/complete response versus stable/progressive disease, the specific value of the binary classification threshold is likely a function of the unique disease-drug combination used, as observed in our prior studies (52). This point is explored further in the Discussion.
Confirmation of Model Stability
The parameters α, Λ and µ were held constant during the sensitivity analysis at three different values for each parameter. These values were the minimum, average, and maximum values derived from fitting Equation (1) to the data as described above. Changing Λ and µ from these values ±10% yielded a maximum change in p′ of 9.2% at t = 200 days. Varying p’ data points randomly up to ±0.1 resulted in Spearman correlation coefficients of 0.766 and 0.919 for Λ and µ, respectively, derived from fitting these randomized data versus the actual clinical data. We note that, due to high stability observed to these perturbations in the large calibration cohort (n = 189 patients), we did not repeat this sensitivity analysis in the validation cohort. Λ values derived from fitting truncated p′ data for the calibration cohort at various time points displayed Spearman correlation coefficients with Λ derived from fitting the full data ranging from 0.071 at t < 30 days to 0.730 at t < 200 days, while Spearman correlation coefficients for µ ranged from 0.910 (t < 30 days) to 0.921 (t < 200 days) for all truncated datasets (Table 2). In the validation cohort, Spearman correlation coefficients for Λ ranged 0.800 at t < 30 days to 0.771 at t < 200 days, and 0.800 at t < 30 days to 0.989 at t < 200 days for μ. Simulated RECIST v1.1 categorization (complete/partial response vs. stable/progressive disease) by using the values of Λ and µ derived from fitting the truncated calibration cohort data resulted in 19% misclassification by using data from t < 60 days (n = 177) and 13% misclassification at t < 200 days (n = 189) when compared to the full dataset. In the validation cohort, we observed 18.6% misclassification at t < 60 days (n = 43) and 16.1% misclassification at t < 60 days (n = 62). Predicted normalized tumor mass after initiation of immunotherapy is depicted at different combinations of α, Λ and µ (Figure 4).
Analysis of Model Predictions
Using the binary classification of tumor response of partial/complete response as positive and stable/progressive disease as negative, receiver operator characteristic (ROC) analysis was performed to identify optimal predictive response thresholds for μ and Λ (Figure 3, insets), as higher values of μ and Λ are expected to correspond to a more favorable patient response. Identification of optimal predictive thresholds in the calibration cohort by maximizing the Youden’s J statistic revealed cutoff thresholds where sensitivity (the proportion of correctly identified patients with partial/complete response) was higher for both Λ and μ (0.945 and 0.891, respectively), while specificity was reduced (0.381 for Λ and 0.567 μ). Response prediction thresholds were identified as 0.722 for Λ and 0.00905 for μ. Testing these response threshold values in the validation cohort revealed sensitivity of 0.6 for Λ and for 0.960 μ, and specificity of 0.743 for Λ and 0.769 for μ in the validation cohort. We found that, in the calibration cohort, positive predictive values (PPV; the probability a patient will be a responder when the model predicts they will be a responder: 0.381 for Λ and 0.458 for μ) were lower than the negative predictive values (NPV; 0.889 for Λ and 0.927 for μ), indicating fewer false-negatives (stable/progressive disease) than false-positives (partial/complete response). This trend was confirmed in the validation cohort (a PPV of 0.600 for Λ and 0.723 for μ; an NPV of 0.744 for Λ and 0.968 for μ). Lastly, we note that overall accuracy was lower by both Λ and μ in the calibration cohort than were observed in the validatio cohort; this is presumably due to the misbalance of patients with ORR (29% = 55 patients) versus patients with stable/progressive disease (71%, = 134 patients) while sensitivity (correctly identified patients with ORR) was higher than specificity.
Comparison of Λ and µ with Clinical and Pathologic Data
Conversion of Λ to an estimated intratumoral CD8+ T-cell count yielded a predicted mean ± S.E.M. of 3,970 ± 1,429 cells/mm2 in patients with partial/complete response to checkpoint inhibition and 553 ± 147 cells/mm2 in patients with stable/progressive disease (Figure 5A) in the calibration cohort, and 4,871 ± 567 cells/mm2 vs. 165 ± 2,604 cells/mm2 in patients with partial/complete response vs. stable/progressive disease in the validation cohort. Intratumoral CD8+ T-cell counts referenced from Tumeh et al. (13) are derived from two scenarios, either encompassing data in the pre-treatment setting only (n = 46) or data including pathologic CD8+ T-cell counts encompassing both pre-treatment and on treatment (n = 23). In the pre-treatment setting, the average intratumoral CD8+ T-cell count was 2,632 ± 518 cells/mm2 in patients with partial/complete response to checkpoint inhibitors and 322 ± 133 cells/mm2 in patients demonstrating stable/progressive disease, while including on-treatment plus pre- treatment data yielded an average of 3,770 ± 675 cells/mm2 in patients with partial/complete response and 501 ± 113 cells/mm2 in patients with stable/progressive disease.
For comparison of µ with PD-L1 staining, data were extracted from a total of twelve published clinical trials (n = 975 cancer patients) with data on clinical response by using a PD-L1 staining cutoff of ≥ 1% versus < 1% and 1,492 patients with a cutoff of ≥ 5% versus < 5%. The ORR of patients with these thresholds was 15% for PD-L1 staining < 1% and 28% for PD-L1 staining ≥ 1%, and 21% and 38% for < 5% and ≥ 5%, respectively. The ORR of patients in the calibration cohort by using the same thresholds in the derived values of µ was therefore 10% and 45% for patients with µ < 1% and ≥ 1% respectively, and was 25% and 67% for patients with µ < 5% and ≥ 5%, while in the validation cohort, the model-predicted ORR was 9% and 71% for patients with µ < 1% and ≥ 1% respectively, and was 29% and 100% for patients with µ < 5% and ≥ 5% (Figure 5B).
DISCUSSION
The translational mathematical model introduced here retrospectively investigates the molecular, cellular, and biophysical mechanism(s) behind patient response to immune checkpoint inhibitor therapy, and it explores the potential clinical value of the early prediction of treatment effectiveness. The reductionist approach introduced here simplifies the rather complex biological cross-talk between the immune system with cancer cells to a set of differential equations ultimately dependent on three intrinsic parameters that represent tumor proliferation, intratumoral immune cell presence/killing efficiency (considered as baseline immune cell infiltration), and checkpoint inhibition efficacy; this latter parameter represents the active role of the checkpoint inhibitors in initiating and amplifying the anti- tumor response.
From a mechanistic standpoint, the mathematical model is fully compatible with the current understanding of the biological effect of checkpoint blockade and clinical subtypes of immunotherapy- responsive tumors (12). First, the parameter values derived from our model for intratumoral immune cell infiltration (Λ) and immunotherapy antibody efficacy (µ) correspond with the quantitative pathologic measures of intratumoral CD8+ T-cell infiltration and PD-L1 positive staining (Figure 5). Although parameters Λ and μ are not direct representations of CD8+ T-cell counts or PD-L1 expression, they correlated with measured pathological biomarkers, thus providing strong experimental evidence that these mathematical parameters may be quantified from these biomarkers. This offers a unique method for clinical translation using clinically-available measured quantities to provide personalized prediction of patient response, a potential that we have often found particularly lacking in prior mechanistic modeling research. By demonstrating links between model parameters and known biology, our model may also be used to quantify the associated mechanist underpinnings of treatment failure, with the goal of suggesting clinical interventions that may overcome these deficiencies to improve patient outcome. Unfortunately, the absence of per-patient measurements of CD8+ T-cell counts or PD- L1 expression corresponding to individual patient response to treatment in the cohorts examined prevented analysis of how these parameters perform on a per-patient basis. We are currently collecting in-house a new data set containing both measures for single patients, and the results of this study will be published once complete. We are also investigating how these biomarkers may inform the model to prospectively identify pseudoprogression (a clinical phenomenon wherein lesions are observed to experience rapid volume increase followed by disease response) on a per-patient basis, based on our previous result that retrospectively known long-term parameter values are unique in pseudoprogression (35).
Moreover, the model demonstrates that the rate of immunotherapy response arises from a non-linear interactions between both adequate effector immune cell infiltration in tumors and immunotherapy efficacy within the tumor microenvironment (Figure 4). Mechanistic mathematical modeling across several cancer types (Table 1) demonstrates that the overlying master equation [i.e., Equation (1)] may well be universally applied, while the model parameter values presented here are likely tumor-specific. Indeed, this conclusion may be observed in Figure 3, where Λ and µ are distinctly different for ORR versus stable or progressive disease, but the values are different for the different cohorts; thereby capturing mechanistic differences across the different cancer types. The non-trivial interaction of these inputs predicted by this model is different in comparison to many established and current candidate biomarker studies (which focus on linear immunotherapy responses to just one of these markers, e.g., PD-L1 expression (14)), perhaps providing a plausible explanation why single biomarkers sometimes provide inconsistent predictions across multiple cancer types (14). Similarly, the unique insights presented herein allow for a more direct comparison of the model parameters to common pathologic markers either reported in the literature or as yet to be reported, and provide a different perspective compared to other immune checkpoint inhibitor modeling studies (19, 53, 54).
The high sensitivity at early time points (t < 60 days) demonstrated here indicates this model may provide valuable early identification of cancer patients most likely to benefit from checkpoint inhibition therapy, with a high NPV indicating minimal false-negatives in the group predicted to receive the least benefit from therapy. Therefore, this attribute could serve as an early indicator of the efficacy of checkpoint inhibition by simply using either clinical or imaging measurements of index tumor lesions and by applying the mathematical model to project maximal effect for an individual patient. Further, the model demonstrates robustness and stability in the derived values of Λ and μ to random variations in p′, and by using values of Λ and μ derived from fitting limited data over time (Table 2). This result is in stark contrast to previous mathematical descriptions of immune-mediated killing of cancer cells, where a relatively small change in a single input parameter of merely ±1% could dramatically vary tumor response by up to ±50% (18).
Towards clinical translation of cancer models, a mathematical model with two simple terms (an exponential growth term and a regression term) has previously been used to successfully correlate and predict patient survival (55). The fundamental difference between this prior model (55) and the model reported here is that the quantification of cell death is more mechanistic, i.e., depending on the specific mechanisms (Figure 1) that underlie cancer cell and immunotherapy drug interactions, hence allowing one to gain more mechanistic insight into the treatment system. As demonstrated, model parameters Λ and µ can be used to stratify responding and non-responding patients (Figure 3), which has immediate potential utility as a biomarker for patient selection in prospective clinical trials. Moreover, the model contains biological values that, at least in principle, may either be measured (e.g., tumor burden, growth rate) or changed clinically (µ: e.g., drug dosing or dosing schedule; ⋀: e.g., radiotherapy-induced increase in PD-L1 expression, radiotherapy abscopal effect). Thus, we expect that the model (i) may be informed by using standard-of-care measurements (e.g., Figure 5), and (ii) can provide quantitative information on which parameters must be changed to maximize therapeutic benefit, providing clinicians with potentially valuable decision-making information.
Of course, as with any biomathematical model, this model is based on several educated assumptions about the interactions of immune effectors (CD8+ T-cells) with cancer cells, which naturally does not reflect the sheer magnitude and complexity of the tumor immune microenvironment. These include the presence of immunosuppressive cells (such as neutrophils, myeloid-derived suppressor cells, Treg, and M2-polarized tumor-associated macrophages), which extensively infiltrate into tumor mass and hinder the efficacy of immune checkpoint inhibitors (56). Moreover, effects of other immune cells, such as CD4+ T-cells, are not explicitly included in the current model form; while it is likely that some of the effects from these other cells is captured within the model representation of DC8+ T-cells, this assumption warrants future model development where each immune component is explicitly represented. Some of these assumptions (e.g., lack of immune cell/antibody infiltration terms) are supported with in vitro or in vivo data such as high rates of immune cell binding by anti-PD1 antibodies in the peripheral circulation (40) and local intratumoral (as opposed to systemic myelogenous) expansion of CD8+ T-cells during checkpoint inhibition (13, 57). Other potential model parameters would logistically be quite challenging to measure in an outpatient clinic-ready setting routinely at this time, and their inclusion remains open for follow-up confirmatory retrospective studies and ultimately for future prospective clinical trials. Moreover, the two response parameters derived from the model, Λ and μ, represent abstract terms, which may not precisely model intratumoral CD8+ lymphocyte infiltration and tumor cell PD-L1 staining and potentially can be modified by relevant determinants, such as tumor mutational burden (58) or the human microbiome of the gut (59). The integration of additional prognostic biomarkers and master regulators that intrinsically (e.g., genetic stability) and extrinsically control the efficacy of immunotherapy represent open research areas for future translational investigation.
Finally, a few operational aspects of this work deserve further comment. Response was defined using RECIST v1.1 criteria, in lieu of the more recent adaptation of RECIST criteria to immunotherapy treatment (iRECIST), because the studies used to derive the model parameters in this mechanistic analysis did not incorporate these new criteria, and our use of published data for the calibration cohort limited or precluded us from obtaining a priori knowledge about non-target lesions. These inherent study limitations notwithstanding, we will continue to improve our model to more accurately predict outcomes in the immunotherapy specific setting of individualized cancer patients, potentially through the inclusion of additional biomarkers, as bioactive molecules such as IFNγ, CD206, CX3CR1, CD1D, and iNOS, along with cell-mediated mechanisms known to have an effect on immune checkpoint inhibitor therapy (60, 61). However, we do not anticipate that such improvements and refinements would change the basic findings and the potential value of the mechanistic mathematical model reported in this work. In going forward, one hopes that knowledge broadly applicable across multiple cancer types and/or immune microenvironment control will be serially incorporated to the dynamic model upon their future discovery and validation.
In conclusion, we present a mechanistic model of immune checkpoint inhibition that is able to describe various immunotherapy response profiles, with the inputs to the model correlating with common pathologic biomarkers in current clinical use. An early and robust a priori predictor for checkpoint inhibition response and outcome might provide a glimpse of the immense potential for timely adjustments and therapeutic personalization. Future prospective investigations of this computational science-assisted approach will focus on readily available clinical data as inputs to the model and further refining the complex interplay between the immune system and the cancer environment to extract other important variables for immunotherapy efficacy. Ultimately, the merit of this approach will rely on its future ability to reliably predict early individual patient response with the goal of improving personalized cancer care.
Data Availability
The authors declare that all data supporting the results in this study are available within the paper and its Supplementary Information.
Author Contributions
V.C. and E.J.K. conceived and designed the project. J.D.B., G.V.M., Z.W., E.J.K., and V.C. contributed to development and performed analysis of the model. J.D.B., G.V.M., Z.W., B.C., M.F., N.E., C.C., D.S.H., J.W., N.H., E.A.M., S.A.C., S.H.C., P.Y.P., S.K.L., S.G., R.L.S., R.P., W.A., E.J.K., and V.C. analyzed and interpreted the data. J.D.B., G.V.M., Z.W., B.C., M.F., N.E., C.C., D.S.H., J.W., N.H., E.A.M., S.A.C., S.H.C., P.Y.P., H.M.C., S.K.L., S.G., R.L.S., R.P., W.A., E.J.K., and V.C. wrote and/or edited the manuscript.
Competing interests
The authors declare no conflict of interest.
Acknowledgements
We gratefully acknowledge support from the Andrew Sabin Family Fellowship, Center for Radiation Oncology Research, Sheikh Ahmed Center for Pancreatic Cancer Research, GE Healthcare, Philips Healthcare, and institutional funds from the University of Texas M.D. Anderson Cancer Center. We have also received Cancer Center Support Grants from the National Cancer Institute (P30CA016672 to the University of Texas M.D. Anderson Cancer Center and P30CA072720 the Rutgers Cancer Institute of New Jersey). This research has also been supported in part by grants from the National Science Foundation Grant DMS-1930583 (Z.W., V.C.), the National Institutes of Health (NIH) 1U01CA196403 (Z.W., V.C.), 1U01CA213759 (Z.W., V.C.), 1R01CA226537 (Z.W., R.P., W.A., V.C.), 1R01CA222007 (Z.W., V.C.), U54CA210181 (Z.W., V.C.), and the University of Texas System STARS Award (V.C.). B.C. acknowledges support through the SER Cymru II Programme, funded by the European Commission through the Horizon 2020 Marie Skłodowska-Curie Actions (MSCA) COFUND scheme and the Welsh European Funding Office (WEFO) under the European Regional Development Fund (ERDF). E.K. has also received support from the Project Purple, NIH (U54CA210181, U01CA200468, and U01CA196403), and the Pancreatic Cancer Action Network (16-65-SING). M.F. was supported through NIH/NCI center grant U54CA210181, R01CA222959, DoD Breast Cancer Research Breakthrough Level IV Award W81XWH-17-1-0389, and the Ernest Cockrell Jr. Presidential Distinguished Chair at Houston Methodist Research Institute. R.P. and W.A. received serial research awards from AngelWorks, the Gillson-Longenbaugh Foundation, and the Marcus Foundation. This work was also supported in part by grants from the National Cancer Institute to S.H.C. (R01CA109322, R01CA127483, R01CA208703, and U54CA210181 CITO pilot grant) and to P.Y.P. (R01CA140243, R01CA188610, and U54 CA210181 CITO pilot grant). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.