Evaluating Quality Adjusted Life Years in the Absence of Standard Utility Values- A Dynamic Joint Modelling Approach
====================================================================================================================

* Vishal Deo
* Gurprit Grover

## Abstract

Estimation of Quality Adjusted Life Years (QALYs) is pivotal towards cost-effectiveness analysis (CEA) of medical interventions. Most of the CEA studies employ multi-state decision analytic modelling approach, where fixed utility values are assigned to each disease state and total QALYs are calculated on the basis of total lengths of stay in each state.

In this paper, we have formulated a new approach to CEA by defining utility as a function of a longitudinal covariate which is significantly associated with disease progression. Association parameter between the longitudinal covariate and survival times is estimated through joint modelling of the longitudinal linear mixed effects model and the Weibull accelerated failure time survival model. Metropolis-Hastings algorithm and Monte Carlo integration are used to predict expected survival times of each censored case using the joint model. Fitted longitudinal model is further used to project values of the longitudinal covariate at all time points for each patient. Utility values calculated using these projected covariate values are used to evaluate QALYs for each patient.

Retrospective survival data of HIV/ AIDS patients undergoing treatment at the Antiretroviral Therapy centre of Ram Manohar Lohia hospital in New Delhi is used to demonstrate the implementation of the proposed methodology. A simulation exercise is also carried out to gauge the predictive capability of the joint model in projecting the values of the longitudinal covariate.

The proposed dynamic approach to calculate QALY provides a promising alternative to the popular multi-state decision analytic modelling approach, especially when the standard utility values for different stages of the concerned disease are not available.

Keywords
*   Health Economics
*   Joint Modelling
*   HIV/ AIDS
*   Proxy Utility Function
*   Weibull AFT
*   Metropolis-Hastings
*   Cost-effectiveness Analysis

## 1 Introduction

Calculation of Incremental Cost Effectiveness Ratio (ICER) based on gain in Quality Adjusted Life Years (QALYs) is a common technique for Cost Effectiveness Analysis (CEA) of a treatment regime. Quality of life at any stage of a disease is quantified in terms of utility values. Direct methods like visual analog scale (VAS) and standard gamble (SG) techniques, and indirect methods based on questionnaires, like EQ-5D, SF-6D, HUI2, HUI3 etc, are available to calculate standard utility values for different stages of a disease; refer [1-5]. Use of these utility values requires classification of disease progression into well defined stages. So, as a popular approach, multi-state models are used to estimate average gain in QALYs since the beginning of the treatment till the lifetime time horizon chosen for the study. Transition probabilities between different states of the disease can be estimated by fitting survival models to the transitions. Cox-Proportional Hazard (PH) models [6, 7], parametric survival models [8-11], Bayesian models [12] and flexible parametric models [13] are some of the most frequently applied models for this purpose. Transition probabilities are further used to estimate expected Total Length of Stay (TLOS) in each state till the lifetime time horizon of the study. TLOS is weighed in with respective utility value for each state and summed over all states to obtain total gain in QALY. This procedure for conducting CEA is called Markov/ multi-state Decision-Analytic Modelling; refer [14].

### Limitations imposed by multi-state decision-analytic models

Model states or disease stages are defined on the basis of one or more covariates(s) /biomarker(s) measured repeatedly at specific follow-up time points over the period of the study. These follow-up times can be at equal or unequal intervals, and are most likely to differ from patient to patient. In such cases, the exact time and path of transition from one stage to another is not known, which is likely to induce error in the estimation of transition probabilities and TLOS. Also, by compartmentalizing the progression of disease into predefined static states, we undermine the dynamics of disease progression within each state. For example, in CEA of Antiretroviral Therapy (ART) for HIV/ AIDS patients, stages are defined on the basis of CD4 cell counts measured at each visit of the patient. CD4 ≥ 500 => stage I, 350≤ CD4<499 => stage II, 200 ≤CD4<349 => stage III, and CD4< 200 => stage IV. So, while using multi-state decision-analytic modelling, five states are defined, where the fifth state is an absorbing state corresponding to death and all other states are transient and are accessible from each other. Suppose that a patient is found to be in state II at the start of ART (t=0) and in state IV at next visit (say t= 3 months). Since the information during the interval is missing, the possible transition paths could be II->III->IV or II-IV or II->I->IV and many more. This problem does not exist for diseases where the states are irreversible; however, the problem of lack of information on exact time of transitions still persists. Further, for the sake of discussion, let us define utility values for the five disease states as 1, 0.8, 0.6, 0.4 and 0 respectively. So, if a patient A has a CD4 count of 351 (stage II) at a visit and his/her next visit is after one year, utility value will be taken as 0.8 for the whole one year (i.e. 0.8 QALY gain). Whereas, if a patient B’s CD4 count is recorded as 348 (stage III) at a visit and his/her next visit is after one year, utility value will be considered as 0.6 for that one year (i.e. 0.6 QALY gain). That is, a significant difference in the calculated value of gain in QALY can exist even when, in reality, the two patients may have similar health status (ignoring other patient specific factors). Apart from this, if we assume same CD4 counts for any two patients, before and after an interval of a year, the dynamics of the changes in the CD4 count within that year may differ drastically between the patients. This problem is also not addressed by the multi-state approach.

### Issues with standard state specific utility values

While using standard utility values for specific states of a disease, calculated using direct (VAS, SG) or indirect (EQ-5D, SF-6D, HUI2, HUI3 etc.) methods, following issues may arise.

1.  In multi-state decision-analytic models, utilities obtained from direct methods, i.e. through individual level preferences, may vary significantly from those obtained from indirect methods based on community level preferences, refer [4].

2.  Individual level variability in utility values can exist because of certain factors and covariates like age, sex, BMI, smoking habit, income, social background, medical history etc., see for example [15].

3.  Spatial variability in standard utility scores of different regions is also expected because of the differences in economic, social, political and geographic status of the target populations. Shah et. al. [16] have given a commentary on how geomarkers can be useful in assessing patient level variability in risks. To understand through an example, an HIV patient, in any stage, staying in a developed country is expected to have a better quality of life than that of a patient in the same stage in a developing or underdeveloped country. This is because, apart from having better patient-care systems at place, developed countries also experience a wider social acceptability of HIV patients. However, on account of unavailability of region/ country specific standard utility values, researchers have time and again used standard utility values calculated for other regions/ countries with drastically different socio-economic and demographic conditions to conduct CEA. For example, while calculating gain in QALY for HIV/ AIDS patients undergoing ART in India, Bender et. al. [17] have obtained utility values from the studies in United States.

4.  Using questionnaire based techniques to obtain utility values can be challenging for certain diseases when, because of lack of social acceptability and fear of discrimination, patients are hesitant in revealing their identities.

5.  With fast paced changes in the patient care facilities and socio-economic standard of life, ideally, utility values should be updated regularly. This would consume a lot of financial and temporal resources.

The objective of this paper is to present a non multi-state approach to calculate QALY, using joint-modelling of longitudinal and survival data, and a proxy utility function. It is a more dynamic alternative to the multi-state decision analytic model approach. The idea is to define a function which calculates utility for each patient at each discrete time point (as per the unit of measurement of time considered in the study), say at each month, since the start of the treatment till death or till the lifetime time horizon of the study (for censored cases). This proxy utility function can be defined as a function of time-dependent covariates having statistically significant impact on hazard or survival time. Thus, this approach captures the continuous changes in quality of life of each patient separately, taking care of the patient level variability in utility values. Further, it traces the entire trajectory of disease progression instead of just monitoring transitions between few discrete states. Since, we are considering diseases where longitudinal repeated measures on some covariate(s)/ biomarker(s) are available along with the survival data, joint modelling of longitudinal and survival data is proposed for estimating the association of longitudinal variable(s) with hazard or with survival times. The estimated association parameter(s) is/are used to define weight(s) of the corresponding covariate(s) in the proxy utility function.

Proposed methodologies for the construction of proxy utility function, and for calculating QALY, are discussed thoroughly in the next section. In section 3, implementation of the proposed methodology is demonstrated on a retrospective data of HIV/AIDS patients undergoing ART at Ram Manohar Lohia hospital in New Delhi. A simulation exercise aimed at assessing predictive capability of the joint model is presented in section 4. Discussion on the results and concluding remarks are presented in section 5.

## 2 Methodology

Steps involved in the proposed methodology are listed below in the order of execution.

### 2.1 Joint Modelling of Longitudinal and Survival Data

In most of the studies related to chronic diseases, longitudinal data of patients comprises of repeated-measures on one or more time dependent covariates, in addition to the time-to-event data on the event of interest. For example, in HIV studies, patients’ follow-up data includes time till the occurrence of the event of interest (death or development of AIDS) and repeated measurements on the time-dependent biomarker (or covariate) CD4 lymphocyte count. Separate modelling of the longitudinal process and the time-to-event process fails in establishing association between the longitudinal progression of the covariate and the occurrence of the event. If our interest lies in assessing the impact of time-dependent covariate(s) on survival time, joint modelling serves as a better approach. Separate modelling does not take into account the fact that longitudinal measures on the time-dependent covariate are interrupted by occurrence of the event, thus introducing bias in the estimation; see [18]. Joint modelling approach overcomes this problem by maximizing the log-likelihood corresponding to the joint distribution of the longitudinal and time-to-event outcomes. Lucid descriptions of this approach can be found in the works of Tsiatis and Davidian [19], Yu et al. [20] and Rizopoulos [21].

#### Joint Model specification

Notations used in the specification of sub-models.

*n*: number of cases (patients).

![Graphic][1]</img>: True observed event time for i-th patient.

*Ci*: Censoring time for i-th patient.

*Ti*: event time for the i-th patient, where ![Graphic][2]</img>

![Graphic][3]</img>; is the event indicator which takes value 1 when the event has occurred, and 0 when observation is censored.

*yi*(*t*): value of the observed/ measured longitudinal outcome for patient *i* at time *t*.

*tij*: *j*-*th* occasion (time point) at which longitudinal response variable is observed for *i*-th patient; where (*j* = 1, 2,…, *ni*), i.e. number of times, and the time points at which, longitudinal responses are recorded for a patient can differ from patient to patient.

*yij*: value of the longitudinal outcome for *i*-th patient at *tij*.

*mi*(*t*): true (unobserved) value of the longitudinal outcome for *i*-th patient at time *t*. It is different from *yi*(*t*) as it is assumed to be free of any measurement error which is present in *yi*(*t*).

#### Longitudinal sub-model: Linear mixed effects model

Using the notations explained above, linear mixed effects model for i-th subject can be defined as follows; from [21].

![Formula][4]</img> 
Where, *τ* denotes the vector of the unknown fixed effects parameters, *bi* denotes a vector of random effects, *xi*(*t*) and *ωi*(*t*) denote row vectors of the design matrices for the fixed and random effects respectively, and *∊i*(*t*) denotes the measurement error term which follows Normal distribution and is assumed to be independent of *bi*. As a standard choice, the random effects are assumed to follow multinomial normal distribution.

#### Survival sub-model: Weibull Accelerated Failure Time (AFT) Model

Again, using the notations as mentioned above, the survival sub-model can be defined as follows,

![Formula][5]</img> 
Where, *wi* is a vector of baseline covariates and *θ* is a vector of coefficients corresponding to these baseline covariates. *α* measures the effect of the longitudinal outcome variable, *mi*(*t*), on the log of survival time and is known as the association parameter. The name ‘association parameter’ emphasizes on the fact that this parameter measures the extent of association between the course of the longitudinal outcome variable and the length of the survival time. Association parameter helps in structural implementation of the idea of joint modelling of the longitudinal and survival processes. *σs* is a scale parameter and *∊si* are independently and identically distributed random error terms following a Gumbel distribution, also known as extreme value distribution, see [22]. Under this assumption, survival time, *Ti*, follows Weibull distribution with probability distribution function (pdf), and survival function given in the equations (3) and (4) respectively.

![Formula][6]</img> ![Formula][7]</img> 
Where, ![Graphic][8]</img>, ![Graphic][9]</img>, and ![Graphic][10]</img> Parameters of both sub-models are estimated jointly by maximizing the joint likelihood function of the longitudinal and survival components. In our previous work on using joint modelling for utility calculations, we have used Cox proportional hazards (PH) model as the survival sub-model [23]. Weibull AFT can be considered as an improvement on the Cox PH model, as in the former case, instead of choosing lifetime time horizon of the study for censored cases, we can predict the lifetime for each patient. Thus, carrying out sensitivity analysis for the choice of lifetime time horizon, as suggested by many authors (see for example [24]), will not be required in the case of joint modelling approach with a Weibull AFT sub-model.

### 2.2 Estimating Conditional Expected Survival Time for Each Censored Case

For each censored case, conditional expectation of survival time, given that the patient has survived till time *Ci*, is estimated and taken as patient level lifetime time horizon for calculating QALY. Conditional probabilities, *P*(*Ti=ti*|*Ti > Ci*), are calculated using estimated fixed effect parameters and empirical Bayes realizations of the vector of random effects *bi* from their posterior distribution. Metropolis-Hastings (MH) algorithm with independent proposals from a multivariate normal distribution, with mean centered at current estimate of *bi* and variance-covariance matrix estimated from joint modelling, is used to generate *L* empirical Bayes realizations on the vector of random effects *bi*, where *L* is a large number, say 100 or 1000.

Conditional expected survival times for censored cases are calculated at each of these *L* realizations of *bi*, using Monte Carlo integration technique. Median of these *L* values calculated for a censored case is taken as the estimate of the conditional expected survival time for that case.

### 2.3 Projection of CD4 Cell Count

Using the fitted longitudinal sub-model, CD4 count is projected for all patients at each time unit, after registration into ART till the predicted conditional expected survival time (for censored cases) or till the time of death (observed cases). At each time point, *L* empirical Bayes realizations are generated on the vector of random effects using MH algorithm. These realized values of *bi* and the estimates of fixed effects are used in the model defined in (1) to calculate *L* values of CD4 count at each time point. Mean of these *L* values at a time is taken as the projected/ predicted value of true CD4 count at that time.

### 2.4 Proxy Utility Function

The proposed proxy utility function is defined as a function of relative changes in repeated measures on the longitudinal outcome *yi*(*t*). Utility function for *i*-th patient at time *t*, *Ui*(*t*), is defined as follows.

![Formula][11]</img> 
Where *U0i* is the base utility value of *i*-th patient at the start of the study and is defined as, ![Graphic][12]</img>. Here, *y0i* is the baseline value of the longitudinal covariate and *K* is the cut-off value or reference value of the longitudinal covariate beyond which it is considered to be in medically normal range. ![Graphic][13]</img>, the estimate of association parameter in the survival sub-model, represents the direct impact of changes in longitudinal covariate on the log of survival time and hence, ![Graphic][14]</img> qualifies to act as a weight for determining change in the utility value as a result of relative change in the value of longitudinal covariate. Thus, the first term in the RHS of equation (5) is the baseline utility value and the subsequent terms, included in the summation, calculate the change in utility values at each time point.

This utility function is proxy in the sense that it is not based on patients’ self evaluation of their quality of life. However, it is defined as a function of a longitudinal covariate/ biomarker which is significantly associated with disease progression and acts as a proxy indicator of the quality of life of patients. It should be noted that even in the case of multi-state decision-analytic modelling approach, the longitudinal covariate has a vital role in defining utility values. For example, in the case of HIV/ AIDS patients, different states are defined on the basis of the longitudinal covariate CD4 count, and further, each state is assigned a fixed utility value. That is, in the case of multi-state model, CD4 count decides state and the state, in turn, implies a predetermined utility value. While in our approach, longitudinal progression of CD4 count directly determines utility value at any time. Projected values of CD4 count are used in the utility function defined in (5) to calculate utility values of each patient at each time unit, after registration till death or till the predicted survival time.

### 2.5 Calculation of QALY

Since, utility values are calculated at intervals of one time unit, total gain in QALY of *i*-th patient at a discount rate of *d*% per annum converted per unit time can be stated as follows.

![Formula][15]</img> 

## 3 Implementation of the Proposed Methodology

As a demonstration, the methodology proposed in section 2 has been employed to calculate QALY for HIV/ AIDS patients undergoing treatment at the ART center of Ram Manohar Lohia hospital in New Delhi. Retrospective follow-up data on 3465 patients, enrolled for ART treatment during the period April 2004 to November 2009, and followed till December 2010, is obtained from register records of the hospital. All records are in terms of patient id and in no way, privacy or anonymity of patients is compromised. Cases with incomplete or missing information on factors and covariates which are included in the model, like age, sex, mode of transmission (MOT), smoking habit, drinking habit, haemoglobin, baseline weight etc., have been excluded from the analysis. Only adults of age 18 and above are retained in the study. Cases with no subsequent visit after the date of registration into ART are also not considered for the analysis. After applying all exclusion criteria, only 1520 cases qualified to be included in the study. Dataset contains information on status (dead or alive), time of death, age at the time of registration (baseline age), sex, smoking habit, alcohol consumption, baseline haemoglobin, mode of transmission of HIV (MOT), time of subsequent visits, CD4 count at each visit, and weight measured at each visit. Use of this data complies with the ethical guidelines defined in the National Ethical Guidelines for Biomedical and Health Research-2017 for administrative and secondary data. List of variables included in the longitudinal and survival sub-models are shown in Table 1. Kaplan-Meier curve of the survival data is furnished in Graph 1.

![Graph 1:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2020/05/25/2020.05.24.20112102/F1.medium.gif)

[Graph 1:](http://medrxiv.org/content/early/2020/05/25/2020.05.24.20112102/F1)

Graph 1: 
Kaplan-Meier curve for survival of HIV/ AIDS patients

View this table:
[Table 1:](http://medrxiv.org/content/early/2020/05/25/2020.05.24.20112102/T1)

Table 1: 
Description of factors and covariates included in the models

A linear mixed effects longitudinal sub-model with response variable as CD4 count and a Weibull AFT survival sub-model are jointly fitted using the R package “JM” with method “Weibull-AFT-GH”; see [21]. Estimated models are presented in Table 2. Estimates of variance components of the longitudinal sub-model are provided in Table 3.

View this table:
[Table 2:](http://medrxiv.org/content/early/2020/05/25/2020.05.24.20112102/T2)

Table 2: 
Results of jointly fitted models

View this table:
[Table 3:](http://medrxiv.org/content/early/2020/05/25/2020.05.24.20112102/T3)

Table 3: 
Estimates of variance components of the longitudinal sub-model

Prior distribution of the random effects, *bi*, in the longitudinal sub-model, is taken as a multivariate normal distribution with a zero mean vector and variance-covariance matrix, *D*, obtained from the estimates of variance components of the joint modelling given in Table 3. For any censored case i, unnormalised posterior pdf of *bi* can be written as follows.

![Formula][16]</img> 
In the expression on the RHS of (7), the first term is the prior pdf of the random effects vector *bi*, the second term is the survival function at time *Ci* given the values of *bi*, and the third term is the pdf of the response variable, CD4 count, given the values of the vector *bi* in the longitudinal sub-model. Given *bi, Yi*(*t*) also follows normal distribution with mean *mi*(*t*) and variance as estimated from joint modelling. MH algorithm with independent proposals from multivariate normal distribution, with mean centered at current estimate of *bi* and variance-covariance matrix estimated from joint modelling, is used to generate empirical Bayes realizations on *bi* from its posterior distribution. It should be noted that MH algorithm does not require the posterior distribution to be normalized. Further, for each uncensored case, following steps are followed.

1.  For *i*-th patient, using MH algorithm, ![Graphic][17]</img> values of random effect vector are realized from the posterior distribution of *bi*.

2.  At each ![Graphic][18]</img>, Monte Carlo integration is applied to find the conditional expected survival time for *i*-th patient given by the expression, ![Formula][19]</img> 

3.  Median of these *L* values, ![Graphic][20]</img>, is taken as the estimate of the conditional expected survival time for the *i*-th patient, say ![Graphic][21]</img>.

4.  For both censored and uncensored cases, at each month, ![Graphic][22]</img>, *L* realizations are generated on the vector *bi* and subsequently, *L* estimates of CD4 count are calculated at each month, say ![Graphic][23]</img>. Average of these *L* values is taken as the final estimated or predicted value of CD4 count at time *t* for *i*-th patient.

5.  To define the proxy utility function, we have taken *K*=800. This cut off value of CD4 count for a healthy person in India is taken on an average after examining the findings of Rungta et al. [25], Shahapur et al. [26], Ramalingam [27], and Uppal et al. [28]. CD4 counts estimated/ predicted in step 4 are used in the equation (5) to calculate utility values of each patient at each month, since registration till death or till the predicted survival time. At a standard discount rate of *d* = 3% per annum, total QALY gained by each patient is calculated using equation (6).

We have implemented these five steps in R through self-developed algorithms and programming codes. Summary of concluding results obtained from these steps is provided in Table 4.

View this table:
[Table 4:](http://medrxiv.org/content/early/2020/05/25/2020.05.24.20112102/T4)

Table 4: 
Summary of results on predicted survival times and QALY

## 4 Simulation Exercise

Using the joint model formulation discussed in section 2.1, we have simulated an individual patient level data on survival of 1000 HIV/ AIDS patients assumed to be followed for 60 months since registration into ART. Corresponding values of different baseline factors and covariates, and monthly values of the longitudinal covariate, CD4 count, are jointly simulated along-with the survival times. The R package ‘simsurv’ is used for implementation of the simulation scheme; for the simulation algorithm; refer [29]. We fit joint model to the simulated data assuming that the longitudinal observations on CD4 count are available only till 30 months from the time of registration. That is, we use only partial information on the longitudinal covariate to train the joint model. Models are fitted for the following two cases. Case 1-five observations on CD4 count (one at baseline and rest at four randomly selected time points between 1st and 30th month/ time-to-death) are available for each patient, and Case 2-ten observations on CD4 count (one at baseline and rest at nine randomly selected time points between 1st and 30th month/ time-to-death) are available for each patient. These two cases are considered to assess the sensitivity of the predictions on the number of observations available on the longitudinal covariate. Based on the results of the fitted models, the implementation steps discussed in section 3 are used to predict/ estimate CD4 counts for each month since the time of registration till the 60th month (censored cases) or till the time-to-death (observed cases). Since the proposed definition of utility function depends on the longitudinal CD4 counts, credibility of the proposed method relies largely on the accuracy of prediction of the unobserved CD4 counts. So, the objective of this section is to appraise the capability of the models fitted to the training data to predict the longitudinal measurements on CD4 count beyond the observed points. Comparison between the true and predicted values of the CD4 counts are presented through graphs and the associated error is measured as Root Mean Squared Error (RMSE). For both cases, predicted values of CD4 counts, corresponding 95% credible intervals, and true values of CD4 counts for four randomly chosen censored patients and four randomly chosen observed patients, are provided in the Graphs 2-5. For the same patients, RMSE values for all cases are provided in Table 5.

![Graph 2:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2020/05/25/2020.05.24.20112102/F2.medium.gif)

[Graph 2:](http://medrxiv.org/content/early/2020/05/25/2020.05.24.20112102/F2)

Graph 2: 
Simulation exercise-Predicted and true CD4 values for censored patients from Case 1. The blue region depicts the 95% credible interval of the predicted CD4 counts.

![Graph 3:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2020/05/25/2020.05.24.20112102/F3.medium.gif)

[Graph 3:](http://medrxiv.org/content/early/2020/05/25/2020.05.24.20112102/F3)

Graph 3: 
Simulation exercise-Predicted and true CD4 values for censored patients from Case 2. The blue region depicts the 95% credible interval of the predicted CD4 counts.

![Graph 4:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2020/05/25/2020.05.24.20112102/F4.medium.gif)

[Graph 4:](http://medrxiv.org/content/early/2020/05/25/2020.05.24.20112102/F4)

Graph 4: 
Simulation exercise-Predicted and true CD4 values for observed patients from Case 1. The blue region depicts the 95% credible interval of the predicted CD4 counts.

![Graph 5:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2020/05/25/2020.05.24.20112102/F5.medium.gif)

[Graph 5:](http://medrxiv.org/content/early/2020/05/25/2020.05.24.20112102/F5)

Graph 5: 
Simulation exercise-Predicted and true CD4 values for observed patients from Case 2. The blue region depicts the 95% credible interval of the predicted CD4 counts.

View this table:
[Table 5:](http://medrxiv.org/content/early/2020/05/25/2020.05.24.20112102/T5)

Table 5: 
RMSE values for all four cases

## 5 Discussion and Conclusion

### Implementation on the retrospective ART data

Mean QALY gained by HIV/ AIDS patients after getting enrolled into ART is 7.46 years (discounted at 3% per annum). Bender et al. [17], while performing CEA for different combinations of drugs in ART in India, found that the average gain in QALY, discounted at 3% per annum, ranged from 115.5 months (9.625 years) to 125.8 months (10.48 years). One of the possible reasons for lower estimate of mean QALY in our study, as compared to that of Bender et al., is that a large proportion of patients included in our study were registered for ART at the AIDS stage, i.e. at the stage when CD4 count was less than 200. The mean baseline CD4 count of the subjects included in our study is 146.0145 (SD=104), while the mean baseline CD4 count reported by Bender et al. is 318 (SD=291). Patients with higher CD4 count at the baseline are expected to have higher survival probabilities. This is apparent from the statistically significant positive value of the association parameter estimated through joint modelling, see Table 2. Further, it is also important to note that the work of Bender et al. exhibits one of the problems discussed in our introduction section regarding the choice of utility values. They have used stage-specific utility values from published reports of US based studies to calculate QALY of patients in India. By using proxy utility function, we have eliminated the bias arising out of unaccounted regional variability in the quality of life.

### Simulation exercise

As can be seen from the Graphs 2-5, the true values of monthly observed CD4 count in the simulated data fluctuate significantly over time. Any real life data on HIV/ AIDS patients can be rarely expected to show such magnitude of fluctuations. That is, the simulated data presents a big challenge for the joint model to produce reliable predictions. Barring few cases, the predicted values of the CD4 counts can be seen to be centred around the mean trend of progression of the true CD4 counts. Predictions based on the second case are closer to the actual values, and subsequently the RMSE values for Case 2 are consistently lower than those for Case 1. The 95% credible intervals of predictions from Case 2 are narrower and better aligned with the true progression of CD4 counts as compared to those from Case 1. These findings corroborate that models fitted using more repeated measures on the longitudinal covariate of each patient show improved prediction capability. Further, predictions for observed cases show relatively much lower RMSE than those for the censored cases.

### Conclusion

The proposed method allows us to carry out CEA without categorizing disease progression into discrete states. It succeeds in gauging the dynamic effects of continuous (or short term) changes in longitudinal covariate on the utility values of the patients. Unlike multi-state decision-analytic modelling approach, this approach facilitates calculation of time-varying utility measures and hence, the total gain in QALY, of each patient. This helps to accommodate for individual level variability in the trajectory of disease progression and the quality of life. This method will be categorically useful in situations where standard utility values for the concerned disease or for the concerned population (or region of study) are not available. Even in many big economies, like India, standard utility values for most of the chronic diseases are still not available.

### Limitations

We have focussed on introducing the proposed methodology and demonstrating its implementation techniques. The simulation exercise included in this paper gives some idea about the prediction capability of the proposed model, but it is not sufficient to conclusively establish its predictive power, and hence the accuracy of utility calculations. This issue of predictive power forms the central objective of our next research work which is currently under progress.

## Data Availability

Data has been obtained with proper permission from competent authorities from the registered records of ART center of Ram Manohar Lohia hospital in New Delhi. To ensure patient confidentiality, the data is not available in public space.

*   Received May 24, 2020.
*   Revision received May 24, 2020.
*   Accepted May 25, 2020.


*   © 2020, Posted by Cold Spring Harbor Laboratory

This pre-print is available under a Creative Commons License (Attribution-NonCommercial-NoDerivs 4.0 International), CC BY-NC-ND 4.0, as described at [http://creativecommons.org/licenses/by-nc-nd/4.0/](http://creativecommons.org/licenses/by-nc-nd/4.0/)

## References

1.  1.Brazier J, Roberts J, Deverill M. The estimation of a preference based measure of health from the SF-36. Journal of Health Economics. 2002; 21(2): 271–292.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S0167-6296(01)00130-8&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=11939242&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F05%2F25%2F2020.05.24.20112102.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000174196800006&link_type=ISI) 

2.  2.Feeny DH, Furlong WJ, Torrance GW, Goldsmith CH, Zhu Z, De-Pauw S, et al. Multi-attribute and single-attribute utility functions for the health utilities index mark 3 system. Medical Care. 2002; 40: 113–128.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1097/00005650-200202000-00006&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=11802084&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F05%2F25%2F2020.05.24.20112102.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000173594900006&link_type=ISI) 

3.  3.Kind P, Brooks R, Rabin R. EQ-5D Concepts and Methods: A Developmental History Dordrecht: Springer; 2005.
    
    
4.  4.Rashidi AA,  H Aa, Marra CA. Do visual analogue scale (VAS) derived standard gamble (SG) utilities agree with Health Utilities Index utilities? A comparison of patient and community preferences for health status in rheumatoid arthritis patients. Health and Quality of Life Outcomes. 2006; 4(25). [https://doi.org/10.1186/1477-7525-4-25](https://doi.org/10.1186/1477-7525-4-25)
    
    
5.  5.Torrance GW, Feeny DH, Furlong WJ, Barr RD, Zhang Y, Wang Q. Multi-attribute preference functions for a comprehensive health status classification system: Health utilities index mark 2. Medical Care. 1996; 34: 702–722.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1097/00005650-199607000-00004&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=8676608&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F05%2F25%2F2020.05.24.20112102.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=A1996UW38100004&link_type=ISI) 

6.  6.Du X, Li M, Zhu P, Wang J, Hou L, Meng H, et al. Comparison of the flexible parametric survival model and Cox model in estimating Markov transition probabilities using real-world data. PLoS ONE. 2018; 13(8): e0200807. [https://doi.org/10.1371/journal.pone.0200807](https://doi.org/10.1371/journal.pone.0200807)
    
    
7.  7.Williams C, Lewsey JD, Mackay DF, Briggs AH. Estimation of survival probabilities for use in cost-effectiveness analyses: A comparison of a multi-state modeling survival analysis approach with partitioned survival and markov decision-analytic modeling. Medical Decision Making. 2017;: 426–439.
    
    
8.  8.Diaby V, Adunlin G, Montero AJ. Survival modeling for the estimation of transition probabilities in model-based economic evaluations in the absence of individual patient data: A tutorial. PharmacoEconomics. 2014; 32(2): 101–108.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s40273-013-0123-9&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24338265&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F05%2F25%2F2020.05.24.20112102.atom) 

9.  9.Speight PM, Palmer S, Moles DR, Downer MC, Smith DH, Henrikkson M, Augustovski F. The cost-effectiveness of screening for oral cancer in primary care. Health Technology Assessment. 2006; 10(14). Doi: 10.3310/hta10140
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.3310/hta10140&link_type=DOI) 

10. 10.Coon JT, Hoyle M, Green C, Liu Z, Welch K, Moxham T, Stein K. Bevacizumab, sorafenib tosylate, sunitinib and temsirolimus for renal cell carcinoma: A systematic review and economic evaluation. Health Technology Assessment. 2010; 14(2). Doi: 10.3310/hta14020
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.3310/hta14020&link_type=DOI) 

11. 11.Wu B, Li T, Cai J, Xu Y, Zhao G. Cost-effectiveness analysis of adjuvant chemotherapies in patients presenting with gastric cancer after D2 gastrectomy. BMC Cancer. 2014. Doi: 10.1186/1471-2407-14-984
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/1471-2407-14-984&link_type=DOI) 

12. 12.Jackson CH, Sharples LD, Thompson SG. Structural and parameter uncertainty in bayesian cost-effectiveness models. Applied Statistics. 2010; 59(2): 233–253.
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20383261&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F05%2F25%2F2020.05.24.20112102.atom) 

13. 13.Jackson CH, Sharples LD, Thompson SG. Survival models in health economic evaluations: Balancing fit and parsimony to improve prediction. The International Journal of Biostatistics. 2010; 6(1). Doi: 10.2202/1557-4679.1269
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.2202/1557-4679.1269&link_type=DOI) 

14. 14.Briggs A, Sculpher M. An introduction to markov modelling for economic evaluation. PharmacoEconomics. 1998; 13(4): 397–409.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.2165/00019053-199813040-00003&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=10178664&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F05%2F25%2F2020.05.24.20112102.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000072789400003&link_type=ISI) 

15. 15.Zhang P, Brown MB, Bilik D, Ackermann RT, Li R, Herman WH. Health utility scores for people with type 2 diabetes in U.S. managed care health plans. Diabetes Care. 2012; 35: 2250–2256.
    
    [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NzoiZGlhY2FyZSI7czo1OiJyZXNpZCI7czoxMDoiMzUvMTEvMjI1MCI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIwLzA1LzI1LzIwMjAuMDUuMjQuMjAxMTIxMDIuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 

16. 16.Shah AN, Simmons J, Beck AF. Adding a vital sign: Considering the utility of place-based measures in health care settings. Hospital Pediatrics. 2018; 8(112): 112–114.
    
    [FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiRlVMTCI7czoxMToiam91cm5hbENvZGUiO3M6ODoiaG9zcHBlZHMiO3M6NToicmVzaWQiO3M6NzoiOC8yLzExMiI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIwLzA1LzI1LzIwMjAuMDUuMjQuMjAxMTIxMDIuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 

17. 17.Bender MA, Kumarasamy N, Mayer KH, Wang B, Walensky RP, Flanigan T, et al. Cost-Effectiveness of Tenofovir as First-Line Antiretroviral Therapy in India. Clinical Infectious Diseases. 2010; 50: 416–425.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1086/649884&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20043752&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F05%2F25%2F2020.05.24.20112102.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000273500300018&link_type=ISI) 

18. 18.Tseng YK, Hsieh F, Wang JL. Joint modelling of accelerated failure time and longitudinal data. Biometrika. 2005; 92(3): 587–603.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/biomet/92.3.587&link_type=DOI) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000231524600007&link_type=ISI) 

19. 19.Tsiatis AA, Davidian M. Joint modeling of longitudinal and time-to-event data: An overview. Statistica Sinica. 2004; 14: 809–834.
    
    
20. 20.Yu M, Law NJ, Taylor JMG, Sandler HM. Joint longitudinal-survival-cure models and their application to prostate cancer. Statistica Sinica. 2004; 14: 835–862.
    
    
21. 21.Rizopoulos D. JM: An R package for the joint modelling of longitudinal and time-to-event data. Journal of Statistical Software. 2010; 35(9). Doi: 10.18637/jss.v035.i09
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.18637/jss.v035.i09&link_type=DOI) 

22. 22.Lee ET, Wang JW. Statistical Methods for Survival Data Analysis. 3rd ed.: John Wiley & Sons; 2003.
    
    
23. 23.Deo V, Grover G. A new approach to evaluate quality adjusted life years using proxy utility function-An application to HIV/ AIDS data. Journal of Communicable Diseases. 2019; 51(3): 1–9.
    
    
24. 24.Jackson C, Stevens J, Ren S, Latimer N, Bojke L, Manca A, et al. Extrapolating Survival from Randomized Trials Using ExternalData: A Review of Methods. Medical Decision Making. 2017; 37: 377–390.
    
    
25. 25.Rungta A, Hooja S, Vyas N, Suman R, Rao A, Gupta S. Enumeration of CD4 and CD8 T lymphocytes in healthy HIV seronegative adults of northwest India: A preliminary study. Indian Journal of Pathology and Microbiology. 2008; 51(1): 127–129.
    
    
26. 26.Shahapur PR, Bairy I, Shivananda PG. CD4 and CD8 reference counts in normal healthy south Indian adults. Indian Journal of Medical Microbiology. 2008; 26(3): 280–290.
    
    
27. 27.Ramalingam S, Kannangai R, Zachariah A, Mathai D, Abraham OC. CD4 counts of normal and HIV-infected south Indian adults:Do we need a new staging system? The National Medical Journal of India. 2001; 14: 335–339.
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=11804363&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F05%2F25%2F2020.05.24.20112102.atom) 

28. 28.Uppal SS, Verma S, Dhot PS. Normal values of CD4 and CD8 lymphocyte subsets in healthy Indian adults and the effects of sex, age, ethnicity, and smoking. Cytometry Part B: Clinical Cytometry. 2003; 52B: 32–36.
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=12599179&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F05%2F25%2F2020.05.24.20112102.atom) 

29. 29.Crowther MJ, Lambert PC. Simulating complex survival data. The Stata Journal. 2012; 12(4): 674–687.

 [1]: /embed/inline-graphic-1.gif
 [2]: /embed/inline-graphic-2.gif
 [3]: /embed/inline-graphic-3.gif
 [4]: /embed/graphic-1.gif
 [5]: /embed/graphic-2.gif
 [6]: /embed/graphic-3.gif
 [7]: /embed/graphic-4.gif
 [8]: /embed/inline-graphic-4.gif
 [9]: /embed/inline-graphic-5.gif
 [10]: /embed/inline-graphic-6.gif
 [11]: /embed/graphic-5.gif
 [12]: /embed/inline-graphic-7.gif
 [13]: /embed/inline-graphic-8.gif
 [14]: /embed/inline-graphic-9.gif
 [15]: /embed/graphic-6.gif
 [16]: /embed/graphic-11.gif
 [17]: /embed/inline-graphic-10.gif
 [18]: /embed/inline-graphic-11.gif
 [19]: /embed/graphic-12.gif
 [20]: /embed/inline-graphic-12.gif
 [21]: /embed/inline-graphic-13.gif
 [22]: /embed/inline-graphic-14.gif
 [23]: /embed/inline-graphic-15.gif