Optimising covariate allocation at design stage using Fisher Information Matrix for Non-Linear Mixed Effects Models in pharmacometrics

Lucie Fayette; Karl Brendel; France Mentré

doi:10.1101/2025.01.31.25321452

Abstract

This work focuses on designing experiments for pharmacometrics studies using Non-Linear Mixed Effects Models including covariates to describe between-subject variability. Before collecting and modelling new clinical trial data, choosing an appropriate design is crucial. Assuming a known model with covariate effects and a joint distribution for covariates in the target population from previous clinical studies, we propose to optimise the allocation of covariates among the subjects to be included in the new trial. It aims achieving better overall parameter estimations and therefore increase the power of statistical tests on covariate effects to detect significance, and more importantly, clinical relevance or non-relevance of relationships. We suggested dividing the domain of continuous covariates into clinically meaningful intervals and optimised their proportions, along with the proportion of each category for the discrete covariates. We used the Fisher Information Matrix and developed a fast and deterministic computation method, leveraging Gaussian quadrature and copula modelling. The optimisation problem was formulated as a convex problem subject to linear constraints, allowing resolution using Projected Gradient Descent algorithm. Different scenarios for a pharmacokinetics model were explored. We showed the benefit of covariate optimisation in reducing the number of subjects needed to achieve desired power in covariate tests.

1 Introduction

In pharmacometrics longitudinal data collected during clinical trials are leveraged through Non-Linear Mixed Effect Models (NLMEM). In particular, NLMEM allow to quantify the relationships between covariates and parameters [1]. Indeed, covariates are often included in pharmacometrics models to describe predictable between subjects variability, thereby enhancing model fit and/or model-based predictions [2].

Before collecting and modelling the data, a key step is to chose an appropriate design. In NLMEM, a population design refers to the number of subjects and their allocation to elementary designs, an elementary design being a combination of values for the design variables, such as number of samples, sampling times, dosage regimen, etc. Usually, covariates are not considered at this step.

In its Population Pharmacokinetics (PK) Guidance for Industry 2022 [3], the FDA emphasises that “Simulations and optimal design methods can maximize the utility of population PK data collection and analyses” and stresses in particular that these methods can aim to estimate “major covariate effects of interest” with greater precision. Nevertheless, the method proposed for power assessment and sample size computation in this recommendation is clinical trial simulations (CTS), although it is computationally expensive and less exhaustive than optimal design strategies. The traditional theory of optimal experimental design, initially developed for non-linear models [4], have already been imported to the field of NLMEM and pharmacometrics [5, 6]. It relies on Fisher Information Matrix (FIM) computation, through linearization of the structural model, or stochastic simulations or Gaussian approximation [7–9]. Thus, before the start of a new study, given a NLMEM and parameters value, one can compute the expected standard error (SE) of the model parameters and deduce whether enough information will be collected to meet the objectives of the study. Thereafter, design optimisation consist in finding the design of experiments that provides the most informative data for estimating model parameters while respecting some design constraints, such as number of subjects, number of sampling times or samplings times windows. These optimal design strategies require to assume a model and a vector of population parameters values. Methods to compute robust designs to the prior choice of parameter values or/and to the prior choice of model in the context of NLMEM have already been explored [10–13] but will not be considered here.

Various optimisation software especially dedicated to design optimisation in the context of clinical trial using NLMEM have been developed in the past 20 years. Among them we can cite PFIM [14], which was the first software tool proposed in 2001 for evaluating a design without using simulations, especially, the R package PFIM 6.1 [15, 16] (CRAN release October 2024); PopED [17, 18] and NONMEM$DESIGN [19].

Numerous studies have demonstrated the benefits of using this methodology to optimise sampling times but very few studies have addressed the question of which covariate values would give the most information.

Nevertheless, if a study aims to detect whether some covariate relationships are significant or relevant, this design optimisation procedure needs to account for covariates. In a previous work, we explored methods for handling covariates in FIM computation [20]. There are based on Monte-Carlo (MC) approximation, and propose to leverage either an historical covariate dataset, a known covariate distribution or a copula. Their accuracy in predicting uncertainty and power of significance and relevance tests was assessed in a simulation study [20] using a simple PK example and under various scenarios, including different sample sizes, sampling points, and IIV. In addition the FIM computation using a prior covariate dataset was applied to a Phase 3 example with a model including 27 covariate relationships and was found to accurately predict uncertainty even in this more complex setting. These methods have not yet been used for computing the FIM during design optimisation process, but it does not present any particular technical difficulties. In this work we propose to go a step further: assuming a covariate distribution within the target population, we propose, for a fixed design, NLMEM and parameter vector, to optimise the selection of subjects to be included in the future trial from this distribution.

Previous works have looked at how to account for covariates when allocating patients to different arms, and how an unbalanced design could improve efficiency and ethics, including model-based approaches (see for instance [21] for a review). However, these approaches do not use the longitudinal data framework nor NLMEM. In addition, they aim to optimised the design sequentially and seek, for each new patient entering the trial, to allocate them to the best possible arm in terms of efficacy and ethical considerations, while maintaining randomisation. Consequently, it does not correspond to our setting aiming to optimise the design prior to setting up the study. In [22], a new method based of the FIM for optimising design using NLMEM where covariates are design variables is proposed. However, the proposed methods require the FIM to be evaluated for a set of fixed covariate values, without taking into account the distribution of these covariates, so the combinatorial approach can result in a very large number of elementary designs to be evaluated and the optimisation result is discrete, making it difficult to manage continuous covariates. In addition, there are no recommendations on how to take into account multiple covariates that are correlated with each other, or on the practical management of covariate distributions.

To this end, we developed in this work a new method aiming to reduce the computational cost of the FIM computation, to handle continuous and correlated covariates and to optimise their distribution. Indeed, with optimisation purposes, the required number of FIM evaluation increases and run time becomes a challenge. Therefore, avoiding MC simulations is a lever to reduce this cost. As a consequence, we introduce a Gaussian quadrature [23] (GQ) based approach, thus faster and deterministic. We have coupled this approach with the use of copula to describe the distribution of covariates, as it is very flexible and if already available, it can be used without individual data. Regarding optimisation, a hurdle to be overcome is that optimising the distribution of continuous covariates is not trivial as it is unrealistic to recommend including a given number of patients matching an exact value of the covariate. Therefore, we propose to segment the continuous covariates into different intervals and to optimise the proportion of each of them among the trial population. This has the combined advantage to remains realistic in practice while defining discrete parameters to be optimised. Finally, we propose a method for the constrained optimisation of covariates using a projected gradient method. The FIM is computed over a finite set combining elementary designs and covariate distributions and each combination is optimally weighted, given the constraints, using the Projected Gradient Descent (PGD) algorithm [24]. Once covariate distribution is optimised, the sample size can be optimised to reach desired level of precision or desired power in statistical test on covariate effects .

All this methods have been implemented using the R language [25] in a working version of PFIM 6. Scripts are available in the Zenodo repository https://doi.org/10.5281/zenodo.14778034. Based on the previously used simple PK model, we developed a workflow to assess the number of nodes required in the quadrature for an accurate FIM computation. Because prior data are not always available at design step, we used public database from the National Health and Nutrition Examination Surveys (NHANES) database [26] to get prior covariate distributions. Thereafter we explored various scenarios suing a simple PK model demonstrating the benefit of covariate distribution optimisation.

In Section 2, the notations and methods are detailed. In Section 3 we go through the example settings and results are presented in Section 4. Lastly global results are discussed in section 5.

2 Methods

In this section we detail notations and methods on study design and NLMEM. We expose the FIM computation accounting for covariate distribution, and especially we describe the segmentation of continuous covariates for optimisation purposes and how it is handle in the FIM computation. Then, we explain the FIM integration using copula and Gauss-Legendre Quadrature (GLQ). Lastly, we present optimisation of covariate distributions using PGD.

2.1 Notations on study designs and NLMEM

In the following, a population design is denoted Ξ = {N, (ξ₁, …, ξ_N)}, where N is the number of subjects and the ξ_i are their elementary designs.

The observation vector y_i for the i^th subject is modelled as given in equation (1) where f denotes the structural non-linear model and θ_i = u (µ, β, Z_i, η_i), the p-vector of individual parameters. The latter are defined as a function u, depending on fixed effects, composed of the typical values denoted µ and of the covariate effects β, and on individual random effects η_i, where η_i ∼ 𝒩 (0, Ω). Individual parameters also depend on the vector of possibly transformed individual covariates Z_i.

We denote Z_i = (Z_Dis,i, Z_Cont,i) a covariate vector when considered as a random vector, and z_i = (z_Dis,i, z_Cont,i) as a realisation of this random vector. The lower ._Dis refers to the discrete sub-vector of the covariate vector, while ._Cont refers to the continuous sub-vector.

In pharmacometrics modelling, parameters are generally log-normally distributed with additive covariate relationships on the log scale, i.e. log θ_i = log µ + β (z_i − z_pop) + η_i.

The residual error is modelled by the function g, depending on ξ_i, θ_i and some parameters σ, and by the random variable ε_i, with .

The P -vector of population parameters is denoted Ψ = {µ, β, λ}, where λ contains the variance parameters (elements of Ω and of the residual error model σ).

To assess the magnitude of a covariate effect, one can compute the ratio of change in primary parameters when covariate values change relative to a reference value z_ref . Detailed formulas are given in [20], and for a log-normally distributed parameter with an additive covariate relationship on the log scale, the ratio writes if the covariate is binary, and the ratio associated to a given percentile PX of the covariate distribution if continuous writes , where P 50 denotes the median of the covariate distribution.

2.2 Elementary and population FIM

Given an elementary design ξ_i, a population parameters vector Ψ, and a covariate realisation z_i the expected elementary FIM, M_F (ξ_i, Ψ, z_i), is defined as the covariance matrix of the Fisher score (equation (2)) where l(y_i; Ψ, z_i, ξ_i) is the likelihood of the vector of observations y_i for the population parameters Ψ, given the individual vector of covariates z_i and the elementary design ξ_i (equation (3)).

Knowing the distribution p_Z for the covariate vector, the expected FIM can be expressed as an expectation over the covariate vector as given in equation (4).

Assuming that the covariate distribution p_Z is the same for all the elementary design, the expected population FIM is the sum of the elementary FIM (equation (5)).

In the following, we consider that the N subjects have the same elementary design ξ, thus the expected population FIM is M_F (Ξ, Ψ, p_Z) = NM_F (ξ, Ψ, p_Z).

2.2.1 Elementary FIM with discrete covariates only

In the case with only D discrete covariates, each with Q_d, d = 1, …, D categories. The covariate vector follows a discrete distribution with possible values, denoted z_Dis,d, d = 1, …, n^(QD), each having the probability p_Dis,d, d = 1, …, n^(QD). Therefore, the elementary FIM for ξ writes as a weighted sum as given in equation (6).

2.2.2 Elementary FIM with discrete and continuous covariates

In case there are both non-independent discrete and continuous covariates, we denote p_Dis the distribution of the discrete sub-vector, and for each of its possible value z_Dis,d, we denote p_Cont|_{z Dis,d} the distribution of the continuous sub-vector conditionally to z_Dis,d. The elementary FIM for a given ξ is given in equation (7).

2.3 Segmentation of continuous covariates

We assume here that a priori distributions are known for the covariates, and our aim is to distribute the subjects in the study. To achieve this, we suggested to segment the continuous covariates into different intervals, chosen for their clinical meaning. We then balanced the allocation between the different continuous intervals and between the discrete covariate categories. The advantage is that it maintains the continuous nature of the covariate in the FIM calculus, while offering a discrete segmentation, which is useful both for reducing dimensionality during optimisation and for inclusion of subjects based on the optimisation results.

We take the general framework, with respectively D discrete and C continuous covariates. The domain of each continuous covariate, indexed by c, is segmented in intervals denoted . Combining those continuous intervals leads to possible intervals for Z_Cont . Finally, there are L = n^(QD) × n^(ℐ) possible distributions for the covariate vector Z.

Conditionally to Z_Dis = z_Dis,d, the distribution of is denoted . The proportion of subjects within the trial population and for which z_Dis,i = z_Dis,d and is denoted: .

This segmentation leads to the elementary FIM formula given in equation (8), which now depends on x.

We denote x the vector of size L containing the and for a better clarity in the notations, we denote x_l the l^th element of x and the associated distribution for Z. The overall distribution for Z is denoted p_Z(x). These notations result in the FIM formula given in equation (9).

Finally, the multivariate distribution of covariates within the trial is described as the weighting of L multivariate distributions, and it is these proportions that we are seeking to optimise.

2.4 FIM integration using copula and Gauss-Legendre Quadrature

We describe here how copula modelling and GLQ can be leverage to compute an elementary FIM M_F ξ, Ψ, z_Dis, p_{Cont, z} .

The first aspect of our method is to characterise the covariate distribution through copula modelling to account for correlations in FIM computation as already proposed [20].

It is well known that given a random vector (Z₁, Z₂, …, Z_C) which follows the distribution p_{Cont, z}|Dis and has continuous marginals F₁, …, F_C, the random vector (U₁, U₂, …, U_C) = (F₁(Z₁), F₂(Z₂), …, F_C(Z_C)) has marginals uniformly distributed on [0; 1]. By definition, the copula of (Z₁, Z₂, …, Z_C) is the cumulative distribution function (cdf) of (U₁, U₂, …, U_C). The Rosenblatt transform [27] denoted T_{Cont, z}| Dis is defined in equation (10) and its inverse in equation (11).

Therefore, applying the inverse Rosenblatt transform to a vector of random uniform variables return a vector that has the same distribution as (Z₁, Z₂, …, Z_C).

Consequently, M_F ξ, Ψ, z_Dis, p_{Cont, |z}Dis can be expressed using as given in equation (12).

The second aspect is to approximate this integral through a GQ [23]. The latter consists in approximating an integral by a weighted sum of the function to be integrated, evaluated at a number n^(Q) of given points in the integration domain, called quadrature nodes. The advantage of GQ integration over the MC is that, since the nodes are chosen to represent the distribution harmoniously, equivalent accuracy is achieved with fewer GQ nodes than with random draws for MC [28].

In our case the integration domain for each of the C integrals is [0; 1]. Therefore, we apply C GLQ with a substitution, as GLQ allows to integrate over the interval [-1, 1]. Denoting v_q and w_q the q^th node and its weight in GLQ, the substitutions are: and .

Finally, M_F ξ, Ψ, z_Dis, p_{Cont, |z}Dis can be expressed as given in equation (13).

This methodology can thereafter be applied to compute each of the L matrices that appears in equation (9).

2.5 Optimisation of covariate distributions

We describe here the proposed algorithm to optimise the proportions x_l of the L distributions of covariates within the population.

2.5.1 Optimisation criteria

In order to compare the FIM between different designs, a criteria mapping the matrix to a scalar needs to be defined. Among various optimality criterion used in theory of optimal design (see for instance [4]), the D-criterion is a popular one and is wieldy used in the field of pharmacometrics. It uses the determinant of the FIM, as given in equation (14) and maximizing it ensures that the parameter estimates are as precise as possible. Indeed, a larger determinant of the FIM corresponds to a smaller volume of the confidence ellipsoid for the parameter estimates, indicating higher precision.

The comparison of covariate repartition x against a reference x_ref for the D-criterion is done by calculating the D-efficiency E_D (equation (15)).

2.5.2 Optimising covariate proportions using Projected Gradient Descent

Our aim is to maximize the D-criterion, for a fixed design Ξ by balancing between the different covariate distributions weighted by the x_l, and given some constraints on these proportions.

Linear constraints

First, as x represents a vector of proportions, it should sum to 1 and each x_l has to be in [0; 1]. Additional constraints can be set on the x_l, for instance to make sure that a given interval for a continuous covariate does not represent more than a desired proportion and not less than another proportion. Therefore there are both equality and inequality constraints, and the optimisation problem is given in equation (16). We denote 𝒬 the constraint space.

Projected Gradient Descent

As the D-criterion is a convex function, the PGD [24] can be used to solve the problem given in equation (16).

Starting from an initial point x₀ ∈ 𝒬, PGD iterates the equation given in (17), where α_k ∈ [0; ∞] is the gradient step-size at the k^th iteration, ϕ_D the gradient of ∇ϕ_D with respect to x and P_𝒬 is the projection on the constraint space. This projection is also the solution of a minimization problem, given in equation (18).

PGD iterates until a stopping condition is met. The detailed computation for ∇ϕ_D (Ξ, Ψ, p_Z, x_k) when all the subjects have the same elementary design ξ is given in [29] and the result is recall in equation (19), where tr denotes the trace operator.

2.6 Optimisation of sample size

After having optimised the proportion of each covariate distribution, we can compute the sample size needed, NSN, all other things being equal, to reach desired confidence level on parameter estimates or on power of statistical tests on covariate effect.

Denoting SE(Ξ, Ψ, p_Z(x))_β the standard error on a parameter β ∈ Ψ for a design Ξ, population vector Ψ and a covariate distribution defined by p_Z and x, the number of subjects needed NSN to reach a desired level SE^* on this parameter is given in equation (20).

From this, we can calculate the NSN required to achieve the desired power for significance tests on covariates [20, 30], as well as relevance tests on covariate relationships [20] for a given equivalence interval [R_inf ; R_sup]. The non-relevance of a covariate effect is assessed by a TOST procedure at level α, with the null hypothesis is H₀: “the covariate effect is relevant”, i.e. r_l,c ∉ [R_inf ; R_sup] while the alternative hypothesis is H₁: “the covariate is not relevant”, i.e. r_l,c ∈ [R_inf ; R_sup]. This two sided null hypothesis can be split into two, respectively H_0,inf and H_0,sup :

Thus H₀ is rejected if both H_0,inf and H_0,sup are rejected.

For log-normally distributed parameter with additive covariate relationships on the log-scale, and at level 1 − α, the null hypothesis is rejected if and if where depending on the sign of z − z_ref, B_sup and B_inf equal or . The power is given by the equation (21), which holds if .

The NSN to reach a desired non-relevance power P^* is found using a round finder approach to solve equation (22).

2.7 Implementation

All these methods have been implementing using the R language [25]. For FIM evaluation an extension of the package PFIM 6, based on previous work [20] have been done. All scripts are available online at NEWZENODO. The quadrature nodes were computed using the function gauss.quad() from the package statmod [31]. For PGD, the projection on the constraint space was made using the function lsei() from the package limSolve [32].

3 Evaluation example using a simple PK model

In this section we explored an example to demonstrate the benefit of covariate distribution optimisation in reducing the sample size needed to draw conclusions about the relevance of covariate relationships.

3.1 Settings

3.1.1 Model

The PK model was a one compartment model with IV bolus and linear elimination, , with two parameters: the clearance (Cl) and the volume of distribution (V). The residual error was modelled as a combined error model: g = a+bf and exponential random effects without correlation were assumed for V and and . All covariate effects were modelled as additive on the log-scale: log θ_i = log θ_pop + β_θ,ZZ_i + η_θ,i.

Firstly, three univariate scenarios were considered, with the model including either an effect of the Sex (SEX) on V, an effect of the Body Mass Index (BMI) on V, or an effect of the creatinine clearance (CLCR) on Cl. Both BMI and CLCR were first normalised and log transformed, applying respectively log and log , where BMI_pop and CLCR_pop respectively denoted the average BMI and CLCR among Male individuals with Normal RF and Healthy Weight for BMI. These scenarios are hereafter referred to as ‘SEX only’, ‘BMI only’ and ‘CLCR only’. Then, the three effects were taken into account simultaneously, and this scenario is referred to as the ‘Three covariates’ scenario.

3.1.2 Covariate distribution specification

Because covariate data may not be available at design step, we used the public database from the National Health and Nutrition Examination Surveys (NHANES) database [26] to get prior covariate distributions. Covariate data from 2009 to 2020 were extracted, keeping only subjects between 18 and 80 years old, with BMI higher or equal to 18.5 kg/m² and for whom there were no missing data for Age, Body Weight (BW), BMI, Height, Creatinine, and SEX. The body weight was adjusted for ideal body weight computed with Robinson’s formula [33] and the Cockcroft and Gault equation with adjusted body weight [34] was then used to estimate the creatinine clearance, CLCR. 29 463 covariate vectors were selected in the NHANES database and Table 1 provides the summary statistics of these covariates.

View this table:

Table 1: Covariate data summary from the NHANES database, 2009-2020

The BMI was segmented into three usual intervals, namely Obesity if BMI ≥ 30 kg/m², Overweight if 25 ≤ BMI < 30 kg/m² and Healthy Weight if 18.5 ≤ BMI < 25 kg/m². The CLCR was segmented into the four usual intervals to define the renal function (RF), namely Normal RF if CLCR ≥ 90 mL/min, Mild RF if 60 ≤ CLCR < 90 mL/min, Moderate RF if 30 ≤ CLCR < 60 mL/min and Severe RF if CLCR < 30 mL/min. Of note, BMI_pop = 22.9 kg/m² and CLCR_pop = 112.2 mL/min.

While there was a good balance between Male and Female and between the different BMI intervals, Severe RF was poorly represented with only 1.2% of the data.

Combining SEX, BMI intervals and CLCR intervals, there were 2 × 4 × 3 = 24 possible covariate combinations. Nevertheless, those including Severe RF were poorly represented in the dataset (e.g. only 40 subjects are Male, with Severe RF and Obesity). Therefore, Severe RF individuals were pooled into two classes depending on their SEX and without taking into account their BMI status. This pooling was carried out to ensure the adequacy of copula fits. The detailed repartition between the resulted 20 covariate combinations is given in the first panel of Figure 7.

3.1.3 Parameter values

The PK parameter values are given in Table 2, such as ratios of change caused by being a Female on V, being Obese on V and having Severe RF on Cl. With ratios of 0.70, i.e. outside of [0.80; 1.25], both the effect of being a Female on V and having Severe RF on Cl were tested for relevance; while r_{V,BMI (Obesity)} was within the equivalence interval, thus the effect of being Obese on V was tested for non-relevance. It should be noted that r_{V,BMI (Obesity)} and r_{Cl,CLCR (Severe RF)} were computed for the threshold values for belonging to these intervals, namely for 30 kg/m² and 30 mL/min.

View this table:

Table 2: Population PK parameter values used in the evaluation example

View this table:

Table 3: Three Covariates - ϕ_D, D-Efficiency and NSN to reach 80% power in covariate testing with the initial and optimal covariate distributions for the D-criterion for the different sets of constraints

3.1.4 Design

The considered design includes N = 24 subjects receiving 250 mg of drug at time 0, and for each of them, three samples are collected at 1, 4 and 12 hours post-dose. The concentration curves f (t) accounting for univariate covariate effects and with parameters given in Table 2 are shown on Figure 1. Because being a Female reduces the volume, the typical concentration curve for Female is above the Male one. Because the higher the BMI the higher the volume, the typical area for concentration curve in individual with Healthy Weight is above the one for Overweight, which is above the one for Obesity. Regarding CLCR status, the lower the CLCR, the lower the typical curve, since renal impairment decreases CLCR.

Figure 1:

Evolution of the concentration f (t) according to the fixed effects in the three univariate scenarios: the first panel corresponds to an effect of SEX on V, the second panel to an effect of BMI on V and the third panel to an effect of CLCR on Cl.For the second and third panels, the black lines correspond to the limits of the intervals of the continuous covariates.

3.2 FIM evaluation using copula and quadrature

3.2.1 Copula fitting

In the example with SEX only (i.e. without continuous covariate), the population FIM was computed for each SEX and the total FIM was obtained as the weighted sum, each FIM weighted by the proportion of the corresponding SEX in the database. With BMI only, three copula were fitted: one for each of the three datasets corresponding to BMI intervals. In the same way, in the example with CLCR only, four copula were fitted: one for each of the four datasets corresponding to CLCR intervals.

For the example with three covariates, a copula was fitted for each of the 20 covariate combination on the associated subset of data.

All copula were fitted as vine copula, using the R package rvinecopulib0.6.3.1.1 [35]. Copula fit evaluation was conducted through a simulation-based strategy as described in [36] using 100 replicates. The relative error (RE) between summary metrics M_sim in copula-simulated populations and in the NHANES dataset was computed as . Summary metrics were the mean, the standard deviation, the median and the 5^th and 95^th percentiles. Overlaps between simulated and true distributions were also computed such as the simulated and observed correlations between BMI and CLCR. Diagnostics plots were explored using the R package pmxcopula [37] with minor edits.

3.2.2 Quadrature settings

According to the Gauss-Legendre Quadrature rule, the approximation using n^(Q) nodes is exact for integrands that are polynomials of degree 2n^(Q) − 1 or less. Nevertheless, in our case, the integrand is the composition of the FIM and the inverse Rosenblatt transform of the copula describing the continuous covariate distribution, therefore, in a general setting, it is not a polynomial function. Consequently, the quadrature remains an approximation and increasing n^(Q) increases the accuracy.

To determine the number of nodes required in the quadrature for an acceptable accuracy in FIM evaluation, we computed the FIM for n^(Q) between 1 and 25, and looked for stabilization in the D-criterion. The same number of nodes was used for all the continuous covariates and for each copula. As a benchmark, we also computed the FIM using copula with Monte Carlo (MC) simulations. This computation was performed using n^(MC) = 10, 100, 500, 1000, and 2000 Monte Carlo samples drawn from each copula. Additionally, since it is a stochastic method, the process was repeated S = 100 times to assess the uncertainty of the computed FIM. The target value was chosen as the average ϕ_D obtained with 1000, and 2000 Monte-Carlo samples, and the RE on the D-criterion was computed with respect to this value. In the same time, we computed the run time as a function of n^(Q), and the average run time in MC procedure as a function of the number of samples.

This comparison was performed for both BMI only and CLCR only, and because of computation already done, optimisation was then carried out with the higher number of nodes, i.e. n^(Q) = 25. Because of reasonable run time, n^(Q) = 25 was also used for the Three covariate scenario and MC approach using n^(MC) = 500 was also explored as a reference.

3.3 Covariate distribution optimisation

In all scenarios, covariate distribution optimisation was performed using PGD. The step-size α was constant and set to 0.001 and the stopping condition was that the objective function should not change by more than ϵ = 10⁻⁷ between two iterations.

In univariate cases, optimisation was done for various values of the covariate effects, while the other PK parameter values were the same as those given in Table 2.

Different constraints were applied depending on the scenario.

3.3.1 SEX only

The optimal balance between Male and female was explored for β_V,Sex ∈ [−1.20; 0.00] with a 0.05 step, with no other constraint than that the proportions should be within [0; 1] and sum to 1.

3.3.2 BMI only

Optimisation between the three BMI intervals was explored for β_V,BMI ∈ [0.00; 1.00] with a 0.05 step, and for two optimisation settings. First without any other constraints than proportion definition. Then, optimisation among the three BMI intervals was performed with the additional constraint that each interval should represent at least 5% of the population. This setting is hereafter referred to as ‘Lower constraints’.

3.3.3 CLCR only

Optimisation between the four CLCR intervals was explored for β_Cl,CLCR ∈ [0.00; 1.00] with a 0.05 step. First, the two same settings as BMI only were explored. In addition, because Severe RF subjects are rare in the population, a third optimisation scenario was explored by adding to the lower limit constraint an upper limit set at 10% for Severe RF in the trial population. This setting is hereafter referred to as ‘Lower and Upper constraints’.

3.3.4 Three covariates

For the case with the three covariates, the three same optimisation settings as for CLCR only were explored.

3.4 Optimised distributions evaluation

The initial distribution between the SEX, CLCR and BMI intervals corresponds to the observed proportions in the selected subset from the NHANES database.

For each optimised distribution, the D-Efficiency relative to the design with the initial distribution was computed, in addition with the NSN to conclude with a 80% power on the statistical significance for each covariate effects, and the NSN to conclude with a 80% power on the relevance (or non relevance) of relationship for the effect of being a Female on V, of having Obesity on V and of having Severe RF on Cl.

4 Results

4.1 Copula fitting

The performance of copula model was evaluated for each covariate combination.

4.1.1 BMI only

For the scenario with BMI only, mean, standard deviation, median and 5^th and 95^th percentiles of the simulated population agreed with the observed one, with a small RE within ±2.5% (Appendix A.1.1, Figure A.1.1). Visual predicted checks (VPC) were also in accordance with the observed percentiles (Appendix A.1.1, Figure A.1.2).

4.1.2 CLCR only

Similarly, with CLCR only, RE on summary statistics was satisfactory, within ±5% for all the intervals (Appendix A.1.2, Figure A.1.3) and VPC were also in accordance with the observed percentiles (Appendix A.1.2, Figure A.1.4).

4.1.3 Three covariates

As shown in Appendix A.1.3, Figure A.1.5, RE was within ±10% for all the summary statistics for almost all the intervals of BMI and CLCR. Indeed the RE on the 5^th or 95^th percentiles of the simulations were a little higher for sd on BMI for Male, with Obesity and Moderate RF; for sd on BMI, P 5% and P 50% on CLCR for both Male and Female with Moderate RF. The simulated correlations from the copula models were very similar to observed correlations between BMI and CLCR (Appendix A.1.3, Figure A.1.7) and overlap between simulated and observed join distribution for BMI and CLCR was always above 80%, except for Female with Severe RF, for which the overlap was above 80% in only in three quarters of the simulations (Appendix A.1.3, Figure A.1.6). Moreover, the VPC given in Figure 2 show a a good match between simulated and observed level lines, for all combinations.

Figure 2:

Three Covariates - Visual predictive checks based on the contours of the bivariate density between BMI and CLCR. For each of the 20 covariate combination, the virtual population was simulated 100 times from the copula and the 99% prediction intervals of percentile contours of the joint distribution was derived and compared to the contours observed in the NHANES database.

The ribbon areas correspond to the 99% prediction interval of 5^th , 50^th and 95^th percentile contours of the joint distribution of BMI and CLCR. The lines corresponds to the observed 5^th , 50^th and 95^th percentile contours.

Overall, copula fits were judged to be very satisfactory.

4.2 FIM evaluation using quadrature

4.2.1 BMI only

As shown in Figure 3, quadrature method was much faster than MC. Indeed, with 25 nodes, the FIM computation was perform in 1.7 seconds, whereas even with only 100 MC samples,the average run time was 6.4 seconds, and it increased to 63.8 and 126.8 seconds for respectively 1000 and 2000 MC samples.

Figure 3:

BMI only - Run time for ϕ_D computation (top) and relative error (RE) in ϕ_D computation (bottom), with FIM integration using on the left copula and MC and on the right copula and GLQ. The target value was the average ϕ_D obtained with n^(MC) = 1000 and n^(MC) = 2000.

The green line corresponds to 0. The boxplot displays the median, the 25^th and 75^th percentiles, while the whiskers are 5^th and 95^th percentiles. The diamond corresponds to the mean. ϕ_D: refers to the D-criterion, FIM: Fisher Information Matrix MC: Monte-Carlo, GLQ: Gauss-Legendre Quadrature

Regarding the accuracy in ϕ_D computation, the first observation is that MC integration led to a non-negligible uncertainty when the number of MC samples n^(MC) is low (Figure 3 and Figure A.2.1 given in Appendix A.2.1). In our example, the uncertainty is however very similar between 500, 1000 and 2000 MC sample, suggesting that 500 MC sample are acceptable for the FIM computation. For the quadrature approach, the RE to the target, defined as the average ϕ_D over the repetitions with 1000 or 2000 MC sample, was bellow 1% starting from 4 quadrature nodes and higher (Figure 3).

4.2.2 CLCR only

Once again, GQ method was much faster than MC with a run time of 2.3 seconds with 25 nodes whereas even with only 100 MC samples, the average run time was 8.6 seconds, and it increased to 86.0 and 170.4 seconds for respectively 1000 and 2000 MC samples (Figure A.2.3 given in Appendix A.2.2).

The uncertainty in ϕ_D computation with the MC approach was also very similar between 500, 1000 and 2000 MC sample, suggesting that 500 MC sample are acceptable for this FIM computation (Figure A.2.2 given in Appendix A.2.2).

With GQ, the relative error to the target was bellow 1% starting from 3 quadrature nodes and higher (Figure A.2.3 given in Appendix A.2.2).

4.2.3 Three Covariates

In this scenario, as the number of covariate combinations was higher, the run time was longer for n^(Q) = 25, 533.0 seconds than the average run time with 500 MC samples: 412.7 seconds (sd = 2.1). Both methods gave very similar results with an average D-criterion of 62.5 with 500 MC samples (sd = 0.2) and 62.5 with n^(Q) = 25 (Figure A.2.4 given in Appendix A.2.3).

4.3 Covariate distribution optimisation

4.3.1 SEX only

Optimal balance between Male and Female for the D-criteria and as a function of β_V,SEX is given in Figure 4. The optimal distribution did not vary a lot and remained close to the 50-50 balance. For β_V,SEX = 0, the optimal balance was perfect equilibrium between Male and Female. When β_V,SEX decreased, the optimal weight for Female increased up to 67.8% for β_V,SEX = −1.20. For the value used in the scenario with the three covariates, i.e. β_V,SEX = −0.35, the optimal Female proportion in this univariate case was 62.0%.

Figure 4:

SEX only - Optimal proportions for the covariate distributions for the D-criterion (top), ϕ_D, D-Efficiency and NSN to reach 80% power (bottom), as a function of β_V,SEX

The purple vertical line corresponds to the value used in the Three covariates scenario. The grey area corresponds to the non-relevance interval. ϕ_D: refers to the D-criterion, NSN: Number of subjects needed

In Figure 4 are given the D-criterion, the D-Efficiency relative to the initial design, the NSN to conclude on the significance of β_V,SEX, and the NSN to conclude on the relevance/non-relevance (when appropriate) of β_V,SEX are given as a function of β_V,SEX for both the initial design (51% of Females) and the Optimal design. In this context, the benefit of using the optimal covariate distribution was very small. As expected, NSN_signif (β_V,SEX) and NSN_relev(V, SEX, (F emale)) decreased when β_V,SEX get further from 0. For the value used in the scenario with the three covariates, the relative D-Efficiency was quite low: only 1.01. The optimal design increased slightly the uncertainty on β_V,SEX therefore NSN_signif (β_V,SEX) which initially was 28 became 30, and similarly NSN_relev(V, SEX, (F emale)) has been increased from 165 to 176.

4.3.2 BMI only

The optimal distribution between the three BMI intervals as a function of β_V,BMI is given in Figure 5. With no constraints on the representation of all the statuses, we can see that the balance was only between the extremes: when β_V,BMI = 0, the optimal was having 53.7% of Obese and 46.3% of Healthy Weight. When the BMI effect increased, the optimal proportion of Obese individual decreased, being 36.0% for β_V,BMI = 1. With the lower constraint of having at least 5% of each interval, the optimal proportion of Overweight was always the minimum of 5%. With this lower constraint, the proportion of Obesity decreased less as the BMI effect increased (-2.2% when β_V,BMI = 0 and -1.7% when β_V,BMI = 1).

Figure 5:

BMI only - Optimal proportions for the covariate distributions for the D-criterion (top), ϕ_D, D-Efficiency and NSN to reach 80% power (bottom), as a function of β_V,BMI

The purple vertical line corresponds to the value used in the Three covariates scenario. The grey area corresponds to the relevance interval. ϕ_D: refers to the D-criterion, NSN: Number of subjects needed

For the value used in the scenario with the three covariates, i.e. β_V,BMI = 0.5, the optimal Obesity proportion without constraint was 39.7% and with the Lower constraints, 38.0%.

The higher β_V,BMI, the more there was benefit in using optimal distribution, no matter the constraint, in terms of overall estimation as the D-Efficiency increased with β_V,BMI, from 1.05 when β_V,BMI = 0 to respectively 1.15 and 1.14 when β_V,BMI = 1 without and with lower constraints (Figure 5). Moreover, the addition of the lower constraints, which makes the distribution more realistic in terms of clinical requirements, was still very valuable, as the ϕ_D curve for the lower constraints was much closer to the ϕ_D curve without constraints than to the curve with the initial distribution. In the same way, the NSN to reach 80% in statistical tests with the lower constraints were closer to those without constraint than to those with the initial distribution. For high effect, the NSN to conclude of the significance of β_V,BMI was all ready low (18 for β_V,BMI = 1) and decreased with optimal distribution (respectively 14 and 15 without and with constraints). Bellow , the ratio of change caused by the Obesity on V was bellow 1.25, and obviously the NSN needed to conclude to the non-relevance of this effect decreased with β_V,BMI, and was minimal in 0. With the initial distribution, NSN_nonrelev(V, BMI, (Obesity)) = 28, and it decreased to 20 and 21 without and with constraints.

For the value used in the scenario with the three covariates, NSN_signif (β_V,BMI) was respectively 70, 52 and 55 for the initial, without constraint and with lower constraints distributions, while the NSN_nonrelev(V, BMI, (Obesity)) was 130, 97, 101.

4.3.3 CLCR only

With CLCR only, a similar pattern as for BMI, with allocation to the extreme statuses was observed without constraint (Figure 6). Thus, for β_Cl,CLCR = 0, the optimal distribution was having 52.2% of Severe RF and 47.8% of Normal RF. When the CLCR effect increased, the optimal proportion of Severe individual decreased, first slowly until β_Cl,CLCR = 0.85 (26.7%) then fell to 4.2% for β_Cl,CLCR = 1. In the same time, while the proportion of Moderate RF was null for β_Cl,CLCR = 0.85, it increased up to 19.3% for β_Cl,CLCR = 1. In other words, when the effect was stronger, less extreme small CLCR values were sufficient and better than the smaller ones.

Figure 6:

CLCR only - Optimal proportions for the covariate distributions for the D-criterion (top), ϕ_D, D-Efficiency and NSN to reach 80% power (bottom), as a function of β_Cl,CLCR

When adding the lower constraint of having at least 5% of each CLCR interval, until β_Cl,CLCR = 0.85, 5% was the optimal for both Mild and Moderate RF. With a greater effect and as previously, the optimal proportion of Moderate increased up to 15.1 for β_Cl,CLCR = 1.

With the additional upper constraint of 10% for the Severe RF, there was a shift from Severe to Moderate, while Mild remained at the lower limit of 5%.

For the value used in the scenario with the three covariates, i.e. β_Cl,CLCR = 0.27, the optimal distribution Normal - Mild - Moderate - Severe were respectively 65.0 - 0 - 0 35.0 %, 58.6 - 5.0 - 5.0 - 31.4 % and 69.4 - 5 - 15.6 - 10.0 % without constraint, with lower constraints, and with lower and upper constraints.

As in the scenario with BMI only, adding the lower constraint on the proportions did not deteriorate too much the performance of the optimal distribution in terms of ϕ_D, D-efficiency and NSN . Constraining Severe RF being bellow 10% reduced the information and therefore increased the NSN but was still valuable compared to the initial distribution. The greater difference in D-Efficiency when using an optimal distribution compared to the initial one was observed for a null effect; because it was in this case than the optimal proportion of Severe RF was the higher while there were only 1.2% Severe in the initial distribution. Above, , the ratio of change caused by the Severe RF on CLCR was above 1.25, and logically the NSN needed to conclude to the relevance of this effect decrease with the increasing in β_Cl,CLCR.

For the value used in the scenario with the three covariates, NSN_signif (β_Cl,CLCR) was respectively 121, 32, 35 and 62 for the initial, without constraint, with lower constraints and with lower and upper constraints distributions, while the NSN_relev(Cl, CLCR, (Severe)) was respectively 684, 179, 194 and 348. Therefore, the use of the more constrained optimal distribution still made it possible to reduce by almost 50% the NSN to conclude on the relevance of the effect of having a Severe RF on Cl.

4.3.4 Three Covariates

The optimal distribution between the 20 covariate combinations in the scenario with three covariates are given in Figure 7 for the initial distribution and for the three optimisation settings. The first observation is that theses distributions were very sparse in the sense that many categories had a weight of 0.

Figure 7:

Three Covariates - Optimal proportions for the covariate distributions for the D-criterion for the different sets of constraints

Because being a Female reduces V while V increases with the BMI, in optimal distributions, Female were more associated with Healthy Weight while Male were associated with Obesity. Indeed, this balance allows to maximise the difference in the theoretical concentration curves, higher for Healthy Female than for Obese Male. In the same way, because having a Severe RF decreases Cl and therefore slows the elimination and leads to higher concentration curves, in optimal distribution, only Female Severe RF were present. We thus find the same tendency to distribute between extremes as in the univariate analysis.

Marginal distributions are given in Appendix A.3 in Table 4. Regarding the SEX, no matter the constraint settings, the optimal balance in close to 66.3% Female, not so far from the 62.0% observed in the univariate case for SEX only.

For the BMI intervals, even without constraint, there were Overweight in the optimal distribution because of the pool of the BMI statuses in the Severe RF combinations. Nonetheless, the proportions of Obesity were close to the univariate scenario: 42.0% without constraint and 39.7% in univariate for the optimisation without constraint, 40.6% and 38.0% with lower constraints, and finally 43.0% with additional upper constraint on Severe RF.

Once again for the CLCR intervals, when there were no constraint, Mild and Moderate RF were absent from the optimal distribution, and the balance Normal - Severe was 68.7 - 31.3% therefore close to 65.0 - 35.0 % from the univariate case. With lower constraints, Mild and Moderate were still to their minimum and with the additional upper constraint, Severe RF was replaced by Moderate RF.

Regarding performances, having both lower and upper constraints remained close to the two other optimisation settings in terms of D-Efficiency and still allowed to reduce the NSN . Indeed the higher NSN to conclude on the significance of the three covariates effects, on the relevance of the effect of being a Female on V and of the effect of having Severe RF on Cl, and on the non-relevance of having Obesity on V was 596 with the initial design and fell to respectively 197 for both without constraint and with lower constraint optimisation and to 229 with the additional upper bound on Severe. Thus, even the more constrained optimisation allowed to reduce the NSN by more than 60%.

5 Discussion

From an application point of view, we proposed in this work to optimise the allocation of the covariates among the patients to be including in a future study. The main objective was to improve the estimation, but we were particularly interested in the effects of the covariates and the power of the statistical tests relating to them, as well as the NSN to conclude on their relevance with a sufficient degree of confidence. For relevance assessment, we used the equivalence region [0.80; 1.25] which is the one used in bioequivalence studies to define whether a dosing adjustment is required but alternative methods have been proposed to chose equivalence regions appropriate to a specific context [2]. This optimisation was carried out between categories for discrete covariates, while for continuous ones, we introduced a segmentation of their domains into intervals and optimised the proportion between them. In terms of methods, we have introduced a new way of integrating the FIM for the covariate distribution by leveraging copula modelling and GLQ, and we solved the optimisation problem using the PGD algorithm. All these methods have been implemented in R based on the PFIM 6.1 package, and the source code is available online at the following link: https://doi.org/10.5281/zenodo.14778034.

We applied the proposed workflow on a PK example with one discrete covariate and two continuous ones. First, univariate analysis showed classical and intuitive results, namely an equal distribution between the more extreme classes and a disappearance of intermediate values in cases where there are no constraints. Constraints were proposed to better reflect real life conditions for patients’ inclusion in clinical studies. Even with these constraints, we showed that in the scenarios with the three covariate effects, the optimal distribution with the most realistic constraints allowed to reduce by more than 60% the NSN to conclude on the clinical relevance (or non-relevance) of the relationships. The challenge in implementing this optimal distribution is the need to include 23 females with Severe RF, while with the initial distribution, it was just 5 subjects without restriction on the SEX. Nevertheless, as the reduction in the sample size is otherwise significant, it may be worth concentrating on this difficult inclusion.

We have shown that using GLQ greatly reduces the computation time compared with MC, while providing good accuracy in FIM computation. However, the choice of the quadrature settings has not been examined in depth. Indeed, we compared the results depending on the number of quadrature nodes with the ones obtained with MC simulations; but using this approach in practice negates all the benefits of run time reduction. In practice, a suggestion can be to increase progressively the number of nodes and just look for a stabilization of the D-criterion without reference. In addition, we have seen that the computation time remains quite low even with a high number of nodes, in our case 25. Nonetheless, this holds for three covariates but if the number of continuous covariates C increases, due to the quadrature of each of them, the number of FIM evaluations (n(Q))^C increases exponentially. This difficulty could be countered, for example, by refining the number of nodes required for each distribution rather than using the same number for all the covariates, or by using adaptive methods that require fewer nodes (see for instance a review in [38]). However, this point should be qualified by the fact that, in practice, it seems unlikely that the number of covariates to be optimised will be very large. Moreover, it would still be faster than CTS, which requires simulation and then estimation on a large number of data sets.

We note that we did not explore the PGD hyper-parameters such as the step size nor some of performance-improving variants because the need was not felt in our situation, but it can be an option in more complex cases. However, in the current version of this algorithm, only linear constraints can be used for an easy projection on the constraint space, while in practice some non-linear ones may be useful (e.g., to impose the power of a test or a SE level on a parameter). To address this, a penalty term could be added to the objective function or another class of algorithms could be explored. Regarding the optimisation criteria, we only used the D-criterion while others exist such as the DS-criterion which allows focus on a specific subset of the parameters (see for instance [13] for an application). In our case, the covariates effects could have been the only parameters of interest. This change of criterion is not straightforward in the sense that its derivative has to be calculated, but apart from this calculation, it presents no particular difficulty.

In this work, we have assumed the same elementary design for all the subjects, but the computations can easily be extended to the case where there are several. It would therefore certainly be more beneficial to adapt the elementary design to the covariate combination. Then, an interesting path may be to jointly optimize the elementary designs and the covariates. If discrete optimization is performed for elementary designs, it can be combined with the covariates and the global process remains the same, only the number of FIM evaluations increases. We can easily imagine that this joint optimisation would enable us to improve the designs, for example in our case, for subjects with poor elimination, having a higher final observation time would guarantee a better estimation. On the other hand, for continuous optimization, the extension is harder. In both cases, it will first be necessary to consider the practical and ethical constraints and discuss the actual implementation with experts.

In our example we used a covariate database and the goodness of fit diagnosis suggested that a large amount of data is required but copulas could have been used directly if available, which is one of their great interests in pharmaceutical applications as they can be made public without threatening anonymity. In addition, the optimization approach can be coupled with another method for FIM computation if preferred.

Regarding FIM computation, we used a linearization approach but methods avoiding this approximation have already been explored using MC and GQ [39] [40]; and future work could extend them to include the integration of the covariate distribution.

Furthermore, we need to point out the possible difficulties in implementing these optimal design distributions. For example, it seems implausible not to include patients because they are not in the most extreme interval, when inclusion processes are already long and costly. The spirit of this optimisation would therefore be rather to aim towards the optimal in practice and above all to be aware of the conditions on the distribution of covariates and on the number of subjects that would be needed to reach a conclusion.

Last but not least, this approach assumes that the model and its parameters are known, including the effects of the covariates, whereas the trial to be designed aims to better estimate them, and is not robust to misspecifications. In the univariate example, we computed optimal allocation and NSN for a range of β and this could be extended into a robust approach as already explored in the NLMEM context [10–13]. Moreover, this methodology could be used to design a Phase 3 study based on Phase 2 results; therefore, the SE from the covariate effects estimates based on the Phase 2 data may be used to specify a prior distribution on the β. This absence of robustness also holds for the covariate distribution assumed for FIM computation and covariate optimisation. In this case, Adaptive Designs and especially Two-stage approaches [10, 29, 41] could be a good avenue to explore for refining the distribution of covariates halfway through inclusion.

Data Availability

All the R scripts are available at the following link: https://doi.org/10.5281/zenodo.14778034

https://doi.org/10.5281/zenodo.14778034

A Appendix

A.1 Copula diagnostics

A.1.1 BMI only

Figure A.1.1:

BMI only - Relative error (RE) of distribution metrics (mean, standard deviation and percentiles) of BMI for the three intervals, as compared to the statistics of the NHANES database. For each BMI interval, the virtual population was simulated 100 times from the copula.

The boxplot displays the median, the 25^th and 75^th percentiles, while the whiskers are 5^th and 95^th percentiles. sd refers to the standard deviation, and P 5%, P 50% and P 95% to the 5^th, 50^th and 95^th percentiles respectively.

Figure A.1.2:

BMI only - Visual predictive checks based on the percentile of the BMI distribution. The virtual population was simulated 100 times from the copula and the 99% prediction intervals of the percentiles the distribution was derived and compared to the percentiles observed in the NHANES database.

The ribbon areas correspond to the 99% prediction interval of 5^th , 50^th and 95^th percentiles of the BMI distribution. The lines correspond to the observed 5^th , 50^th and 95^th percentiles of the CLCR distribution.

A.1.2 CLCR only

Figure A.1.3:

CLCR only - Relative error (RE) of distribution metrics (mean, standard deviation and percentiles) of CLCR for the four intervals, as compared to the statistics of the NHANES database. For each CLCR interval, the virtual population was simulated 100 times from the copula.

Figure A.1.4:

CLCR only - Visual predictive checks based on the percentile of the CLCR distribution. The virtual population was simulated 100 times from the copula and the 99% prediction intervals of the percentiles the distribution was derived and compared to the percentiles observed in the NHANES database.

The ribbon areas correspond to the 99% prediction interval of 5^th , 50^th and 95^th percentiles of the CLCR distribution. The lines correspond to the observed 5^th , 50^th and 95^th percentiles of the CLCR distribution.

A.1.3 Three covariates

Figure A.1.5:

Three Covariates - Relative error (RE) of marginal metrics (mean, standard deviation and percentiles) of continuous covariates for the 20 covariate combinations, as compared to the statistics of the NHANES database. For each combination, the virtual population was simulated 100 times from the copula.

Figure A.1.6:

Three Covariates - Overlap metric of 95th density contours of virtual population relative simulated 100 times from the copula to the NHANES database.

Overlap metric of 95^th density contours of virtual population relative to observed population. Virtual population was simulated 100 times. The gray dashed line indicates 85% overlap percentage.

Figure A.1.7:

Three Covariates - Correlations of logBM I and logCLCR pair in the NHANES database (red diamond) and in virtual population relative simulated 100 times from the copula (boxplot).

A.2 FIM integration: MC vs GQ

A.2.1 BMI only

Figure A.2.1:

BMI only - ϕ_D computation with FIM integration using copula and MC (left) and copula and GLQ (right). The green line corresponds to the average ϕ_D obtained with n^(MC) = 1000 and n^(MC) = 2000. The boxplot displays the median, the 25^th and 75^th percentiles, while the whiskers are 5^th and 95^th percentiles. The diamond corresponds to the mean. ϕ_D: refers to the D-criterion, FIM: Fisher Information Matrix MC: Monte-Carlo, GLQ: Gauss-Legendre Quadrature

A.2.2. CLCR only

Figure A.2.2:

CLCR only - ϕ_D computation with FIM integration using copula and MC (left) and copula and GLQ (right). The green line corresponds to the average ϕ_D obtained with n^(MC) = 1000 and n^(MC) = 2000. The boxplot displays the median, the 25^th and 75^th percentiles, while the whiskers are 5^th and 95^th percentiles. The diamond corresponds to the mean. ϕ_D: refers to the D-criterion, FIM: Fisher Information Matrix MC: Monte-Carlo, GLQ: Gauss-Legendre Quadrature

Figure A.2.3:

CLCR only - Run time for ϕ_D computation (top) and relative error (RE) in ϕ_D computation (bottom), with FIM integration using on the left copula and MC and on the right copula and GLQ. The target value was the average ϕ_D obtained with n^(MC) = 1000 and n^(MC) = 2000. The boxplot displays the median, the 25^th and 75^th percentiles, while the whiskers are 5^th and 95^th percentiles. The diamond corresponds to the mean. ϕ_D: refers to the D-criterion, FIM: Fisher Information Matrix MC: Monte-Carlo, GLQ: Gauss-Legendre Quadrature

A.2.3 Three Covariates

Figure Figure A.2.4:

Three covariates - ϕ_D with FIM integration using copula and MC (left) and copula and GLQ (right). The boxplot displays the median, the 25^th and 75^th percentiles, while the whiskers are 5^th and 95^th percentiles. The diamond corresponds to the mean. ϕ_D: refers to the D-criterion, FIM: Fisher Information Matrix MC: Monte-Carlo, GLQ: Gauss-Legendre Quadrature

A.3 Optimisation results: marginal distributions

View this table:

Table 4: Three Covariates - Marginal optimal proportions for the covariate distributions for the D-criterion for the different sets of constraints

Footnotes

Funding statement This work was financed by a CIFRE agreement (Conventions Industrielles de Formation par la Recherche) of the ANRT (Association Nationale de la Recherche et de la Technologie). The CIFRE agreement is a partnership between a public laboratory and a company, here the UMR (Unité Mixte de Recherche) 1137 and Ipsen, respectively.

References

1.↵
Mould, D. R. & Upton, R. Basic concepts in population modeling, simulation, and model-based drug development. CPT: Pharmacometrics & Systems Pharmacology 1, 1–14 (2012).
OpenUrl
2.↵
Sanghavi, K. et al. Covariate modeling in pharmacometrics: General points for consideration. CPT: Pharmacometrics & Systems Pharmacology 13, 710–728 (2024).
OpenUrl PubMed
3.↵
FDA Guidance for Industry Population Pharmacokinetics https://www.fda.gov/media/128793/download. 2022.
4.↵
Atkinson, A., Donev, A. & Tobias, R. Optimum experimental designs, with SAS (OUP Oxford, 2007).
5.↵
Mentré, F. et al. Current use and developments needed for optimal design in pharmacometrics: a study performed among DDMoRe’s European Federation of Pharmaceutical Industries and Associations Members. CPT: Pharmacometrics & Systems Pharmacology 2, 1–2 (2013).
OpenUrl
6.↵
Strömberg, E. A., Nyberg, J. & Hooker, A. C. The effect of Fisher information matrix approximation methods in population optimal design calculations. Journal of Pharmacokinetics and Pharmacodynamics 43, 609–619 (2016).
OpenUrl PubMed
7.↵
Mentre, F., Mallet, A. & Baccar, D. Optimal design in random-effects regression models. Biometrika 84, 429–442 (1997).
OpenUrl CrossRef Web of Science
8.
Retout, S., Mentré, F. & Bruno, R. Fisher information matrix for non-linear mixed-effects models: evaluation and application for optimal design of enoxaparin population pharmacokinetics. Statistics in Medicine 21, 2623–2639 (2002).
OpenUrl CrossRef PubMed Web of Science
9.↵
Bazzoli, C., Retout, S. & Mentré, F. Fisher information matrix for nonlinear mixed effects multiple response models: evaluation of the appropriateness of the first order linearization using a pharmacokinetic/pharmacodynamic model. Statistics in Medicine 28, 1940–1956 (2009).
OpenUrl CrossRef PubMed
10.↵
Dodds, M. G., Hooker, A. C. & Vicini, P. Robust population pharmacokinetic experiment design. Journal of Pharmacokinetics and Pharmacodynamics 32, 33–64 (2005).
OpenUrl CrossRef PubMed Web of Science
11.
Foo, L. K., McGree, J., Eccleston, J. & Duffull, S. Comparison of robust criteria for D-optimal designs. Journal of Biopharmaceutical Statistics 22, 1193–1205 (2012).
OpenUrl PubMed
12.
Loingeville, F., Nguyen, T. T., Riviere, M.-K. & Mentré, F. Robust designs in longitudinal studies accounting for parameter and model uncertainties–application to count data. Journal of Biopharmaceutical Statistics 30, 31–45 (2020).
OpenUrl PubMed
13.↵
Seurat, J., Nguyen, T. T. & Mentré, F. Robust designs accounting for model uncertainty in longitudinal studies with binary outcomes. Statistical Methods in Medical Research 29, 934–952 (2020).
OpenUrl PubMed
14.↵
Bazzoli, C., Retout, S. & Mentré, F. Design evaluation and optimisation in multiple response nonlinear mixed effect models: PFIM 3.0. Computer Methods and Programs in Biomedicine 98, 55–65 (2010).
OpenUrl CrossRef PubMed Web of Science
15.↵
Leroux, R., Seurat, J., Fayette, L., Bach, N. T. & Mentré, F. Design evaluation and optimisation in nonlinear mixed effects models with the R package PFIM 6.0. PAGE (2023).
16.↵
Mentré, F., Leroux, R., Seurat, J. & Fayette, L. PFIM: Population Fisher Information Matrix R package version 6.1 (2024). https://CRAN.R-project.org/package=PFIM.
17.↵
Foracchia, M., Hooker, A. C., Vicini, P. & Ruggeri, A. POPED, a software for optimal experiment design in population kinetics. Computer Methods and Programs in Biomedicine 74 (2004).
18.↵
Nyberg, J. et al. PopED: An extended, parallelized, nonlinear mixed effects models optimal design tool. Computer Methods and Programs in Biomedicine 108 (2012).
19.↵
Bauer, R. J., Hooker, A. C. & Mentre, F. Tutorial for $ DESIGN in NONMEM: Clinical trial evaluation and optimization. CPT: Pharmacometrics & Systems Pharmacology 10, 1452–1465 (2021).
OpenUrl PubMed
20.↵
Fayette, L., Brendel, K. & Mentré, F. Using Fisher Information Matrix to predict uncertainty in covariate effects and power to detect their relevance in Non-Linear Mixed Effect Models in pharmacometrics. medRxiv, 2024–10 (2024).
21.↵
Rosenberger, W. F. & Sverdlov, O. Handling Covariates in the Design of Clinical Trials. Statistical Science 23. doi:10.1214/08-STS269 (Aug. 2008).
OpenUrl CrossRef
22.↵
Bogacka, B., Latif, M. A., Gilmour, S. G. & Youdim, K. Optimum designs for non-linear mixed effects models in the presence of covariates. Biometrics 73, 927–937 (2017).
OpenUrl PubMed
23.↵
Golub, G. H. & Welsch, J. H. Calculation of Gauss quadrature rules. Mathematics of Computation 23, 221–230 (1969).
OpenUrl
24.↵
Calamai, P. H. & Moré, J. J. Projected gradient methods for linearly constrained problems. Mathematical Programming 39, 93–116 (1987).
OpenUrl
25.↵
R Core Team. R: A Language and Environment for Statistical Computing R Foundation for Statistical Computing (Vienna, Austria, 2021). https://www.R-project.org/.
26.↵
Centers for Disease Control and Prevention (CDC). National Center for Health Statistics (NCHS). National Health and Nutrition Examination Survey Data. Hyattsville, MD: U.S. Department of Health and Human Services, Centers for Disease Control and Prevention https://wwwn.cdc.gov/nchs/nhanes/default.aspx. 2009-2020.
27.↵
Rosenblatt, M. Remarks on a multivariate transformation. The Annals of Mathematical Statistics 23, 470–472 (1952).
OpenUrl
28.↵
James, F. Monte Carlo theory and practice. Reports on Progress in Physics 43, 1145 (1980).
OpenUrl
29.↵
Fayette, L., Leroux, R., Mentré, F. & Seurat, J. Robust and adaptive two-stage designs in nonlinear mixed effect models. The AAPS Journal 25, 71 (2023).
OpenUrl PubMed
30.↵
Retout, S., Comets, E., Samson, A. & Mentré, F. Design in nonlinear mixed effects models: optimization using the Fedorov–Wynn algorithm and power of the Wald test for binary covariates. Statistics in Medicine 26, 5162–5179 (2007).
OpenUrl CrossRef PubMed
31.↵
Giner, G. & Smyth, G. K. statmod: probability calculations for the inverse Gaussian distribution. R Journal 8, 339–351 (2016).
OpenUrl
32.↵
Karline Soetaert, Karel Van den Meersche & Dick van Oevelen. limSolve: Solving Linear Inverse Models R package 1.5.1 (2009).
33.↵
Robinson, J. D., Lupkiewicz, S. M., Palenik, L., Lopez, L. M. & Ariet, M. Determination of ideal body weight for drug dosage calculations. American Journal of Hospital Pharmacy 40, 1016–1019 (1983).
OpenUrl Abstract
34.↵
Chin, P., Florkowski, C. & Begg, E. The performances of the Cockcroft-Gault, modification of diet in renal disease study and chronic kidney disease epidemiology collaboration equations in predicting gentamicin clearance. Annals of Clinical Biochemistry 50, 546–557 (2013).
OpenUrl CrossRef PubMed
35.↵
Nagler, T. & Vatter, T. rvinecopulib: High performance algorithms for vine copula modeling. R package version 0.5 5. https://CRAN.R-project.org/package=rvinecopulib (2021).
36.↵
Guo, Y., Guo, T., Knibbe, C. A., Zwep, L. B. & van Hasselt, J. Generation of realistic virtual adult populations using a model-based copula approach. Journal of Pharmacokinetics and Pharmacodynamics, 1–12 (2024).
37.↵
Guo, Y., Guo, T., van Hasselt, J. G. C. & Zwep, L. B. pmxcopula - R package for copula-based covariate simulation. PAGE (2024).
38.↵
Malcolm, M. A. & Simpson, R. B. Local versus global strategies for adaptive quadrature. ACM Transactions on Mathematical Software (TOMS) 1, 129–146 (1975).
OpenUrl
39.↵
Riviere, M.-K., Ueckert, S. & Mentré, F. An MCMC method for the evaluation of the Fisher information matrix for non-linear mixed effect models. Biostatistics 17, 737–750 (2016).
OpenUrl CrossRef PubMed
40.↵
Ueckert, S. & Mentré, F. A new method for evaluation of the Fisher information matrix for discrete mixed effect models using Monte Carlo sampling and adaptive Gaussian quadrature. Computational Statistics & Data Analysis 111, 203–219 (2017).
OpenUrl
41.↵
Lestini, G., Dumont, C. & Mentré, F. Influence of the size of cohorts in adaptive design for nonlinear mixed effects models: an evaluation by simulation for a pharmacokinetic and pharmacodynamic model for a biomarker in oncology. Pharmaceutical Research 32, 3159–3169 (2015).
OpenUrl PubMed

View the discussion thread.

Posted February 02, 2025.

Download PDF

Data/Code

Citation Tools

Subject Area

Health Informatics

Subject Areas

All Articles

Addiction Medicine (407)
Allergy and Immunology (718)
Anesthesia (212)
Cardiovascular Medicine (3027)
Dentistry and Oral Medicine (344)
Dermatology (256)
Emergency Medicine (452)
Endocrinology (including Diabetes Mellitus and Metabolic Disease) (1068)
Epidemiology (12944)
Forensic Medicine (12)
Gastroenterology (844)
Genetic and Genomic Medicine (4744)
Geriatric Medicine (438)
Health Economics (744)
Health Informatics (3011)
Health Policy (1090)
Health Systems and Quality Improvement (1111)
Hematology (405)
HIV/AIDS (950)
Infectious Diseases (except HIV/AIDS) (14253)
Intensive Care and Critical Care Medicine (872)
Medical Education (448)
Medical Ethics (117)
Nephrology (487)
Neurology (4523)
Nursing (241)
Nutrition (669)
Obstetrics and Gynecology (836)
Occupational and Environmental Health (754)
Oncology (2351)
Ophthalmology (664)
Orthopedics (262)
Otolaryngology (332)
Pain Medicine (294)
Palliative Medicine (85)
Pathology (511)
Pediatrics (1224)
Pharmacology and Therapeutics (515)
Primary Care Research (513)
Psychiatry and Clinical Psychology (3886)
Public and Global Health (7114)
Radiology and Imaging (1579)
Rehabilitation Medicine and Physical Therapy (943)
Respiratory Medicine (937)
Rheumatology (455)
Sexual and Reproductive Health (462)
Sports Medicine (395)
Surgery (501)
Toxicology (63)
Transplantation (216)
Urology (187)

[1] 1.↵
Mould, D. R. & Upton, R. Basic concepts in population modeling, simulation, and model-based drug development. CPT: Pharmacometrics & Systems Pharmacology 1, 1–14 (2012).
OpenUrl

[2] 2.↵
Sanghavi, K. et al. Covariate modeling in pharmacometrics: General points for consideration. CPT: Pharmacometrics & Systems Pharmacology 13, 710–728 (2024).
OpenUrl PubMed

[3] 3.↵
FDA Guidance for Industry Population Pharmacokinetics https://www.fda.gov/media/128793/download. 2022.

[4] 4.↵
Atkinson, A., Donev, A. & Tobias, R. Optimum experimental designs, with SAS (OUP Oxford, 2007).

[5] 5.↵
Mentré, F. et al. Current use and developments needed for optimal design in pharmacometrics: a study performed among DDMoRe’s European Federation of Pharmaceutical Industries and Associations Members. CPT: Pharmacometrics & Systems Pharmacology 2, 1–2 (2013).
OpenUrl

[6] 6.↵
Strömberg, E. A., Nyberg, J. & Hooker, A. C. The effect of Fisher information matrix approximation methods in population optimal design calculations. Journal of Pharmacokinetics and Pharmacodynamics 43, 609–619 (2016).
OpenUrl PubMed

[7] 7.↵
Mentre, F., Mallet, A. & Baccar, D. Optimal design in random-effects regression models. Biometrika 84, 429–442 (1997).
OpenUrl CrossRef Web of Science

[8] 8.
Retout, S., Mentré, F. & Bruno, R. Fisher information matrix for non-linear mixed-effects models: evaluation and application for optimal design of enoxaparin population pharmacokinetics. Statistics in Medicine 21, 2623–2639 (2002).
OpenUrl CrossRef PubMed Web of Science

[9] 9.↵
Bazzoli, C., Retout, S. & Mentré, F. Fisher information matrix for nonlinear mixed effects multiple response models: evaluation of the appropriateness of the first order linearization using a pharmacokinetic/pharmacodynamic model. Statistics in Medicine 28, 1940–1956 (2009).
OpenUrl CrossRef PubMed

[10] 10.↵
Dodds, M. G., Hooker, A. C. & Vicini, P. Robust population pharmacokinetic experiment design. Journal of Pharmacokinetics and Pharmacodynamics 32, 33–64 (2005).
OpenUrl CrossRef PubMed Web of Science

[11] 11.
Foo, L. K., McGree, J., Eccleston, J. & Duffull, S. Comparison of robust criteria for D-optimal designs. Journal of Biopharmaceutical Statistics 22, 1193–1205 (2012).
OpenUrl PubMed

[12] 12.
Loingeville, F., Nguyen, T. T., Riviere, M.-K. & Mentré, F. Robust designs in longitudinal studies accounting for parameter and model uncertainties–application to count data. Journal of Biopharmaceutical Statistics 30, 31–45 (2020).
OpenUrl PubMed

[13] 13.↵
Seurat, J., Nguyen, T. T. & Mentré, F. Robust designs accounting for model uncertainty in longitudinal studies with binary outcomes. Statistical Methods in Medical Research 29, 934–952 (2020).
OpenUrl PubMed

[14] 14.↵
Bazzoli, C., Retout, S. & Mentré, F. Design evaluation and optimisation in multiple response nonlinear mixed effect models: PFIM 3.0. Computer Methods and Programs in Biomedicine 98, 55–65 (2010).
OpenUrl CrossRef PubMed Web of Science

[15] 15.↵
Leroux, R., Seurat, J., Fayette, L., Bach, N. T. & Mentré, F. Design evaluation and optimisation in nonlinear mixed effects models with the R package PFIM 6.0. PAGE (2023).

[16] 16.↵
Mentré, F., Leroux, R., Seurat, J. & Fayette, L. PFIM: Population Fisher Information Matrix R package version 6.1 (2024). https://CRAN.R-project.org/package=PFIM.

[17] 17.↵
Foracchia, M., Hooker, A. C., Vicini, P. & Ruggeri, A. POPED, a software for optimal experiment design in population kinetics. Computer Methods and Programs in Biomedicine 74 (2004).

[18] 18.↵
Nyberg, J. et al. PopED: An extended, parallelized, nonlinear mixed effects models optimal design tool. Computer Methods and Programs in Biomedicine 108 (2012).

[19] 19.↵
Bauer, R. J., Hooker, A. C. & Mentre, F. Tutorial for $ DESIGN in NONMEM: Clinical trial evaluation and optimization. CPT: Pharmacometrics & Systems Pharmacology 10, 1452–1465 (2021).
OpenUrl PubMed

[20] 20.↵
Fayette, L., Brendel, K. & Mentré, F. Using Fisher Information Matrix to predict uncertainty in covariate effects and power to detect their relevance in Non-Linear Mixed Effect Models in pharmacometrics. medRxiv, 2024–10 (2024).

[21] 21.↵
Rosenberger, W. F. & Sverdlov, O. Handling Covariates in the Design of Clinical Trials. Statistical Science 23. doi:10.1214/08-STS269 (Aug. 2008).
OpenUrl CrossRef

[22] 22.↵
Bogacka, B., Latif, M. A., Gilmour, S. G. & Youdim, K. Optimum designs for non-linear mixed effects models in the presence of covariates. Biometrics 73, 927–937 (2017).
OpenUrl PubMed

[23] 23.↵
Golub, G. H. & Welsch, J. H. Calculation of Gauss quadrature rules. Mathematics of Computation 23, 221–230 (1969).
OpenUrl

[24] 24.↵
Calamai, P. H. & Moré, J. J. Projected gradient methods for linearly constrained problems. Mathematical Programming 39, 93–116 (1987).
OpenUrl

[25] 25.↵
R Core Team. R: A Language and Environment for Statistical Computing R Foundation for Statistical Computing (Vienna, Austria, 2021). https://www.R-project.org/.

[26] 26.↵
Centers for Disease Control and Prevention (CDC). National Center for Health Statistics (NCHS). National Health and Nutrition Examination Survey Data. Hyattsville, MD: U.S. Department of Health and Human Services, Centers for Disease Control and Prevention https://wwwn.cdc.gov/nchs/nhanes/default.aspx. 2009-2020.

[27] 27.↵
Rosenblatt, M. Remarks on a multivariate transformation. The Annals of Mathematical Statistics 23, 470–472 (1952).
OpenUrl

[28] 28.↵
James, F. Monte Carlo theory and practice. Reports on Progress in Physics 43, 1145 (1980).
OpenUrl

[29] 29.↵
Fayette, L., Leroux, R., Mentré, F. & Seurat, J. Robust and adaptive two-stage designs in nonlinear mixed effect models. The AAPS Journal 25, 71 (2023).
OpenUrl PubMed

[30] 30.↵
Retout, S., Comets, E., Samson, A. & Mentré, F. Design in nonlinear mixed effects models: optimization using the Fedorov–Wynn algorithm and power of the Wald test for binary covariates. Statistics in Medicine 26, 5162–5179 (2007).
OpenUrl CrossRef PubMed

[31] 31.↵
Giner, G. & Smyth, G. K. statmod: probability calculations for the inverse Gaussian distribution. R Journal 8, 339–351 (2016).
OpenUrl

[32] 32.↵
Karline Soetaert, Karel Van den Meersche & Dick van Oevelen. limSolve: Solving Linear Inverse Models R package 1.5.1 (2009).

[33] 33.↵
Robinson, J. D., Lupkiewicz, S. M., Palenik, L., Lopez, L. M. & Ariet, M. Determination of ideal body weight for drug dosage calculations. American Journal of Hospital Pharmacy 40, 1016–1019 (1983).
OpenUrl Abstract

[34] 34.↵
Chin, P., Florkowski, C. & Begg, E. The performances of the Cockcroft-Gault, modification of diet in renal disease study and chronic kidney disease epidemiology collaboration equations in predicting gentamicin clearance. Annals of Clinical Biochemistry 50, 546–557 (2013).
OpenUrl CrossRef PubMed

[35] 35.↵
Nagler, T. & Vatter, T. rvinecopulib: High performance algorithms for vine copula modeling. R package version 0.5 5. https://CRAN.R-project.org/package=rvinecopulib (2021).

[36] 36.↵
Guo, Y., Guo, T., Knibbe, C. A., Zwep, L. B. & van Hasselt, J. Generation of realistic virtual adult populations using a model-based copula approach. Journal of Pharmacokinetics and Pharmacodynamics, 1–12 (2024).

[37] 37.↵
Guo, Y., Guo, T., van Hasselt, J. G. C. & Zwep, L. B. pmxcopula - R package for copula-based covariate simulation. PAGE (2024).

[38] 38.↵
Malcolm, M. A. & Simpson, R. B. Local versus global strategies for adaptive quadrature. ACM Transactions on Mathematical Software (TOMS) 1, 129–146 (1975).
OpenUrl

[39] 39.↵
Riviere, M.-K., Ueckert, S. & Mentré, F. An MCMC method for the evaluation of the Fisher information matrix for non-linear mixed effect models. Biostatistics 17, 737–750 (2016).
OpenUrl CrossRef PubMed

[40] 40.↵
Ueckert, S. & Mentré, F. A new method for evaluation of the Fisher information matrix for discrete mixed effect models using Monte Carlo sampling and adaptive Gaussian quadrature. Computational Statistics & Data Analysis 111, 203–219 (2017).
OpenUrl

[41] 41.↵
Lestini, G., Dumont, C. & Mentré, F. Influence of the size of cohorts in adaptive design for nonlinear mixed effects models: an evaluation by simulation for a pharmacokinetic and pharmacodynamic model for a biomarker in oncology. Pharmaceutical Research 32, 3159–3169 (2015).
OpenUrl PubMed

Optimising covariate allocation at design stage using Fisher Information Matrix for Non-Linear Mixed Effects Models in pharmacometrics

Abstract

1 Introduction

2 Methods

2.1 Notations on study designs and NLMEM

2.2 Elementary and population FIM

2.2.1 Elementary FIM with discrete covariates only

2.2.2 Elementary FIM with discrete and continuous covariates

2.3 Segmentation of continuous covariates

2.4 FIM integration using copula and Gauss-Legendre Quadrature

2.5 Optimisation of covariate distributions

2.5.1 Optimisation criteria

2.5.2 Optimising covariate proportions using Projected Gradient Descent

Linear constraints

Projected Gradient Descent

2.6 Optimisation of sample size

2.7 Implementation

3 Evaluation example using a simple PK model

3.1 Settings

3.1.1 Model

3.1.2 Covariate distribution specification

3.1.3 Parameter values

3.1.4 Design

3.2 FIM evaluation using copula and quadrature

3.2.1 Copula fitting

3.2.2 Quadrature settings

3.3 Covariate distribution optimisation

3.3.1 SEX only

3.3.2 BMI only

3.3.3 CLCR only

3.3.4 Three covariates

3.4 Optimised distributions evaluation

4 Results

4.1 Copula fitting

4.1.1 BMI only

4.1.2 CLCR only

4.1.3 Three covariates

4.2 FIM evaluation using quadrature

4.2.1 BMI only

4.2.2 CLCR only

4.2.3 Three Covariates

4.3 Covariate distribution optimisation

4.3.1 SEX only

4.3.2 BMI only

4.3.3 CLCR only

4.3.4 Three Covariates

5 Discussion

Data Availability

A Appendix

A.1 Copula diagnostics

A.1.1 BMI only

A.1.2 CLCR only

A.1.3 Three covariates

A.2 FIM integration: MC vs GQ

A.2.1 BMI only

A.2.2. CLCR only

A.2.3 Three Covariates

A.3 Optimisation results: marginal distributions

Footnotes

References

Citation Manager Formats

Subject Area