Identification of probiotic responders in cross-over trials using the Bayesian statistical model considering lags of effect period
==================================================================================================================================

* Shion Hosoda
* Yuichiro Nishimoto
* Yohsuke Yamauchi
* Takuji Yamada
* Michiaki Hamada

## Abstract

Recent advances in microbiome research have led to the further development of microbial interventions, such as probiotics and prebiotics, which are potential treatments for constipation. However, the effects of probiotics vary from person to person; therefore, the effectiveness of probiotics needs to be verified for each individual. Individuals showing significant effects of the target probiotic is called responders. A statistical model for the evaluation of responders was proposed in a previous study. However, the previous model does not consider the lag in the effect of the probiotic. It is expected that there are lags between the period of time when probiotics are administered and when they are effective. In this study, we propose a Bayesian statistical model to estimate the probability that a subject is a responder, by considering the lag of the effect period. In synthetic dataset experiments, the proposed model was found to outperform the base model, which did not factor in the lag. Further, we found that the proposed model could distinguish responders showing large uncertainty in terms of the lag of the effect period against the intake period.

## 1 Introduction

Recent advances in microbiome research have resulted in the rapid development of microbial interventions, such as probiotics and prebiotics, which are potential treatments for constipation [1]. Probiotics are defined as living microbes that have a beneficial effect on the host when ingested in sufficient quantities and are reported to improve defecation frequencies and treat constipation [1, 2]. The effects of probiotics vary from person to person [3]. Individuals exhibiting significant effects of probiotics, are called “responders” [4]; each responder exhibits a significant effect of a different probiotic. That is, the different responders respond to different probiotics, and the individual differences make it difficult to evaluate the effects of probiotics. Therefore, experimental designs used for probiotic research should take into account the individual differences between subjects.

One type of a sophisticated experimental design is a cross-over trial, in which each subject takes both the target probiotic and placebo. Specifically, a cross-over trial comprises the following steps: (1) Each individual is first administered a capsule containing the target probiotic or placebo for several days. (2) After a washout period, which lasts several weeks and is set to remove the effects of the former capsule, each individual is administered the other capsule (containing either probiotic or placebo) for a specific period of time. Cross-over trials are widely applied in various fields of research, including research related to probiotics [5], prebiotics [6], neurorehabilitation [7], and spinal manipulation [8]. The main advantage of cross-over trials is that they enable the evaluation of individual differences, which are determined by attributes such as gender, genetics, and habits. Accordingly, datasets obtained from a cross-over trial demand a reasonable analysis method that considers individual differences.

An approach to estimate individual differences have been already conducted in a previous study. Nakamura *et al*. evaluated improvements in defecation frequencies using a Weibull regression model [9]. They revealed individual differences in the improvement of defecation frequency by grouping subjects into three groups: strong responders, weak responders, and non-responders. However, their model used an unreasonable assumption that the effects of the target probiotics start on the day when the probiotic is administered to the subject. A previous study suggested that orally ingested material should be excreted for one or more days [10]. In addition, it has been estimated that it can take more than ten hours for microbes to increase dramatically [11], and at least two days for food to alter the gut microbiome [12]. Therefore, analyses that do not consider the lag between the intake time and the effect time can lead to a misidentification of responders, especially in short-term intervention experiments.

In this study, we propose a Bayesian statistical model for estimating the efficacy of a target probiotic in improving defecation frequency, by considering the lag between the intake and effect time (Fig. 1). The model considered individual differences in a cross-over trial dataset. The proposed model is based on the segmented linear regression model, which represents each periodic term using linear regression, and has discrete parameters of lag days. The proposed model evaluates the cumulative sum of the number of times a subject defecated. An individual can be evaluated based on the posterior probability that the individual is a responder to probiotics. With the proposed model, we estimated if each subject is a responder using synthetic datasets and the real dataset used in the previous study [9]. We compared the results of the proposed model with those of a base model that did not consider the lag period. Our analysis showed that taking into consideration the effect time lag was useful in the synthetic dataset experiments. Real data experiments show that the proposed model estimated the posterior distribution considering the effect lag and led to different conclusions from the base model. We found that the proposed model could eliminate uncertain responders (responders whose response to a probiotic is uncertain) according to the lag in the effect period against the intake period.

![Figure 1:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2022/03/15/2022.03.14.22272054/F1.medium.gif)

[Figure 1:](http://medrxiv.org/content/early/2022/03/15/2022.03.14.22272054/F1)

Figure 1: 
Schematic illustration of the effect lag in cross-over trials. The time when an effect of the probiotic is observed is delayed compared to the time when the probiotic is ingested. The subject is first administered the placebo capsule and then the target probiotic capsule.

## 2 Materials and methods

### 2.1 Overview

Here, we provide an overview of the proposed model. The proposed model requires a dataset of the cumulative sum of the number of defecation events, collected from a cross-over trial. Figure 1 shows the schematic illustration of the respective cross-over trial, whose dataset was used in this study. The terms of the trial are divided into several periodic terms with respect to capsule intake/effect, and are denoted as “segments.” Here, *S* = 5 is used in this figure and in the dataset used in this study, where *S* is the number of segments. The proposed model is a Bayesian statistical model based on segmented linear models for the cumulative sum of the number of defecation events. The proposed model can take the lag of the effect time (*cf*. Section 1) into account using the number of lag days as discrete parameters.

### 2.2 Generative process

Here, we describe the generative process of the proposed model. The proposed model demands the cumulative sum of the defecation frequency *y**i*(*i* = 1 … *N*), where *N* is the number of days in the entire trial term. This model is for one subject. Let *α* ∈ ℝ, *η* ∈ ℝ, and ![Graphic][1]</img> be the logarithmic defecation frequency during normal periods, the effect of target probiotics, and the effect of capsules, respectively. Here, the effect of the capsule indicates the effects of ingesting the capsule itself, regardless of its content. That is, the effect of the capsule is observed in both the target probiotic and placebo periods. The prior distributions of *α, η*, and ![Graphic][2]</img> are as follows. ![Formula][3]</img>  where WIP denotes the weakly informative prior distribution. We use Cauchy(0, 10) as the weakly informative prior in this study, where Cauchy(*x*, *γ*) denotes the Cauchy distribution with the location parameter *x* and scale parameter *γ*. Let *µ* and *ν* be the lag of effect start and end, respectively. *µ* and *ν* are shared by the effect of capsules and probiotics. Specifically, the effects of the capsules and probiotics emerged *µ* days after the subject ingested the capsule and expired *ν* days after the subject stopped taking the capsule. Here, we assume that *µ* and *ν* are up to several days long using the following prior distributions: ![Formula][4]</img>  where DiscreteUniform(*a, b*) denotes the discrete uniform distribution with minimum value *a* and maximum value *b*, and *µ*max and *ν*max are the maximum values of *µ* and *ν*, respectively. We used *µ*max = *ν*max = 5 in this paper. We modified the segment in which the subject ingested the capsule using *µ* and *ν* as follows: ![Formula][5]</img>  where *d*1, *d*2, *d*3, and *d*4 represent the start day index of the first capsule, the end day index of the first capsule, the start day index of the second capsule, and the end day index of the second capsule, respectively; and ![Graphic][6]</img>, and ![Graphic][7]</img> indicate the effect start day index of the first capsule, the effect end day index of the first capsule, the effect start day index of the second capsule, and the effect end day index of the second capsule, respectively. Let *O, P*, and *T* be the sets of the day indices in the normal, placebo, and target probiotic periods, respectively. *O, P*, and *T* are given by ![Formula][8]</img>  ![Formula][9]</img>  where *C* is an indicator of the cross-over type, which shows the order of the capsules of the target probiotic and placebo. The subject was first administered placebos and then the target probiotic when *C* = 0, and the subject was first administered the target probiotic and then the placebo when *C* = 1. Let *β**i* be the rate of increase in the cumulative sum of the defecation frequency on the *i*-th day. *β**i* depends on the period in which the *i*-th day lies, as given below: ![Formula][10]</img>  The intercept of the *i*-th segment *γ**i* is defined as ![Formula][11]</img>  where *G**t* is the set of the day indices in the *t*-th segment. Here, *γ**i* is calculated such that the regression line passes through the observation point ![Graphic][12]</img>. This calculation enables the precise evaluation of the increase in the cumulative sum of the defecation frequencies in each segment. *γ**i* in the first segment is equal to zero, because the cumulative sum of defecation frequencies is 0 before the first day. The distribution of the *i*-th day cumulative sum of defecation frequencies *y**i* is as follows: ![Formula][13]</img>  Where ![Formula][14]</img>  Here, WIP*>ε* denotes the truncated weakly informative prior distribution, whose domain of definition is *x > ε* with a random variable *x*. We used 0.1 as *ε*.

### 2.3 Parameter estimation

We estimated the posterior distribution of the parameters of the proposed model using the No-U-Turn-Sampler (NUTS) [13], which is a Markov chain Monte Carlo (MCMC) method. Because NUTS can sample only continuous parameters, we estimate the following posterior distribution marginalized with respect to *µ* and *ν*: ![Formula][15]</img>  where ·1:*N* denotes the set ![Graphic][16]</img>. We implemented a parameter estimation algorithm using PyStan ([https://github.com/stan-dev/pystan](https://github.com/stan-dev/pystan)). We used five chains of MCMC and then sampled parameters 1000 times randomly for each chain and discarded the first half of the samples as burn-in samples, which were supposed to depend on the initial sample. We used 15 as the maximum of the tree depth in the NUTS algorithm (called the “max treedepth” option in the PyStan library). The other hyperparameters were set by default.

### 2.4 Evaluation by scoring improvement of defecation frequency

We defined the following defecation frequency improvement (DFI) score DFI(*µ, ν*): ![Formula][17]</img>  where *x**i* is the defecation frequency of the *i*-th day, and *T* and *P* are computed by Eq. (1) and Eq. (2), respectively, for for the score parameters *µ* and *ν*. The DFI score indicates the log-ratio of the defecation frequency in the target probiotic period to that in the placebo period.

### 2.5 Synthetic data experiment

We generated synthetic datasets and estimated the parameters using these datasets to evaluate the performance of the proposed model. We generated *α, η*, ![Graphic][18]</img>, *µ*, and *ν* by using the following distribution: ![Formula][19]</img>  After computing *β**i* using *α, η*, ![Graphic][20]</img>, *µ*, and *ν*, The number of days between the *l*-th and *l* + 1-th defecation events for subject *v**l* ∈ 𡄝 was obtained as follows: ![Formula][21]</img>  where Gamma(*a, b*) denotes the gamma distribution with the shape parameter *a* and scale parameter *b* and ![Graphic][22]</img> is the variance of the interval. The mean and variance of this gamma distribution were 1*/β**I* and ![Graphic][23]</img>, respectively. We generated *v**l* until ![Graphic][24]</img> exceeded the number of days in each segment and obtained *y**i* by transforming the defecation intervals. We randomly generated datasets 1000 times and estimated the posterior distributions of the parameters once on each dataset. We used (*d**1*, *d**2*, *d**3*, *d**4*) = (29, 43, 71, 85), (51, 76, 126, 151), (101, 151, 251, 301), *N* = 85, 151, 301, ![Graphic][25]</img>, and *C* = 0 in one half of the subjects and *C* = 1 in the other half of the subjects for each dataset.

### 2.6 Real data experiment

We used a real dataset from a previous study [9]. Twenty subjects received *Bifidobacterium longum* capsules in the experiment. Eleven subjects were first administered placebo capsules (*C* = 0) and the remaining subjects were first administered the target probiotic capsules (*C* = 1). (*d*1, *d*2, *d*3, *d*4) = (29, 43, 71, 85) was used in this dataset.

### 2.7 Bayesian beta regression of responder probability on the microbial relative abundances

We conducted regression analysis using the beta regression model [14]. Let *r**i* be the response probability of the *i*-th subject. The Bayesian beta regression model represents *r**i* using the standardized relative abundances of the bacteria in the *i*-th subject shortly before the start of capsule administration, which is denoted by **m***i*. **m***i* is the *D*-dimensional vector and *D* is the number of the different bacteria. The generative process is as follows: ![Formula][26]</img>  where *φ* is the precision parameter obtained by reparameterizing the parameters of the beta distribution. *λ* is the regularization parameter. **b** = (*b*1, …, *b**D*)T is the regression parameter vector; logit*−*1() is the inverse-logit function; and Beta(*a, b*) denotes the beta distribution with the shape parameters *a* and *b*. Because the domain of definition of the beta distribution does not include zero or one, we added 10*−*5/ 10*−*5 to *r**i* when *r**i* is zero or one. The same method as in Section 2.3 was used for the parameter estimation.

### 2.8 Microbiome data

The 16S rRNA sequence data were obtained from the DDBJ DRA(DRA006874). QIIME2 (version 2019.10) was used for the 16S rRNA gene analysis [15]. In the analytical pipeline, sequence data were processed using the DADA2 pipeline for quality filtering and denoising (options: –p-trunc-len-f 150 –p-trunc-len-r 190–p-max-ee-f 3.0 –p-max-ee-r 3.0) [16]. The filtered output sequences were assigned to taxa by using the “qiime feature-classifier classify-sklearn” command with the default parameters. Silva SSU Ref Nr 99 (version 132) was used as the reference database for taxonomy assignment [17]. We used only those taxa that had non-zero abundance in at least 15 subjects for the regression analysis.

## 3 Results

### 3.1 Performance evaluation with synthetic datasets

We evaluated the performance of the proposed model under various conditions using synthetic datasets (*cf*. Section 2.5). To verify the accuracy of *η*, we compared the estimated and true values (Supplementary Figure S1). In the case of ![Graphic][27]</img>, the proposed model could accurately estimate *η*. ![Graphic][28]</img>, that is, the standard deviation *σ*(*v*) 0.1 means that defecation events with the one-sigma error are within ± 2.4 hours (*cf*. Section 2.5).

We also verified the accuracy of the estimation of *µ* and *ν*. Figure 2 shows the sum of the probabilities for each true *µ* and *ν* to evaluate the uncertainty of the estimation. The diagonal elements in Fig. 2def, which shows the results of *ν*, are high in the experiments of ![Graphic][29]</img>. However, as can be observed in Fig. 2abc, which shows the results for *µ*, the proposed model tends to overestimate the *µ* value.

![Figure 2:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2022/03/15/2022.03.14.22272054/F2.medium.gif)

[Figure 2:](http://medrxiv.org/content/early/2022/03/15/2022.03.14.22272054/F2)

Figure 2: 
The heat map of the true *µ, ν* and *µ, ν* estimated by the proposed model for each synthetic dataset, where the number of observation points is the same as that of the real dataset. The *x*- and *y*-axes indicate the true *µ, ν* and estimated *µ, ν* values, respectively. Each column represents the sum of the probabilities of each estimate for the subject whose true value is that in the column. a, b, and c indicate the *µ* results when ![Graphic][30]</img>, and ![Graphic][31]</img>, respectively. d, e, and f indicate the *ν* results when ![Graphic][32]</img>, and ![Graphic][33]</img>, respectively.

To evaluate the performance improvement by considering the lag, we identified responders based on the posterior distribu-tions. Here, we defined responders as the subjects with *η >* 0. Figure 3 shows the AUC-ROC curve of the proposed model and the base model. We use the proposed model with *µ*max = *ν*max = 0 (*cf*. Section 2.2), which does not consider the lag of the effect period, as the base model. The proposed model outperformed the base model, and the effectiveness of considering the lag of the effect period is demonstrated in the case where a lag exists. Figure 3 also shows the performance for each threshold of the posterior probability of *η >* 0. In the case of ![Graphic][34]</img>, identification with a threshold of 0.95 showed a low false positive rate.

![Figure 3:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2022/03/15/2022.03.14.22272054/F3.medium.gif)

[Figure 3:](http://medrxiv.org/content/early/2022/03/15/2022.03.14.22272054/F3)

Figure 3: 
The AUC-ROC curve for identifying responders based on the estimated posterior distributions of *η* in the synthetic datasets of *N* = 85 and (*d*1, *d*2, *d*3, *d*4) = (29, 43, 71, 85) using the proposed model (*µ*max = *ν*max = 5) and the base model (*µ*max = *ν*max = 0). The *x*- and *y*-axes indicate the false positive and true positive rates, respectively. The blue and orange dashed lines indicate the results for *µ*max = *ν*max = 5 and *µ*max = *ν*max = 0, respectively. The red circle, blue triangle, and green square indicate the performance of responder identification with the threshold of the probability of *η >* 0 0.5, 0.7, and 0.95, respectively, when *µ*max = *ν*max = 5.

### 3.2 Responder evaluation using a real dataset

We conducted an experiment using a real dataset (*cf*. Section 2.6). To evaluate the effect of the target probiotic on each subject, we visualized the estimated posterior distribution of *η* (Fig. 4). Subjects MO04, MO05, MO10, and MO16 exhibited high values of *η*, whereas subjects MO06 and MO18 exhibited low values of *η*. The 95% Bayesian credible intervals of subjects MO02, MO04, MO05, MO06, MO08, MO10, MO12, MO13, MO16, MO18, and MO23 do not include zero. The posterior distributions of *µ* and *ν* are shown in Supplementary Figure S2. We can see that the estimated values of *µ* and *ν* vary from person to person.

![Figure 4:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2022/03/15/2022.03.14.22272054/F4.medium.gif)

[Figure 4:](http://medrxiv.org/content/early/2022/03/15/2022.03.14.22272054/F4)

Figure 4: 
Estimated posterior distributions of *η* for each subject. The *x*- and *y*-axes indicate the subjects and *η* values, respectively. The bar shows the median of the posterior distribution. The error bars represent the 2.5% and 97.5% percentiles.

We also examined the estimated probability that each subject was a responder (Fig. 5). We counted the number of samples that satisfied *η >* 0 and computed the ratio of the count to the number of all samples as the posterior probability. The probabilities of subjects MO02, MO04, MO05, MO08, MO09, MO10, and MO16 exceed 0.95. In a previous study, subjects MO04, MO05, and MO10 were also reported to be responders, whereas subject MO16 was reported to be a non-responder [9]. Supplementary Figure S3 shows the posterior distributions of *η* estimated by the base model, which did not consider the lag (*µ*max = *ν*max = 0). The Bayesian credible interval of subject MO16 also did not include zero, but the median was estimated to be lower than that of the proposed model. The results for subjects MO22 and MO24 show large differences between the proposed and the base models. The median values of the base model were larger than those of the proposed model, and their identification of responders based on 95% Bayesian credible intervals led to different conclusions (Fig. 5 and Supplementary Figure S4).

![Figure 5:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2022/03/15/2022.03.14.22272054/F5.medium.gif)

[Figure 5:](http://medrxiv.org/content/early/2022/03/15/2022.03.14.22272054/F5)

Figure 5: 
Probability that each subject is a responder based on posterior distribution. The *x*- and *y*-axes indicate the subjects and probabilities of *η >* 0, respectively. The horizontal line indicates that the probability equals to 0.95.

To verify the consistency between the posterior distributions and the used dataset, we evaluated the improvement in the defecation frequency using scoring (*cf*. Section 2.4). Figure 6 shows the DFI score of subjects where *µ* = 0 … 5 and *ν* = 0 … 5. While the DFI scores of *µ* = 0 and *ν* = 0 for subject MO24, which was identified as a responder by the base model, were 0.18, the DFI scores for *µ* ≠ 0 and *ν* ≠ 0 were less than 0. That is, subject MO24 did not show an improvement in the defecation frequency if a lag of the effect period existed, and the proposed model reflected the specifications of subject MO24. We also examined the fit of the predictive distribution to the data set (Fig. 7). We observed that the use of the cumulative sum enabled the consideration of the uncertainty caused by uneven defecation frequencies. For example, the defecation frequencies of MO04 and MO12 were comparable, but the uncertainty was estimated to be larger for MO04 because of the uneven defecation frequencies.

![Figure 6:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2022/03/15/2022.03.14.22272054/F6.medium.gif)

[Figure 6:](http://medrxiv.org/content/early/2022/03/15/2022.03.14.22272054/F6)

Figure 6: 
DFI scores (*cf*. Section 2.4) of all subjects. The title of each panel indicates the subject and the result of the responder identification by the proposed and base models. The left and right circles indicate the proposed and base model results, respectively. The filled circle indicates that the subject is identified as a responder, and the open circle indicates that the subject is not. The *x*- and *y*-axes indicate *ν* and *µ* of the score parameters, respectively. Each value indicates the score. A darker color indicates a higher score.

![Figure 7:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2022/03/15/2022.03.14.22272054/F7.medium.gif)

[Figure 7:](http://medrxiv.org/content/early/2022/03/15/2022.03.14.22272054/F7)

Figure 7: 
Data used and predictive distributions. The *x*- and *y*-axes indicate the day and cumulative sum of the defecation frequencies, respectively. The blue line and the blue area indicate real data and the 2.5% and 97.5% percentiles of predictive distributions, respectively. The red and black areas indicate the intake periods of the target probiotic and placebo, respectively.

To investigate the relationship between the response to probiotics and gut microbiota, we performed Bayesian beta re-gression of the responder probability on the microbial abundance features before the target probiotic periods. Figure 8 shows the posterior distribution of the regression parameters. As in the previous study, the negative effect of *Agathobacter* was estimated. The 95% Bayesian credible intervals for all regression parameters included zero.

![Figure 8:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2022/03/15/2022.03.14.22272054/F8.medium.gif)

[Figure 8:](http://medrxiv.org/content/early/2022/03/15/2022.03.14.22272054/F8)

Figure 8: 
Posterior distributions of the coefficients of Bayesian beta regression. The *x*- and *y*-axes indicate bacteria and regression coefficients, respectively. The bar shows the median of the posterior distribution. The error bars represent the 2.5% and 97.5% percentiles.

## 4 Discussion

In this study, we proposed a model for estimating the improvement in defecation frequencies using cross-over trial datasets, and considering the lag of the effect period. Using synthetic datasets, we verified that the proposed model could identify responders better than the base model. In the real dataset experiments, we identified seven responders based on the probability of *η >* 0. The base model, which assumed that the lag of the effect period did not exist, identified subjects MO22 and MO24 as responders, in addition to the responders identified by the proposed model. Subjects MO22 and MO24 did not have high DFI scores when *µ* ≠ 0 and *ν* ≠ 0 (Fig. 6). The proposed model reflected these observations. The proposed model was suggested to eliminate uncertain responders in terms of the lag of the effect period against the intake period. In the regression analysis of the responder probabilities on the microbial relative abundances before target probiotic intake, we found that *Agathobacter* had a negative effect. These results are consistent with those of the previous study. However, we could not conclude any microbial effects based on the 95% Bayesian credible intervals.

We used the same lag of the start/end day for the placebo and the target probiotic periods. However, these two types of lags may differ because of their sources. That is, while the lag in the target probiotic period is likely to be caused by physical factors (digestion and changes in physical conditions), the lag in the placebo period is likely to be caused by cognitive factors [18]. Therefore, introducing different lag parameters for the placebo and target probiotic periods may enable a better estimation. However, adding these parameters can render the estimation computationally expensive.

We used uniform distributions with fixed hyperparameters as prior distributions of *µ* and *ν*. There are several options for the prior distribution. Setting prior distributions based on literature enables a more accurate estimation of parameters. In addition, the covariance between *µ* and *ν* can reflect the consistency of *µ* and *ν* in a subject if a covariate between *µ* and *ν* can be assumed.

*µ* and *ν* play key roles in the proposed model. In the synthetic data experiments (Section 3.1), the estimation performance of *µ* and *ν* was not very accurate. However, the proposed model is effective for estimating responders, as seen in the synthetic data experiments (Fig. 3). This is because considering all cases of (*µ, ν*) contributes to the detection of responders when there is lag in the effect period. However, *µ* and *ν* are not always useful for all datasets. *µ* and *ν* are not necessary for the long-term datasets because the number of lag days is small relative to the number of days in the trial. Indeed, the difference between the base model and the proposed model is smaller for synthetic long-term datasets containing observations made under similar conditions (Supplementary Figure S5 and Supplementary Figure S6). Nevertheless, lag consideration is still important because, in most cases, the experiments will be conducted in a short period of time due to cost considerations.

There is a limitation to determining responders based only on defecation frequencies, which is suggested to be unreliable by the U.S. Department of Health and Human Services Food and Drug Administration [19]. According to them, the identification of responders needs to be evaluated based not only on the defecation frequency but also on abdominal pain intensity. That is, deterministic estimation of responders may lead to wrong conclusions. We believe that the responder estimation based on the posterior distribution of *η* conducted in this study enables us to consider the uncertainty of the estimation and contribute to solving this problem.

## Supporting information

Supplementary data including Figures S1, S2, S3, S4, and S5 [[supplements/272054_file03.pdf]](pending:yes)

## Data Availability

https://www.medrxiv.org/content/10.1101/2020.03.23.20041400v2.full-text

[https://github.com/shion-h/LagBasedResponderIdentifier](https://github.com/shion-h/LagBasedResponderIdentifier) 

## Author contributions

**Shion Hosoda:** Conceptualization, Methodology, Software, Investigation, Validation, Visualization, Writing-Original Draft. **Yuichiro Nishimoto:** Software, Investigation, Writing - Review &Draft. **Yohsuke Yamauchi:** Software, Investigation, Writing - Review &Draft. **Takuji Yamada:** Investigation, Writing - Review &Draft. **Michiaki Hamada:** Investigation, Supervision, Writing - Review &Draft.

## Data availability

Stan and Python source codes are available at [https://github.com/shion-h/LagBasedResponderIdentifier](https://github.com/shion-h/LagBasedResponderIdentifier).

## Funding

This work was supported by JSPS/MEXT KAKENHI (Grant Number JP19J20117 to SH, JP20H00624, JP19H01152, and JP18KT0016 to MH).

## Supplementary data

Supplementary data, including Figures S1, S2, S3, S4, and S5, are available.

## Declarations

The randomized controlled trial whose dataset was used in this study was conducted with the approval of the clinical trial ethics review committee of Chiyoda Paramedical Care Clinic.

## Acknowledgements

Computations were partially performed with the NIG supercomputer at ROIS National Institute of Genetics and the super- computer at Human Genome Center, the Institute of Medical Science, the University of Tokyo.

*   Received March 14, 2022.
*   Revision received March 14, 2022.
*   Accepted March 15, 2022.


*   © 2022, Posted by Cold Spring Harbor Laboratory

This pre-print is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), CC BY-NC 4.0, as described at [http://creativecommons.org/licenses/by-nc/4.0/](http://creativecommons.org/licenses/by-nc/4.0/)

## References

1.  [1]. Anna Chmielewska and  Hania Szajewska. Systematic review of randomised controlled trials: Probiotics for functional constipation. World Journal of Gastroenterology : WJG, 16(1):69–75, January 2010.
    
    
2.  [2]. Katarzyna Wojtyniak and  Hania Szajewska. Systematic review: Probiotics for functional constipation in children. European Journal of Pediatrics, 176(9):1155–1162, September 2017.
    
    
3.  [3]. Eirini Dimidi,  Stephanos Christodoulides,  Konstantinos C. Fragkos,  S. Mark Scott, and  Kevin Whelan. The effect of probiotics on functional constipation in adults: A systematic review and meta-analysis of randomized controlled trials. The American Journal of Clinical Nutrition, 100(4):1075–1084, October 2014.
    
    [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NDoiYWpjbiI7czo1OiJyZXNpZCI7czoxMDoiMTAwLzQvMTA3NSI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIyLzAzLzE1LzIwMjIuMDMuMTQuMjIyNzIwNTQuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 

4.  [4]. Petia Kovatcheva-Datchary,  Anne Nilsson,  Rozita Akrami,  Ying Shiuan Lee,  Filipe De Vadder,  Tulika Arora,  Anna Hallen,  Eric Martens,  Inger Björck, and  Fredrik Bäckhed. Dietary Fiber-Induced Improvement in Glucose Metabolism Is Associated with Increased Abundance of Prevotella. Cell Metabolism, 22(6):971–982, December 2015.
    
    
5.  [5]. Tal Korem,  David Zeevi,  Niv Zmora,  Omer Weissbrod,  Noam Bar,  Maya Lotan-Pompan,  Tali Avnit-Sagi,  Noa Kosower,  Gal Malka,  Michal Rein,  Jotham Suez,  Ben Z. Goldberg,  Adina Weinberger,  Avraham A. Levy,  Eran Elinav, and  Eran Segal. Bread Affects Clinical Parameters and Induces Gut Microbiome-Associated Personal Glycemic Responses. Cell Metabolism, 25(6):1243–1253.e5, June 2017.
    
    
6.  [6]. Nicole J. Kellow,  Melinda T. Coughlan, and  Christopher M. Reid. Metabolic benefits of dietary prebiotics in human subjects: A systematic review of randomised controlled trials. British Journal of Nutrition, 111(7):1147–1161, April 2014.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1017/S0007114513003607&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24230488&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F03%2F15%2F2022.03.14.22272054.atom) 

7.  [7]. Farnaz Abdollahi,  Emily D. Case Lazarro,  Molly Listenberger,  Robert V. Kenyon,  Mark Kovic,  Ross A. Bogey,  Donald Hedeker,  Borko D. Jovanovic, and  James L. Patton. Error Augmentation Enhancing Arm Recovery in Individuals With Chronic Stroke: A Randomized Crossover Design. Neurorehabilitation and Neural Repair, 28(2):120–128, February 2014.
    
    
8.  [8]. Brian Budgell and  Barbara Polus. The Effects of Thoracic Manipulation on Heart Rate Variability: A Controlled Crossover Trial. Journal of Manipulative and Physiological Therapeutics, 29(8):603–610, October 2006.
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=17045093&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F03%2F15%2F2022.03.14.22272054.atom) 

9.  [9]. Y. Nakamura,  S. Suzuki,  S. Murakami,  K. Higashi,  N. Watarai,  Y. Nishimoto,  J. Umetsu,  C. Ishii,  Y. Ito,  Y. Mori,  M. Kohno,  T. Yamada, and  S. Fukuda. Metabologenomics identified fecal biomarkers for bowel movement regulation by Bifidobacterium longum capsules: An RCT, June 2020.
    
    
10. [10]. Francesco Asnicar,  Emily R. Leeming,  Eirini Dimidi,  Mohsen Mazidi,  Paul W. Franks,  Haya Al Khatib,  Ana M. Valdes,  Richard Davies,  Elco Bakker,  Lucy Francis,  Andrew Chan,  Rachel Gibson,  George Hadjigeorgiou,  Jonathan Wolf,  Timothy D. Spector,  Nicola Segata, and  Sarah E. Berry. Blue poo: Impact of gut transit time on the gut microbiome using a novel marker. Gut, 70(9):1665–1674, September 2021.
    
    [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NjoiZ3V0am5sIjtzOjU6InJlc2lkIjtzOjk6IjcwLzkvMTY2NSI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIyLzAzLzE1LzIwMjIuMDMuMTQuMjIyNzIwNTQuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 

11. [11]. Niall G. Vine,  Winston D. Leukes, and  Horst. Kaiser. In vitro growth characteristics of five candidate aquaculture probiotics and two fish pathogens grown in fish intestinal mucus. FEMS Microbiology Letters, 231(1):145–152, February 2004.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S0378-1097(03)00954-6&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=14769479&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F03%2F15%2F2022.03.14.22272054.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000188932300021&link_type=ISI) 

12. [12]. Abigail J. Johnson,  Pajau Vangay,  Gabriel A. Al-Ghalith,  Benjamin M. Hillmann,  Tonya L. Ward,  Robin R. Shields-Cutler,  Austin D. Kim,  Anna Konstantinovna Shmagel,  Arzang N. Syed,  Jens Walter,  Ravi Menon,  Katie Koecher, and  Dan Knights. Daily Sampling Reveals Personalized Diet-Microbiome Associations in Humans. Cell Host & Microbe, 25(6):789–802.e5, June 2019.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.chom.2019.05.005&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F03%2F15%2F2022.03.14.22272054.atom) 

13. [13]. Matthew D. Hoffman and  Andrew Gelman. The No-U-Turn Sampler: Adaptively Setting Path Lengths in Hamiltonian Monte Carlo. Journal of Machine Learning Research, 15(47):1593–1623, 2014.
    
    
14. [14]. Silvia Ferrari and  Francisco Cribari-Neto. Beta Regression for Modelling Rates and Proportions. Journal of Applied Statistics, 31(7):799–815, August 2004.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1080/0266476042000214501&link_type=DOI) 

15. [15]. Evan Bolyen,  Jai Ram Rideout,  Matthew R. Dillon,  Nicholas A. Bokulich,  Christian C. Abnet,  Gabriel A. Al-Ghalith,  Harriet Alexander,  Eric J. Alm,  Manimozhiyan Arumugam,  Francesco Asnicar,  Yang Bai,  Jordan E. Bisanz,  Kyle Bittinger,  Asker Brejnrod,  Colin J. Brislawn,  C. Titus Brown,  Benjamin J. Callahan,  Andrés Mauricio Caraballo-Rodríguez,  John Chase,  Emily K. Cope,  Ricardo Da Silva,  Christian Diener,  Pieter C. Dorrestein,  Gavin M. Douglas,  Daniel M. Durall,  Claire Duvallet,  Christian F. Edwardson,  Madeleine Ernst,  Mehrbod Estaki,  Jennifer Fouquier,  Julia M. Gauglitz,  Sean M. Gibbons,  Deanna L. Gibson,  Antonio Gonzalez,  Kestrel Gorlick,  Jiarong Guo,  Benjamin Hillmann,  Susan Holmes,  Hannes Holste,  Curtis Huttenhower,  Gavin A. Huttley,  Stefan Janssen,  Alan K. Jarmusch,  Lingjing Jiang,  Benjamin D. Kaehler,  Kyo Bin Kang,  Christopher R. Keefe,  Paul Keim,  Scott T. Kelley,  Dan Knights,  Irina Koester,  Tomasz Kosciolek,  Jorden Kreps,  Morgan G. I. Langille,  Joslynn Lee,  Ruth Ley,  Yong-Xin Liu,  Erikka Loftfield,  Catherine Lozupone,  Massoud Maher,  Clarisse Marotz,  Bryan D. Martin,  Daniel McDonald,  Lauren J. McIver,  Alexey V. Melnik,  Jessica L. Metcalf,  Sydney C. Morgan,  Jamie T. Morton,  Ahmad Turan Naimey,  Jose A. Navas-Molina,  Louis Felix Nothias,  Stephanie B. Orchanian,  Talima Pearson,  Samuel L. Peoples,  Daniel Petras,  Mary Lai Preuss,  Elmar Pruesse,  Lasse Buur Rasmussen,  Adam Rivers,  Michael S. Robeson,  Patrick Rosenthal,  Nicola Segata,  Michael Shaffer,  Arron Shiffer,  Rashmi Sinha,  Se Jin Song,  John R. Spear,  Austin D. Swafford,  Luke R. Thompson,  Pedro J. Torres,  Pauline Trinh,  Anupriya Tripathi,  Peter J. Turnbaugh,  Sabah Ul-Hasan,  Justin J. J. van der Hooft,  Fernando Vargas,  Yoshiki Vázquez-Baeza,  Emily Vogtmann,  Max von Hippel,  William Walters,  Yunhu Wan,  Mingxun Wang,  Jonathan Warren,  Kyle C. Weber,  Charles H. D. Williamson,  Amy D. Willis,  Zhenjiang Zech Xu,  Jesse R. Zaneveld,  Yilong Zhang,  Qiyun Zhu,  Rob Knight, and  J. Gregory Caporaso. Reproducible, interactive, scalable and extensible microbiome data science using QIIME 2. Nature Biotechnology, 37(8):852–857, August 2019.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41587-019-0209-9&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=31341288&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F03%2F15%2F2022.03.14.22272054.atom) 

16. [16]. Benjamin J. Callahan,  Paul J. McMurdie,  Michael J. Rosen,  Andrew W. Han,  Amy Jo A. Johnson, and  Susan P. Holmes. DADA2: High-resolution sample inference from Illumina amplicon data. Nature Methods, 13(7):581–583, July 2016.
    
    
17. [17]. Christian Quast,  Elmar Pruesse,  Pelin Yilmaz,  Jan Gerken,  Timmy Schweer,  Pablo Yarza,  Jörg Peplies, and  Frank Oliver Glöckner. The SILVA ribosomal RNA gene database project: Improved data processing and web-based tools. Nucleic Acids Research, 41(D1):D590–D596, January 2013.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/nar/gks1219&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23193283&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F03%2F15%2F2022.03.14.22272054.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000312893300084&link_type=ISI) 

18. [18]. Donald D. Price,  Damien G. Finniss, and  Fabrizio Benedetti. A Comprehensive Review of the Placebo Effect: Recent Advances and Current Thought. Annual Review of Psychology, 59(1):565–590, 2008.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1146/annurev.psych.59.113006.095941&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=17550344&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F03%2F15%2F2022.03.14.22272054.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000253283000021&link_type=ISI) 

19. [19]. US Food,  Drug Administration, et al. Guidance for industry irritable bowel syndrome–clinical evaluation of products for treatment, 2010.

 [1]: /embed/inline-graphic-1.gif
 [2]: /embed/inline-graphic-2.gif
 [3]: /embed/graphic-2.gif
 [4]: /embed/graphic-3.gif
 [5]: /embed/graphic-4.gif
 [6]: /embed/inline-graphic-3.gif
 [7]: /embed/inline-graphic-4.gif
 [8]: /embed/graphic-5.gif
 [9]: /embed/graphic-6.gif
 [10]: /embed/graphic-7.gif
 [11]: /embed/graphic-8.gif
 [12]: /embed/inline-graphic-5.gif
 [13]: /embed/graphic-9.gif
 [14]: /embed/graphic-10.gif
 [15]: /embed/graphic-11.gif
 [16]: /embed/inline-graphic-6.gif
 [17]: /embed/graphic-12.gif
 [18]: /embed/inline-graphic-7.gif
 [19]: /embed/graphic-13.gif
 [20]: /embed/inline-graphic-8.gif
 [21]: /embed/graphic-14.gif
 [22]: /embed/inline-graphic-9.gif
 [23]: /embed/inline-graphic-10.gif
 [24]: /embed/inline-graphic-11.gif
 [25]: /embed/inline-graphic-12.gif
 [26]: /embed/graphic-15.gif
 [27]: /embed/inline-graphic-13.gif
 [28]: /embed/inline-graphic-14.gif
 [29]: /embed/inline-graphic-15.gif
 [30]: F2/embed/inline-graphic-16.gif
 [31]: F2/embed/inline-graphic-17.gif
 [32]: F2/embed/inline-graphic-18.gif
 [33]: F2/embed/inline-graphic-19.gif
 [34]: /embed/inline-graphic-20.gif