Bayesian Joint Modeling Analysis of Longitudinal Proportional and Survival Data

Liu, Wenting; Li, Huiqiong; Tang, Anmin; Cui, Zixin

doi:10.3390/math11163469

Open AccessArticle

Bayesian Joint Modeling Analysis of Longitudinal Proportional and Survival Data

Yunnan Key Laboratory of Statistical Modeling and Data Analysis, Yunnan University, Kunming 650091, China

^*

Author to whom correspondence should be addressed.

Mathematics 2023, 11(16), 3469; https://doi.org/10.3390/math11163469

Submission received: 5 July 2023 / Revised: 5 August 2023 / Accepted: 8 August 2023 / Published: 10 August 2023

(This article belongs to the Special Issue Bayesian Statistical Analysis of Big Data and Complex Data)

Download

Browse Figures

Versions Notes

Abstract

:

This paper focuses on a joint model to analyze longitudinal proportional and survival data. We utilize a logit transformation on the longitudinal proportional data and employ a partially linear mixed-effect model. With this model, we estimate the unknown function of time using the B-splines technique. Additionally, we introduce a centered Dirichlet process mixture model (CDPMM) to capture the random effects, allowing for a flexible distribution. The survival data are assumed using a Cox proportional hazard model, and the sharing random effects joint model is developed for the two types of data. We develop a Bayesian Lasso (BLasso) approach that combines the Gibbs sampler and the Metropolis–Hastings algorithm. The proposed method allows for the estimation of unknown parameters and the selection of significant covariates simultaneously. We evaluate the performance of our proposed methods through simulation studies and also provide an illustration of our methodologies using an example from the MA.5 research experiment.

Keywords:

longitudinal proportional data; survival data; joint model; Bayesian variable selection; B-splines; CDPMM method

MSC:

62N02

1. Introduction

The joint analysis of longitudinal and survival data has gained widespread application in clinical studies on cancer and HIV/AIDS, where the primary endpoints typically involve time-to-event outcomes such as disease-free and overall survival. Notably, following the seminal work by Faucett and Thomas [1] and Wulfsohn and Tsiatis [2], the standard joint model has been extensively investigated. Researchers have extensively discussed the advantages of joint models [3,4,5,6,7,8]. However, certain patients with a compromised quality of life (QOL) may opt to discontinue their participation in the clinical trials due to disease recurrence, or they may experience mortality. In this case, the absence of QOL measures resulting from the withdrawal of patients provides informative insights into the trade-off between intensive treatment and poor QOL. To establish strong evidence, we conducted joint modeling of longitudinal life measures and survival data to investigate their relationship. For the longitudinal quality of life and survival data, Henderson et al. [9] and Zeng and Cai [10] considered the use of shared-normal-distribution random effects to jointly analyze the relationship between longitudinal QOL and survival time. Tang et al. [11] considered a novel semiparametric joint model for multivariate longitudinal and survival data to analyze data from the International Breast Cancer Study. Longitudinal quality-of-life measurement data can be linearly converted into longitudinal proportional data whose value range is in the unit interval (0, 1) [12]. Song and Tan [12] emphasized that disregarding the constraint of having values between 0 and 1 could lead to erroneous interpretations. For the longitudinal component, there are two methods to deal with it. The first method applied the classic linear mixed model to the longitudinal proportional data after logit transformation [13], and the second method directly used the simplex distribution to model the longitudinal proportional data [14,15]. The models established using the two methods both used the EM algorithm and the Laplace approximation to estimate the unknown parameters. In order to be more flexible and practical, this paper will use a partial linear mixed-effect model for the logit transformed longitudinal proportional data and use the B-splines method to model the unknown function in the model. Meanwhile, to enhance the feasibility of our proposed model, we use the CDPMM method to model random effects.

In addition, variable selection in the joint model is also considered. In traditional regression models, variable selection methods include forward selection, backward elimination, stepwise selection, and the use of information criteria such as the Akaike information criterion (AIC). However, these approaches can be computationally expensive and unstable when dealing with complex models that have a large number of covariates. To address this issue, penalized likelihood methods have been proposed, with one popular method being the Lasso of Tibshirani [16]. The Lasso estimates linear regression coefficients by applying a constraint on the

L_{1}

norm of the least squares. Tibshirani [16] proposed that Lasso estimates can be interpreted as posterior norm estimates when the regression parameters have independent and identically Laplacian priors. Park and Casella [17] extended this idea under the Bayesian framework and introduced the Bayesian Lasso (BLasso) variable selection method. They used a double exponential prior for the regression coefficients and a gamma distribution for the shrinkage parameter. The BLasso method has been successfully applied to various models, including linear regression [18], semiparametric structural equation models [19], and joint models of longitudinal and survival data [11]. Building on this work, our paper extends the BLasso variable selection method to the joint model of longitudinal proportional data and survival data. We propose an approach called BLasso, which aims to estimate unknown parameters while also identifying the significant effects of crucial covariates.

The rest of this paper is organized as follows. In Section 2, the joint model of longitudinal proportional and survival data is introduced. In Section 3, the Bayesian estimations of the joint model are proposed. In Section 4, three numerical simulations are presented to evaluate the performance of the proposed methods. In Section 5, we utilize the proposed approach to analyze the MA.5 research experiment’s data. We then provide some concluding remarks in Section 6. For more technical information, please refer to Appendix A.

2. Model and Notation

Consider a dataset consisting of n individuals. Let

y_{i j}

be a longitudinal proportional measurement for the i-th individual (

i = 1, 2, \dots, n

) at observation time point

t_{i j}

for

j = 1, 2, \dots, n_{i}

, and

y_{i j} \in (0, 1)

, where

n_{i}

represents the number of observations of individual i. We assume that

y_{i j}^{*}

is the logit transformation of

y_{i j}

and

y_{i j}^{*} = logit (y_{i j}) = \log (\frac{1 - y_{i j}}{y_{i j}})

. Furthermore,

T_{i}^{*}

and

C_{i}

are the true survival time and censoring time, respectively. Additionally, we have the true survival time

T_{i}^{*}

and the censoring time

C_{i}

for each individual i. Let

T_{i} = min (T_{i}^{*}, C_{i})

denote the corresponding observed event time. Let

δ_{i} = 1 (T_{i}^{*} \leq C_{i})

denote the failure indicator, where 1

(\cdot)

is an indicator function.

We denote

y^{*} = \{y_{1}^{*}, y_{2}^{*}, \dots, y_{n}^{*}\}

, where

y_{i}^{*} = \{y_{i 1}^{*}, y_{i 2}^{*}, \dots, y_{i n_{i}}^{*}\}

. Let

T = \{T_{1}, T_{2}, \dots, T_{n}\}

and

Δ = \{δ_{1}, δ_{2}, \dots, δ_{n}\}

. The random effects

b = {b_{1}, b_{2}, \dots, b_{n}}

are time-independent and underlie both the longitudinal and survival processes for the i-th individual. Given the random effects

b_{i}

, we assume that

y_{i j}^{*}

follows a partially linear mixed-effect model.

\begin{matrix} y_{i j}^{*} | b_{i} = X_{i j}^{⊤} β + g (t_{i j}) + Z_{i j}^{⊤} b_{i} + ε_{i j}, \end{matrix}

(1)

where

X_{i j}

and

Z_{i j}

represent the time-independent design vectors of fixed and random effects associated with

y_{i j}^{*}

, respectively;

β

is a

p_{1} \times 1

vector of fixed effects’ regression parameters;

b_{i}

is a

q \times 1

random effects vector;

g (t)

is a twice-continuous differentiable unknown function; and

ε_{i j}

is a white noise process with variance

σ^{2}

. Additionally, we assume that

ε_{i j}

’s are independent of

b_{i}

. To facilitate the feasibility of our proposed model, instead of the traditional normality assumption, which may be violated in some applications [20], we specify the random effects using a Dirichlet process (DP) mixture of normals.

For event time

T_{i}

, given random effects

b_{i}

, we assume that

T_{i}

follows the hazard model:

\begin{matrix} λ_{i} (t | b_{i}) = λ_{0} (t) \exp (W_{i}^{⊤} γ + ϕ^{⊤} b_{i}), \end{matrix}

(2)

where the known fixed effects’ design matrix

W_{i}

connects the unknown

p_{2} \times 1

parameter vector

γ

to

λ_{i} (t | b_{i})

. Additionally, the unknown

q \times 1

parameter vector

ϕ

links

b_{i}

to

λ_{i} (t | b_{i})

. Lastly, the basic hazard function

λ_{0} (t)

remains unknown.

From the above discussion, it is suggested to link models (1) and (2) through shared random effects, called a shared random effects joint model (JMSRE). The parameter

ϕ

in model JMSRE reflects the correlation between transformed longitudinal proportional data and survival data, given random effects. When

ϕ = 0_{q \times 1}

, it means that the longitudinal index is not necessarily related to the event time; i.e., longitudinal proportional data and survival data can be modeled separately. So in this case, joint modeling is not necessary, and longitudinal indicators can be ignored for modeling survival data.

Further, to make Bayesian inference on

β

based on model (1), we approximate

g (t)

through a B-splines method:

\begin{matrix} g (t) \approx B_{1} (t) φ_{1} + B_{2} (t) φ_{2} + \dots + B_{L} (t) φ_{L} = B^{⊤} (t) φ, \end{matrix}

where

L = d + K + 1

, d is the degree of B-splines, K is the number of knots,

φ = {(φ_{1}, φ_{2}, \dots, φ_{L})}^{⊤}

is an

L \times 1

unknown coefficient vector, and

B (t) = ((B_{1} (t), B_{2} (t),

\dots, B_{L} (t) {))}^{⊤}

.

We denote

θ_{y} = \{β, φ, σ^{2}\}

as the unknown parameters associated with model (1) and

θ_{T} = \{γ, ϕ\}

as the unknown parameters associated with model (2). Thus, given

(θ_{y}, θ_{T}, b)

, the joint likelihood function of

(y^{*}, T, Δ)

can be written as

p (y^{*}, T, Δ | θ_{y}, θ_{T}, b) = \prod_{i = 1}^{n} p (y_{i}^{*} | b_{i}; θ_{y}) p (T, Δ | b_{i}; θ_{T}),

(3)

where

p (y_{i}^{*} | b_{i}; θ_{y}) = \prod_{j = 1}^{n_{i}} \frac{1}{\sqrt{2 π σ^{2}}} \exp \{- \frac{{(y_{i j}^{*} - X_{i j}^{⊤} β - B^{⊤} (t_{i j}) φ - Z_{i j}^{⊤} b_{i})}^{2}}{2 σ^{2}}\},

p (T, Δ | b_{i}; θ_{T}) = \prod_{i = 1}^{n} {[\frac{\exp (W_{i}^{⊤} γ + ϕ^{⊤} b_{i})}{\sum_{j \in R_{i}} \exp (W_{j}^{⊤} γ + ϕ^{⊤} b_{j})}]}^{δ_{i}}, R_{i} = \{j : T_{j} \geq T_{i}\} .

3. Bayesian Estimation of Joint Model

3.1. Prior Specification

In order to develop Bayesian inference on the considered models, it is necessary to specify the prior distributions for

σ^{2}, φ, β, γ and ϕ

. For conjugation, we consider the following priors for

σ^{2}, φ

:

\frac{1}{σ^{2}} \sim Γ (a_{0}, b_{0}), φ \sim N_{L} (0, H_{φ}^{0}),

(4)

where

a_{0}

,

b_{0}

, and

H_{φ}^{0}

are pre-given hyperparameters.

Γ (a_{0}, b_{0})

denotes the Gamma distribution with parameter

a_{0}

and the shape parameter

b_{0}

. We can set the prior distribution to a non-informative prior distribution, which just needs to have a large variance. Thus, we consider

a_{0} = 1

,

b_{0} = 1

and

H_{φ}^{0} = 100 I_{4}

in the paper.

As stated by Tang et al. [21], the random effects

b_{i}

can be modeled using a Dirichlet process (DP) mixture of normals. Specifically, we assume that

b_{i}

values are independently and identically distributed according to a mixture distribution, where the mixture components are drawn from a DP with a base distribution

P

that has unknown parameters

(μ_{g}, Ω_{g})

. To address the challenges of performing Bayesian estimation on the regression parameters

β

and the dispersion parameter

σ^{2}

in the model (1), one common approach is to use a Dirichlet process (DP) prior to approximate the unknown form of

P

. The DP prior is specified as

P \sim DP (τ F_{0})

, where

F_{0}

is a base distribution that serves as a starting point for constructing the nonparametric distribution, and

τ

is a weight that represents the researcher’s certainty of

F_{0}

being the distribution of

P

. Sethuraman [22] demonstrated that the DP prior

DP (τ F_{0})

can be represented using a stick-breaking prior. However, this representation has some limitations. It leads to a non-zero mean of random effects [23], which may not be desirable in certain cases. Additionally, it results in a discrete probability distribution for random effects [20], which may not accurately capture the underlying continuous distribution. The discrete Dirichlet processes proposed by Ishwaran and Zarepour [24] and Yang et al. [23] are commonly known as discrete Dirichlet processes. However, these methods may not be suitable for continuous underlying densities of random effects. Furthermore, violating the assumption of a zero mean on the random effects can lead to non-identifiability in the random effects model. Additionally, the computational complexity of the discrete DP methods with a stick-breaking prior for random effects can be high for complex models. To tackle the mentioned challenges, Tang et al. [21] proposed a truncated-approximate centered Dirichlet process mixture model (CDPMM).

In order to address these challenges, we also adopt the CDPMM approach [21] in the model (1). This method allows us to specify the prior distribution of

b_{i}

as follows:

b_{i} \overset{i . i . d .}{\sim} \sum_{g = 1}^{G} π_{g} N_{q} (μ_{g}, Ω_{g}) with μ_{g} = μ_{g}^{*} - \sum_{g = 1}^{G} π_{g} μ_{g}^{*} and (μ_{g}^{*}, Ω_{g}) \overset{i . i . d .}{\sim} F_{0},

where

1 \leq G < \infty

, and

π_{g}

is a random probability weight chosen to be independent of

(μ_{g}^{*}, Ω_{g})

such that

0 \leq π_{g} \leq 1

and

\sum_{g = 1}^{\infty} π_{g} = 1

. To ease the computational intensity, we consider

G = 25

, and

π_{g}

is given by the following stick-breaking procedure, as proposed by Ishwaran and Zarepour (2000) [24]:

π_{1} = ϑ_{1} and π_{g} = ϑ_{g} \prod_{ι = 1}^{g - 1} (1 - ϑ_{ι}) for g = 2, \dots, G,

(5)

Let

ϑ_{g}

be independent and identically distributed (i.i.d.) random variables following a Beta distribution with parameters

(1, τ)

for

g = 1, 2, \dots, G - 1

, and let

ϑ_{G} = 1

. This implies that the sum of all

π_{g}

values is equal to 1. The prior distribution for the unknown parameter

τ

is a Gamma distribution with hyperparameters

a_{1}

and

a_{2}

[25]. Here, we take the hyperparameters

a_{1}

and

a_{2}

to be 25 and 5, respectively.

An efficient and flexible method for solving the DP prior specified above is to represent

b_{i}

in terms of a latent variable

L_{i} \in {1, 2, \dots, G}

, which records each

b_{i}

’s cluster membership and conveys its parametric value to the distribution of

b_{i}

. Let

L = {L_{1}, L_{2}, \dots, L_{n}}

,

π = {π_{1}, π_{2}, \dots, π_{G}}

,

μ^{*} = {μ_{1}^{*}, μ_{2}^{*}, \dots, μ_{G}^{*}}

and

Ω = {Ω_{1}, Ω_{2}, \dots, Ω_{G}}

, where

Ω_{g} = diag (ω_{g 1}, ω_{g 2}, \dots, ω_{g q})

. These variables can be reformulated as follows:

L_{i} | π \overset{i . i . d}{\sim} \sum_{g = 1}^{G} π_{g} δ_{g} (\cdot) and (π, μ^{*}, Ω) \sim f_{1} (π) f_{2} (μ^{*}) f_{3} (Ω),

where

δ_{g} (\cdot)

denotes a discrete probability measure concentrated at g,

f_{1} (π)

is specified by the stick-breaking prior as given in Equation (5),

f_{2} (μ^{*}) = \prod_{g = 1}^{G} f_{2} (μ_{g}^{*})

, and

f_{3} (Ω) = \prod_{g = 1}^{G} \prod_{j = 1}^{q} f_{3} (ω_{g_{j}})

, where

f_{1} (π)

,

f_{2} (μ_{g}^{*})

, and

f_{3} (ω_{g_{j}})

, respectively, represent the probability density functions of the random variables

π

,

μ_{g}^{*}

, and

ω_{g_{j}}

. Here,

μ_{g}^{*}

and

ω_{g_{j}}

can be specified by

μ_{g}^{*} | ξ, Ψ \overset{i . i . d}{\sim} N_{q} (ξ, Ψ), ξ | ξ^{0}, Ψ^{0} \sim N_{q} (ξ^{0}, Ψ^{0}), ψ_{j}^{- 1} | c_{1}, c_{2} \sim Γ (c_{1}, c_{2}) for j = 1, 2, \dots, q,

(6)

ω_{g_{j}}^{- 1} | ω_{j}^{a}, ϖ_{j} \sim Γ (ω_{j}^{a}, ϖ_{j}) and ϖ_{j} | ϖ_{j}^{a}, ϖ_{j}^{b} \sim Γ (ϖ_{j}^{a}, ϖ_{j}^{b}),

respectively. Let

Ψ = diag (ψ_{1}, ψ_{2}, \dots, ψ_{q})

,

Γ (c_{1}, c_{2})

denote the Gamma distribution with parameters

c_{1}

and

c_{2}

, and

ξ^{0}, Ψ^{0}, c_{1}, c_{2}, ω_{j}^{a}, ϖ_{j}^{a}

and

ϖ_{j}^{b}

are prespecified hyperparameters [20]. The following values are used in the paper:

ξ^{0} = 0_{q \times 1}

,

Ψ^{0} = I_{q}

,

c_{1} = 11

,

c_{2} = 2.5

,

ω_{j}^{a} = 3

,

ϖ_{j}^{a} = n

, and

ϖ_{j}^{b} = 10

. Given the values of

L_{i}

,

μ^{*}

, and

Ω

, we can sample

b_{i}

from

N_{q} (μ_{L_{i}}, Ω_{L_{i}})

with

μ_{L_{i}} = μ_{L_{i}}^{*} - Σ_{g = 1}^{G} μ_{g}^{*}

.

Here, we will mainly introduce the variable selection principle of the BLasso method [17,19] for the proposed joint model JMSRE. We need to identify not only the important variables in models (1) and (2) but also whether the parameter

ϕ

is

0_{q \times 1}

. Our proposed BLasso method accomplishes this. In general, the prior distribution of the regression parameters is set to a multivariate normal distribution. Based on the concept of Bayesian Lasso inference [17], we adopt hierarchical priors for

β

,

γ

, and

ϕ

as follows:

\begin{matrix} β | H_{β} \sim N_{p_{1}} (0, H_{β}), with H_{β} = diag (h_{β_{1}}^{2}, h_{β_{2}}^{2}, \dots, h_{β_{p_{1}}}^{2}), \\ f (h_{β_{1}}^{2}, h_{β_{2}}^{2}, \dots, h_{β_{p_{1}}}^{2}) = \prod_{j = 1}^{p_{1}} \frac{ϑ_{β_{j}}^{2}}{2} \exp (- \frac{ϑ_{β_{j}}^{2}}{2} h_{β_{j}}^{2}), \end{matrix}

(7)

\begin{matrix} γ | H_{γ} \sim N_{p_{2}} (0, H_{γ}), with H_{γ} = diag (h_{γ_{1}}^{2}, h_{γ_{2}}^{2}, \dots, h_{γ_{p_{2}}}^{2}), \\ f (h_{γ_{1}}^{2}, h_{γ_{2}}^{2}, \dots, h_{γ_{p_{2}}}^{2}) = \prod_{j = 1}^{p_{2}} \frac{ϑ_{γ_{j}}^{2}}{2} \exp (- \frac{ϑ_{γ_{j}}^{2}}{2} h_{γ_{j}}^{2}), \end{matrix}

(8)

\begin{matrix} ϕ | H_{ϕ} \sim N_{q} (0, H_{ϕ}), with H_{ϕ} = diag (h_{ϕ_{1}}^{2}, h_{ϕ_{2}}^{2}, \dots, h_{ϕ_{q}}^{2}), \\ f (h_{ϕ_{1}}^{2}, h_{ϕ_{2}}^{2}, \dots, h_{ϕ_{q}}^{2}) = \prod_{j = 1}^{q} \frac{ϑ_{ϕ_{j}}^{2}}{2} \exp (- \frac{ϑ_{ϕ_{j}}^{2}}{2} h_{ϕ_{j}}^{2}), \end{matrix}

(9)

where

ϑ_{β} = \{ϑ_{β_{1}}, ϑ_{β_{2}}, \dots, ϑ_{β_{p_{1}}}\}

,

ϑ_{γ} = \{ϑ_{γ_{1}}, ϑ_{γ_{2}}, \dots, ϑ_{γ_{p_{2}}}\}

, and

ϑ_{ϕ} = \{ϑ_{ϕ_{1}}, ϑ_{ϕ_{2}}, \dots, ϑ_{ϕ_{q}}\}

are the regularization parameters that control the tail decay. In particular, to better control the effect of tail decay, this paper sets different regularization parameters for different components of the same parameter. Inspired by Park and Casella [17], we further consider the following super-priorities for these tuning parameters:

\begin{matrix} ϑ_{β_{j}}^{2} \sim Γ (a_{ϑ_{β}}, b_{ϑ_{β}}), j = 1, 2, \dots, p_{1}, \end{matrix}

(10)

\begin{matrix} ϑ_{γ_{j}}^{2} \sim Γ (a_{ϑ_{γ}}, b_{ϑ_{γ}}), j = 1, 2, \dots, p_{2}, \end{matrix}

(11)

\begin{matrix} ϑ_{ϕ_{j}}^{2} \sim Γ (a_{ϑ_{ϕ}}, b_{ϑ_{ϕ}}), j = 1, 2, \dots, q . \end{matrix}

(12)

3.2. Bayesian Analysis of Joint Model

To obtain Bayesian estimates of the unknown parameters

β, φ, σ^{2}, b, γ

, and

ϕ

, we use a hybrid algorithm that combines the block Gibbs sampler and the Metropolis–Hastings algorithm. This algorithm iteratively draws samples for these parameters.

(A) Conditional distribution of

β

.

According to Equations (3) and (7), the conditional posterior distribution

p (β | φ, σ^{2}, b, y^{*})

is given by

\begin{matrix} p (β | φ, σ^{2}, b, y^{*}) \propto \exp \{- \frac{1}{2} [\sum_{i = 1}^{n} \sum_{j = 1}^{n_{i}} \frac{1}{σ^{2}} {(y_{i j}^{*} - X_{i j}^{⊤} β - B^{⊤} (t_{i j}) φ - Z_{i j}^{⊤} b_{i})}^{2} + β^{⊤} H_{β}^{- 1} β]\}, \end{matrix}

which yields

\begin{matrix} β | φ, σ^{2}, b, y^{*} \sim N_{p_{1}} (A_{β}, V_{β}), \end{matrix}

(13)

where

V_{β}^{- 1} = \sum_{i = 1}^{n} \sum_{j = 1}^{n_{i}} \frac{1}{σ^{2}} X_{i j} X_{i j}^{⊤} + H_{β}^{- 1}

,

A_{β} = V_{β} (\sum_{i = 1}^{n} \sum_{j = 1}^{n_{i}} \frac{1}{σ^{2}} X_{i j} (y_{i j}^{*} - Z_{i j}^{⊤} b_{i} - B^{⊤} (t_{i j}) φ))

.

(B) Conditional distribution of

φ

.

According to Equation (3) and the prior of

φ

in Equation (4), the conditional distribution

p (φ | β, σ^{2}, b, y^{*})

is given by

\begin{matrix} p (φ | β, σ^{2}, b, y^{*}) \propto \exp \{- \frac{1}{2} [\sum_{i = 1}^{n} \sum_{j = 1}^{n_{i}} \frac{1}{σ^{2}} {(y_{i j}^{*} - X_{i j}^{⊤} β - B^{⊤} (t_{i j}) φ - Z_{i j}^{⊤} b_{i})}^{2} + φ^{⊤} {(H_{φ}^{0})}^{- 1} φ]\}, \end{matrix}

which yields

\begin{matrix} φ | β, σ^{2}, b, y^{*} \sim N_{L} (A_{φ}, V_{φ}), \end{matrix}

(14)

where

V_{φ}^{- 1} = \sum_{i = 1}^{n} \sum_{j = 1}^{n_{i}} \frac{1}{σ^{2}} B (t_{i j}) B {(t_{i j})}^{⊤} + {(H_{φ}^{0})}^{- 1}

,

A_{φ} = V_{φ} (\sum_{i = 1}^{n} \sum_{j = 1}^{n_{i}} \frac{1}{σ^{2}} B (t_{i j}) (y_{i j}^{*} - X_{i j}^{⊤} β - Z_{i j}^{⊤} b_{i}))

.

(C) Conditional distribution of

\frac{1}{σ^{2}}

.

According to Equation (3) and the prior of

σ^{2}

in Equation (4), the conditional distribution

p (\frac{1}{σ^{2}} | β, φ, b, y^{*})

is given by

\begin{matrix} p (\frac{1}{σ^{2}} | β, φ, b, y^{*}) & \propto \exp \{- \frac{1}{σ^{2}} (\frac{1}{2} \sum_{i = 1}^{n} \sum_{j = 1}^{n_{i}} {(y_{i j}^{*} - X_{i j}^{⊤} β - B^{⊤} (t_{i j}) φ - Z_{i j}^{⊤} b_{i})}^{2} + b_{0})\} \\ \times {(\frac{1}{σ^{2}})}^{\frac{1}{2} \sum_{i = 1}^{n} n_{i} + a_{0} - 1}, \end{matrix}

which yields

\begin{matrix} \frac{1}{σ^{2}} | β, φ, b, y^{*} \sim Γ (\frac{1}{2} \sum_{i = 1}^{n} n_{i} + a_{0}, \frac{1}{2} {(y_{i j}^{*} - X_{i j}^{⊤} β - B^{⊤} (t_{i j}) φ - Z_{i j}^{⊤} b_{i})}^{2} + b_{0}) . \end{matrix}

(15)

(D) Conditional distribution of

b_{i}

.

For reasons of space, the sampling of

b_{i}, i = 1, 2, \dots, n

follows the steps in Appendix A, which can also be seen in Tang et al. [21].

(E) Conditional distribution of

γ

.

It follows from Equation (3) and (8) that the conditional distribution

p (γ | b, ϕ, T, Δ)

is proportional to

\begin{matrix} \prod_{i = 1}^{n} {[\frac{\exp (W_{i}^{⊤} γ + ϕ^{⊤} b_{i})}{\sum_{j \in R_{i}} \exp (W_{j}^{⊤} γ + ϕ^{⊤} b_{j})}]}^{δ_{i}} \exp \{- \frac{1}{2} γ^{T} H_{γ}^{- 1} γ\}, \end{matrix}

(16)

which is not a familiar distribution. Therefore, the well-known Metropolis–Hastings (MH) algorithm is adopted to simulate observations from the conditional distribution given above, which is implemented as follows. Given the current value

γ^{(m)}

, new candidates

γ

are generated from

N_{p_{2}} (γ^{(m)}, σ_{γ}^{2} Σ_{γ})

, where

Σ_{γ} = {(- \partial^{2} {(\ln (p (γ | b, ϕ, T, Δ)) / \partial γ \partial γ^{⊤} |}_{γ = γ^{(m)}})}^{- 1}

. The new

γ^{(m)}

is accepted with probability

\begin{matrix} \min \{1, \frac{p (γ | b, ϕ, T, Δ)}{p (γ^{(m)} | b, ϕ, T, Δ)}\}, \end{matrix}

where

\begin{matrix} Σ_{γ} = {(\sum_{i = 1}^{n} \frac{\sum_{j \in R_{i}} \exp (\cdot) W_{j} W_{j}^{⊤} \sum_{j \in R_{i}} \exp (\cdot) + \sum_{j \in R_{i}} \exp (\cdot) W_{j} \sum_{j \in R_{i}} \exp (\cdot) W_{j}^{⊤}}{{(\sum_{j \in R_{i}} \exp (\cdot))}^{2}} + H_{γ}^{- 1})}^{- 1} . \end{matrix}

(17)

with

\exp (\cdot) = \exp (W_{j}^{⊤} γ^{(m)} + ϕ^{⊤} b_{j}) .

The variance coefficient

σ_{γ}^{2}

can be adjusted to achieve an average acceptance rate of approximately 0.25 or higher.

(F) Conditional distribution of

ϕ

.

From Equations (3) and (9), the conditional distribution

p (ϕ | γ, b, T, Δ)

is proportional to

\begin{matrix} \prod_{i = 1}^{n} {[\frac{\exp (W_{i}^{⊤} γ + ϕ^{⊤} b_{i})}{\sum_{j \in R_{i}} \exp (W_{j}^{⊤} γ + ϕ^{⊤} b_{j})}]}^{δ_{i}} \exp \{- \frac{1}{2} ϕ^{T} H_{ϕ}^{- 1} ϕ\}, \end{matrix}

(18)

which is not a familiar distribution. Similar to above (E), given the current value

ϕ^{(m)}

, new candidates

ϕ

are generated from

N_{q} (ϕ^{(m)}, σ_{ϕ}^{2} Σ_{ϕ})

, where

Σ_{ϕ} = {(- \partial^{2} {(\ln (p (ϕ | γ, b, T, Δ)) / \partial ϕ \partial ϕ^{⊤} |}_{ϕ = ϕ^{(m)}})}^{- 1}

. The new

ϕ^{(m)}

is accepted with probability

\begin{matrix} \min \{1, \frac{p (ϕ | γ, b, T, Δ)}{p (ϕ^{(m)} | γ, b, T, Δ)}\}, \end{matrix}

where

\begin{matrix} Σ_{ϕ} = {(\sum_{i = 1}^{n} \frac{\sum_{j \in R_{i}} \exp (\cdot) b_{j} b_{j}^{⊤} \sum_{j \in R_{i}} \exp (\cdot) + \sum_{j \in R_{i}} \exp (\cdot) b_{j} \sum_{j \in R_{i}} \exp (\cdot) b_{j}^{⊤}}{{(\sum_{j \in R_{i}} \exp (\cdot))}^{2}} + H_{ϕ}^{- 1})}^{- 1}, \end{matrix}

(19)

and

\exp (\cdot) = \exp (W_{j}^{⊤} γ + {ϕ^{(m)}}^{⊤} b_{j}) .

The variance coefficient

σ_{ϕ}^{2}

can be adjusted to achieve an average acceptance rate of approximately 0.25 or higher.

Using the above iterative process, we can obtain a series of sample

{(β^{(m)}, φ^{(m)}, {σ^{2}}^{(m)}

,

b_{i}^{(m)}, γ^{(m)}, ϕ^{(m)}) : m = 1, 2, \dots, M}

. Then, Bayesian estimates of

β

,

φ

,

σ^{2}

,

b_{i}

,

γ

and

ϕ

can be obtained using

\begin{matrix} \hat{β} = \frac{1}{M} \sum_{m = 1}^{M} β^{(m)}, \hat{φ} = \frac{1}{M} \sum_{m = 1}^{M} φ^{(m)}, \hat{σ^{2}} = \frac{1}{M} \sum_{m = 1}^{M} {σ^{2}}^{(m)}, \end{matrix}

\begin{matrix} {\hat{b}}_{i} = \frac{1}{M} \sum_{m = 1}^{M} b_{i}^{(m)} \hat{γ} = \frac{1}{M} \sum_{m = 1}^{M} γ^{(m)}, \hat{ϕ} = \frac{1}{M} \sum_{m = 1}^{M} ϕ^{(m)} . \end{matrix}

Similarly, the consistent estimates of the posterior covariance matrices of

var (β | y^{*}, X, Z)

,

var (φ | y^{*}, X, Z)

,

var (σ^{2} | y^{*}, X, Z)

,

var (γ | W, T, Δ)

, and

var (ϕ | y^{*}, X, Z, W, T, Δ)

can be obtained via the sample covariance matrices. For example,

\begin{matrix} \hat{var} (β | y^{*}, X, Z) = \frac{1}{M - 1} \sum_{m = 1}^{M} (β^{(m)} - \hat{β}) {(β^{(m)} - \hat{β})}^{⊤} . \end{matrix}

Therefore, the variance of the corresponding parameter can be obtained by considering the diagonal elements of the sample covariance matrix of the random sample sequence.

4. Simulation Studies

In this section, we perform three simulation studies to examine the finite performance of the previously mentioned methods.

The model used in these studies was the one defined in models (1) and (2), involving a total of 200 individuals. The specific details of the model are as follows:

y_{i j}^{*} | b_{i} = X_{1 i j} β_{1} + X_{2 i j} β_{2} + X_{3 i j} β_{3} + X_{4 i j} β_{4} + X_{5 i j} β_{5} + X_{6 i j} β_{6} + g (t_{i j}) + b_{i} + ε_{i j},

(20)

λ_{i} (t | b_{i}) = λ_{0} (t) \exp (W_{1 i} γ_{1} + W_{2 i} γ_{2} + W_{3 i} γ_{3} + W_{4 i} γ_{4} + ϕ b_{i}) .

(21)

In model (1),

Z_{i j}

can be either one-dimensional or multi-dimensional. However, in the following simulation study,

Z_{i j}

was set to be one-dimensional. In order to perform variable selection on

X_{i j}

and

W_{i j}

,

X_{i j}

and

W_{i j}

were set to be multi-dimensional in the simulation study. The data were generated as follows: observation time

t_{i j}

was randomly generated between 0 and 3. The covariates

X_{1 i j}

and

X_{6 i j}

followed a Bernoulli distribution with success probabilities of 0.5 and 0.3, respectively. The covariates

X_{2 i j}, X_{3 i j}, X_{4 i j}

, and

X_{5 i j}

were generated from a multivariate normal distribution

N_{4} (0, Σ)

with mean vector 0 and covariance matrix

Σ

. The covariance matrix

Σ

is a symmetric positive definite matrix with diagonal elements of 1 and all other elements of 0.5. The random error

ε_{i j}

was generated from a normal distribution with mean 0 and variance

σ^{2} = 0 . 6^{2}

. We define

W_{i} = {(W_{i 1}, W_{i 2}, W_{i 3}, W_{i 4})}^{⊤} = {(X_{3 i 1}, X_{4 i 1}, X_{5 i 1}, X_{6 i 1})}^{⊤}

. The baseline hazard function

λ_{0} (t) = 0.7

and

ϕ = 0.6

. The censoring time

C_{i}

was generated from the uniform distribution

U [0, 3]

, and

T_{i}^{*}

was generated from the exponential distribution with mean

1 / λ_{i} (t | b_{i})

,

T_{i} = min (T_{i}^{*}, C_{i})

. Our main objective is to utilize the proposed approaches to identify insignificant covariates and estimate non-zero coefficients. Bayesian results were obtained from 200 replications.

To demonstrate the accuracy and flexibility of our proposed method, we conducted three simulation studies. These simulations aimed to estimate parameters of interest, identify unimportant variables, and capture the features of the unknown function

g (t)

and random effects

b_{i}

. The true values of unknown parameters

β

and

γ

were set to be the same in Simulation I and Simulation II, and the parameter’s true values included 0. The true values of unknown parameters

β

and

γ

in Simulation III are all non-zero. The settings of the unknown function

g (t)

and random effects

b_{i}

are different between the three simulation studies. The unknown function

g (t)

setting includes both nonlinear and linear. The random effect

b_{i}

was set to follow a mixed normal distribution with unimodal, bimodal, and trimodal distributions, respectively. By conducting these simulation studies, we can showcase the effectiveness and versatility of our method.

Simulation I

$\begin{matrix} β = {(β_{1}, \dots, β_{6})}^{⊤} = {(1, 0, 0, - 0.5, 0.5, - 1)}^{⊤}, γ = {(γ_{1}, \dots, γ_{4})}^{⊤} = {(0, 1, - 0.5, 0)}^{⊤}, \end{matrix}$

$\begin{matrix} g (t) = \sin (\frac{3}{4} π t), b_{i} \overset{i . i . d}{\sim} 0.6 N (- 0.8, 0 . 1^{2}) + 0.4 N (1.2, 0 . 5^{2}), \end{matrix}$
Simulation II

$\begin{matrix} β = {(β_{1}, \dots, β_{6})}^{⊤} = {(1, 0, 0, - 0.5, 0.5, - 1)}^{⊤}, γ = {(γ_{1}, \dots, γ_{4})}^{⊤} = {(0, 1, - 0.5, 0)}^{⊤}, \end{matrix}$

$\begin{matrix} g (t) = t, b_{i} \overset{i . i . d}{\sim} 0.4 N (0, 0 . 3^{2}) + 0.3 N (- 1.5, 0 . 1^{2}) + 0.3 N (1.5, 0 . 1^{2}), \end{matrix}$
Simulation III

$\begin{matrix} β = {(β_{1}, \dots, β_{6})}^{⊤} = {(1, 0.5, - 0.5, - 0.5, 0.5, - 1)}^{⊤}, γ = {(γ_{1}, \dots, γ_{4})}^{⊤} = {(- 0.5, 1, - 0.5, 1)}^{⊤}, \end{matrix}$

$\begin{matrix} g (t) = t^{2}, b_{i} \overset{i . i . d}{\sim} N (0, 0 . 8^{2}) . \end{matrix}$

We utilized the proposed semiparametric Bayesian procedure to simultaneously estimate unknown parameters and identify significant covariates in each of the three simulation studies. The mean censoring rates for the survival times in these studies were 44%, 45%, and 37%. The prior hyperparameters were set as follows:

a_{ϑ_{β}} = a_{ϑ_{γ}} = a_{ϑ_{ϕ}} = 1

,

b_{ϑ_{γ}} = b_{ϑ_{β}} = b_{ϑ_{ϕ}} = 0.1

. These hyperparameters correspond to the hyperpriors for the adjustment coefficients in Equations (10)–(12). We set

a_{0} = 1

,

b_{0} = 1

, and

H_{φ}^{0} = 100 I_{4}

, which correspond to the prior parameters of

σ^{2}

and

φ

. We set the degree of B-splines

d = 3

, the number of knots

K = 4

, and

G = 25

.

To assess the convergence of the proposed algorithm, we computed the estimated potential scale reduction (EPSR) values for the parameters. Additionally, we also need to test the convergence of the unknown function fitted using the B-splines method. Figure 1 indicates that the EPSR values remained consistently below 1.2 after around 3000 iterations in all three simulation studies. Consequently, we collected 3000 observations (

M = 3000

) to calculate the Bayesian estimates of the parameters after 3000 iterations in order to produce Bayesian results for each of the 200 replications. For comparison, we also applied Gaussian priors as the prior distribution of random effects. The purpose of these simulations is to compare the semi-parametric approach based on the CDPMM prior with the parametric approach based on the Gaussian prior from a Bayesian perspective. Results obtained from three simulation studies were reported in Table 1, Table 2 and Table 3, which include five measures: “Median”, “Bias”, “SD”, “RMS”, and “F0”. “Median” represents the median of the estimates from 200 replications. “Bias” indicates the difference between the true value and the mean of the estimates from 200 replications. “SD” indicates the standard deviation of the estimates from 200 replications. “RMS” is the root mean square between the estimates from 200 replications and their true values. “F0” indicates the proportion of parameters identified as zero in 200 replications, considering a parameter to be identified as zero if its 95% confidence interval contains zero.

The results from Table 1, Table 2 and Table 3 suggest that the Bayesian estimates of the parameters are reasonably accurate. One can see that in all simulations, the proposed CDPMM prior performed better in both parameter estimation and inferential characteristics. This is indicated by the fact that the bias (Bias) values of the results based on the CDPMM prior method are all less than 0.10, and the root mean square (RMS) value and standard deviation (SD) value are both less than 0.20. Furthermore, the BLasso method was able to correctly identify the important covariates in most cases, regardless of the prior inputs of parameters. This is supported by the fact that the F0 values corresponding to the important covariates were less than 10%, indicating a high level of significance. On the other hand, the F0 values corresponding to the unimportant covariates were more than 90%, indicating a lack of significance. The recovery performance of the proposed method for the unknown function

g (t)

can be measured using the RMSE (the root mean square error), which is expressed as

\begin{matrix} RMSE (g^{(r)}) = \sqrt{\frac{1}{300} \sum_{l = 1}^{300} {(g (u_{l}) - {\hat{g}}^{(r)} (u_{l}))}^{2}}, r = 1, 2, \dots, 200, \end{matrix}

(22)

where

{\hat{g}}^{(r)} (t) = B^{⊤} (t) {\hat{φ}}^{(r)}

,

{\hat{φ}}^{(r)}

represents the Bayesian estimated value of the parameter vector

φ

in the r-th replication. Similar to the RMSE of the unknown function, we also calculate the RMSE of the random effects. Figure 2 plots the estimated curve and estimated density of the unknown function

\hat{g} (t)

and the random effects

b_{i}

of the replication based on different priors. The mean of the RMSE of the unknown function and the random effects is in the middle of the 200 replications and is compared against the true curves and true density in three simulation studies, respectively.

Upon inspection of Figure 2, it is evident that the Bayesian B-splines method proposed in this paper is flexible enough to accurately fit the true curve of the unknown function

g (t)

. Additionally, the CDPMM prior proposed demonstrates sufficient flexibility compared to the Gaussian prior to capture the general shapes of the three distribution assumptions considered for

b_{i}

. The results presented in Table 4, based on 200 replications in three simulation studies under the CDPMM prior and Gaussian prior, further support the robustness of the CDPMM method. The estimated means and standard deviations (SDs) of the random effects

b_{i}

closely align with their corresponding true values. Moreover, the 25%, 50%, and 75% quantiles of the RMSE of the unknown function and the random effects are sufficiently small, indicating the effectiveness of the CDPMM approach in estimating random effects.

All these findings show that, compared with the Gaussian prior method, our CDPMM prior method makes the Bayesian B-spline curve flexible enough to accurately fit the real curve of nonlinear data. Additionally, the Bayesian procedure effectively captures the true information of

b_{i}

, regardless of their true distributions and forms. Furthermore, BLasso has a high probability of correctly identifying the true model.

5. An Example

In this section, we apply the method proposed in the previous sections to the MA.5 research experiment conducted by the Clinical Trial Group of the National Cancer Institute of Canada. The data pertain to 716 women with early-stage breast cancer before menopause. A total of 356 patients were randomly selected to receive cyclophosphamide, epirubicin, and fluorouracil (CEF) adjuvant chemotherapy as the experimental group. The remaining 360 patients received cyclophosphamide, methotrexate, and fluorouracil (CMF) adjuvant chemotherapy as the control group of the trial. In clinical trials, visits were made before the start of treatment, during each of the six treatment cycles, and every three months after treatment. At each visit, medical history and physical examination were conducted, and the Breast Cancer Questionnaire (BCQ) is used to assess the patient’s QOL. The dataset consists of a total of 7807 observations. By the end of the study, 366 patients had died, resulting in a censoring rate of approximately 49%. For a detailed study of these data, please refer to Song et al. [26] and Levine et al. [27]. We linearly convert the evaluated BCQ score into a unit interval

(0, 1)

, and the longitudinal data constrained to the interval

(0, 1)

are the longitudinal proportional data of interest. The trial focuses on the recurrence-free survival time (RFS), which is the duration between randomization and disease recurrence. Different treatment options, age, and the number of tumor-positive lymph nodes may directly affect RFS and the patient’s QOL. We fitted the MA.5 research experiment dataset to the following model:

\begin{matrix} y_{i j}^{*} | b_{i} = β_{1} {EM}_{i} + β_{2} {{NODE}_{_} POS}_{i} + β_{3} {AGE}_{i} + g (t_{i j}) + b_{i} + ε_{i j}, \end{matrix}

(23)

\begin{matrix} λ_{i} (t | b_{i}) = λ_{0} (t) \exp (γ_{1} {EM}_{i} + γ_{2} {{NODE}_{_} POS}_{i} + γ_{3} {AGE}_{i} + ϕ b_{i}), \end{matrix}

(24)

where variable

y_{i j}^{*}

represents the BCQ score after applying the logit function transformation.

{EM}_{i}

is a two-class treatment index, where

{EM}_{i} = 1

indicates that the i-th patient underwent CEF treatment, and

{EM}_{i} = 0

indicates that the i-th patient underwent CMF treatment. Age and the number of lymph node metastases are binary variables. Patients who are 40 years old or younger are classified as belonging to the younger group, denoted as

AGE = 1

. Patients who are older than 40 years old belong to the elderly group, denoted as

AGE = 0

. When the number of lymph node metastases is 0–3,

NODE_POS = 0

; otherwise, it is 1. The term

g (t)

in Equation (23) represents an unknown function related to the observation time t.

The unknown function

g (t)

is estimated using a cubic B-spline function, and the domain of the cubic B-spline function is

[\min (t_{i j}), \max (t_{i j})]

. The prior distributions and values of all hyperparameters in the case study are the same as those set in the simulation study above. Based on the above settings, we calculated EPSR values for all parameters. The results indicate that after approximately 3000 iterations, all EPSR values are less than 1.2. Therefore, we use the 3000 iterations after the 3000th iteration to calculate the Bayesian estimation. The results of the example analysis are shown in Table 5 based on two different prior methods.

From Table 5, the following observations can be made. (i) The parameter estimation based on the CDPMM prior proposed in this paper has a smaller standard deviation (SD) and a shorter confidence interval than that based on the Gaussian prior. This suggests that the approach proposed in this paper is more effective. (ii) Under the CDPMM prior, the risk ratio of randomly receiving CEF and CMF treatment is

HR = \exp (γ_{1}) = 71.106 %

, implying that patients who randomly receive CEF chemotherapy have a lower risk. (iii) The credible interval

(0.176, 0.312)

for

β_{1}

does not include 0, indicating that different adjuvant chemotherapy regimens have a significant impact on patients’ QOL. Additionally, it suggests that CEF chemotherapy is more toxic than CMF chemotherapy. (iv) The risk ratio for the number of lymph node metastases being greater than or equal to four compared to less than four is calculated as

HR = \exp (γ_{2}) = 210.644 %

. This implies that patients with a higher number of lymph node metastases have a greater risk of breast cancer recurrence and a shorter RFS. (v) The regression coefficient

β_{2}

for lymph node metastasis numbers greater than or equal to 4 is 0.304, and its credible interval does not include 0, indicating high significance. This suggests that patients with a higher number of lymph nodes experience a lower QOL, which aligns with clinical experience; (vi) The risk ratio between the young group and the old group is

HR = \exp (γ_{3}) = 187.386 %

, implying that the risk of breast cancer recurrence is higher and the RFS is shorter in the young group; (vii) The credible interval

(0.180, 0.375)

for

β_{3} = 0.269

does not contain 0, indicating that age has a significant impact on where variable

y_{i j}^{*}

represents the BCQ score after applying the logit function transformation. This suggests that the quality of life for the elderly group is better than that of the young group. (viii) The value of

ϕ

is 0.269, and the credible interval for

ϕ

is

(0.036, 0.519)

, which does not include 0. This indicates that

ϕ

is significantly different from 0, suggesting a significant correlation between the longitudinal proportional data and survival data. Therefore, the JMSRE model proposed in this paper is applicable and reasonable for analyzing the MA.5 research experiment’s data.

6. Concluding Remarks

In this paper, a semiparametric joint model is proposed for longitudinal proportional data and survival data. The model does not assume the normality of random effects and does not require the specification of an unknown function influencing longitudinal responses. The proposed model offers several advantages. Firstly, it improves the flexibility of jointly modeling longitudinal proportional data and survival data. Secondly, the proposed B-splines method effectively captures different unknown functions in a flexible manner. Thirdly, compared to a Gaussian prior, the proposed CDPMM method accurately captures the unimodal, bimodal, and multimodal features of random effects. Lastly, the computational burden is not heavy, with the replication in the simulation study taking approximately 4 min and the breast cancer dataset taking about 78 min to run.

Our simulation studies and example analysis demonstrate that the Bayesian estimation approach proposed based on the joint model is accurate and robust. The use of Bayesian B-splines allows for a more flexible estimation of the unknown function curve, enabling it to capture the true characteristics of the unknown function more effectively. Additionally, compared with the Gaussian prior method, the CDPMM method effectively captures the true information of

b_{i}

. Furthermore, the BLasso method has a high probability of correctly identifying the true model. In comparison to the method proposed by Song et al. [26] for jointly modeling longitudinal proportional data and survival data, the joint model proposed in this paper offers greater flexibility.

The joint model of longitudinal proportional data and survival data proposed in this paper still has many unsolved problems, and we need to address the following issues in the future: (i) It does not impose any constraints on the form of the basic hazard function. (ii) We should consider more complex spline models, such as automatically selecting nodes to enhance the performance of the proposed model. (iii) We should also explore a joint model for the variable longitudinal proportional outcome and the multivariate survival outcome.

Author Contributions

Methodology, W.L., A.T. and Z.C.; Writing—review & editing, H.L. All authors have read and agreed to the published version of the manuscript.

Funding

The research was partially supported by grants from the Natural Science Foundation of China [Grant Number 12261102], a grant from key project of the Yunnan Province Foundation, China [Grant Number 202001BB050049, 202201BF070001-004, 202301AS070044].

Data Availability Statement

The real data that are used to illustrate the proposed methods may be available from the corresponding author upon considerable request. The data are not publicly available due to ethical restrictions and privacy.

Acknowledgments

The authors wish to thank the Editor-in-Chief, the Associate Editor and two reviewers for their many helpful and insightful comments and suggestions that greatly improved the paper.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Conditional Distribution of b_i

Let

θ_{b_{i}}

denote the unknown parameters associated with the distribution of

b_{i}

for

i = 1, 2, \dots, n

. The parameters

θ_{b_{i}}

can be iteratively drawn using the following steps.

Step (a) The conditional distribution of

ξ

given

(μ^{*}, Ψ, b)

is a normal distribution given by

\begin{matrix} ξ | μ^{*}, Ψ, b \sim N_{q} (A, B), \end{matrix}

(A1)

where

B = {(G Ψ^{- 1} + {(Ψ^{0})}^{- 1})}^{- 1}

and

A = B ({(Ψ^{0})}^{- 1} ξ^{0} + Ψ^{- 1} \sum_{g = 1}^{G} μ_{g}^{*})

.

Step (b) For

j = 1, 2, \dots, q

, the diagonal elements of

Ψ

is conditionally distributed as

\begin{matrix} ψ_{j}^{- 1} | μ^{*}, ξ \sim Γ (c_{1} + \frac{G}{2}, c_{2} + \frac{1}{2} \sum_{g = 1}^{G} {(μ_{g_{j}}^{*} - ξ_{j})}^{2}), \end{matrix}

(A2)

where

μ_{g_{j}}^{*}

is the jth element of

μ_{g}^{*}

and

ξ_{j}

is the jth element of

ξ

.

Step (c) For

j = 1, 2, \dots, q

,

ϖ_{j} | Ω

is conditionally distributed as

\begin{matrix} ϖ_{j} | Ω \sim Γ (ϖ_{j}^{a}, ϖ_{j}^{b} + \sum_{g = 1}^{G} ω_{g_{j}}^{- 1}), \end{matrix}

(A3)

where

ω_{g_{j}}

is the jth diagonal element of

Ω_{g}

.

Step (d) According to Ishwaran and Zarepour [24], the conditional distribution of

τ

can be determined based on the given

π

:

τ | π \sim Γ (a_{1} + G - 1, a_{2} - \sum_{g = 1}^{G - 1} \log (1 - ν_{g}^{*})),

where

ν_{g}^{*}

represents a randomly sampled weight from the beta distribution.

Step (e) Given L and

τ

, the conditional distribution of

π

can be obtained by following generalized Dirichlet distribution:

\begin{matrix} π | L, τ \sim Dir (a_{1}^{*}, b_{1}^{*}, a_{2}^{*}, b_{2}^{*}, \dots, a_{G - 1}^{*}, b_{G - 1}^{*}) . \end{matrix}

(A4)

where

a_{g}^{*} = 1 + d_{g}

and

b_{g}^{*} = τ + \sum_{ι = g + 1}^{G} d_{ι}

for

g = 1, 2, \dots, G - 1

. Here,

d_{g}

represents the number of

L_{i}^{'} s

values that are equal to g, and

ν_{g}^{*}

is generated autonomously from a Beta distribution characterized by the parameters

(a_{g}^{*}, b_{g}^{*})

. Then, the values

π_{1}, π_{2}, \dots, π_{G}

are derived using the following formula:

\begin{matrix} π_{1} = ν_{1}^{*}, π_{G} = 1 - \sum_{g = 1}^{G - 1} π_{g}, and π_{g} = \prod_{ι = 1}^{g - 1} (1 - ν_{ι}^{*}) ν_{g}^{*}, for g \neq 1 or G . \end{matrix}

(A5)

Step (f) Let

L_{1}^{*}, L_{2}^{*}, \dots, L_{d}^{*}

represent the d distinct values of

{L_{1}, L_{2}, \dots, L_{n}}

(i.e., the unique number of “clusters”). For

g = 1, 2 \dots, G

, the conditional distribution of

μ_{g}^{*}

is as follows:

\begin{matrix} μ_{g}^{*} | ξ, Ψ \sim N_{q} (ξ, Ψ) for g \notin {L_{1}^{*}, L_{2}^{*}, \dots, L_{d}^{*}}, \end{matrix}

(A6)

\begin{matrix} μ_{g}^{*} | ξ, Ψ, Ω, L, b \sim N_{q} (E_{g}, F_{g}) for g \in {L_{1}^{*}, L_{2}^{*}, \dots, L_{d}^{*}}, \end{matrix}

(A7)

where

F_{g}

is defined as

{(Ψ^{- 1} + Σ_{{i : L_{i} = g}} Ω_{i}^{- 1})}^{- 1}

, and

E_{g}

is defined as

F_{g} (Ψ^{- 1} ξ + Σ_{{i : L_{i} = g}} Ω_{i}^{- 1} b_{i})

for

g \in {L_{1}^{*}, L_{2}^{*}, \dots, L_{d}^{*}}

. Given

μ_{g}^{*}

,

μ_{g} = μ_{g}^{*} - Σ_{g = 1}^{G} π_{g} μ_{g}^{*}

,

μ^{*} = {μ_{1}^{*}, μ_{2}, \dots, μ_{G}^{*}}

, and

μ = {μ_{1}, μ_{2}, \dots, μ_{G}}

.

Step (g) Given a value g, for

j = 1, 2, \dots, q

, the conditional distribution of the diagonal elements of

Ω_{g}

is as follows:

\begin{matrix} ω_{g_{j}} \sim Γ (ω_{j}^{a}, ϖ_{j}) for g \notin {L_{1}^{*}, L_{2}^{*}, \dots, L_{d}^{*}}, \end{matrix}

(A8)

\begin{matrix} ω_{g_{j}} \sim Γ (\frac{d_{g}}{2} + ω_{j}^{a}, ϖ_{j} + \sum_{{i : L_{i} = g}} \frac{1}{2} {(b_{i_{j}} - μ_{g_{j}})}^{2}) for g \in {L_{1}^{*}, L_{2}^{*}, \dots, L_{d}^{*}}, \end{matrix}

(A9)

where

b_{i_{j}}

represents the jth element of vector

b_{i}

, while

μ_{g_{j}}

denotes the jth element of vector

μ_{g}

. Additionally, given the value of

ω_{g_{j}}

, we can construct the diagonal matrix

Ω_{g} = diag (ω_{g_{1}}, ω_{g_{2}}, \dots, ω_{g_{q}})

. Finally, the set

Ω

consists of matrices

{Ω_{1}, Ω_{2}, \dots, Ω_{G}}

.

Step (h) Given

π, μ, Ω, b

, the conditional distribution of

L_{i}

is obtained by

\begin{matrix} L_{i} | π, μ, Ω, b \overset{i . i . d}{\sim} Multinomial (π_{i g}^{*}, g = 1, 2, \dots, G), \end{matrix}

(A10)

the value of

π_{i g}^{*}

is directly proportional to

π_{g} p (b_{i} | μ_{g}, Ω_{g})

, where

b_{i} | μ_{g}, Ω_{g} \sim N_{q} (μ_{g}, Ω_{g})

. The values of

π_{g}

(

g = 1, 2, \dots, G

) are randomly selected from step (e). When given

L_{i}

,

μ

, and

Ω

, the prior distribution of

b_{i}

follows a normal distribution

N_{q} (μ_{L_{i}}, Ω_{L_{i}})

, where

μ_{L_{i}}

and

Ω_{L_{i}}

represent the

L_{i}

elements of the sets

μ

and

Ω

, respectively.

Step (i) The conditional distribution

p (b_{i} | β, φ, σ^{2}, γ, ϕ, y^{*}, T, Δ)

cannot be directly derived using Gibbs sampling for

i = 1, 2, \dots, n

as it is non-standard. Specifically, it can be expressed as follows:

\begin{matrix} p (b_{i} | β, φ, σ^{2}, γ, ϕ, y^{*}, T, Δ) \propto p (b_{i} | μ_{L_{i}}, Ω_{L_{i}}) p (y_{i}^{*} | b_{i}; θ_{y}) p (T, Δ | b; θ_{T}) . \end{matrix}

(A11)

The Metropolis–Hastings algorithm, which is employed to sample

b_{i}

, is implemented in the following manner. During the mth iteration, a new candidate

b_{i}

is drawn from a normal distribution

N_{q} (b_{i}^{(m)}, σ_{b}^{2} Σ_{b_{i}})

, where

b_{i}^{(m)}

represents the current value,

Σ_{b_{i}} = {(Ω_{L_{i}}^{- 1} + Ξ_{i})}^{- 1}

and

Ξ_{i} = - \partial^{2} {(\ln (p (y_{i}^{*} | b_{i}; θ_{y}) p (T, Δ | b; θ_{T})) / \partial b_{i} \partial b_{i}^{⊤} |}_{b_{i} = b_{i}^{(m)}}

. The new

b_{i}

is accepted with probability

\begin{matrix} min \{1, \frac{p (b_{i} | μ_{L_{i}}, Ω_{L_{i}}) p (y_{i}^{*} | b_{i}; θ_{y}) p (T, Δ | b; θ_{T})}{p (b_{i}^{(m)} | μ_{L_{i}}, Ω_{L_{i}}) p (y_{i}^{*} | b_{i}^{(m)}; θ_{y}) p (T, Δ | b_{i}^{(m)}, b_{- i}; θ_{T})}\}, \end{matrix}

(A12)

The remaining random effects, denoted as

b_{- i}

, represent the random effects of all individuals except the ith individual. The value of the variance

σ_{b}^{2}

can be adjusted to ensure that the average acceptance rate is about 0.25 or higher.

References

Faucett, C.L.; Thomas, D.C. Simultaneously modelling censored survival data and repeatedly measured covariates: A gibbs sampling approach. Stat. Med. 1996, 15, 1663–1685. [Google Scholar] [CrossRef]
Wulfsohn, M.S.; Tsiatis, A.A. A Joint model for survival and longitudinal data measured with error. Biometrics 1997, 53, 330–339. [Google Scholar] [CrossRef] [PubMed]
Schluchter, M.D. Methods for the analysis of informatively censored longitudinal data. Stat. Med. 1992, 11, 1861–1870. [Google Scholar] [CrossRef] [PubMed]
Tsiatis, A.A.; Degruttola, V.; Wulfsohn, M.S. Modeling the relationship of survival to longitudinal data measured with error: Applications to survival and CD4 counts in patients with AIDS. J. Am. Stat. Assoc. 1995, 90, 27–37. [Google Scholar] [CrossRef]
Tsiatis, A.A.; Davidian, M. Joint modeling of longitudinal and time-to-event data: An overview. Stat. Sin. 2004, 14, 809–834. [Google Scholar] [CrossRef]
Yu, M.; Law, N.J.; Taylor, J.M.G.; Sandler, H.M. Joint longitudinal-survival-cure models and their application to prostate cancer. Stat. Sin. 2004, 14, 835–862. [Google Scholar] [CrossRef]
Diggle, P.J.; Sousa, I.; Chetwynd, A.G. Joint modelling of repeated measurements and time-to-event outcomes: The fourth armitage lecture. Stat. Med. 2008, 27, 2981–2998. [Google Scholar] [CrossRef] [PubMed]
Lawrence Gould, A.; Boye, M.E.; Crowther, M.J.; Ibrahim, J.G.; Quartey, G.; Micallef, S.; Bois, F.Y. Joint modeling of survival and longitudinal non-survival data: Current methods and issues. Stat. Med. 2015, 34, 2181–2195. [Google Scholar] [CrossRef]
Henderson, R.; Diggle, P.; Dobson, A. Joint modelling of longitudinal measurements and event time data. Biostatistics 2000, 1, 465–480. [Google Scholar] [CrossRef]
Zeng, D.; Cai, J. Simultaneous modelling of survival and longitudinal data with an application to repeated quality of life measures. Lifetime Data Anal. 2005, 11, 151–174. [Google Scholar] [CrossRef]
Tang, A.; Zhao, X.; Tang, N. Bayesian variable selection and estimation in semiparametric joint models of multivariate longitudinal and survival data. Biom. J. 2017, 59, 57–78. [Google Scholar] [CrossRef]
Song, P.X.; Tan, M. Marginal models for longitudinal continuous proportional data. Biometrics 2000, 56, 496–502. [Google Scholar] [CrossRef] [PubMed]
Lesaffre, E.; Rizopoulos, D.; Tsonaka, R. The Logistic Transform for Bounded Outcome Scores. Biostatistics 2007, 8, 72–85. [Google Scholar] [CrossRef]
Barndorff-Nielsen, O.E.; Jørgensen, B. Some Parametric Models on the Simplex. Multivar. Anal. 1991, 39, 106–116. [Google Scholar] [CrossRef] [Green Version]
Qiu, Z.; Song, P.X.K. Simplex Mixed-Effects Models for Longitudinal Proportional Data. Scand. J. Stat. 2008, 35, 577–596. [Google Scholar] [CrossRef]
Tibshirani, R. Regression shrinkage and selection via the lasso. J. R. Stat. Soc. B 1996, 58, 267–288. [Google Scholar] [CrossRef]
Park, T.; Casella, G. The Bayesian Lasso. J. Am. Stat. Assoc. 2008, 103, 681–686. [Google Scholar] [CrossRef]
Hans, C. Bayesian lasso regression. Biometrika 2009, 96, 835–845. [Google Scholar] [CrossRef]
Guo, R.; Zhu, H.; Chow, S.M.; Ibrahim, J.G. Bayesian Lasso for semiparametric sructural equation models. Biometrics 2012, 68, 567–577. [Google Scholar] [CrossRef]
Ohlssen, D.I.; Sharples, L.D.; Spiegelhalter, D.J. Flexible random effects models using Bayesian semiparametric models: Applications to institutional comparisons. Stat. Med. 2007, 26, 2088–2112. [Google Scholar] [CrossRef]
Tang, A.; Duan, X.; Zhao, Y. Bayesian variable selection and estimation in semiparametric simplex mixed-effects models with longitudinal proportional data. Entropy 2022, 24, 1466. [Google Scholar] [CrossRef]
Sethuraman, J. A Constructive Definition of Dirichlet Priors. Stat. Sin. 1994, 4, 639–650. [Google Scholar]
Yang, M.; Dunson, D.B.; Baird, D. Semiparametric bayes hierarchical models with mean and variance constraints. Comput. Stat. Data Anal. 2010, 54, 2172–2186. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Ishwaran, H.; Zarepour, M. Markov chain monte carlo in approximate dirichlet and beta two-parameter process hierarchical models. Biometrika 2000, 87, 371–390. [Google Scholar] [CrossRef]
Chow, S.; Tang, N.; Yuan, Y.; Song, X.; Zhu, H. Bayesian estimation of semiparametric nonlinear dynamic factor analysis models using the Dirichlet process prior. Br. J. Math. Stat. Psychol. 2011, 64, 69–106. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Song, H.; Peng, Y.; Tu, D. Jointly modeling longitudinal proportional data and survival times with an application to the quality of life data in a breast cancer trial. Lifetime Data Anal. 2017, 23, 183–206. [Google Scholar] [CrossRef] [PubMed]
Levine, M.N.; Bramwell, V.H.; Pritchard, K.I.; Norris, B.D.; Shepherd, L.E.; Abu-Zahra, H.; Findlay, B.; Warr, D.; Bowman, D.; Myles, J.; et al. Randomized trial of intensive cyclophosphamide, epirubicin, and fluorouracil chemotherapy compared with cyclophosphamide, methotrexate, and fluorouracil in premenopausal women with node-positive breast cancer. J. Clin. Oncol. 1998, 16, 2651–2658. [Google Scholar] [CrossRef]

Figure 1. EPSR values of all parameters against iteration numbers for a randomly selected replication in Simulation I (left panel), Simulation II (middle panel), and Simulation III (right panel). The colored lines represent the EPSR values for all parameters, and the red dashed lines determine the number of iterations when all parameters converge.

Figure 2. Estimation versus true values of unknown function

g (t)

(upper panels) and estimated versus true densities for random effects

b_{i}

(lower panels) based on the CDPMM prior (CDPMM) and Gaussian prior (GP) method in Simulation I (left panels), Simulation II (middle panels), and Simulation III (right panels).

Figure 2. Estimation versus true values of unknown function

g (t)

(upper panels) and estimated versus true densities for random effects

b_{i}

(lower panels) based on the CDPMM prior (CDPMM) and Gaussian prior (GP) method in Simulation I (left panels), Simulation II (middle panels), and Simulation III (right panels).

Table 1. Bayesian estimates of parameters based on the CDPMM prior and Gaussian prior in Simulation I.

Pra.	True	CDPMM Prior					Gaussian Prior
Pra.	True	Median	Bias	SD	RMS	F0 (%)	Median	Bias	Median	RMS	F0 (%)
$β_{1}$	1.00	0.993	−0.005	0.058	0.058	0.0	0.980	−0.016	0.124	0.125	0.0
$β_{2}$	0.00	0.001	0.002	0.036	0.036	99.0	−0.006	−0.010	0.086	0.087	95.5
$β_{3}$	0.00	−0.005	−0.002	0.033	0.033	99.0	−0.009	−0.005	0.090	0.090	95.0
$β_{4}$	−0.50	−0.493	0.005	0.041	0.041	0.0	−0.477	0.009	0.108	0.109	0.5
$β_{5}$	0.50	0.499	−0.001	0.037	0.037	0.0	0.509	0.008	0.104	0.104	0.5
$β_{6}$	−1.00	−0.985	0.013	0.064	0.066	0.0	−0.965	0.034	0.161	0.164	0.0
$γ_{1}$	0.00	0.010	0.010	0.119	0.120	95.5	−0.026	−0.028	0.122	0.125	95.5
$γ_{2}$	1.00	1.007	0.001	0.148	0.148	0.0	0.988	0.004	0.157	0.156	0.0
$γ_{3}$	−0.50	−0.478	0.018	0.137	0.138	5.0	−0.479	0.030	0.134	0.137	6.0
$γ_{4}$	0.00	0.010	−0.002	0.198	0.197	97.0	0.026	0.031	0.183	0.185	97.5
$ϕ$	0.60	0.610	0.006	0.106	0.106	0.0	0.598	0.032	0.121	0.125	0.0
$σ^{2}$	0.36	0.357	−0.002	0.018	0.018	–	0.363	0.004	0.021	0.022	–

Table 2. Bayesian estimates of parameters based on the CDPMM prior and Gaussian prior in Simulation II.

Pra.	True	CDPMM Prior					Gaussian Prior
Pra.	True	Median	Bias	SD	RMS	F0 (%)	Median	Bias	Median	RMS	F0 (%)
$β_{1}$	1.00	0.932	−0.076	0.091	0.118	0.0	0.893	−0.113	0.142	0.182	0.0
$β_{2}$	0.00	0.003	0.002	0.047	0.047	98.5	0.000	0.003	0.098	0.098	93.0
$β_{3}$	0.00	0.004	0.000	0.049	0.049	99.0	0.000	0.003	0.109	0.109	94.5
$β_{4}$	−0.50	−0.494	0.009	0.057	0.058	0.0	−0.484	0.012	0.117	0.117	0.5
$β_{5}$	0.50	0.483	−0.013	0.056	0.058	0.0	0.487	−0.018	0.112	0.113	1.0
$β_{6}$	−1.00	−1.014	−0.007	0.101	0.101	0.0	−1.001	0.012	0.191	0.191	0.0
$γ_{1}$	0.00	0.002	0.002	0.106	0.106	98.5	−0.002	0.005	0.131	0.131	94.5
$γ_{2}$	1.00	0.964	−0.027	0.144	0.146	0.0	0.999	0.003	0.142	0.142	0.0
$γ_{3}$	−0.50	−0.457	0.031	0.128	0.131	3.5	−0.501	−0.001	0.144	0.143	6.0
$γ_{4}$	0.00	−0.006	−0.014	0.174	0.174	98.0	0.002	0.008	0.226	0.225	96.5
$ϕ$	0.60	0.571	−0.028	0.086	0.091	0.0	0.596	0.005	0.101	0.100	0.0
$σ^{2}$	0.36	0.361	0.000	0.020	0.020	–	0.365	0.005	0.021	0.021	–

Table 3. Bayesian estimates of parameters based on the CDPMM prior and Gaussian prior in Simulation III.

Pra.	True	CDPMM Prior					Gaussian Prior
Pra.	True	Median	Bias	SD	RMS	F0 (%)	Median	Bias	Median	RMS	F0 (%)
$β_{1}$	1.00	0.990	−0.015	0.103	0.104	0.0	0.984	−0.013	0.102	0.102	0.0
$β_{2}$	0.50	0.487	−0.009	0.077	0.077	0.0	0.487	−0.013	0.083	0.084	0.0
$β_{3}$	−0.50	−0.488	0.011	0.085	0.085	0.0	−0.477	0.018	0.076	0.078	0.0
$β_{4}$	−0.50	−0.499	0.001	0.079	0.079	0.0	−0.507	−0.005	0.076	0.076	0.0
$β_{5}$	0.50	0.496	0.000	0.083	0.083	0.0	0.513	0.006	0.078	0.078	0.0
$β_{6}$	−1.00	−0.977	0.028	0.134	0.136	0.0	−0.976	0.024	0.133	0.135	0.0
$γ_{1}$	−0.50	−0.502	0.002	0.125	0.125	0.5	−0.479	0.022	0.124	0.126	2.5
$γ_{2}$	1.00	0.987	−0.008	0.141	0.141	0.0	0.983	−0.025	0.144	0.146	0.0
$γ_{3}$	−0.50	−0.492	0.008	0.127	0.127	2.5	−0.483	0.015	0.123	0.123	2.5
$γ_{4}$	1.00	0.997	−0.007	0.191	0.191	0.0	1.015	−0.001	0.205	0.205	0.0
$ϕ$	0.60	0.604	−0.002	0.145	0.145	1.0	0.626	0.021	0.157	0.158	0.5
$σ^{2}$	0.36	0.365	0.005	0.020	0.020	–	0.364	0.004	0.019	0.019	–

Table 4. Estimated mean and standard deviation for random effects and quantiles of RMSE for unknown functions and random effects based on the CDPMM prior (CDPMM) and Gaussian prior (GP) method in three simulation studies.

	Method	Est of Random Effects				Quantile of RMSE
	Method	Mean	Est Mean	SD	Est SD	25%	50%	75%
Simulation I	CDPMM	−0.011	−0.007	1.004	0.961	0.091	0.115	0.138
	GP	0.004	−0.036	1.052	0.903	0.130	0.159	0.198
Simulation II	CDPMM	−0.040	0.009	1.251	1.236	0.086	0.112	0.139
	GP	−0.151	−0.040	1.224	1.135	0.088	0.111	0.140
Simulation III	CDPMM	−0.001	−0.001	0.879	0.797	0.109	0.152	0.227
	GP	0.018	0.004	0.739	0.663	0.102	0.144	0.209

“Mean” denotes true empirical mean of the distribution; “Est mean” denotes mean of the posterior samples. “SD” denotes true empirical standard deviation of the distribution; “Est SD” denotes standard deviation of the posterior samples.

Table 5. Bayesian estimations of parameters based on the CDPMM prior and Gaussian prior in the MA.5 experimental research study.

Pra.	CDPMM Prior			Gaussian Prior
Pra.	Est	SD	IC	Est	SD	IC
$β_{1}$	0.239	0.035	(0.176, 0.312)	0.242	0.045	(0.163, 0.331)
$β_{2}$	0.304	0.041	(0.219, 0.377)	0.296	0.043	(0.220, 0.379)
$β_{3}$	0.269	0.049	(0.180, 0.375)	0.275	0.059	(0.166, 0.387)
$γ_{1}$	−0.341	0.150	(−0.625, −0.033)	−0.316	0.152	(−0.636, −0.048)
$γ_{2}$	0.745	0.133	(0.480, 1.013)	0.747	0.141	(0.472, 1.017)
$γ_{3}$	0.628	0.136	(0.355, 0.902)	0.611	0.154	(0.310, 0.934)
$ϕ$	0.269	0.126	(0.036, 0.519)	0.292	0.133	(0.020, 0.551)
$σ^{2}$	0.180	0.003	(0.174, 0.186)	0.180	0.003	(0.174, 0.186)

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Liu, W.; Li, H.; Tang, A.; Cui, Z. Bayesian Joint Modeling Analysis of Longitudinal Proportional and Survival Data. Mathematics 2023, 11, 3469. https://doi.org/10.3390/math11163469

AMA Style

Liu W, Li H, Tang A, Cui Z. Bayesian Joint Modeling Analysis of Longitudinal Proportional and Survival Data. Mathematics. 2023; 11(16):3469. https://doi.org/10.3390/math11163469

Chicago/Turabian Style

Liu, Wenting, Huiqiong Li, Anmin Tang, and Zixin Cui. 2023. "Bayesian Joint Modeling Analysis of Longitudinal Proportional and Survival Data" Mathematics 11, no. 16: 3469. https://doi.org/10.3390/math11163469

APA Style

Liu, W., Li, H., Tang, A., & Cui, Z. (2023). Bayesian Joint Modeling Analysis of Longitudinal Proportional and Survival Data. Mathematics, 11(16), 3469. https://doi.org/10.3390/math11163469

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Bayesian Joint Modeling Analysis of Longitudinal Proportional and Survival Data

Abstract

1. Introduction

2. Model and Notation

3. Bayesian Estimation of Joint Model

3.1. Prior Specification

3.2. Bayesian Analysis of Joint Model

4. Simulation Studies

5. An Example

6. Concluding Remarks

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A. Conditional Distribution of b_i

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Bayesian Joint Modeling Analysis of Longitudinal Proportional and Survival Data

Abstract

1. Introduction

2. Model and Notation

3. Bayesian Estimation of Joint Model

3.1. Prior Specification

3.2. Bayesian Analysis of Joint Model

4. Simulation Studies

5. An Example

6. Concluding Remarks

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A. Conditional Distribution of bi

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Appendix A. Conditional Distribution of b_i