Cross-Validated Functional Generalized Partially Linear Single-Functional Index Model

Rachdi, Mustapha; Alahiane, Mohamed; Ouassou, Idir; Alahiane, Abdelaziz; Hobbad, Lahoucine

doi:10.3390/math12172649

Open AccessArticle

Cross-Validated Functional Generalized Partially Linear Single-Functional Index Model

by

Mustapha Rachdi

^1,†

,

Mohamed Alahiane

^2,*,†

,

Idir Ouassou

^2,†,

Abdelaziz Alahiane

^3,† and

Lahoucine Hobbad

^2,†

¹

Laboratory AGEIS, Grenoble Alps University, UFR SHS, BP. 47, Cedex 09, 38040 Grenoble, France

²

Complex Systems Modeling Laboratory, National School of Applied Sciences, Cadi Ayyad University, Av. Abdelkrim Khattabi, BP. 575, Marrakesh 40000, Morocco

³

SmartICT Lab, ENSAO, Mohamed Premier University, Oujda 60000, Morocco

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Mathematics 2024, 12(17), 2649; https://doi.org/10.3390/math12172649

Submission received: 3 July 2024 / Revised: 5 August 2024 / Accepted: 20 August 2024 / Published: 26 August 2024

(This article belongs to the Special Issue Multivariate Statistical Analysis and Application)

Download

Browse Figures

Versions Notes

Abstract

In this paper, we have introduced a functional approach for approximating nonparametric functions and coefficients in the presence of multivariate and functional predictors. By utilizing the Fisher scoring algorithm and the cross-validation technique, we derived the necessary components that allow us to explain scalar responses, including the functional index, the nonlinear regression operator, the single-index component, and the systematic component. This approach effectively addresses the curse of dimensionality and can be applied to the analysis of multivariate and functional random variables in a separable Hilbert space. We employed an iterative Fisher scoring procedure with normalized B-splines to estimate the parameters, and both the theoretical and practical evaluations demonstrated its favorable performance. The results indicate that the nonparametric functions, the coefficients, and the regression operators can be estimated accurately, and our method exhibits strong predictive capabilities when applied to real or simulated data.

Keywords:

single-index coefficient; cross-validation; asymptotic normality; Fisher scoring algorithm; functional data analysis (FDA); functional index; quasi-likelihood

MSC:

62R10

1. Introduction

Parametric regression models are a common tool in generalized linear models (GLMs), where the relationship between the average response and covariates is explored through a chosen link function, often with a canonical or predefined form, as noted by McCullagh [1] and Nelder [2]. However, in certain cases, this approach may not be suitable due to the lack of knowledge or the presence of a more intricate link function.

In order to address this challenge, various models have been developed, including nonparametric and semiparametric regression models. However, these models are often limited in their application due to the curse of dimensionality, which poses significant challenges when dealing with high-dimensional data.

Efforts have been made to overcome this limitation by employing two key approaches: (i) the approximation of the link functions and (ii) dimension reduction techniques. One approach that has been proposed is the generalized additive model (GAM), which was introduced by Hastie [3] and extensively discussed by Wood [4]. In the GAM, the nonparametric component is represented as a sum of univariate functions, allowing for the flexible modeling of the complex relationships. However, it is important to note that a potential limitation of the GAM is its inability to explicitly account for interactions between explanatory variables. This means that the model may not fully capture the effects of interactions, which could be important in certain contexts.

The single-index model (SIM) was introduced by Härdle et al. [5] and Hristache et al. [6] to address certain cases where dimensionality reduction and relaxation of restrictive parametric assumptions are needed. The SIM approach achieves this by transforming multiple covariates into a linear combination of the covariates. Building upon the SIM, Ait-Saïdi et al. [7] investigated the functional single-index model (FSIM), which extends the single-index model usefulness framework to incorporate functional predictors.

Longitudinal data scenarios involving discrete explanatory variables in the linear component were investigated by Liang et al. [8], and Chen et al. [9] extended the framework to develop the partially linear single-index models (PLSIMs). These models allow for the modeling of discrete variables within the single-index framework.

Partially generalized linear single-index models (PGLSIMs), introduced by Carroll et al. [10], employ kernel smoothing techniques to estimate the single-index link function. PGLSIMs offer increased flexibility and modeling capabilities. In a study by Wang et al. [11], a novel approach was proposed. This approach combines penalized spline smoothing of the quasi-likelihood with the Fisher scoring mechanism, resulting in improved theoretical robustness and suitability for PGLSIM modeling.

Overall, these models, including SIM, FSIM, PLSIM, and PGLSIM, provide valuable tools for addressing complex scenarios, accommodating functional predictors and discrete variables, and effectively handling high-dimensional data.

Several models have been developed to address the complexity of functional variables in the regression analysis. However, it is important to note that the models mentioned earlier may not fully capture the intricacies of the data when some covariates are of the functional kind. Researchers have dedicated their efforts to explore functional variables in the regression models, as evidenced by the works of Ramsay et al. [12] and Ferraty et al. [13].

Furthermore, the field has seen investigations into specific models such as semifunctional partial linear regression, as studied by Aneiros et al. [14], and partially linear modeling with multifunctional covariates, as explored by Aneiros et al. [15]. Various other works have contributed to our understanding of this topic, including studies on inference by Horváth et al. [16]; the introduction to this subject by Kokoszka et al. [17]; the spline approaches by Schumaker [18]; the functional principal component analysis (FPCA) by Cao et al. [19]; the lack-of-fit testing by Li et al. [20]; and the works by Ould-Saïd et al. [21], Laksaci et al. [22], and Ouassou et al. [23,24] on the regression analysis with functional covariates.

We can refer to specific studies concerned by different aspects of the regression models. For instance, Yu et al. [25] focus on the partially functional linear single-index regression model, while Yu et al. [26] provide a comprehensive review of the penalized spline-smoothing methodology for the partially linear single-index model (PLSIM), where the sub-regression function is assumed to be a spline function with a fixed number of knots. Regarding the FDA, Rachdi et al. [27] and Alahiane et al. [28] investigate partially linear generalized single-index models for functional data (PLGSIMF) using the B-spline expansion and quasi-likelihood function. In their studies, the functional model is assumed to be linear. On the other hand, Alahiane et al. [29] explore the high-dimensional case, where the functional model is nonlinear.

The projection pursuit regression, proposed by Friedman et al. [30], Hall [31], and Huber [32], introduces an additive model that operates on derived features rather than the original inputs. This approach aims to capture complex relationships between variables by projecting them onto a lower-dimensional space. In the context of FDA, Ferraty et al. [33] extended the projection pursuit regression to the functional framework. They achieved this by incorporating the cross-validation method in the Nadaraya–Watson estimation technique and the spline functions. This generalization allows for the extension of the FDA results to the case where both the covariates and the responses are curves.

In this paper, we present a novel model called cross-validated estimations for generalized partially linear single-functional index models (CVGPLSFIM). We therefore estimate: (i) the nonlinear regression operator comprising a functional index that is chosen by the cross-validation method; (ii) the nonlinear regression operator combining a spline approximation and the one-dimensional systematic component with unknown link; and (iii) the single-index function using an iterative algorithm based on smoothing by spline functions and the maximization of the quasi-likelihood function. We also provide the convergence rates of our different estimators of the different CVGPLSFIM parameters. In fact, our contributions include examining the performance of the single and functional index, the nonparametric regression operator, the systematic component, and the optimal direction. Through numerical calculations on simulated data and then on real data, we demonstrate that our model outperforms the models mentioned throughout this introduction.

The structure of this article is as follows. In Section 2.1, Section 2 and Section 3, we present our estimation methodology, discuss the asymptotic properties of the proposed estimators, and give an iterative algorithm that maximizes the quasi-likelihood function, allowing us to compute the estimators. Section 4 presents the results of a simulation study. Furthermore, in Section 5, we apply our methodology to a real dataset consisting of curves from the chemometrics field. The technical lemmas needed to prove Theorems 1–3 are provided in Appendix A. In order to save space, the detailed proofs of the various results have been compiled in an additional and available supplementary material.

2. Estimation Methodology

2.1. Preliminary Definitions

Let Y be a scalar response variable, Z be a functional random variable that is valued in

H

separable Hilbert space endowed with inner product

〈 f, g 〉 = \int_{I} f (t) g (t) d t

. Let

(X, Z) \in R^{d} \times H

be the predictor vector where

X = (X_{1}, X_{2}, \dots, X_{d})

and d are a fixed integer. For a fixed

(x, z) \in R^{d} \times H

, we assume that the conditional density function of the response Y given

(X, Z) = (x, z)

belongs to the following canonical exponential family

f_{Y | X = x, Z = z} (y) = exp (y ξ (x, z) - B (ξ (x, z)) + C (y)),

(1)

where B and C are two known functions that are defined from

R

into

R

, and

ξ : R^{d} \times H \to R

is the parameter in the generalized linear model that is linked to the dependent variable

μ (x, z) = I E [Y | X = x, Z = z] = B^{'} (ξ (x, z)) .

(2)

where

B^{'}

denotes the first derivative of the function B.

In what follows, we modelize the scalar response Y as a cross-validated generalized partially linear functional single-index model (CVGPLFSIM) by

g (μ (X, Z)) = η_{0} (α^{⊤} X) + R (〈 β, Z 〉) + ε .

(3)

where

α = {(α_{1}, \dots, α_{d})}^{⊤} \in R^{d}

is the d-dimensional single-index coefficient vector;

η_{0}

is the unknown single-index link function (the systematic nonlinear component), which will be assumed to be sufficiently smooth; and R is the nonlinear regression operator to be estimated, where

x^{⊤}

denotes the transpose vector of x and

β \in H

is the so-called the functional index, which is such that

{∥ β ∥}^{2} = 1

and

〈 β, Z 〉

is the functional single-index.

We suggest estimators for the unknown single-index vector

α

, the unknown systematic component

η_{0} (\cdot)

, the unknown functional index

β,

and the unknown nonlinear regression operator R. Then, we will derive their asymptotic distributions, and we also provide some illustrations of these models and their performances.

Remark 1.

For identifiability purposes, we guess that ${| | α | |}_{d} = 1$ and the first component of α is non-negative, i.e., $α_{1} > 0$ , where $| | \cdot {| |}_{d}$ denotes the Euclidean norm on $R^{d}$ .
In order to identify the function $η_{0} (\cdot)$ , we define its support as $[a, b]$ , where $a = inf α^{⊤} X$ and $b = sup α^{⊤} X$ .
In the definition of the real canonical link function g, we will assume that the functional random variable $Z = {Z (t), t \in [0, 1]}$ is valued in $H$ such that:

$E [Z] = 0, E (ε | X, Z) = 0 and var (ε | X, Z) = σ^{2} .$
If the conditional variance $var (Y | X = x, Z = z) = σ^{2} V (μ (x, z))$ where $V (\cdot)$ is an unknown positive function, then the estimation of the mean function $g (μ)$ may be obtained by replacing the log-likelihood $f_{Y | X = x, Z = z}$ given by (1) by the quasi-likelihood $Q (u, v)$ , which is given for any real numbers u and v by

$\frac{\partial Q (u, v)}{\partial u} = \frac{v - u}{σ^{2} V (u)} = \frac{v - u}{var (Y | X = x, Z = z)} .$

2.2. Methodology

Let

(X_{i}, Y_{i}, Z_{i})

for

i = 1, \dots, n

, be an independent and identically distributed (i.i.d.) n-sample of

(X, Y, Z)

and, for each

i = 1, \dots, n

,

g (μ (X_{i}, Z_{i})) = η_{0} (α^{⊤} X_{i}) + R (〈 β, Z_{i} 〉) + ε_{i} .

(4)

Let

{B_{j} (u), j = 1, \dots, N_{n}}

the B-spline basis functions of order r;

h = \frac{(b - a)}{J_{n} + 1}

is the distance between the neighbors knots, where

J_{n}

denote the number of interior knots in

[a, b]

, as in Wang et al. [11], Rachdi et al. [27], Alahiane et al. [28] and Alahiane et al. [29].

Let

S_{n}

be the space of polynomial splines on

[a, b]

of order

r \geq 1

,

v \in N^{*}

and

e \in (0, 1]

such that

p = v + e > 1.5

. We denote by

H (p)

the collection of functions g that are defined on

[a, b]

whose v-th order derivative,

g^{(v)}

, exists and satisfies the following e-th order Lipschitz condition

|g^{(v)} (m_{1}) - g^{(v)} (m_{2})| \leq C {|m_{1} - m_{2}|}^{e}, for all a \leq m_{1}, m_{2} \leq b .

Using the method by De Boor [34], we can approximate

η,

which is assumed in

H (p)

, by the function

\tilde{η} \in S_{n} .

So, we can write

\tilde{η} (u) = {\tilde{γ}}^{⊤} B (u)

, where

B (u)

is the spline basis and

\tilde{γ} \in R^{N_{n}}

is the spline coefficient vector.

We introduce a new knot sequence

t_{0} < t_{1} < \dots < t_{k + 1}

in the range of R. Then, there exists

N^{'} = k + r + 1

functions in the B-splines basis, which are normalized and of order r, such that

g (\cdot) \approx δ^{⊤} B_{1} (.) where B_{1} (\cdot) = {(B_{11} (\cdot), B_{12} (\cdot), \dots, B_{1 N^{'}} (\cdot))}^{⊤}

and

δ \in R^{N^{'}} .

Using the setting

W_{i} = 〈 β, Z_{i} 〉,

the mean function estimator

\hat{m} (x, z)

is therefore obtained by assessing the parameter

θ = {(α^{⊤}, γ^{⊤}, δ^{⊤})}^{⊤}

and inverting the subsequent equation

g (μ (X_{i}, Z_{i})) = {\hat{γ}}^{⊤} B ({\hat{α}}^{⊤} x) + {\hat{δ}}^{⊤} B_{1} (W_{i}) .

We may notice that the parameter

θ = {(α^{⊤}, γ^{⊤}, δ^{⊤})}^{⊤}

is deduced by maximizing the following requirement of quasi-likelihood

\begin{matrix} \hat{θ} = {({\hat{α}}^{⊤}, {\hat{γ}}^{⊤}, {\hat{δ}}^{⊤})}^{⊤} = \underset{θ = (α, γ, δ) \in R^{d} \times R^{N_{n}} \times R^{N^{'}}}{arg max} L (θ), \end{matrix}

(5)

where

L (θ) : = L (α, γ, δ) = \frac{1}{n} \sum_{i = 1}^{n} Q (g^{- 1} (m_{i}), Y_{i})

and

m_{i} : = γ^{⊤} B (α^{⊤} X_{i}) + δ^{⊤} B_{1} (W_{i})

.

In order to overcome the constraints

∥ α ∥ = 1

and

α_{1} > 0

that are imposed on the d-dimensional index

α

, we adopt a reparametrization approach that is inspired by the methodology employed by Yu [26]

α (τ) = {(\sqrt{1 - {∥ τ ∥}^{2}}, τ^{⊤})}^{⊤} for τ \in R^{d - 1} .

The true value

τ_{0}

of

τ

must be

∥τ_{0}∥ \leq 1

. Subsequently, we assume that

∥τ_{0}∥ < 1

.

The Jacobian matrix of

α \to α (τ)

of dimension

d \times (d - 1)

is

J (τ) = (\begin{matrix} - \frac{1}{\sqrt{1 - {∥ τ ∥}^{2}}} τ^{⊤} \\ I_{(d - 1) \times (d - 1)} \end{matrix})

Denote

l = 1, 2

,

q_{l} (m, y) = \frac{\partial^{l}}{\partial m^{l}} Q (m, y)

and

ρ_{l} = \frac{1}{σ^{2} V (g^{- 1} (m))} {[\frac{d}{d m} (g^{- 1} (m))]}^{l} .

So,

q_{1} (m, y) = (y - m) ρ_{1} (m) and q_{2} (m, y) = (y - m) ρ_{1}^{'} (m) - ρ_{2} (m) .

The score vector is

S (θ_{τ}) = \frac{\partial l}{\partial θ_{τ}} (θ_{τ}) = \frac{1}{n} \sum_{i = 1}^{n} q_{1} (m_{i}, Y_{i}) ξ_{i} (τ, γ, δ),

where

ξ_{i} (τ, γ, δ) = (\begin{matrix} γ^{⊤} B^{'} (α^{⊤} (τ) X_{i}) J^{⊤} (τ) X_{i} \\ B (α^{⊤} (τ) X_{i}) \\ B_{1} (W_{i}) \end{matrix}) .

The conditional expectation of the Hessian matrix given X and Z is

H (θ_{τ}) = - \frac{1}{n} \sum_{i = 1}^{n} ρ_{2} (m_{i}) ξ_{i} (τ, γ, δ) ξ_{i}^{⊤} (τ, γ, δ) .

The Fisher scoring update equations

θ_{τ}^{(k + 1)} = θ_{τ}^{(k)} - {[H (θ_{τ}^{(k)})]}^{- 1} S (θ_{τ}^{(k)})

becomes

\begin{matrix} θ_{τ}^{(k + 1)} & = & θ_{τ}^{(k)} + {[\sum_{i = 1}^{n} ρ_{2} (m_{i}^{(k)}) ξ_{i} (τ^{(k)}, γ^{(k)}, δ^{(k)}) ξ_{i}^{⊤} (τ^{(k)}, γ^{(k)}, δ^{(k)})]}^{- 1} \\ \times [\sum_{i = 1}^{n} (Y_{i} - μ_{i}^{(k)}) ρ_{1} (m_{i}^{(k)}) ξ_{i} (τ^{(k)}, γ^{(k)}, δ^{(k)})], \end{matrix}

where

m_{i}^{(k)} = γ^{(k) ⊤} B (α^{(k) ⊤} (τ^{(k)}) X_{i}) + δ^{(k) ⊤} B_{1} (W_{i})

and

μ_{i}^{(k)} = g^{- 1} (m_{i}^{(k)})

, for

1 \leq i \leq n

.

So, we obtain

\begin{matrix} \hat{η} (t) & = & {\hat{γ}}^{⊤} B (t) = γ^{(k) ⊤} B (t), \\ {\hat{m}}_{i} & = & {\hat{γ}}^{⊤} B (α^{⊤} (\hat{τ}) X_{i}) + {\hat{δ}}^{⊤} B_{1} (W_{i}) = γ^{(k) ⊤} B (α^{⊤} (τ^{k})) X_{i} + δ^{(k) ⊤} B_{1} (W_{i}), \\ \hat{R} (Z_{i}) & = & {\hat{δ}}^{⊤} B_{1} (W_{i}) = δ^{(k) ⊤} B_{1} (W_{i}), \end{matrix}

where

\hat{α} = α (τ^{(k)})

is the estimator of the single-index coefficient vector

α

and

{\hat{μ}}_{i} = g^{- 1} ({\hat{m}}_{i})

.

Through the utilization of this procedure, we will employ the cross-validation method (see the Section 2.3) to determine the optimal direction

β (\cdot)

. Additionally, we will estimate in Section 2.4 the various components of our CVGPLSFIM model using a plug-in approach.

2.3. Cross-Validated Estimation of the Functional Index

We use the cross-validation principle based on the leave-one-out statistical sample

{(X_{j}, Z_{j}), j = 1, \dots, n, j \neq i}

to estimate the functional index

β

by:

\hat{β} = \underset{β \in Ξ}{arg min} C V (β),

where

C V (β) = \frac{1}{n} \sum_{i = 1}^{n} {(Y_{i} - g^{- 1} ({\hat{m}}_{i}^{(- i)}))}^{2} I_{Z_{i} \in G},

(6)

with

{\hat{m}}_{i}^{(- i)} = {\hat{γ}}^{⊤} B (α^{⊤} (\hat{τ}) X_{i}^{(- i)}) + {\hat{δ}}^{⊤} B_{1} (W_{i}^{(- i)})

;

G

is a subset of

H

introduced for usual technical boundedness reasons, and

Ξ = Ξ_{n}

with

Ξ \subset H ⋂ \{{β : ∥ β ∥}^{2} = 1\}

is constructed in a similar way as in Ait-Saïdi et al. [7]. Specifically:

Each direction $β \in Ξ_{n}$ is obtained from an $l_{n}$ -dimensional space generated by the B-spline basis functions $\{e_{1}, \dots, e_{l_{n}}\}$ . Therefore, we focus on directions

$β (.) = \sum_{k = 1}^{l_{n}} λ_{k} e_{k}, where (λ_{1}, \dots, λ_{l_{n}}) \in V .$

(7)
The set of coefficients vectors $V$ in (7) is obtained by the following procedure:
–
Step 1: For each $(b_{1}, \dots, b_{l_{n}}) \in C^{l_{n}}$ , where $C = \{c_{1}, \dots, c_{K}\} \subset R^{K}$ denotes a set of K “seed-coefficients”, we construct the initial functional direction $β_{init} (\cdot) = \sum_{k = 1}^{l_{n}} b_{k} e_{k} (\cdot) .$
–
Step 2: For each $β_{init}$ selected in Step 1 that verifies the condition $β_{init} (t_{0}) > 0$ , we construct $(λ_{1}, \dots, λ_{l_{n}}) = (b_{1}, \dots, b_{l_{n}}) / {〈 β_{init}, β_{init} 〉}^{1 / 2} .$
–
Step 3: Construct $V$ as the set of vectors $(b_{1}, \dots, b_{l_{n}})$ obtained in Step 2. Therefore, the final set of eligible functional direction is

$Ξ_{n} = \{β (\cdot) = \sum_{k = 1}^{l_{n}} λ_{k} e_{k} (\cdot) such that (λ_{1}, \dots, λ_{l_{n}}) \in V\} .$

Using the method by De Boor [34], if

β_{0}

is sufficiently smooth, then it is well approximated by some function in the

l_{n}

-dimensional space, which is generated by the B-spline basis. From the construction (see Step 2), each

β \in Ξ_{n}

satisfies

〈 β, β 〉 = 1

and

β (t_{0}) > 0

. So, the identifiability of the CVGPLFSIM model is guaranteed. Like in Ait-Saïdi et al. [7], we consider the cubic B-spline functions and

V = \{- 1, 0, 1\}

.

2.4. The CVGPLFSIM Model

By plug-in, first, the functional index

\hat{β}

in the model is

g (μ (X_{i}, Z_{i})) = η (α^{⊤} X_{i}) + R (W_{i}) for i = 1, \dots, n,

(8)

where

W_{i} = 〈 \hat{β}, Z_{i} 〉

denotes the functional index component. We seek a function

η \in S_{n}

along with a value of

α

that minimizes the following quasi-likelihood function

L (η, α) = \frac{1}{n} \sum_{i = 1}^{n} Q (g^{- 1} \{η (α^{⊤} X_{i}) + δ^{⊤} B_{1} (W_{i})\}, Y_{i}) .

(9)

By denoting

θ = (α^{⊤}, γ^{⊤}, δ^{⊤})

, the maximization problem (9) is equivalent to find a value

θ

maximizing

l (θ) = \frac{1}{n} \sum_{i = 1}^{n} Q (g^{- 1} \{γ^{⊤} B (α^{⊤} X_{i}) + δ^{⊤} B_{1} (W_{i})\}, Y_{i}),

(10)

where

g (μ (X_{i}, Z_{i})) = γ^{⊤} B (α^{⊤} X_{i}) + δ^{⊤} B_{1} (W_{i}), for i = 1, \dots, n .

(11)

The mean function estimator

\hat{μ}

is given by the evaluation of the parameters

\hat{θ} = {({\hat{α}}^{⊤}, {\hat{γ}}^{⊤}, {\hat{δ}}^{⊤})}^{⊤}

and inverting Equation (11). In fact,

\hat{θ} = {({\hat{α}}^{⊤}, {\hat{γ}}^{⊤}, {\hat{δ}}^{⊤})}^{⊤}

is determined by maximizing the following quasi-likelihood

\hat{θ} = {({\hat{α}}^{⊤}, {\hat{γ}}^{⊤}, {\hat{δ}}^{⊤})}^{⊤} = \underset{θ = {(α^{⊤}, γ^{⊤}, δ^{⊤})}^{⊤} \in R^{d} \times R^{N_{n}} \times R^{N^{'}}}{arg max} l (θ),

(12)

where

l (θ) : = l (α, γ, δ) = \frac{1}{n} \sum_{i = 1}^{n} Q (g^{- 1} \{γ^{⊤} B (α^{⊤} X_{i}) + δ^{⊤} B_{1} (W_{i})\}, Y_{i}) = \frac{1}{n} \sum_{i = 1}^{n} Q (g^{- 1} \{m_{i}\}, Y_{i}),

with

m_{i} = γ^{⊤} B (U_{i}) + δ^{⊤} B_{1} (W_{i}), where U_{i} = α^{⊤} X_{i}

,

α_{0}, γ_{0}, δ_{0}

, and

η_{0} (\cdot)

denote the true values, respectively, of

α, γ, δ

, and

η (\cdot)

. So, the spline estimator of

η_{0} (\cdot)

is

\hat{η} (\cdot) = {\hat{γ}}^{⊤} B (\cdot)

.

Let

R (τ) = (\begin{matrix} J (τ) & 0 \\ 0 & I_{N^{'} \times N^{'}} \end{matrix})

be the Jacobian matrix of

{(α^{⊤} (τ), δ^{⊤})}^{⊤}

and

(\tilde{α}, \tilde{δ}) = \underset{{∥ α ∥}_{d} = 1, δ \in R^{N^{'}}}{arg max} \frac{1}{n} \sum_{i = 1}^{n} Q (g^{- 1} \{\tilde{η} (α^{⊤} X_{i}) + δ^{⊤} B_{1} (W_{i})\}, Y_{i}) .

(13)

Then,

(\tilde{τ}, \tilde{δ}) = \underset{τ \in R^{d - 1}, δ \in R^{N^{'}}}{arg max} \tilde{l} (τ, δ),

(14)

where

\tilde{l} (τ, δ) = \frac{1}{n} \sum_{i = 1}^{n} Q (g^{- 1} \{\tilde{η} (α^{⊤} (τ) X_{i}) + δ^{⊤} B_{1} (W_{i})\}, Y_{i})

and

η (.)

was replaced by

\tilde{η} (.) .

We define

{\tilde{θ}}_{τ} = {({\tilde{τ}}^{⊤}, {\tilde{γ}}^{⊤}, {\tilde{δ}}^{⊤})}^{⊤}

such that

{({\tilde{τ}}^{⊤}, {\tilde{γ}}^{⊤}, {\tilde{δ}}^{⊤})}^{⊤} = \underset{τ \in R^{d - 1}, γ \in R^{N_{n}}, δ \in R^{N^{'}}}{arg max} \frac{1}{n} \sum_{i = 1}^{n} Q (g^{- 1} \{γ^{⊤} B (α^{⊤} (τ) X_{i}) + δ^{⊤} B_{1} (W_{i})\}, Y_{i}) .

(15)

Then,

l (θ_{τ})

becomes

l (θ_{τ}) = \frac{1}{n} \sum_{i = 1}^{n} Q (g^{- 1} \{γ^{⊤} B (α^{⊤} (τ) X_{i}) + δ^{⊤} B_{1} (W_{i})\}, Y_{i}) = \frac{1}{n} \sum_{i = 1}^{n} Q (g^{- 1} \{m_{i}\}, Y_{i}) .

The score vector is given by

\begin{matrix} S (θ_{τ}) = \frac{\partial l}{\partial θ_{τ}} (θ) |_{θ = θ_{τ}} = \frac{1}{n} \sum_{i = 1}^{n} q_{1} (m_{i}, Y_{i}) ξ_{i} (τ, γ, δ), \end{matrix}

(16)

where

ξ_{i} (τ, γ, δ) = (\begin{matrix} γ^{⊤} B^{'} (α^{⊤} (τ) X_{i}) J^{⊤} (τ) X_{i} \\ B (α^{⊤} (τ) X_{i}) \\ B_{1} (W_{i}) \end{matrix}) .

Then, the Hessian matrix of the quasi-likelihood function is

H (θ_{τ}) = \frac{1}{n} \sum_{i = 1}^{n} ρ_{2} (m_{i}) ξ_{i} (τ, γ, δ) ξ_{i}^{⊤} (τ, γ, δ) .

We have

{\tilde{θ}}_{τ} = {({\tilde{τ}}^{⊤}, {\tilde{γ}}^{⊤}, {\tilde{δ}}^{⊤})}^{⊤} = \underset{θ_{τ} = (τ, γ, δ) \in R^{d - 1} \times R^{N} \times R^{N^{'}}}{arg max} l (θ_{τ}) .

Then, the Fisher scoring update equations become

\begin{matrix} θ_{τ}^{(k + 1)} & = & θ_{τ}^{(k)} + {[H (θ_{τ}^{(k)})]}^{- 1} S (θ_{τ}^{(k)}) \\ = & θ_{τ}^{(k)} + {[\sum_{i = 1}^{n} ρ_{2} (m_{i}^{(k)}) ξ_{i} (τ^{(k)}, γ^{(k)}, δ^{(k)}) ξ_{i}^{⊤} (τ^{(k)}, γ^{(k)}, δ^{(k)})]}^{- 1} \\ \times [\sum_{i = 1}^{n} (Y_{i} - μ_{i}^{(k)}) ρ_{1} (m_{i}^{(k)}) ξ_{i} (τ^{(k)}, γ^{(k)}, δ^{(k)})], \end{matrix}

(17)

where for

1 \leq i \leq n,

\begin{matrix} m_{i}^{(k)} & = & γ^{(k) ⊤} B (α^{(k) ⊤} (τ^{(k)}) X_{i}) + δ^{(k) ⊤} B_{1} (W_{i}), \\ μ_{i}^{(k)} & = & g^{- 1} (m_{i}^{(k)}), \\ \hat{η} (t) & = & {\hat{γ}}^{⊤} B (t) \approx γ^{k) ⊤} B (t) = \sum_{j = 1}^{N_{n}} γ_{j}^{(k)} B_{j} (t), \hat{R} (\cdot) = \sum_{j = 1}^{N^{'}} δ_{j}^{(k)} B_{1, j} (.), \\ {\hat{m}}_{i} & = & {\hat{γ}}^{⊤} B (α^{⊤} (\hat{τ}) X_{i}) + δ^{(k) ⊤} B_{1} (W_{i}) \approx \sum_{j = 1}^{N_{n}} γ_{j}^{(k)} B_{j} (α^{⊤} {(τ^{k})}^{} X_{i}) + \sum_{j = 1}^{N^{'}} δ_{j}^{(k)} B_{1, j} (W_{i}) . \end{matrix}

Then,

{\hat{μ}}_{i} = g^{- 1} ({\hat{m}}_{i})

and

\hat{α} = α (τ^{(k)})

is the estimator of the single-index coefficient vector of the CVGPLSFIM model. The statistic

\hat{β}

is the estimated functional single-index, and

\hat{R}

is the estimated nonlinear regression operator R obtained by the CVGPLSFIM model.

3. Main Asymptotic Properties

In this section, we establish the asymptotic properties of the estimators of (i) the nonparametric components, (ii) the parametric components, (iii) the unique index, (iv) the nonlinear regression operator, and (v) the convergence of the estimators of the univariate components. These properties are established under a set of specific assumptions.

3.1. Assumptions

Let

φ

,

φ_{1}

, and

φ_{2}

be measurable functions on

[a, b]

. We define the empirical inner product and its corresponding norm as follows:

{〈φ_{1}, φ_{2}〉}_{n} = \frac{1}{n} \sum_{i = 1}^{n} φ_{1} (U_{i}) φ_{2} (U_{i}) and {∥ φ ∥}_{n}^{2} = \frac{1}{n} \sum_{i = 1}^{n} φ^{2} (U_{i}), where U_{i} = α^{⊤} X_{i} .

If

φ

,

φ_{1}

, and

φ_{2}

are

L^{2}

-integrable, we define the theoretical inner product and its corresponding norm as follows:

(φ_{1}, φ_{2}〉 = E [φ_{1} (U) φ_{2} (U)] and {∥ φ ∥}_{2}^{2} = E [φ^{2} (U)] = \int_{a}^{b} φ^{2} (u) f (u) d u .

Let

ε = Y - g^{- 1} (m_{0} (T))

, where

T = {(X^{⊤}, {\bar{W}}^{⊤})}^{⊤} and \bar{W} = B_{1} (W)

. We assume that

(C1): The single-index link function $η_{0} (\cdot) \in H (p),$ where $H (p)$ is defined as above.
(C2): For all $m \in R$ and for all y in the range of the response variable Y, we have for $k = 1, 2$ that

$q_{2} (m, y) < 0 and c_{q} < |q_{2}^{k} (m, y)| < C_{q},$

for some positive constants $c_{q}$ and $C_{q} .$
(C3): The $ν$ -th order partial derivative of the joint density function of X satisfies the Lipschitz condition of order $κ (κ \in (0, 1]) .$
The marginal density function of $α^{⊤} X$ is continuous and bounded away from zero and supported within $[a, b]$ .
(C4): For any vector $τ$ , there exists positive constants $c_{τ}$ and $C_{τ}$ such that

$c_{τ} I_{t \times t} \leq E [(\begin{matrix} 1 \\ T \end{matrix}) {(\begin{matrix} 1 \\ T \end{matrix})}^{⊤} | α^{⊤} (τ) X = α^{⊤} (τ) x] \leq C_{τ} I_{t \times t},$

where $t = 1 + d + N_{n} + N^{'}$ and $T = {(X^{⊤}, {\bar{W}}^{⊤})}^{⊤} .$
(C5): The number $N_{n}$ of knots satisfies $n^{\frac{1}{2 (p + 1)}} ≪ N_{n} ≪ n^{\frac{1}{8}} (p > 3) .$
(C6): The fourth-order moment of the random variable Z is finite, i.e., ${E ∥ Z (.) ∥}^{4} \leq C$ , where C denotes a generic positive constant.
(C7): The covariance function $K (t, s) = Cov (Z (t), Z (s))$ is positive definite.
(C8): For some finite positive constants $C_{ρ}$ , $C_{ρ}^{*}$ , and $M_{0}$ ,

$|ρ_{1} (m_{0})| \leq C_{ρ} and |ρ_{1} (m) - ρ_{1} (m_{0})| \leq C_{ρ}^{*} |m - m_{0}| for all |m - m_{0}| \leq M_{0} .$
(C9): For some finite positive constants $C_{g}$ , $C_{g}^{*}$ , and $M_{1}$ , the link function g in the model (3) satisfies:
$|\frac{d}{d m} g (m) |_{m = m_{0}}| \leq C_{g}$ and, for all $|m - m_{0}| \leq M_{1}$ ,

$|\frac{d}{d m} g^{- 1} (m) - \frac{d}{d m} g^{- 1} (m) |_{m = m_{0}}| \leq C_{g}^{*} |m - m_{0}| .$
(C10): There exists a positive constant $C_{0}$ such that $E (ε^{2} | U_{τ, 0}) \leq C_{0} .$
(C11): We assume that all the random variables $〈 β, Z 〉$ for all $β \in H$ have values on a set $C$ , where $C$ is a compact subset of $R .$
(C12): The nonlinear regression operator $R (\cdot) \in H (p) .$

Comments on the Assumptions: The smoothness condition in (C1) describes that the single-index function

η_{0} (\cdot)

can be approximated by functions in the B-spline space with a normalized basis. On the other hand, condition (C2) ensures the uniqueness of the solution, where condition (C3) is a smoothness condition on the joint and marginal density functions of

α^{⊤} X

and X. Condition (C5) allows us to obtain the rate of growth of the dimension of the spline spaces relative to the sample size. Conditions (C6) and (C7) are required for the covariate function Z, whereas conditions (C4) and (C8)–(C10) are technical hypotheses that will be needed for the results’s proofs. Conditions (C11) and (C12) are smoothness conditions of the nonlinear operator regression R.

3.2. The Consistency Study

3.2.1. Technical Lemmas

In this subsection, we present the needed lemmas to prove Theorems 2 and 3, for which the proofs take into account the behaviors of all the components involved in the model (3).

Lemma 1.

Under assumptions (C1)–(C4) and (C6)–(C8), we have

\sqrt{n} (\begin{matrix} \tilde{τ} - τ_{0} \\ \tilde{δ} - δ_{0} \end{matrix}) \overset{D}{⟶} N (0, A^{- 1} Σ_{1} A^{- 1}),

where

Σ_{1}

and A will be defined below in the Appendix for more details; the symbol

\overset{D}{⟶}

denotes the convergence in distribution and

\bar{W} = B_{1} (W)

,

A = (\begin{matrix} A_{11} & A_{12} \\ A_{12}^{⊤} & A_{22} \end{matrix})

with

\begin{matrix} A_{11} & = & I E [ρ_{2} (m_{0} (T)) {η_{0}^{'} (U_{τ, 0})}^{2} J^{⊤} (τ_{0}) X {\bar{W}}^{⊤} J (τ_{0})], \\ A_{22} & = & I E [ρ_{2} (m_{0} (T)) \bar{W} {\bar{W}}^{⊤}], \\ A_{12} & = & I E [q_{1}^{2} (m_{0} (T), Y) . (\begin{matrix} η_{0}^{'} (U_{τ, 0}) J^{⊤} (τ_{0}) X \\ \bar{W} \end{matrix}) {(\begin{matrix} η_{0}^{'} (U_{τ, 0}) J^{⊤} (τ_{0}) X \\ \bar{W} \end{matrix})}^{⊤}] \end{matrix}

By applying the

δ

-method, we obtain the following lemma.

Lemma 2.

Under assumptions (C1)–(C4) and (C6)–(C8), we have

\sqrt{n} (\begin{matrix} α (\tilde{τ}) - α (τ_{0}) \\ \tilde{δ} - δ_{0} \end{matrix}) \overset{D}{⟶} N (0, R (τ_{0}) A^{- 1} Σ_{1} A^{- 1} R^{⊤} (τ_{0})),

where

R (τ) = (\begin{matrix} J (τ) & 0 \\ 0 & I_{N^{'} \times N^{'}} \end{matrix}),

and

Σ_{1} = I E [q_{1}^{2} (m_{0} (T)) (\begin{matrix} η_{0}^{'} (U_{τ, 0}) J^{⊤} (τ_{0}) X \\ \bar{W} \end{matrix}) {(\begin{matrix} η_{0}^{'} (U_{τ, 0}) J^{⊤} (τ_{0}) X \\ \bar{W} \end{matrix})}^{⊤}] .

Furthermore,

α (\tilde{τ}) - α (τ_{0}) = o_{P} (\frac{1}{\sqrt{n}})

and

\tilde{δ} - δ_{0} = o_{P} (\frac{1}{\sqrt{n}})

.

Lemma 3.

Under assumptions (C1)–(C5), we have

∥ \hat{θ} - \tilde{θ} ∥ = O_{P} \{\sqrt{N_{n}} (h^{p} + \frac{1}{\sqrt{n h}})\} .

(18)

where

N_{n}

is the number of B-splines basis functions of order r.

The proofs of the previous results are supported by the following lemmas.

3.2.2. Convergence of the Estimated Univariate Components

So, for the nonlinear regression operator R behavior, we have the following theorem:

Theorem 1.

Under assumptions (C1)–(C8) and (C11)–(C12), we have

∥ \hat{R} {- R ∥}_{2} = O_{P} \{\sqrt{N_{n}} (\frac{1}{\sqrt{n} h} + h^{p})\} .

The proof of Theorem 1 will be given in the Supplementary Materials.

3.2.3. Estimation of the Systematic Component Function

Theorem 2.

Under assumptions (C1)–(C7), we have

{∥\hat{η} - η_{0}∥}_{2} = O_{P} \{\sqrt{N_{n}} (\frac{1}{\sqrt{n} h} + h^{p})\} .

and

{∥\hat{η} - η_{0}∥}_{n} = O_{P} \{\sqrt{N_{n}} (\frac{1}{\sqrt{n} h} + h^{p})\} .

The proof of Theorem 2 will be given in the Supplementary Materials. This proof takes into account the components of our model, which makes it different from the results obtained by Alahiane et al. [28,29].

3.2.4. Estimation of the Parametric Components

The next theorem shows that the maximum quasi-likelihood estimator is root-n-consistent and is asymptotically normal, although the convergence rate of the nonparametric component

\hat{η}

is slower than root-n. Before enouncing the theorem, let us denote

\begin{matrix} Υ (u_{τ, 0}) = \frac{E [X ρ_{2} (m_{0} (T)) | U_{τ, 0} = u_{τ, 0}]}{E [ρ_{2} (m_{0} (T)) | U_{τ, 0} = u_{τ, 0}]} & , & Γ (u_{τ, 0}) = \frac{E [W ρ_{2} (m_{0} (T)) | U_{τ, 0} = u_{τ, 0}]}{E [ρ_{2} (m_{0} (T)) | U_{τ, 0} = u_{τ, 0}]}, \\ Φ (x) = Φ (U_{τ, 0}, x) = x - Υ (u_{τ, 0}) & and & Ψ (w) = Ψ (U_{τ, 0}, w) = w - Γ (u_{τ, 0}) . \end{matrix}

Theorem 3.

Under assumptions (C1)–(C10), the constrained quasi-likelihood estimators

\hat{α}

and

\hat{δ}

with

∥ \hat{α} ∥_{d} = 1

are jointly asymptotically normally distributed, i.e.,

\sqrt{n} (\begin{matrix} \hat{α} - α_{0} \\ \hat{δ} - δ_{0} \end{matrix}) \overset{D}{⟶} N (0, R (τ_{0}) D^{- 1} R^{⊤} (τ_{0})),

where

\overset{D}{\to}

denotes the convergence in distribution,

D = E [ρ_{2} (m_{0} (T)) (\begin{matrix} η_{0}^{'} (U_{τ, 0}) J^{⊤} (τ_{0}) Φ (X) \\ Ψ (\bar{W}) \end{matrix}) {(\begin{matrix} η_{0}^{'} (U_{τ, 0}) J^{⊤} (τ_{0}) Φ (X) \\ Ψ (\bar{W}) \end{matrix})}^{⊤}],

and

α (\hat{τ}) - α (τ_{0}) = O_{P} (\frac{1}{\sqrt{n}}), \hat{δ} - δ_{0} = O_{P} (\frac{1}{\sqrt{n}}) .

where

O_{P}

denotes the Bachmann–Landau notation “in probability”.

Proof of Theorem 3.

The proof of Theorem 3 is given in the Supplementary Materials. This proof takes into account all the components involved in the model (3), which makes it different from the results obtained by Alahiane et al. [28,29]. □

4. A Simulation Study

We aim to show the performance of various estimators of the parameters

τ

,

γ

,

δ

, the nonparametric function

η

, the functional index

β

, and the nonlinear regression operator R of the model (3) through numerical simulations under both the Gaussian and the logistic cases. The conditional density of Y given

X = x, Z = z

is described by Equation (1).

We believe that the model is given by the following equation:

g (μ (X_{i}, Z_{i})) = sin \{\frac{π (α^{⊤} X_{i} - A)}{B - A}\} + R (〈 β, Z_{i} 〉) + ε_{i}, for i = 1, \dots, n .

(19)

The responses

Y_{i}

are simulated according to the Equation (19);

X_{i}

are taken uniformly over the interval

[- 0.5, 0.5]

, whereas the errors

ε_{i} \sim N (0, 0.025)

. Moreover, we take the following coefficients:

α = \frac{1}{\sqrt{3}} {(1, 1, 1)}^{⊤}, A = \frac{\sqrt{3}}{2} - \frac{1.645}{\sqrt{12}} and B = \frac{\sqrt{3}}{2} + \frac{1.645}{\sqrt{12}} .

The functional real variable

Z_{i} (\cdot)

is taken as

Z (t) = a cos (2 π t) + b sin (4 π t) + 2 c (t - 0.25) (t - 0.5), t \in [- 1, 1]

, where

a \sim U (0, 1)

,

b \sim U (0, 1)

and

c \sim U (0, 1)

. A set of 1000 independent curves is generated according to the following model:

Z_{i} (t) = a_{i} cos (2 π t) + b_{i} sin (4 π t) + 2 c_{i} (t - 0.25) (t - 0.5), t \in [- 1, 1]

, where

a_{i}

,

b_{i}

, and

c_{i}

are uniformly distributed on

[0, 1]

, respectively. Curves are discretized over a very fine mesh of one thousand equispaced points; the set of curves is stored in the matrix

Z = [Z_{i} (t_{j})],

i = 1, \dots, 1000, j = 1, \dots, 1000

. A random selection of 30 of these functional data is plotted in Figure 1.

We consider the functional index

β (t) = \frac{1}{\sqrt{2}} [sin (\frac{3}{2} π t) + sin (\frac{1}{2} π t)]

and the regression operator

R (u) = {\{\frac{1}{\sqrt{2}} \int_{- 1}^{1} [sin (\frac{3}{2} π t) + sin (\frac{1}{2} π t)] Z (t) d t\}}^{3}

.

We set directions, and we use the first to the fourth eigenfunctions of the variance operator of Z; note that the cumulative explained variances of the corresponding linear principal components are

71.3 %

,

87.5 %

,

98.8 %

, and

99.6 %

.

In the first step of our algorithm, we base our simulation experiments on samples of 300 couples

(Z_{i}, Y_{i})

that we randomly extracted; we use the first 200 couples as the training set and the remaining 100 couples as the test set. We do not use all the columns of

Z

, only a selection. At each step, we use cubic splines, and the number of knots is set to six.

Concerning the nonparametric functional estimator, the semimetric used is the standard distance

L_{2}

between the curves.

In the second step, the nodes are selected according to the formula

C n^{\frac{1}{2 r}} log (n),

where

C \in [0.3, 1]

(see [11]).

We choose

C = 0.6

, and we perform replications of 3000 samples of sizes

n = 500

.

Through the plug-in process (the second step), we estimate the model parameters (8) using the GPLFSIM algorithm as described previously.

g (μ (X_{i}, Z_{i})) = η (α^{⊤} X_{i}) + R (〈 \hat{β}, Z_{i} 〉) for i = 1, \dots, n .

(20)

Then, the computed bias, the standard deviation (SD), and the mean squared error (MSE) with respect to the parameter

τ

, the parameter

γ

, and the parameter

δ

are summarized in Table 1, Table 2, Table 3, Table 4, Table 5, Table 6 and Table 7.

We present below in Figure 2 and Figure 3 the functional index and nonlinear regression operator obtained by the GPLFSIM algorithm in both cases: the Gaussian and the logistic cases. We chose the two distributions (Gaussian and logistic) because of their popularity and because of the use of activation functions (sigmoid functions) in neural networks, especially when the input data are high-dimensional.

In order to compute the bias, SD, and MSE, we recorded 3000 replications of the CVGPLFSIM algorithm in the Gaussian case and in the logistic case with

n = 500

as follows (see Table 1, Table 2, Table 3, Table 4, Table 5, Table 6 and Table 7).

It is obvious to remark that the quality of the estimators is illustrated through these simulations, as the method works quite well. The bias, SD, and MSE are generally reasonably low. The parametric and the nonparametric components, the single-index

α

, the functional index

β (\cdot)

, and the nonlinear regression operator R of Y over

X, Z

are calculated by the procedure described above. The two tables therefore indicate the consistency of

\hat{α}

and

\hat{δ}

as the bias, SD, and MSE decrease as the sample size increases.

We have developed our algorithm in both cases, the identity link function and the logistic link function. The simulations show that the CVPGPLSFIM algorithm works well in both cases. We present below in Figure 4 the single index estimated by the model in both cases, the Gaussian and logistic cases.

We observe that the single-index estimated by our model fits well with the single-index.

We present below in Figure 5 the systematic component

η

estimated by the model in both cases: the Gaussian and the logistic cases.

We consider the root of the averaged squared error criterion (see [35]) in both the Gaussian and the logistic cases:

{RASE}_{1} = {(\frac{1}{n} \sum_{i = 1}^{n} {(\hat{η} (u_{i}) - η (u_{i}))}^{2})}^{\frac{1}{2}} and {RASE}_{2} = {(\frac{1}{n} \sum_{i = 1}^{n} {(\hat{R} (u_{i}) - R (u_{i}))}^{2})}^{\frac{1}{2}}

Table 8 and Table 9 summarize the samples means, medians, and variances of the

{RASE}_{1}

and

{RASE}_{2}

with different sample sizes in both the Gaussian and logistic cases.

We conclude that as the sample size n increases from 500 to

1000,

the sample mean, median and variance of

{RASE}_{1}

decreases.

We conclude that when the sample size n increases from 500 to

1000,

the sample mean, the median, and the variance of

{RASE}_{2}

decreases.

5. Application for Tecator Data

In this section, we employ the CVGPLFSIM model to analyze the Tecator data, which is a well-known dataset in the field of FDA. The dataset can be obtained from the following link: http://lib.stat.cmu.edu/datasets/tecator (accessed on 1 March 2024). It consists of 215 finely chopped meat samples, each associated with its corresponding fat content (

Y_{i}

for

i = 1, \dots, 215

), near-infrared absorbance spectra (

Z_{i}

for

i = 1, \dots, 215

) measured at 100 wavelengths ranging from 850 to 1050 nm, as well as the protein content

X_{1, i}

and moisture content

X_{2, i}

of the meat samples. For more comprehensive information and insights, we refer readers to Ferraty et al. [13]. We aim to forecast the fat content of the finely chopped meat samples. Figure 6 shows a sample of the absorbance curves.

In order to evaluate the effectiveness of the model (3), we employ a random splitting of the sample into two subsets: a training subset, denoted as

I_{1}

, consisting of 160 observations, and a test subset, denoted as

I_{2}

, consisting of 55 observations. The purpose of the training subset is to estimate the model parameters, while the test subset is used to assess the accuracy of the predictors. We utilize the mean square error of prediction (MSEP), as defined in Aneiros et al. [14] and given by

MSEP = \frac{1}{55} \sum_{i \in I_{2}} {(Y_{i} - {\hat{Y}}_{i})}^{2} / {var}_{I_{2}} (Y_{i}) .

where

{\hat{Y}}_{i}

represents the predicted value based on the training subset and

{var}_{I_{2}} (Y_{i})

denotes the variance of the response variables in the test subset. This indicator allows us to assess the accuracy of our predictions with respect to the variability in the test dataset.

The performance comparison of the CVGPLFSIM model with other models is presented in Table 10 and Table 11. Based on the obtained results, we can infer that the CVGPLFSIM model demonstrates competitiveness and effectiveness in analyzing the given dataset.

Table 10 and Table 11 show the performance of the CVGPLFSIM model by comparing it with other models. The CVGPLFSIM model is a competitive one for such data.

Moreover, Figure 7 shows the nonparametric estimator of the function

η

, in both the Gaussian and the logistic cases.

Figure 8 and Figure 9 show the estimator functional index

\hat{β} (\cdot)

and the estimator of the nonlinear regression operator

\hat{R}

.

Figure 10 illustrates the difference between the fat content and its estimation from the model for both the Gaussian and the logistic cases.

We can see that our model fits well the content of fatness “215 pieces of meat”.

Possible extensions to our manuscript include hypothesis testing and confidence bands using the bootstrap method, the extension of the activation function (sigmoid functions) in the context of neural networks (deep learning) when the input data are high-dimensional, and functional projection pursuit regression for GFPLSIM models

g (μ (X, Z)) = η (α^{⊤} X) + r (Z)

, where

r (Z) = \sum_{i = 1}^{m} 〈 β_{i}, Z 〉

-type models take advantage of the information required from the various revealing directions.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/math12172649/s1. In the supplementary material, we present the appendix and proofs of the various results obtained in this paper. References [7,10,14,34,36,37] are cited in Supplementary Materials.

Author Contributions

Conceptualization, M.R., M.A., I.O., A.A. and L.H.; Methodology, M.R., M.A., I.O., A.A. and L.H.; Software, M.R., M.A., I.O., A.A. and L.H.; Validation, M.R., M.A., I.O., A.A. and L.H.; Formal analysis, M.R., M.A., I.O., A.A. and L.H.; Investigation, M.R., M.A., I.O., A.A. and L.H.; Resources, M.R., M.A., I.O., A.A. and L.H.; Data curation, M.R., M.A., I.O., A.A. and L.H.; Writing—original draft, M.R., M.A., I.O., A.A. and L.H.; Writing—review & editing, M.R., M.A., I.O., A.A. and L.H.; Visualization, M.R., M.A., I.O., A.A. and L.H.; Supervision, M.R., M.A. and I.O.; Project administration, M.R., M.A. and I.O.; Funding acquisition, M.A. and I.O. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

As in Ferraty et al. [13], Aneiros et al. [14], and Alahiane et al. [28,29], to value our theoretical results, we consider the public spectrometric data widely used in FDA and that are available at the website http://lib.stat.cmu.edu/datasets/tecator, accessed on 1 March 2024.

Acknowledgments

The authors thank the editor and the reviewers for their helpful and constructive comments.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Appendix A

We will present technical lemmas that will be used for proving Theorems 2 and 3. In what follows, for all probability measures Q, we define

L^{2} (Q) = \{f such that Q f^{2} = \int f^{2} d Q < \infty\} .

Let

F

be a subclass of

L^{2} (Q)

. For all

f \in F, ∥ f ∥ = {(\int f^{2} d Q)}^{\frac{1}{2}}

.

Denote

N (δ, F, L^{2} (Q))

as the

δ

-covering number of

F

, i.e., the smallest value of N for which there exist functions

f_{1}, f_{2}, \dots, f_{N}

(which are not necessarily in

F

), such that for each

f \in F

, there exists

j \in {1, \dots, N}

,

∥ f - f_{j} ∥ < δ

or

F \subset ⋃_{j = 1}^{N} B (f_{j}, δ)

.

For two functions l and u, a bracketing

[l, u]

is the set of functions f such that

l \leq f \leq u

,

[l, u] = {f such that l \leq f \leq u} .

The

δ

-covering number with bracketing

N_{[]} (δ, F, L^{2} (Q))

is defined as the smallest value of N, necessary to cover the whole

F,

for which there exist pairs of functions

\{[f_{j}^{L}, f_{j}^{U}] for j = 1, \dots, N\}

with

∥f_{j}^{U} - f_{j}^{L}∥ \leq δ

, such that for each

f \in F

, there is a

j \in {1, \dots, N}

such that

f_{j}^{L} \leq f \leq f_{j}^{U}

(

f_{j}^{U}

and

f_{j}^{L}

are not necessary belonging to

F

).

The

δ

-entropy with bracketing is

log N_{[]} (δ, F, L^{2} (Q))

. The uniform entropy integral

J_{[]} (δ, F, L^{2} (Q))

is defined as

J_{[]} (δ, F, L^{2} (Q)) = \int_{0}^{δ} {\{1 + log N_{[]} (κ, F, L^{2} (Q))\}}^{\frac{1}{2}} d κ .

Let

Q_{n}

be the empirical measure of Q, i.e.,

Q_{n} = \frac{1}{n} \sum_{i = 1}^{n} δ_{X_{i}} (\cdot)

such that

Q_{n} f = E^{Q_{n}} [f] = \int f d Q_{n} = \frac{1}{n} \sum_{i = 1}^{n} \int f δ_{X_{i}} = \frac{1}{n} \sum_{i = 1}^{n} f (X_{i}) .

Denote

G_{n} = \sqrt{n} (Q_{n} - Q)

as the standardized empirical process indexed by

F

and

{∥G_{n}∥}_{F} = \sum_{f \in F} |G_{n} f|

for any measurable class of functions

F .

For all

f \in F

, we have

Q f = E^{Q} [f (X)] = \int f d Q

, and

\begin{matrix} G_{n} f & = & \sqrt{n} (Q_{n} f - Q f) = \frac{1}{\sqrt{n}} \sum_{i = 1}^{n} (f (X_{i}) - E [f (X)]) . \end{matrix}

Lemma A1

(Lemma 3.4.2. in Van Der Van et al. [38]). Let

M_{0} > 0

and

F

be uniformly bounded class of measurable functions such that

\{\begin{matrix} for all f \in {F, ∥ f ∥}_{\infty} < M_{0}, \\ Q f^{2} < δ^{2} . \end{matrix}

Then

E^{Q} [{∥G_{n}∥}_{F}] \leq c_{0} J_{[]} (δ, F, L^{2} (Q)) \{1 + \frac{J_{[]} (δ, F, L^{2} (Q))}{δ^{2} \sqrt{n}} M_{0}\},

where

c_{0}

is a finite constant not dependent on n.

Lemma A2

(Lemma A.1. in Huang [39]). For any

λ > 0

, let

Θ_{n} = {η (α_{0}^{⊤} x)

such that

∥ δ - δ_{0} ∥ \leq λ, η \in S_{n}, {∥η - η_{0}∥}_{2} \leq λ} .

Then, for any

ϵ \leq λ

log N_{[]} (λ, Θ_{n}, L^{2} (P)) \leq C N_{n} log (\frac{λ}{ϵ}) .

Lemma A3

(Lemma A.2. in Wang et al. [40] and Lemma A.4. in Xue et al. [41]). Under assumptions (C1)–(C5), we have

A_{n} = sup_{η_{1}, η_{2} \in S_{n}} |\frac{{〈η_{1}, η_{2}〉}_{n} - 〈η_{1}, η_{2}〉}{{∥η_{1}∥}_{2} {∥η_{2}∥}_{2}}| = O_{a . c o .} \{\sqrt{\frac{log n}{n h}}\},

where

O_{a . c o .}

denotes the “O” Lanadau symbol for the almost-complete convergence.

Let

D_{n, θ} = (\begin{matrix} γ^{⊤} B^{'} (α^{⊤} (τ) X_{i}) J^{⊤} (τ) & 0 & 0 \\ 0 & I & 0 \\ 0 & 0 & B (α^{⊤} (τ) X_{i}) \end{matrix})

; we denote

T_{i} = (X_{i}^{⊤}, {\bar{W}}_{i}^{⊤}),

W_{n, θ} = \frac{1}{n} \sum_{i = 1}^{n} D_{i, θ} (\begin{matrix} T_{i} \\ 1 \end{matrix}) {(\begin{matrix} T_{i} \\ 1 \end{matrix})}^{⊤} D_{i, θ}^{⊤}, and W_{θ} = \frac{1}{n} \sum_{i = 1}^{n} E [D_{i, θ} (\begin{matrix} T_{i} \\ T \end{matrix}) {(\begin{matrix} T_{i} \\ 1 \end{matrix})}^{⊤} D_{i, θ}^{⊤}] .

Then, we have the following lemma.

Lemma A4

(Lemma A.3 in the Supplementary Material of Wang et al. [40]). Under assumptions (C1)–(C8), there exists

C > 0

such that

\underset{θ}{error} {∥W_{θ}^{- 1}∥}_{2} \leq C \sqrt{N_{n}} a . c o . and \underset{θ}{error} {∥W_{n, θ}^{- 1}∥}_{2} \leq C \sqrt{N_{n}} a . c o .,

where

{∥ M ∥}_{2} = sup_{x \neq 0} \frac{∥ M x ∥}{∥ x ∥} = \underset{∥ x ∥ = 1}{error} ∥ M x ∥ .

In what follows, we will give lemmas that allow us to prove Theorem 3. The lemmas and theorem proofs will be developed in the Supplementary Materials. The proofs of these lemmas are different from those of Alahiane et al. [28,29] as they take into account the components of our new model.

Lemma A5.

Under conditions (C1)–(C8), we have

\begin{matrix} \frac{1}{n} \sum_{i = 1}^{n} ρ_{2} (m_{0 i}) \{\hat{η} (U_{τ, 0 i}) - η_{0} (U_{τ, 0 i})\} η_{0}^{'} (U_{τ, 0 i}) J^{⊤} (τ_{0}) Φ (X_{i}) = O_{P} (\frac{1}{\sqrt{n}}), \end{matrix}

(A1)

\begin{matrix} \frac{1}{n} \sum_{i = 1}^{n} ρ_{2} (m_{0 i}) η_{0}^{'} (U_{τ, 0 i}) Φ (X_{i}) Υ^{⊤} (U_{τ, 0 i}) J (τ_{0}) (\hat{τ} - τ_{0}) = O_{P} (\frac{1}{\sqrt{n}}), \end{matrix}

(A2)

\begin{matrix} \frac{1}{n} \sum_{i = 1}^{n} ρ_{2} (m_{0 i}) η_{0}^{'} (U_{τ, 0 i}) Φ (X_{i}) Γ^{⊤} (U_{τ, 0 i}) J (τ_{0}) (\hat{δ} - δ_{0}) = O_{P} (\frac{1}{\sqrt{n}}) . \end{matrix}

(A3)

Lemma A6.

Under conditions (C1)–(C8), we have

\begin{matrix} \frac{1}{n} \sum_{i = 1}^{n} ρ_{2} (m_{o i}) \{\hat{η} (U_{τ, o i}) - η_{0} (U_{τ, o i})\} Ψ (T_{i}) = O_{P} (\frac{1}{\sqrt{n}}), \end{matrix}

(A4)

\begin{matrix} \frac{1}{n} \sum_{i = 1}^{n} ρ_{2} (m_{o i}) η_{0}^{'} (U_{τ, o i}) Ψ (T_{i}) Υ^{⊤} (U_{τ, o i}) J (τ_{0}) (\hat{τ} - τ_{0}) = O_{P} (\frac{1}{\sqrt{n}}), \end{matrix}

(A5)

\begin{matrix} \frac{1}{n} \sum_{i = 1}^{n} ρ_{2} (m_{o i}) Ψ (U_{τ, o i}, Z_{i}) Γ^{⊤} (U_{τ, o i}) (\hat{δ} - δ_{0}) = O_{P} (\frac{1}{\sqrt{n}}) . \end{matrix}

(A6)

References

McCullagh, P.; Nelder, J. Generalized Linear Models; Routledge: London, UK, 1989; Volume 37. [Google Scholar]
Nelder, J.; Wedderburn, R. Generalized linear models. J. R. Stat. Soc. Ser. (Stat. Soc.) 1972, 135, 370–384. [Google Scholar] [CrossRef]
Hastie, T.; Tibshirani, R. Generalized Additive Models; Chapman & Hall/CRC: New York, NY, USA, 1990; Volume 43. [Google Scholar]
Wood, S. Generalized Additive Models. An Introduction with R; CRC/Taylor & Francis: New York, NY, USA, 2017. [Google Scholar]
Hardle, W.; Hall, P.; Ichimura, H. Optimal smoothing in single index models. Ann. Stat. 1993, 21, 157–178. [Google Scholar] [CrossRef]
Hristache, M.; Juditsky, A.; Spokoiny, V. Direct estimation of the index coefficient in a single-index model. Ann. Stat. 2001, 29, 595–623. [Google Scholar] [CrossRef]
Ait-Saïdi, A.; Ferraty, F.; Kassa, R.; Vieu, P. Cross-validated estimations in the single-functional index model. Stat. J. Theor. Appl. Stat. 2008, 42, 475–494. [Google Scholar] [CrossRef]
Liang, H.; Wang, N. Partially linear single index measurement error models. Stat. Sin. 2005, 15, 99–116. [Google Scholar]
Chen, J.; Li, D.; Liang, H.; Wang, S. Semiparametric GEE analysis in partially linear single-index models for longitudinal data. Ann. Stat. 2015, 43, 1682–1715. [Google Scholar] [CrossRef]
Carroll, R.; Fan, J.; Gijbels, I.; Wand, M. Generalized partially linear single-index models. J. Am. Stat. Assoc. 1997, 92, 477–489. [Google Scholar] [CrossRef]
Wang, L.; Cao, G. Efficient estimation for generalized partially linear single index models. Bernoulli 2018, 24, 1101–1127. [Google Scholar] [CrossRef]
Ramsay, J.; Silverman, B. Functional Data Analysis; Springer: New York, NY, USA, 2005. [Google Scholar]
Ferraty, F.; Vieu, P. Nonparametric Functional Data Analysis: Theory and Practice; Springer: New York, NY, USA, 2006; Volume 76. [Google Scholar]
Aneiros-Pérez, G.; Vieu, P. Semi-functional partial linear regression. Stat. Probab. Lett. 2006, 76, 1102–1110. [Google Scholar] [CrossRef]
Aneiros, G.; Vieu, P. Partial linear modelling with multi-functional covariates. Comput. Stat. 2015, 30, 647–671. [Google Scholar] [CrossRef]
Horváth, L.; Kokoszka, P. Inference for Functional Data with Applications; Springer Science and Business Media: New York, NY, USA, 2012; Volume 200. [Google Scholar]
Kokoszka, P.; Reimherr, M. Introduction to Functional Data Analysis; CRC Press: Boca Raton, FL, USA, 2021. [Google Scholar]
Schumaker, L. Spline Functions: Basic Theory; Cambridge University Press: Cambridge, UK, 2007. [Google Scholar]
Cao, R.; Du, J.; Zhou, J.; Xie, T. FPCA-based estimation for generalized functional partially linear models. Stat. Pap. 2020, 61, 2715–2735. [Google Scholar] [CrossRef]
Li, C.; Lu, M. A lack-of-fit test for generalized linear models via single-index techniques. Comput. Stat. 2018, 33, 731–756. [Google Scholar] [CrossRef]
Ould Said, E.; Ouassou, I.; M, R. Functional Statistics and Applications: Selected Papers from MICPS-2013 (Contributions to Statistics) Hardcover; Springer: New York, NY, USA, 2015. [Google Scholar]
Laksaci, A.; Kaid, Z.; Alahiane, M.; Ouassou, I.; Rachdi, M. Non parametric estimations of the conditional density and mode when the regressor and the response are curves. Commun. -Stat.-Theory Methods 2023, 52, 4659–4674. [Google Scholar] [CrossRef]
Ouassou, I.; Rachdi, M. Regression operator estimation by delta sequences method for functional data and its applications. AStA Adv. Stat. Anal. 2010, 96, 451–465. [Google Scholar] [CrossRef]
Ouassou, I.; Rachdi, M. Stein type estimation of the regression operator for functional data. Adv. Appl. Stat. Sci. 2010, 1, 233–250. [Google Scholar]
Yu, P.; Du, J.; Zhang, Z. Single index partially functional linear regression model. Stat. Pap. 2020, 61, 1107–1123. [Google Scholar] [CrossRef]
Yu, Y.; Ruppert, D. Penalized spline estimation for partially linear single index models. J. Am. Stat. Assoc. 2002, 97, 1042–1054. [Google Scholar] [CrossRef]
Rachdi, M.; Alahiane, M.; Ouassou, I.; Vieu, P. Generalized functional partially linear single index models. In Functional and High Dimensional Statistics and Related Fields; Springer: New York, NY, USA, 2020; pp. 221–228. [Google Scholar]
Alahiane, M.; Ouassou, I.; Rachdi, M.; Vieu, P. Partially Linear Generalized single Index Models for Functional Data PLGSIMF. Stats Funct. Data Anal. FDA 2021, 4, 793–813. [Google Scholar] [CrossRef]
Alahiane, M.; Ouassou, I.; Rachdi, M.; Vieu, P. High-Dimensional Statistics: Non-Parametric Generalized Functional Partially Linear Single Index Model. Math. Adv. Stat. Theory Methodol. Appl. Data Anal. 2022, 10, 2704. [Google Scholar] [CrossRef]
Friedman, J.; Stuetzle, W. Projection pursuit regression. J. Am. Stat. Assoc. 1981, 76, 817–823. [Google Scholar] [CrossRef]
Hall, P. On projection pursuit regression. Ann. Stat. 1989, 17, 573–588. [Google Scholar] [CrossRef]
Huber, P. Projection pursuit. Ann. Stat. 1985, 13, 435–475. [Google Scholar] [CrossRef]
Ferraty, F.; Goia, A.; Salinelli, E.; Vieu, P. Functional projection pursuit regression. Test 2013, 22, 293–320. [Google Scholar] [CrossRef]
De Boor, C. A Practical Guide to Splines; Springer: New York, NY, USA, 2001; Volume 27. [Google Scholar]
Lai, P.; Tian, Y.; Lian, H. Estimation and variable selection for generalised partially linear single-index models. J. Nonparametr. Stat. 2014, 26, 171–185. [Google Scholar] [CrossRef]
Pollard, D. Asymptotics for least absolute deviation regression estimators. Econom. Theory 1991, 7, 186–199. [Google Scholar] [CrossRef]
Stone, C. The dimensionality reduction principle for generalized additive models. Ann. Stat. 1986, 14, 590–606. [Google Scholar] [CrossRef]
Van Der Vaart, A.W.; Wellner, J. Weak Convergence; Springer: New York, NY, USA, 1996. [Google Scholar]
Huang, J. Efficient estimation of the partly linear additive Cox model. Ann. Stat. 1999, 27, 1536–1563. [Google Scholar] [CrossRef]
Wang, L.; Yang, L. Spline estimation of single index models. Stat. Sin. 2009, 19, 765–783. [Google Scholar]
Xue, L.; Yang, L. Additive coefficient modeling via polynomial spline. Stat. Sin. 2006, 16, 1423–1446. [Google Scholar]

Figure 1. A random selection of 30 simulated curves of Z.

Figure 2. The Gaussian case: estimators of

\hat{β} (\cdot)

(the left plot) and estimators of the nonlinear regression operator

\hat{R} (u)

(the right plot), where u stands for

〈 \hat{β}, Z 〉

.

Figure 2. The Gaussian case: estimators of

\hat{β} (\cdot)

(the left plot) and estimators of the nonlinear regression operator

\hat{R} (u)

(the right plot), where u stands for

〈 \hat{β}, Z 〉

.

Figure 3. The logistic case: estimators of

\hat{β} (\cdot)

(the left plot) and estimators of the nonlinear regression operator

\hat{R} (u)

(the right plot), where u stands for

〈 \hat{β}, Z 〉

.

Figure 3. The logistic case: estimators of

\hat{β} (\cdot)

(the left plot) and estimators of the nonlinear regression operator

\hat{R} (u)

(the right plot), where u stands for

〈 \hat{β}, Z 〉

.

Figure 4. On the left plot: single-index

α

versus predicted single-index

\hat{α}

: Gaussian case; on the right plot: single-index

α

versus predicted single-index

\hat{α}

: logistic case.

Figure 4. On the left plot: single-index

α

versus predicted single-index

\hat{α}

: Gaussian case; on the right plot: single-index

α

versus predicted single-index

\hat{α}

: logistic case.

Figure 5. The function

η

versus its estimator

\hat{η}

for the Gaussian case (left plot), and the function

η

versus its estimator

\hat{η}

for the logistic case (right plot).

Figure 5. The function

η

versus its estimator

\hat{η}

for the Gaussian case (left plot), and the function

η

versus its estimator

\hat{η}

for the logistic case (right plot).

Figure 6. Sample of 100 absorbance curves

Z

.

Figure 6. Sample of 100 absorbance curves

Z

.

Figure 7. The estimator

\hat{η} (\cdot)

in the Gaussian case (left plot), and the estimator

\hat{η} (\cdot)

in the logistic case (right plot).

Figure 7. The estimator

\hat{η} (\cdot)

in the Gaussian case (left plot), and the estimator

\hat{η} (\cdot)

in the logistic case (right plot).

Figure 8. The Gaussian case: the estimator

\hat{β}

(left plot) and the estimator

\hat{R}

(right plot).

Figure 8. The Gaussian case: the estimator

\hat{β}

(left plot) and the estimator

\hat{R}

(right plot).

Figure 9. The logistic case: the estimator

\hat{β}

(left plot) and the estimator

\hat{R}

(right plot).

Figure 9. The logistic case: the estimator

\hat{β}

(left plot) and the estimator

\hat{R}

(right plot).

Figure 10. The content of fatness and its estimation: the Gaussian case (left plot) and the logistic case (right plot).

Table 1. Bias, SD, and MSE according to the parameter

τ

for CVGPLFSIM with the identity link function and

n = 500

in both the Gaussian and the logistic cases.

Table 1. Bias, SD, and MSE according to the parameter

τ

for CVGPLFSIM with the identity link function and

n = 500

in both the Gaussian and the logistic cases.

	Gaussian Case ¹		Logistic Case ²
Sample Size	$τ_{1}$	$τ_{2}$	$τ_{1}$	$τ_{2}$
Bias	0.0013	−0.0012	0.0027	−0.0061
SD	0.0012	0.0034	0.0031	0.0201
MSE	3.13 $\times 10^{- 6}$	1.3 $\times 10^{- 5}$	4.45 $\times 10^{- 5}$	4.14 $\times 10^{- 4}$

Note: This table summarizes the bias, SD, and MSE of

τ

with sample size

n = 500

. ¹

τ

for the Gaussian case. ²

τ

for the logistic case.

Table 2. Bias, SD, and MSE evolutions with respect to the parameter

γ

variation for CVPGPLSFIM with the identity link function and

n = 500

.

Table 2. Bias, SD, and MSE evolutions with respect to the parameter

γ

variation for CVPGPLSFIM with the identity link function and

n = 500

.

	$γ_{1}$	$γ_{2}$	$γ_{3}$	$γ_{4}$	$γ_{5}$
Bias	−0.0026	0.0142	−0.0231	0.0242	−0.0037
SD	0.0102	0.0165	0.0141	0.0140	0.0071
MSE	1.1080 $\times 10^{- 4}$	4.7389 $\times 10^{- 4}$	7.3242 $\times 10^{- 4}$	7.8164 $\times 10^{- 4}$	6.4100 $\times 10^{- 5}$

Table 3. Bias, SD, and MSE evolutions with respect to the parameter

γ

variation for CVGPLFSIM with the identity link function and

n = 500

.

Table 3. Bias, SD, and MSE evolutions with respect to the parameter

γ

variation for CVGPLFSIM with the identity link function and

n = 500

.

	$γ_{6}$	$γ_{7}$	$γ_{8}$	$γ_{9}$	$γ_{10}$
Bias	0.0032	0.0023	−0.0021	0.0012	−0.0045
SD	0.0033	0.0041	0.0014	0.0027	0.0042
MSE	2.113 $\times 10^{- 5}$	2.21 $\times 10^{- 5}$	6.37 $\times 10^{- 6}$	8.73 $\times 10^{- 6}$	3.789 $\times 10^{- 5}$

Table 4. Bias, SD, and MSE evolutions with respect to the parameter

δ

variation for CVGPLFSIM with the identity link function and

n = 500

.

Table 4. Bias, SD, and MSE evolutions with respect to the parameter

δ

variation for CVGPLFSIM with the identity link function and

n = 500

.

	$δ_{1}$	$δ_{2}$	$δ_{3}$	$δ_{4}$	$δ_{5}$
Bias	0.0009	0.0037	−0.0045	0.0082	−0.0035
SD	0.0036	0.0012	0.0071	0.0038	0.0091
MSE	1.377 $\times 10^{- 5}$	1.513 $\times 10^{- 5}$	7.066 $\times 10^{- 5}$	8.168 $\times 10^{- 5}$	2.450 $\times 10^{- 5}$

Table 5. Bias, SD, and MSE evolutions with respect to the parameter

γ