Local Influence for the Thin-Plate Spline Generalized Linear Model

Ibacache-Pulgar, Germán; Pacheco, Pablo; Nicolis, Orietta; Uribe-Opazo, Miguel Angel

doi:10.3390/axioms13060346

Open AccessArticle

Local Influence for the Thin-Plate Spline Generalized Linear Model

¹

Institute of Statistics, Universidad de Valparaíso, Av. Gran Bretaña 1111, Valparaíso 2360102, Chile

²

Centro de Estudios Atmosféricos y Cambio Climático (CEACC), Universidad de Valparaíso, Valparaíso 2360102, Chile

³

Dirección de Educación Virtual, Universidad de Playa Ancha, Avenida Guillermo González de Hontaneda 855, Playa Ancha, Valparaíso 2360072, Chile

⁴

Facultad de Ingenieria, Universidad Andres Bello, Calle Quillota 980, Viña del Mar 2520000, Chile

⁵

Centro de Ciências Exatas e Tecnológicas, Western Paraná State University (UNIOESTE), Cascavel 85819-110, Paraná, Brazil

^*

Author to whom correspondence should be addressed.

Axioms 2024, 13(6), 346; https://doi.org/10.3390/axioms13060346

Submission received: 19 April 2024 / Revised: 12 May 2024 / Accepted: 20 May 2024 / Published: 23 May 2024

(This article belongs to the Special Issue Mathematical Models and Simulations, 2nd Edition)

Download

Browse Figures

Versions Notes

Abstract

:

Thin-Plate Spline Generalized Linear Models (TPS-GLMs) are an extension of Semiparametric Generalized Linear Models (SGLMs), because they allow a smoothing spline to be extended to two or more dimensions. This class of models allows modeling a set of data in which it is desired to incorporate the non-linear joint effects of some covariates to explain the variability of a certain variable of interest. In the spatial context, these models are quite useful, since they allow the effects of locations to be included, both in trend and dispersion, using a smooth surface. In this work, we extend the local influence technique for the TPS-GLM model in order to evaluate the sensitivity of the maximum penalized likelihood estimators against small perturbations in the model and data. We fit our model through a joint iterative process based on Fisher Scoring and weighted backfitting algorithms. In addition, we obtained the normal curvature for the case-weight perturbation and response variable additive perturbation schemes, in order to detect influential observations on the model fit. Finally, two data sets from different areas (agronomy and environment) were used to illustrate the methodology proposed here.

Keywords:

exponential family; smoothing spline; penalized likelihood function; weighted back-fitting algorithm; diagnostics measures

MSC:

62P12; 62J20; 62G05

1. Introduction

Thin-Plate Spline Generalized Linear Models (TPS-GLMs) represent an extension of semiparametric generalized linear models (SGLMs) by enabling the application of smoothing splines in multiple dimensions. These models have the same characteristics of the generalized linear model (GLM), as described by McCullagh and Nelder [1]. Like GLMs, TPS-GLMs can assume a variety of distribution families for the response variable. They also allow for a non-linear relationship between the response variable’s mean and the linear predictor via a link function, and they account for non constant variance in the data. Furthermore, the TPS-GLM allow modeling non-linear joint interaction effects due to some covariates, as well as the effects of coordinates in spatial data, making them a useful tool to model dynamic pattern in different scientific areas, such as environment, agronomy, ecology, and so on. Some of the main works related to thin-plate spline technique are Duchon [2,3], Bookstein [4], and Chen et al. [5], while in the context of statistical modeling, Wahba [6], Green and Silverman [7], Wood [8], and Moraga et al. [9], can be mentioned, among others.

However, it is well known that diagnostic analysis is a fundamental process in all statistical modeling for any data set. This analysis allows us to validate the assumptions established about the model in question and identify discrepant observations, and eventually influential ones on the fit of the model. One of the main diagnostic techniques used in GLM and SGLM is local influence. In general, the idea of the local influence technique introduced by Cook [10] is to evaluate the sensitivity of the MLEs when small perturbations are introduced in the assumptions of the model or in the data, both in the response variable and in the explanatory variables. This technique has the advantage, regarding the case elimination technique, that it is not necessary to calculate the estimates of the parameters for each case excluded. In our case, we are interested in developing the local influence technique in the TPS-GLM, in order to detect observations that may have a disproportionate influence on the estimators of both the parametric (regression coefficient) and non-parametric (surface) part of the linear predictor. Such influence may be due, for example, to the fact that each experimental unit contributes differently to the model or that our variable of interest is exposed to a certain modification. In the context of GLM and SGLM, there is empirical evidence that the maximum likelihood estimators (MLEs) and maximum penalized likelihood estimators (PMLEs) are sensitive to this type of situation, and therefore we believe that this sensitivity is also present in the estimators of the TPS-GLM, in particular, in the surface estimator.

Various studies have expanded upon the technique of local influence within different parametric models. Thomas and Cook [11] applied Cook’s method of local influence [10] to generalized linear models to assess the impact of minor data perturbations. Ouwens and Beger [12] obtained the normal curvature under a generalized linear model in order to identify influential subjects and/or individual observations. Zhu and Lee [13] developed the local influence technique for incomplete data, and extended such results to generalized linear mixed models (see also Zhu and Lee [14] for further details). Espinheira et al. [15] extended the local influence analysis to beta regression models considering various perturbation scenarios. Rocha and Simas [16] and Ferrari et al. [17] derived the normal curvature considering a beta regression model whose dispersion parameter varies according to the effect of some covariates. Ferreira and Paula [18] developed the local influence approach to partially linear Skew Normal models under different perturbation schemes, and Emami [19] evaluated the sensitivity of Liu penalized least squares estimators using local influence analysis. Most recently, Liu et al. [20] have reported the implementation of influence diagnostics in AR time series models with Skew Normal (SK) distributions.

Within a semiparametric framework, Thomas [21] developed diagnostics for local influence to assess the sensitivity of estimates for the smoothing parameter, which were determined using the cross-validation criterion. Zhu and Lee [14] and Ibacache-Pulgar and Paula [22] introduced measures of local influence to analyze the sensitivity of maximum penalized likelihood estimates in normal and partially linear Student-t models, respectively. Ibacache-Pulgar et al. [23,24] explored local influence curvature within elliptical semiparametric mixed models and symmetric semiparametric additive models. Subsequently, ref. [25] and Ibacache-Pulgar and Reyes [26] further extended local influence measures to normal and elliptical partially varying-coefficient models, respectively. Ibacache-Pulgar et al. [27] developed the local influence method within the context of semiparametric additive beta regression models. Meanwhile, Cavieres et al. [28] calculated the normal curvature to assess the sensitivity of estimators in a thin-plate spline model that incorporates skew normal random errors. Jeldes et al. [29] applied the partially coefficient-varying model with symmetric random errors to air pollution data from the cities of Santiago, Chile, and Lima, Peru. In this context, they carried out an application of the local influence technique to detect influential observations in the model fit. Saavedra-Nievas et al. [30] extended the local influence technique for the spatio-temporal linear model under normal distribution and with separable covariance. Recently, Sánchez et al. [31] obtained the normal curvature for the varying-coefficient quantile regression model under log-symmetric distributions, and presented an interesting application of such results to an environmental pollution data set.

In this work, we extend the local influence approach in Thin-Plate Spline Generalized Linear Model.

The contents are organized as follows: Section 2 introduces the thin-plate spline generalized linear model. Section 3 details the method for obtaining maximum penalized likelihood estimators and discusses some statistical inferential results. In Section 4, we provide a detailed description of the local influence method and derives normal curvatures for various perturbation schemes. In Section 5, the methodology is illustrated using two datasets. The paper concludes with some final observations in Section 6.

2. The Thin-Plate Spline Generalized Linear Model (TPS-GLM)

In this section, we present the TPS-GLM and the penalized function to carry out the process of estimating the parameters.

2.1. Statistical Model

Let

{y_{i} ∣ i = 1, \dots, n}

be a data set where each response variable

y_{i}

follows a distribution from the exponential family with the following density function:

\begin{matrix} f_{y} (y_{i}; θ_{i}, ϕ) = exp [\frac{y_{i} θ_{i} - ψ (θ_{i})}{a_{i} (ϕ)} + c (y_{i}, ϕ)], \end{matrix}

where

θ_{i}

is the canonical form of the location parameter and depends on the mean

μ_{i}

. The term

a_{i} (ϕ)

represents a known function of the unknown dispersion parameter

ϕ

(or a vector of unknown dispersion parameters). The function c depends on both the dispersion parameter and the responses, while

ψ

is a known function, such that the mean and variance of

y_{i}

are given by:

μ_{i} = E (y_{i}) = \partial ψ (θ_{i}) / \partial θ_{i}

and

Var (y_{i}) = a_{i} (ϕ) V_{i}

, with

V_{i} = V (μ_{i}) = \partial^{2} ψ (θ_{i}) / \partial θ_{i}^{2}

, respectively. The TPS-GLM is defined by Equation (1) and the following systematic component:

\begin{matrix} g (μ_{i}) = η_{i} = w_{i}^{⊤} α + f (t_{i}), \end{matrix}

(1)

where

w_{i}

is a

(p \times 1)

vector of covariables,

α = {(α_{1}, \dots, α_{p})}^{⊤}

corresponds to the vector of regression coefficients,

f (\cdot)

is unknown smooth arbitrary surface, and

t_{i}

is a two-dimensional covariates vector. To write the model given by Equation (1) in a matrix form, first consider the one-to-one transformation of the vector

f

suggested by Green and Silverman [7], stated as

\begin{matrix} f = (\begin{matrix} f (t_{1}) \\ ⋮ \\ f (t_{n}) \end{matrix}) = E δ + T^{T} a, \end{matrix}

where

a

is a

3 \times 1

vector with components

a_{i}

,

δ

is a

n \times 1

vector with components

δ_{i}

,

E

is a

(n \times n)

matrix whose elements are given by

E_{i j} = \frac{1}{16 π} ∥ t_{i} - t_{j} ∥^{2} \log {∥ t_{i} - t_{j} ∥}^{2}

, with

E_{i i} = 0

for each i, and

T

is a

(3 \times n)

matrix defined as

\begin{matrix} T = (\begin{matrix} 1 & 1 & \dots & 1 \\ t_{1} & t_{2} & \dots & t_{n} \end{matrix}) . \end{matrix}

Thus, the Model (1) can be written in a matrix form as

\begin{matrix} η = X β + E δ, \end{matrix}

where the regression matrix is structured as

X = {(\begin{matrix} T^{T} & W \end{matrix})}^{T}

, with

W = {(\begin{matrix} w_{1}^{⊤}, & \dots, & w_{n}^{⊤} \end{matrix})}^{T}

, and the vector of regression coefficients as

β = {(\begin{matrix} a^{T} & α \end{matrix})}^{T} = (\begin{matrix} β_{1}, & \dots, & β_{p + 3} \end{matrix})

, where

β_{j} = a_{j}

(

j = 1, 2, 3

) and

β_{j} = α_{j - 3}

(

j = 4, \dots, p + 3

); see [9]. Note that this matrix representation of the linear predictor allows us to treat the TPS-GLM as a semiparametric generalized linear model, in which the term

X β

represents the parametric component and

E δ

the nonparametric component. One of the advantages of the TPS-GLM, apart from being able to model both discrete and continuous variables that belong to the exponential family, is its flexibility to model the non-linear joint effect of covariates through the surface f present in the linear predictor

η

. In the context of spatial data, this models allows the effect of coordinates to the incorporated into the modeling process. It is important to note that when the surface f is not present in the linear predictor

η

, the model reduces to the classical generalized linear model. However, if the vector

t

reduces to a scalar,

t

, the model reduces to the semiparametric generalized linear model discussed, for instance, by Green and Silverman [7].

2.2. Penalized Function

Under the TPS-GLM, we have that

θ = {(β^{⊤}, δ^{⊤}, ϕ)}^{⊤} \subseteq R^{p^{*}}

, with

p^{*} = (p + 3) + n + 1

parameters. Then, the log-likelihood function is given by

L (θ) = \sum_{i = 1}^{n} L_{i} (θ),

(2)

where

\begin{matrix} L_{i} (θ) = [\frac{y_{i} θ_{i} - ψ (θ_{i})}{a_{i} (ϕ)} + c (y_{i}, ϕ)] . \end{matrix}

To ensure the identifiability of the parameter vector

α

, we assume that f belongs to the function space where all partial derivatives of total order m reside within the Hilbert space

L^{2} [E^{d}]

, the space of square-integrable functions on Euclidean d-space. Incorporating a penalty function over f, we have that the penalized log-likelihood function can be expressed as (see, for instance, Green and Silverman [7])

\begin{matrix} L_{p} (θ, λ_{f}) & = & L (θ) + λ_{f}^{*} J_{m}^{d} (f), \end{matrix}

(3)

where

J_{m}^{d} (f)

is a penalty functional measuring the wiggliness of f, and

λ_{f}^{*} (λ_{f})

is a constant that depends on the smoothing parameter

λ_{f} \geq 0

. In general, a measure of the curvature of f corresponds to its squared norm,

∥ f ∥

, defined as

\begin{matrix} J_{m}^{d} (f) = ∥ f ∥ = \sum_{υ_{1} + \dots + υ_{d} = m} \frac{m!}{υ_{1}! \dots υ_{d}!} \int_{- \infty}^{+ \infty} \dots \int_{- \infty}^{+ \infty} {(\frac{\partial^{m} f}{\partial t_{1}^{α_{1}} \dots \partial t_{d}^{α_{d}}})}^{2} \prod_{j = 1}^{d} d t_{j} . \end{matrix}

For simplicity, in this work, we will consider the case in which

d = 2, m = 2

and

g = g (t_{1}, t_{2})

. Consequently, the penalty function

J_{2}^{2} (f)

is expressed in the form

\begin{matrix} J_{2}^{2} (f) & = & \int \int_{R^{2}} \{{(\frac{\partial^{2} f}{\partial t_{1}^{2}})}^{2} + 2 {(\frac{\partial^{2} f}{\partial t_{1} \partial t_{2}})}^{2} + {(\frac{\partial^{2} f}{\partial t_{2}^{2}})}^{2}\} d t_{1} d t_{2}, \end{matrix}

and measures the rapid variation in f and the departure from local linearity. In this case, the estimation of f leads to a natural thin-plate spline. According to Green and Silverman [7], we may express the penalty functional as

J_{2}^{2} (f) = δ^{T} E δ

. Then, if we consider

λ_{f}^{*} = - λ_{f} / 2

, the penalized log-likelihood function (3) can be expressed as

\begin{matrix} L_{p} (θ, λ_{f}) & = & L (θ) - \frac{λ_{f}}{2} δ^{T} E δ . \end{matrix}

(4)

The first term in the right-hand side of Equation (4) measures the goodness-of-fit, while the second terms penalizes the roughness of f with a fixed parameter

λ_{f}

. Selecting appropriate parameters is crucial in the estimation process, as they determine the balance between the goodness-of-fit and the smoothness (or regularity) of the estimated function. It is important to emphasize that selecting appropriate parameters is crucial in the estimation process because they control the trade-off between goodness-of-fit and the smoothness (or regularity) of the estimated function. In this work, the smoothing parameter is selected through the Akaike Criterion (AIC) based on the penalized log-likelihood function given in Equation (3). More details of the method are given in Section 3.7.

3. Estimation and Inference

In this section, we discuss the problem of estimating the parameters under the TPS-GLM. Specifically, we derive a weighted iterative process based on the backfitting algorithm and estimate the variance–covariance matrix of our estimator from the penalized Fisher information matrix (see Green [32] and Green and Silverman [7]). A brief discussion of the smoothing parameter selection is also presented.

3.1. Penalized Score Function

First, we are going to assume that the function

L_{p} (θ, λ_{f})

is regular in the sense that it admits first and second partial derivatives with respect to the elements of the parameter vector

θ

. To obtain the score function for

β

, we must calculate

\partial L_{p_{i}} (θ, λ_{f}) / \partial β_{j}

for i ∈

{1, \dots, n}

and j ∈

{1, \dots, p + 2}

. After performing some partial derivative operations, we have that the score function for

β

can be written in matrix as follows:

\begin{matrix} U_{p}^{β} (θ) = \frac{\partial L_{p} (θ, λ_{f})}{\partial β} & = & X^{⊤} \tilde{T} (y - μ), \end{matrix}

where

X

is an (

n \times 3 + p

) matrix whose ith row is

x_{i}^{⊤}

,

\tilde{T} = diag [{(a_{i} (ϕ))}^{- 1} (\frac{\partial μ_{i}}{\partial η_{i}}) V_{i}^{- 1}]

is a (

n \times 3 + p

) matrix, with

V_{i} = V (μ_{i}) = \partial^{2} ψ (θ_{i}) / \partial θ_{i}^{2}

the variance function,

a_{i} (ϕ)

a function of

ϕ

,

y = {(y_{1}, . . ., y_{n})}^{⊤}

and

μ = {(μ_{1}, . . ., μ_{n})}^{⊤}

are (

n \times 1

) vectors.

Conversely, to derive the score function for

δ

, we need to compute

\partial L_{p_{i}} (θ, λ_{f}) / \partial δ_{ℓ}

for i ∈

{1, \dots, n}

and ℓ ∈

{1, \dots, n}

. Again, after some algebraic operations, the score function for

δ

can be written in matrix as follows:

\begin{matrix} U_{p}^{δ} (θ) = \frac{\partial L_{p} (θ, λ_{f})}{\partial δ} & = & E^{⊤} \tilde{T} (y - μ) - λ_{f} E δ, \end{matrix}

where the matrix

E

is defined in Section 2.1. Finally, the score function for

ϕ

is given by

\begin{matrix} U_{p}^{ϕ} (θ) = \frac{\partial L_{p} (θ, λ_{f})}{\partial ϕ} & = & - \sum_{i = 1}^{n} {(a_{i} (ϕ))}^{- 2} {y_{i} θ_{i} - ψ (θ_{i})} + \sum_{i = 1}^{n} c^{'} (y_{i}, ϕ), \end{matrix}

with

c^{'} (y_{i}, ϕ) = \partial c (y_{i}, ϕ) / \partial ϕ

, for i ∈

{1, \dots, n}

. Thus, the vector of penalized score functions of

θ

can be expressed compactly as

\begin{matrix} U_{p} (θ) = (\begin{matrix} U_{p}^{β} (θ) \\ U_{p}^{δ} (θ) \\ U_{p}^{ϕ} (θ) \end{matrix}) . \end{matrix}

Note that if the model under consideration only considers the parametric component in the linear predictor, that is, the nonparametric component is omitted, the expressions of the remaining score functions are reduced to those obtained under the classical generalized linear model.

3.2. Penalized Hessian Matrix

To obtain the penalized Hessian matrix, we must compute the second-derivate of

L_{p} (θ, λ_{f})

with respect to each element of

θ

, that is,

\partial^{2} L_{p} (θ, λ_{f}) / \partial θ_{j^{*}} θ_{ℓ^{*}}

, for

j^{*}, ℓ^{*}

∈

{1, \dots, p^{*}}

. After some algebraic operations, we have that the diagonal elements (block matrices) of the Hessian matrix are given by

\begin{matrix} L_{p}^{β β} = \frac{\partial^{2} L_{p} (θ, λ_{f})}{\partial β \partial α^{⊤}} & = & - X^{⊤} M^{*} X, \\ L_{p}^{δ δ} = \frac{\partial^{2} L_{p} (θ, λ_{f})}{\partial δ \partial δ^{⊤}} & = & - E^{⊤} M^{*} E - λ_{f} E and \\ L_{p}^{ϕ ϕ} = \frac{\partial^{2} L_{p} (θ, λ_{f})}{\partial ϕ^{2}} & = & \sum_{i = 1}^{n} 2 {(a_{i} (ϕ))}^{- 3} (y_{i} θ_{i} - ψ (θ_{i})) + \sum_{i = 1}^{n} c^{″} (y_{i}, ϕ)), \end{matrix}

where

M^{*} = {diag}_{1 \leq i \leq n} [{(a_{i} (ϕ))}^{- 1} {(\partial μ_{i} / \partial η_{i})}^{2} V_{i}^{- 1}]

and

c^{″} (y_{i}, ϕ) = \partial^{2} c (y_{i}, ϕ) / \partial ϕ^{2}

, for

1 \leq i \leq n

. The elements outside the main diagonal of the Hessian matrix take the form

\begin{matrix} L_{p}^{β δ} = \frac{\partial^{2} L_{p} (θ, λ_{f})}{\partial β \partial δ^{⊤}} & = & - X^{⊤} M^{*} E, \\ L_{p}^{β_{j} ϕ} = \frac{\partial^{2} L_{p} (θ, λ_{f})}{\partial α_{j} \partial ϕ} & = & - \sum_{i = 1}^{n} {(a_{i} (ϕ))}^{- 2} \{(y_{i} - μ_{i}) V_{i}^{- 1} \frac{\partial μ_{i}}{\partial η_{i}} x_{i j}\} and \\ L_{p}^{δ_{ℓ} ϕ} = \frac{\partial^{2} L_{p} (θ, λ_{f})}{\partial δ_{ℓ} \partial ϕ} & = & - \sum_{i = 1}^{n} {(a_{i} (ϕ))}^{- 2} \{(y_{i} - μ_{i}) V_{i}^{- 1} \frac{\partial μ_{i}}{\partial η_{i}} e_{i ℓ}\}, \end{matrix}

where

x_{i j}

denotes the

(i, j)

th element of the matrix

X

and

e_{i j}

denotes the

(i, ℓ)

th element of the matrix

E

, for i ∈

{1, \dots, n}

, j ∈

{1, \dots, p + 2}

and ℓ ∈

{1, \dots, n}

. Thus, the penalized Hessian matrix can be represented as

\begin{matrix} L_{p} (θ) = (\begin{matrix} L_{p}^{β β} & L_{p}^{β δ} & L_{p}^{β ϕ} \\ L_{p}^{β δ^{⊤}} & L_{p}^{δ δ} & L_{p}^{δ ϕ} \\ L_{p}^{β ϕ^{⊤}} & L_{p}^{δ ϕ^{⊤}} & L_{p}^{ϕ ϕ} \end{matrix}) . \end{matrix}

It is noteworthy that this matrix simplifies to the Hessian matrix used in generalized linear models when the nonparametric component is absent. The primary application of this matrix lies in the normal curvature, which is essential for developing the local influence technique. This will be discussed in the following section.

3.3. Penalized Expected Information Matrix

By taking the expectation of the matrix

- L_{p} (θ)

, we derive the penalized expected information matrix, which is of dimension

(p^{*} \times p^{*})

, as follows:

\begin{matrix} J_{p} (θ) & = & - E [\frac{\partial^{2} L_{p} (θ, λ)}{\partial θ \partial θ^{T}}] . \end{matrix}

This matrix assumes the following diagonal structure in blocks:

\begin{matrix} J_{p} (θ) & = & (\begin{matrix} J_{p}^{β δ} (θ) & 0 \\ 0 & J_{p}^{ϕ ϕ} (θ) \end{matrix}), \end{matrix}

where

\begin{matrix} J_{p}^{β δ} (θ) & = & (\begin{matrix} X^{⊤} M^{*} X & X^{⊤} M^{*} E \\ E^{⊤} M^{*} X & E^{⊤} M^{*} E + λ_{f} E \end{matrix}) \end{matrix}

and

\begin{matrix} J_{p}^{ϕ ϕ} (θ) & = & \sum_{i = 1}^{n} - 2 {(a_{i} (ϕ))}^{- 3} (μ_{i} θ_{i} - ψ (θ_{i})) - \sum_{i = 1}^{n} E (c^{″} (y_{i}, ϕ)), \end{matrix}

with

c^{″} (y_{i}, ϕ) = \partial^{2} c (y_{i}, ϕ) / \partial ϕ^{2}

for i ∈

{1, \dots, n}

.

3.4. Derivation of the Iterative Process

The value of

θ

that maximizes

L_{p} (θ, λ_{f})

, called maximum penalized likelihood estimate (MPLE) and denoted by

\hat{θ}

, is carried out by solving the corresponding estimation equations. Let

θ = {(\begin{matrix} θ_{1}^{⊤} & θ_{2} \end{matrix})}^{⊤}

, where

θ_{1} = {(\begin{matrix} β^{⊤} & δ^{⊤} \end{matrix})}^{⊤}

and

θ_{2} = ϕ

. In addition, consider the partition of the score function vector

U_{p} (θ) = {(\begin{matrix} U_{p}^{1^{⊤}} (θ) & U_{p}^{2^{⊤}} (θ) \end{matrix})}^{⊤}

, where

U_{p}^{1^{⊤}} (θ) = (\begin{matrix} U_{p}^{β^{⊤}} (θ) & U_{p}^{δ^{⊤}} (θ) \end{matrix})

and

U_{p}^{2} (θ) = U_{p}^{ϕ} (θ)

. In order to estimate

θ

based on penalized likelihood function given by Equation (4), we have to solve the equations

\begin{matrix} \{\begin{matrix} U_{p}^{1} (θ) & = & 0 \\ U_{p}^{2} (θ) & = & 0 . \end{matrix} \end{matrix}

These estimating equations are nonlinear, and necessitate an iterative approach for their solution. An alternative frequently proposed in the context of generalized linear models is the Fisher scoring algorithm (Nelder and Wedderburn, [33]), considering the fact that in some situations the matrix

- L_{p} (θ)

can be non-positive definite. Then, the algorithm for estimating

θ_{1}

, with

ϕ

fixed, is given by

θ_{1}^{new} = θ_{1}^{old} + {(J_{p}^{β δ} {(θ)}^{- 1})}^{old} U_{p}^{1^{old}} (θ),

which is equivalent to solving the matrix equation

\begin{matrix} (\begin{matrix} I & S_{β}^{old} E \\ S_{δ}^{old} X & I \end{matrix}) (\begin{matrix} β^{new} \\ δ^{new} \end{matrix}) = (\begin{matrix} S_{0}^{old} z^{old} \\ S_{1}^{old} z^{old} \end{matrix}), \end{matrix}

(5)

where

z^{old} = (y - μ^{old}) + η^{old}

, with

S_{ϑ}^{old}

defined as

\begin{matrix} S_{ϑ} & = & \{\begin{matrix} {(X^{⊤} M^{*^{old}} X)}^{- 1} X^{⊤} M^{*^{old}} & ϑ = β \\ {(E^{⊤} M^{*^{old}} E + λ_{f} E)}^{- 1} E^{⊤} M^{*^{old}} & ϑ = δ . \end{matrix} \end{matrix}

Consequently, the weighted back-fitting (Gauss–Seidel) iterations for simultaneously updating

β

and

δ

are given by

\begin{matrix} β^{new} & = & S_{β}^{old} (z^{old} - E δ^{old}), \end{matrix}

(6)

\begin{matrix} δ^{new} & = & S_{δ}^{old} (z^{old} - X β^{old}), \end{matrix}

(7)

It is crucial to note that the system of Equations (5) is consistent, and the back-fitting algorithm converges to a solution for any initial values, provided that the weight matrix

M^{*}

is symmetric and positive definite. Additionally, if the parametric component

w_{i}^{⊤} β

is absent in the linear predictor, the estimator of

δ

is given by:

\begin{matrix} δ^{new} & = & S_{δ}^{old} z^{old} . \end{matrix}

The MPLE of the dispersion parameter,

{\hat{θ}}_{2} = \hat{ϕ}

, can be determined through the following iterative procedure:

\begin{matrix} θ_{2}^{new} & = & θ_{2}^{old} - {(J_{p}^{ϕ ϕ} {(θ)}^{- 1})}^{old} U_{p}^{2^{old}} (θ) . \end{matrix}

Summarizing, each iteration of the Fisher scoring algorithm updates

β

and

δ

using Equations (6) and (7), and evaluating matrices

S_{ϑ}

and

M^{*}

at the MPLE of

θ

obtained in the previous iteration, that is,

θ^{old}

, until convergence is obtained. The joint iterative process that resolves

U_{p} (θ) = 0

is presented below.

3.5. Estimation of Surface

To obtain the MPLE of f, we must consider its one-to-one representation given in Equation (2) and MPLE obtained from the iterative process described above. Indeed, we have that

\hat{f}

can be obtained as

\begin{matrix} \hat{f} = E \hat{δ} + T^{T} \hat{a},, \end{matrix}

where

\hat{δ}

and

\hat{a}

are the MPLE of

δ

and

\hat{a}

, respectively. Note that vector

\hat{a}

corresponds to the first three elements of vector

\hat{β}

. Consequently,

\hat{f}

is a natural thin-plate spline. Details of the conditions that guarantee this result are given, for example, in Green and Silverman [7] and Wood [34].

3.6. Approximate Standard Errors

In this study, we propose approximating the variance–covariance matrix of

\hat{θ}

by using the inverse of the penalized Fisher information matrix. Specifically, we have that

\begin{matrix} \hat{Cov} (\hat{θ}) & \approx & J_{p}^{- 1} (θ) |_{\hat{θ}} . \end{matrix}

If we are interested in drawing inferences for

β

, the approximate variance–covariance matrix can be estimated by using the corresponding block-diagonal matrix obtained from

J_{p}^{- 1} (θ)

, similarly for

f

and

ϕ

.

3.7. On Degrees of Freedom and Smoothing Parameter

For the TPS-GLM, the degree of freedom (

d f

) associated with the smooth surface is given by (see, for instance Green and Silverman [7])

\begin{matrix} d f (λ_{f}) = tr (E^{⊤} S_{δ}), \end{matrix}

which approximately represents the number of effective parameters used in the modeling process to estimate the smooth surface f.

Regarding the selection of the smoothing parameter, we propose to use the Akaike Information Criterion (AIC) (see, for instance, [24,35]), defined as

\begin{matrix} A I C (λ_{f}) = - 2 L_{p} (θ, λ_{f}) |_{\hat{θ}} + 2 [1 + p + d f (λ_{f})], \end{matrix}

where

L_{p} (θ, λ_{f})

denote the penalized likelihood function evaluated at MPLE of

θ

, and p denote the number of parameters in

β

. As usual, the idea is to select the value of

λ_{f}

that minimizes

A I C (λ_{f})

.

4. Local Influence

In this section, we extend the local influence technique to evaluate the sensitivity of the MPLE under the TPS-GLM. Specifically, we present some theoretical aspects of the method and, subsequently, we derive the normal curvature for three perturbation schemes.

4.1. Local Influence Analysis

Consider

ω = {(ω_{1}, \dots, ω_{n})}^{⊤}

, an

n \times 1

vector of perturbations restricted to some open subset

Ω \subset R^{n}

. Let

L_{p} (θ, λ_{f} ∣ ω)

denote the logarithm of the perturbed penalized likelihood function. Assume there exists a vector of non-perturbation

ω_{0} \in Ω

, such that

L_{p} (θ, λ_{f} ∣ ω_{0}) = L_{p} (θ, λ_{f})

. To evaluate the influence of small perturbations on the MPL estimate

\hat{θ}

, we can consider the penalized likelihood displacement given by:

2 [L_{p} (\hat{θ}, λ_{f}) - L_{p} ({\hat{θ}}_{ω}, λ_{f})] \geq 0,

where

{\hat{θ}}_{ω}

is the MPL estimate under

L_{p} (θ, λ_{f} ∣ ω)

. The measure LD(

ω

) is useful for assessing the distance between

\hat{θ}

and

{\hat{θ}}_{ω}

. Cook [10] suggested examining the local behavior of LD(

ω

) around

ω_{0}

. The procedure involves selecting a unit direction

d \in Ω

, with

∥ d ∥ = 1

, and then plotting LD(

ω_{0} + a d

) against a, where

a \in R

. This plot, called the lifted line, can be characterized by considering the normal curvature

C_{d} (θ)

around

a = 0

. The suggestion is to assume the direction

d = d_{max}

corresponding to the largest curvature

C_{d_{max}} (θ)

. The index plot of

d_{max}

can identify those cases that, under small perturbations, have a significant potential influence on LD(

ω

). According to Cook [10], the normal curvature in the unit direction

d

is expressed as

C_{d} (θ) = - 2 {d^{⊤} Δ_{p}^{⊤} L_{p}^{- 1} Δ_{p} d},

with

\begin{matrix} L_{p} = \frac{\partial^{2} L_{p} (θ, λ_{f})}{\partial θ \partial θ^{⊤}} |_{θ = \hat{θ}} and Δ_{p} = \frac{\partial^{2} L_{p} (θ, λ_{f} | ω)}{\partial θ \partial ω^{⊤}} |_{θ = \hat{θ}, ω = ω_{0}} . \end{matrix}

Note that

- L_{p}

represents the penalized observed information matrix evaluated at

\hat{θ}

(see Section 3.2), and

Δ_{p}

is the penalized perturbation matrix evaluated at

\hat{θ}

and

ω_{0}

. It is essential to highlight that

C_{d} (θ)

denotes the local influence on the estimate

\hat{θ}

after perturbing the model or data. Escobar and Meeker [36] suggested examining the normal curvature in the direction

d = e_{i}

, where

e_{i}

is an

n \times 1

vector with a one at the ith position and zeros elsewhere. Consequently, the normal curvature, referred to as the total local influence of the ith case, takes the form

C_{e_{i}} (θ) = 2 | c_{i i} |

for

i \in {1, \dots, n}

, where

c_{i i}

is the ith principal diagonal element of the matrix

C = Δ_{p}^{⊤} L_{p}^{- 1} Δ_{p}

.

4.2. Derivation of the Normal Curvature

Typically, the perturbation schemes used in the analysis of local influence are determined by the structure of the model under consideration, as discussed by Billor and Loynes [37]. These schemes can generally be divided into two main categories: perturbations to the model (to examine changes in assumptions) or perturbations to the data. For instance, we might consider perturbing the response variable or the explanatory variables. The motivation for employing these perturbation schemes often includes addressing issues such as the presence of outliers or the occurrence of measurement errors in the data. Subsequently, we will present the formulas for the matrix

Δ_{p}

for various perturbation schemes.

Consider the weights assigned to the observations in the penalized log-likelihood function, given by:

\begin{matrix} L_{p} (θ, λ_{f} | ω) & = & L (θ | ω) - \sum_{i = 1}^{n} \frac{λ_{f}}{2} δ^{⊤} E δ, \end{matrix}

where

L (θ | ω) = \sum_{i = 1}^{n} ω_{i} L_{i} (θ)

,

ω = {(ω_{1}, \dots, ω_{n})}^{⊤}

is the vector of weights, with

0 \leq ω_{i} \leq 1

. In this case, the vector of no perturbation is given by

ω_{0} = 1_{(n \times 1)}

. Differentiating

L_{p} (θ, λ_{f} | ω)

with respect to the elements of

θ

and

ω^{T}

, we have that the matrix

Δ_{p}

takes the form

Δ_{p} = (\begin{matrix} X^{⊤} D_{τ} \\ E^{⊤} D_{τ} \\ {\hat{u}}^{⊤} \end{matrix}),

where the matrix

D_{τ} = {diag}_{1 \leq i \leq n} (τ_{i})

and

u = {(u_{1}, \dots, u_{n})}^{⊤}

, with

τ_{i} = {(a_{i} (ϕ))}^{- 1} (y_{i} - \partial ψ (h (η_{i})) / \partial h (η_{i})) \partial h (η_{i}) / \partial η_{i}

,

h (η_{i}) = ψ^{' - 1} (η_{i})

,

ψ^{' - 1} (\cdot)

denotes the inverse function of

ψ^{'} (\cdot)

,

u_{i} = - {(a_{i} (ϕ))}^{- 2} (y_{i} h (η_{i}) - ψ (h (η_{i})) + c^{'} (y_{i}, ϕ) e_{i n}^{⊤}

, and

e_{i n}

a vector with 1 at the ith position and zero elsewhere.

To perturb the response variable values, we consider

y_{i ω} = y_{i} + ω_{i}

for

i \in {1, \dots, n}

, where

ω = {(ω_{1}, \dots, ω_{n})}^{⊤}

is the vector of perturbations. The vector of no perturbation is

ω = {(0, \dots, 0)}^{⊤}

. The perturbed penalized log-likelihood function is constructed from Equation (3) with

y_{i}

replaced by

y_{i ω}

, as follows:

\begin{matrix} L_{p} (θ, λ_{g} | ω) & = & L (θ | ω) - \sum_{i = 1}^{n} \frac{λ_{f}}{2} δ^{⊤} E δ, \end{matrix}

where

L (\cdot)

is defined in Equation (2) with

y_{i ω}

replacing

y_{i}

. By differentiating

L_{p} (θ, λ ∣ ω)

with respect to the elements of

θ

and

ω_{i}

, and after some algebraic manipulation, we obtain:

Δ_{p} = (\begin{matrix} X^{⊤} D_{c} \\ E^{⊤} D_{c} \\ {\hat{d}}^{⊤} \end{matrix}),

where the matrix

D_{c} = {diag}_{1 \leq i \leq n} (c_{i})

and

d = {(d_{1}, \dots, d_{n})}^{⊤}

, with

c_{i} = \partial h (η_{i}) / \partial η_{i}

and

d_{i} = - {(a_{i} (ϕ))}^{- 2} (h (η_{i}) e_{i n}^{⊤} + c^{'} (y_{i ω}, ϕ) / \partial ω_{i})

, with

e_{i n}

denoting a vector with 1 at the ith position and zero elsewhere.

5. Applications

In this section, we show the applicability of the TPS-GLM and the local influence method with two real data applications. The model estimation and diagnostics have been implemented using MatLab 9.13.0 (R2022b) software [38] (the developed code is available on request by the authors).

5.1. Wypych Data

The first dataset we use to illustrate the applicability of the TPS-GLM consists of 83 sample points within a 46.6-hectare agricultural area in Wypych, located at latitude 24°50′24″ S and longitude 53°36′36″ W, with an average altitude ranging from 589 to 660 m. The data were collected during the 2006/2007 agricultural year in the western region of Paraná State, Brazil (see [39], Appendix 4). The soil is classified as Dystroferric Red Latosol with a clayey texture. The region’s climate is mesothermal, super-humid temperate, classified as Cfa according to (Köeppen), with a mean annual temperature of 21 °C. The 83 georeferenced points were determined by a regular grid of

75 \times 75

m using a global positioning system (GPS). The collected variables were as follows:

Soya: average of soybean yield (t/ha).
Height: average height (cm)of plants at the end of the production process.
Pods: average number of pods.
Lat: latitude (UTM).
Long: longitude (UTM).

The original objective was to investigate the spatial variability of soybean yield (Soya) in the studied area based on the covariates: average plant height, average number of pods per plant, latitude, and longitude. Figure 1 shows the scatterplots between the response variable Soya and the explanatory variables Height and Pods. In addition, the plot of the response variable against the coordinates is shown. Clearly, from Figure 1a,b, it can be seen that the explanatory variables Height (X2) and Pods (X3) are linearly related to the response variable Soya (Z). The spatial effect given by the coordinates (X,Y) will be incorporated into the model through a smooth surface.

5.1.1. Fitting the TPS-GLM

Based on the above analysis, we propose the TPS-GLM, introduced in Section 2, to model the trends present in the Wypych data. Specifically, we are going to assume that the response variable Soya belongs to the Gaussian family, and that the link function is the identity. Therefore, the model is expressed as follows:

g (μ_{i}) = μ_{i} = β_{0} + β_{1} {Height}_{i} + β_{2} {Pods}_{i} + f ({Lat}_{i}, {Long}_{i}) i \in {1, \dots, 83},

where

β = {(β_{0}, β_{1}, β_{2})}^{⊤}

correspond to the regression coefficients associated with the parametric component of the model, and

f (\cdot)

is a smooth surface. Table 1 lists the MPLE of

β

. The respective asymptotic standard errors are presented in parentheses.

The value of the smoothing parameter

λ_{f}

was selected in such a way that the AIC criterion was minimized. The adjusted determinant coefficients (R²(Adj)) are evaluated for assessing the goodness-of-fit of the two models. It is important to note that our model have a lower AIC and an higher R²(Adj), compared to the multiple regression model that does not consider the spatial effect. Figure 2a shows the QQ-plot for the standardized residuals, whose adjustment to the Gaussian TPS-GLM seems to be reasonable. However, the presence of some atypical observations is observed in one of the tails of the distribution. Figure 2b displays the scatter plot between the observed values, Soya, and their estimated values,

\hat{Soya}

. Considering the trend of the points, we conclude that the estimates are good, since they generate consistent adjusted values of the response variable.

5.1.2. Diagnostic Analysis

To identify potentially influential observations on the MPLE under the fitted Gaussian TPS-GLM for the Wypych data, we present several index plots of

B_{i} = B_{e_{i}} (γ)

for

γ = β, δ

. Figure 3 shows the index plot

B_{i}

for the case-weight perturbation scheme under the fitted model. Figure 3a reveal that the observations

# 6

,

# 61

,

# 69

and

# 71

are more influential on

\hat{β}

, whereas the observations

# 6

,

# 66

,

# 61

and

# 38

are more influential on

\hat{δ}

; see Figure 3b. When we perturb the response variable additively, we have that the observations

# 80

,

# 32

,

# 75

and

# 88

are more influential on

\hat{β}

; see Figure 4a. Regarding

\hat{δ}

, observations

# 3

,

# 42

and

# 80

appear as slightly influential as seem in Figure 4b.

We conclude that the maximum penalized likelihood estimates (MPLE) of the regression coefficients and the smooth surface exhibit sensitivity to modifications made to the data or the model. This analysis has shown that observations identified as influential for the parametric component do not necessarily exert influence on the non-parametric component, and vice versa. For instance, under the case-weight perturbation scheme, observations

# 69

and

# 71

were detected as influential for the parametric component, but not for the nonparametric component.

5.1.3. Confirmatory Analysis

Table 2 displays the relative changes experienced by each element in the vector of regression coefficients. In this analysis, we only consider the three most influential observations under the case-weight perturbation scheme. As can be seen in this table, observations

# 6

,

# 61

and

# 69

generate significant changes in the estimates. Still, no relevant inferential changes were noted. However, the AIC and &R²(Adj) present some differences once the above observations are dropped.

On the other hand, Table 3 shows the relative changes in the vector of regression coefficients under the additive perturbation scheme of the response variable. Here, we consider the four most influential observations. As can be seen in the table, observations

# 32

,

# 69

,

# 75

and

# 80

generate important relative changes in the estimates of the parametric component of the model. However, no significant inferential changes were observed. About the AIC and &R²(Adj), there are not evident differences.

5.2. Ozone Concentration Data

For our analysis, we utilize data from a study examining the relationship between atmospheric ozone concentration (O3) and various meteorological variables in the Los Angeles Basin for a sample of 330 days in 1976. The data were initially presented by Breiman and Friedman [40], and are available for download from various public repositories. Although the dataset includes several variables, in this application, we will consider only three explanatory variables, which are detailed in the following.

O3: daily maximum one-hour average ozone concentration in Upland, CA, measured in parts per million (ppm).
Temp: Sandburg Air Base temperature, in Celsius.
Vis: visibility, in miles.
Day: calendar day.

Figure 5 contains the dispersion graphs between the outcome variable (log(O3)) and each one of the explanatory variables Temp, Vis and Day.

Figure 5a shows a curved surface in the relationship between the variable log(O3) and the joint effect of the explanatory variables Temp and Day, whereas the relationship between log(O3) and the joint effect of the explanatory variable Vis and Day shows less curve; see Figure 5b. This graphical analysis recommends the inclusion in the model of a nonparametric component, specifically a surface, that can explain the relationship between log(O3) and the combined effect of the explanatory variables Temp and Day. For simplicity, in this work, we will include the effect of the explanatory variable Vis in a linear form. To begin our analysis, we are going to consider the fit of a GLM assuming that the variable of interest O3 is Poisson distributed with mean

μ_{i}

and logarithmic link function. Different structures of the linear predictor for the explanatory variables Vis, Temp, and Day will be considered (see Table 4).

For Model I, we consider only the individual effects of the explanatory variables Vis, Temp and Day. Note that all these effects were incorporated in a linear form in the systematic component of the model. For Model II, we consider the inclusion of a nonparametric term to model the nonlinear effects of the explanatory variable Day; see Ibacache et al. [41]. Model III considers a systematic component that contains the individual effects of the explanatory variables Vis, Temp and Day, in addition to the incorporation of the interaction effect between the explanatory variables Temp and Day. Here, the interaction effect is introduced linearly in the model. Model IV corresponds to a TPS-GLM where the joint effect of the Temp and Day explanatory variables is included nonlinearly by using smooth surface. Table 5 contains the ML and MPL estimates associated with the parametric component for the four fitted models.

It is important to note that both the individual and interaction effects are statistically significant, as the corresponding p-values (not shown here) are less than 0.05. Additionally, the estimates of

β_{0}

are similar across the four models, whereas the estimates of

β_{1}

vary considerably, particularly in Model IV. Concerning the associated standard errors, all the estimators exhibit small values. The last two rows of Table 5 display the Akaike Information Criterion (AIC) and

R^{2}

values, respectively. It is evident that the TPS-GLM, with AIC

(λ_{f}) = 1777.705

, provides the best fit to the Ozone data, followed by Model II with an AIC of 1806.837. This is corroborated by the QQ-plots in Figure 6, specifically Figure 6b,d. Furthermore, the

R^{2}

value associated with our model is higher than those of Models I, II, and III. The smoothing parameter

λ_{f}

was chosen such that the effective degrees of freedom were approximately 7. Figure 7 illustrates the 3D plot of the adjusted log(O3) against the explanatory variables Temp and Day, showing an adequate fit of the TPS-GLM.

5.2.1. Diagnostic Analysis

To identify potentially influential observations on the MPL estimators under the fitted TPS-GLM for the Ozone data, we present some index plots of

B_{i} = B_{e_{i}} (γ)

, for

γ = β, δ

. Figure 8 shows the index plot of

B_{i}

for the case-weight perturbation scheme under the fitted model. In Figure 8a,b, note that observations

# 167

,

# 220

,

# 168

, and

# 177

are more influential on

\hat{β}

and

\hat{δ}

, respectively. By perturbing the response variable additively, it becomes clear that observations

# 125

,

# 175

,

# 218

,

# 219

, and

# 221

are more influential on

\hat{β}

and

\hat{δ}

; see Figure 9a and Figure 9b, respectively.

From the local influence analysis, we conclude that the MPLE of the regression coefficients and the smooth surface are sensitive to perturbations in the data or the model. Furthermore, this analysis revealed that observations identified as influential for the parametric component are also influential for the nonparametric component, and vice versa. For example, under the case-weight perturbation scheme, observations

# 167

,

# 220

,

# 168

, and

# 177

were found to be influential for both the parametric and nonparametric components.

5.2.2. Confirmatory Analysis

To investigate the impact on model inference when influential potentially observations detected in the diagnostic analysis are removed, we present the relative changes (RCs) in the MPL estimate of

β_{j}

for

j \in {1, 2}

after removing the influential observations from the dataset (%). The RC is defined as

{RC}_{ξ} = |\frac{\hat{ξ} - {\hat{ξ}}_{(I)}}{\hat{ξ}}| \times 100 %

, where

{\hat{ξ}}_{(I)}

denotes the MPL estimate of

ξ

, with

ξ = β_{j}

, after the corresponding observation(s) are removed according to set I. Table 6 presents the RCs in the regression coefficient estimates after removing the observations identified as potentially influential for the parametric component of the model.

6. Concluding Remarks and Future Research

In this work, we study some aspects of the Thin-Plate Spline Generalized Linear Models. Specifically, we derive an iterative process to estimate the parameters and the Fisher information matrix to approximate, through its inverse, the variance–covariance matrix of the estimators. In addition, we extended the local influence method, obtaining closed expressions for the Hessian and perturbation matrices under cases-weight perturbation and additive perturbation of the response variable. We performed a statistical data analysis with two real data sets of the agronomic and environmental area. The study showed the advantage of incorporating a smooth surface to model the joint effect of a pair of explanatory variables or the spatial effect determined by the coordinates. In both applications, it was observed that the adjusted values of the response variable were consistent. In addition, it was observed that our model presented a better fit to model the soybean yield and ozone concentration data, compared to some classic parametric and semiparametric models, respectively. In our analysis, it was found that those observations detected as potentially influential generated important changes in the estimates, but not significant inferential changes. In addition, our study confirms the need to develop the local influence method to evaluate the sensitivity of maximum penalized likelihood estimators and thus determine those observations that can exert an excessive influence on both the parametric and non-parametric components, or on both.

As future work, we propose to incorporate a correlation component in the model and extend the local influence technique to other perturbation schemes, mainly on the non-parametric component of the model.

Author Contributions

Conceptualization, G.I.-P., P.P., M.A.U.-O. and O.N.; methodology, G.I.-P. and O.N.; software, G.I.-P. and P.P.; validation, G.I.-P. and P.P.; formal analysis, P.P.; investigation, G.I.-P., P.P., O.N. and M.A.U.-O.; data curation, P.P. and M.A.U.-O.; writing—original draft preparation, G.I.-P. and P.P.; writing—review and editing, O.N. and M.A.U.-O.; supervision, G.I.-P. and O.N.; project administration, O.N.; funding acquisition, O.N. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by ANID-Fondecyt grant number 1201478.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data cannot be shared openly but are available on request from authors.

Acknowledgments

The third author acknowledges the support of ANID-Fondecyt 1201478.

Conflicts of Interest

The authors declare no conflicts of interest.

References

McCullagh, P.; Nelder, J.A. Generalized Linear Models, 2nd ed.; Chapman and Hall: London, UK, 1989. [Google Scholar]
Duchon, J. Interpolation des fonctions de deux variables suivant le principe de la flexion des plaques minces. RAIRO Anal. Numér. 1976, 10, 5–12. [Google Scholar] [CrossRef]
Duchon, J. Splines minimizing rotation-invariant semi-norms in Sobolev spaces. Lect. Notes Math. 1977, 57, 85–100. [Google Scholar]
Bookstein, F.L. Principal warps: Thin-plate splines and decomposition of deformations. IEEE Trans. Pattern Anal. Mach. Intell. 1989, 11, 567–585. [Google Scholar] [CrossRef]
Chen, C.; Li, Y.; Yan, C.; Dai, H.; Liu, G. A Thin Plate Spline-Based Feature-Preserving Method for Reducing Elevation Points Derived from LiDAR. Remote Sens. 2015, 7, 11344–11371. [Google Scholar] [CrossRef]
Wahba, G. Spline Models for Observational Data; SIAM: Philadelphia, PA, USA, 1990. [Google Scholar]
Green, P.J.; Silverman, B.W. Nonparametric Regression and Generalized Linear Models; Chapman and Hall: Boca Raton, FL, USA, 1994. [Google Scholar]
Wood, S.N. Thin plate regression splines. J. R. Stat. Soc. Ser. B (Methodol.) 2003, 65, 95–114. [Google Scholar] [CrossRef]
Moraga, M.S.; Ibacache-Pulgar, G.; Nicolis, O. On an elliptical thin-plate spline partially varying-coefficient model. Chil. J. Stat. 2021, 12, 205–228. [Google Scholar]
Cook, R.D. Assessment of Local Influence. J. R. Stat. Soc. Ser. B (Methodol.) 1986, 48, 133–169. [Google Scholar] [CrossRef]
Thomas, W.; Cook, R.D. Assessing influence on regression coefficients in generalized linear models. Biometrika 1989, 76, 741–749. [Google Scholar] [CrossRef]
Ouwens, M.N.M.; Tan, F.E.S.; Berger, M.P.F. Local influence to detect influential data structures for generalized linear mixed models. Biometrics 2001, 57, 1166–1172. [Google Scholar] [CrossRef]
Zhu, H.; Lee, S. Local influence for incomplete-data models. J. R. Stat. Soc. Ser. B 2001, 63, 111–126. [Google Scholar] [CrossRef]
Zhu, H.; Lee, S. Local influence for generalized linear mixed models. Can. J. Stat. 2003, 31, 293–309. [Google Scholar] [CrossRef]
Espinheira, P.L.; Ferrari, P.L.; Cribari-Neto, F. Influence diagnostics in beta regression. Comput. Stat. Data Anal. 2008, 52, 4417–4431. [Google Scholar] [CrossRef]
Rocha, A.; Simas, A. Influence diagnostics in a general class of beta regression models. TEST 2001, 20, 95–119. [Google Scholar] [CrossRef]
Ferrari, S.; Spinheira, P.; Cribari-Neto, F. Diagnostic tools in beta regression with varying dispersion. Stat. Neerl. 2011, 65, 337–351. [Google Scholar] [CrossRef]
Ferreira, C.S.; Paula, G.A. Estimation and diagnostic for skew-normal partially linear models. J. Appl. Stat. 2017, 44, 3033–3053. [Google Scholar] [CrossRef]
Emami, H. Local influence for Liu estimators in semiparametric linear models. Stat. Pap. 2018, 59, 529–544. [Google Scholar] [CrossRef]
Liu, Y.; Mao, G.; Leiva, V.; Liu, S.; Tapia, A. Diagnostic Analytics for an Autoregressive Model under the Skew-Normal Distribution. Mathematics 2020, 8, 693. [Google Scholar] [CrossRef]
Thomas, W. Influence diagnostics for the cross-validated smoothing parameter in spline smoothing. J. Am. Stat. Assoc. 1991, 9, 693–698. [Google Scholar] [CrossRef]
Ibacache, G.; Paula, G.A. Local Influence for student-t partially linear models. Comput. Stat. Data Anal. 2011, 55, 1462–1478. [Google Scholar] [CrossRef]
Ibacache-Pulgar, G.; Paula, G.A.; Galea, M. Influence diagnostics for elliptical semiparametric mixed models. Stat. Model. 2012, 12, 165–193. [Google Scholar] [CrossRef]
Ibacache, G.; Paula, G.A.; Cysneiros, F. Semiparametric additive models under symmetric distributions. Test 2013, 22, 103–121. [Google Scholar] [CrossRef]
Zhang, J.; Zhang, X.; Ma, H.; Zhiya, C. Local influence analysis of varying coefficient linear model. J. Interdiscip. Math. 2015, 3, 293–306. [Google Scholar] [CrossRef]
Ibacache-Pulgar, G.; Reyes, S. Local influence for elliptical partially varying coefficient model. Stat. Model. 2018, 18, 149–174. [Google Scholar] [CrossRef]
Ibacache-Pulgar, G.; Figueroa-Zuñiga, J.; Marchant, C. Semiparametric additive beta regression models: Inference and local influence diagnostics. REVSTAT-Stat. J. 2019, 19, 255–274. [Google Scholar]
Cavieres, J.; Ibacache-Pulgar, G.; Contreras-Reyes, J. Thin plate spline model under skew-normal random errors: Estimation and diagnostic analysis for spatial data. J. Stat. Comput. Simul. 2023, 93, 25–45. [Google Scholar] [CrossRef]
Jeldes, N.; Ibacache-Pulgar, G.; Marchant, C.; López-Gonzales, J.L. Modeling Air Pollution Using Partially Varying Coefficient Models with Heavy Tails. Mathematics 2022, 10, 3677. [Google Scholar] [CrossRef]
Saavedra-Nievas, J.C.; Nicolis, O.; Galea, M.; Ibacache-Pulgar, G. Influence diagnostics in Gaussian spatial—Temporal linear models with separable covariance. Environ. Ecol. Stat. 2023, 30, 131–155. [Google Scholar] [CrossRef]
Sánchez, L.; Ibacache-Pulgar, G.; Marchant, C.; Riquelme, M. Modeling Environmental Pollution Using Varying-Coefficients Quantile Regression Models under Log-Symmetric Distributions. Axioms 2023, 12, 976. [Google Scholar] [CrossRef]
Green, P.J. Penalized Likelihood for General Semi-Parametric Regression Models. Int. Stat. Rev. 1987, 55, 245–259. [Google Scholar] [CrossRef]
Nelder, J.A.; Wedderburn, R.W.M. Generalized Linear Models. J. R. Stat. Soc. Ser. A (Gen.) 1972, 135, 370–384. [Google Scholar] [CrossRef]
Wood, S.N. Generalized Additive Models: An Introduction with R, 2nd ed.; Chapman and Hall/CRC: Boca Raton, FL, USA, 2017. [Google Scholar]
Akaike, H. Information theory as an extension of the maximum likelihood principle. In Proceedings of the Second International Symposium on Information Theory; Petrov, B.N., Csaki, F., Eds.; Academiai Kiado: Budapest, Hungary, 1973. [Google Scholar]
Escobar, L.A.; Meeker, W.Q. Assessing Influence in Regression Analysis with Censored Data. Biometrics 1992, 48, 507–528. [Google Scholar] [CrossRef] [PubMed]
Billor, N.; Loynes, R.M. Local influence: A new approach. Comm. Statist. Theory Meth. 1993, 22, 1595–1611. [Google Scholar] [CrossRef]
MathWorks Inc. MATLAB Version: 9.13.0 (R2022b); The MathWorks Inc.: Natick, MA, USA, 2022; Available online: https://www.mathworks.com (accessed on 10 October 2022).
Uribe-Opazo, M.A.; Borssoi, J.A.; Galea, M. Influence diagnostics in Gaussian spatial linear models. J. Appl. Stat. 2012, 3, 615–630. [Google Scholar] [CrossRef]
Breiman, L.; Friedman, J.H. Estimating optimal transformations for multiple regression and correlation. J. Am. Stat. Assoc. 1985, 80, 580–598. [Google Scholar] [CrossRef]
Ibacache-Pulgar, G.; Lira, V.; Villegas, C. Assessing Influence on Partially Varying-coefficient Generalized Linear Model. REVSTAT-Stat. J. 2022. Available online: https://revstat.ine.pt/index.php/REVSTAT/article/view/507 (accessed on 10 October 2022).

Figure 1. Scatter plots: Soya versus Height (a), Soya versus Pods (b), and Soya versus coordinates in UTM (c).

Figure 2. QQ-plots of the standardized residuals for the TPS-GLM with its confidence interval (dashed lines) (a) and scatterplot between Soya and

\hat{Soya}

(b), under model fitted to Wypych data.

Figure 2. QQ-plots of the standardized residuals for the TPS-GLM with its confidence interval (dashed lines) (a) and scatterplot between Soya and

\hat{Soya}

(b), under model fitted to Wypych data.

Figure 3. Index plots of

B_{i}

for assessing local influence on

\hat{β}

(a) and

\hat{δ}

(b), considering case-weight perturbation.

Figure 3. Index plots of

B_{i}

for assessing local influence on

\hat{β}

(a) and

\hat{δ}

(b), considering case-weight perturbation.

Figure 4. Index plots of

B_{i}

for assessing local influence on

\hat{β}

(a) and

\hat{δ}

(b), considering response variable additive perturbation.

Figure 4. Index plots of

B_{i}

for assessing local influence on

\hat{β}

(a) and

\hat{δ}

(b), considering response variable additive perturbation.

Figure 5. 3D plots between the response variable and the explanatory variables: logarithm of ozone data versus temperature and day variables (a), and logarithm of ozone data versus visibility and day variables (b).

Figure 6. QQ-plot of the standardized residuals for the models described in Table 5: Model I (a), Model II (b), Model III (c) and Model IV (d).

Figure 7. 3D plot between

\hat{\log (μ)}

and explanatory variables Temp and Day.

Figure 7. 3D plot between

\hat{\log (μ)}

and explanatory variables Temp and Day.

Figure 8. Index plots of

B_{i}

for assessing local influence on

\hat{β}

(a) and

\hat{δ}

(b), considering case-weight perturbation under model fitted to Ozone data.

Figure 8. Index plots of

B_{i}

for assessing local influence on

\hat{β}

(a) and

\hat{δ}

(b), considering case-weight perturbation under model fitted to Ozone data.

Figure 9. Index plots of

B_{i}

for assessing local influence on

\hat{β}

(a) and

\hat{δ}

(b), considering response variable additive perturbation.

Figure 9. Index plots of

B_{i}

for assessing local influence on

\hat{β}

(a) and

\hat{δ}

(b), considering response variable additive perturbation.

Table 1. MPLEs with their standard errors (within parenthesis), AIC and R²(Adj).

	Model
Parameters	Gaussian Linear	TPS-GLM
$β_{0}$	1.1921 (0.672)	0.497 (0.751)
$β_{1}$	0.0116 (0.0128)	0.032 (0.015)
$β_{2}$	0.0339 (0.0079)	0.030 (0.008)
AIC	149.99	139.9992
R²(Adj)	0.168	0.315

Table 2. Relative changes (RCs) (in %) in the MPL estimates of

β_{j}

in cases-weight perturbation under the TPS-GLM. The last two columns indicate the AIC and R²(Adj) of the model with dropped observations.

Table 2. Relative changes (RCs) (in %) in the MPL estimates of

β_{j}

in cases-weight perturbation under the TPS-GLM. The last two columns indicate the AIC and R²(Adj) of the model with dropped observations.

	Parameters and Relatives Changes
Dropped Obs.	$β_{0}$	$β_{1}$	$β_{2}$	${RC}_{β_{0}}$	${RC}_{β_{1}}$	${RC}_{β_{2}}$	AIC	R²(Adj)
6	1.696	0.009	0.023	122.59	63.93	27.52	125.50	0.267
61	0.987	0.022	0.027	29.47	15.19	12.23	131.96	0.356
69	0.432	0.035	0.028	43.33	35.04	10.85	138.07	0.326
6-61	1.996	0.002	0.022	161.89	92.06	27.65	116.20	0.218
6-69	1.481	0.014	0.023	94.35	45.61	26.09	124.52	0.268
61-69	0.704	0.029	0.028	7.59	9.58	10.93	131.03	0.358
6-61-69	1.868	0.005	0.023	145.16	81.03	26.90	115.83	0.310

Table 3. Relative changes (RCs) (in %) in the MPL estimates of

β_{j}

in response variable perturbation under the TPS-GLM. The last two columns indicate the AIC and R²(Adj) of the model with dropped observations.

Table 3. Relative changes (RCs) (in %) in the MPL estimates of

β_{j}

in response variable perturbation under the TPS-GLM. The last two columns indicate the AIC and R²(Adj) of the model with dropped observations.

	Parameters and Relatives Changes
Dropped Obs.	$β_{0}$	$β_{1}$	$β_{2}$	${RC}_{β_{0}}$	${RC}_{β_{1}}$	${RC}_{β_{2}}$	AIC	R²(Adj)
32	0.701	0.027	0.029	8.000	4.62	5.23	138.73	0.319
69	0.432	0.034	0.027	43.33	29.0	12.22	138.07	0.326
75	0.760	0.028	0.027	0.24	6.22	12.65	139.44	0.307
80	0.699	0.028	0.029	8.21	7.86	8.16	139.01	0.311
32-69	0.382	0.035	0.030	49.87	32.82	4.00	136.81	0.33
32-75	0.700	0.027	0.030	8.17	4.12	4.90	138.11	0.319
32-80	0.621	0.028	0.031	18.49	6.34	0.16	137.63	0.316
69-75	0.430	0.035	0.028	43.61	34.96	10.96	137.53	0.318
69-80	0.333	0.036	0.029	56.33	38.32	5.39	136.99	0.322
75-80	0.695	0.028	0.029	8.85	7.25	7.90	138.39	0.302
32-69-75	0.381	0.035	0.030	49.98	32.33	3.74	136.22	0.322
32-75-80	0.621	0.028	0.031	18.54	5.23	0.96	136.93	0.308
69-75-80	0.333	0.036	0.029	56.26	37.40	5.09	136.42	0.314
32-69-75-80	0.271	0.035	0.032	64.47	30.15	3.22	134.96	0.320

Table 4. Four structures of the linear predictor for the explanatory variables Vis, Temp, and Day, assuming that the response variable log(O3) follows a

POISSON (μ_{i})

distribution.

Table 4. Four structures of the linear predictor for the explanatory variables Vis, Temp, and Day, assuming that the response variable log(O3) follows a

POISSON (μ_{i})

distribution.

Model	$g (μ_{i}) = \log (μ_{i})$
I	$β_{0} + β_{1} {Vis}_{i} + β_{2} {Temp}_{i} + β_{3} {Day}_{i}$
II	$β_{0} + β_{1} {Vis}_{i} + β_{2} {Temp}_{i} + f ({Day}_{i})$
III	$β_{0} + β_{1} {Vis}_{i} + β_{2} {Temp}_{i} + β_{3} {Day}_{i} + β_{4} {Temp}_{i} \times {Day}_{i}$
IV	$β_{0} + β_{1} {Vis}_{i} + f ({Temp}_{i}, {Day}_{i})$

Table 5. AIC, R²(Adj), ML and MPL estimates for all four fitted models to the Ozone data.

Parameters	I	II	III	IV
$β_{0}$	0.577 (0.104)	0.478 (0.142)	0.787 (0.198)	2.507 (0.040)
$β_{1}$	−0.002 (0.0003)	−0.002 (0.0003)	−0.002 (0.0003)	−0.002 (0.0003)
$β_{2}$	0.035 (0.001)	0.033 (0.002)	0.032 (0.003)	-
$β_{3}$	−0.001 (0.002)	-	−0.002 (0.001)	-
$β_{4}$	-	-	0.00002 (0.00002)	-
AIC	1887.312	1806.837	1887.757	1789.92
R²(Adj)	0.673	0.715	0.670	0.728

Table 6. Relative changes (RCs) (in %) in the MPL estimates of

β_{j}

under the TPS-GLM. The last two columns indicate the AIC and R²(Adj) of the model with dropped observations.

Table 6. Relative changes (RCs) (in %) in the MPL estimates of

β_{j}

under the TPS-GLM. The last two columns indicate the AIC and R²(Adj) of the model with dropped observations.

Dropped Obs.	$β_{0}$	$β_{1}$	${RC}_{β_{0}}$	${RC}_{β_{1}}$	AIC	R²(Adj)
167	2.513	−0.002	0.231	0.378	1777.07	0.737
175	2.506	−0.002	0.051	1.673	1784.58	0.724
219	2.540	−0.002	1.334	7.105	1784.44	0.725
220	2.507	−0.002	0.012	0.263	1777.87	0.728
167-175	2.511	−0.002	0.169	1.052	1771.76	0.734
167-219	2.538	−0.002	1.242	6.853	1771.58	0.735
167-220	2.511	−0.002	0.179	0.884	1765.08	0.738
175-219	2.538	−0.002	1.248	7.368	1779.10	0.722
175-220	2.504	−0.002	0.104	0.684	1772.55	0.725
219-220	2.507	−0.002	0.007	0.289	1772.807	0.725
167-175-219	2.536	−0.002	1.155	7.136	1766.269	0.732
167-175-220	2.512	−0.002	0.215	3.415	1759.79	0.735
175-219-220	2.504	−0.002	0.098	0.678	1767.484	0.721
167-175-219-220	2.534	−0.002	1.072	7.800	1754.761	0.731

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Ibacache-Pulgar, G.; Pacheco, P.; Nicolis, O.; Uribe-Opazo, M.A. Local Influence for the Thin-Plate Spline Generalized Linear Model. Axioms 2024, 13, 346. https://doi.org/10.3390/axioms13060346

AMA Style

Ibacache-Pulgar G, Pacheco P, Nicolis O, Uribe-Opazo MA. Local Influence for the Thin-Plate Spline Generalized Linear Model. Axioms. 2024; 13(6):346. https://doi.org/10.3390/axioms13060346

Chicago/Turabian Style

Ibacache-Pulgar, Germán, Pablo Pacheco, Orietta Nicolis, and Miguel Angel Uribe-Opazo. 2024. "Local Influence for the Thin-Plate Spline Generalized Linear Model" Axioms 13, no. 6: 346. https://doi.org/10.3390/axioms13060346

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Local Influence for the Thin-Plate Spline Generalized Linear Model

Abstract

1. Introduction

2. The Thin-Plate Spline Generalized Linear Model (TPS-GLM)

2.1. Statistical Model

2.2. Penalized Function

3. Estimation and Inference

3.1. Penalized Score Function

3.2. Penalized Hessian Matrix

3.3. Penalized Expected Information Matrix

3.4. Derivation of the Iterative Process

3.5. Estimation of Surface

3.6. Approximate Standard Errors

3.7. On Degrees of Freedom and Smoothing Parameter

4. Local Influence

4.1. Local Influence Analysis

4.2. Derivation of the Normal Curvature

5. Applications

5.1. Wypych Data

5.1.1. Fitting the TPS-GLM

5.1.2. Diagnostic Analysis

5.1.3. Confirmatory Analysis

5.2. Ozone Concentration Data

5.2.1. Diagnostic Analysis

5.2.2. Confirmatory Analysis

6. Concluding Remarks and Future Research

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI