Next Article in Journal
Analysis of the Dynamic Response as a Basis for the Efficient Protection of Large Structure Health Using Controllable Frequency-Controlled Drives
Next Article in Special Issue
Non-Parametric Non-Inferiority Assessment in a Three-Arm Trial with Non-Ignorable Missing Data
Previous Article in Journal
Low-Temperature Industrial Waste Heat (IWH) Recovery Using a New Design for Fast-Charging Thermal Energy Storage Units
Previous Article in Special Issue
Application of an Empirical Best Linear Unbiased Prediction Fay–Herriot (EBLUP-FH) Multivariate Method with Cluster Information to Estimate Average Household Expenditure
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Estimation Curve of Mixed Spline Truncated and Fourier Series Estimator for Geographically Weighted Nonparametric Regression

1
Department of Statistics, Institut Teknologi Sepuluh Nopember, Surabaya 60111, Indonesia
2
Department of Statistics, Faculty of Mathematics and Natural Sciences, Universitas Halu Oleo, Kendari 93132, Indonesia
*
Author to whom correspondence should be addressed.
Mathematics 2023, 11(1), 152; https://doi.org/10.3390/math11010152
Submission received: 29 November 2022 / Revised: 18 December 2022 / Accepted: 23 December 2022 / Published: 28 December 2022
(This article belongs to the Special Issue Statistical Methods in Data Science and Applications)

Abstract

:
Geographically Weighted Regression (GWR) is the development of multiple linear regression models used in spatial data. The assumption of spatial heterogeneity results in each location having different characteristics and allows the relationships between the response variable and each predictor variable to be unknown, hence nonparametric regression becomes one of the alternatives that can be used. In addition, regression functions are not always the same between predictor variables. This study aims to use the Geographically Weighted Nonparametric Regression (GWNR) model with a mixed estimator of truncated spline and Fourier series. Both estimators are expected to overcome unknown data patterns in spatial data. The mixed GWNR model estimator is then determined using the Weighted Maximum Likelihood Estimator (WMLE) technique. The estimator’s characteristics are then determined. The results of the study found that the estimator of the mixed GWNR model is an estimator that is not biased and linear to the response variable y.
MSC:
62J05; 62J10; 62J12; 62J20

1. Introduction

Regression analysis is a statistical method used to determine the relationship between response variables and one or more predictor variables [1]. Regression is divided into three types: parametric regression, nonparametric regression, and semiparametric regression. Parametric regression is used when the shape of the regression curve is known, whether linear, quadratic, cubic or otherwise. Whereas, nonparametric regression is used when the shape of the regression curve is unknown. As for semiparametric regression, it is a combination of parametric and nonparametric regression.
Nonparametric regression is a method used to model regression curves of unknown shape [2]. This method is a more flexible approach because the data is expected to look for the estimation form of the regression curve itself without being influenced by the researcher’s subjectivity factor [3]. Some of the estimators used in nonparametric regression include spline estimators, Fourier series, kernels, and local polynomials. Each estimator has its characteristics to approach unknown regression functions. Research on nonparametric regression has been widely conducted by single estimators [4,5] and mixed estimators [6,7]. However, its application is still limited to nonspatial data. In fact, there are many problems related to spatial data. Spatial data is data that contains size and location information [8]. Methods used in spatial data analysis include Spatial Autocorrelation, Spatial Error Model, Geographically weighted regression, and others.
According to [8], Geographically Weighted Regression (GWR) is a statistical method that can analyze spatial heterogeneity. Spatial heterogeneity is one of the same predictor variables exerting unequal influences on different locations within a study site. The GWR model generates an estimator of model parameters that are local to each point or location where the data is observed. Research on GWR by [9,10] shows that the GWR model is better than the global model that can overcome spatial heterogeneity. It was further developed on a nonparametric GWR with a single estimator [11,12,13]. In addition, the relationship between response variables and some predictor variables can vary [14]. Therefore, a nonparametric GWR model with mixed truncated spline and Fourier series would be developed. The truncated spline estimator in GWR is expected to overcome the changing curve pattern at certain sub intervals [11]. In contrast, the Fourier series estimator is expected to model the repeating data pattern [6]. This study aims to create a Geographically Weighted Nonparametric Regression (GWNR) model with a mixed truncated spline and Fourier series estimator, to determine the parameter estimate of the GWNR model with the Weighted Maximum Likelihood Estimator method, and evaluate the properties the mixed GWNR model.
The following discussion in this paper is divided into three main topics. Section 2 discusses the GWNR Model and method estimation of WMLE. Section 3 presents the estimation parameter model GWNR; unbiased and linear estimator properties; and data application. Section 4 is the conclusions.

2. Materials and Methods

2.1. Geographically Weighted Nonparametric Regression (GWNR)

The GWNR model is a development of GWR in nonparametric regression. Provided paired data ( x 1 i , , x P i , z 1 i , , z Q i , y i ) and assumed relationships between predictor variables ( x 1 i , , x P i , z 1 i , , z Q i ) with response variables ( y i ) following a multivariable regression model [11] are as follows:
y i = μ ( x 1 i , , x P i , z 1 i , , z Q i ) + ε i ,   i = 1 , 2 , , n
where, y i is a response variable, μ ( x 1 i , , x P i , z 1 i , , z Q i ) is a regression curve of unknown shape, P is the predictor variables with a truncated spline function, Q is the predictor variables approached with Fourier series functions, and n is a number of observations and is assumed to be additive. If the function μ ( x 1 i , , x P i , z 1 i , , z Q i ) is approached with truncated spline functions and Fourier series, then Equation (1) can be written:
y i = μ ( x 1 i , , x P i , z 1 i , , z Q i ) + ε i
y i = p = 1 P f p ( x p i ) + q = 1 Q g q ( z q i ) + ε i ,   i = 1 , 2 , , n
where,
p = 1 P f p ( x p i ) = f 1 ( x 1 i ) + + f P ( x P i ) = X 1 β 1 ( u i , v i ) + + X P β P ( u i , v i ) = [ X 1 X P ] [ β 1 ( u i , v i ) β P ( u i , v i ) ] = X β ( u i , v i )
is a truncated spline component with P predictor variables and
q = 1 Q g q ( z q i ) = g 1 ( z 1 i ) + + g Q ( z Q i ) = Z 1 a 1 ( u i , v i ) + + Z Q a Q ( u i , v i ) = [ Z 1 Z Q ] [ a 1 ( u i , v i ) a Q ( u i , v i ) ] = Z a ( u i , v i )
is a Fourier series component with Q other predictor variables. By matrix notation, it can be written with:
y = X β ( u i , v i ) + Z a ( u i , v i ) + ε
where:
y = [ y 1 y n ] ,   ε = [ ε 1 ε n ]
X = [ X 1 X P ]
Z = [ Z 1 Z Q ]
β ( u i , v i ) = [ β 1 T ( u i , v i ) β P T ( u i , v i ) ] T
a ( u i , v i ) = [ a 1 T ( u i , v i ) a Q T ( u i , v i ) ] T
( u i , v i ) = ( longitude ,   latitude ) ,   i = 1 , 2 , , n  

2.2. Weighted Maximum Likelihood Estimator (WMLE)

Maximum likelihood estimation from the parameters μ and σ 2 with a distributed n-sized sample y i N ( μ , σ 2 ) , i = 1 , 2 , , n can be written with:
f ( y 1 , y 2 , , y n ) = i = 1 n 1 2 π σ 2 exp ( 1 2 ( y i μ σ ) ) 2
so that the likelihood function for y i , i = 1 , 2 , , n is
L ( μ , σ 2 | y i ) = ( 2 π ) n 2 ( σ 2 ) n 2 exp ( 1 2 σ 2 i = 1 n ( y i μ ) 2 )
Next, by multiplying by the weighting matrix W = d i a g ( w 1 i * , w 2 i * , , w n i * ) , the log likelihood function is obtained in terms of weighted function w i i * as [15]:
ln L * ( μ , σ 2 ( u i * , v i * ) | y i ) = i = 1 n w i i * ln ( 1 2 π σ 2 ( u i * , v i * ) exp ( 1 2 ( y i μ σ ( u i * , v i * ) ) 2 ) ) ,   i = 1 , 2 , , n
The role of weights in the GWNR model is very important because the weighting values will represent the location of observational data from one another. One method that can be used is the Gaussian Kernel function [8]. Equation (4) estimates the GWNR parameter with the WMLE method.
Furthermore, steps are given in estimating the parameters of a mixed GWNR model with the WMLE Method as follows:
  • Defining a mixed GWNR model
  • Assuming distribution ε
  • Determining the distribution of y
  • Forming a likelihood function
  • Forming a weighted likelihood function
  • Specifying the first partial derivative of the likelihood function against the mixed GWNR model parameter
  • Getting an estimate of mixed GWNR model parameters.

3. Results

3.1. Parameter Estimation

Estimation of parameters on GWNR models with mixed estimators uses Weighted Maximum Likelihood Estimator (WMLE). The WMLE method is obtained by knowing the distribution of the response variable y i , i = 1 , 2 , , n in advance. Then, it is determined by the weighting matrix for each location to i , i = 1 , 2 , , n . The weighting used is a Fixed Gaussian Kernel function. Next, it is given the form of a distribution of the GWNR model that is presented on Lemma 1.
Lemma 1. 
Given the GWNR model in Equation (2), with  ε i , i = 1 , 2 , , n  is normally distributed with mean equal to zero and variance  σ 2 ( u i , v i ) , hence  y i , i = 1 , 2 , , n  is normally distributed with mean
β 0 ( u i , v i ) + p = 1 P m = 1 M β m p ( u i , v i ) x p i m + p = 1 P r = 1 R β ( r + M ) p ( u i , v i ) ( x p i t r p ) + m + q = 1 Q ( γ q ( u i , v i ) z q i + 1 2 θ 0 q ( u i , v i ) + h = 1 H θ h q ( u i , v i ) cos ( h z q i ) )
and variance,  σ 2 ( u i , v i ) .
where:
  • P: number of spline components
  • M: polynomial degree of spline
  • R: number of knot points
  • Q: number of Fourier components
  • H: number of oscillation parameters.
Lemma 1 has been proven in Appendix A.
Theorem 1. 
If given a model on Equation (2) with  ε i , i = 1 , 2 , , n  normally distributed with mean zero and variance  σ 2 ( u i , v i )  and the weighted likelihood function given to (4), by the MLE method, an estimator is obtained  β ^ ( u i , v i )  and  a ^ ( u i , v i )  as follows:
β ^ ( u i , v i ) = A ( t , h ) y  
a ^ ( u i , v i ) = B ( t , h ) y
where:
A ( t , h ) = R ( X T W ( u i , v i ) X ) 1 [ X T W ( u i , v i ) X T W ( u i , v i ) Z ( Z T W ( u i , v i ) Z ) 1 Z T W ( u i , v i ) ] Β ( t , h ) = S ( Z T W ( u i , v i ) Z ) 1 [ Z T W ( u i , v i ) Z T W ( u i , v i ) X ( X T W ( u i , v i ) X ) 1 X T W ( u i , v i ) ]
  • t = knot point for spline component
  • h = oscillation parameter component.
Proof of Theorem 1. 
Is given to Appendix B. □

3.2. Unbiased and Linear Estimator Properties

Lemma 2. 
If  β ^ ( u i , v i ) is a truncated spline component parameter estimator of the GWNR model with a mixed estimator approach that follows Equation (4), so  β ^ ( u i , v i ) is an unbiased estimator and belongs to the class of linear estimators in observation y.
Furthermore, it can be seen in Appendix C which is the proof of Lemma 2.
Lemma 3. 
If  a ^ ( u i , v i ) is a Fourier series component parameter estimator of the GWNR model with a mixed estimator approach that follows Equation (4), so  a ^ ( u i , v i ) is an unbiased estimator and belongs to the class of linear estimators in observation y.
Lemma 3 is the last proven lemma and is described in Appendix D.
Lemma 4. 
If  β ^ ( u i , v i )  and  a ^ ( u i , v i )  are given by Theorem 1, hence the estimator for  f ^ g ^  and  μ ^  is hence given by:
f ^ = X β ^ ( u i , v i )  
g ^ = Z a ^ ( u i , v i )
so that the following is obtained:
μ ^ = X β ^ ( u i , v i ) + Z a ^ ( u i , v i ) = C ( t , h ) y
Proof.  
Next, to determine the function estimator f ^ , g ^ and μ ^ ( u i , v i ) are described as follows. Based on Theorem 1, it can be substituted β ^ ( u i , v i ) and a ^ ( u i , v i ) so that it is obtained:
f ^ = X β ^ ( u i , v i ) = X A ( t , h ) y
and
g ^ = Z a ^ ( u i , v i ) = Z B ( t , h ) y
As a result, obtained estimator μ ^ be
μ ^ = f ^ + g ^ = X β ^ ( u i , v i ) + Z a ^ ( u i , v i ) = X A ( t , h ) y + Z B ( t , h ) y = C ( t , h ) y
where C ( t , h ) = X A ( t , h ) + Z B ( t , h ) is a hat matrix containing knot points t and the parameter of the h oscillation on the mixed GWNR model with the approach of truncated spline and Fourier series estimator. □

3.3. Data Application

The data used are secondary data from [16,17,18,19,20,21] with research variables, percentage of the poor population ( y ) , CPI ( x 1 ) , TPT ( x 2 ) , longitude and latitude coordinates ( u i , v i ) , and as many as 81 districts/cities on Sulawesi Island.
The following steps for applying the estimated parameters of the GWNR model to poverty data on Sulawesi Island in 2020 are as follows:
  • Making a scatter plot between the variables x 1 and y, as well as x 2 and y
  • Defining the initial model
  • Selecting optimum knots and oscillation parameters
  • Estimating parameters of global model with the OLS method based on the initial model formed
  • Testing assumptions of spatial heterogeneity on residual values on global models
  • Determining the weighting matrix
  • Estimating parameters of the GWNR model with the WMLE method
  • Choosing the best model based on MSE and R2
  • Making conclusions
Based on the above step, the variable x 1 as a component of the Fourier series and the variable x 2 as the spline component are obtained based on the scatter plot shown. Furthermore, the test of spatial assumptions is obtained that the assumption of spatial heterogeneity is met, so the global model is less suitable for use because the residual properties are not homogeneous. One alternative model that can be used is the GWNR model. Use of this GWNR model is expected to overcome heteroskedasticity by generating a local model for each location. Here are some local models generated:
y ^ k e n = 47.59 0.5 x 1 0.17 x 2 + 0.43 cos ( x 1 ) + 0.22 ( x 2 8.55 )
y ^ m k s = 49.9 0.58 x 1 0.07 x 2 + 0.08 cos ( x 1 ) + 0.47 ( x 2 8.55 )
y ^ m a n = 44.19 0.46 x 1 0.15 x 2 + 0.11 cos ( x 1 ) + 0.19 ( x 2 8.55 )
y ^ p a l = 40.79 0.47 x 1 + 0.3 x 2 0.48 cos ( x 1 ) 0.27 ( x 2 8.55 )
y ^ g o r = 44.12 0.3 x 1 2.25 x 2 0.05 cos ( x 1 ) + 2.38 ( x 2 8.55 )
y ^ m a j = 39.88 0.43 x 1 + 0.04 x 2 + 0.02 cos ( x 1 ) + 0.22 ( x 2 8.55 )
where:
  • yken = estimated poverty percentage for Kendari City
  • ymks = estimated poverty percentage for Makassar City
  • yman = estimated poverty percentage for Manado City
  • ypal = estimated poverty percentage for Palu City
  • ygor = estimated poverty percentage for Gorontalo City
  • ymaj = estimated poverty percentage for Mamuju City.
GWNR mixed with oscillation parameters k =1 and linear spline t =1 resulted in MSE and R2 values of 3.65 and 74.65 per cent, respectively. Based on several local models above, it shows that poverty in Sulawesi Island is influenced by HDI and TPT, where the increasing HDI will result in a decrease in the percentage of poverty. Conversely, an increase in TPT will increase the percentage of poverty.

4. Conclusions

Estimation of GWNR using the truncated spline and Fourier series was successfully formulated. It was found that:
  • The GWNR model using a mixed estimator of truncated spline and Fourier series is
    y = X β ( u i , v i ) + Z a ( u i , v i ) + ε
    Where f = X β ( u i , v i ) is a truncated spline component, g = Z a ( u i , v i ) is a component of a Fourier series, and ε is a residual component.
  • Estimators of GWNR are β ^ ( u i , v i ) = A ( t , h ) y , a ^ ( u i , v i ) = B ( t , h ) y , and μ ^ = C ( t , h ) y . The estimator is an unbiased and linear estimator to observe the response variable.

Author Contributions

Conceptualization: L.L., I.N.B. and V.R.; methodology, L.L. and I.N.B.; writing-original draft preparation, L.L.; writing—review and editing, L.L., I.N.B. and V.R. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Deputi Bidang Penguatan Riset dan Pengembangan, Ministry of Research and Technology/National Research and Innovation Agency (Kemenristek), the Republic of Indonesia, via grant Penelitian Disertasi Doktor (PDD) in 2022.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

From the Equation (2), it is assumed that ε i N ( 0 , σ 2 ( u i , v i ) ) so that the probability function ε 1 , ε 2 , , ε n written with
f ( ε 1 , ε 2 , , ε n ) = i = 1 n { 1 2 π σ 2 ( u i , v i ) exp ( 1 2 σ 2 ( u i , v i ) ε i 2 ) }
Therefore, the likelihood function for ε i , i = 1 , 2 , , n is as follows:
L ( β ( u i , v i ) , a ( u i , v i ) , σ 2 ( u i , v i ) | ε ) = i = 1 n [ 1 2 π σ 2 ( u i , v i ) exp [ 1 2 σ 2 ( u i , v i ) [ y i ( β 0 ( u i , v i ) + p = 1 P m = 1 M β m p ( u i , v i ) x p i m p = 1 P r = 1 R β ( r + M ) p ( u i , v i ) ( x p i t r p ) + M + q = 1 Q ( γ q ( u i , v i ) z q i + 1 2 θ 0 q ( u i , v i ) + h = 1 H θ h q ( u i , v i ) cos ( h z i ) ) ) ] 2 ] ]
Due to the fact that ε i N ( 0 , σ 2 ( u i , v i ) ) , therefore
E ( y ) = E ( β 0 ( u i , v i ) + p = 1 P m = 1 M β m p ( u i , v i ) x p i m + p = 1 P r = 1 R β ( r + M ) p ( u i , v i ) ( x p i t r p ) + M + q = 1 Q ( γ q ( u i , v i ) z q i + 1 2 θ 0 q ( u i , v i ) + h = 1 H θ h q ( u i , v i ) cos ( h z i ) ) + ε i ) = E ( β 0 ( u i , v i ) + p = 1 P m = 1 M β m p ( u i , v i ) x p i m + p = 1 P r = 1 R β ( r + M ) p ( u i , v i ) ( x p i t r p ) + M + q = 1 Q ( γ q ( u i , v i ) z q i + 1 2 θ 0 q ( u i , v i ) + h = 1 H θ h q ( u i , v i ) cos ( h z i ) ) ) + E ( ε i ) = β 0 ( u i , v i ) + p = 1 P m = 1 M β m p ( u i , v i ) x p i m + p = 1 P r = 1 R β ( r + M ) p ( u i , v i ) ( x p i t r p ) + M + q = 1 Q ( γ q ( u i , v i ) z q i + 1 2 θ 0 q ( u i , v i ) + h = 1 H θ h q ( u i , v i ) cos ( h z i ) )
and
v a r ( y ) = v a r ( β 0 ( u i , v i ) + p = 1 P m = 1 M β m p ( u i , v i ) x p i m + p = 1 P r = 1 R β ( r + M ) p ( u i , v i ) ( x p i t r p ) + M + q = 1 Q ( γ q ( u i , v i ) z q i + 1 2 θ 0 q ( u i , v i ) + h = 1 H θ h q ( u i , v i ) cos ( h z i ) ) + ε i ) = v a r ( β 0 ( u i , v i ) + p = 1 P m = 1 M β m p ( u i , v i ) x p i m + p = 1 P r = 1 R β ( r + M ) p ( u i , v i ) ( x p i t r p ) + M + q = 1 Q ( γ q ( u i , v i ) z q i + 1 2 θ 0 q ( u i , v i ) + h = 1 H θ h q ( u i , v i ) cos ( h z i ) ) ) + var ( ε i ) = σ 2 ( u i , v i )
Consequently,
y i N ( β 0 ( u i , v i ) + p = 1 P m = 1 M β m p ( u i , v i ) x p i m + p = 1 P r = 1 R β ( r + M ) p ( u i , v i ) ( x p i t r p ) + M + q = 1 Q ( γ q ( u i , v i ) z q i + 1 2 θ 0 q ( u i , v i ) + h = 1 H θ h q ( u i , v i ) cos ( h z i ) ) , σ 2 ( u i , v i ) )

Appendix B

Equation (A1) obtained the likelihood function of y i , i = 1 , 2 , , n and at its location is
L ( β ( u i , v i ) , a ( u i , v i ) , σ 2 ( u i , v i ) | ε ) = i = 1 n f ( y i | β ( u i , v i ) , a ( u i , v i ) , σ 2 ( u i , v i ) ) = i = 1 n [ 1 2 π σ 2 ( u i , v i ) exp [ 1 2 σ 2 ( u i , v i ) [ y i ( β 0 ( u i , v i ) + p = 1 P m = 1 M β m p ( u i , v i ) x p i m + p = 1 P r = 1 R β ( r + M ) p ( u i , v i ) ( x p i t r p ) + M + q = 1 Q ( γ q ( u i , v i ) z q i + 1 2 θ 0 q ( u i , v i ) + h = 1 H θ h q ( u i , v i ) cos ( h z i ) ) ) ] 2 ] ] = ( 2 π ) n 2 ( σ 2 ( u i , v i ) ) n 2 exp [ 1 2 σ 2 ( u i , v i ) [ y i ( β 0 ( u i , v i ) + p = 1 P m = 1 M β m p ( u i , v i ) x p i m + p = 1 P r = 1 R β ( r + M ) p ( u i , v i ) ( x p i t r p ) + M + q = 1 Q ( γ q ( u i , v i ) z q i + 1 2 θ 0 q ( u i , v i ) + h = 1 H θ h q ( u i , v i ) cos ( h z i ) ) ) ] 2 ]
The geographical location factor is the weighting factor in the GWR model, so Equation (A2) is given a weighting w i ( j ) to obtain the local model of GWNR, then a natural logarithm operation is performed as follows:
L * ( β ( u i , v i ) , a ( u i , v i ) , σ 2 ( u i , v i ) | y ) = i = 1 n ( f ( y i | β ( u i , v i ) , a ( u i , v i ) , σ 2 ( u i , v i ) ) ) w i ( j ) = i = 1 n [ 1 2 π σ 2 ( u i , v i ) exp [ 1 2 σ 2 ( u i , v i ) [ y i ( β 0 ( u i , v i ) + p = 1 P m = 1 M β m p ( u i , v i ) x p i m + p = 1 P r = 1 R β ( r + M ) p ( u i , v i ) ( x p i t r p ) + M + q = 1 Q ( γ q ( u i , v i ) z q i + 1 2 θ 0 q ( u i , v i ) + h = 1 H θ h q ( u i , v i ) cos ( h z i ) ) ) ] 2 ] ] w i ( j )
ln ( L * ( β ( u i , v i ) , a ( u i , v i ) , σ 2 ( u i , v i ) | y ) ) = = i = 1 n w i j ln ( 1 2 π σ 2 ( u i , v i ) exp [ 1 2 σ 2 ( u i , v i ) [ y i ( β 0 ( u i , v i ) + p = 1 P m = 1 M β m p ( u i , v i ) x p i m + p = 1 P r = 1 R β ( r + M ) p ( u i , v i ) ( x p i t r p ) + M + q = 1 Q ( γ q ( u i , v i ) z q i + 1 2 θ 0 q ( u i , v i ) + h = 1 H θ h q ( u i , v i ) cos ( h z i ) ) ) ] 2 ] ) = i = 1 n w i ( j ) ( 1 2 ) ln ( 2 π ) i = 1 n w i ( j ) ( 1 2 ) ln ( σ 2 ( u i , v i ) ) 1 2 σ 2 ( u i , v i ) i = 1 n w i ( j ) ( y i ( β 0 ( u i , v i ) + p = 1 P m = 1 M β m p ( u i , v i ) x p i m + p = 1 P r = 1 R β ( r + M ) p ( u i , v i ) ( x p i t r p ) + M + q = 1 Q ( γ q ( u i , v i ) z q i + 1 2 θ 0 q ( u i , v i ) + h = 1 H θ h q ( u i , v i ) cos ( h z i ) ) ) ) 2 = i = 1 n w i ( j ) ( 1 2 ) ln ( 2 π ) i = 1 n w i ( j ) ( 1 2 ) ln ( σ 2 ( u i , v i ) ) 1 2 σ 2 ( u i , v i ) T *
where:
T * = i = 1 n w i ( j ) ( y i ( β 0 ( u i , v i ) + p = 1 P m = 1 M β m p ( u i , v i ) x p i m + p = 1 P r = 1 R β ( r + M ) p ( u i , v i ) ( x p i t r p ) + M + q = 1 Q ( γ q ( u i , v i ) z q i + 1 2 θ 0 q ( u i , v i ) + h = 1 H θ h q ( u i , v i ) cos ( h z i ) ) ) ) 2 = ( y X β ( u i , v i ) Z a ( u i , v i ) ) T W ( u i , v i ) ( y X β ( u i , v i ) Z a ( u i , v i ) )
Parameter estimations of β ^ ( u i , v i ) and a ^ ( u i , v i ) are obtained by maximizing ln L * on Equation (A3). Estimator β ^ ( u i , v i ) is obtained by deriving the Equation (A4) against β ^ ( u i , v i ) , which then equates to zero so that it is obtained:
ln L * β ( u i , v i ) = ( i = 1 n w i ( j ) ( 1 2 ) ln ( 2 π ) i = 1 n w i ( j ) ( 1 2 ) ln ( σ 2 ( u i , v i ) ) 1 2 σ 2 ( u i , v i ) T * ) β ( u i , v i ) 0 = ( y X β ( u i , v i ) Z a ( u i , v i ) ) T W ( u i , v i ) ( y X β ( u i , v i ) Z a ( u i , v i ) ) β ( u i , v i ) 0 = 2 X T W ( u i , v i ) y + 2 X T W ( u i , v i ) Z a ( u i , v i ) + 2 X T W ( u i , v i ) X β ( u i , v i ) β ^ ( u i , v i ) = ( X T W ( u i , v i ) X ) 1 ( X T W ( u i , v i ) y X T W ( u i , v i ) Z a ( u i , v i ) )
Therefore, the estimation of the parameters β ( u i , v i ) is
β ^ ( u i , v i ) = ( X T W ( u i , v i ) X ) 1 ( X T W ( u i , v i ) y X T W ( u i , v i ) Z a ^ ( u i , v i ) )
Furthermore, parameter estimation is carried out for a ( u i , v i ) . To obtain an estimator a ^ ( u i , v i ) and then derive Equation (A4) against a ( u i , v i ) which then equates to zero, is the following is thus obtained:
ln L * a ( u i , v i ) = ( y X β ( u i , v i ) Z a ( u i , v i ) ) T W ( u i , v i ) ( y X β ( u i , v i ) Z a ( u i , v i ) ) a ( u i , v i ) 0 = 2 Z T W ( u i , v i ) y + 2 Z T W ( u i , v i ) X β ( u i , v i ) + 2 Z T W ( u i , v i ) Z a ( u i , v i ) a ^ ( u i , v i ) = ( Z T W ( u i , v i ) Z ) 1 ( Z T W ( u i , v i ) y Z T W ( u i , v i ) X β ( u i , v i ) )
Therefore, the estimation of the parameters a ( u i , v i ) is
a ^ ( u i , v i ) = ( Z T W ( u i , v i ) Z ) 1 ( Z T W ( u i , v i ) y Z T W ( u i , v i ) X β ^ ( u i , v i ) )
Estimator β ^ ( u i , v i ) in Equation (A5) still contains an estimator a ^ ( u i , v i ) . Like wise, estimators a ^ ( u i , v i ) in Equation (A6) still contain an estimator β ^ ( u i , v i ) . In order to obtain a free form of an estimator, it is necessary to make a substitution. To obtain an estimator β ^ ( u i , v i ) , which is free from a ^ ( u i , v i ) , it is then substituted Equation (A6) into Equation (A5) as follows:
β ^ ( u i , v i ) = ( X T W ( u i , v i ) X ) 1 ( X T W ( u i , v i ) y X T W ( u i , v i ) Z a ^ ( u i , v i ) ) = ( X T W ( u i , v i ) X ) 1 ( X T W ( u i , v i ) y X T W ( u i , v i ) Z ( Z T W ( u i , v i ) Z ) 1 ( Z T W ( u i , v i ) y Z T W ( u i , v i ) X β ^ ( u i , v i ) ) ) = ( X T W ( u i , v i ) X ) 1 X T W ( u i , v i ) y ( X T W ( u i , v i ) X ) 1 X T W ( u i , v i ) Z ( Z T W ( u i , v i ) Z ) 1 Z T W ( u i , v i ) y + ( X T W ( u i , v i ) X ) 1 X T W ( u i , v i ) Z ( Z T W ( u i , v i ) Z ) 1 Z T W ( u i , v i ) X β ^ ( u i , v i ) )  
Then, they are merged in the same field containing β ^ ( u i , v i ) , so that the following is obtained:
β ^ ( u i , v i ) ( X T W ( u i , v i ) X ) 1 X T W ( u i , v i ) Z ( Z T W ( u i , v i ) Z ) 1 Z T W ( u i , v i ) X β ^ ( u i , v i ) ) = ( X T W ( u i , v i ) X ) 1 X T W ( u i , v i ) y ( X T W ( u i , v i ) X ) 1 X T W ( u i , v i ) Z ( Z T W ( u i , v i ) Z ) 1 Z T W ( u i , v i ) y β ^ ( u i , v i ) = R [ ( X T W ( u i , v i ) X ) 1 X T W ( u i , v i ) ( X T W ( u i , v i ) X ) 1 X T W ( u i , v i ) Z ( Z T W ( u i , v i ) Z ) 1 Z T W ( u i , v i ) ] y
where:
R = [ I ( X T W ( u i , v i ) X ) 1 X T W ( u i , v i ) Z ( Z T W ( u i , v i ) Z ) 1 Z T W ( u i , v i ) X β ^ ( u i , v i ) ) ] 1
Therefore, the following is obtained:
β ^ ( u i , v i ) = A ( t , h ) y
with
A ( t , h ) = R [ ( X T W ( u i , v i ) X ) 1 X T W ( u i , v i ) ( X T W ( u i , v i ) X ) 1 X T W ( u i , v i ) Z ( Z T W ( u i , v i ) Z ) 1 Z T W ( u i , v i ) ]
To obtain an estimator a ^ ( u i , v i ) which is free from β ^ ( u i , v i ) , it is substituted Equation (A5) to Equation (A6) as follows:
a ^ ( u i , v i ) = ( Z T W ( u i , v i ) Z ) 1 ( Z T W ( u i , v i ) y Z T W ( u i , v i ) X β ^ ( u i , v i ) ) = ( Z T W ( u i , v i ) Z ) 1 ( X T W ( u i , v i ) y Z T W ( u i , v i ) X ( X T W ( u i , v i ) X ) 1 ( X T W ( u i , v i ) y X T W ( u i , v i ) Z a ^ ( u i , v i ) ) ) = ( Z T W ( u i , v i ) Z ) 1 Z T W ( u i , v i ) y ( Z T W ( u i , v i ) Z ) 1 Z T W ( u i , v i ) X ( X T W ( u i , v i ) X ) 1 X T W ( u i , v i ) y + ( Z T W ( u i , v i ) Z ) 1 Z T W ( u i , v i ) X ( X T W ( u i , v i ) X ) 1 X T W ( u i , v i ) Z a ^ ( u i , v i ) )
Then, it is merged in the same field containing a ^ ( u i , v i ) , so that the following is obtained:
a ^ ( u i , v i ) ( Z T W ( u i , v i ) Z ) 1 Z T W ( u i , v i ) X ( X T W ( u i , v i ) X ) 1 X T W ( u i , v i ) Z a ^ ( u i , v i ) ) = ( Z T W ( u i , v i ) Z ) 1 Z T W ( u i , v i ) y ( Z T W ( u i , v i ) Z ) 1 Z T W ( u i , v i ) X ( X T W ( u i , v i ) X ) 1 X T W ( u i , v i ) y a ^ ( u i , v i ) = S [ ( Z T W ( u i , v i ) Z ) 1 Z T W ( u i , v i ) ( Z T W ( u i , v i ) Z ) 1 Z T W ( u i , v i ) X ( X T W ( u i , v i ) X ) 1 X T W ( u i , v i ) ] y
where:
S = [ I ( Z T W ( u i , v i ) Z ) 1 Z T W ( u i , v i ) X ( X T W ( u i , v i ) X ) 1 X T W ( u i , v i ) Z a ^ ( u i , v i ) ) ] 1
Therefore, the following is obtained:
a ^ ( u i , v i ) = B ( t , h ) y
with
B ( t , h ) = S [ ( Z T W ( u i , v i ) Z ) 1 Z T W ( u i , v i ) ( Z T W ( u i , v i ) Z ) 1 Z T W ( u i , v i ) X ( X T W ( u i , v i ) X ) 1 X T W ( u i , v i ) ]

Appendix C

The unbiased nature of the parameter β ^ ( u i , v i ) can be indicated by:
E ( β ^ ( u i , v i ) ) = E ( A ( t , h ) y ) = E ( R [ ( X T W ( u i , v i ) X ) 1 X T W ( u i , v i ) ( X T W ( u i , v i ) X ) 1 X T W ( u i , v i ) Z ( Z T W ( u i , v i ) Z ) 1 Z T W ( u i , v i ) ] y ) = ( R [ ( X T W ( u i , v i ) X ) 1 X T W ( u i , v i ) ( X T W ( u i , v i ) X ) 1 X T W ( u i , v i ) Z ( Z T W ( u i , v i ) Z ) 1 Z T W ( u i , v i ) ] ) E ( y ) = ( R [ ( X T W ( u i , v i ) X ) 1 X T W ( u i , v i ) ( X T W ( u i , v i ) X ) 1 X T W ( u i , v i ) Z ( Z T W ( u i , v i ) Z ) 1 Z T W ( u i , v i ) ] ) ( X β ( u i , v i ) + Z a ( u i , v i ) ) = R ( X T W ( u i , v i ) X ) 1 X T W ( u i , v i ) X β ( u i , v i ) + R ( X T W ( u i , v i ) X ) 1 X T W ( u i , v i ) Z a ( u i , v i ) R ( X T W ( u i , v i ) X ) 1 X T W ( u i , v i ) Z ( Z T W ( u i , v i ) Z ) 1 Z T W ( u i , v i ) X β ( u i , v i ) R ( X T W ( u i , v i ) X ) 1 X T W ( u i , v i ) Z ( Z T W ( u i , v i ) Z ) 1 Z T W ( u i , v i ) Z a ( u i , v i ) = R β ( u i , v i ) R ( X T W ( u i , v i ) X ) 1 X T W ( u i , v i ) Z ( Z T W ( u i , v i ) Z ) 1 Z T W ( u i , v i ) X β ( u i , v i ) = [ I ( X T W ( u i , v i ) X ) 1 X T W ( u i , v i ) Z ( Z T W ( u i , v i ) Z ) 1 Z T W ( u i , v i ) X ] R β ( u i , v i ) = [ I ( X T W ( u i , v i ) X ) 1 X T W ( u i , v i ) Z ( Z T W ( u i , v i ) Z ) 1 Z T W ( u i , v i ) X ] [ I ( X T W ( u i , v i ) X ) 1 X T W ( u i , v i ) Z ( Z T W ( u i , v i ) Z ) 1 Z T W ( u i , v i ) X ] 1 β ( u i , v i ) = β ( u i , v i )
Since E ( β ^ ( u i , v i ) ) = β ( u i , v i ) , it can be said that β ^ ( u i , v i ) is an unbiased estimator for β ( u i , v i ) . Next, it can be written that β ^ ( u i , v i ) = A ( t , h ) y , and it is clearly seen that the estimator β ^ ( u i , v i ) is a linear estimator in observation y.

Appendix D

The unbiased nature of the parameter a ^ ( u i , v i ) can be indicated by:
E ( a ^ ( u i , v i ) ) = E ( B ( t , h ) y ) = E ( S [ ( Z T W ( u i , v i ) Z ) 1 Z T W ( u i , v i ) ( Z T W ( u i , v i ) Z ) 1 Z T W ( u i , v i ) X ( X T W ( u i , v i ) X ) 1 X T W ( u i , v i ) ] y ) = ( S [ ( Z T W ( u i , v i ) Z ) 1 Z T W ( u i , v i ) ( Z T W ( u i , v i ) Z ) 1 Z T W ( u i , v i ) X ( X T W ( u i , v i ) X ) 1 X T W ( u i , v i ) ] ) E ( y ) = ( S [ ( Z T W ( u i , v i ) Z ) 1 Z T W ( u i , v i ) ( Z T W ( u i , v i ) Z ) 1 Z T W ( u i , v i ) X ( X T W ( u i , v i ) X ) 1 X T W ( u i , v i ) ] ) ( X β ( u i , v i ) + Z a ( u i , v i ) ) = S ( Z T W ( u i , v i ) Z ) 1 Z T W ( u i , v i ) X β ( u i , v i ) + S ( Z T W ( u i , v i ) Z ) 1 Z T W ( u i , v i ) Z a ( u i , v i ) S ( Z T W ( u i , v i ) Z ) 1 Z T W ( u i , v i ) X ( X T W ( u i , v i ) X ) 1 X T W ( u i , v i ) X β ( u i , v i ) S ( Z T W ( u i , v i ) Z ) 1 Z T W ( u i , v i ) X ( X T W ( u i , v i ) X ) 1 X T W ( u i , v i ) Z a ( u i , v i ) = S a ( u i , v i ) S ( Z T W ( u i , v i ) Z ) 1 Z T W ( u i , v i ) X ( X T W ( u i , v i ) X ) 1 X T W ( u i , v i ) Z a ( u i , v i ) = [ I ( Z T W ( u i , v i ) Z ) 1 Z T W ( u i , v i ) X ( X T W ( u i , v i ) X ) 1 X T W ( u i , v i ) Ζ ] S a ( u i , v i ) = [ I ( Z T W ( u i , v i ) Z ) 1 Z T W ( u i , v i ) X ( X T W ( u i , v i ) X ) 1 X T W ( u i , v i ) Ζ ] = [ I ( Z T W ( u i , v i ) Z ) 1 Z T W ( u i , v i ) X ( X T W ( u i , v i ) X ) 1 X T W ( u i , v i ) Ζ ] 1 a ( u i , v i ) = a ( u i , v i )
Due to the fact that E ( a ^ ( u i , v i ) ) = a ( u i , v i ) , then it can be said that a ^ ( u i , v i ) is an unbiased estimator for a ( u i , v i ) . Next, it can be written that a ^ ( u i , v i ) = B ( t , h ) y and hence it is clearly seen that the estimator a ^ ( u i , v i ) is linear in observation y.

References

  1. Draper, N.R.; Smith, H. Applied Regression Analysis, 3rd ed.; John Wiley & Sons Inc.: Hoboken, NJ, USA, 2014; pp. 1–716. [Google Scholar] [CrossRef]
  2. Budiantara, I.N. The combination of spline and kernel estimator for nonparametric regression and its properties. Appl. Math. Sci. 2015, 9, 6083–6094. [Google Scholar] [CrossRef]
  3. Cheng, M.; Paige, R.L.; Sun, S.; Yan, K. Variance reduction for kernel estimators in clustered/longitudinal data analysis. J. Stat. Plan. Inference 2010, 140, 1389–1397. [Google Scholar] [CrossRef]
  4. Octavanny, M.A.D.; Budiantara, I.N.; Ratnasari, V. Pemodelan faktor-faktor yang memengaruhi provinsi jawa timur menggunakan pendekatan regresi semiparametrik spline. J. Sains Seni ITS 2017, 6, 1–7. [Google Scholar]
  5. Hidayat, M.F.; Achmad, R.F.A.; Solimun. Estimation of truncated spline function in non-parametric path analysis based on weighted least square (WLS). IOP Conf. Ser. Mater. Sci. Eng. 2019, 546, 5–10. [Google Scholar] [CrossRef]
  6. Sudiarsa, W.; Budiantara, I.N.; Suhartono, S.; Purnami, S.W. Combined estimator fourier series and spline truncated in multivariable nonparametric regression. Appl. Math. Sci. 2015, 9, 4997–5010. [Google Scholar] [CrossRef]
  7. Mariati, N.P.A.M.; Budiantara, I.N.; Ratnasari, V. Combination estimation of smoothing spline and fourier series in nonparametric regression. J. Math. 2020, 2020, 4712531. [Google Scholar] [CrossRef]
  8. Fotheringham, A.S.; Brundson, C.; Charlton, M. Geographically Weighted Regression, the Analysis of Spatially Varying Relationships; John Wiley & Sons Ltd.: Chichester, UK, 2002. [Google Scholar]
  9. Tang, J.; Gao, F.; Liu, F.; Zhang, W.; Qi, Y. Understanding spatio-temporal characteristics of urban travel demand based on the combination of GWR and GLM. Sustainabily 2019, 11, 5525. [Google Scholar] [CrossRef] [Green Version]
  10. Dziauddin, M.F. Estimating land value uplift around light rail transit stations in Greater Kuala Lumpur: An empirical study based on geographically weighted regression (GWR). Res. Transp. Econ. 2019, 74, 10–20. [Google Scholar] [CrossRef]
  11. Sifriyani; Kartiko, S.H.; Budiantara, I.N.; Gunardi. Development of nonparametric geographically weighted regression using truncated spline approach. Songklanakarin J. Sci. Technol. 2018, 40, 909–920. [Google Scholar]
  12. Fitri, N.; Sifriyani, S.; Yuniarti, D. Nonparametric geographically weighted regression dengan pendekatan spline truncated. Pros. Semin. Nas. Mat. Dan Stat. 2019, 98–105. [Google Scholar]
  13. Sifriyani, S. Simultaneous hypothesis testing of multivariable nonparametric spline regression in the GWR model. Int. J. Stat. Probab. 2019, 8, 32. [Google Scholar] [CrossRef]
  14. Nurcahayani, H.; Budiantara, I.N.; Zain, I. The curve estimation of combined truncated spline and fourier series estimator for multiresponse nonparametric regression. Mathematics 2021, 9, 1141. [Google Scholar] [CrossRef]
  15. Akbarov, A.; Wu, S. Waranty claim forecasting based on weighted maximum likelihood estimator. Qual. Reliab. Eng. Int. 2012, 28, 663–669. [Google Scholar] [CrossRef]
  16. BPS. Provinsi Sulawesi Tenggara dalam Angka 2021; UD. Resky Bersama: Kendari, Indonesia, 2021. [Google Scholar]
  17. BPS. Provinsi Sulawesi Selatan dalam Angka 2021; BPS Press: Makassar, Indonesia, 2021. [Google Scholar]
  18. BPS. Provinsi Sulawesi Utara dalam Angka 2021; Perum Percetakan NRI: Manado, Indonesia, 2021. [Google Scholar]
  19. BPS. Provinsi Sulawesi Tengah dalam Angka 2021; UD. Rio: Palu, Indonesia, 2021. [Google Scholar]
  20. BPS. Provinsi Sulawesi Barat dalam Angka 2021; Erlangga: Mamuju, Indonesia, 2021. [Google Scholar]
  21. BPS. Provinsi Gorontalo dalam Angka 2021; CV. Rifaldi: Gorontalo, Indonesia, 2021. [Google Scholar]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Laome, L.; Budiantara, I.N.; Ratnasari, V. Estimation Curve of Mixed Spline Truncated and Fourier Series Estimator for Geographically Weighted Nonparametric Regression. Mathematics 2023, 11, 152. https://doi.org/10.3390/math11010152

AMA Style

Laome L, Budiantara IN, Ratnasari V. Estimation Curve of Mixed Spline Truncated and Fourier Series Estimator for Geographically Weighted Nonparametric Regression. Mathematics. 2023; 11(1):152. https://doi.org/10.3390/math11010152

Chicago/Turabian Style

Laome, Lilis, I Nyoman Budiantara, and Vita Ratnasari. 2023. "Estimation Curve of Mixed Spline Truncated and Fourier Series Estimator for Geographically Weighted Nonparametric Regression" Mathematics 11, no. 1: 152. https://doi.org/10.3390/math11010152

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop