Next Article in Journal
Electrode Modified with Carboxylated Multi-Walled Carbon Nanotubes and Electropolymerized Pyrogallol Red for the Determination of Eugenol
Previous Article in Journal
Stability of Metal–Organic Frameworks: Recent Advances and Future Trends
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Proceeding Paper

Modified Estimator of Finite Population Variance under Stratified Random Sampling †

1
Department of Statistics, Ahmadu Bello University, Zaria 810107, Nigeria
2
Department of Statistics, Binyaminu Usman Polytechnic, Hadejia 731101, Nigeria
*
Author to whom correspondence should be addressed.
Presented at the 4th International Electronic Conference on Applied Sciences, 27 October–10 November 2023; Available online: https://asec2023.sciforum.net/.
Eng. Proc. 2023, 56(1), 177; https://doi.org/10.3390/ASEC2023-16308
Published: 21 November 2023
(This article belongs to the Proceedings of The 4th International Electronic Conference on Applied Sciences)

Abstract

:
This paper proposes a generalized estimator of finite population variance using the auxiliary information under stratified random sampling. The expressions for bias and mean square error equations of the proposed estimator up to the first degree of approximation are derived. The theoretical efficiency conditions under which the proposed estimator is better than some existing estimators are obtained. The performances of the existing and proposed estimators were assessed using three real datasets based on the criteria of minimum mean square error and supreme percentage relative efficiency. Evidence from the study showed that the proposed estimator performed better and was more efficient than some existing estimators considered.

1. Introduction

Researchers have been considering the use of auxiliary information in various forms to construct more accurate estimators for population parameters in experimental surveys. This approach has been shown to significantly improve the accuracy of the population mean estimation, and it has been explored by several researchers, including [1,2,3]. The field of survey sampling has seen a significant amount of research devoted to developing estimators of population variance to measure variations that exist in real-world scenarios. These variations can arise in various industries, such as manufacturing, pharmaceuticals, agriculture, and biological experiments. Researchers like [4,5,6,7,8,9,10,11,12,13,14] have worked extensively in this area. In real-life situations, population units of variables of interest can have diverse features, which makes the population units heterogeneous. In such cases, estimators of finite population variance, based on simple random sampling, cannot be applied. Therefore, the stratification approach is used to provide precise results when population units are heterogeneous [5,15,16,17].
Most ratio-based and product-based estimators are not efficient in estimating population characteristics when there is a negative correlation between the study and auxiliary variables for ratio-based estimators or a positive correlation for product-based estimators. To address this issue under stratified random sampling, a new generalized ratio-product cum regression-type estimator is proposed. The generalized ratio-product cum regression-type estimator proposed in this study is flexible in providing better efficient estimates than existing estimators when the data exhibits either a negative or positive correlation between the study and auxiliary variables.
Following the introduction is Section 2, which contains the notations and the literature review, while Section 3 presents the methodology of the study. Section 4 presents the efficiency conditions of the proposed methodology. Section 5 discusses the results while the conclusion and recommendations are presented in Section 6.

2. Notations and Some Existing Estimators

Consider a finite population U = U 1 ,   , U N of size N which is divided into L non-overlapping group or strata, with each stratum containing N h ( h = 1,2 , , L ) units, whose union gives N units, i.e., h = 1 G N h = N .   A sample using simple random sample of size n h is drawn without replacement (SRSWOR) from the strata population N h such that   h = 1 G n h = n . Let ( y h i , x h i ) be the value of the study variable Y and the auxiliary variable X on i t h unit   U i ,   i = 1 , N . Let   Y ¯ h , and X ¯ h be population means corresponding to the sample means y ¯ h   and   x ¯ h , respectively, in each stratum. Let s y ( h ) 2 and s x ( h ) 2 be the sample mean squares for the study and auxiliary variables for the h t h strata, respectively, and let S y ( h ) 2 and S x ( h ) 2 be the population mean squares for the study and auxiliary variables for the h t h strata, respectively. Let W h = N h N be the proportion of the population units falling in the h t h stratum. Let the correlation coefficient between the study variable and the auxiliary variable for the h t h strata be   ρ y x ( h ) . Also, let C y ( h ) = S y ( h ) / Y ¯ h and C x ( h ) = S x ( h ) / X ¯ h be the stratum coefficients of variation of the study variable Y and the auxiliary variable X. Let β 2 ( y ) = ψ 40 and β 2 ( x ) = ψ 04   be the coefficients of kurtosis of y and x for the h t h strata, respectively.
β 2 ( y ) = ψ 40 ( h ) = μ 40 ( h ) μ 20 ( h ) 2 ,   β 2 ( x ) = ψ 04 ( h ) = μ 04 ( h ) μ 02 ( h ) 2 ,   ψ 22 ( h ) = μ 22 ( h ) μ 20 ( h ) μ 02 ( h ) ,   Y ¯ h = 1 N h i = 1 N h Y h i ,  
X ¯ h = 1 N h i = 1 N h X h i ,   y ¯ h = 1 n h i = 1 n h y h i ,   x ¯ h = 1 n h i = 1 n h x h i ,   s y h 2 = 1 n h 1 i = 1 n h y h i y ¯ h 2 ,  
s x h 2 = 1 n h 1 i = 1 n h x h i x ¯ h 2 ,   S Y h 2 = 1 N h 1 i = 1 N h Y h i Y ¯ h 2 ,   S X h 2 = 1 N h 1 i = 1 N h X h i X ¯ h 2 ,   γ h = 1 n h 1 N h ,   μ r s ( h ) = 1 N h j = 1 N h Y h j Y ¯ h r X h j X ¯ h s .
The usual unbiased population variance estimator under stratified random sampling can be used to estimate the population variance and is given as:
s y s t 2 = h = 1 L W h 2 γ h s y h 2
V a r ( s y s t 2 ) = h = 1 L W h 4 γ h 3 S y h 4 ( ψ 40 ( h ) 1 )
The usual ratio stratified estimator for estimating population variance proposed by [5] is given as:
T r ( s t ) = h = 1 G W h 2 γ h ( s y h 2 S X h 2 s x h 2 )
B i a s ( T r ( s t ) ) = h = 1 G W h 2 γ h 2 S y h 2 [ ( ψ 40 ( h ) 1 ) ( θ 22 ( h ) 1 ) ]
M S E ( T r ( s t ) ) h = 1 G W h 4 γ h 3 S y h 4 [ ( ψ 40 ( h ) 1 ) + ( ψ 04 ( h ) 1 ) 2 ( θ 22 ( h ) 1 ) ]
The usual regression stratified estimator for estimating population variance proposed by [5] is given as:
S ^ r e g ( s t ) 2 = h = 1 L W h 2 γ h [ s y ( h ) 2 + b h ( S x ( h ) 2 s x ( h ) 2 ) ]
V a r ( S ^ r e g ( s t ) 2 ) = h = 1 L W h 4 γ h 3 S y h 4 ( ψ 40 ( h ) 1 ) ( 1 ρ h 2 )
Also, [15] proposed a difference-type estimator of population variance under stratified random sampling as follows:
S ^ D ( s t ) 2 = h = 1 G W h 2 γ h [ k 1 h s y h 2 + k 2 h ( S x h 2 s x h 2 ) ]
B i a s ( S ^ D ( s t ) 2 ) min h = 1 G W h 2 γ h 2 S y h 2 { ( ψ 04 ( h ) 1 ) γ h [ ( ψ 40 ( h ) 1 ) ( ψ 04 ( h ) 1 ) ( θ 22 ( h ) 1 ) 2 ] + ( ψ 04 ( h ) 1 ) 1 }
M S E ( S ^ D ( s t ) 2 ) min h = 1 G W h 4 γ h 3 S y h 4 { ( ψ 40 ( h ) 1 ) ( ψ 04 ( h ) 1 ) ( θ 22 ( h ) 1 ) 2 γ h [ ( ψ 40 ( h ) 1 ) ( ψ 04 ( h ) 1 ) ( θ 22 ( h ) 1 ) 2 ] + ( ψ 04 ( h ) 1 ) }
where
k 1 h ( o p t ) = ( ψ 04 ( h ) 1 ) γ h [ ( ψ 40 ( h ) 1 ) ( ψ 04 ( h ) 1 ) ( θ 22 ( h ) 1 ) 2 ] + ( ψ 04 ( h ) 1 )
k 2 h ( o p t ) = S y h 2 ( θ 22 ( h ) 1 ) S x h 2 { γ h [ ( ψ 40 ( h ) 1 ) ( ψ 04 ( h ) 1 ) ( θ 22 ( h ) 1 ) 2 ] + ( ψ 04 ( h ) 1 ) }
The ratio-type and product-type estimators of finite population variance under stratified random sampling proposed by [17] is as follows:
S ^ S R ( s t ) 2 = h = 1 L W h 2 γ h s y h 2 exp ( S x h 2 s x h 2 S x h 2 + s x h 2 )
S ^ S P ( s t ) 2 = h = 1 L W h 2 γ h s y h 2 exp ( s x h 2 S x h 2 S x h 2 + s x h 2 )
B i a s ( S ^ S R ( s t ) 2 ) = h = 1 L W h 2 γ h 2 S y h 2 [ 3 8 ( ψ 04 ( h ) 1 ) 1 2 ( θ 22 ( h ) 1 ) ]
B i a s ( S ^ S P ( s t ) 2 ) = h = 1 L W h 2 γ h 2 S y h 2 [ 1 2 ( θ 22 ( h ) 1 ) 1 8 ( ψ 04 ( h ) 1 ) ]
M S E ( S ^ S R ( s t ) 2 ) = h = 1 L W h 4 γ h 3 S y h 4 [ ( ψ 40 ( h ) 1 ) + 1 4 ( ψ 04 ( h ) 1 ) ( θ 22 ( h ) 1 ) ]
M S E ( S ^ S P ( s t ) 2 ) = h = 1 L W h 4 γ h 3 S y h 4 [ ( ψ 40 ( h ) 1 ) + 1 4 ( ψ 04 ( h ) 1 ) + ( θ 22 ( h ) 1 ) ]
An improved population variance estimator under stratified random sampling proposed by [16] is given as follows:
S ^ S G ( s t ) 2 = h = 1 G W h 2 γ h [ ω 1 h s y h 2 + ω 2 h ( S x h 2 s x h 2 ) ] exp ( S x h 2 s x h 2 S x h 2 + s x h 2 )
B i a s ( S ^ S G ( s t ) 2 ) h = 1 G W h 2 γ h 2 [ S y h 2 + ω 1 h S y h 2 { 1 + 3 8 ( ψ 40 ( h ) 1 ) 1 2 ( θ 22 ( h ) 1 ) } + 1 2 ω 2 h S x h 2 γ h ( ψ 04 ( h ) 1 ) ]
M S E ( S ^ S G ( s t ) 2 ) min h = 1 G W h 4 γ h 3 S y h 4 { 64 [ ( ψ 40 ( h ) 1 ) ( ψ 04 ( h ) 1 ) ( θ 22 ( h ) 1 ) 2 ] γ h ( ψ 04 ( h ) 1 ) 3 16 γ h ( ψ 04 ( h ) 1 ) [ ( ψ 40 ( h ) 1 ) ( ψ 04 ( h ) 1 ) ( θ 22 ( h ) 1 ) 2 ] 64 { γ h [ ( ψ 40 ( h ) 1 ) ( ψ 04 ( h ) 1 ) ( θ 22 ( h ) 1 ) 2 ] + ( ψ 40 ( h ) 1 ) } + ( ψ 04 ( h ) 1 ) }
where ω 1 h and ω 2 h are suitable chosen constants with optimum values:
ω 1 h ( o p t ) = γ h ( ψ 04 ( h ) 1 ) 8 [ 8 γ h ( ψ 04 ( h ) 1 ) γ h [ ( ψ 04 ( h ) 1 ) + ( ψ 40 ( h ) 1 ) ( ψ 04 ( h ) 1 ) ( θ 22 ( h ) 1 ) 2 ] ]
ω 2 h ( o p t ) = S y h 2 8 S x h 2 [ ( ψ 40 ( h ) 1 ) ( 4 ( ψ 04 ( h ) 1 ) ( θ 22 ( h ) 1 ) 2 3 ) + 4 ( θ 22 ( h ) 1 ) 2 [ ( ψ 04 ( h ) 1 ) + ( ψ 40 ( h ) 1 ) ( ψ 04 ( h ) 1 ) ( θ 22 ( h ) 1 ) 2 ] ]

3. The Proposed Estimator

Motivated by [14] estimators and having studied some estimators under stratified random sampling, the following generalized estimator is proposed to estimate the population variance of the variable of interest by utilizing the auxiliary information.
G ( s t ) = h = 1 L W h 2 λ h { τ 1 ( h ) s y ( h ) 2 [ 1 2 ( S x ( h ) 2 s x ( h ) 2 + s x ( h ) 2 S x ( h ) 2 ) ] + τ 2 ( h ) ( S x ( h ) 2 s x ( h ) 2 ) } exp { S x ( h ) 2 s x ( h ) 2 S x ( h ) 2 + s x ( h ) 2 }
where τ i ( h )   ( i = 1,2 ) are unknown constants whose values to be determined later, and σ is a suitably chosen constant.
In order to find the bias and MSE of the estimators, we define the following relative error terms:
e 0 ( h ) = s y ( h ) 2 S y ( h ) 2 S y ( h ) 2   and   e 1 ( h ) = s x ( h ) 2 S x ( h ) 2 S x ( h ) 2 ,   where   s y ( h ) 2 = S y ( h ) 2 ( 1 + e 0 ( h ) )   and   s x ( h ) 2 = S x ( h ) 2 ( 1 + e 1 ( h ) )
such that
E ( e 0 ( h ) ) = E ( e 1 ( h ) ) = 0 ,   E ( e 0 ( h ) 2 ) = γ h ( ψ 40 ( h ) 1 ) ,   E ( e 1 ( h ) 2 ) = γ h ( ψ 04 ( h ) 1 )   and   E ( e 0 ( h ) e 1 ( h ) ) = γ h ( ψ 22 ( h ) 1 ) .
Expressing the estimator G i ( s t ) in terms of e i   ( i = 0 ,   1 ) and by taking the expectation, the bias of the proposed generalized estimator, up to the first order of approximation, is given as:
B i a s ( G ( s t ) ) = h = 1 G W h 2 λ h 1 [ ( τ 1 ( h ) 1 ) S y ( h ) 2 + τ 1 ( h ) S y ( h ) 2 ( 7 8 λ h ( ψ 04 ( h ) 1 ) 1 2 λ h ( ψ 22 ( h ) 1 ) ) + τ 2 ( h ) 2 S x ( h ) 2 λ h ( ψ 04 ( h ) 1 ) ]
The mean square error of the proposed estimator, up to the first order of approximation, is given as:
M S E ( G ( s t ) ) = h = 1 G W h 4 λ h 2 { ( τ 1 ( h ) 1 ) 2 S y ( h ) 4 + τ 1 ( h ) 2 S y ( h ) 4 λ h [ ( ψ 40 ( h ) 1 ) 2 ( ψ 22 ( h ) 1 ) + 2 ( ψ 04 ( h ) 1 ) ] + τ 2 ( h ) 2 S x ( h ) 4 λ h ( ψ 04 ( h ) 1 ) τ 1 ( h ) S y ( h ) 4 λ h ( 7 4 ( ψ 04 ( h ) 1 ) ( ψ 22 ( h ) 1 ) ) 2 τ 1 ( h ) τ 2 ( h ) S y ( h ) 2 S x ( h ) 2 λ h ( ( ψ 22 ( h ) 1 ) ( ψ 04 ( h ) 1 ) ) τ 2 ( h ) S y ( h ) 2 S x ( h ) 2 λ h ( ψ 04 ( h ) 1 ) }
Minimizing Equation (26) with respect to τ 1 ( h ) and τ 2 ( h ) , we obtained the optimum values of τ 1 ( h ) and τ 2 ( h ) as:
τ 1 ( h ) ( o p t ) = 1 + 3 8 λ h ( ψ 04 ( h ) 1 ) 1 + λ h ( ψ 04 ( h ) 1 ) + λ h ( ( ψ 40 ( h ) 1 ) ( ψ 22 ( h ) 1 ) 2 ( ψ 04 ( h ) 1 ) )
and
τ 2 ( h ) ( o p t ) = S y ( h ) 2 S x ( h ) 2 { 1 2 τ 1 ( h ) ( o p t ) ( 1 ( ψ 22 ( h ) 1 ) ( ψ 04 ( h ) 1 ) ) }
Substituting the optimum values of τ 1 ( h ) and τ 2 ( h ) into Equation (26), we obtained the minimum mean square error of G i ( s t ) as:
M S E ( G ( s t ) ) min = h = 1 G W h 4 λ h 2 S y ( h ) 4 { ( 1 λ h ( ψ 04 ( h ) 1 ) 4 ) [ 1 + 3 8 λ h ( ψ 04 ( h ) 1 ) ] 2 [ 1 + λ h ( ψ 04 ( h ) 1 ) + λ h ( ( ψ 40 ( h ) 1 ) ( ψ 22 ( h ) 1 ) 2 ( ψ 04 ( h ) 1 ) ) ] }

4. Efficiency Comparisons

In this section, we obtain the theoretical efficiency conditions for the proposed estimator by comparing the mean square error (MSE) equation of the proposed estimator with the MSE equations of some existing estimators. Thus, conditions under which the proposed estimator is more efficient are given below:
  • Comparing the proposed estimator’s MSE with that of the usual variance estimator, we have:
V a r ( s ^ y s t 2 ) M S E ( G ( s t ) ) min > 0
Substituting the variances of the usual variance estimator and proposed estimator into Equation (30) results in:
h = 1 L W h 4 n h 3 S y h 4 ( ψ 40 ( h ) 1 ) h = 1 G W h 4 n h 2 S y ( h ) 4 { B ( h ) C ( h ) 2 D ( h ) } > 0
Further simplifying Equation (31), we have:
h = 1 G W h 4 λ h 2 S y ( h ) 4 C ( h ) 2 D ( h ) > h = 1 G W h 4 λ h 2 S y ( h ) 4 B ( h ) h = 1 L W h 4 n h 3 S y h 4 ( ψ 40 ( h ) 1 )
where B ( h ) = ( 1 γ h ( ψ 04 ( h ) 1 ) 4 ) , C ( h ) = 1 + 3 8 λ h ( ψ 04 ( h ) 1 ) and D ( h ) = 1 + λ h ( ψ 04 ( h ) 1 ) + λ h ( ( ψ 40 ( h ) 1 ) ( θ 22 ( h ) 1 ) 2 ( ψ 04 ( h ) 1 ) )
Since condition (32) is satisfied, the proposed estimators are more efficient than sample variance.
  • Comparing the proposed estimator’s MSE with that of the usual ratio estimator defined by [5], we have:
M S E ( T r ( s t ) ) M S E ( G ( s t ) ) min > 0
Substituting the MSE expressions of the usual ratio estimator and proposed estimator into Equation (33) results in:
h = 1 G W h 4 γ h 3 S y h 4 [ ( ψ 40 ( h ) 1 ) + ( ψ 04 ( h ) 1 ) 2 ( θ 22 ( h ) 1 ) ] h = 1 G W h 4 λ h 2 S y ( h ) 4 { B ( h ) C ( h ) 2 D ( h ) } > 0
Further simplifying Equation (34), we have:
h = 1 G W h 4 λ h 2 S y ( h ) 4 C ( h ) 2 D ( h ) > h = 1 G W h 4 λ h 2 S y ( h ) 4 B ( h ) h = 1 G W h 4 γ h 3 S y h 4 [ ( ψ 40 ( h ) 1 ) + ( ψ 04 ( h ) 1 ) 2 ( θ 22 ( h ) 1 ) ]
Since condition (35) is satisfied, the proposed estimators are more efficient than [5]’s stratified regression estimator.
  • Comparing the proposed estimator’s MSE with that of the usual stratified regression estimator, we have:
V a r ( S ^ r e g ( s t ) 2 ) M S E ( G ( s t ) ) min > 0
Substituting the MSE expressions of the usual ratio estimator and proposed estimator into Equation (36) results in:
h = 1 L W h 4 n h 3 S y h 4 ( ψ 40 ( h ) 1 ) ( 1 ρ h 2 ) h = 1 G W h 4 λ h 2 S y ( h ) 4 { B ( h ) C ( h ) 2 D ( h ) } > 0
Further simplifying Equation (37), we have:
h = 1 G W h 4 λ h 2 S y ( h ) 4 C ( h ) 2 D ( h ) > h = 1 G W h 4 λ h 2 S y ( h ) 4 B ( h ) h = 1 L W h 4 S y h 4 n h 3 ( ψ 40 ( h ) 1 ) ( 1 ρ h 2 )
Since condition (38) is satisfied, the proposed estimators are more efficient than [5]’s stratified regression estimator.
  • Comparing the proposed estimator’s MSE with that of the stratified estimator [15], we have:
M S E ( S ^ D ( s t ) 2 ) min M S E ( G i ( s t ) ) min > 0
Substituting the MSE expressions of [15]’s estimator and the proposed estimator into Equation (39) results in:
h = 1 G W h 4 γ h 3 S y h 4 { ( ψ 40 ( h ) 1 ) ( ψ 04 ( h ) 1 ) + ( ψ 04 ( h ) 1 ) ( θ 22 ( h ) 1 ) 2 γ h [ ( ψ 40 ( h ) 1 ) ( ψ 04 ( h ) 1 ) ( θ 22 ( h ) 1 ) 2 ] } h = 1 G W h 4 λ h 2 S y ( h ) 4 { B ( h ) C ( h ) 2 D ( h ) } > 0
Further simplifying Equation (40), we have:
h = 1 G W h 4 λ h 2 S y ( h ) 4 C ( h ) 2 D ( h ) > h = 1 G W h 4 λ h 2 S y ( h ) 4 B ( h ) h = 1 G W h 4 γ h 3 S y h 4 { ( ψ 40 ( h ) 1 ) ( ψ 04 ( h ) 1 ) + ( ψ 04 ( h ) 1 ) ( θ 22 ( h ) 1 ) 2 γ h [ ( ψ 40 ( h ) 1 ) ( ψ 04 ( h ) 1 ) ( θ 22 ( h ) 1 ) 2 ] }
Since condition (41) is satisfied, the proposed estimators are more efficient than [15]’s stratified estimator.
  • Comparing the proposed estimator’s MSE with that of [17]’s ratio-type and product-type estimators, respectively, we have:
M S E ( S ^ S R ( s t ) 2 ) M S E ( G ( s t ) ) min > 0
Substituting the MSE expressions of [17]’s ratio-type estimator and proposed estimator into Equation (42) results in:
h = 1 L W h 4 γ h 3 S y h 4 [ ( ψ 40 ( h ) 1 ) + 1 4 ( ψ 04 ( h ) 1 ) ( θ 22 ( h ) 1 ) ] h = 1 G W h 4 λ h 2 S y ( h ) 4 { B ( h ) C ( h ) 2 D ( h ) } > 0
Further simplifying Equation (43), we have:
h = 1 G W h 4 λ h 2 S y ( h ) 4 C ( h ) 2 D ( h ) > h = 1 G W h 4 λ h 2 S y ( h ) 4 B ( h ) h = 1 L W h 4 γ h 3 S y h 4 [ ( ψ 40 ( h ) 1 ) + 1 4 ( ψ 04 ( h ) 1 ) ( θ 22 ( h ) 1 ) ]
Similarly, for [17]’s product-type estimator, we have:
M S E ( S ^ S P ( s t ) 2 ) M S E ( G ( s t ) ) min > 0
Substituting the MSE expressions of [17]’s product-type estimator and the proposed estimator into Equation (45) results in:
h = 1 L W h 4 γ h 3 S y h 4 [ ( ψ 40 ( h ) 1 ) + 1 4 ( ψ 04 ( h ) 1 ) + ( θ 22 ( h ) 1 ) ] h = 1 G W h 4 λ h 2 S y ( h ) 4 { B ( h ) C ( h ) 2 D ( h ) } > 0
Further simplifying Equation (46), we have:
h = 1 G W h 4 λ h 2 S y ( h ) 4 C ( h ) 2 D ( h ) > h = 1 G W h 4 λ h 2 S y ( h ) 4 B ( h ) h = 1 L W h 4 γ h 3 S y h 4 [ ( ψ 40 ( h ) 1 ) + 1 4 ( ψ 04 ( h ) 1 ) + ( θ 22 ( h ) 1 ) ]
Since conditions (44) and (47) are satisfied, the proposed estimator is more efficient than [17]’s ratio-type and product-type stratified estimators, respectively.
  • Comparing the proposed estimator’s MSE with that of [16]’s variance estimator, we have:
M S E ( S ^ S G ( s t ) 2 ) M S E ( G ( s t ) ) min > 0
Substituting the MSE expressions of [16]’s estimator and proposed estimator into Equation (48) results in:
h = 1 G W h 4 γ h 3 S y h 4 { 64 K γ h ( ψ 04 ( h ) 1 ) 3 16 γ h ( ψ 04 ( h ) 1 ) K 64 γ h K + ( ψ 40 ( h ) 1 ) + ( ψ 04 ( h ) 1 ) } h = 1 G W h 4 λ h 2 S y ( h ) 4 { B ( h ) C ( h ) 2 D ( h ) } > 0
Further simplifying Equation (49), we have:
h = 1 G W h 4 λ h 2 S y ( h ) 4 C ( h ) 2 D ( h ) > h = 1 G W h 4 λ h 2 S y ( h ) 4 B ( h ) h = 1 G W h 4 γ h 3 S y h 4 { 64 K γ h ( ψ 04 ( h ) 1 ) 3 16 γ h ( ψ 04 ( h ) 1 ) K 64 γ h K + ( ψ 40 ( h ) 1 ) + ( ψ 04 ( h ) 1 ) }
where K = [ ( ψ 40 ( h ) 1 ) ( ψ 04 ( h ) 1 ) ( θ 22 ( h ) 1 ) 2 ]
Since condition (50) is satisfied, the proposed estimator is more efficient than [16]’s variance estimator.

5. Empirical Results

To observe the performance of our proposed estimator with respect to other considered estimators, we use the following datasets for numerical comparisons, which were previously used by many authors in the literature (Table 1, Table 2, Table 3 and Table 4).
Table 5 and Table 6 showed the values of bias, mean square error (MSE), and percentage relative efficiency (PRE) of the proposed estimators and certain existing estimators considered using three datasets. Based on the results obtained from the three datasets, it can be observed that the proposed estimator has the minimum mean square error and the supreme percentage relative efficiency values, respectively, compared to the sample variance estimator [5], variance ratio and regression estimators [15], variance estimator [17], ratio-type and product-type exponential variance estimators, and [16] variance estimator. Therefore, based on the criteria of mean square error and percentage relative efficiency, the proposed estimator performed better and is more efficient than the existing estimators considered.

6. Conclusions

In this study, a new generalized estimator of finite population variance using the auxiliary information in stratified random sampling is proposed. The expressions for bias and mean square error equations of the proposed estimator up to first degree of approximation are derived. The theoretical efficiency conditions under which the proposed estimator is better than some existing estimators are obtained. The proposed estimator is thus empirically compared with certain existing estimators in the literature using real datasets. Evidence from the results revealed that the proposed estimator performs better than the existing estimators considered when compared using mean square error and percentage relative efficiency.

Author Contributions

This research was a collaborative effort involving all authors. Y.Z. took charge of designing the study, crafting the protocol, and analyzing the data. I.M. led the field study, oversaw literature searches, and generated the initial draft. The final manuscript was reviewed and approved by all authors. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data are contained within the article.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Zakari, Y.; Muhammad, I.; Sani, N.M. Alternative ratio-product type estimator in simple random sampling. Commun. Phys. Sci. 2020, 5, 418–426. [Google Scholar]
  2. Muhammad, I.; Zakari, Y.; Audu, A. An alternative class of ratio-regression-type estimator under two-phase sampling scheme. CBN J. Appl. Stat. 2021, 12, 1–26. [Google Scholar] [CrossRef] [PubMed]
  3. Oyeyemi, G.M.; Muhammad, I.; Kareem, A.O. Combined Exponential-Type Estimators for Finite Population Mean in Two-Phase Sampling. Asian J. Probab. Stat. 2023, 21, 44–58. [Google Scholar] [CrossRef]
  4. Das, A.K.; Tripathi, T.P. Use of auxiliary information in estimating the finite population variance. Sankhya C 1978, 40, 139–148. [Google Scholar]
  5. Isaki, C.T. Variance estimation using auxiliary information. J. Am. Stat. Assoc. 1983, 78, 117–123. [Google Scholar] [CrossRef]
  6. Ahmed, M.S.; Walid, A.D.; Ahmed, A.O.H. Some estimators for finite population variance under two-phase sampling. Stat. Transit. 2003, 6, 143–150. [Google Scholar]
  7. Kadilar, C.; Cingi, H. Ratio estimators for the population variance in simple and stratified random sampling. Appl. Math. Comput. 2006, 173, 1047–1059. [Google Scholar] [CrossRef]
  8. Upadhyaya, L.N.; Singh, H.P.; Singh, S. A class of estimators for estimating the variance of the ratio estimator. J. Jpn. Stat. Soc. 2004, 34, 47–63. [Google Scholar] [CrossRef]
  9. Swain AK, P.C. Generalized estimator of finite population variance. J. Stat. Theory Appl. 2015, 14, 45–51. [Google Scholar] [CrossRef]
  10. Sumramani, J.; Kumarapandiyan, G. Generalized modified ratio type estimator for estimation of population variance. Sri-Lankan J. Appl. Stat. 2015, 16, 69–90. [Google Scholar] [CrossRef]
  11. Singh, R.; Malik, S. Improved estimation of population variance using information on auxiliary attribute in simple random sampling. Appl. Math. Comput. 2014, 235, 43–49. [Google Scholar] [CrossRef]
  12. Yadav, S.K.; Kadilar, C.; Shabbir, J. Improved family of estimators of population variance in simple random sampling. J. Stat. Theory Pract. 2015, 9, 219–226. [Google Scholar] [CrossRef]
  13. Zakari, Y.; Muili, J.O.; Tela, M.N.; Danchadi, N.S.; Audu, A. Use of Unknown Weight to Enhance Ratio-Type Estimator in Simple Random Sampling. Lapai J. Appl. Nat. Sci. 2020, 5, 74–81. [Google Scholar]
  14. Muhammad, I.; Zakari, Y.; Audu, A. Generalized estimators for finite population variance using measurable and affordable auxiliary character. Asian Res. J. Math. 2022, 18, 14–30. [Google Scholar] [CrossRef]
  15. Rao, T.J. On certail methods of improving ratio and regression estimators. Commun. Stat.-Theory Methods 1991, 20, 3325–3340. [Google Scholar] [CrossRef]
  16. Shabbir, J.; Gupta, S. On improvement in variance estimation using auxiliary information. Commun. Stat.-Theory Methods 2010, 36, 2177–2185. [Google Scholar] [CrossRef]
  17. Singh, H.P.; Upadhyaya, L.N.; Namjoshi, U.D. Estimation of finite population variance. Curr. Sci. 2009, 57, 1331–1334. [Google Scholar]
  18. Ahmad, S.; Hussain, S.; Shabbir, J.; Zahid, E.; Aamir, M.; Onyango, R. Improved Estimation of Finite Population Variance Using Dual Supplementary Information under Stratified Random Sampling. Math. Probl. Eng. 2022, 2022, 3813952. [Google Scholar] [CrossRef]
  19. Koyuncu, N. Improved Estimators of Finite Population Variance in Stratified Random Sampling. World Appl. Sci. J. 2013, 23, 130–137. [Google Scholar]
Table 1. Source and description of datasets.
Table 1. Source and description of datasets.
Data SetSourceYX
1[7]Level of apple production Rate of apple trees
2[18]Area under wheat in the region in 1974Area under wheat in the region in 1973
3[19]Leaf area for the newly developed strain of wheat Weight of leaves
Table 2. Parameters of Dataset 1.
Table 2. Parameters of Dataset 1.
Parameters N h n h S y h 2 S x h 2 β 2 ( y h ) β 2 ( x h ) λ 22 ( h )
Stratum 11910583,977.511.57158043.281.451.46
Stratum 23216456,563.38.1390141.563.091.74
Stratum 3147195,208.812.44946171.621.621.80
Stratum 4158437,923.510.3342672.221.902.02
Table 3. Parameters of Dataset 2.
Table 3. Parameters of Dataset 2.
Parameters N h n h C x h S y h 2 S x h 2 β 2 ( y h ) β 2 ( x h ) λ 22 ( h )
Stratum 1930.111892831,978.2511.57158042.92862.072.38
Stratum 21030.073375337,629.398.1390141.5111.421.42
Stratum 31540.119378406893.06712.44946172.202.422.28
Table 4. Parameters of Dataset 3.
Table 4. Parameters of Dataset 3.
Parameters N h n h C x h S y h 2 S x h 2 β 2 ( y h ) β 2 ( x h ) λ 22 ( h )
Stratum 11240.11189286.066460311.57158041.93945472.27482331.9123464
Stratum 21350.07337535.29152298.1390142.98192693.4369042.970998
Stratum 31450.11937846.496130112.44946172.34489862.89554962.5134376
Table 5. Bias estimates of the existing estimators and the proposed estimator.
Table 5. Bias estimates of the existing estimators and the proposed estimator.
Dataset 1Dataset 2Dataset 3
Sample Variance000
[5]’s Ratio Estimator83.79038−27.9890.01521998
[5]’s Regression000
[15]’s Estimator21539.9216941.31.338503
[17]’s Ratio10.84317−46.103−0.00092405
[17]’s Product51.26086110.3190.01799213
[16]’s76.34123154.8550.1906794
Proposed Estimator3.203016402.1320047−0.00200567
Table 6. MSE and PRE values of proposed and existing estimators with respect to S ^ y 2 .
Table 6. MSE and PRE values of proposed and existing estimators with respect to S ^ y 2 .
Dataset 1Dataset 2Dataset 3
MSEPREMSEPREMSEPRE
Sample Variance419,607.1100.00188,354.1100.000.004715021100.00
[5]’s Ratio Estimator220,271.5190.5021,269.09885.580.0009897653476.38
[5]’s Regression254,877.6164.6320,959.36898.660.0008908265529.29
[15]’s Estimator1,204,32934.84885,096.421.280.0036798567128.13
[17]’s Ratio298,307.4140.6662,669.71300.550.0010637556443.24
[17]’s Product762,244.155.05292,602.264.370.00812817758.01
[16]’s254,968164.5711,841.291590.660.002745902171.71
Proposed Estimator220,094.1190.659616.1441958.730.0001738692711.82
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Zakari, Y.; Muhammad, I. Modified Estimator of Finite Population Variance under Stratified Random Sampling. Eng. Proc. 2023, 56, 177. https://doi.org/10.3390/ASEC2023-16308

AMA Style

Zakari Y, Muhammad I. Modified Estimator of Finite Population Variance under Stratified Random Sampling. Engineering Proceedings. 2023; 56(1):177. https://doi.org/10.3390/ASEC2023-16308

Chicago/Turabian Style

Zakari, Yahaya, and Isah Muhammad. 2023. "Modified Estimator of Finite Population Variance under Stratified Random Sampling" Engineering Proceedings 56, no. 1: 177. https://doi.org/10.3390/ASEC2023-16308

Article Metrics

Back to TopTop