1. Introduction
Datasets can consist of numerical or categorical variables in classical statistics. Numerous writers have developed a variety of estimators for calculating the countable population mean under classical statistics when additional data are present. In survey studies, supplementary information on a finite population is typically obtainable from census databases or from previous experience. Research on survey sampling has illustrated a vast range of techniques for employing supplementary data in order to improve the sampling design and also obtain progressively more accurate estimates [
1]. It is noteworthy that the ratio, regression, and product procedures are appealing when auxiliary information is presented [
2]. Therefore, many scholars have combined their efforts to increase the efficiency of these methods in the hopes of developing increasingly skilled estimators of measures of central tendency, measures of dispersion, and CDF, etc.
Neutrosophic statistics are used when there is some degree of indeterminacy in the data. When the data contain neutrosophy, this statistical methodology, which goes beyond the conventional approach, is used. According to [
3], neutrosophic statistics are especially helpful when data within the population or sample are unclear, indeterminate, or indefinite.
Ref. [
3] referred to a variety of neutrosophic values, including quantitative data that suggest that a given value might lie inside the interval range
even while the precise value is unknown. The neutrosophic observation is composed of
, where
. As a result, we developed a notation scheme for neutrosophic data representation that uses the interval type
, where
stands for the lower value and
for the upper value.
In the literature, the researchers have estimated the neutrosophic statistics and utilized information on one or more supplementary variables. In simple random sampling (SRS), [
4] created neutrosophic ratio-type estimation methods. Ref. [
5] developed generalized ratio and product-type estimation methods under neutrosophic ranked set sampling (RSS). Ref. [
6] created a two-phase process loss index utilizing the neutrosophic statistical interval technique. Refs. [
7,
8] developed generalized classes of neutrosophic ratio and exponential estimation methods. Ref. [
9] has suggested a generalized neutrosophic exponential robust ratio-type estimation method. Using neutrosophic robust regression, [
10] introduced a new family of Hartley–Ross-type estimation techniques for estimating the population parameter. Ref. [
11] suggested a neutrosophic predictive estimation method of countable population mean utilizing kernel regression.
Suppose we are interested in a population’s proportion of unclear or indeterminate
values. When utilizing sample data from a survey, users often need to estimate the population neutrosophic CDF. Alternatively, they may need to estimate the proportion of population units whose elements are not more than or equal to a specific number
. For instance, we could be curious to know how many filtration facilities have lower levels of arsenic in their water than zero or how much agricultural area has fewer consequences of pesticide poisoning than zero. Such a proportion is a certain value of the population’s neutrosophic CDF.
where
for
and
for
. Frequently, when conducting survey sampling, the research variable can only be measured for the units in a sample; thus, the traditional technique of the CDF is based only on the chosen sampling methodology and the sampling ratio of the population. Estimating
can be undertaken as follows:
Several authors have computed the CDF utilizing information on sole or various supplementary symmetric and asymmetric data. Firstly, [
12] created a technique for evaluating the countable population CDF. Ref. [
13] developed both traditional and predictive techniques for estimating the CDF using survey information. Ref. [
14] use the model-calibrated pseudo-empirical probability methodology to propose an estimation technique for the population CDF. Ref. [
15] examined the problem of CDF and quantiles estimators for a population utilizing extra information. Ref. [
16] develop a new class of estimation methods in order to estimate the CDF using extra information. Ref. [
17] proposed two new estimation methods that make use of the mean and ranks of the supplemental data to estimate the limited population CDF under SRS and StRS. Ref. [
18] created a novel type of exponential estimation technique for evaluating the population CDF with additional data in the form of the rank and average of the additional data under StRS. Ref. [
19] also created a novel class of estimation techniques for the population CDF utilizing dual additional information under StRS.
The calibration approach has been a priority as a field of study for survey sampling in recent years. The calibration approach modifies the initial design weights to improve the precision of estimation by utilizing extra data. The calibration approach uses modified weights to minimize the difference between modified and original weights while performing a set of conditions with supplementary variables. See the pioneers in this discipline for more details [
20]. Ref. [
21] created a calibrated technique for mean estimation. Ref. [
22] presented a calibration technique for computing the population parameter in StRS with a variety of calibrated conditions constructed with additional data. Ref. [
23] suggested calibrated mean estimation methods based on a stratified RSS method along with calibration variance of the estimator. Ref. [
24] creates a calibration technique for the population mean of the research information utilizing novel calibrated weights that use two additional sources of information under StRS. Ref. [
25] extended the study by utilizing the properties of linear moments. Refs. [
26,
27,
28] proposed a novel robust calibrated technique for computing the population parameter under StRS. Ref. [
29] proposed a calibration CDF estimator using robust measures under StRS. It is important to note that [
29] uses
mistakenly, instead of
in
and
. However, this is merely a typing error. The calibration technique proposed by Tracy et al. [
21] and utilized by many researchers, as discussed above, has not yet to receive much attention in terms of neutrosophic CDF estimation.
Research Gap of Neutrosophic Calibrated Estimation of CDF
All earlier studies on survey sampling have employed only certain, clear, and unambiguous symmetric and asymmetric data. These methods yield a single, clean result, which can occasionally be problematic because it has a probability of being overstated, inaccurate, or overlooked. However, under certain conditions, data are often of a neutrosophic type; this is the moment at which traditional classical methods are ineffective and a neutrosophic approach is used. Neutrosophic data include uncertain, partially unknown, inconsistent, incomplete, and other indeterminate data. Consequently, interval-valued neutrosophic numbers (INN) may be observed in data from populations or experiments. It had been thought that the actual data, which were unknown at the time of collection, belonged to that interval. In reality, there are more unclear facts available than certain information. As a result, more neutrosophic methods are required.
In life, various research data are available, and the collection of data is very costly, particularly when the data are unclear. As a result, using the outdated traditional methods to calculate the population’s actual value for ambiguous data will be costly and risky. When the research and additional information are of the neutrosophic type, there is no technique accessible that is able to resolve the issue using the calibration approach. We now switch to a new approach, known as neutrosophic calibrated estimation of CDF, which offers an entirely new viewpoint on survey techniques. Therefore, this paper proposes a neutrosophic calibrated estimator of CDF.
However, to our knowledge, no work has been undertaken so far on the type of calibrated estimators of neutrosophic CDF under StRS considered by [
21]. Thus, we are motivated to suggest the neutrosophic calibration estimators of CDF by adapting the idea of [
10]. Because all the authors, under conventional statistics, rely on certain, single-valued numbers to estimate the empirical CDF when additional variable is accessible. These forms of estimations offer biased outcomes. Finding the best estimate for the uncertain empirical CDF value with an optimal (lowest) MSE is our main objective.
The article’s remaining sections are arranged as follows: the adapted neutrosophic CDF calibration estimators are introduced in
Section 2. The suggested estimators are given in
Section 3. A simulation study is conducted in
Section 4, and, in
Section 5, the article comes to an end.
2. Adapted Neutrosophic Calibration Estimators of CDF Using Supplementary Information
Neutrosophic calibration-based estimate techniques involve neutrosophic calibrated or adjusted weights that are designed with the use of auxiliary data and are designed to minimize a specific measure of distance from the original stratum weights. Neutrosophic statistics are used to analyze datasets, or neutrosophic data, that have some uncertainty interval in them. By using this method, researchers may deal with inconsistent or incomplete data and draw more accurate conclusions from the sampled data. Assume that
and
are the research and supplementary neutrosophic variables related with
, which is a countable population of size
that is divided into
strata, with the
stratum incorporating
elements,
, and
. The weight of each stratum is defined as
. The traditional neutrosophic calibration estimator of CDF, under StRS is as follows:
where
is the sample neutrosophic CDF estimation for
in the
stratum. We consider the following terms, where
and
are the
sample observation of our neutrosophic study variable
and supplementary variable
, respectively.
and
are the sample specific value neutrosophic of our study variable
and additional variable
, respectively.
and
are the
stratum sample and population neutrosophic CDF of the study variable
, respectively.
and
are the
stratum sample and population neutrosophic CDF, respectively, of the additional variable
and
are the
stratum sample and population neutrosophic coefficient of variation (CV), respectively, of the additional variable
are wisely selected neutrosophic weights, determining the estimator’s shape. The traditional neutrosophic calibration estimator of CDF under StRS is
Adapted Family of Neutrosophic CDF Estimators
Taking inspiration from [
25], the adapted
family of estimators is as follows in Equation (1):
where
is the neutrosophic calibrated weight. Using the chi-square distance function we obtain the following:
and satisfy the calibrated constraints, as follows:
Hence, the function of Lagrange using Equations (2)–(5) with multipliers
,
and
, denoted by
, is given by
setting Equation (6)
, we obtain the following:
Substituting Equation (7) in Equations (3)–(5), we obtain the following:
When we solve the system of equations, the expressions of lambdas are as follows:
Substituting these values in Equations (7) and (1), we obtain the neutrosophic calibrated estimators of CDF, as given below:
where neutrosophic betas
and
are given by
where
The family members of
are provided in
Table 1.
The neutrosophic estimators are developed based on different values of , such as , where and are the sample neutrosophic mean, sample neutrosophic standard deviation, sample neutrosophic variance, sample neutrosophic CV, of the supplementary information of in the stratum, respectively.