Processing math: 97%
Research article Topical Sections

Estimation techniques utilizing dual auxiliary variables in stratified two-phase sampling

  • Received: 17 August 2024 Revised: 30 October 2024 Accepted: 01 November 2024 Published: 21 November 2024
  • MSC : 62D

  • In this research paper, an improved set of estimators for finding the finite population variance of a study variable under a stratified two-phase sampling design is introduced. These estimators rely on information about extreme values and the ranks of an auxiliary variable. We examined the properties of these estimators using first-order approximation, focusing on biases and mean squared errors (MSEs). Additionally, we conducted an extensive simulation study to evaluate their performance and validate our theoretical insights. Furthermore, in the application section, we employed some datasets to further assess the performances of our estimators as compared to other existing estimators. The results demonstrated that S2Q2 was the best-performing estimator, and significantly outperforms existing estimators, achieving a percent relative efficiency (PRE) in the exponential distribution as high as 385.467. The percent relative efficiency values were continuously higher than 100 in a variety of situations, with values as high as 353.129 in other distributions like the uniform and gamma. The suggested estimators are superior to the conventional estimators, as demonstrated by empirical assessments using datasets, where percent relative efficiency improvements ranged from 115.026 to 139.897. These results highlight the robustness and applicability of the proposed class of estimators in real-world sampling.

    Citation: Olayan Albalawi. Estimation techniques utilizing dual auxiliary variables in stratified two-phase sampling[J]. AIMS Mathematics, 2024, 9(11): 33139-33160. doi: 10.3934/math.20241582

    Related Papers:

    [1] Tolga Zaman, Cem Kadilar . Exponential ratio and product type estimators of the mean in stratified two-phase sampling. AIMS Mathematics, 2021, 6(5): 4265-4279. doi: 10.3934/math.2021252
    [2] Yasir Hassan, Muhammad Ismai, Will Murray, Muhammad Qaiser Shahbaz . Efficient estimation combining exponential and ln functions under two phase sampling. AIMS Mathematics, 2020, 5(6): 7605-7623. doi: 10.3934/math.2020486
    [3] Khazan Sher, Muhammad Ameeq, Sidra Naz, Basem A. Alkhaleel, Muhammad Muneeb Hassan, Olayan Albalawi . Developing and evaluating efficient estimators for finite population mean in two-phase sampling. AIMS Mathematics, 2025, 10(4): 8907-8925. doi: 10.3934/math.2025408
    [4] Khazan Sher, Muhammad Ameeq, Muhammad Muneeb Hassan, Basem A. Alkhaleel, Sidra Naz, Olyan Albalawi . Novel efficient estimators of finite population mean in stratified random sampling with application. AIMS Mathematics, 2025, 10(3): 5495-5531. doi: 10.3934/math.2025254
    [5] Saman Hanif Shahbaz, Aisha Fayomi, Muhammad Qaiser Shahbaz . Estimation of the general population parameter in single- and two-phase sampling. AIMS Mathematics, 2023, 8(7): 14951-14977. doi: 10.3934/math.2023763
    [6] Xuechen Liu, Muhammad Arslan . A general class of estimators on estimating population mean using the auxiliary proportions under simple and two phase sampling. AIMS Mathematics, 2021, 6(12): 13592-13607. doi: 10.3934/math.2021790
    [7] Hleil Alrweili, Fatimah A. Almulhim . Estimation of the finite population mean using extreme values and ranks of the auxiliary variable in two-phase sampling. AIMS Mathematics, 2025, 10(4): 8794-8817. doi: 10.3934/math.2025403
    [8] Mehreen Fatima, Saman Hanif Shahbaz, Muhammad Hanif, Muhammad Qaiser Shahbaz . A modified regression-cum-ratio estimator for finite population mean in presence of nonresponse using ranked set sampling. AIMS Mathematics, 2022, 7(4): 6478-6488. doi: 10.3934/math.2022361
    [9] Riffat Jabeen, Aamir Sanaullah, Muhammad Hanif, Azam Zaka . Two-exponential estimators for estimating population mean. AIMS Mathematics, 2021, 6(1): 737-753. doi: 10.3934/math.2021045
    [10] Sohaib Ahmad, Sardar Hussain, Javid Shabbir, Muhammad Aamir, M. El-Morshedy, Zubair Ahmad, Sharifah Alrajhi . Improved generalized class of estimators in estimating the finite population mean using two auxiliary variables under two-stage sampling. AIMS Mathematics, 2022, 7(6): 10609-10624. doi: 10.3934/math.2022592
  • In this research paper, an improved set of estimators for finding the finite population variance of a study variable under a stratified two-phase sampling design is introduced. These estimators rely on information about extreme values and the ranks of an auxiliary variable. We examined the properties of these estimators using first-order approximation, focusing on biases and mean squared errors (MSEs). Additionally, we conducted an extensive simulation study to evaluate their performance and validate our theoretical insights. Furthermore, in the application section, we employed some datasets to further assess the performances of our estimators as compared to other existing estimators. The results demonstrated that S2Q2 was the best-performing estimator, and significantly outperforms existing estimators, achieving a percent relative efficiency (PRE) in the exponential distribution as high as 385.467. The percent relative efficiency values were continuously higher than 100 in a variety of situations, with values as high as 353.129 in other distributions like the uniform and gamma. The suggested estimators are superior to the conventional estimators, as demonstrated by empirical assessments using datasets, where percent relative efficiency improvements ranged from 115.026 to 139.897. These results highlight the robustness and applicability of the proposed class of estimators in real-world sampling.



    It is standard procedure in sampling theory to include auxiliary variables with the study variable in order to improve design and increase the efficiency of the estimator by utilizing their relationship. Although information about auxiliary variables is sometimes unavailable in practical circumstances prior to conducting a survey, in such instances, a two-phase sampling procedure is preferable. Two steps are used in two-phase sampling, sometimes referred to as double sampling, to choose a sample from a population. Since two-phase sampling is an economical sampling strategy, it is frequently employed in sample surveys when supplementary data is not available ahead of time. A brief summary of two-phase sampling was initially introduced by [1]. Such works were not explored further after that, until the works of [2]. Due to its low-cost variable screening qualities, two-phase sampling has received a lot of interest in recent years. For estimation of finite population mean under two-phase sampling schemes, different estimators proposed by[3,4,5]. In order to estimate the finite population variance, different estimators suggested by [7,8]. For more information, see [9,10,11] and references therein.

    Since variation occurs naturally, estimating finite population variance is a serious problem. The utilization of auxiliary information to estimate the population variance was initially introduced by [12] and then expanded upon by [13]. Employing supplementary information in an informed strategy can improve estimators performance. In order to determine the population variance, [14] proposed exponential estimators based on ratios and products. By using the different transformations, [15,16,17] introduced some new estimators to improve the variance estimation. Under simple random sampling and stratified random sampling, different families of estimators obtained by [18,19,20]. For more details about different estimators and methods for estimating the finite population variances, we refer to [21,22,23].

    In the sample survey data, there may be unusual observations. When the sample contains outlier values, the results may be distorted. In this regard, several researchers have focused on outlier values and presented various methods to estimate population characteristics. The researchers in[24] used a linear transformation to obtain two estimators based on the auxiliary minimum and maximum observations. After that, these works were not investigated until [25]. The researchers employed numerous finite population mean estimators, as well as the concept of using extreme values in them. For calculating the finite population mean, [26,27] introduced different transformations methods to handle the outliers. [28] used stratified random sampling to improve the estimate of the limited population mean under extreme values. [29] provided effective estimators for estimating population variance using extreme value transformations. The work [30] proposed novel estimators that use extreme values to estimate population variance with the least mean squared errors (MSE). [31] proposed double exponential ratio estimators that use extreme values of the auxiliary variable to evaluate their effectiveness in estimating population variance. To improve estimator accuracy, [32] developed efficient estimators that leverage auxiliary variables under simple random sampling. For further information, readers can read [33,34].

    Several important considerations motivated the development of a new method to estimate the finite population variance:

    ● Traditional estimators for finite population variance frequently neglect extreme values (outliers) and rankings of auxiliary variables. Outliers are often considered challenging, resulting in skewed conclusions or inflated MSE. The inefficiency of stratified two-phase sampling designs highlights the need for a more efficient approach that addresses these problems.

    ● Existing estimators often struggle with stratified two-phase sampling due to its complicated data structure. These issues emphasize the need for more robust and efficient estimators.

    ● In most cases, two-phase sampling is more economical than one-phase sample, particularly when dealing with large populations. It lowers total expenditures by enabling researchers to gather preliminary data with a smaller sample before selecting a second sample.

    ● Two-phase sampling enables researchers to choose certain clusters or strata that reflect the whole population, it helps guarantee that varied sub-populations are effectively represented.

    ● Two-phase sampling is a useful method in a variety of research situations because it offers more accurate representation and control of variability along with cost savings, enhanced precision, and flexibility.

    In this article, our main objective is to properly utilize the information about the outlier values of the auxiliary variable, which are used as supplementary information to increase the accuracy of the proposed class of estimators. It is well known that outlier information is often removed from sample data, and therefore classical estimators generally decrease its significance as MSE increases. When there is a relationship between the two variables, the ranks of the auxiliary variable are linked to the study variable. Consequently, these rankings provide a useful tool to improve the accuracy of the estimators. We apply the transformations technique, motivated by [29,30,31,32], to provide a new class of estimators using the ranks of the auxiliary variable and the known information on the outlier values to estimate the finite population variance in two-phase stratified sampling. The new suggested method is particularly useful in economic surveys, public health examinations, and environmental evaluations, where similar sample strategies are often used. The new estimators are ideal for disciplines like market research and agricultural surveys that frequently meet extreme values, as they can efficiently include outlier information without distorting results.

    This article is organized as follows: In Section 2, we introduce the foundational concepts and notations. In Section 3, we discuss various established estimators. Our proposed class of estimators is detailed in Section 4. A theoretical comparison is presented in Section 5. In Section 6, we conduct simulations on six different artificial populations with varying probability distributions to evaluate the theoretical results discussed in Section 5. This section also provides numerical examples to validate our theoretical findings. Finally, in Section7, we offer a discussion of the results and suggestions for future research.

    Let us consider a finite population

    ϕ=(ϕ1,ϕ2,,ϕN)

    of size N units. This population is divided into L strata, each of which is Nh(h=1,2,,L), with the property that

    Lh=1Nh=N.

    Let yhi, xhi, and rhi be the values of the study variable (Y), the auxiliary variable (X), and the ranks of the auxiliary variable R in the hth stratum of the ith(i=1,2,,Nh) unit, respectively. We define the population variances for these variables in the hth stratum as

    S2yh=1Nh1Nhi=1(Yhi¯Yh)2,S2xh=1Nh1Nhi=1(Xhi¯Xh)2,S2rh=1Nh1Nhi=1(Rhi¯Rh)2,

    where

    ¯Yh=1NhNhi=1Yhi,¯Xh=1NhNhi=1Xhi

    and

    ¯Rh=1NhNhi=1Rhi

    denote the population means of the study variable (Y), auxiliary variable (X), and the ranks of the auxiliary variable (R) in the hth stratum that corresponding to the population means

    ˉY=1NhLh=1Wh¯Yh,ˉX=1NhLh=1Wh¯Xh,ˉR=1NhLh=1Wh¯Rh,

    respectively, where Wh is the stratum weight and defined by

    Wh=NhN.

    The population coefficients of variations in the hth stratum, are defined as

    Cyh=Syh¯Yh,Cxh=Sxh¯Xh

    and

    Crh=Srh¯Rh

    where Syh,Sxh, and Srh are the population standard deviations of (Y,X,R) in the hth stratum, respectively.

    Furthermore, define the population correlation coefficients between (Y,X), (Y,R), and (X,R) in the hth stratum as follows:

    ρyxh=SyxhSyhSxh,ρyrh=SyrhSyhSrh,ρxrh=SxrhSxhSrh,

    where

    Syxh=1Nh1Nhi=1(Yhi¯Yh)(Xhi¯Xh),Syrh=1Nh1Nhi=1(Yhi¯Yh)(Rhi¯Rh)

    and

    Sxrh=1Nh1Nhi=1(xhi¯Xh)(Rhi¯Rh),

    are the population co-variances, respectively.

    In this paper, we provide a set of estimators to estimate the finite population variance S2y of Y in the presence of the auxiliary variable X. The definition of the two-phase sampling scheme is:

    (1) A sample of size (´nh<Nh) from the first phase is chosen in order to estimate the population variance S2xh.

    (2) For the second phase, a sample size of (nh<´nh) is chosen in order to observe both y and x, respectively.

    We define the following concepts in order to calculate the biases and mean square errors for different estimators:

    ξ0h=(s2yhS2yhS2yh),   ξ1h=(s2xhS2xhS2xh),   ξ2h=(ˊs2xhS2xhS2xh),   ξ3h=(s2rhS2rhS2rh),   ξ4h=(ˊs2rhS2rhS2rh),

    such that

    E(ξih)=0

    for i=0,1,2,3,4.

    E(ξ20h)=ηhΔ400h,E(ξ21h)=ηhΔ040h,E(ξ22h)=ηhΔ040h,E(ξ23h)=ηhΔ004h,E(ξ24h)=ηhΔ004h,E(ξ0hξ1h)=ηhΔ220h,E(ξ0hξ2h)=ηhΔ220h,E(ξ0hξ3h)=ηhΔ202h,E(ξ0hξ4h)=ηhΔ202h,E(ξ1hξ2h)=ηhΔ040h,E(ξ1hξ3h)=ηhΔ022h,E(ξ1hξ4h)=ηhΔ022h,E(ξ2hξ3h)=ηhΔ022h,E(ξ2hξ4h)=ηhΔ022h,E(ξ3hξ4h)=ηhΔ004h,

    where

    Δ400h=(Δ400h1),  Δ040h=(Δ040h1),  Δ004h=(Δ004h1),  Δ220h=(Δ220h1),Δ202h=(Δ202h1),  Δ022h=(Δ022h1),  ηh=(1nh1Nh),  ηh=(1´nh1Nh),  ηh=(1nh1´nh).

    Also

    Δlqsh=φlqshφl/2200hφq/2020hφs/2002h,

    where

    φlqsh=Nhi=1(Yhi¯Yh)l(Xhi¯Xh)q(Rhi¯Rh)sNh1.

    Here,

    Δ400h=β2(yh),   Δ040h=β2(xh),andΔ004h=β2(rh)

    are the population coefficients of kurtosis.

    Next, we review the other estimators of the finite population variances while comparing them with the estimators in our proposed class.

    The variance of the usual estimator

    ¯yst=Lh=1Wh¯yh

    in stratified random sampling is defined as follows:

    Var(¯yst)=Lh=1ηhW2hS2yh=S2yst.

    The unbiased estimator ˆS2T1 of S2yst, is defined as

    ˆS2T1=Lh=1ηhW2hs2yh.

    The usual variance estimator of ˆS2T1 for population variance is given by

    Var(ˆS2T1)=Lh=1η3hW4hS4yhΔ400h. (3.1)

    A ratio estimator for population variance ˆS2T2, proposed by [13], is given by

    ˆS2T2=Lh=1ηhW2hs2yh(ˊs2xhs2xh). (3.2)

    The following equations represent the bias and MSE of ˆS2T2;

    Bias(ˆS2T2)Lh=1η2hW2hS2yh(Δ040hΔ220h) (3.3)

    and

    MSE(ˆS2T2)Lh=1W4hS4yh(η3hΔ400h+η3hΔ040h2η3hΔ220h). (3.4)

    According to [35], the linear regression estimator ˆS2T3, is defined as

    ˆS2T3=Lh=1ηhW2h[s2yh+b(s2yh,s2xh)(ˊs2xhs2xh)], (3.5)

    where

    b(s2yh,s2xh)=s2yhˆΔ220hs2xhˆΔ040h

    is the sample regression coefficient.

    The following equation represents a MSE of ˆS2T3;

    MSE(ˆS2T3)Lh=1S4yhW4hΔ400h(η3hη3hρ2yxh), (3.6)

    where

    ρyxh=Δ220hΔ400hΔ040h.

    An exponential ratio type estimator ˆS2T4, presented by [14], is defined as follows

    ˆS2T4=Lh=1ηhW2hs2yhexp(ˊs2xhs2xhˊs2xh+s2xh). (3.7)

    The following equations represent the bias and MSE of ˆS2T4;

    Bias(ˆS2T4)12Lh=1η2hW2hS2yh(3Δ040h4Δ220h) (3.8)

    and

    MSE(ˆS2T4)Lh=1W4hS4yh[η3hΔ400h+η3h(Δ040h4Δ220h)]. (3.9)

    By employing the kurtosis of an auxiliary variable, [15] proposed a ratio-type estimator ˆS2T5, is defined as

    ˆS2T5=Lh=1ηhW2hs2yh(ˊs2xh+Δ040hs2xh+Δ040h). (3.10)

    The following equations represent the bias and MSE of ˆS2T5;

    Bias(ˆS2T5)Lh=1η2hghW2hS2yh(ghΔ040hΔ220h) (3.11)

    and

    MSE(ˆS2T5)Lh=1W4hS4yh[η3hΔ400h+η3h(g2hΔ040h2ghΔ220h)], (3.12)

    where

    gh=S2xhS2xh+Δ040h.

    The classifications of some ratio estimators is given in [17], which are defined as

    ˆS2T6=Lh=1ηhW2hs2yh(ˊs2xh+Cxhs2xh+Cxh), (3.13)
    ˆS2T7=Lh=1ηhW2hs2yh(Δ040hˊs2xh+CxhΔ040hs2xh+Cxh) (3.14)

    and

    ˆS2T8=Lh=1ηhW2hs2yh(Cxhˊs2xh+Δ040hCxhs2xh+Δ040h). (3.15)

    The following equations represent the bias and MSE of ˆS2Ti;

    Bias(ˆS2Ti)Lh=1η2htihW2hS2yh(tihΔ040hΔ220h) (3.16)

    and

    MSE(ˆS2Ti)Lh=1W4hS4yh[η3hΔ400h+η3h(t2ihΔ040h2tihΔ220h)], (3.17)

    where

    t1h=S2xhS2xh+Cxh,  t2h=Δ040hS2xhΔ040hS2xh+Cxh,  and t3h=CxhS2xhCxhS2xh+Δ040h.

    In this section, we present an improved class of estimators inspired by prior works [30,31,32]. These estimators employ minimum and maximum values of auxiliary variables, along with their ranks, in two-phase sampling to estimate the variance of the finite population. The suggested estimator is defined as

    ˆS2Q=Lh=1ηhW2hs2yhexp[θ1h{γ1h(ˊs2xhs2xh)γ1h(ˊs2h+s2xh)+2γ2h}]exp[θ2h{γ3h(ˊs2rhs2rh)γ3h(ˊs2rh+s2rh)+2γ4h}], (4.1)

    where (θih,i=1,2) are known constants values either (1 or 2), and (γih,i=1,2,3,4) are the parameters of the auxiliary variables. The minimum and maximum values (outliers) of the auxiliary variable are denoted by (xmh,xMh), while the minimum and maximum values (outliers) of the ranks of the auxiliary variable are denoted by (Rmh,RMh). The known values of γ1h,γ2h are given in Table 1,

    γ3h=1

    and

    γ4h=RMRm.

    We can derive the various classes of the suggested estimator from (4.1), which are listed in Table 1.

    where

    δh=(ˊs2rhs2rhˊs2rh+s2rh+2(RMhRmh)).
    Table 1.  Some classes of the proposed estimator.
    Subsets of the proposed estimator ˆS2Q γ1h γ2h
    ˆS2Q1=Lh=1ηhW2hs2yhexp[θ1h{β2(xh)(ˊs2xhs2xh)β2(xh)(ˊs2xh+s2xh)+2(xMhxmh)}]exp[θ2hδh] β2(xh) xMhxmh
    ˆS2Q2=Lh=1ηhW2hs2yhexp[θ1h{cxh(ˊs2xhs2xh)cxh(ˊs2xh+s2xh)+2(xMhxmh)}]exp[θ2hδh] cxh xMhxmh
    ˆS2Q3=Lh=1ηhW2hs2yhexp[θ1h{(xMhxmh)(ˊs2xhs2xh)(xMhxmh)(ˊs2xh+s2xh)+2cxh}]exp[θ2hδh] xMhxmh cxh
    ˆS2Q4=Lh=1ηhW2hs2yhexp[θ1h{(xMhxmh)(ˊs2xhs2xh)(xMhxmh)(ˊs2xh+s2xh)2cxh}]exp[θ2δh] xMhxmh cxh
    ˆS2Q5=Lh=1ηhW2hs2yhexp[θ1h{(xMhxmh)(ˊs2xhs2xh)(xMhxmh)(ˊs2xh+s2xh)+2β2(xh)}]exp[θ2hδh] xMhxmh β2(xh)
    ˆS2Q6=Lh=1ηhW2hs2yhexp[θ1h{β2(xh)(ˊs2xhs2xh)β2(xh)(ˊs2xh+s2xh)+2(xMhxmh)}]exp[θ2hδh] β2(xh) xMhxmh
    ˆS2Q6=Lh=1ηhW2hs2yhexp[θ1h{(xMhxmh)(ˊs2xhs2xh)(xMhxmh)(ˊs2xh+s2xh)2β2(xh)}]exp[θ2δ] xMhxmh β2(xh)
    ˆS2Q8=Lh=1ηhW2hs2yhexp[θ1h{cxh(ˊs2xhs2xh)cxh(ˊs2xh+s2xh)+2(xMhxmh)}]exp[θ2hδh] cxh xMhxmh

     | Show Table
    DownLoad: CSV

    Now, we discuss the properties of the new proposed class of estimators, we rewrite (4.1) in terms of errors to get the bias and the MSE of ˆS2Q, i.e.,

    ˆS2Q=Lh=1ηhW2hS2yh(1+ξ0h)exp[b1h(ξ2hξ1h)2(1+b1h2(ξ1h+ξ2h))1]×exp[b2h(ξ4hξ3h)2(1+b2h2(ξ3h+ξ4h))1], (4.2)

    where

    θ1h=θ2h=1,   b1h=γ1hS2xhγ1hS2xh+γ2h,   and b2h=S2rhS2rh+γ4h.

    Applying the Taylor series to the first approximation order, we obtain

    ˆS2QLh=1ηhW2hS2yhLh=1ηhW2hS2yh[ξ0hb1h2(ξ1hξ2h)b2h2(ξ3hξ4h)+3b21h8ξ21hb21h8ξ22h+b22h8ξ23hb22h8ξ24hξ1h2ξ0hξ1h+b1h2ξ0hξ2hb2h2ξ0hξ3h+b2h2ξ0hξ4hb21h2ξ1hξ2h+b1hb2h4ξ1hξ3hb1hb2h4ξ1hξ4hb1hb2h4ξ2hξ3h+b1hb2h4ξ2hξ4hb22h2ξ3hξ4h]. (4.3)

    Using (4.3), the bias of ˆS2T is given by

    Bias(ˆS2Q)Lh=1η2hW2hS2yh[3b21h8Δ040h+3b22h8Δ004hb1h2Δ220hb2h2Δ202h+b1hb2h2Δ022h] Lh=1η2hW2hS2yh[3b21h8Δ040h+3b22h8Δ004hb1h2Δ220hb2h2Δ202h+b1hb2h2λ022h].

    After the simple simplifications, we get

    Bias(ˆS2Q)Lh=1η2hW2hS2yh[38(b21hΔ040h+b22hΔ004h)12(b1hΔ220h+b2hΔ202hb1hb2hΔ022h)], (4.4)

    where

    ηh=ηhηh.

    The Eq (4.3) is squared and the expected value is taken to obtain a first-order approximation of the MSE, which is represented by the following equation

    MSE(ˆS2Q)Lh=1η3hW4hS4yh[Δ400h+b21h4Δ040h+b22h4Δ004hb1hΔ220hb2hΔ202h+b1hb2h2Δ022h]Lh=1η3hW4hS4yh[b21h4Δ040h+b22h4Δ004hb1hΔ220hb2hΔ202h+b4hb5h2λ022h].

    After the simplification, we get

    MSE(ˆS2Q)Lh=1W4hS4yh[η3hΔ400h+η3h4(b21hΔ040h+b22hΔ004h4b1hΔ220h4b2hΔ202h+2b1hb2hΔ022h)]. (4.5)

    The proposed class of estimators ˆS2Q is compared in this section to other existing estimators, including ˆS2T1,ˆS2T2,ˆS2T3,ˆS2T4,ˆS2T5, and ˆS2Ti.

    Condition (ⅰ): By (3.1) and (4.5)

    Var(ˆS2T1)>MSE(ˆS2Q),

    if

    Lh=1W4hS4yh(η3hη3h)(b21hΔ040h+b22hΔ004h4b1hΔ220h4b2hΔ202h+2b1hb2Δ022h)<0.

    For

    η3hη3h>0,

    that is,

    η3h>η3h,
    Lh=1W4hS4yh(b21hΔ040h+b22hΔ004h4b1hΔ220h4b2hΔ202h+2b1hb2hΔ022h)<0. (5.1)

    Similarly

    ηhηh<0,

    that is,

    η<η,
    Lh=1W4hS4yh(b21hΔ040h+b22hΔ004h4b1hΔ220h4b2hΔ202h+2b1hb2hΔ022h)>0. (5.2)

    If condition (5.1) or (5.2) holds true, the suggested estimator ˆS2Q demonstrates higher efficiency in comparison to MSE(ˆS2T1).

    Condition (ⅱ): By (3.4) and (4.5)

    MSE(ˆS2T2)>MSE(ˆS2Q),

    if

    Lh=1W4hS4yh(η3hη3h)[(4b21h)Δ040hb22hΔ004h+4(b1h2)Δ220h+4b2hΔ202h2b1hb2hΔ022h]>0.

    For

    η3hη3h<0,

    that is,

    η3h<η3h,
    Lh=1W4hS4yh[(4b21h)Δ040hb22hΔ004h+4(b1h2)Δ220h+4b2hΔ202h2b1hb2hΔ022h]>0. (5.3)

    Similarly,

    η3hη3h>0,

    that is,

    η3h>η3h,
    Lh=1W4hS4yh[(4b21h)Δ040hb22hΔ004h+4(b1h2)Δ220h+4b2hΔ202h2b1hb2hΔ022h]<0. (5.4)

    If condition (5.3) or (5.4) holds true, the suggested estimator ˆS2Q demonstrates higher efficiency in comparison to MSE(ˆS2T2).

    Condition (ⅲ): By (3.6) and (4.5)

    MSE(ˆS2T3)>MSE(ˆS2Q),

    if

    Lh=1W4hS4yh(η3hη3h)[ρ2yxh+14(b21hΔ040h+b2h2Δ004h4b1hΔ220h4b2hΔ202h+2b1hb2hΔ022h)]<0.

    For

    η3hη3h>0,

    that is,

    η3h>η3h,
    Lh=1W4hS4yh[ρ2yxh+14(b21hΔ040h+b22hΔ004h4b1hΔ220h4b2hΔ202h+2b1hb2hΔ022h)]<0. (5.5)

    Similarly,

    η3hη3h<0,

    that is,

    η3h<η3h,
    Lh=1W4hS4yh[ρ2yxh+14(b21hΔ040h+b22hΔ004h4b1hΔ220h4b2hΔ202h+2b1hb2hΔ022h)]>0. (5.6)

    If condition (5.5) or (5.6) holds true, the suggested estimator ˆS2Q demonstrates higher efficiency in comparison to MSE(ˆS2T3).

    Condition (ⅳ): By (3.9) and (4.5)

    MSE(ˆS2T4)>MSE(ˆS2Q),

    if

    Lh=1W4hS4yh(η3hη3h)[(b21h1)Δ040h+b22hΔ004h4(b1h1)Δ220h4b2hΔ202h+2b1hb2hΔ022h]<0.

    For

    η3hη3h>0,

    that is,

    η3h>η3h,
    Lh=1W4hS4yh[(b21h1)Δ040h+b22hΔ004h4(b1h1)Δ220h4b2hΔ202h+2b1hb2hΔ022h]<0. (5.7)

    Similarly,

    η3hη3h<0,

    that is,

    η3h<η3h,
    Lh=1W4hS4yh(η3hη3h)[(b21h1)Δ040h+b22hΔ004h4(b1h1)Δ220h4b2hΔ202h+2b1hb2hΔ022h]>0. (5.8)

    If condition (5.7) or (5.8) holds true, the suggested estimator ˆS2Q demonstrates higher efficiency in comparison to MSE(ˆS2T4).

    Condition (ⅴ): By (3.12) and (4.5)

    MSE(ˆS2T5)>MSE(ˆS2Q),

    if

    Lh=1W4hS4yh(η3hη3h)[(b21h4g2h)Δ040h+b22hΔ004h4(b1hgh)Δ220h4b2hΔ202h+2b1hb2hΔ022h]<0.

    For

    η3hη3h>0,

    that is,

    η3h>η3h,
    Lh=1W4hS4yh[(b21h4g2h)Δ040h+b22hΔ004h4(b1hgh)Δ220h4b2hΔ202h+2b1hb2hΔ022h]<0. (5.9)

    Similarly,

    η3hη3h<0,

    that is,

    η3h<η3h,
    Lh=1W4hS4yh[(b21h4g2h)Δ040h+b22hΔ004h4(b1hgh)Δ220h4b2hΔ202h+2b1hb2hΔ022h]>0. (5.10)

    If condition (5.9) or (5.10) holds true, the suggested estimator ˆS2Q demonstrates higher efficiency in comparison to MSE(ˆS2T5).

    Condition (ⅵ): By (3.17) and (4.5)

    MSE(ˆS2Ti)>MSE(ˆS2Q),

    if

    Lh=1W4hS4yh(η3hη3h)[(b21h4t2ih)Δ040h+b22hΔ004h4(b1htih)Δ220h4b2hΔ202h+2b1hb2hΔ022h]<0.

    For

    η3hη3h>0,

    that is,

    η3h>η3h,
    Lh=1W4hS4yh[(b21h4t2ih)Δ040h+b22hΔ004h4(b1htih)Δ220h4b2hΔ202h+2b1hb2hΔ022h]<0. (5.11)

    Similarly,

    η3hη3h<0,

    that is,

    η3h<η3h,
    Lh=1W4hS4yh[(b21h4t2ih)Δ040h+b22hΔ004h4(b1htih)Δ220h4b2hΔ202h+2b1hb2hΔ022h]>0. (5.12)

    If condition (5.11) or (5.12) holds true, the suggested estimator ˆS2Q demonstrates higher efficiency in comparison to MSE(ˆS2Ti).

    In this part, we examine the performance of the proposed class of estimators as compared to other estimators using percent relative efficiency (PREs). This examination is carried out using both simulated and three separate real data sets.

    To confirm the theoretical results reported in Section 5, we use the methods proposed by [30,31,32] to undertake a simulation study. The goal is to evaluate the performance of the suggested class of estimators using the known minimum and maximum values of the auxiliary variable, as well as its ranks within the context of two-phase stratified sampling. The following probability distributions can possibly be used to artificially produce six distinct populations for the auxiliary variable X:

    ● Population 1: XExponential (1);

    ● Population 2: XExponential (3);

    ● Population 3: XUniform (1,3);

    ● Population 4: XUniform (1,2);

    ● Population 5: XGamma (1,4);

    ● Population 6: XGamma (2,5).

    The variable of interest, Y, is computed as

    Y=ryx×X+e,

    where

    ryx=0.80

    indicates the correlation coefficient between the study and the auxiliary variables, and eN(0,1) signifies the error term.

    To compute the PREs, we used the following algorithms in R:

    Step 1: We first use the various probability distributions mentioned above to generate a population of size 2000. In order to compute distinct values for the existing and suggested class of estimators, this population is split into two strata using stratified random sampling techniques.

    Step 2: To collect a first phase sample of size ´nh from a population of size Nh, use the simple random sampling without replacement {(SRSWOR)} technique.

    Step 3: Using the {SRSWOR} technique, obtain the second phase sample size nh from the first phase sample.

    Step 4: We calculate the population total and the extreme values of the auxiliary variables from the above steps.

    Step 5: For each population, we use SRSWOR approach to generate distinct sample sizes for each stratum. The sample sizes are specified as 20%,30%, and 40%.

    Step 6: Obtained the PREs values for each sample size using all of the estimators presented in this article. This step ensures that the relative efficiency of each estimator is evaluated across different sample sizes.

    Step 7: Steps 5 and 6 are then repeated 50,000 times to ensure the robustness of the results. The outcomes for artificial populations are presented in Table 2, which provides a comprehensive analysis of the estimators performance under simulated conditions. Step 8: Furthermore, obtain the MSEs and PREs for each estimator over all replications using the following formulas:

    MSE(ˆS2l)min=50000i=1(ˆS2lS2i)250000

    and

    PRE=V(ˆS2T1)MSE(ˆS2l)min×100,

    where l is one of T1,T2,T3,T4,T5,T6,T7,T8,TQk(k=1,2,,8).

    Table 2.  Percent relative efficiency (PRE) using the artificial populations.
    Estimator Exp (1) Exp (3) Uni (1,3) Uni (1,2) Gam (1,4) Gam (2,5)
    ˆS2T1 100 100 100 100 100 100
    ˆS2T2 110.789 109.760 113.025 115.247 120.126 118.587
    ˆS2T3 120.970 123.638 115.188 120.315 122.505 120.765
    ˆS2T4 126.486 127.946 118.344 123.200 124.425 123.078
    ˆS2T5 128.145 128.108 122.670 126.526 128.772 127.589
    ˆS2T6 135.980 129.964 125.520 128.789 131.405 1131.164
    ˆS2T7 135.112 129.123 125.345 128.002 131.408 1131.664
    ˆS2T8 136.304 130.245 126.156 130.328 134.528 133.589
    ˆS2Q1 194.668 180.356 150.712 160.139 147. 547 153.329
    ˆS2Q2 353.129 385.467 320.225 333.167 296.724 280.289
    ˆS2Q3 259.189 256.578 270.667 259.369 220.949 214.345
    ˆS2Q4 162.707 168.689 180.576 165.508 157.333 142.148
    ˆS2Q5 142.837 148.790 160.031 143.625 140.495 140.279
    ˆS2Q6 190.456 202.098 200.321 178.353 182.132 162.399
    ˆS2Q7 170.065 192.987 190.401 150.323 152.369 158.950
    ˆS2Q8 1966.825 210.876 210.677 220.688 200. 712 192.952

     | Show Table
    DownLoad: CSV

    To evaluate the effectiveness of the suggested estimators, we examine the PREs of several estimators on three real data sets. The data sets descriptions are given below, while the summary statistics are given in Tables 35.

    Table 3.  Summary statistics for data 1.
    Descriptive statistics
    N1=18 ¯X1=415 ¯Y1=85572 ¯R1=9.500 XM1=2055
    Xm1=24 RM1=18 Rm1=1 Sx1=52.675 Sy1=248216
    Sr1=5.338 Cx1=1.258 Cy1=2.901 Cr1=0.562 ρyx1=0.337
    ρyr1=0.304 ρxr1=0.709 Δ4001=3270 Δ0401=3345 Δ0041=1.692
    Δ2201=2398 Δ2021=1267 Δ0221=944 η1=0.144 η1=0.056
    N2=18 ¯X2=257 ¯Y2=19293.610 ¯R2=27.500 XM2=1674
    Xm2=52 Rm2=19 RM2=36 Sx2=365.696 Sy2=37979
    Sr2=5.338 Cx2=1.423 Cy2=1.969 Cr2=0.194 ρyx2=0.976
    ρyr2=0.565 ρxr2=0.786 Δ4002=2542 Δ0402=2388 Δ0042=1.622
    Δ2202=2246 Δ2022=739 Δ0222=988 η2=0.144 η2=0.056

     | Show Table
    DownLoad: CSV
    Table 4.  Summary statistics for data 2.
    Descriptive statistics
    N1=18 ¯X1=962 ¯Y1=162979 ¯R1=9.500 XM1=1530
    Xm1=388 RM1=36 Rm1=19 Sx1=308 Sy1=255887
    Sr1=5.338 Cx1=0.320 Cy1=1.571 Cr1=0.562 ρyx1=0.145
    ρyr1=0.135 ρxr1=0.802 Δ4001=2625 Δ0401=3237 Δ0041=1.692
    Δ2201=1568 Δ2021=1548 Δ0221=1298 η1=0.144 η1=0.056
    N2=18 ¯X2=1146 ¯Y2=134458 ¯R2=27.500 XM2=2370
    Xm2=58 Rm2=19 RM2=36 Sx2=469.931 Sy2=50236
    Sr2=5.338 Cx2=0.409 Cy2=0.374 Cr2=0.194 ρyx2=0.787
    ρyr2=0.657 ρxr2=889 Δ4002=2240 Δ0402=2558 Δ0042=1.622
    Δ2202=1807 Δ2022=2049 Δ0222=1200 η2=0.144 η2=0.056

     | Show Table
    DownLoad: CSV
    Table 5.  Summary statistics for data 3.
    Descriptive statistics
    N1=18 ¯X1=72.550 ¯Y1=27.490 ¯R1=9.500 XM1=95
    Xm1=28 RM1=18 Rm1=1 Sx1=10.580 Sy1=10.130
    Sr1=5.338 Cx1=0.155 Cy1=0.376 Cr1=0.562 ρyx1=0.337
    ρyr1=0.284 ρxr1=0.557 Δ4001=2.550 Δ0401=2.845 Δ0041=1.692
    Δ2201=3.158 Δ2021=4.544 Δ0221=4.542 η1=0.144 η1=0.056
    N2=18 ¯X2=60.870 ¯Y2=20.820 ¯R2=27.500 XM2=75
    Xm2=15 Rm2=19 RM2=36 Sx2=8.980 Sy2=12.750
    Sr2=5.338 Cx2=0.142 Cy2=0.269 Cr2=0.194 ρyx2=0.496
    ρyr2=0.297 ρxr2=0.756 Δ4002=4.142 Δ0402=3.934 Δ0042=1.622
    Δ2202=1.384 Δ2022=1.239 Δ0222=2.488 η2=0.144 η2=0.056

     | Show Table
    DownLoad: CSV

    Data 1. (Source: ([36]HY__HY, p.226]))

    Y: The employment levels recorded by the different departments for 2012, which represents the overall number of workers.

    X: The total number of factories that these departments officially registered in 2012, which gives information on industrial activity.

    R: The rankings assigned to each department based on the total number of factories they registered in 2012, offering a comparative view of industrial engagement across departments.

    Two distinct groups have been created from the data-set:

    Group 1: The Gujranwala, Rawalpindi, Sargodha, and Lahore divisions are included in this group; they all contribute to the examination of employment and industrial registration.

    Group 2: This group represents another aspect of the information for comparison analysis and is made up of the divisions of Bahawalpur, Faisalabad, Multan, Sahiwal, and Khan.

    Data 2. (Source: [36]HY__HY, p.135])

    Y: Represents the total number of students attended at educational institutions in 2012.

    X: Represents the overall number of government-funded schools in 2012.

    R: Represents the order of government-funded schools in 2012 according to the number of schools they had in that year.

    Two distinct groups have been generated from the data-set:

    Group 1: The Gujranwala, Rawalpindi, Sargodha, and Lahore divisions are included in this group; they all contribute to the examination of employment and industrial registration.

    Group 2: This group represents another aspect of the information for comparison analysis and is made up of the divisions of Bahawalpur, Faisalabad, Multan, Sahiwal, and Khan.

    Data 3. (Source:[37,p.24])

    Y: The expenses incurred on food by the family, directly related to their employment.

    X: The total weekly income earned by the family, reflecting their financial resources for that period.

    R: The ranking of families based on their weekly income, providing a comparative measure of their earnings.

    For efficiency comparisons, we use the following formula:

    PRE=V(ˆS2T1)MSE(ˆS2l)×100,

    where l is one of T1,T2,T3,T4,T5,T6,T7,T8,TQk(k=1,2,,8).

    Additionally, Table 6 presents a summary of the findings for real data-sets.

    Table 6.  Percent relative efficiency using empirical data-sets.
    Estimator Data 1 Data 2 Data 3
    ˆS2T1 100 100 100
    ˆS2T2 111.535 101.057 100.991
    ˆS2T3 113.964 109.260 109.302
    ˆS2T4 112.526 109.250 106.626
    ˆS2T5 111.735 102.009 102.023
    ˆS2T6 111.535 101.048 128.081
    ˆS2T7 111.535 101.047 128.089
    ˆS2T8 111.695 103.703 129.687
    ˆS2Q1 117.202 116.082 138.417
    ˆS2Q2 119.177 117.150 139.897
    ˆS2Q3 117.384 115.726 134.367
    ˆS2Q4 114.751 116.077 137.845
    ˆS2Q5 117.201 115.027 135.217
    ˆS2Q6 117.202 115.026 138.091
    ˆS2Q7 117.202 115.025 133.450
    ˆS2Q8 117.227 114.851 134.986

     | Show Table
    DownLoad: CSV

    A class of efficient estimators for estimating the finite population variance was introduced in this article. These estimators accounted for both the rankings and the auxiliary variable's extreme values. The theoretical prerequisites outlined in Section 5 show how the suggested class of estimators is more efficient than others, allowing for a comparison with those that already exist. To verify these limits, we conducted a simulation study and examined three empirical data sets. The outcomes, displayed in Table 2, demonstrate that the suggested class of estimators consistently performs better in terms of PREs than the other existing estimators. The theoretical results in Section 5 are further confirmed by the empirical data shown in Table 6. We draw the conclusion that, in comparison to the other estimators under consideration, the suggested class of estimators ˆS2Qi (i=1,2,3,,8,) exhibits superior efficiency based on both simulation and empirical data. Because it has the lowest MSE of these suggested estimators, ˆS2Q2 is particularly preferable.

    There are some advantages of this study in practical applications are given below:

    Improved accuracy and efficiency: Using extreme values and rankings of auxiliary variables, the novel approach improves precision and efficiency when calculating population variance. The suggested estimators outperform previous approaches, achieving PRE values of up to 385.467 in simulated experiments. This improved performance is especially valuable in real-world applications where survey data may contain outliers.

    Applicability in stratified two-phase sampling: The suggested approach is especially designed to support stratified two-phase sampling, which is usual in large-scale surveys when supplementary information may be unavailable until later stages. This makes the approach particularly useful in economic surveys, public health examinations, and environmental evaluations, where similar sample strategies are often used.

    Handling of outliers in practical contexts: The new estimators are ideal for disciplines like market research and agricultural surveys that frequently meet extreme values, as they can efficiently include outlier information without distorting results.

    For this study, a thorough benchmark analysis was conducted using the procedures listed below:

    Selection of competing methods:

    ● Find and choose the estimators that are employed in stratified two-phase sampling to estimate the finite population variance. Regression-based estimators, exponential ratio estimators, and conventional variance estimators like these are a few examples.

    ● For an extensive comparison basis, use the most widely utilized techniques from the review of the literature, such as those put out by Isaki (1983)[13], Bahl and Tuteja (1991) [14], and Upadhyaya and Singh (1999) [15].

    Performance metrics:

    PRE, which shows the improvement in effectiveness over a standard approach, should be the main tool used to assess how well various estimators perform.

    ● To examine estimators performance in entirety, take into account other measures, including bias, adaptability to outliers, and MSE.

    ● Analyze the computational efficiency of the suggested and existing estimators, particularly for large data sets.

    Simulation study and real life data sets

    ● The research encompassed practical stratification situations and included a variety of artificial populations with varying probability distributions, including exponential, uniform, and gamma. Comparing the results using practical problems variables, such as industrial activity and employment levels, provided valuable insights into practical application, while several replications guaranteed statistical robustness.

    ● According to the findings, the suggested estimators continuously performed better than conventional techniques in terms of PRE, with appreciable gains over a range of sample sizes and distributions. The investigation was made more detailed by the use of statistical tests to validate the significance of the according to efficiency increases. The advantages of the novel methodology were established by this thorough study, which also supported its consideration by proving its superiority over other methods.

    Moreover, we investigated the characteristics of the suggested efficient class of estimators using a two-phase stratified sampling technique. It is also conceivable to propose some novel estimators utilizing the non-response sampling approach, and our findings can be useful in determining the more efficient estimators with the lowest MSEs. It is also an appropriate topic for future investigation.

    The author declares no conflict of interest.



    [1] J. Neyman, Contribution to the theory of sampling human population. J. Amer. Stat. Assoc., 33 (1938), 101–116. https://doi.org/10.2307/2279117 doi: 10.2307/2279117
    [2] B. V. Sukhatme, Some ratio-type estimators in two-phase sampling, J. Amer. Stat. Assoc., 57 (1962), 628–632. https://doi.org/10.2307/2282400 doi: 10.2307/2282400
    [3] G. K. Vishwakarma, S. M. Zeeshan, Generalized ratio-cum-product estimator for finite population mean under two-phase sampling scheme, J. Mod. Appl. Stat. Meth., 19 (2021), 2–16. https://doi.org/10.22237/jmasm/1608553320 doi: 10.22237/jmasm/1608553320
    [4] T. Zaman, C. Kadilar, New class of exponential estimators for finite population mean in two-phase sampling, Commun. Stat., 50 (2021), 874–889. https://doi.org/10.1080/03610926.2019.1643480 doi: 10.1080/03610926.2019.1643480
    [5] A. Y.Erinola, R. V. K. Singh, A. Audu, T. James, Modified class of estimator for finite population mean under two-phase sampling using regression estimation approach, Asian. J. Prob. Stat., 4 (2021), 52–64. https://doi.org/10.9734/ajpas/2021/v14i430338 doi: 10.9734/ajpas/2021/v14i430338
    [6] M. N. Qureshi, M. U. Tariq, M. Hanif, Memory-type ratio and product estimators for population variance using exponentially weighted moving averages for time-scaled surveys, Commun. Stat. Simul. Comput., 53 (2024), 1484–1493. https://doi.org/10.1080/03610918.2022.2050390 doi: 10.1080/03610918.2022.2050390
    [7] A. Sanaullah, M. Hanif, A. Asghar, Generalized exponential estimators for population variance under two-phase sampling, Int. J. Appl. Comput. Math., 2 (2016), 75–84. https://doi.org/10.1007/s40819-015-0047-5 doi: 10.1007/s40819-015-0047-5
    [8] H. P. Singh, S. Singh, J. M. Kim, Efficient use of auxiliary variables in estimating finite population variance in two-phase sampling, Commun. Korean Stat. Soc., 17 (2010), 165–181. https://doi.org/10.5351/CKSS.2010.17.2.165 doi: 10.5351/CKSS.2010.17.2.165
    [9] M. A. Alomair, U. Daraz, Dual transformation of auxiliary variables by using outliers in stratified random sampling, Mathematics, 12 (2024), 2839. https://doi.org/10.3390/math12182839 doi: 10.3390/math12182839
    [10] U. Daraz, M. A. Alomair, O. Albalawi, A. S. Al Naim, New techniques for estimating finite population variance using ranks of auxiliary variable in two-stage sampling, Mathematics, 12 (2024), 2741. https://doi.org/10.3390/math12172741 doi: 10.3390/math12172741
    [11] M. Jabbar, Z. Javid, A. Zaheer, R. Zainab, Ratio type exponential estimator for the estimation of finite population variance under two-stage sampling, Res. J. Appl. Sci. Eng. Technol., 7 (2024), 4095–4099. https://doi.org/10.19026/rjaset.7.772 doi: 10.19026/rjaset.7.772
    [12] A. K. Das, T. P. Tripathi, Use of auxiliary information in estimating the finite population variance, Sankhya, 40 (1978), 39–148.
    [13] C. T. Isaki, Variance estimation using auxiliary information, J. Am. Stat. Assoc., 78 (1983), 117–123. https://doi.org/10.2307/2287117 doi: 10.2307/2287117
    [14] S. Bahl, R. Tuteja, Ratio and product type exponential estimators, J. Inf. Optim. Sci., 12 (1991), 159–164. https://doi.org/10.1080/02522667.1991.10699058 doi: 10.1080/02522667.1991.10699058
    [15] L. Upadhyaya, H. Singh, An estimator for population variance that utilizes the kurtosis of an auxiliary variable in sample surveys, Vikram Math. J., 19 (1999), 14–17.
    [16] V. Dubey, H. Sharma, On estimating population variance using auxiliary information, Stat. Transit. New Ser., 9 (2008), 7–18.
    [17] C. Kadilar, H. Cingi, Ratio estimators for the population variance in simple and stratified random sampling, Appl. Math. Comput., 173 (2006), 1047–1059. https://doi.org/10.1016/j.amc.2005.04.032 doi: 10.1016/j.amc.2005.04.032
    [18] H. Singh, P. Chandra, An alternative to ratio estimator of the population variance in sample surveys, J. Transp. Stat., 9 (2008), 89–103.
    [19] J. Shabbir, S. Gupta, Some estimators of finite population variance of stratified sample mean, Commun. Stat., 39 (2010), 3001–3008. https://doi.org/10.1080/03610920903170384 doi: 10.1080/03610920903170384
    [20] H. P. Singh, R. S. Solanki, A new procedure for variance estimation in simple random sampling using auxiliary information, Stat. Papers, 54 (2013), 479–497. https://doi.org/10.1007/s00362-012-0445-2 doi: 10.1007/s00362-012-0445-2
    [21] S. K. Yadav, C. Kadilar, J. Shabbir, S. Gupta, Improved family of estimators of population variance in simple random sampling, J. Stat. Theory Practice, 9 (2015), 219–226. https://doi.org/10.1080/15598608.2013.856359 doi: 10.1080/15598608.2013.856359
    [22] J. Shabbir, S. Gupta, Using rank of the auxiliary variable in estimating variance of the stratified sample mean, Int. J. Comput. Theor. Stat., 6 (2019), 171–181. http://doi.org/10.12785/IJCTS/060207 doi: 10.12785/IJCTS/060207
    [23] T. Zaman, H. Bulut, An efficient family of robust-type estimators for the population variance in simple and stratified random sampling, Commun. Stat., 52 (2023), 2610–2624. https://doi.org/10.1080/03610926.2021.1955388 doi: 10.1080/03610926.2021.1955388
    [24] S. Mohanty, J. Sahoo, A note on improving the ratio method of estimation through linear transformation using certain known population parameters, Sankhyā Indian J. Stat., 57 (1995), 93–102.
    [25] M. Khan, J. Shabbir, Some improved ratio, product, and regression estimators of finite population mean when using minimum and maximum values, Sci. World J., 2013 (2013), 431868. https://doi.org/10.1155/2013/431868 doi: 10.1155/2013/431868
    [26] G. S. Walia, H. Kaur, M. Sharma, Ratio type estimator of population mean through efficient linear transformation, Amer. J. Math. Stat., 5 (2015), 144–149. https://doi.org/10.5923/j.ajms.20150503.06 doi: 10.5923/j.ajms.20150503.06
    [27] M. Khan, Improvement in estimating the finite population mean under maximum and minimum values in double sampling scheme, J. Stat. Appl. Probab. Lett., 2 (2015), 115–121. https://doi.org/10.12785/jsapl/020203 doi: 10.12785/jsapl/020203
    [28] U. Daraz, J. Shabbir, H. Khan, Estimation of finite population mean by using minimum and maximum values in stratified random sampling, J. Mod. Appl. Stat. Methods, 17 (2018), 20. https://doi.org/10.22237/jmasm/1532007537 doi: 10.22237/jmasm/1532007537
    [29] U. Daraz, M. Khan, Estimation of variance of the difference-cum-ratio-type exponential estimator in simple random sampling, Res. Math. Stat., 8 (2021), 1899402. https://doi.org/10.1080/27658449.2021.1899402 doi: 10.1080/27658449.2021.1899402
    [30] U. Daraz, J. Wu, O. Albalawi, Double exponential ratio estimator of a finite population variance under extreme values in simple random sampling, Mathematics, 12 (2024), 1737. https://doi.org/10.3390/math12111737 doi: 10.3390/math12111737
    [31] U. Daraz, J. Wu, M. A. Alomair, L. A. Aldoghan, New classes of difference cum-ratio-type exponential estimators for a finite population variance in stratified random sampling, Heliyon, 10 (2024), e33402. https://doi.org/10.1016/j.heliyon.2024.e33402 doi: 10.1016/j.heliyon.2024.e33402
    [32] U. Daraz, M. A. Alomair, O. Albalawi, Variance estimation under some transformation for both symmetric and asymmetric data, Symmetry, 16 (2024), 957. https://doi.org/10.3390/sym16080957 doi: 10.3390/sym16080957
    [33] H. O. Cekim, H. Cingi, Some estimator types for population mean using linear transformation with the help of the minimum and maximum values of the auxiliary variable, Hacet. J. Math. Stat., 46 (2017), 685–694. https://doi.org/10.15672/hjms.201510114186 doi: 10.15672/hjms.201510114186
    [34] S. Chatterjee, A. S. Hadi, Regression analysis by example, John Wiley & Sons, Inc., 2013. https://doi.org/10.1002/0470055464
    [35] D. J. Watson, The estimation of leaf area in field crops, J. Agric. Sci., 27 (1937), 474–483. https://doi.org/10.1017/S002185960005173X doi: 10.1017/S002185960005173X
    [36] Bureau of Statistics, Punjab development statistics government of the Punjab, Lahore, Pakistan, 2013.
    [37] W. B. Cochran, Sampling techniques, John Wiley & Sons, Inc., 1963.
  • This article has been cited by:

    1. Umer Daraz, Jinbiao Wu, Della Agustiana, Walid Emam, Finite Population Variance Estimation Using Monte Carlo Simulation and Real Life Application, 2025, 17, 2073-8994, 84, 10.3390/sym17010084
    2. Khudhayr A. Rashedi, Alanazi Talal Abdulrahman, Tariq S. Alshammari, Khalid M. K. Alshammari, Usman Shahzad, Javid Shabbir, Tahir Mehmood, Ishfaq Ahmad, Robust Särndal-Type Mean Estimators with Re-Descending Coefficients, 2025, 14, 2075-1680, 261, 10.3390/axioms14040261
    3. Abdulaziz S. Alghamdi, Fatimah A. Almulhim, Optimizing Finite Population Mean Estimation Using Simulation and Empirical Data, 2025, 13, 2227-7390, 1635, 10.3390/math13101635
  • Reader Comments
  • © 2024 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)
通讯作者: 陈斌, bchen63@163.com
  • 1. 

    沈阳化工大学材料科学与工程学院 沈阳 110142

  1. 本站搜索
  2. 百度学术搜索
  3. 万方数据库搜索
  4. CNKI搜索

Metrics

Article views(641) PDF downloads(28) Cited by(3)

Figures and Tables

Tables(6)

Other Articles By Authors

/

DownLoad:  Full-Size Img  PowerPoint
Return
Return

Catalog