Loading [MathJax]/jax/output/SVG/jax.js
Research article Special Issues

Estimation of the finite population mean using extreme values and ranks of the auxiliary variable in two-phase sampling

  • This study aimed to improve the estimation of the mean of the dependent variable by incorporating the smallest and largest values and ranks of the independent variable. To achieve this, we introduce two new classes of estimators that offer enhanced accuracy compared with the existing approaches, as evaluated using the mean squared error (MSE) criterion. The key features of the proposed estimators are examined through a first-order approximation method, particularly focusing on the bias and mean squared error under two-phase sampling. In addition, their performance is assessed using simulated populations generated from six different distributions with varying parameter settings, along with three real datasets. Furthermore, the findings show that the new estimators achieve lower mean squared errors compared with existing methods.

    Citation: Hleil Alrweili, Fatimah A. Almulhim. Estimation of the finite population mean using extreme values and ranks of the auxiliary variable in two-phase sampling[J]. AIMS Mathematics, 2025, 10(4): 8794-8817. doi: 10.3934/math.2025403

    Related Papers:

    [1] Khazan Sher, Muhammad Ameeq, Sidra Naz, Basem A. Alkhaleel, Muhammad Muneeb Hassan, Olayan Albalawi . Developing and evaluating efficient estimators for finite population mean in two-phase sampling. AIMS Mathematics, 2025, 10(4): 8907-8925. doi: 10.3934/math.2025408
    [2] Sohaib Ahmad, Sardar Hussain, Muhammad Aamir, Faridoon Khan, Mohammed N Alshahrani, Mohammed Alqawba . Estimation of finite population mean using dual auxiliary variable for non-response using simple random sampling. AIMS Mathematics, 2022, 7(3): 4592-4613. doi: 10.3934/math.2022256
    [3] Amber Yousaf Dar, Nadia Saeed, Moustafa Omar Ahmed Abu-Shawiesh, Saman Hanif Shahbaz, Muhammad Qaiser Shahbaz . A new class of ratio type estimators in single- and two-phase sampling. AIMS Mathematics, 2022, 7(8): 14208-14226. doi: 10.3934/math.2022783
    [4] Olayan Albalawi . Estimation techniques utilizing dual auxiliary variables in stratified two-phase sampling. AIMS Mathematics, 2024, 9(11): 33139-33160. doi: 10.3934/math.20241582
    [5] Mehreen Fatima, Saman Hanif Shahbaz, Muhammad Hanif, Muhammad Qaiser Shahbaz . A modified regression-cum-ratio estimator for finite population mean in presence of nonresponse using ranked set sampling. AIMS Mathematics, 2022, 7(4): 6478-6488. doi: 10.3934/math.2022361
    [6] Saman Hanif Shahbaz, Aisha Fayomi, Muhammad Qaiser Shahbaz . Estimation of the general population parameter in single- and two-phase sampling. AIMS Mathematics, 2023, 8(7): 14951-14977. doi: 10.3934/math.2023763
    [7] Xuechen Liu, Muhammad Arslan . A general class of estimators on estimating population mean using the auxiliary proportions under simple and two phase sampling. AIMS Mathematics, 2021, 6(12): 13592-13607. doi: 10.3934/math.2021790
    [8] Tolga Zaman, Cem Kadilar . Exponential ratio and product type estimators of the mean in stratified two-phase sampling. AIMS Mathematics, 2021, 6(5): 4265-4279. doi: 10.3934/math.2021252
    [9] Sanaa Al-Marzouki, Christophe Chesneau, Sohail Akhtar, Jamal Abdul Nasir, Sohaib Ahmad, Sardar Hussain, Farrukh Jamal, Mohammed Elgarhy, M. El-Morshedy . Estimation of finite population mean under PPS in presence of maximum and minimum values. AIMS Mathematics, 2021, 6(5): 5397-5409. doi: 10.3934/math.2021318
    [10] Khazan Sher, Muhammad Ameeq, Muhammad Muneeb Hassan, Basem A. Alkhaleel, Sidra Naz, Olyan Albalawi . Novel efficient estimators of finite population mean in stratified random sampling with application. AIMS Mathematics, 2025, 10(3): 5495-5531. doi: 10.3934/math.2025254
  • This study aimed to improve the estimation of the mean of the dependent variable by incorporating the smallest and largest values and ranks of the independent variable. To achieve this, we introduce two new classes of estimators that offer enhanced accuracy compared with the existing approaches, as evaluated using the mean squared error (MSE) criterion. The key features of the proposed estimators are examined through a first-order approximation method, particularly focusing on the bias and mean squared error under two-phase sampling. In addition, their performance is assessed using simulated populations generated from six different distributions with varying parameter settings, along with three real datasets. Furthermore, the findings show that the new estimators achieve lower mean squared errors compared with existing methods.



    Supplementary information is a useful method in survey sampling for increasing estimators' precision. To enhance relative efficiency, methods such as ratio, regression, and product-type estimates utilize not only the data related to the primary study variables but also supplementary details from one or more auxiliary variables. Extensive research efforts have been dedicated to developing improved estimators for the overall parameters, including measures like the population mean, total, median, and various additional indicators. For more details about improved estimators and their properties, see [1,2,3] and references therein.

    The sampling theory uses auxiliary variables as well as the study variable in order to get a better design, and it is also used to fine-tune the cost-effectiveness of the estimators by utilizing the relationship between the variables. In such situations, the two-phase sampling method is preferred, especially when the overall population mean is unavailable prior to conducting a survey. Survey methods that use two distinct stages to select samples from a population are called two-phase sampling methods. A preliminary phase can be conducted more cost-effectively or efficiently before the main phase when the main phase is more costly. The basic concept of two-phase sampling was initially introduced by [4]. Recently, it has attracted considerable interest because of its cost-effectiveness in assessing variables. A few investigations that have been conducted on two-phase sampling include [5,6,7,8].

    The information obtained through a sample survey can produce unexpected results. When the sample contains extreme values, the mean estimator becomes susceptible to distortion, which may lead to biased results. Using the extreme values from known auxiliary variables, [9] first provided two estimators through the use of linear transformations. After this initial work, [10] suggested better estimators based on ratios, products, and regression to estimate population means more successfully by utilizing outlier values. In order to more accurately obtain the unknown population mean by considering extreme values, [11] introduced some estimators based on a two-phase strategy. A new set of transformations based on extreme values of auxiliary variables were provided by [12], which enhanced the estimation of the unknown population mean. Subsequently, [13] achieved significant improvements in increasing the precision of population mean estimations by introducing extreme values into a stratified random sampling technique. Using extreme value ideas as a foundation, [14,15] obtained a family of estimators intended to minimize the mean squared errors by using the population variance. In a recent work, [16,17,18] proposed various classes of highly efficient estimators by employing transformations on extreme values to accurately estimate the population variance. A ratio estimation of the population mean using auxiliary information under the optimal sampling design was proposed by [19]. Additional insights and related methodologies can be explored in [20,21,22].

    The motivation behind this study lies in the ongoing challenges of improving the accuracy and efficiency of population mean estimation, particularly in survey sampling. Traditional estimators often struggle with the presence of extreme values, such as the smallest and largest observations, which can distort the results. This issue is particularly critical in fields like economics, healthcare, and public health, where precise population estimates directly influence policy decisions and resource allocation.

    Sometimes, there is a temptation to remove extreme observations from the data-sets. However, the accuracy of estimators often declines when the mean squared error (MSE) is calculated in scenarios involving extreme values. The innovation of this work lies in the development of these new estimators, which are designed to improve the precision of population mean estimation while minimizing the MSE compared with existing methods. In this article, we introduce different improved estimators that employ both the smallest and largest observations of independent variables and their ranks in a double-phase sampling design to improve the precision of estimation. The new estimators are widely applicable, with potential uses in areas like economics, healthcare, manufacturing, retail, and transportation. They are particularly effective in scenarios demanding precise predictions. The following are some implications of our suggested estimators.

    The new estimators are widely applicable, with potential uses in areas like economics, healthcare, manufacturing, retail, and transportation. They are particularly effective in scenarios demanding precise predictions.

    1) Industry: In the industrial sector, our method supports predictive maintenance by monitoring equipment in manufacturing. For instance, in an automotive plant, it analyzes sensor data to forecast component failures, enabling timely repairs that reduce downtime and costs. This improves efficiency, lowers expenses, and ensures smoother production.

    2) Economics: The estimators significantly improve economic forecasting by providing more accurate predictions of indicators like inflation, unemployment, and gross domestic product (GDP) growth. This makes it possible for the public and commercial sectors to allocate resources more effectively and make better policy decisions.

    3) Environmental science The proposed estimators are important in environmental science, assisting in environmental forecasting and estimating pollution. They provide accurate predictions of ecological patterns, aiding in the projection of atmospheric shifts, pollution concentrations, and their effects on ecosystems.

    4) Transportation: Through traffic pattern analysis and delay prediction, our approach improves route optimization for delivery services in the transportation industry. As a result, delivery times and fuel consumption can be reduced by logistics businesses planning more effective routes.

    5) Public health: In public health, the estimators enhance epidemiological modeling and health outcome predictions. They enable more precise forecasting of disease spread, healthcare demands, and intervention planning. Organizations are better able to allocate resources and get ready for future healthcare demands due to the more accurate information.

    6) Retail: Our approach more precisely predicts product demand in retail, which enhances inventory management. This enables retailers to keep inventory levels balanced, preventing both excess stock and shortages. These examples show how adaptable our methodology is and how it has the ability to significantly advance both industry and healthcare.

    7) Medicine: Our approach can enhance early cancer detection by improving the accuracy of medical imaging analysis. When integrated with tools like magnetic resonance imaging (MRI) or computed tomography (CT) scans, it aids in identifying malignant tissues more effectively, enabling timely intervention. For example, it could improve tumor detection in breast cancer through better image segmentation and classification of abnormal tissue.

    The structure of the paper is arranged as follows. The basic notations and concepts utilized in this study are explained in Section 2. Section 3 provides a summary of various existing estimators. The proposed classes of estimators are thoroughly discussed in Section 4. There is a rigorous mathematical comparison in Section 5 between existing and new proposed estimators. A description of a simulation conducted to produce some populations from various distributions is presented in Section 6, which is intended to evaluate the theoretical conclusions from Section 5. This section also presents numerical examples that illustrate the practical implications of the theoretical findings. Finally, Section 7 summarizes the overall conclusions and suggests some ideas for new research.

    A finite population can be represented as Ω=(Ω1,Ω2,Ω3,,ΩN), where N denotes the total number of units. In this context, let xi represent the value of the independent variable X, ri indicate the ranks associated with the independent variable R, and yi denote the value of the response variable Y for the ith unit.

    This research explores the impact of the variable X and presents two new estimators developed to obtain the population mean Y. The two-phase sampling scheme is described by the following definition.

    (1) We collect a sample without replacement of size m1 in the initial stage in order to accurately obtain the population mean, represented as ˉX. For accurate results, we must make sure that the sample size m1 remains under N.

    (2) A second sample (without replacement) size of m2 observation (where m2<m1) is selected to determine the variables y and x in the second phase.

    The following are the formulas for the population means, which are calculated as

    ˉX=Ni=1xiN,
    ˉR=Ni=1riN,

    and

    ˉY=Ni=1yiN.

    Let the population variances for variables (X,R,Y) be defined as follows without replacement sampling:

    S2x=Ni=1(xiˉX)2N1,
    S2r=Ni=1(riˉR)2N1,
    S2y=Ni=1(yiˉY)2N1,

    while the population coefficients of variations for these variables are defined as:

    Cx=SxˉX,
    Cr=SrˉR,

    and

    Cy=SyˉY,

    respectively. Furthermore, the relationships among the variables (Y,X), (Y,R), and (X,R) are described through their population correlation coefficients as follows:

    ρyx=SyxSySx,
    ρyr=SyrSySr,

    and

    ρxr=SxrSxSr,

    where

    Syx=Ni=1(yiˉY)(xiˉX)N1,
    Syr=Ni=1(yiˉY)(riˉR)N1,

    and

    Sxr=Ni=1(xiˉX)(riˉR)N1,

    are the population co-variances, respectively.

    Now, we define the first-phase sample means and variances base on m1 observations associated with the variables X and R, which are presented in the following manner:

    ˉx=m1i=1xim1,
    ˉr=m1i=1rim1,
    ˊs2x=m1i=1(xiˉx)2m11,

    and

    ˊs2r=m1i=1(riˉr)2m11.

    To calculate the sample means and variances for the second phase, a random sample without replacement of size m2 observations is chosen based on the first phase (m2<m1), which are defined as:

    ˉx=m2i=1xim2,
    ˉr=m2i=1rim2,
    ˉy=m2i=1yim2,
    s2x=m2i=1(xiˉx)2m21,
    x2r=m2i=1(riˉr)2m21,

    and

    s2y=m2i=1(yiˉy)2m21.

    The analysis of the mean squared errors and biases associated with the estimators used to determine the population mean is given in this section. These findings are then compared with those of the proposed classes of estimators to identify areas for improvement.

    The conventional unbiased estimator is presented below:

    ˉyd=1nni=1yi. (3.1)

    The variance of the conventional unbiased estimator ˉyd is defined by:

    V(ˉyd)=θˉY2C2y, (3.2)

    where

    θ=(1m21N)

    is the sampling correction term for the second phase.

    The following is an expression for a ratio-type estimator ˉydR according to [4]:

    ˉydR=ˉyˉxˉx. (3.3)

    The following formulas are used to express the bias and MSE of ˉydR:

    Bias(ˉydR)θˉY(C2xCyx), (3.4)
    MSE(ˉydR)ˉY2(θC2y+θC2x2θCyx), (3.5)

    where

    θ=(1m11N),

    and

    θ=(1m21m)

    represent the sampling correction terms for the first and the intermediate stages.

    The product estimator ˉydP is expressed as:

    ˉydP=ˉyˉxˉx. (3.6)

    The following formulas are used to express the bias and MSE of ˉydP:

    Bias(ˉydP)ˉYθCyx, (3.7)

    and

    MSE(ˉydP)ˉY2(θC2y+θC2x+2θCyx). (3.8)

    The standard regression estimator is denoted as ˉydlr, which is generally represented by:

    ˉydlr=ˉy+byx(ˉxˉx), (3.9)

    where byx denotes the regression coefficient from the sample.

    The following formulas are used to express the bias and MSE of ˉydlr:

    Bias(ˉydlr)θβ(μ12Syxμ03S2x), (3.10)
    MSE(ˉydlr)ˉY2C2y(θθρ2yx), (3.11)

    where

    β=SyxS2x,

    and

    μrs=Ni=1(yiˉY)r(xiˉX)sN.

    According to [23], the definitions of the exponential ratio and product type estimators are given by

    ˉydRMD=ˉyexp(ˉxˉxˉx+ˉx), (3.12)

    and

    ˉydPMD=ˉyexp(ˉxˉxˉx+ˉx). (3.13)

    The expressions for MSE of ˉydRMD and ˉydPMD are, respectively, expressed as follows:

    MSE(ˉydRMD)ˉY2[θC2y+θC2x(14C)], (3.14)

    and

    MSE(ˉydPMD)ˉY2[θC2y+θC2x(14+C)], (3.15)

    where

    C=(ρyxCx)Cy.

    The authors of [24] proposed a double sampling estimator is defined as

    ˉydRPˉy{sˉxˉx+(1s)ˉxˉx}, (3.16)

    where

    s=1+C2.

    The following formulas are used to express the bias and MSE of ˉydRP:

    Bias(ˉydRP)minθ2ˉYC2x[1+C(12C)], (3.17)

    and

    MSE(ˉydRP)minˉY2C2y[θ(1ρ2yx)+θρ2yx]. (3.18)

    This section, inspired by the methodologies discussed in [25,26,27], presents some improved classes of estimators. These improved estimators employ both the smallest and largest observations of the independent variables and their ranks in a double-phase design to improve estimation precision. The proposed estimators are defined below:

    ˉydU=[k1ˉy(ˉxˉx)α1+k2(ˉxˉx)α2]exp[a1(ˉxˉx)a1(ˉx+ˉx)+2a2], (4.1)

    and

    ˉyde=ˉyexp[k3{(ˉxˉx)(ˉxˉx)+2b1}]exp[k4{(ˉrˉr)(ˉr+ˉr)+2b2}], (4.2)

    where the scalar quantities (α1,α2) can assume the values (0,1,1). It is important to determine the unknown constant values of (k1,k2) by using the scalar quantities (α1,α2) so that the biases and mean squared errors can be minimized. Meanwhile, b1=XMXm represents the difference between extreme observations of the independent variable, and b2=RMRm represents the difference between the highest and lowest ranks of the independent variable. In addition, a1 and a2 are the various transformation parameter values. A detailed description of the subsets of the proposed estimator-Ⅰ can be found in Table 1.

    Table 1.  Classification of estimators under the improved estimator-Ⅰ.
    Subsets of ˉydU α1 α2 a1 a2
    ˉydU1=[k1ˉy(ˉxˉx)+k2(ˉxˉx)]L 1 -1 β2(x) xMxm
    ˉydU2=[k1ˉy(ˉxˉx)+k2(ˉxˉx)]L -1 1 cx xMxm
    ˉydU3=[k1ˉy(ˉxˉx)+k2(ˉxˉx)]L -1 -1 xMxm cx
    ˉydU4=[k1ˉy+k2(ˉxˉx)]L 0 1 xMxm β2(x)
    ˉydU5=[k1ˉy(ˉxˉx)+k2(ˉxˉx)]L 1 1 xMxm β2(x)
    ˉydU6=[k1ˉy+k2(ˉxˉx)]L 0 -1 β2(x) xMxm
    ˉydU7=[k1ˉy(ˉxˉx)+k2]L 1 0 xMxm cx
    ˉydU8=[k1ˉy(ˉxˉx)+k2]L -1 0 cx xMxm

     | Show Table
    DownLoad: CSV

    where

    L=exp[a1(ˉxˉx)a1(ˉx+ˉx)+2a2].

    To determine the mathematical properties of different estimators, the relative error terms are utilized:

    e0=(ˉyˉYˉY),e1=(ˉxˉXˉX),e2=(ˉxˉXˉX),e3=(ˉrˉRˉR),e4=(ˉrˉRˉR),

    such that E(ei)=0.

    Moreover,

    E(e20)=θC2y,E(e21)=θC2x,E(e22)=θC2x,E(e23)=θC2r,E(e24)=θC2r,E(e0e1)=θCyx,
    E(e0e2)=θCyx,E(e0e3)=θCyr,E(e0e4)=θCyr,E(e1e2)=θC2x,E(e1e3)=θCxr,
    E(e1e4)=θCxr,E(e2e3)=θCxr,E(e2e4)=θCxr,E(e3e4)=θC2r.

    In order to investigate the characteristics of the first improved estimator, we rewrite (4.1) using the error terms:

    ˉydU=[k1ˉY(1+e0)(1+e1)α1(1+e2)α1+k2(1+e1)α2(1+e2)α2]×exp[g1(e2e1)2(1+g12(e2+e1))1], (4.3)

    where

    g1=a1ˉXa1ˉX+a2.

    By performing a first-order Taylor series expansion and extending the right-hand sides of (4.3), excluding terms where ei>2, we get the following expression:

    ˉydUˉYˉY+k1ˉY[1+e0e1(α1+g12)+e2(α1+g12)+e21(α1g12+3g218+α1(α1+1)2)+e22(α1g12g218+α1(α11)2)e0e1(α1+g12)e0e2(α1+g12)e1e2(α1+g12)2]+k2[1e1(α2+g12)+e2(α2+g12)+e21(α2g12+3g218+α2(α2+1)2)+e22(α2g12g218+α2(α21)2)e1e2(α2+g12)2]. (4.4)

    Using (4.4), the bias of ˉydU is given by

    Bias(ˉydU)[ˉY+k1ˉYD+k2G], (4.5)

    where

    D=[1+θ{C2x(4α1(α1+1+g1)+3g218)Cyx(2α1+g12)}+θ{C2x{4α1(α1+g1+1)3g212}+Cyx(2α1+g12)}],

    and

    G=[1+θC2x(4α2(α2+g1+1)+3g218)+θC2x(2α2(α2+g11)3g214)].

    By applying expectation after squaring both sides of (4.4), we derive the corresponding equation that expresses the MSE of ˉydU.

    MSE(ˉydU)[ˉY2+ˉY2k21A+k22B2ˉY2k1D2ˉYk2G+2ˉYk1k2F], (4.6)

    where

    A=[1+θ{C2y+C2x{(α1+g12)2+(α1g1+3g214+α1(α1+1)2)}4Cyx(α1+g12)}+θ{C2x{(α1+g12)2+(α1g1g214+α1(α11))4(α1+g12)}+4Cyx(α1+g12)}],
    B=[1+θC2x{(α2+g12)2+(α2g1+3g214+α2(α2+1))}+θC2x{(α2+g12)2+(α2g1g214+α2(α21))4(α2+g12)2}],

    and

    F=[1+θ{C2x{(α1g12+3g218+α1(α1+1)2)+(α1+g12)(α2+g12)+(α2g12+3g218+α2(α2+1)2)Cyx(α1+α2+g1)}+θ{C2x{(α1g12g218+α1(α11)2)(α1+g12)(α2+g12)+(α2g12g218+α2(α21)2)(α1+g12)2(α2+g12)2}+Cyx(α1+α2+g1)}].

    By minimizing Eq (4.6), the optimal values for k1 and k2 can be determined, as shown below

    k1(opt)=BDFGABF2,

    and

    k2(opt)=ˉY(AGDF)ABF2.

    The minimum values for bias and MSE of ˉydU are determined by replacing the optimum k1 and k2 into (4.5) and (4.6). The resulting expressions are given below:

    Bias(ˉydU)minˉY2[1(AG2+BD22DFG)ABF2], (4.7)

    and

    MSE(ˉydU)minˉY2[1(AG2+BD22DFG)ABF2]. (4.8)

    To assess the behavior of the proposed estimator, we reformulate (4.2) using relative errors, which allows the computation of the Eqs (4.13) and (4.14), i.e.,

    ˉyde=ˉY(1+e0)exp[k3{g2(e2e1)2(1+g22(e1+e2))1}]exp[k4{g3(e4e3)2(1+g32(e3+e4))1}], (4.9)

    where

    g2=ˉXˉX+b1,

    and

    g3=ˉRˉR+b2.

    By performing a first-order Taylor series expansion and extending the right-hand sides of (4.9), excluding terms where ei>2, we get the following expression:

    ˉydeˉYˉY[e0k3g22(e1e2)k4g32(e3e4)+(k3g224+k23g228)e21(k3g224k23g228)e22+(k4g234k24g238)e23(k4g234k24g238)e24k3g22e0e1+k3g22e0e2k4g32e0e3+k4g32e0e4k23g222e1e2+k3k4g2g34e1e3k3k4g2g34e1e4k3k4g2g34e2e3+k3k4g2g34e2e4k24g232e3e4]. (4.10)

    Using (4.10), the bias of ˉyde is given by:

    Bias(ˉyde)θˉY[(k23g228+k3g224)C2x+(k24g238+k4g234)C2rk3g22Cyxk4g32Cyr+k3k4g2g32Cxr]θˉY[(k23g228+k3g224)C2x+(k24g238+k4g234)C2rk3g22Cyxk4g32Cyr+k3k4g2g32Cxr]. (4.11)

    By applying expectation after squaring both sides of (4.10), we derive the corresponding equation that expresses the MSE of ˉyde.

    MSE(ˉyde)θˉY2[C2y+k23g224C2x+k24g234C2rk3g2Cyxk4g3Cyr+k3k4g2g32Cxr]θˉY2[k23g224C2x+k24g234C2rk3g2Cyxk4g3Cyr+k3k4g2g32], (4.12)

    To express the bias and MSE for ˉyde, we substitute the known constants k3 and k4 into the formulas in (4.11) and (4.12). After simplifying the expressions, we get the following results:

    Bias(ˉyde)θˉY[38(g22C2x+g23C2r)12(g2Cyx+g3Cyr+g2g3Cxr)], (4.13)

    and

    MSE(ˉyde)ˉY2[θC2y+θ4(g22C2x+g23C2rg2Cyxg3Cyr+2g2g3Cxr)]. (4.14)

    In this section, we provide the efficiency conditions by using the mean squared error equation of the proposed estimators with the mean squared error equations of existing estimators, such as ˉyd, ˉydR, ˉydlr, ˉydRMD, ˉydPMD, and ˉydRP.

    Condition (ⅰ): We derived the following results from (3.2) and (4.8) to ensure that the suggested estimator is at least as efficient:

    V(ˉyd)>MSE(ˉydU)min,ifθC2y+(AG2+BD22DFGABF2)>1.

    Condition (ⅱ): The following expression obtained from (3.5) and (4.8), provides the required condition for the proposed estimator-Ⅰ to be more efficient:

    MSE(ˉydR)>MSE(ˉydU)min,if(θC2y+θC2x2θCyx)+(AG2+BD22DFGABF2)>1.

    Condition (ⅲ): The following condition obtained from (3.11) and (4.8), which provides that proposed estimator-Ⅰ has a smaller mean squared error:

    MSE(ˉydlr)>MSE(ˉydU)min,ifC2y(θθρ2yx)+(AG2+BD22DFGABF2)>1.

    Condition (ⅳ): The required condition for the proposed estimator-Ⅰ, determined from (3.14) and (4.8), shows that the proposed estimator-Ⅰ is more efficient:

    MSE(ˉydRMD)>MSE(ˉydU)min,if[θC2y+θC2x(14C)]+(AG2+BD22DFGABF2)>1.

    Condition (ⅴ): The resulting expression derived from (3.15) and (4.8) defines the necessary condition for the proposed estimator-Ⅰ to show better efficiency:

    MSE(ˉydPMD)>MSE(ˉydU)min,if[θC2y+θC2x(14+C)]+(AG2+BD22DFGABF2)>1.

    Condition (ⅵ): The following inequality, derived from (3.18) and (4.8), indicates the effectiveness of the proposed estimator-Ⅰ:

    MSE(ˉydRP)min>MSE(ˉydU)min,ifC2y[θ(1ρ2yx)+θρ2yx]+(AG2+BD22DFGABF2)>1.

    Condition (ⅶ): From (3.2) and (4.14), we derive:

    V(ˉyd)>MSE(ˉyde),if
    (θθ)(g22C2x+g23C2rg2Cyxg3Cyr+2g2g3Cxr)<0.

    If θ is greater than ˊθ or, equivalently, θ>θ, the following inequality is satisfied:

    (g22C2x+g23C2rg2Cyxg3Cyr+2g2g3Cxr)<0. (5.1)

    In a similar way, when θ is less than ˊθ, the following inequality holds true:

    (g22C2x+g23C2rg2Cyxg3Cyr+2g2g3Cxr)>0. (5.2)

    The proposed estimator ˉyde shows better performance relative to MSE(ˉyd) when either (5.1) or (5.2) holds.

    Condition (ⅷ): From (3.5) and (4.14), we derive:

    MSE(ˉydR)>MSE(ˉyde),if4(θθ)(C2x2Cyx)>(θθ)(g22C2x+g23C2rg2Cyxg3Cyr+2g2g3Cxr).

    If θ is greater than ˊθ or, equivalently, θ>θ, the following inequality is satisfied:

    4(C2x2Cyx)>(g22C2x+g23C2rg2Cyxg3Cyr+2g2g3Cxr). (5.3)

    In a similar way, when θ is less than ˊθ, the subsequent inequality holds true:

    4(C2x2Cyx)<(g22C2x+g23C2rg2Cyxg3Cyr+2g2g3Cxr). (5.4)

    The proposed estimator ˉyde shows better performance relative to MSE(ˉydR) when either (5.3) or (5.4) holds.

    Condition (ⅸ): From (3.11) and (4.14), we derive

    MSE(ˉydlr)>MSE(ˉyde),if4(θθ)ρ2yx<(θθ)(g22C2x+g23C2rg2Cyxg3Cyr+2g2g3Cxr).

    If θ is greater than ˊθ or, equivalently, θ>θ, the following inequality is satisfied:

    4ρ2yx>(g22C2x+g23C2rg2Cyxg3Cyr+2g2g3Cxr). (5.5)

    In a similar way, when θ is less than ˊθ, the following inequality holds true:

    4ρ2yx<(g22C2x+g23C2rg2Cyxg3Cyr+2g2g3Cxr). (5.6)

    The proposed estimator ˉyde shows better performance relative to MSE(ˉydlr) when either (5.5) or (5.6) holds.

    Condition (ⅹ): From (3.14) and (4.14), we derive

    MSE(ˉydRMD)>MSE(ˉyde)
    4(θθ)C2x(14C)>(θθ)(g22C2x+g23C2rg2Cyxg3Cyr+2g2g3Cxr).

    In a similar way, when θ is less than ˊθ, the following inequality holds true:

    C2x(14C)>(g22C2x+g23C2rg2Cyxg3Cyr+2g2g3Cxr). (5.7)

    In a similar way, when θ is less than ˊθ, the subsequent inequality holds true:

    C2x(14C)<(g22C2x+g23C2rg2Cyxg3Cyr+2g2g3Cxr). (5.8)

    The proposed estimator ˉyde shows better performance relative to MSE(ˉydRMD) when either (5.7) or (5.8) holds.

    Condition (xi): From (3.15) and (4.14), we derive

    MSE(ˉydPMD)>MSE(ˉyde),if
    4(θθ)C2x(14+C)>(θθ)(g22C2x+g23C2rg2Cyxg3Cyr+2g2g3Cxr).

    If θ is greater than ˊθ or, equivalently, θ>θ, the following inequality is satisfied:

    C2x(1+4C)>(g22C2x+g23C2rg2Cyxg3Cyr+2g2g3Cxr). (5.9)

    In a similar way, when θ is less than ˊθ, the following inequality holds true:

    C2x(1+4C)<(g22C2x+g23C2rg2Cyxg3Cyr+2g2g3Cxr). (5.10)

    The proposed estimator ˉyde shows better performance relative to MSE(ˉydPMD) when either (5.9) or (5.10) holds.

    Condition (xii): From 3.18 and (4.14), we derive:

    MSE(ˉydRP)min>MSE(ˉyde),if
    4(θθ)ρ2yx<(θθ)(g22C2x+g23C2rg2Cyxg3Cyr+2g2g3Cxr).

    If θ is greater than ˊθ or, equivalently, θ>θ, the following inequality is satisfied:

    4ρ2yx>(g22C2x+g23C2rg2Cyxg3Cyr+2g2g3Cxr). (5.11)

    In a similar way, when θ is less than ˊθ, the following inequality holds true:

    4ρ2yx<(g22C2x+g23C2rg2Cyxg3Cyr+2g2g3Cxr). (5.12)

    The proposed estimator ˉyde shows better performance relative to MSE(ˉydRP) when either (5.11) or (5.12) holds.

    In this section, we compare the mean squared errors of the improved classes of estimators with those of existing estimators in order to provide a comprehensive analysis. This analysis relies on both simulated data and distinct real datasets. By analyzing the MSE across these simulated and different datasets, we aim to provide a thorough assessment of the performance and reliability of the new proposed estimators.

    The simulation method from [28] is in this section, where the auxiliary variable X is collected from six populations, each associated with a different probability distribution.

    ● Data-1: XGamma(η1=4,η2=6),

    ● Data-2: XGamma(η1=8,η2=10),

    ● Data-3: XUniform(ν1=3,ν2=5),

    ● Data-4: XUniform(ν1=0,ν2=1),

    ● Data-5: XExponential(μ=5),

    ● Data-6: XExponential(μ=10).

    Afterwards, by utilizing the correlation coefficient's fixed value

    ryx=0.79,

    and the random error term

    eN(0,1),

    the dependent variable Y is computed by using the following equation:

    Y=ryx×X+e.

    On the basis of each distribution and correlation setting, we examined the MSEs of various suggested estimators and existing estimators to assess the robustness and efficiency.

    Step 1: In order to generate 1000 observations, we start to obtain a population based on the different probability distributions mentioned above.

    Step 2: In the first phase, simple random sampling without replacement (SRSWOR) is used to choose a sub-sample m1 from a whole population of size N.

    Step 3: From the first-phase sample, the SRSWOR technique is again applied to draw a second-phase sub-sample with size m2.

    Step 4: The total population, as well as the extreme observations of independent variables and their rankings, are calculated using the processes described above. We also determine the optimal values for the improved classes of estimators by considering unknown constants.

    Step 5: Different sample sizes are computed for each population using the SRSWOR method.

    Step 6: For each sample size under consideration, the MSE are calculated for all the estimators examined in this paper.

    Step 7: After 45,000 repetitions of Steps 5 and 6; the MSE values for all the estimators are calculated on the basis of the formula defined as follows:

    MSE(ˉydk)min=45000g=1(ˉydkgˉY)245000,k=R,lr,RMD,PMD,RP,e,U1,U2,,U8.

    Step 8: Finally, Table 2 reports the MSE results for artificial populations.

    Table 2.  MSE computation for various estimators with artificial populations.
    Estimator Gam(4,6) Gam(8,10) Uni(0,1) Uni(3,5) Exp(5) Exp(10)
    ˉyd 1.92e-2 2.28e-2 8.98e-3 9.90e-2 8.26e-2 9.95e-2
    ˉydR 1.73e-2 2.29e-2 4.70e-3 8.09e-2 7.66e-2 9.48e-2
    ˉydlr 1.68e-2 1.91e-2 3.10e-3 7.58e-2 7.01e-2 9.33e-2
    ˉydRMD 1.48e-2 1.72e-2 2.98e-3 7.24e-2 6.79e-2 9.07e-2
    ˉydPMD 2.29e-2 2.48e-2 9.25e-3 1.04e-1 9.25e-2 1.01e-1
    ˉydRP 7.62e-3 1.59e-2 2.79e-3 6.28e-2 6.09e-2 8.57e-2
    ˉyde 7.18e-3 1.40e-2 2.41e-3 5.60e-2 5.51e-2 7.54e-2
    ˉydU1 3.33e-3 4.42e-3 8.60e-4 3.28e-3 2.86e-3 5.58e-3
    ˉydU2 2.45e-3 4.56e-3 4.00e-4 3.05e-3 2.16e-3 1.54e-2
    ˉydU3 3.62e-3 5.34e-3 1.73e-3 3.88e-3 4.53e-3 2.77e-2
    ˉydU4 3.16e-3 6.63e-3 6.20e-4 3.13e-3 4.18e-3 3.00e-2
    ˉydU5 2.17e-3 4.85e-3 1.58e-3 3.55e-3 2.46e-3 4.14e-3
    ˉydU6 9.90e-4 1.63e-3 2.60e-4 1.20e-4 1.33e-3 2.05e-3
    ˉydU7 6.90e-4 1.74e-3 2.20e-4 3.70e-4 8.70e-4 3.48e-3
    ˉydU8 9.12e-6 6.32e-5 2.42e-6 6.10e-6 1.67e-5 6.30e-4

     | Show Table
    DownLoad: CSV

    We assessed the performance of different estimators by calculating the mean squared errors across three separate datasets. The aim of this analysis was to check the accuracy of the proposed estimators. Detailed descriptions of the datasets, along with their summary statistics, are provided below.

    Dataset 1. (Source: [29], p. 226)

    Y: Information on the number of employees in each department for the year 2012, which indicates the size of the workforce across different industries.

    X: The number of factories each department officially authorized in 2012b by providing information on the productivity of the department and the existence of the industry.

    R: The departments are arranged according to the count of factories in 2012, used to compare relative industrial activity.

    Dataset 2. (Source: [29], p. 135)

    Y: Denotes the overall count of students enrolled in educational departments in 2012;

    X: Denotes the total number of schools funded by the government in 2012;

    R: Denotes the ranks of the total number of schools funded by the government in 2012 according to the number of schools.

    Dataset 3. (Source: [1], p. 24)

    Y: The food costs that the families paid for, which are directly related to their jobs;

    X: The total weekly income that the families made, which shows their financial resources for that time;

    R: the ranking of families according to their weekly income, which shows how much money they made compared with each other.

    The above data sets are summarized in the following summary statistics given in Table 3, and mean squared comparison between new proposed estimators and existing estimators are displayed in Table 4.

    Table 3.  Summary statistics for different data-sets.
    Parameters Dataset 1 Dataset 2 Dataset 3 Parameters Dataset 1 Dataset 2 Dataset 3
    N 36 36 33 ˉY 52432 148718 27.49
    m1 15 15 15 ˉX 335.78 1054.39 72.55
    m2 6 6 6 ˉR 18.51 18.50 17.00
    XM 2055 2370 95 Sy 178201 182315 10.13
    Xm 24 39 58 Sx 451.136 402.61 10.58
    RM 36 36 33 Sr 10.53 10.54 9.64
    Rm 1 1 1 Cy 3.40 1.23 0.37
    Cx 1.34 0.38 0.15 Cr 0.57 0.56 0.57
    ρyx 0.39 0.17 0.25 ρyr 0.36 0.19 0.20
    ρxr 0.75 0.94 0.98 θ 0.14 0.14 0.14
    θ 0.10 0.10 0.10 θ 0.04 0.04 0.04

     | Show Table
    DownLoad: CSV
    Table 4.  MSE computation for various estimators with real populations.
    Estimator Dataset 1 Dataset 2 Dataset 3
    ˉyd 4410841204 4616729172 13.97
    ˉydR 3934722123 4566840232 13.53
    ˉydlr 3934517862 4509295381 13.32
    ˉydRMD 4048702250 4511078294 13.35
    ˉydPMD 5021109648 4883614743 15.40
    ˉydRP 3180960973 4018903439 11.39
    ˉydUe 3025382610 3918150768 11.29
    ˉydU1 354493387 422589708 2.66
    ˉydU2 2175126 7346603 0.05
    ˉydU3 350163172 214844883 0.72
    ˉydU4 212556223 352690922 2.04
    ˉydU5 275131170 403622267 2.64
    ˉydU6 377505122 216979294 0.74
    ˉydU7 117242958 280960015 1.51
    ˉydU8 1345452 6937497 0.04

     | Show Table
    DownLoad: CSV

    To identify the performance and quality of the newly improved estimators, we conducted simulations and analyzed three real-life datasets. The performance comparison was made using the MSE as the key criterion. The MSE values for both the new and existing estimators are listed in Table 2, while the results for the real datasets are shown in Table 4. From these evaluations, the following conclusions can be drawn.

    ● Findings from both the generated simulations and the actual datasets indicate that the newly introduced estimators consistently yield lower MSE values compared with the overall existing approaches that have already been discussed in the literature. This pattern is clearly reflected in Tables 2 and 4.

    ● Additionally, the MSE values for the proposed estimators are consistently lower than those for the existing ones, as illustrated by the declining trend of the graph lines in Figures 1 and 3, both for the simulation experiments and real-life data. This observation suggests that the proposed estimators outperform the other ones, with the MSE values for the new estimators showing an inverse pattern compared with those of the existing estimators.

    Figure 1.  A visual representation of the mean squared errors (MSEs) results for all suggested and existing estimators using artificial data.

    Figure 2 presents a grouped bar chart comparing the MSE of the proposed estimator across various artificial populations generated from gamma, uniform, and exponential distributions. Each group on the x-axis represents a distribution type, while the two bars within each group correspond to different parameter settings for that distribution (e.g., Gam(4,6) and Gam(8,10) for the gamma distribution). The y-axis shows the MSE values, with lower bars indicating better performance for that estimator. This visualization helps highlight how the estimator performs under different distributional shapes and scales, offering a clear comparison of its efficiency across varied population structures.

    Figure 2.  Visual representation of the mean squared errors (MSEs) for all suggested and existing estimators using artificial data.

    Figure 4 shows a grouped bar chart of the MSE values for 15 estimators across three real datasets. The x-axis indexes the estimators from 1 to 15, while the y-axis uses a logarithmic scale to handle the large variation in MSE values particularly highlighting the contrast between the relatively large MSE values of Dataset 1 and Dataset 2 and the smaller values from Dataset 3. Each group of bars represents an estimator, with different colors indicating the datasets. This visualization allows for easy comparison of estimator performance across datasets, highlighting how some estimators perform consistently better, especially on Dataset 3. From this plot, it is evident that certain estimators (such as Estimators 14 and 15) consistently yield lower MSE values.

    Figure 3.  A visual representation of the mean squared errors (MSEs) results for all suggested and existing estimators using real data.
    Figure 4.  Visual representation of the mean squared errors (MSEs) for all suggested and existing estimators using real data.

    This research introduced innovative sets of efficient newly enhanced estimators for determining the population mean, utilizing the minimum and largest values of the independent variable along with its known highest and lowest rank values. Section 5 outlines specific theoretical foundations to compare the proposed estimators with existing methods, demonstrating their effectiveness. To support these findings, simulation experiments were performed, along with an analysis of real-life datasets. The results show that the new estimators consistently perform better than the existing estimators, especially by having a lower mean squared error (MSE), as displayed in Table 2. These findings are also supported by the results in Table 4, which validates the theoretical conditions given in Section 5.

    The simulation results and empirical studies clearly show that the suggested estimators ˉydi (i=e,U1,U2,,U8) are more efficient than the other estimators being considered. We noticed that all the proposed estimators, ˉydU8 proves to be the most efficient option within the suggested group of estimators, making it strongly recommended.

    This study introduced an enhanced method for population mean estimation by utilizing the smallest and largest values of the independent variables along with their ranks. The main advantage of this approach lies in its ability to significantly reduce the MSE, particularly in the presence of extreme values, offering improved precision over traditional methods. This method is valuable in fields such as economics, healthcare, public policy, and education, where accurate population estimates are essential. However, the method has some limitations. Its performance may vary, depending on the characteristics of the data, such as distribution and correlation, and the two-phase sampling design may add complexity and cost.

    Moreover, our study examined the qualities and efficiency of the newly enhanced estimators by adopting a sampling two-phase technique. This approach allowed us to compare the performance of the enhanced estimators against traditional methods more effectively. Future research may explore the application of these methods in various sampling designs, including stratified and systematic approaches, and investigate optimal sample size allocation strategies to further improve efficiency. The use of additional auxiliary variables and assessment on large-scale datasets may provide deeper insights. Exploring the combination of these estimators with machine learning for adaptive sampling or real-time data processing can enhance estimation precision in dynamic environments. This topic offers an interesting direction for further investigation.

    Hleil Alrweili: Conceptualization, methodology, software, validation, formal analysis, investigation, resources, data curation, writing-original draft preparation, writing-review and editing, visualization, project administration; Fatimah A. Almulhim: Conceptualization, methodology, validation, formal analysis, investigation, data curation, writing-original draft preparation, writing-review and editing, visualization, supervision, funding acquisition. All authors have read and agreed to the published version of the manuscript.

    The authors declare they have not used Artificial Intelligence (AI) tools in the creation of this article.

    This project was funded by Princess Nourah bint Abdulrahman University Researchers Supporting Project. The authors, therefore, acknowledge with thanks Princess Nourah bint Abdulrahman University Researchers Supporting funds, Project number (PNURSP2025R515), Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia.

    The authors declare no conflict of interest.



    [1] W. G. Cochran, Sampling techniques, 2 Eds., Hoboken: John Wiley and Sons, 1963.
    [2] C. E. Särndal, Sample survey theory vs. general statistical theory: estimation of the population mean, Int. Stat. Rev., 40 (1972), 1–12. https://doi.org/10.2307/1402700 doi: 10.2307/1402700
    [3] S. Y. Yang, D. B. Meng, H. F. Yang, C. Q. Luo, X. Y. Su, Enhanced soft Monte Carlo simulation coupled with support vector regression for structural reliability analysis, P. I. Civil Eng.-Transp., 2024 (2024), 1–16. https://doi.org/10.1680/jtran.24.00128 doi: 10.1680/jtran.24.00128
    [4] B. V. Sukhatme, Some ratio-type estimators in two-phase sampling, J. Am. Stat. Assoc., 57 (1962), 628–632. https://doi.org/10.2307/2282400 doi: 10.2307/2282400
    [5] A. Y. Erinola, R. V. K. Singh, A. Audu, T. James, Modified class of estimator for finite population mean under two-phase sampling using regression estimation approach, Asian Journal of Probability and Statistics, 14 (2021), 52–64. https://doi.org/10.9734/ajpas/2021/v14i430338 doi: 10.9734/ajpas/2021/v14i430338
    [6] S. Guha, H. Chandra, Improved estimation of finite population mean in two-phase sampling with subsampling of the nonrespondents, Math. Popul. Stud., 28 (2021), 24–44. https://doi.org/0.1080/08898480.2019.1694325
    [7] A. Kamal, N. Amir, H. Dastagir, Some exponential type predictive estimators of finite population mean in two-phase sampling, Journal of Statistics, Computing and Interdisciplinary Research, 2 (2020), 51–57. https://doi.org/10.52700/scir.v2i1.10 doi: 10.52700/scir.v2i1.10
    [8] T. Zaman, C. Kadilar, New class of exponential estimators for finite population mean in two-phase sampling, Commun. Stat.-Theor. M., 50 (2019), 874–889. https://doi.org/10.1080/03610926.2019.1643480 doi: 10.1080/03610926.2019.1643480
    [9] S. Mohanty, J. Sahoo, A note on improving the ratio method of estimation through linear transformation using certain known population parameters, Sankhya: The Indian Journal of Statistics, 57 (1995), 93–102.
    [10] M. Khan, J. Shabbir, Some improved ratio, product, and regression estimators of finite population mean when using minimum and maximum values, Sci. World J., 2013 (2013), 431868. https://doi.org/10.1155/2013/431868 doi: 10.1155/2013/431868
    [11] M. Khan, Improvement in estimating the finite population mean under maximum and minimum values in double sampling scheme, J. Stat. Appl. Pro. Lett., 2 (2015), 115–121.
    [12] G. S. Walia, H. Kaur, M. K. Sharma, Ratio type estimator of population mean through efficient linear transformation, American Journal of Mathematics and Statistics, 5 (2015), 144–149. https://doi.org/10.5923/j.ajms.20150503.06 doi: 10.5923/j.ajms.20150503.06
    [13] U. Daraz, J. Shabbir, H. Khan, Estimation of finite population mean by using minimum and maximum values in stratified random sampling, J. Mod. Appl. Stat. Meth., 17 (2018), eP2548. https://doi.org/10.22237/jmasm/1532007537 doi: 10.22237/jmasm/1532007537
    [14] U. Daraz, M. Khan, Estimation of variance of the difference-cum-ratio-type exponential estimator in simple random sampling, RMS Res. Math. Stat., 8 (2021), 1899402. https://doi.org/10.1080/27658449.2021.1899402 doi: 10.1080/27658449.2021.1899402
    [15] U. Daraz, M. A. Alomair, O. Albalawi, Variance estimation under some transformation for both symmetric and asymmetric data, Symmetry, 16 (2024), 957. https://doi.org/10.3390/sym16080957 doi: 10.3390/sym16080957
    [16] A. S. Alghamdi, H. Alrweili, A comparative study of new ratio-type family of estimators under stratified two-phase sampling, Mathematics, 13 (2025), 327. https://doi.org/10.3390/math13030327 doi: 10.3390/math13030327
    [17] U. Daraz, J. B. Wu, D. Agustiana, W. Emam, Finite population variance estimation using Monte Carlo simulation and real life application, Symmetry, 17 (2025), 84. https://doi.org/10.3390/sym17010084 doi: 10.3390/sym17010084
    [18] U. Daraz, D. Agustiana, J. B. Wu, W. Emam, Twofold auxiliary information under two-Phase sampling: An improved family of double-transformed variance estimators, Axioms, 14 (2025), 64. https://doi.org/10.3390/axioms14010064 doi: 10.3390/axioms14010064
    [19] C. X. Long, W. X. Chen, R. Yang, D. S. Yao, Ratio estimation of the population mean using auxiliary information under the optimal sampling design, Probab. Eng. Inform. Sc., 36 (2022), 449–460. https://doi.org/10.1017/S0269964820000625 doi: 10.1017/S0269964820000625
    [20] H. O. Cekim, H. Cingi, Some estimator types for population mean using linear transformation with the help of the minimum and maximum values of the auxiliary variable, Hacet. J. Math. Stat., 46 (2017), 685–694. https://doi.org/10.15672/HJMS.201510114186 doi: 10.15672/HJMS.201510114186
    [21] S. Chatterjee, A. S. Hadi, Regression analysis by example, 5 Eds., Hoboken: John Wiley & Sons, 2013.
    [22] A. S. Alghamdi, H. Alrweili, New class of estimators for finite population mean under stratified double phase sampling with simulation and real-life application, Mathematics, 13 (2025), 329. https://doi.org/10.3390/math13030329 doi: 10.3390/math13030329
    [23] H. P. Singh, G. K. Vishwakarma, Modified exponential ratio and product estimators for finite population mean in double sampling, Aust. J. Stat., 36 (2007), 217–225.
    [24] H. P. Singh, M. R. Espeio, Double sampling ratio-product estimator of a finite population mean in sample surveys, J. Appl. Stat., 34 (2007), 71–85. https://doi.org/10.1080/02664760600994562 doi: 10.1080/02664760600994562
    [25] U. Daraz, J. B. Wu, O. Albalawi, Double exponential ratio estimator of a finite population variance under extreme values in simple random sampling, Mathematics, 12 (2024), 1737. https://doi.org/10.3390/math12111737 doi: 10.3390/math12111737
    [26] U. Daraz, J. B. Wu, M. A. Alomair, L. A. Aldoghan, New classes of difference cum-ratio-type exponential estimators for a finite population variance in stratified random sampling, Heliyon, 10 (2024), e33402. https://doi.org/10.1016/j.heliyon.2024.e33402 doi: 10.1016/j.heliyon.2024.e33402
    [27] M. A. Alomair, U. Daraz, Dual transformation of auxiliary variables by using outliers in stratified random sampling, Mathematics, 12 (2024), 2839. https://doi.org/10.3390/math12182839 doi: 10.3390/math12182839
    [28] U. Daraz, M. A. Alomair, O. Albalawi, A. S. Al Naim, New techniques for estimating finite population variance using ranks of auxiliary variable in two-stage sampling, Mathematics, 12 (2024), 2741. https://doi.org/10.3390/math12172741 doi: 10.3390/math12172741
    [29] Punjab Development Statistics, Bureau of Statistics, Government of the Punjab: Lahore, Pakistan, 2013. Available from: https://bos.punjab.gov.pk.
  • Reader Comments
  • © 2025 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)
通讯作者: 陈斌, bchen63@163.com
  • 1. 

    沈阳化工大学材料科学与工程学院 沈阳 110142

  1. 本站搜索
  2. 百度学术搜索
  3. 万方数据库搜索
  4. CNKI搜索

Metrics

Article views(304) PDF downloads(41) Cited by(0)

Figures and Tables

Figures(4)  /  Tables(4)

Other Articles By Authors

/

DownLoad:  Full-Size Img  PowerPoint
Return
Return

Catalog