Our study aimed to compare ordered ranked set sampling with moving extremes ranked set sampling in the context of type Ⅱ censoring. We focused on deriving Bayesian estimations and predictions using the linear exponential model. This analysis included various loss functions, such as squared error, Al-Bayyati, and general entropy. To evaluate the efficiency of the estimators we produced, we assessed their mean squared error and relative absolute bias. Additionally, we provide Bayesian point and interval predictions for the ordered future lifetime, considering both squared error and general entropy loss functions. To ensure the accuracy and effectiveness of these estimation and prediction methods, we conducted numerical tests using Monte Carlo simulations. Finally, we illustrated these theoretical concepts with a practical example that utilized real-world medical data.
Citation: Haidy A. Newer, Bader S Alanazi. Bayesian estimation and prediction for linear exponential models using ordered moving extremes ranked set sampling in medical data[J]. AIMS Mathematics, 2025, 10(1): 1162-1182. doi: 10.3934/math.2025055
Related Papers:
[1]
Haiping Ren, Ziwen Zhang, Qin Gong .
Estimation of Shannon entropy of the inverse exponential Rayleigh model under progressively Type-Ⅱ censored test. AIMS Mathematics, 2025, 10(4): 9378-9414.
doi: 10.3934/math.2025434
[2]
Abdulhakim A. Al-Babtain, Amal S. Hassan, Ahmed N. Zaky, Ibrahim Elbatal, Mohammed Elgarhy .
Dynamic cumulative residual Rényi entropy for Lomax distribution: Bayesian and non-Bayesian methods. AIMS Mathematics, 2021, 6(4): 3889-3914.
doi: 10.3934/math.2021231
[3]
Baria A. Helmy, Amal S. Hassan, Ahmed K. El-Kholy, Rashad A. R. Bantan, Mohammed Elgarhy .
Analysis of information measures using generalized type-Ⅰ hybrid censored data. AIMS Mathematics, 2023, 8(9): 20283-20304.
doi: 10.3934/math.20231034
[4]
Hassan Okasha, Mazen Nassar, Saeed A. Dobbah .
E-Bayesian estimation of Burr Type XII model based on adaptive Type-Ⅱ progressive hybrid censored data. AIMS Mathematics, 2021, 6(4): 4173-4196.
doi: 10.3934/math.2021247
[5]
Refah Alotaibi, Mazen Nassar, Zareen A. Khan, Wejdan Ali Alajlan, Ahmed Elshahhat .
Entropy evaluation in inverse Weibull unified hybrid censored data with application to mechanical components and head-neck cancer patients. AIMS Mathematics, 2025, 10(1): 1085-1115.
doi: 10.3934/math.2025052
[6]
Magdy Nagy, Khalaf S. Sultan, Mahmoud H. Abu-Moussa .
Analysis of the generalized progressive hybrid censoring from Burr Type-Ⅻ lifetime model. AIMS Mathematics, 2021, 6(9): 9675-9704.
doi: 10.3934/math.2021564
[7]
Essam A. Ahmed, Laila A. Al-Essa .
Inference of stress-strength reliability based on adaptive progressive type-Ⅱ censing from Chen distribution with application to carbon fiber data. AIMS Mathematics, 2024, 9(8): 20482-20515.
doi: 10.3934/math.2024996
[8]
Amal S. Hassan, Najwan Alsadat, Oluwafemi Samson Balogun, Baria A. Helmy .
Bayesian and non-Bayesian estimation of some entropy measures for a Weibull distribution. AIMS Mathematics, 2024, 9(11): 32646-32673.
doi: 10.3934/math.20241563
[9]
Abdullah Ali H. Ahmadini, Amal S. Hassan, Ahmed N. Zaky, Shokrya S. Alshqaq .
Bayesian inference of dynamic cumulative residual entropy from Pareto Ⅱ distribution with application to COVID-19. AIMS Mathematics, 2021, 6(3): 2196-2216.
doi: 10.3934/math.2021133
[10]
Bing Long, Zaifu Jiang .
Estimation and prediction for two-parameter Pareto distribution based on progressively double Type-II hybrid censored data. AIMS Mathematics, 2023, 8(7): 15332-15351.
doi: 10.3934/math.2023784
Abstract
Our study aimed to compare ordered ranked set sampling with moving extremes ranked set sampling in the context of type Ⅱ censoring. We focused on deriving Bayesian estimations and predictions using the linear exponential model. This analysis included various loss functions, such as squared error, Al-Bayyati, and general entropy. To evaluate the efficiency of the estimators we produced, we assessed their mean squared error and relative absolute bias. Additionally, we provide Bayesian point and interval predictions for the ordered future lifetime, considering both squared error and general entropy loss functions. To ensure the accuracy and effectiveness of these estimation and prediction methods, we conducted numerical tests using Monte Carlo simulations. Finally, we illustrated these theoretical concepts with a practical example that utilized real-world medical data.
1.
Introduction
Cost-effective sampling is a vital consideration in certain experiments, particularly when measuring a characteristic of interest that is expensive or time-consuming. Ranked set sampling (RSS) offers an efficient strategy to achieve observational economy by enhancing precision per sampling unit. Initially introduced by McIntyre [30], RSS was designed to improve the accuracy of the sample mean as an estimator of the population mean. Chen et al. [17] explored the theory and applications of RSS, while other studies by Chen et al. [18,19] identified various parametric estimations using RSS. Ali et al. [6] applied different inference methods for linear exponential distribution based on extreme RSS. Mohie El-Din et al. [22,23,24,25] conducted Bayesian estimation and prediction based on ordered RSS under type Ⅱ censored samples, demonstrating that the estimations and predictions derived from ordered RSS (ORSS) outperform those based on simple random samples (SRS). Recently, Bhushan et al. [11,12] investigated modified classes of estimators using RSS, providing valuable insights into the development of new estimation techniques. Additionally, Newer et al. [31] utilized various applications of RSS to derive inferences for the parameters of the Nadarajah-Haghighi distribution. However, the effectiveness of an estimator relies heavily on the accuracy of the ranking process. To address this issue, Al-Odat and Al-Saleh [3] introduced moving extremes ranked set sampling (MERSS), which aims to minimize ranking errors while preserving the optimality of the original ranked set method. The MERSS scheme can be outlined as follows:
1 Select m1 SRSs of sizes 1, 2, ..., m1.
2 Rank the elements of each sample through visual inspection or another cost-effective method, without directly measuring the characteristic of interest.
3 Accurately measure the maximum ordered observation from each set: the first set, the second set, the third set, and so forth.
4 Repeat steps 1–3 for an additional m2 sets of sizes 1, 2, ..., m2, this time measuring the minimum ordered observations instead of the maximum ones.
The total sample size for MERSS is n=m1+m2. The measured MERSS units are denoted as ωMERSS=(ω1:1,ω2:2,…,ωm1:m1,ω1:1,ω1:2,…,ω1:m2). Balakrishnan and Li [9] introduced the concept of ORSS by utilizing order statistics derived from independent and non-identically distributed (IND) random variables. They established optimal linear inference based on ORSS, where the values ωi:i are arranged in increasing order. For convenience, we denote the first r observations from type Ⅱ censored ordered MERSS (OMERSS) as zOMERSS=(z1≤z2≤…≤zr). The literature is abundant with studies focused on parameter estimation for various distributions using MERSS. For instance, Abu-Dayyeh and Al Sawi [1] examined the modified maximum likelihood estimator (MLE) of the mean for the exponential distribution under MERSS. Similarly, Al-Omari and Al-Hadhrami [2] investigated the properties of Bayesian estimators for the population mean of the normal distribution using MERSS. Furthermore, the authors in [4,5] analyzed the MLE of the mean for both exponential and normal distributions under MERSS, demonstrating that the MLEs derived from MERSS consistently outperformed those obtained through SRS in quantitative terms. For a more thorough understanding of MERSS, refer to the works of [17,18].
In applied statistics and reliability analysis, linear exponential distribution (LED) finds numerous applications. Broadbent [13] and Carbone et al. [15] utilized LED to examine survival patterns of patients with plasmacytic myeloma. Also known as the linear failure rate distribution, LED is well-regarded for modeling lifetime data in both reliability and medical studies, particularly effective in representing processes characterized by an increasing linear failure rate. In survival analysis, models featuring bathtub-shaped failure rates are highly sought after. However, LED falls short in providing an appropriate parametric fit for phenomena exhibiting decreasing, nonlinearly increasing, or non-monotonic failure rates—characteristics commonly observed in firmware reliability modeling and biological studies. For further insights into these topics, refer to the works of Lai and Xie [27], Lai et al. [28] and Zhang et al. [32]. The two-scale parameter LED, denoted as (α,λ), is defined by the following probability density function (pdf) and cumulative distribution function (cdf):
f(y)=(α+λy)exp(−αy−λ2y2),y≥0,(α,λ≥0),
(1.1)
F(y)=1−exp(−αy−λ2y2),y≥0,(α,λ≥0).
(1.2)
Notably, LED represents the distribution of the minimum of two independent random variables, v and w, where v follows an exponential distribution with parameter α, and w follows a Rayleigh distribution with parameter λ. Consequently, LED encompasses both exponential and Rayleigh distributions as special cases:
- When λ=0, LED simplifies to an exponential distribution.
- When α=0, it reduces to a Rayleigh distribution.
Testing how products fail under normal operating conditions is essential for reliability analysis. However, thorough testing can be extremely costly, especially for products known for their high reliability. When testing becomes too expensive and the product shows a strong level of reliability, it may not be practical to use complete samples for statistical analysis. To address these challenges of time and cost, statisticians have introduced various censoring methods. In life or quality tests, a random sample y1,y2,…,yn is taken from a distribution with cdf F(y). Instead of waiting for all n samples to fail, the experiment is stopped after the rth failure, where 1<r<n. The order statistics of the data can be arranged as y1:n≤y2:n≤⋯≤yr:n. This approach is known as type Ⅱ censored data, where only the smallest observed values are recorded. Type Ⅱ censoring is particularly beneficial when waiting for all n individuals to fail would take a long time, helping to save both time and resources. The number of censored samples is decided before the experiment starts. For more in-depth discussions, you can refer to sources [16,29,30] and other cited works within those references.
In many real-world scenarios, the ability to predict future observations is crucial. This is especially important in industrial applications and survival studies, where predicting ordered random variables helps assess the likelihood of producing defective items. In lifetime testing, both interval and point predictions are vital for identifying the most effective censoring methods. By analyzing the number of failed items during a life-test experiment, we can estimate the failure times of the remaining objects. Additionally, these predictions can guide decisions on whether to accelerate the life test and when the failure of a future item might signal the end of the test. The statistical literature shows that prediction problems have garnered significant interest from researchers in both theoretical and practical areas, as highlighted in the works of Mohie El-Din et al. [22,24,25].
Our study aims to compare ORSS with OMERSS under a type Ⅱ censoring scheme to derive Bayesian estimations for the parameters of the linear exponential model. We will also develop Bayesian point and interval predictions for the ordered future lifetime, taking into account both symmetric and asymmetric loss functions. The estimation and prediction results obtained from ORSS will be compared with those derived from OMERSS. To illustrate these theoretical concepts, we will use a specific example from the medical field that incorporates real-life data. The structure of this paper is organized as follows: Section 2 presents essential preliminary results relevant to our study. Section 3 details the derivation of Bayesian estimators for the parameters under various loss functions. Section 4 focuses on establishing Bayesian prediction bounds for LED using OMERSS. Section 5 includes a Monte Carlo simulation and a discussion to evaluate the accuracy of the estimation and prediction methods, along with an application to a real dataset. Section 6 concludes the study.
2.
Preliminary results
Here, let Z=(Z1,Z2,…,Zr), where z1≤z2≤…≤zr, represent the informative type Ⅱ censored sample consisting of the first r observations (out of n≥r) from OMERSS. Utilizing results for order statistics from IND random variables (refer to Balakrishnan [8]), the joint density function (JDF) of z can be expressed as:
Assuming that Zi follows the same distribution as the ith order statistic ωi:j, where i,j=1,2,…,mς and ς=1,2 with i≤j for for an SRS of size mς, the pdf and cdf of Zj are given by (see [7,20]):
By substituting the expressions from (2.4) and (2.5) into (2.1), it follows that the likelihood function (LF) for type Ⅱ censored OMERSS can be represented as
In this section, we illustrate how to derive the Bayesian estimators for the unknown parameters α and λ. We will explore Bayesian estimation methods under the assumption that the random variable (ℑ,ℵ) follows an exponential prior distribution. Using the realizations (α,λ), we will estimate the posterior density of (α,λ) based on the observed sample z. The prior distribution can be determined as follows:
ℏ(α,λ)=ηρexp(−αη−λρ),α,λ>0,(η,ρ>0).
(3.1)
The hyperparameters (η,ρ) were selected to reflect our prior knowledge of (α,λ). It is important to highlight that we chose an exponential family prior due to its flexibility and simplicity, which allows it to capture a wide range of the experimenter's prior beliefs. For more details, see [10]. The posterior density of (α,λ) can be expressed as follows, utilizing the likelihood function from Eq (2.10) and the prior density from Eq (3.1):
Choosing an appropriate loss function is crucial in Bayesian analysis. To comprehensively evaluate Bayesian estimates, we consider several types of loss functions: squared error (SR), linear exponential (LINEX), Al-Bayyati (AB), and general entropy (GE). The definitions of the SR, AB, and GE loss functions for the model parameter Θ are as follows:
ℓSR(Θ,˜Θ)=(Θ−˜Θ)2,
ℓLINEX(Θ,˜Θ)=ec(˜Θ−Θ)−c(˜Θ−Θ)−1,c≠0,
ℓAB(Θ,˜Θ)=Θa(Θ−˜Θ)2,a∈R,
ℓGE(Θ,˜Θ)=(˜ΘΘ)a−aln(˜ΘΘ)−1,
where ˜Θ is the estimate of Θ, and a in the AB and GE loss functions indicates the degree of asymmetry. In Bayesian analysis, defining an appropriate loss function is essential for determining the optimal estimate of an unknown parameter. Numerous loss functions have been developed in the literature to accommodate various loss structures. For further information, see Calabria and Pulcini [14] and EL-Sagheer et al. [21]. When a>0, a positive error (overestimation) (˜Θ>Θ) results in more significant consequences than a negative error (underestimation), leading to a highly skewed asymmetric loss function. Conversely, if a<0, the opposite is true. When a is close to zero, the estimates derived from the asymmetric loss function and the squared error (SR) loss function tend to be quite similar. As a result, asymmetric loss functions are particularly suitable for lifetime modeling. For instance, overestimating the survival function and failure rate function is often more detrimental than underestimating them. The Bayesian estimate of the model parameter T=T(Θ), based on the SR, AB, LINEX, and GE loss functions, can generally be expressed as follows:
˜TSR=E(T(Θ)|z)=∫ΘT(Θ)ℜ(Θ|z)dΘ,
where ℜ(Θ|z) is the posterior distribution of Θ given the observed data z,
where a is the asymmetry parameter for the GE loss function. These expressions represent the Bayesian estimates under various loss functions, taking into account the posterior distribution ℜ(Θ|z) of the unknown parameter Θ based on the observed data.
Under the SR loss function, the Bayes estimators of the unknown parameters (α,λ) are given by their respective posterior means. Specifically:
Based on the observed OMERSS in the two sample cases, we present the Bayesian prediction distribution for future order statistics.
4.1. Bayesian prediction bounds
Let ys, for s=1,…,r, represent a future independent type Ⅱ censored sample from the same population. Assume that y1≤y2≤…≤yr constitutes a type Ⅱ censored sample of size r obtained from a life test involving n items. Our objective is to develop a method for providing a Bayesian prediction for the sth ordered lifetime ys of a future sample of size r, based on the observed sample z from the same population. The pdf of ys is expressed as follows:
where r,n∑t_,ν_,ε,ι,ς=r,n∑t_,ν_,εs−1∑ι=01∑ς=0, Ω∗t_,ν_,ε,ι,ς(i_,n)=Ωt_,ν_,ε(i_)aι,s(n)Γ(r−ε−ς+2)Γ(ε+ς+1), and ϱ=(n+ι−s+1).
To establish the prediction boundaries for ys, it is essential to determine the predictive survival function p[ys>υ|z]. Therefore, based on Eq (4.3), we have:
where Φ(o,c)t_,ν_,ε,ι,ς(υ)=∫∞υe(ς+o)logys−cys(η+Ψt_,ν_(z)+ϱys)−(r−ε−ς+2)(ρ+Ψt_,ν_(z22)+ϱy2s2)−(ε+ς+1)dys.
A two-sided 100π% predictive interval for ys(1≤s≤r) is expressed as p[K(z)<ys<H(z)]=π, where K(z) and H(z) represent the lower and upper confidence limits, respectively. These limits satisfy the conditions: p[K(z)<ys]=1+π2 and p[ys>H(z)]=1−π2. In this context, obtaining analytical solutions for these limits is also challenging, which necessitates the use of an appropriate simulation method.
4.2. Bayesian predictor of ys
With ϕ∗s:n(ys|z) provided by (4.3), the two-sample Bayesian predictive pdf of ys under the SR function can be expressed as
To conduct Bayesian estimation and derive two-sample Bayesian prediction intervals based on type Ⅱ censoring for the ORSS and OMERSS procedures, a simulation study is carried out following these steps:
1) Set the hyperparameter values (η,ρ)=(0.5,0.3) to generate α=0.4999 and λ=0.2998 from ℏ(α,λ).
2) Generate n(=3,4,5) SRSs from the LED using the transformation: Zi:j=√(αλ)2−2λln(1−Ui:j)−αλ, for i,j=1,…,n, where Ui:j follows a uniform distribution on the interval (0,1). Next, apply the procedures for RSS as described in [31] and for MERSS with m1(=1,2,3) and m2(=2) as outlined in Section 1. This process will yield the RSS and MERSS, which can then be ordered to obtain the ORSS and OMERSS, respectively.
3) Calculate the Bayes estimates of α and λ using the SR, AB, LINEX, and GE loss functions as outlined in Section 3.
4) Repeat Steps 2 and 3 m=1000 times to calculate the average estimates (AV(ˆϑ)=1m∑mi=1ˆϑi), the mean squared error (MSE(ˆϑ)=1m∑mi=1(ϑ−ˆϑi)2), and the relative absolute bias (RAB(ˆϑ)=1m∑mi=1|ˆϑi−ϑ|ϑ), where ˆϑ represents an estimate of ϑ=(α,λ).
5) Display the computational results of the Bayes estimates derived based on the ORSS and OMERSS methods in Table 1.
Table 1.
Bayesian estimates of α and λ, along with their associated AVs, MSEs, and RABs, using both ORSS (Case Ⅰ) and OMERSS (Case Ⅱ).
6) For the two-sample Bayesian prediction of yı, where ı=1,⋯,s, set the hyperparameter values (η,ρ)=(1.5,1.3) to generate α=1.5005 and λ=1.3052 from ℏ(α,λ). Subsequently, obtain ORSS and OMERSS samples of size n=3,4,5 from LED.
7) Use the results from Section 3 to construct (95%) two-sample Bayesian prediction intervals and the Bayes predictive estimate for yı, where ı=1,⋯,s, under the SR, GE, and LINEX functions. The results are summarized in Table 2 for ORSS and OMERSS, respectively.
Table 2.
Bayesian predictive estimators and bounds for ys, s=1,⋯,r, using ORSS (Case Ⅰ) and OMERSS (Case Ⅱ).
˜ys
SR
LINEX
GE
Case
n
r
s
c=0.1
c=−0.1
a=−1.6
a=−0.6
ϵL
ϵU
Ⅰ
3
2
1
0.2671
0.2631
0.2712
0.3400
0.2186
0.1671
0.3671
2
0.7324
0.7165
0.7496
0.8502
0.6551
0.6324
0.8324
4
2
1
0.2345
0.2316
0.2375
0.2967
0.1927
0.1345
0.3345
2
0.6518
0.6398
0.6646
0.7524
0.5850
0.5518
0.7518
3
1
0.2978
0.2936
0.3022
0.3705
0.2475
0.1978
0.3978
2
0.6702
0.6593
0.6816
0.7604
0.6086
0.5702
0.7702
3
1.2880
1.2590
1.3200
1.4180
1.2030
1.1880
1.3880
5
2
1
0.1702
0.1687
0.1718
0.2155
0.1399
0.0702
0.2702
2
0.4896
0.4827
0.4969
0.5664
0.4386
0.3896
0.5896
3
1
0.1386
0.1376
0.1397
0.1752
0.1141
0.0386
0.2354
2
0.3354
0.3323
0.3386
0.3856
0.3017
0.5993
0.2386
3
0.6993
0.6893
0.7098
0.7790
0.6468
0.4354
0.7993
4
1
0.3052
0.3019
0.3086
0.3640
0.2621
0.2052
0.5011
2
0.6011
0.5948
0.6076
0.6611
0.5586
0.8554
1.4150
3
0.9554
0.9446
0.9667
1.0210
0.9103
0.4052
0.7011
4
1.5150
1.4910
1.5410
1.6070
1.4560
1.0550
1.6150
Ⅱ
3
2
1
0.5255
0.5129
0.5391
0.6498
0.4406
0.4255
0.6255
2
1.2980
1.2560
1.3460
1.4790
1.1800
1.1980
1.3980
4
2
1
0.4187
0.4100
0.4279
0.5243
0.3474
0.3187
0.5187
2
1.0860
1.0540
1.1210
1.2470
0.9802
0.9855
1.1860
3
1
0.3111
0.3059
0.3166
0.3944
0.2556
0.2111
0.4111
2
0.7109
0.6965
0.7264
0.8210
0.6385
0.6109
0.8109
3
1.3860
1.3460
1.4330
1.5520
1.2800
1.2860
1.4860
5
2
1
0.2911
0.2867
0.2958
0.3681
0.2395
0.1911
0.3911
2
0.7981
0.7801
0.8176
0.9216
0.7166
0.6981
0.8981
3
1
0.3746
0.3687
0.3807
0.4569
0.3168
0.2746
0.4746
2
0.8109
0.7966
0.8262
0.9098
0.7441
0.7109
0.9109
3
1.5020
1.4640
1.5440
1.6450
1.4100
1.4020
1.6020
4
1
0.2564
0.2533
0.2596
0.3181
0.2139
0.1564
0.3564
2
0.5463
0.5391
0.5538
0.6189
0.4970
0.4463
0.6463
3
0.9225
0.9080
0.9380
1.0120
0.8637
0.8225
1.0230
4
1.5570
1.5210
1.5980
1.6910
1.4720
1.4570
1.6570
Notice that Lϵ=|ϵU−ϵL|=0.2 is the length prediction bound.
1) The AVs, APRs, MSEs, and RABs for ORSS and OMERSS decrease as r and n increase.
2) For fixed r and n, the AVs, MSEs, and RABs of the GE and LINEX loss functions increase with increasing a and c.
3) The Bayes estimates of λ and α from OMERSS outperform those from ORSS in terms of AVs, APRs, MSEs, and RABs.
4) The Bayes estimates derived from the LINEX, AB, and GE (with c=−0.1,a=−0.1,−0.6) loss functions for λ and α are superior to those obtained using the LINEX, AB, and GE (with c=0.1,a=0.3,−1.6) loss functions, based on AVs, MSEs, and RABs.
5) The Bayes estimates for λ from both ORSS and OMERSS are more accurate than those for α in terms of AVs, APRs, MSEs, and RABs.
6) According to Table 2, the prediction estimators ˜ys and their prediction intervals increase as s rises for fixed n and r but decrease as n and r increase for fixed s, applicable to both ORSS and OMERSS.
7) The predictive intervals from OMERSS demonstrate better results compared to those from ORSS in all scenarios analyzed.
8) The Bayes predictive estimates for ˜ys, s=1,…,r, based on the LINEX and GE (with c=−0.1,a=−0.6) loss functions outperform those from LINEX and GE (with c=0.1,a=−1.6) and from the SR loss function.
9) The Bayes predictive estimates obtained from OMERSS are more effective than those derived from ORSS.
10) The choice of small sample sizes n=3,4,5 was deliberate and based on several considerations:
- Practical relevance: In many real-world applications, especially in medical and reliability studies, obtaining large sample sizes can be challenging due to cost, time, and ethical constraints. Our choice of smaller sample sizes reflects the practical limitations often encountered in such fields.
- Demonstration of methodology: The primary goal of our simulation study is to demonstrate the effectiveness and applicability of our Bayesian estimation and prediction methods under type Ⅱ censoring. Using smaller sample sizes allows us to clearly illustrate the methodology and its performance in a controlled setting.
- Computational efficiency: Smaller sample sizes enable us to conduct a more extensive simulation study with a larger number of replications (1000 replications in our case). This ensures that our results are statistically robust and reliable, even with smaller sample sizes.
- Comparative analysis: By using smaller sample sizes, we can effectively compare the performance of our methods (ORSS and OMERSS) under different conditions. This comparative analysis is crucial for understanding the strengths and limitations of our approaches.
11) The series involved in the derivation of Bayesian estimators under various loss functions (e.g., squared error, linear exponential, Al-Bayyati, and general entropy) have been verified for convergence. This ensures that the estimators are well-defined and reliable.
12) The series used in the calculation of Bayesian prediction bounds and predictive distributions have also been checked for convergence. This ensures that the prediction intervals and estimates are accurate and meaningful.
13) In our simulation study, we have ensured that the series used in the calculations are convergent for the sample sizes and parameters considered. This guarantees the robustness and reliability of our simulation results.
Based on the observations mentioned, we suggest utilizing the Bayes GE (with a=−0.6) loss function for estimating the parameters, as well as the Bayes GE (with a=−0.6) loss function for predicting future ordered observations in the context of OMERSS under type Ⅱ censoring.
The traditional sampling methods such as SRS may not be as efficient as RSS and MERSS in terms of reducing the mean squared error and relative absolute bias (Al-Saleh and Al-Hadrami [4]; Chen et al. [17]). Maximum likelihood estimation, while widely used, can be less robust in the presence of censored data compared to our Bayesian approaches (Lai and Xie [27]). Other Bayesian methods may not incorporate the same loss functions or prior distributions, which can affect the accuracy and reliability of the estimates (Berger [10]). Our methods, particularly under type Ⅱ censoring, demonstrate superior performance in these aspects.
The SR loss function is commonly used due to its simplicity and ease of mathematical tractability. The LINEX loss function is chosen for its ability to handle asymmetric losses, which is particularly useful in scenarios where overestimation and underestimation have different costs. The AB loss function is selected for its flexibility in incorporating different degrees of asymmetry, making it suitable for a wide range of applications. The GE loss function is chosen for its robustness and ability to handle different types of data distributions. We have compared these loss functions with alternative ones, such as the absolute error loss function. The absolute error loss function is simpler but less flexible in handling asymmetric losses compared to the LINEX and AB loss functions. The SR loss function, while widely used, may not be as robust as the GE loss function in certain scenarios. Our selection of loss functions is justified by their ability to provide more accurate and reliable estimates in the context of our study. All series involved in the equations and calculations presented in this manuscript have been checked for convergence. This ensures the validity and reliability of the Bayesian estimators and prediction bounds derived in our study.
5.3. Application of LED distribution to real data
In this section, we employ a real dataset from the medical domain to demonstrate the theoretical results presented in earlier sections. The dataset consists of the times reported by a group of 20 patients who received an analgesic and subsequently felt better. The data, derived from Gross and Clark [26], includes the following values: 1.1, 1.4, 1.3, 1.7, 1.9, 1.8, 1.6, 2.2, 1.7, 2.7, 4.1, 1.8, 1.5, 1.2, 1.4, 3.0, 1.7, 2.3, 1.6, and 2.0. To evaluate whether LED distribution with cdf (1.2) is a suitable fit for the data, we perform a Kolmogorov-Smirnov (K-S) test and calculate the associated p-value. From the data and cdf (1.2), we estimate the scale parameters λ and α by maximizing the likelihood function, resulting in ˆλ=0.0240 and ˆα=0.4900. The K-S test statistic is computed to be 0.2340, with a corresponding p-value of 0.3307. The computed p-value of over 0.05 suggests a strong fit between the actual dataset and LED with cdf (1.2). This is further illustrated in Figure 1, which displays the histogram, empirical cdf, and pdf (1.1) along with cdf (1.2) of the dataset, indicating a good correspondence between the fitted LED and the empirical data. We have explored the potential application of our findings to other types of distributions. For instance, the Weibull distribution is widely used in reliability and survival analysis due to its flexibility in modeling different types of failure rates. The gamma distribution is commonly applied in fields such as queuing theory and hydrology, while the log-normal distribution is useful in modeling skewed data in economics and biology. By adapting our Bayesian estimation and prediction methods to LED, we can enhance the precision and reliability of statistical analyses in these areas.
Figure 1.
Histogram and empirical cdf (in red) compared to the pdf and cdf (in blue) of the LED for the given dataset.
In the RSS method (Scheme Ⅰ), we begin by randomly selecting n=6 SRSs, each of equal size. The items within each sample are ranked based on the variable of interest. The smallest observation is measured from the first set. Subsequently, a second SRS of size 6 is randomly selected without replacement from the remaining data, and the second smallest observation is recorded from this set. This process continues until the sixth SRS of size 6 is selected without replacement. As a result, we obtain the RSS values: (1.2, 1.5, 1.7, 2.2, 2.7, 2.0). For the MERSS method (Scheme Ⅱ), we first randomly select m1=4 SRSs of sizes 1, 2, ..., 4. The maximum ordered observation from each of these sets is accurately measured. We then repeat this procedure for an additional m2=2 sets of sizes 1 and 2, this time measuring the minimum ordered observations instead of the maximum ones. The total sample size for MERSS is n=6, yielding the MERSS values: (1.2, 1.5, 1.7, 2.2, 1.4, 1.3). Consequently, we can derive ORSS and OMERSS by arranging the RSS and MERSS values in ascending order. Table 4 presents the Bayesian estimates of the parameters α and λ for both the ORSS (Case Ⅰ) and OMERSS (Case Ⅱ) methods. The estimates are provided for different censoring levels r=2,3,4 and sample size n=6. The table includes estimates under various loss functions: LINEX, AB, and GE. The results indicate that the OMERSS method generally provides more accurate estimates (lower MSE and RAB) compared to the ORSS method, especially for higher censoring levels. This suggests that OMERSS is more robust and reliable for parameter estimation in the context of the linear exponential model. Table 5 presents the Bayesian predictive estimators and bounds for the real dataset using both the ORSS (Case Ⅰ) and OMERSS (Case Ⅱ) methods. The table includes predictive estimates and bounds for different censoring levels r and sample size n=6. The results show that the OMERSS method generally provides more accurate predictive estimates and tighter prediction bounds compared to the ORSS method. This is evident from the lower values of ϵL and ϵU for OMERSS, indicating better predictive performance. The predictive estimates under different loss functions (GE and LINEX) also demonstrate the robustness of the OMERSS method in predicting future observations.
Table 4.
Bayesian estimates based on the real dataset for n=6, η=0.5, and ρ=0.3, using both ORSS (Case Ⅰ) and OMERSS (Case Ⅱ).
The performance of Bayes estimations and predictions based on ORSS and OMERSS for the linear exponential distribution has been examined. OMERSS offers several practical advantages, including improved estimation accuracy, reduced mean squared error, and enhanced robustness in the presence of censored data. OMERSS can be more efficient than traditional sampling methods such as SRS and RSS, particularly in scenarios where the data is skewed or follows a complex distribution. The MERSS technique allows for more flexible and adaptive sampling, which can lead to better performance in various applications. Additionally, OMERSS can provide more reliable estimates in situations where the data is subject to type Ⅱ censoring, making it a valuable tool for statistical inference in a wide range of fields. Under type Ⅱ censoring, the Bayes estimates of the unknown parameters α and λ—using SR, AB, LINEX, and GE loss functions—and the predictive estimates for future observations have been established. A real dataset from the medical field has been utilized to illustrate the theoretical concepts discussed in this study. Additionally, a Monte Carlo simulation was performed to assess the effectiveness and accuracy of the estimation and prediction methods. Based on the numerical findings, we recommend using the Bayes method for estimation and prediction based on OMERSS in the context of the specified censoring scenarios.
Author contributions
Haidy A. Newer: Writing–original draft, Writing–review and editing, Validation, Investigation, Formal analysis, Methodology, Software, Visualization, Supervision; Bader S Alanazi: Funding acquisition. All authors have approved the final version of the manuscript for publication.
Use of Generative-AI tools declaration
The authors declare they have not used Artificial Intelligence (AI) tools in the creation of this article.
Acknowledgments
The authors extend their appreciation to the Deanship of Scientific Research at Northern Border University, Arar, KSA, for funding this research work through the project number "NBU-FPEJ-2025-1253-01".
Conflict of interest
The authors declare no conflict of interest.
References
[1]
W. Abu-Dayyeh, E. Al Sawi, Modified inference about the mean of the exponential distribution using moving extreme ranked set sampling, Stat. Papers, 50 (2009), 249–259. https://doi.org/10.1007/s00362-007-0072-5 doi: 10.1007/s00362-007-0072-5
[2]
A. Al-Omari, S. Al-Hadhrami, Bayes estimation of the mean of normal distribution using moving extreme ranked set sampling, Pak. J. Stat. Oper. Res., 8 (2012), 21–30. https://doi.org/10.18187/pjsor.v8i1.227 doi: 10.18187/pjsor.v8i1.227
[3]
M. T. Al-Odat, M. F. Al-Saleh, A variation of ranked set sampling, Journal of Applied Statistical Science, 10 (2001), 137–146.
[4]
M. F. Al-Saleh, S. A. Al-Hadhrami, Estimation of the mean of the exponential distribution using moving extremes ranked set sampling, Stat. Papers, 44 (2003), 367–382. https://doi.org/10.1007/s00362-003-0161-z doi: 10.1007/s00362-003-0161-z
[5]
M. F. Al-Saleh, S. A. Al-Hadrami, Parametric estimation for the location parameter for symmetric distributions using moving extremes ranked set sampling with application to trees data, Environmetrics, 14 (2003), 651–664. https://doi.org/10.1002/env.610 doi: 10.1002/env.610
[6]
H. S. Ali, M. M. Mohie El-Din, S. M. Elarishy, H. A. Newer, Inference for linear exponential distribution based on extreme ranked set sampling, Pak. J. Stat., 39 (2023), 415–432.
[7]
B. C. Arnold, N. Balakrishnan, H. N. Nagaraja, A first course in order statistics, Society for Industrial and Applied Mathematics, 2008. https://doi.org/10.1137/1.9780898719062
[8]
N. Balakrishnan, Permanents, order statistics, outliers, and robustness, Revista Matemática Complutense, 20 (2007), 7–107.
[9]
N. Balakrishnan, T. Li, Ordered ranked set samples and applications to inference, J. Stat. Plan. Infer., 138 (2008), 3512–3524. https://doi.org/10.1016/j.jspi.2005.08.050 doi: 10.1016/j.jspi.2005.08.050
S. Bhushan, A. Kumar, S. Shahab, S. A. Lone, S. A. Almutlak, Modified class of estimators using ranked set sampling, Mathematics, 10 (2022), 3921. https://doi.org/10.3390/math10213921 doi: 10.3390/math10213921
[12]
S. Bhushan, A. Kumar, S. A. Lone, On some novel classes of estimators under ranked set sampling, Alex. Eng. J., 61 (2022), 5465–5474. https://doi.org/10.1016/j.aej.2021.11.001 doi: 10.1016/j.aej.2021.11.001
R. Calabria, G. Pulcini, Point estimation under asymmetric loss functions for left-truncated exponential samples, Commun. Stat.-Theor. Meth., 25 (1996), 585–600. https://doi.org/10.1080/03610929608831715 doi: 10.1080/03610929608831715
[15]
P. P. Carbone, L. E. Kellerhouse, E. A. Gehan, Plasmacytic myeloma: a study of the relationship of survival to various clinical manifestations and anomalous protein type in 112 patients, The American Journal of Medicine, 42 (1967), 937–948. https://doi.org/10.1016/0002-9343(67)90074-5 doi: 10.1016/0002-9343(67)90074-5
W. Chen, M. Xie, M. Wu, Parametric estimation for the scale parameter for scale distributions using moving extremes ranked set sampling, Stat. Probabil. Lett., 83 (2013), 2060–2066. https://doi.org/10.1016/j.spl.2013.05.015 doi: 10.1016/j.spl.2013.05.015
[19]
W. Chen, M. Xie, M. Wu, Modified maximum likelihood estimator of scale parameter using moving extremes ranked set sampling, Commun. Stat.-Simul. Comput., 45 (2016), 2232–2240. https://doi.org/10.1080/03610918.2014.904520 doi: 10.1080/03610918.2014.904520
R. M. EL-Sagheer, M. S. Eliwa, K. M. Alqahtani, M. EL-Morshedy, A. Sajid, Asymmetric randomly censored mortality distribution: Bayesian framework and parametric bootstrap with application to COVID-19 data, J. Math., 2022 (2022), 8300753. https://doi.org/10.1155/2022/8300753 doi: 10.1155/2022/8300753
[22]
M. M. M. El-Din, M. S. Kotb, E. F. Abd-Elfattah, H. A. Newer, Bayesian inference and prediction of the Pareto distribution based on ordered ranked set sampling, Commun. Stat.-Theor. Meth., 46 (2017), 6264–6279. https://doi.org/10.1080/03610926.2015.1124118 doi: 10.1080/03610926.2015.1124118
[23]
M. M. M. El-Din, M. S. Kotb, H. A. Newer, Inference for linear exponential distribution based on record ranked set sampling, Journal of Statistics Applications & Probability, 10 (2021), 512–524. https://doi.org/10.18576/jsap/100219 doi: 10.18576/jsap/100219
[24]
M. M. M. El-Din, M. S. Kotb, H. A. Newer, Bayesian estimation and prediction of the Rayleigh distribution based on ordered ranked set sampling under type-Ⅱ doubly censored samples, Journal of Statistics Applications & Probability Letters, 8 (2021), 83–95. https://doi.org/10.18576/jsapl/080202 doi: 10.18576/jsapl/080202
[25]
M. M. M. El-Din, M. S. Kotb, H. A. Newer, Bayesian estimation and prediction for Pareto distribution based on ranked set sampling, Journal of Statistics Applications & Probability, 4 (2015), 1–11.
[26]
A. J. Gross, V. A. Clark, Survival distributions: reliability applications in the biomedical sciences, New York: Wiley, 1975.
[27]
C. D. Lai, M. Xie, Bathtub shaped failure rate life distributions, In: Stochastic ageing and dependence for reliability, New York, NY: Springer, 2006, 71–107. https://doi.org/10.1007/0-387-34232-X_3
[28]
C. D. Lai, M. Xie, D. N. P. Murthy, Bathtub-shaped failure rate life distributions, In: Handbook of statistics Vol. 20, Elsevier, 2001, 69–104. https://doi.org/10.1016/S0169-7161(01)20005-4
G. A. McIntyre, A method for unbiased selective sampling. using ranked sets, Australian Journal of Agricultural Research, 3 (1952), 385–390. https://doi.org/10.1071/AR9520385 doi: 10.1071/AR9520385
[31]
H. A. Newer, M. M. M. El-Din, H. S. Ali, I. Al-Shbeil, W. Emam, Statistical inference for the Nadarajah-Haghighi distribution based on ranked set sampling with applications, AIMS Math., 8 (2023), 21572–21590. https://doi.org/10.3934/math.20231099 doi: 10.3934/math.20231099
[32]
T. Zhang, M. Xie, L. C. Tang, S. H. Ng, Reliability and modeling of systems integrated with firmware and hardware, Int. J. Reliab. Qual. Safety Eng., 12 (2005), 227–239. https://doi.org/10.1142/S021853930500180X doi: 10.1142/S021853930500180X
This article has been cited by:
1.
Haidy A. Newer, Bader S Alanazi,
Bayesian inference for the Rayleigh distribution using ordered extreme k-records ranked set sampling with random sample sizes,
2025,
125,
11100168,
214,
10.1016/j.aej.2025.03.081
2.
Haidy A. Newer,
Statistical inference for the generalized exponential distribution using ordered lower k-record ranked set sampling with random sample sizes,
2025,
15,
2045-2322,
10.1038/s41598-025-01995-z
Haidy A. Newer, Bader S Alanazi. Bayesian estimation and prediction for linear exponential models using ordered moving extremes ranked set sampling in medical data[J]. AIMS Mathematics, 2025, 10(1): 1162-1182. doi: 10.3934/math.2025055
Haidy A. Newer, Bader S Alanazi. Bayesian estimation and prediction for linear exponential models using ordered moving extremes ranked set sampling in medical data[J]. AIMS Mathematics, 2025, 10(1): 1162-1182. doi: 10.3934/math.2025055