1.
Introduction
In 1995, Kamps presented the notion of generalized order statistics (GOS), which is the unification of different models of ascendingly ordered random variables (RVs). The GOS incorporates significant and well-known concepts that have been discussed individually in the statistical literature. Many models of ascendingly ordered RVs, such as sequential order statistics, progressive Type-Ⅱ (PT-Ⅱ) censored order statistics, ordinary order statistics (OOS), record values, and Pfeifer's record model, are theoretically contained in the GOS model.
Assume F(.) to be an arbitrary continuous cumulative distribution function (CDF) with probability density function (PDF) f(.). Assume also k>0, n∈N, and ˜m=(m1,m2,⋯,mn−1)∈ℜn−1 to be the parameters such that γn=k and γi=k+n−i+Mi, for i=1,⋯,n−1, where Mi=∑n−1ι=imι. Then, the RVs Xi:n,˜m,k,i=1,⋯,n, are said to be GOS, if their joint PDF is given by
where ˉF(.)=1−F(.) and F−1(0)<x1≤⋯≤xn<F−1(1).
Several models of arranged RVs can be considered special instances of GOS. m1=m2=⋯=mn−1=m; γi=k+(n−i)(m+1), i=1,⋯,n corresponds to m-generalized order statistics (m-GOS), γi=n−i+1 (mi=0,k=1) corresponds to OOS, and mi=−1;γi=k, i=1,⋯,n, k∈N corresponds to k-recored values. Also, for mi=Ri, n=m0+∑m0ν=1Rν, Rν∈N, and γi=n−∑i−1ν=1Rν−i+1, 1≤i≤m0, where m0 denotes the fixed number of failure of units to be observed, the model reduces to PT-Ⅱ censored order statistics.
Under the condition γi≠γj, i,j=1,⋯,n−1,i≠j, Kamps and Cramer [23] derived the PDF of Xr:n,˜m,k, 1≤r≤n as
and the joint PDF of Xr:n,˜m,k and Xs:n,˜m,k, r,s=1,⋯,n,r<s as
where Cr−1=∏ri=1γi, ai,r=∏rı=1ı≠i1γı−γi,1≤i≤r≤n, and a(r)j,s=∏sȷ=r+1ȷ≠j1γȷ−γj, r+1≤j≤s≤n.
It can be shown that for m1=m2=⋯=mn−1=m≠−1 (Khan and Khan [24]),
and
Therefore, the PDF of Xr:n,˜m,k given in (1.2) reduces to
and the joint PDF of Xr:n,˜m,k and Xs:n,˜m,k given in (1.3) reduces to
where Cr−1=∏ri=1γi, γi=k+(n−i)(m+1), hm(x)={−1m+1(1−x)m+1,m≠−1,−ln(1−x),m=−1, and gm(x)=hm(x)−hm(0), x∈[0,1) (see Kamps [22]).
David [8] introduced the concept of concomitants of order statistics (COS), but Yang [47] described the general theory of COS. Concomitants are important in selection and prediction issues, ranked set sampling, parameter estimation, and the characterization of parent bivariate distributions. For a brief overview of the uses of the concomitants of ordered RVs, see Veena and Thomas [46] and the references therein. For a review of fundamental findings on COS, see Daivd and Nagaraja [9]. Furthermore, for some of the recent works on COS, we refer to Philip and Thomas [36,37,38], Kumar et al. [29], Barakat et al. [3], and Koshti and Kamalja [28].
Several authors have investigated the concomitants of GOS (CGOS), including Ahsanullah and Nevzorov [1], Beg and Ahsanullah [4], El-Din et al. [13], Domma and Giordano [11], Hanif and Shahbaz [15], Shahbaz and Shahbaz [40], Tahmasebi et al. [44], Alawady et al. [2], and Kamal et al. [20]. Let (Xi,Yi), i=1,⋯,n be a random sample from a bivariate distribution function FX,Y(x,y). When the X-variates are ordered in ascending order as X1:n,˜m,k≤X2:n,˜m,k≤X3:n,˜m,k≤⋯≤Xn:n,˜m,k, then Y-variates paired (not necessarily in ascending order) with these GOS are called the CGOS and are indicated by Y[r:n,˜m,k], r=1,⋯,n. The PDF of Y[r:n,˜m,k] is given by (Ahsanullah and Nevzorov, [1])
where f(y|x) is the conditional PDF of Y given X and fr:n,˜m,k(x) is defined in (1.2).
Moreover, the joint PDF of Y[r:n,˜m,k] and Y[s:n,˜m,k], r,s=1,⋯,n,r<s is given by
where fr,s:n,˜m,k(x1,x2) is given in (1.3).
One of the most notable applications of COS is in ranked set sampling (RSS). RSS is considered a beneficial sampling strategy for improving estimation efficiency and precision if the variable under study is expensive to measure or difficult to get, yet inexpensive and simple to rank. RSS was proposed by McIntyre [31] and then supported by Takahasi and Wakimoto [45] through mathematical theory. The procedure for RSS is described as follows:
1) Randomly choose n2 units from the population under study, then divide them into n sets of n units.
2) Order the elements of each set without making actual measurements.
3) Choose and quantify the ith minimum from the ith set, i=1,⋯,n, to create a new set of size n, known as the RSS.
4) If a large sample size is required, repeat the above three steps d times (cycles) until a sample of size nd is obtained.
For a comprehensive review of the theory and applications of RSS, see Chen et al. [7]. In some practical applications, the study variable, say, Y, is more difficult to measure, whereas an auxiliary variable X associated with Y is easily quantifiable and may be precisely arranged. In this situation, Stokes [42] created another RSS technique, which is as follows:
1) At random, choose n independent bivariate sets of size n.
2) Take note of the value of the auxiliary variable on each of these units.
3) From the ith set of size n, choose the variable Y associated with the ith smallest X, i=1,⋯,n.
The resulting set of n units is known as the RSS. Consider (X(i:n)i,Y[i:n]i), i=1,⋯,n to be the pair chosen from the ith set, where X(i:n)i is the ith order statistics of the auxiliary variate in the ith set and Y[i:n]i is the measurement made on the Y variate associated with X(i:n)i. Y[i:n]i is obviously the concomitant of the ith order statistics resulting from the ith sample. Numerous authors in the literature have considered the estimation of parameters of the various bivariate distributions using RSS and its modifications. Some work in this area is by Chacko and Thomas [5], Philip and Thomas [36,37], Koshti and Kamalja [26], Irshad et al. [16,17], and Dong et al. [12].
COS and higher moments of multivariate distributions have received a lot of attention in recent years. Most of the literature on concomitants is concentrated on symmetric distributions such as multivariate normal (Sheikhi et al., [41]; Chaumette and Vincent, [6]) or multivariate elliptical (Jamalizadeh and Balakrishnan, [18]). Skewed distributions have gained a lot of interest recently in the literature since many datasets encountered in reality have some degree of skewness. In this regard, the distribution theory of COS from skew distributions has been investigated by several authors, including Hanif and Shahbaz [15], Shahbaz and Shahbaz [40], Tahmasebi et al. [44], Shahbaz et al. [39], and Kamal et al. [20]. In this article, we consider the bivariate generalized Weibull (BGW) distribution and the CGOS arising from it. There are numerous reasons for considering this particular bivariate distribution. Due to the presence of four parameters, the joint PDF of the BGW distribution is quite flexible and can take on various shapes depending on the shape parameter. The joint PDF, joint CDF, and conditional PDF for the BGW distribution are all in closed forms, making them appropriate for usage in practice. The univariate marginals of this distribution are able to analyze various types of hazard rates. In addition, it can be utilized for modeling bivariate lifetime data in a variety of scenarios. So far, no results on CGOS arising from the BGW distribution have been found in the literature. Thus, the current study aims to develop the distribution theory of CGOS originating from the BGW distribution and apply it to associated inference problems.
The article is structured as follows: In Section 2, we provide a brief overview of the BGW distribution and some of its properties. In Section 3, we present the marginal PDF as well as the explicit expressions for the single moments of CGOS from the BGW distribution. The joint PDF of CGOS from the BGW distribution is also obtained in Section 3. Furthermore, the explicit expressions for the product moments of CGOS are derived. Section 4 presents the best linear unbiased (BLU) estimator of the parameter of the study variable contained in the BGW distribution using Stokes's RSS and some of the other modified RSS schemes. In Section 5, we apply the results to a real dataset. In Section 6, conclusions are provided.
2.
BGW distribution
A bivariate RV (X,Y) is said to follow a BGW distribution if its PDF is given by (Pathak et al. [34])
where x,y≥0,α,β1,β2>0, 0<θ≤1, ω(x,y;ϕ)=xαβ1+yαβ2, and ϕ=(α,β1,β2). The BGW distribution includes the bivariate generalized exponential distribution (refer to Mirhosseini et al. [32]) and the bivariate generalized Rayleigh distribution (refer to Pathak and Vellaisamy [35]) as sub-models. The conditional PDF of Y given X=x is (Pathak et al. [34])
The RV X∼EW(α,β1,θ) is a member of the exponentiated Weibull (EW) distribution with PDF
and CDF
Similarly, Y∼EW(α,β2,θ). A series expansion of the PDF of the BGW distribution is given by
Pathak et al. [34] showed that the product moments of the BGW distribution are
If we make the transformation, U=Xβ⋆1 and V=Yβ⋆2, β⋆i=β1/αi,i=1,2, the standard BGW distribution has the joint PDF as
It is clear that the variables U and V have the standard EW distribution as marginal functions with PDFs are, respectively, given by
3.
Distribution theory of CGOS from BGW distribution
In this part, we obtain the distributions and moments of CGOS arising from the BGW distribution.
3.1. Marginal PDF and single moments of CGOS
Suppose (Xi,Yi) and (Ui,Vi) are random samples of size n each originating from the BGW distribution and the standard BGW distribution, with PDFs provided by (2.1) and (2.7), respectively. Let V[r:n,˜m,k] be the concomitant of the rth GOS Ur:n,˜m,k. Then, the PDF and the pth moments of V[r:n,˜m,k], r=1,⋯,n are given by the following two theorems:
Theorem 1 If V[r:n,˜m,k] is the concomitant of the rth GOS from the standard BGW distribution, then the PDF of V[r:n,˜m,k], for r=1,⋯,n, is given by
where δθ(j,γi)=(θj)∑γi−1τ=0(−1)j+τ+1(γi−1τ)B(j,θτ+1) and B(.,.) is the complete beta function.
Proof. Using the PDF of Ur:n,˜m,k (1.2) in (1.6), the PDF of the rth CGOS V[r:n,˜m,k] is given as
In view of (2.7) and (2.8), we get
where z=uα. Now, by using Eq (3.3121) in Gradshteyn and Ryzhik [14] to compute the integral in (3.3), we obtain the result given in (3.1). □
Corollary 1. Taking m1=m2=⋯=mn−1=m≠−1 in (3.1), the PDF of the rth concomitant of m-GOS from the standard BGW distribution is given by
where γi=k+(n−i)(m+1).
Remark 1. When m=0 and k=1 in (3.4), we get the PDF of the rth COS from the standard BGW distribution as
where Cr:n=n!(r−1)!(n−r)!.
Theorem 2. Under the conditions of Theorem 1, the pth moment of V[r:n,˜m,k] is
Proof. Using (3.1), the pth moment of V[r:n,˜m,k] is given as
where z=vα. Then, after integration, we get (3.6). □
Corollary 2. Taking m1=m2=⋯=mn−1=m≠−1 in (3.6), the pth moment of the concomitant of m-GOS is given by
Remark 2. Let m=0 and k=1 in (3.8), then the pth moment of COS is
3.2. Joint PDF and product moments of CGOS
Let V[r:n,˜m,k] and V[s:n,˜m,k], r,s=1,⋯,n,r<s be the concomitants of the rth and sth GOS from the standard BGW distribution. Then, the joint PDF and the product moments of V[r:n,˜m,k] and V[s:n,˜m,k] are given by the following two theorems:
Theorem 3. The joint PDF of concomitants V[r:n,˜m,k] and V[s:n,˜m,k], r,s=1,⋯,n,r<s is given by
where
where 3F2(a1,a2,a3;b1,b2;x) denotes the hypergeometric function defined by
and (c)ℓ=c(c+1)⋯(c+ℓ−1) is the ascending factorial.
Proof. Using (1.3) in (1.7), the joint PDF of the rth and sth CGOS V[r:n,˜m,k] and V[s:n,˜m,k] is given as
In view of (2.7) and (2.8), we get
where
where w=e−uα1 and Bw(.,.) denotes the incomplete beta function defined by Bw(a1,a2)=∫w0xa1−1(1−x)a2−1dx.
Now, putting the value of I(u1) in (3.12), we get
where z=uα1. We know that Bw(a1,a2)=wa1a12F1(a1,1−a2;a1+1;w) (see Mathai and Saxena, [30]), and
Therefore,
where t=e−z. Now, using (3.15), we get the result of (3.10). □
Corollary 3. At m1=m2=⋯=mn−1=m≠−1 in (3.10), the joint PDF of concomitants V[r:n,m,k] and V[s:n,m,k] of the rth and sth m-GOS for the standard BGW distribution is given by
where γi=k+(n−i)(m+1).
Remark 3. For m=0 and k=1 in (3.16), we obtain the joint PDF of the rth and sth COS from the standard BGW distribution as
where Cr,s:n=n!(r−1)!(s−r−1)!(n−s)!.
Theorem 4. The product moments of two concomitants V[r:n,˜m,k] and V[s:n,˜m,k] are given by
Proof. Using (3.10), the pth and qth moments of V[r:n,˜m,k] and V[s:n,˜m,k] are given as
where zi=vαi,i=1,2. Then, after integration, we get (3.18). □
Corollary 4. Setting m1=m2=⋯=mn−1=m≠−1 in (3.18), we can get the product moments of two concomitants of m-GOS of the standard BGW distribution.
Remark 4. When m=0 and k=1 in (3.18), we get the product moments of COS as
Remark 5. At mi=Ri,n=m0+∑m0i=1Ri, and γi=n−∑i−1ν=1Rν−i+1, 1≤i≤m0 in Theorems 1–4, the results for PT-Ⅱ censored order statistics can be obtained.
Remark 6. From (3.9), the expressions of means and variances of the COS Y[i:n],i=1,⋯,n, arising from the BGW distribution, are obtained as follows:
where Var[V[i:n]]=μ(2)[i:n]−(μ[i:n])2. The expression of the covariances between Y[i:n] and Y[j:n] is given, using (3.9) and (3.20), by
where Cov(V[i:n],V[j:n])=μ[i,j:n]−μ[i:n]μ[j:n],1≤i<j≤n.
The means and variances of the COS of the standard BGW distribution for n=1,⋯,5 and different values of the parameters α and θ are calculated in Tables 1 and 2. It can be noted that the condition ∑nr=1μj[r:n]=nμj1:1,j=1,2 is satisfied (see David and Nagaraja, [10]). In Tables 3–6, we have computed the means and variances of the concomitants of PT-Ⅱ censored order statistics. From Tables 1–6, one can observe that the variances are decreasing with respect to α.
4.
BLU estimator of the parameter β⋆2 of BGW distribution
In this part, we obtain the BLU estimator of β⋆2 involved in the BGW distribution using Stoke's RSS. Assume that n sets of units, each of size n, are taken from the BGW distribution with the PDF given in (2.1). Let X(i:n)i, i=1,⋯,n represent the observation made on the auxiliary variable X in the ith unit of the RSS, and Y[i:n]i represent the measurement performed on the Y variable in the same unit. It is obvious that Y[i:n]i has the same distribution as Y[i:n], the concomitant of the ith order statistics (see David and Nagraja, [10], p. 145). From Remark 6, the mean and the variance of Y[i:n]i are given as E[Y[i:n]i]=β⋆2μ[i:n], and Var[Y[i:n]i]=β⋆22δi,i:n,1≤i≤n. Because the two measurements Y[i:n]i and Y[j:n]j (i≠j) of Y are based on two independent samples, we have Cov[Y[i:n]i,Y[j:n]j]=0.
Let Y[n]=(Y[1:n]1,Y[2:n]2,⋯,Y[n:n]n)′ denote the column vector of COS. Then, the mean vector and the variance-covariance matrix of Y[n] can be written as
and
where μ=(μ[1:n],⋯,μ[n:n])′ and Λ=diag(δ1,1:n,δ2,2:n,⋯,δn,n:n). If the parameters α and θ are known, then the combination of (4.1) and (4.2) allow us to apply the generalized Gauss-Markov theorem (see David and Nagraja, [10], p. 185). Hence, the BLU estimator ^β⋆2 of β⋆2 is given as
where ai=μ[i:n]/δi,i:n∑ni=1μ2[i:n]/δi,i:n, and the variance of ^β⋆2 is given by
We have calculated the coefficients ai of Y[i:n]i,i=1,⋯,n in ^β⋆2 and Var[^β⋆2]/β⋆22 for n=1,⋯,5, and different values of the parameters α and θ are presented in Tables 7 and 8.
A modified RSS approach is presented by Stokes [43], wherein only the largest or smallest judgment ranked unit is selected for quantification. Let n random samples each of size n be drawn from the BGW distribution. From each of the n samples, choose the unit for which the measurement on the auxiliary variable X is the smallest (largest) and measure the Y variable associated with it. Then, we call the collection of observations Y[1:n]1,Y[1:n]2,⋯,Y[1:n]n (Y[n:n]1,Y[n:n]2,⋯,Y[n:n]n) as the lower RSS (LRSS) (upper RSS (URSS)).
Based on LRSS and URSS, the BLU estimators ˜β⋆2,LRSS and ˜β⋆2,URSS of β⋆2 are
and their variances are
The efficiencies e1 of ˜β⋆2,LRSS and e2 of ˜β⋆2,URSS relative to ^β⋆2 are given by
see, for example, Koshti and Kamalja [27] and Philip and Thomas [37]. We have computed the efficiencies e1 and e2 for n=2,⋯,5, α=1,2, and θ=0.50,0.90, which are presented in Table 9. From Table 9, it can be observed that:
● The efficiency e1 is less than one for all selected values of α,θ, and n. So, ^β⋆2 is relatively more efficient than ˜β⋆2,LRSS.
● The efficiency e1 decreases as α increases, and for a fixed pair (n,α), e1 increases as θ increases.
● The efficiency e2 is greater than one for all selected values of α,θ, and n. Thus, ˜β⋆2,URSS is relatively more efficient than ^β⋆2.
● The efficiency e2 increases as α increases, and for a fixed pair (n,α), e2 decreases as θ increases.
5.
Real data application
For illustration purposes, we have considered the American Football League dataset given in Jamalizadeh and Kundu [19]. The bivariate dataset represents the game time to the first points scored by kicking the ball between goal posts (X) and the 'game time' by moving the ball into the end zone (Y). Pathak et al. [34] demonstrated that the BGW distribution fits this data better than other real-life time models. Here, we generate random samples of size five using forty-two pairs of observations. The samples under RSS schemes are displayed in Table 10.
The estimator of β⋆2 under various RSS schemes is a function of α and θ, which are unknown in this case. Thus, the method of moment estimation can be taken (see Kamalja and Koshti [21], for example). To obtain the moment estimators of α and θ, we use the moment equations based on the moments of Y-observations and the moment equation based on the correlation between (X,Y). These give ˆα=3.39821 and ˆθ=0.24259. Table 11 shows the estimates of β⋆2 under the RSS, LRSS, and URSS schemes. The results show that ˜β⋆2,URSS has the smallest variance. This is consistent with the findings of the efficiency performance study in Section 4.
6.
Conclusions
In this paper, we have considered the CGOS from the BGW distribution. We have derived the PDFs and moments of CGOS from the BGW distribution. Similar results for order statistics and PT-Ⅱ censored order statistics are presented as special instances. Finally, we have obtained the BLU estimator of the parameter associated with the study variable based on Stoke's RSS. Moreover, a real dataset is used for illustration purposes. The results for higher joint moments can be used to create skewness or kurtosis matrices (Kollo, [25]), which have important applications in both independent component analysis and invariant coordinate selection. This could be an interesting topic for future research. It will also be interesting to discuss the problem of predicting intervals for future order statistics and record values using concomitants of order statistics and record values arising from BGW distribution; see, for example, Muraleedharan and Chacko [33]. In addition, some information measures, such as the Shannon entropy and extropy, for CGOS can also be investigated.
Use of AI tools declaration
The authors declare they have not used Artificial Intelligence (AI) tools in the creation of this article.
Acknowledgments
The author would like to acknowledge the Deanship of Graduate Studies and Scientific Research, Taif University for funding this work.
Conflict of interest
The author declares that no conflict of interest exist.