Processing math: 90%
Research article

On a new modeling strategy: The logarithmically-exponential class of distributions

  • Received: 28 January 2021 Accepted: 07 May 2021 Published: 18 May 2021
  • MSC : 60E05, 62E15, 62F30

  • In this paper, a promising modeling strategy for data fitting is derived from a new general class of univariate continuous distributions. This class is governed by an original logarithmically-exponential one-parameter transformation which has the ability to enhance some modeling capabilities of any parental distribution. In relation to the current literature, it appears to be a "limit case" of the well-established truncated generalized Fréchet generated class. In addition, it offers a natural alternative to the famous odd inverse exponential generated class. Some special distributions are presented, with particular interest in a novel heavy-tailed three-parameter distribution based on the Lomax distribution. Functional equivalences, modes analysis, stochastic ordering, functional expansions, moment measures, information measures and reliability measures are derived. From generic or real data, our modeling strategy is based on the new class combined with the maximum likelihood approach. We apply this strategy to the introduced modified Lomax model. The efficiency of the three parameter estimates is validated by a simulation study. Subsequently, two referenced real data sets are adjusted according to the rules of the art; the first one containing environmental data and the second one, financial data. In particular, we show that the proposed model is preferable to four concurrents also derived from the Lomax model, including the odd inverse exponential Lomax model.

    Citation: Abdulhakim A. Al-Babtain, Ibrahim Elbatal, Christophe Chesneau, Mohammed Elgarhy. On a new modeling strategy: The logarithmically-exponential class of distributions[J]. AIMS Mathematics, 2021, 6(7): 7845-7871. doi: 10.3934/math.2021456

    Related Papers:

    [1] Fiaz Ahmad Bhatti, G. G. Hamedani, Mashail M. Al Sobhi, Mustafa Ç. Korkmaz . On the Burr XII-Power Cauchy distribution: Properties and applications. AIMS Mathematics, 2021, 6(7): 7070-7092. doi: 10.3934/math.2021415
    [2] Refah Alotaibi, Hassan Okasha, Hoda Rezk, Abdullah M. Almarashi, Mazen Nassar . On a new flexible Lomax distribution: statistical properties and estimation procedures with applications to engineering and medical data. AIMS Mathematics, 2021, 6(12): 13976-13999. doi: 10.3934/math.2021808
    [3] Qin Gong, Laijun Luo, Haiping Ren . Unit compound Rayleigh model: Statistical characteristics, estimation and application. AIMS Mathematics, 2024, 9(8): 22813-22841. doi: 10.3934/math.20241110
    [4] Abdulaziz S. Alghamdi, Muhammad Ahsan-ul-Haq, Ayesha Babar, Hassan M. Aljohani, Ahmed Z. Afify . The discrete power-Ailamujia distribution: properties, inference, and applications. AIMS Mathematics, 2022, 7(5): 8344-8360. doi: 10.3934/math.2022465
    [5] Ehab M. Almetwally, Ahlam H. Tolba, Dina A. Ramadan . Bayesian and non-Bayesian estimations for a flexible reduced logarithmic-inverse Lomax distribution under progressive hybrid type-Ⅰ censored data with a head and neck cancer application. AIMS Mathematics, 2025, 10(4): 9171-9201. doi: 10.3934/math.2025422
    [6] H. M. Barakat, M. A. Alawady, I. A. Husseiny, M. Nagy, A. H. Mansi, M. O. Mohamed . Bivariate Epanechnikov-exponential distribution: statistical properties, reliability measures, and applications to computer science data. AIMS Mathematics, 2024, 9(11): 32299-32327. doi: 10.3934/math.20241550
    [7] Wael W. Mohammed, Kalpasree Sharma, Partha Jyoti Hazarika, G. G. Hamedani, Mohamed S. Eliwa, Mahmoud El-Morshedy . Zero-inflated discrete Lindley distribution: Statistical and reliability properties, estimation techniques, and goodness-of-fit analysis. AIMS Mathematics, 2025, 10(5): 11382-11410. doi: 10.3934/math.2025518
    [8] Abdulhakim A. Al-Babtain, Rehan A. K. Sherwani, Ahmed Z. Afify, Khaoula Aidi, M. Arslan Nasir, Farrukh Jamal, Abdus Saboor . The extended Burr-R class: properties, applications and modified test for censored data. AIMS Mathematics, 2021, 6(3): 2912-2931. doi: 10.3934/math.2021176
    [9] Mohamed Said Mohamed, Najwan Alsadat, Oluwafemi Samson Balogun . Continuous Tsallis and Renyi extropy with pharmaceutical market application. AIMS Mathematics, 2023, 8(10): 24176-24195. doi: 10.3934/math.20231233
    [10] Aliyu Ismail Ishaq, Abdullahi Ubale Usman, Hana N. Alqifari, Amani Almohaimeed, Hanita Daud, Sani Isah Abba, Ahmad Abubakar Suleiman . A new Log-Lomax distribution, properties, stock price, and heart attack predictions using machine learning techniques. AIMS Mathematics, 2025, 10(5): 12761-12807. doi: 10.3934/math.2025575
  • In this paper, a promising modeling strategy for data fitting is derived from a new general class of univariate continuous distributions. This class is governed by an original logarithmically-exponential one-parameter transformation which has the ability to enhance some modeling capabilities of any parental distribution. In relation to the current literature, it appears to be a "limit case" of the well-established truncated generalized Fréchet generated class. In addition, it offers a natural alternative to the famous odd inverse exponential generated class. Some special distributions are presented, with particular interest in a novel heavy-tailed three-parameter distribution based on the Lomax distribution. Functional equivalences, modes analysis, stochastic ordering, functional expansions, moment measures, information measures and reliability measures are derived. From generic or real data, our modeling strategy is based on the new class combined with the maximum likelihood approach. We apply this strategy to the introduced modified Lomax model. The efficiency of the three parameter estimates is validated by a simulation study. Subsequently, two referenced real data sets are adjusted according to the rules of the art; the first one containing environmental data and the second one, financial data. In particular, we show that the proposed model is preferable to four concurrents also derived from the Lomax model, including the odd inverse exponential Lomax model.



    In many applied fields, such as biology, medicine, insurance, engineering, economics, physics and finance, the modeling of the observed phenomenon from experimental data is crucial. The more appropriate and precise the model, the more we are able to understand the phenomenon in great detail. One area of study is the development of flexible probability distributions, which are frequently based on these models. For this aim, the most standard method is based on a parametric transformation of existing distributions. Such a transformation aims to increase the adaptability of a reference distribution for broader perspectives in terms of data analysis. Modern computer technology has contributed to the applicability of these distributions, allowing the processing of complex functions, accessibility of efficient optimization techniques and achievement of a high degree of numerical precision. The most famous and useful classes of distributions are described in detail in the surveys of [19,23,38]. Post-2020 classes include the transmuted Muth generated class by [5], Box-Cox gamma generated class by [6], Topp-Leone odd Fréchet generated class by [7], exponentiated power generalized Weibull power series generated class by [8], truncated Cauchy power generated class by [9], exponentiated truncated inverse Weibull generated class by [10], transmuted odd Fréchet generated class by [13], Marshall-Olkin exponentiated generalized generated class by [41], exponentiated M generated class by [16], type II power Topp-Leone generated class by [17], modified T-X generated class by [12], modified odd Weibull generated class by [22], type II general inverse exponential generated class by [27], odd generalized gamma generated class by [32], and truncated generalized Fréchet generated (TGF-G) class by [42].

    However, modern classes of distributions based on simple, quite original and motivated transformation of the reference distribution remain rare. In this article, such a candidate is proposed; we introduce the logarithmically-exponential generated (LE-G) class defined by the following cumulative distribution function (cdf):

    F(x;α,ω)=1ln(1eα)ln(1eαG(x;ω)1),xR, (1.1)

    where G(x;ω) denotes the cdf of a reference continuous distribution with ω as parameter vector. One can notice that F(x;α,ω)=Tα[G(x;ω)], where Tα(y) denotes the logarithmically-exponential transformation: Tα(y)=ln(1eαy1)/ln(1eα), with y(0,1). Three major facts behind the LE-G class are formulated below.

    1. We insist on the simplicity and originality of the logarithmically-exponential definition of F(x;α,ω), with only one transformation-parameter, that has never been considered before to the best of our knowledge.

    2. The LE-G class can be viewed as a natural "limit class" of the TGF-G class introduced by [42]; the cdf of TGF-G class tends to Equation (1.1) when β tends to 0.

    3. When α tends to +, the following equivalence result holds:

    F(x;α,ω)eα(G(x;ω)11)=eα(1G(x;ω))/G(x;ω),

    which corresponds to the cdf defining the odd inverse exponential generated (OIE-G) class formerly introduced in [28] with α=1. See, also, [26] and [33], for further proceedings in this direction. A deeper relation between the LE-G and OIE-G classes will be revealed in this study. The respective successes of the TGF-G and OIE-G classes in statistics motivate further investigations into the LE-G class, which is the aim of this study. The first part is devoted to the important functions of the class, and especially, the probability density function (pdf), hazard rate function (hrf) and quantile function (qf). Then, as concrete examples, special members of the LE-G class are exposed. In particular, we intend to show the importance of the new class by focusing on a new heavy right-tailed distribution with three parameters derived to the reference Lomax (Lom) distribution. This choice is motivated by the curvature flexibility of the corresponding pdf and hrf, demonstrating rarely observed shapes for a three-parameter heavy-tailed distribution. Especially, we see decreasing and unimodal shapes for the pdf, and increasing, decreasing and unimodal shapes for the hrf. The general properties of the LE-G class are examined, describing equivalences of the cdf, pdf and hrf at the boundaries, modes analysis, some stochastic ordering structure, functional expansions of the cdf and the exponentiated pdf, diverse moment measures, information measures related to the entropy concept and a crucial reliability measure. The properties of the modified Lom distribution are also specified, and supported by numerical and graphical illustrations. Then the practice of the LE-G class is developed for the purpose of data fitting. The parameter estimation of a special model is executed with the famous maximum likelihood method. In particular, by considering the proposed modified Lom model, a short Monte Carlo simulation is performed to assess the efficiency of the obtained estimates. Next, our modeling strategy is applied to two real heavy-tailed data sets. Well-referenced statistical measures are in favor of the proposed modified Lom model, surpassing four competitors also derived from the Lom model.

    Section 2 refines the presentation of the LE-G class. Its theoretical properties are developed in Section 3. Our statistical approach is described in Section 4. Applications are given in Section 5. We end the paper with concluding remarks in Section 6.

    Here, we refine the presentation of the LE-G class with functions of interest and examples of specific distributions.

    Based on Eq (1.1), the pdf of the LE-G class is obtained upon differentiation (in the almost everywhere sense). It is defined by

    f(x;α,ω)=αln(1eα)g(x;ω)G(x;ω)2eαG(x;ω)11eαG(x;ω)1,xR, (2.1)

    where g(x;ω) denotes the pdf associated to G(x;ω). As usual, if we consider a random variable X with such a pdf, for any a and b with a<b, the distribution of X is characterized by P(aXb)=baf(t;α,ω)dt. The cdf of X at the point x is rediscovered by taking a= and b=x.

    As a crucial reliability function, the corresponding hrf is derived as

    h(x;α,ω)=f(x;α,ω)1F(x;α,ω)=αln(1eα)ln(1eαG(x;ω)1)g(x;ω)G(x;ω)2eαG(x;ω)11eαG(x;ω)1,xR. (2.2)

    The hrf can be increasing or decreasing, non-monotonic, or discontinuous in the broadest sense. In survival analysis, it is informative about the underlying mechanism of failure of a lifetime system. See, for instance, [1].

    The qf of the LE-G class is defined as the inverse function of Eq (1.1). After elementary operations, we arrive at

    Q(u;α,ω)=QG[α{ln[1(1eα)u]}1,ω],u(0,1), (2.3)

    where QG(u;ω) denotes the qf associated to G(x;ω), that is, QG(x;ω)=G1(u;ω) with u(0,1). This function is fundamental in defining various parameters, measures and statistical models of the LE-G class. About this, all the information is available in [25].

    The above three functions are fundamental for further statistical processing of the LE-G class, and their relative simplicity motivates more in this regard.

    In terms of heterogeneous and motley distributions, the LE-G class is rich. Based on useful referenced distributions, Table 1 lists some special distributions of the class which have various domains and numbers of parameters.

    Table 1.  Special distributions of the LE-G class based on some reference distributions.
    LE-G Reference Domain G(x;ω) (α,ω) F(x;α,ω)
    LES Sine (0,1) sin(π2x) (α) 1ln(1eα)ln(1eα{sin[(π/2)x]}1)
    LEU Uniform (0,b) xb (α,b) 1ln(1eα)ln(1eαbx1)
    LETP Topp-Leone (0,1) xb(2x)b (α,b) 1ln(1eα)ln(1eαxb(2x)b)
    LEK Kumaraswamy (0,1) 1(1xa)b (α,a,b) 1ln(1eα)ln(1eα[1(1xa)b]1)
    LEE Exponential (0,+) 1eθx (α,θ) 1ln(1eα)ln(1eα(1eθx)1)
    LEIE Inverse exponential (0,+) eθ/x (α,θ) 1ln(1eα)ln(1eαeθ/x)
    LEL Lindley (0,+) 1θ+1+θxθ+1eθx (α,θ) 1ln(1eα)ln(1eα[1(θ+1+θx)eθx/(θ+1)]1)
    LEW Weibull (0,+) 1eθxλ (α,λ,θ) 1ln(1eα)ln(1eα(1eθxλ)1)
    LELom Lomax (0,+) 1(1+xρ)θ (α,ρ,θ) 1ln(1eα)ln(1eα[1(1+x/ρ)θ]1)
    LEGu Gumbel R exp(ebx) (α,b) 1ln(1eα)ln(1eαexp(ebx))
    LELog Logistic R (1+ebx)1 (α,b) 1ln(1eα)ln(1eα(1+ebx))
    LEC Cauchy R 1πarctan(bx)+12 (α,b) 1ln(1eα)ln(1eα[arctan(bx)/π+1/2]1)

     | Show Table
    DownLoad: CSV

    All of these special distributions have their own mathematical and practical characteristics; they can be studied independently for various purposes. The next part is devoted to the LELom distribution, defined with the Lom distribution as distribution of reference.

    In order to motivate the LELom distribution, a retrospective on the Lom distribution is necessary. First, the cdf and pdf of the Lom distribution are specified by

    G(x;ρ,θ)=1(1+xρ)θ,x>0

    and G(x;ρ,θ)=0 for x0, and

    g(x;ρ,θ)=θρ(1+xρ)θ1,x>0

    and g(x;ρ,θ)=0 for x0, respectively. Thus, the Lom distribution has a decreasing pdf which tends to 0 when x tends to + with a polynomial decay rate. Moreover, it is skewed to the right with a heavy tail. It is known to be one of the simplest distributions of this type in the literature. For this reason, the Lom distribution has served as a model in various contexts of applied sciences dealing with heavy-tailed data. More information on this topic can be found in [2,3,4,14,15,20].

    Despite undeniable qualities, the Lom distribution has some modeling limitations, including a pdf which is "desperately only decreasing", and a hrf which lacks flexibility; it can not be increasing or have N shapes, among others. In order to overpass these limitations, several modifications or extensions of the Lom distribution have been proposed, such as the power lomax (PLom) distribution by [35], odd inverse exponential Lomax (OInLom) distribution by [26] and exponential Lomax (ExLom) distribution by [24], to cite a few. The LELom distribution, briefly presented in Table 1, constitutes a new modification of the Lom distribution. As a preliminary statement, it may be more adapted to some data sets than the existing modifications. First, the Lelom distribution is defined with the cdf given as

    F(x;α,ρ,θ)=1ln(1eα)ln(1eα[1(1+x/ρ)θ]1),x>0,

    and F(x;α,ρ,θ)<0 for x0. From Equation (2.1), standard operations give the following pdf:

    f(x;α,ρ,θ)=αθρln(1eα)(1+x/ρ)θ1[1(1+x/ρ)θ]2eα[1(1+x/ρ)θ]11eα[1(1+x/ρ)θ]1,x>0,

    and f(x;α,ρ,θ)<0 for x0. From Equation (2.2), the hrf of the LELom distribution is obtained as

    h(x;α,ρ,θ)=αθρ[ln(1eα)ln(1eα[1(1+x/ρ)θ]1)](1+x/ρ)θ1[1(1+x/ρ)θ]2×eα[1(1+x/ρ)θ]11eα[1(1+x/ρ)θ]1,x>0,

    and h(x;α,ρ,θ)<0 for x0. In order to communicate on the flexibility of the LELom distribution, various curvatures of f(x;α,ρ,θ) and h(x;α,ρ,θ) are shown in Figure 1.

    Figure 1.  Panel of shapes of the (i) pdf and (ii) hrf of the LELom distribution for several values of the parameters.

    In Figure 1 (i), we see that the pdf of the LELom distribution is unimodal and asymmetric on the right with a possibly heavy tail; the mathematical heavy-tailed nature of the LELom distribution will be discussed later. Also, a short left tail is sometimes observed. This detail is more important than it seems at first glance; it can be of interest for the modeling of certain lifetime phenomena presenting low values before a numerical peak. Two concrete examples of this kind will be the subject of all the attention in the applications part of this study (see Section 5). The corresponding hrf is revealed to be very flexible; in Figure 1 (ii), we see that it possesses increasing, decreasing and unimodal (upside-down) shapes, with more or less plateness. The modeling qualities of LELom distributions are thus proven.

    We end this part by presenting the qf of the LELom distribution. Based on Equation (2.3), it is given as

    Q(u;α,ρ,θ)=ρ{(1α{ln[1(1eα)u]}1)1/θ1},u(0,1). (2.4)

    The analytical expression of this function allows a direct quantile analysis of the LELom distribution, including the expressions of the quantile density function, hazard quantile function, median and the two other quartiles, quantile asymmetry measures and quantile plateness measures. The importance of such a manageable quantile function in survival studies can be found in [25] and [31]. This is also a plus for the LELom distribution.

    This section is devoted to some theoretical properties of the LE-G class, with applications to the LELom distribution at all stages.

    The functional equivalences of the cdf, pdf and hrf of the LE-G class are now discussed, with the cases G(x;ω)0 and G(x;ω)1 distinguished. First, when G(x;ω)0, a direct analysis gives

    F(x;α,ω)1ln(1eα)eαG(x;ω)1,f(x;α,ω)αln(1eα)g(x;ω)G(x;ω)2eαG(x;ω)1.

    This last equivalence function also holds for h(x;α,ω). We see that α affects the decay rate of the exponential term for the pdf and hrf; it plays an important role in this regard.

    For the complementary case G(x;ω)1, by putting SG(x;ω)=1G(x;ω) and using standard equivalence results, we get

    F(x;α,ω)=1+α(eα1)ln(1eα)SG(x;ω),f(x;α,ω)α(eα1)ln(1eα)g(x;ω)

    and h(x;α,ω)h(x;ω), where h(x;ω)=g(x;ω)/SG(x;ω) is the hrf associated to G(x;ω). Here, the role of α is minor; it is only involved in the proportional constants only. However, these constants can be consequent for a fixed x such that G(x;ω)1.

    As an example of application, if we focus on the LELom distribution, for x0, we get

    F(x;α,ρ,θ)1ln(1eα)eα[1(1+x/ρ)θ]10

    and

    f(x;α,ρ,θ)αρθln(1eα)x2eα[1(1+x/ρ)θ]10.

    Thus, in all circumstances, f(x;α,ρ,θ) tends to 0 when x tends to 0, and with an exponential decay rate. Note that, in this case, α[1(1+x/ρ)θαθx/ρ, meaning that all the parameters affect this convergence in the exponential sense. As already stated, the asymptotic behavior of h(x;α,ρ,θ) corresponds to the one of f(x;α,ρ,θ). On the other hand, when x+, by invoking the standard equivalence results, we get

    F(x;α,ρ,θ)=1+αρθ(eα1)ln(1eα)xθ1,

    and

    f(x;α,ρ,θ)αθρθρ(eα1)ln(1eα)xθ10.

    Thus, in all cases, f(x;α,ρ,θ) tends to 0 with a polynomial decay rate when x tends to +, more or less rapidly, depending on θ. Also, by this equivalence and the integral Riemann criteria, we have +0etxf(x;α,ρ,θ)dx=+ for any t>0. Therefore, the LELom distribution is a true heavy-tailed distribution in the mathematical sense. Concerning the hrf, when x+, we have h(x;α,ρ,θ)θx10. Thus, only the parameter θ affects the equivalence function as a proportional constant.

    The mode analysis of a distribution consists in identifying the mode(s) or maximal point(s) of the corresponding pdf. For the LE-G class, a maximal point for f(x;α,ω) can be determined via the the study of df(x;α,ω)/dx, or more simply, the following derivative:

    ddxln[f(x;α,ω)]=dg(x;ω)/dxg(x;ω)2g(x;ω)G(x;ω)+αg(x;ω)G(x;ω)2eαG(x;ω)1eαG(x;ω)11.

    Thus, a maximal point of f(x;α,ω) satisfy the following non-linear equation: dln[f(x;α,ω)/dx=0, corresponding to

    dg(x;ω)/dxg(x;ω)+αg(x;ω)G(x;ω)2eαG(x;ω)1eαG(x;ω)11=2g(x;ω)G(x;ω).

    If only one maximum is obtained, the LE-G is unimodal, if more, it is multimodal, depending on the definition of the reference distribution. If the above equations are too complicated from the analytical point of view, numerical or graphical approaches are more convenient for the mode analysis.

    Concerning the LELom distribution, this aspect has already been commented; it is mainly unimodal and this can be observed in Figure 1, and the mode satisfies the following non-linear equation:

    (θ+1)(1+x/ρ)1+αθ(1+x/ρ)θ1[1(1+x/ρ)θ]2eα[1(1+x/ρ)θ]1eα[1(1+x/ρ)θ]11=2θ(1+x/ρ)θ11(1+x/ρ)θ.

    Obviously, only mathematical software has the ability to evaluate it numerically. Mathematica, Python or R can be used in this regard.

    The following proposition demonstrates an important first order stochastic dominance result between the LE-G and OIE-G classes.

    Proposition 1. The LE-G class first order stochastically dominates the OIE-G class. That is, we have

    F(x;α,ω)F(x;α,ω),

    where F(x;α,ω)=eα(G(x;ω)11).

    Proof. First, let us set q(x)=ln(1x)/x for x(0,1). Then, its derivative with respect to x is dq(x)/dx=[ln(1x)+x/(1x)]/x2, which is negative since ln(1y)y/(1y) for y<1, the equality being acheived for y=0 only. Therefore, q(x) is a decreasing function. Thus, for any (u,v)(0,1)2 with uv, we have q(v)q(u), and, since ln(1v)<0, it is equivalent to

    ln(1u)ln(1v)uv.

    By applying this inequality with u=eαG(x;ω)1 and v=eα satisfying uv with (u,v)(0,1)2, we get

    F(x;α,ω)=ln(1eαG(x;ω)1)ln(1eα)eαG(x;ω)1eα=F(x;α,ω).

    This ends the proof of Proposition 1.

    Proposition 1 proves that the OIE-G class is not only a "limit case" of the LE-G class; a well-identified stochastic dominance exists. Roughly speaking, the LE-G models can achieve certain objectives that the OIE-G models can not, and vice versa. Also, it motivates the LE-G models as alternatives of the OIE-G models.

    In addition, the LE-G class satisfies the following monotone likelihood ratio property.

    Proposition 2. For α1α2, the following ratio function

    κ(x;α1,α2,ω)=f(x;α1,ω)f(x;α2,ω),xR,

    is decreasing with respect to x.

    Proof. First, after some simplifications, we get

    κ(x;α1,α2,ω)=α1ln(1eα2)α2ln(1eα1)eα2G(x;ω)11eα1G(x;ω)11.

    Therefore

    ddxκ(x;α1,α2,ω)=α1ln(1eα2)α2ln(1eα1)g(x;ω)G(x;ω)2α1eα1G(x;ω)1[eα2G(x;ω)11]α2eα2G(x;ω)1[eα1G(x;ω)11](eα1G(x;ω)11)2.

    Now, let us study the positivity of the numerator term. Let us consider the following intermediary function: q(α)=αeαy1/[eαy11] with y=G(x;ω). Then, we have

    ddαq(α)=eαy1y(eαy11)αy(eαy11)2.

    By the general inequality ez1+z for any zR, we get y(eαy11)αyαy1α=0. Therefore, dq(α)/dα0, implying that q(α) is an increasing function with respect to α. Therefore, α1α2 implies that q(α1)q(α2), which is equivalent to

    α1eα1G(x;ω)1[eα2G(x;ω)11]α2eα2G(x;ω)1[eα1G(x;ω)11]0.

    It follows that dκ(x;α1,α2,ω)/dx0, and thus κ(x;α1,α2,ω) is a decreasing function with respect to x. This ends the proof of Proposition 2.

    The demonstrated stochastic ordering properties involve many other properties described in [37]. We leave the interested reader to this reference for more information in this direction.

    The following expansion investigates a linear relation between F(x;α,ω) and basic functions of the reference distribution.

    Proposition 3. The following expansion holds:

    F(x;α,ω)=+k=1,,m=0βk,,mSG(x;ω)m,

    where

    βk,,m=1ln(1eα)(m)(1)+m+11k1!αk, (3.1)

    (m) refers to the general binomial coefficient at m and , and SG(x;ω)=1G(x;ω) defining the survival function associated to G(x;ω).

    Proof. Based on the logarithmic and exponential series expansions, and the generalized binomial theorem, we have

    F(x;α,ω)=1ln(1eα)+k=11keαkG(x;ω)1=1ln(1eα)+k=11k+=01!(αk)G(x;ω)=1ln(1eα)+k=11k+=01!(αk)+m=0(m)(1)mSG(x;ω)m=+k=1,,m=0βk,,mSG(x;ω)m.

    Proposition 3 is proved.

    Proposition 3 applied to the LELom distribution gives the following simple expansion:

    F(x;α,ρ,θ)=+k=1,,m=0βk,,m(1+xρ)θm,x>0.

    The following expansion investigates a linear relation between exponentiated f(x;α,ω) and functions of the reference distribution.

    Proposition 4. Let δ>0. Then, the following expansion holds:

    f(x;α,ω)δ=+k,,m=0τ[δ]k,,mv[δ]m(x;ω),

    where

    τ[δ]k,,m=1[ln(1eα)]δ(δk)(2δm)αδ+1!(1)k++m(k+δ) (3.2)

    and

    v[δ]m(x;ω)=g(x;ω)δSG(x;ω)m.

    Proof. We have

    f(x;α,ω)δ=αδ[ln(1eα)]δg(x;ω)δG(x;ω)2δeαδG(x;ω)1[1eαG(x;ω)1]δ,xR.

    The general binomial theorem is the key of the proof. By applying it three times in a row, it comes

    f(x;α,ω)δ=αδ[ln(1eα)]δg(x;ω)δG(x;ω)2δeαδG(x;ω)1+k=0(δk)(1)keαkG(x;ω)1=αδ[ln(1eα)]δg(x;ω)δG(x;ω)2δ+k=0(δk)(1)keα(k+δ)G(x;ω)1=αδ[ln(1eα)]δg(x;ω)δ+k=0(δk)(1)k+=01![α(k+δ)]G(x;ω)2δ=αδ[ln(1eα)]δg(x;ω)δ+k=0(δk)(1)k+=01![α(k+δ)]+m=0(2δm)(1)mSG(x;ω)m=+k,,m=0τ[δ]k,,mv[δ]m(x;ω).

    Proposition 4 is proved.

    Notice that

    (i) A direct application of Proposition 4 with δ=1 gives the following expansion for the pdf of the LE-G class:

    f(x;α,ω)=+k,,m=0τ[1]k,,mv[1]m(x;ω),

    where v[1]m(x;ω)=g(x;ω)SG(x;ω)m corresponds to the pdf of the minimum of m+1 independent random variables with the reference distribution as common distribution when it is multiplied by the constant m+1. Thus, it is well identified for most of the standard distributions of the literature.

    In particular, in the context of the LELom distribution, we have

    f(x;α,ρ,θ)=+k,,m=0τ[1]k,,mv[1]m(x;ρ,θ),

    where

    v[1]m(x;ρ,θ)=θρ(1+xρ)θ(m+1)1,x>0,

    understanding that v[1]m(x;ρ,θ)=0 for x0. One can notice that v[1]m(x;ρ,θ)=g(x;ρ,θ(m+1))/(m+1), meaning that f(x;α,ρ,θ) is expressed as an infinite linear combination of pdfs of the Lom distribution.

    (ii) Proposition 3 applied to the LELom distribution gives the following simple expansion for f(x;α,ρ,θ)δ:

    f(x;α,ρ,θ)δ=+k,,m=0τ[δ]k,,mv[δ]m(x;ρ,θ),

    where

    v[δ]m(x;ρ,θ)=(θρ)δ(1+xρ)δ(θ+1)θm,x>0,

    understanding that v[δ]m(x;ρ,θ)=0 for x0.

    These expansion results are useful to obtain manageable analytical expressions of various measures of importance, such as different types of moment, entropy and reliability measures. Some of them will be the subject of further development in the next subsections.

    Now, let us introduce a random variable X whose distribution belongs to the LE-G class and E be the standard expectation. Then, if the rth order moment about the origin of X converges in the mathematical sense, it is given by

    μr=E(Xr)=+xrf(x;α,ω)dx=αln(1eα)+xrg(x;ω)G(x;ω)2eαG(x;ω)11eαG(x;ω)1dx.

    The chances of finding a simple expression for this integral are slim. For the evaluation of μr, two standard solutions can be studied: (i) a numerical approach by applying approximation integral procedure and (ii) an expansion of μr involving manageable coefficients. This last approach requires the result in Proposition 4 applied with δ=1; subject to the mathematical validity on the interchanges of the sums and integral signs, we have

    μr=+k,,m=0τ[1]k,,mξ[r]m, (3.3)

    where ξ[r]m=+xrv[1]m(x;ω)dx. Since v[1]m(x;ω) is a well identified function for most of the standard distributions, ξ[r]m is often easily calculable via standard integration techniques or can be extracted in published works. That is, μr can be obtained by summing well-identified coefficients. A correct approximation of μr follows by replacing the infinite bound by any large integer; in most practical situations, the integer 40 gives satisfaction.

    Let us now study the moments about the origin of a random variable following the LELom distribution, starting by establishing their convergence. First, at the neighborhood of x=0, we recall that the following function equivalence holds: f(x;α,ρ,θ){αρ/(θln(1eα)]}x2eα[1(1+x/ρ)θ]1 implying that xrf(x;α,ρ,θ)=o(x1/2) for any r0 and, by the Riemann integrability criterion, the integral of xrf(x;α,ρ,θ) over (0,ϵ) with ϵ>0 converges. At the neighborhood of x=+, we recall that f(x;α,ρ,θ){αθρθ/[ρ(eα1)ln(1eα)]}xθ1 and, by the Riemann integrability criterion, the integral of xrθ1 over (ϵ,+) with ϵ>0 converges if and only if r<θ. Hence, the condition r<θ is necessary to guarantee the convergence of μr; the LELom distribution does not admit moments about the origin of all orders. Thus, under the condition r<θ, numerical approximations of μr have sense. Also, for an evaluation of μr, the complementary expansion approach described in Eq (3.3) gives

    μr=+k,,m=0τ[1]k,,mξ[r]m,

    where, by introducing the beta function defined by B(a,b)=10ta1(1t)b1dt with a>0 and b>0,

    ξ[r]m=+xrv[1]m(x;ρ,θ)dx=θρ+0xr(1+xρ)θ(m+1)1dx=θρr+0u(r+1)1(1+u)(r+1)+θ(m+1)rdu=θρrB(r+1,θ(m+1)r).

    It comes

    μr=+k,,m=0τ[1]k,,mθρrB(r+1,θ(m+1)r).

    As sketched previously, a precise approximation of μr follows by replacing the infinite bound with 40. As an example of numerical work, some values for the mean and variance of X are collected in Table 2. In Table 2, we see some monotonic behavior of the mean and variance of X, being more or less small for the considered values.

    Table 2.  Values of mean and variance of the LELom distribution for diverse values of the parameters.
    θ α=2, ρ=3 θ α=0.5, ρ=1.5
    mean variance mean variance
    5 1.301 1.36 5 0.368 0.197
    5.5 1.152 0.991 5.5 0.328 0.146
    6.5 0.938 0.589 6.5 0.269 0.09
    7 0.858 0.473 7 0.247 0.073
    7.5 0.79 0.388 7.5 0.228 0.061
    8 0.733 0.324 8 0.212 0.051
    8.5 0.683 0.274 8.5 0.198 0.044

     | Show Table
    DownLoad: CSV

    In order to complete the study above, one can mention that the moment generating function of X is specified by

    M(t)=E(etX)=+etxf(x;α,ω)dx=αln(1eα)+etxg(x;ω)G(x;ω)2eαG(x;ω)11eαG(x;ω)1dx,

    with tR such that the integral converges. Subject to the logical validity of the interchanges of the sums and integral signs, the consequence of Proposition 4 with δ=1 is

    M(t)=+k,,m=0τ[1]k,,mυm(t),

    where υm(t)=+etxv[1]m(x;ω)dx. This last term is already available in the litterature for a wide panel of parental distributions. If we deal with the parental Lom distribution as in the LELom distribution, the moment generating function exists at least for t0. It is however quite complex, but two analytical results involving special functions are provided in [29,Section 5].

    Alternatively, if X admits moments of all orders, which is not the case for the LELom distribution, then we can use the following formula:

    M(t)=+k=0tkk!μk.

    The characteristic function can be dettermined through the following relationship: φ(t)=M(it), where i2=1.

    In addition, the incomplete version of the rth moment about the origin of X can be treated in a similar way. From it, one can define various measures of deviations, and diverse types of residual and reversed residual lifetime functions. Further developments on this topic can be found in [23].

    In order to measure the variation or uncertainty of distributions belonging to the LE-G class, several information measures can be used. The most well-known are described in [11]. Here, we focus our attention on the Rényi and Tsallis entropy measures introduced by [36,39], respectively. First, if the Rényi entropy measure of the LE-G class converges in the mathematical sense, it can be defined as

    I[p]R=11pln[+f(x;α,ω)pdx]=p1plnαp1pln[ln(1eα)]+11pln[+g(x;ω)pG(x;ω)2peαpG(x;ω)1[1eαG(x;ω)1]pdx],

    with p>0 and p1. By arguing similarly as for moments about the origin, the integral term can be evaluated numerically by any integral approximation procedure. A more direct approach, which is "exact" in the mathematical sense, consists of giving a simple expansion of I[p]R. Owing to Proposition 4, subject to the mathematical validity on the interchanges of the sums and integral signs, we have

    +f(x;α,ω)pdx=+k,,m=0τ[p]k,,mϕ[p]m, (3.4)

    where ϕ[p]m=+v[p]m(x;ω)dx, which is quite manageable for most of the standard distributions.

    In the setting of the LELom distribution, let us discuss the convergence of I[p]R deduced to the convergence of the integral term. First, at the neighborhood of x=0, we have f(x;α,ρ,θ)p{αρ/(θln(1eα)]}px2peαp[1(1+x/ρ)θ]1 implying that f(x;α,ρ,θ)p=o(x1/2). By the Riemann integrability criterion, the integral of f(x;α,ρ,θ)p over (0,ϵ) with ϵ>0 converges. At the neighborhood of x=+, we have f(x;α,ρ,θ)p{αθρθ/[ρ(eα1)ln(1eα)]}pxp(θ+1) and, by the Riemann integrability criterion, the integral of xp(θ+1) over (ϵ,+) with ϵ>0 converges if and only if p(θ+1)>1. Therefore, we must have p(θ+1)>1 to ensure the convergence of I[p]R. Under this condition, numerical approximations of I[p]R are possible. Also, owing to Equation (3.4), one can write

    +f(x;α,ρ,θ)pdx=+k,,m=0τ[p]k,,mϕ[p]m,

    where

    ϕ[p]m=+v[p]m(x;ρ,θ)dx=(θρ)p+0(1+xρ)p(θ+1)θmdx=θpρ1p1p(θ+1)+θm1.

    Therefore, the following formula is valid:

    I[p]R=11pln[+k,,m=0τ[p]k,,mθpρ1p1p(θ+1)+θm1],

    as well as the following practical approximation:

    I[p]R11pln[40k,,m=0τ[p]k,,mθpρ1p1p(θ+1)+θm1].

    Similarly, the Tsallis entropy measure has an integral of the form +f(x;α,ω)qdx as main term; it can be defined as

    I[q]T=1q1[1+f(x;α,ω)qdx],

    where q>0 and q1. A mathematical treatment similar to the one applied to I[p]R can be performed. In particular, for the LELom distribution, I[q]T converges if and only if q(θ+1)>1. Moreover, the following result holds:

    I[q]T=1q1[1+k,,m=0τ[q]k,,mθqρ1q1q(θ+1)+θm1]1q1[140k,,m=0τ[q]k,,mθqρ1q1q(θ+1)+θm1].

    As an example of numerical work, some values for the above entropy measures are presented in Table 3.

    Table 3.  Values of the Rényi and Tsallis entropy measures of the LELom distribution for diverse values of the parameters.
    α=0.5, θ=2.0, ρ=1.5 α=ρ=p=q=0.5
    p Rényi entropy q Tsallis entropy θ Rényi entropy Tsallis entropy
    0.4 3.448 0.4 11.529 2 2.378 4.568
    0.5 2.378 0.5 4.568 2.5 1.8 2.918
    0.6 1.894 0.6 2.832 3 1.412 2.052
    0.7 1.595 0.7 2.046 3.5 1.123 1.507
    1.2 0.925 1.2 0.845 4 0.894 1.128
    1.5 0.743 1.5 0.62 4.5 0.705 0.846

     | Show Table
    DownLoad: CSV

    From Table 3, we see that the amount of uncertainty of the LELOm distribution is versatile, reaching small or large values. This aspect also reveals the statistical richness and complexity of the LELom distribution.

    The stress-strength model concerns the service life of a component exposed to a certain stress and having strength (or resistance). The stress and the strength are modeled by two random variables, say Y and X, respectively. It is assumed that if the stress exceeds the strength, the component can not live. The reliability of such a random system is expressed by the following measure: R=P(Y<X). Here, we investigate the expression of R in the context of the LE-G family. More precisely, we suppose that X and Y are independent variables, and come from LE-G family in the following scenario; X is with the cdf F(x;α1,ω) and pdf f(x;α1,ω), and Y is with the cdf F(x;α2,ω). Thus, the parameter α can differ from one distribution to the other, but the reference distribution is the same. In this setting, we have

    R=+F(x;α2,ω)f(x;α1,ω)dx=α1ln(1eα2)ln(1eα1)+ln(1eα2G(x;ω)1)g(x;ω)G(x;ω)2eα1G(x;ω)11eα1G(x;ω)1dx=α1ln(1eα2)ln(1eα1)+1ln(1eα2u)eα1u1eα1udu.

    The obtained integral remains manageable; its numerical computation is quite possible. Alternatively, by virtue of Propositions 3 and 4, subject to the mathematical validity on the interchanges of the sums and integral signs, we obtain

    R=+k=1,,m=0+k,,m=0β(α2)k,,mτ[1];(α1)k,,m+SG(x;ω)mv[1]m(x;ω)dx,

    where β(α2)k,,m is defined by Eq (3.1) with α=α2 and $ \tau^{[1]; (\alpha_1)}_{k', \ell', m'} isdefinedbyEq(3.2)with \delta = 1 and \alpha = \alpha_1 $ and

    ψm,m=+SG(x;ω)mv[1]m(x;ω)dx=+g(x;ω)SG(x;ω)m+mdx=10(1u)m+mdu=1m+m+1.

    Therefore, we have

    R=+k=1,,m=0+k,,m=0β(α2)k,,mτ[1];(α1)k,,m1m+m+140k=1,,m=040k,,m=0β(α2)k,,mτ[1];(α1)k,,m1m+m+1.

    Thus, it can be approximated via standard computation techniques. Discussions and applications on the role of R in various settings can be found in [18].

    An important objective of the LE-G class is to produce flexible parametric models for concrete problems dealing with data. Here, we present a common strategy to construct accurate LE-G models for data fitting. Let x1,,xn be n values supposed to be generated from a given LE-G distribution. Then, we can estimate the unknown parameters of the corresponding LE-G model by the maximum likelihood approach. Specifically, by denoting (α,ω) the vector of unknown parameters, its maximum likelihood estimate (MLE) is given as

    (ˆα,ˆω)=argmax(α,ω)L(x1,,xn;α,ω),

    where L(x1,,xn;α,ω) denotes the likelihood function defined by

    L(x1,,xn;α,ω)=ni=1f(xi;α,ω)=ni=1[αln(1eα)g(xi;ω)G(xi;ω)2eαG(xi;ω)11eαG(xi;ω)1].

    The components of (ˆα,ˆω) are called MLEs of those of (α,ω), respectively. Subject to mathematical validity, one can find the MLEs through the maximization of the log-likelihood function defined by

    (x1,,xn;α,ω)=nlnαnln[ln(1eα)]+ni=1ln[g(xi;ω)]2ni=1ln[G(xi;ω)]ni=1ln(eαG(xi;ω)11).

    The desired MLEs are solutions of the following system of equations:

    α(x1,,xn;α,ω)=nαn(eα1)ln(1eα)ni=1eαG(xi;ω)1G(xi;ω)[eαG(xi;ω)11]=0

    and, by denoting ω=(ω1,,ωp), for j=1,,p,

    ωj(x1,,xn;α,ω)=ni=1g(xi;ω)/ωjg(xi;ω)2ni=1G(xi;ω)/ωjG(xi;ω)+αni=1[G(xi;ω)/ωj]eαG(xi;ω)1G(xi;ω)2[eαG(xi;ω)11]=0.

    For the great majority of the referenced distributions, nice expressions of these solutions are impossible. Therefore, the use of a numerical procedure is necessary to obtain an acceptable approximation of (ˆα,ˆω). For the sake of clarity, the existence and uniqueness of MLEs are not proven here, but it is of course an essential aspect of the method. The established theory of the maximum likelihood approach is in effect; the constructions of confidence intervals or likelihood ratio tests for the parameters of LE-G models are entirely possible. We may refer to [21] for the generalities.

    The above theory holds for the LELom model, under the following configuration: ω=(ρ,θ), for x>0,

    G(x;ρ,θ)=1(1+xρ)θ,g(x;ρ,θ)=θρ(1+xρ)θ1,
    ρG(x;ρ,θ)=θρ2x(1+xρ)θ1,ρg(x;ρ,θ)=θρ3(1+xρ)θ2(θxρ),

    and

    θG(x;ρ,θ)=(1+xρ)θln(1+xρ),θg(x;ρ,θ)=(1+xρ)θ1ρ+x[1θln(1+xρ)].

    Then, precise numerical approximations of the MLEs of (α,ρ,θ) can be obtained. We illustrate this claim by conducting a Monte Carlo simulation study with respect to the sample size n. The R software is used (see [34]) and we proceed as follows:

    1. We draw N=5000 samples of size n=100, 200, 300 and 1000 from the LELom distribution under the following sets of parameters: Set1 =(α=0.5,ρ=0.5,θ=0.5), Set2 =(α=1.5,ρ=0.5,θ=0.8), Set3 =(α=1.2,ρ=0.5,θ=0.5) and Set4 =(α=1.2,ρ=0.5,θ=0.8).

    2. We calculate the MLEs of α, ρ and θ for each of the N samples, say ˆαj, ˆρj and ˆθj, respectively, for j=1,,N.

    3. We compute the average MLEs (AMLEs) and mean-squared errors (MSEs) for c=α, ρ and θ through the following formulas:

    AMLE=1NNj=1ˆcj,MSE=1NNj=1(ˆcjc)2.

    4. The obtained numerical results are indicated in Table 4 for Sets 1 and 2, and Table 5 for Sets 3 and 4.

    Table 4.  AMLEs and MSEs of the LELom model parameters for Set1 and Set2.
    n Set1 Set2
    AMLE MSE AMLE MSE
    100 0.7670 0.2798 1.8867 1.0433
    0.4495 0.1524 0.8046 0.7718
    0.5190 0.0059 0.8857 0.0148
    200 0.5958 0.1583 1.8090 0.8734
    0.5419 0.0930 0.7320 0.4211
    0.5104 0.0045 0.8686 0.0083
    300 0.5489 0.1035 1.6867 0.4326
    0.4724 0.0760 0.6046 0.1718
    0.4989 0.0019 0.8357 0.0048
    1000 0.5317 0.0912 1.5399 0.3495
    0.5007 0.0547 0.5926 0.0806
    0.5127 0.0009 0.8236 0.0032

     | Show Table
    DownLoad: CSV
    Table 5.  AMLEs and MSEs of the LELom model parameters for Set3 and Set4.
    n Set3 Set4
    AMLE MSE AMLE MSE
    100 0.7140 0.2910 1.7910 1.5110
    0.7964 0.2267 0.7286 0.2649
    0.4564 0.0054 0.8505 0.0344
    200 0.9254 0.2573 1.2968 0.5961
    0.7204 0.1465 0.6183 0.1950
    0.4654 0.0021 0.8301 0.0102
    300 0.9573 0.2451 1.2183 0.5758
    0.6250 0.1093 0.5931 0.1057
    0.4704 0.0017 0.7965 0.0038
    1000 1.1425 0.1476 1.2010 0.1748
    0.5142 0.0310 0.5788 0.0893
    0.5197 0.0011 0.8091 0.0027

     | Show Table
    DownLoad: CSV

    From Tables 4 and 5, the following comments can be formulated. As the value of n increases, the AMLEs tend to the corresponding true values of the parameters. The MSEs decrease to 0 as the value of n increases. These two observations are consistent with the theoretical first-order properties of the MLEs. Therefore, the LELom model combined with the maximum likelihood estimation method seems to be a good solution for fitting data with a heavy tail, and a possible short left tail, a claim that will be confirmed in the next section.

    Here, the accuracy of the fit of the LELom model is compared to that of some competing models such as the three-parameter power lomax (PLom) model by [35], two-parameter odd inverse exponential Lomax (OInLom) model by [26], three-parameter exponential Lomax (ExLom) model by [24], and standard two-parameter Lomax (Lom) model.

    Based on a given data set, the following fit measures are taken into account: Cramér-von Mises (W), Anderson-Darling (A) and Kolmogorov-Smirnov (KS) statistics with the p-value of the related KS statistical test. In addition, the following criteria based on the maximum likelihood approach are considered: Akaike information criterion (AIC) and Bayesian information criterion (BIC), both defined from the estimated log-likelihood function denoted as ˆ. For such a data set, the model with the smallest values for W, A, KS, AIC, and BIC, and the largest p-value is considered the best model.

    The results are obtained using the R software.

    The first data set was originally reported by [40]. It represents the Maximum Annual Flood Discharges (MAFD) of the North Saskachevan River over a period of 47 years. It contains 47 observations, one per year, and the unit is 1000 cubic feet per second. The data set is given as {19.885, 20.940, 21.820, 23.700, 24.888, 25.460, 25.760, 26.720, 27.500, 28.100, 28.600, 30.200, 30.380, 31.500, 32.600, 32.680, 34.400, 35.347, 35.700, 38.100, 39.020, 39.200, 40.000, 40.400, 40.400, 42.250, 44.020, 44.730, 44.900, 46.300, 50.330, 51.442, 57.220, 58.700, 58.800, 61.200, 61.740, 65.440, 65.597, 66.000, 74.100, 75.800, 84.100,106.600,109.700,121.970,121.970,185.560}.

    The second data set includes the actual monthly tax revenue in Egypt from January 2006 to November 2010. It was originally provided by [30]. The data set is presented as {5.9, 20.4, 14.9, 16.2, 17.2, 7.8, 6.1, 9.2, 10.2, 9.6, 13.3, 8.5, 21.6, 18.5, 5.1, 6.7, 17.0, 8.6, 9.7, 39.2, 35.7, 15.7, 9.7, 10.0, 4.1, 36.0, 8.5, 8.0, 9.2, 26.2, 21.9, 16.7, 21.3, 35.4, 14.3, 8.5, 10.6, 19.1, 20.5, 7.1, 7.7, 18.1, 16.5, 11.9, 7.0, 8.6, 12.5, 10.3, 11.2, 6.1, 8.4, 11.0, 11.6, 11.9, 5.2, 6.8, 8.9, 7.1, 10.8}.

    Let us begin with the analysis of the first data set. The MLEs related to the considered models are indicated in Table 6, along with their related standard errors (SEs).

    Table 6.  Values of the MLEs related to the models with the related SEs (in parentheses) for the first data set.
    Models MLEs and SEs (in parentheses)
    LELom
    (α,ρ,θ)
    30.1240
    (0.0078)
    24.4863
    (0.7501)
    3.8502
    (0.9975)
    PLom
    (α,β,λ)
    1.5941
    (0.5966)
    1.3786
    (0.1462)
    248.2217
    (5.0557)
    ExLom
    (α,β,λ)
    1.7866
    (0.2061)
    2.4702
    (1.1485)
    0.0032
    (0.0009)
    OInLom
    (α,ρ,θ)
    7.8195
    (2.1153)
    127.1995
    (5.1333)
    8.4659
    (0.2254)
    Lom
    (ρ,θ)
    112.9830
    (7.7611)
    2.7932
    (0.8682)
    -
    -

     | Show Table
    DownLoad: CSV

    In order to measure the accuracy of the fits of the models, Table 7 gives the values of ˆ, AIC, BIC, W, A and KS (statistic and p-value) for the considered models.

    Table 7.  Values of ˆ, AIC, BIC, W, A, KS and the KS p-value of the models for the first data set.
    Models ˆ AIC BIC W A KS KS p-value
    LELom 216.5707 439.1414 444.7550 0.0280 0.1990 0.0793 0.9234
    PLom 235.2859 476.5718 482.1854 0.0960 0.6430 0.2979 0.0004
    ExLom 226.5981 459.1963 464.8099 0.28960 1.80388 0.1552 0.1976
    OInLom 216.6007 439.2014 444.8150 0.0537 0.3709 0.0882 0.8484
    Lom 243.1155 490.2310 493.9734 0.1149 0.7643 0.3641 5.907×1006

     | Show Table
    DownLoad: CSV

    Based on the results in Table 7, we see that the LELom model is the best with AIC =439.1414, BIC =444.7550, W =0.0280, A =0.1990, KS =0.0793 and the p-value of the KS test given as p-value =0.9234. We can notice the proximity of this last value to the optimal unit. The second best model is the OInLom model. We illustrate that in Figure 2 by plotting the estimated pdfs and cdfs of the models over the histogram and empirical cdf, respectively.

    Figure 2.  Plots of the (i) estimated pdfs of the models over the histogram and (ii) estimated cdfs of the models over the empirical cdf for the first data set.

    Thanks to its small left tail and peakness properties, the estimated functions of the LELom model have well captured the characteristics of the data, contrary to the other models.

    Figure 3 completes the previous study by providing the probability-probability (P-P) and quantile-quantile (Q-Q) plots of the LELom model.

    Figure 3.  Graphics of the (i) P-P plot and (ii) Q-Q plot of the LELom model for the first data set.

    This figure shows that the black lines in the P-P and Q-Q plots well adjust the corresponding scatter plots, demonstrating the good fit of the LELom model to the data.

    We now use our methodology for the second data set. The MLEs related to the considered models are indicated in Table 8, along with their related SEs.

    Table 8.  Values of the MLEs related to the models with the related SEs (in parentheses) for the second data set.
    Models MLEs and SEs (in parentheses)
    LELom
    (α,ρ,θ)
    27.6902
    (0.0243)
    5.4759
    (1.8775)
    3.4196
    (0.7343)
    PLom
    (α,β,λ)
    1.1901
    (0.4637)
    2.2445
    (0.2773)
    253.0403
    (7.6449)
    ExLom
    (α,β,λ)
    1.8875
    (0.2739)
    1.4853
    (1.3272)
    0.0103
    (0.0108)
    OInLom
    (α,ρ,θ)
    4.2666
    (0.6823)
    142.7981
    (3.8372)
    25.0416
    (1.8329)
    Lom
    (ρ,θ)
    143.6553
    (9.6621)
    11.4152
    (6.2237)
    -
    -

     | Show Table
    DownLoad: CSV

    Table 9 provides the values of AIC, BIC, W, A and KS (statistic and p-value) for the considered models.

    Table 9.  Values of ˆ, AIC, BIC, W, A, KS and the KS p-value of the models for the second data set.
    Models ˆ AIC BIC W A KS KS p-value
    LELom 192.3681 390.7361 396.9687 0.0433 0.2617 0.0823 0.8189
    PLom 195.8858 397.6563 403.8889 0.1145 0.6521 0.1546 0.1192
    ExLom 199.5225 405.0450 411.2776 0.3220 2.0531 0.1418 0.1858
    OInLom 192.5344 391.0688 397.3014 0.0908 0.5380 0.0955 0.6538
    Lom 214.3048 432.5045 436.6596 0.1764 1.0830 0.3175 0.0013

     | Show Table
    DownLoad: CSV

    From Table 9, it is noted that the best model is the LELom model with AIC =390.7361, BIC =396.9687, W =0.0433, A =0.2617, KS =0.0823 and p-value =0.8189. The second best model is the OInLom model. However, it remains far from the performance of the LELom model on the basis of the benchmarks considered.

    The main estimated functions of the models over the corresponding empirical objects are presented in Figure 4.

    Figure 4.  Plots of the (i) estimated pdfs of the models over the histogram and (ii) estimated cdfs of the models over the empirical cdf for the second data set.

    We see that the small left tail and peak properties of the LELom model made the difference with the fits of the other models.

    Figure 5 concludes the previous study by showing the P-P and Q-Q plots of the LELom model.

    Figure 5.  Graphics of the (i) P-P plot and (ii) Q-Q plot of the LELom model for the second data set.

    The black lines in the P-P and Q-Q plots adjust the scatter plots nicely, demonstrating the good fit of the LELom model to the data.

    In view of the above results, we can recommend the LELom model for the analysis of data similar to those considered.

    In this paper, we have motivated a modeling strategy based on a class of distributions defined from an original and simple logarithmically-exponential transformation. A new flexible modification of the Lomax model with three parameters is derived, allowing the fit of data having a heavy right tail and an eventual short left tail. The proposed class is the subject of in-depth work on its probabilistic and statistical properties, exploring the functional equivalences, modes analysis, stochastic ordering, functional expansions, moment measures, information measures and reliability measures. The new Lomax model is then applied to two real-life data sets, one related to the environment and the other to finance. It is shown to provide a near perfect fit, outperforming some existing models also based on the reference Lomax model, and with the same number of parameters. It is understood that only a special distribution of the new class has been applied; others distributions with different features can be used quite efficiently for applications in medicine, economics, insurance, reliability, and life testing, among others.

    The authors declare that the publication of this paper does not involve any conflicts of interest.

    The authors are grateful to the reviewers for their constructive comments.

    This work was supported by King Saud University (KSU). The first author, therefore, gratefully acknowledges the KSU for technical and financial support.

    This project is supported by Researchers Supporting Project number (RSP-2020/156) King Saud University, Riyadh, Saudi Arabia.



    [1] M. V. Aarset, How to identify bathtub hazard rate, IEEE Trans. Reliab., 36 (1987), 106–108.
    [2] M. A. Abdullah, H. A. Abdullah, Estimation of Lomax parameters based on generalized probability weighted moment, J. King Saud Univ. Sci., 22 (2010), 171–184.
    [3] A. Afaq, S. P. Ahmad, A. Ahmed, Bayesian Analysis of shape parameter of Lomax distribution under different loss functions, Int. J. Stat. Math., 2 (2010), 55–65.
    [4] M. Ahsanullah, Record values of Lomax distribution, Stat. Neerl., 41 (1991), 21–29.
    [5] A. A. Al-Babtain, I. Elbatal, C. Chesneau, F. Jamal, The transmuted Muth generated class of distributions with applications, Symmetry, 12 (2020), 1677. doi: 10.3390/sym12101677
    [6] A. A. Al-Babtain, I. Elbatal, C. Chesneau, F. Jamal, Box-Cox gamma-G family of distributions: Theory and applications, Mathematics, 8 (2020), 1801. doi: 10.3390/math8101793
    [7] S. Al-Marzouki, F. Jamal, C. Chesneau, M. Elgarhy, Topp-Leone odd Fréchet generated family of distributions with applications to Covid-19 data sets, Comput. Model Eng. Sci., 125 (2020), 437–458.
    [8] M. A. Aldahlan, F. Jamal, C. Chesneau, I. Elbatal, M. Elgarhy, Exponentiated power generalized Weibull power series family of distributions: Properties, estimation and applications, PloS One, 15 (2020), 1–25.
    [9] M. A. Aldahlan, F. Jamal, C. Chesneau, M. Elgarhy, I. Elbatal, The truncated Cauchy power family of distributions with inference and applications, Entropy, 22 (2020), 1–25.
    [10] A. M. Almarashi, M. Elgarhy, F. Jamal, C. Chesneau, The exponentiated truncated inverse Weibull generated family of distributions with applications, Symmetry, 12 (2020), 1–21.
    [11] J. M. Amigo, S. G. Balogh, S. Hernandez, A brief review of generalized entropies, Entropy, 20 (2018), 813. doi: 10.3390/e20110813
    [12] M. Aslam, Z. Asghar, Z. Hussain, S. F. Shah, A modified T-X family of distributions: classical and Bayesian analysis, J. Taibah Univ. Sci., 14 (2020), 254–264. doi: 10.1080/16583655.2020.1732642
    [13] M. A. Badr, I. Elbatal, F. Jamal, C. Chesneau, M. Elgarhy, The transmuted odd Fréchet-G family of distributions: Theory and applications, Mathematics, 8 (2020), 1–20.
    [14] N. Balakrishnan, M. Ahsanullah, Relations for single and product moments of record values from Lomax distribution, Sankhya B, 56 (1994), 140–146.
    [15] A. Balkema, L. de Haan, Residual life time at great age, Ann. Probab., 2 (1974), 792–804.
    [16] R. A. R. Bantan, C. Chesneau, F. Jamal, M. Elgarhy, On the analysis of new Covid-19 cases in Pakistan using an exponentiated version of the M family of distributions, Mathematics, 8 (2020), 1–20.
    [17] R. A. R. Bantan, F. Jamal, C. Chesneau, M. Elgarhy, Type II Power Topp-Leone generated family of distributions with statistical inference and applications, Symmetry, 12 (2020), 1–24.
    [18] M. A. Beg, On the estimation of P(Y<X) for the two parameter exponential distribution, Metrika, 27 (1980), 29–34. doi: 10.1007/BF01893574
    [19] C. C. R. Brito, L. C. Rêgo, W. R. Oliveira, F. Gomes-Silva, Method for generating distributions and classes of probability distributions: The univariate case. Hacettepe J. Math. Stat., 48 (2019), 897–930.
    [20] M. C. Bryson, Heavy-tailed distributions: Properties and tests, Technometrics, 16 (1974), 61–68. doi: 10.1080/00401706.1974.10489150
    [21] G. Casella, R. L. Berger, Statistical Inference, Brooks/Cole Publishing Company: Bel Air, CA, USA, 1990.
    [22] C. Chesneau, T. El Achi, Modified odd Weibull family of distributions: Properties and applications, J. Indian Soc. Prob. Stat., 21 (2020), 259–286. doi: 10.1007/s41096-020-00075-x
    [23] G. M. Cordeiro, R. B. Silva, A. D. C. Nascimento, Recent Advances in Lifetime and Reliability Models, Bentham Sciences Publishers, Sharjah, UAE, 2020.
    [24] A. El-Bassiouny, N. Abdo, H. Shahen, Exponential Lomax distribution, Int. J. Comput. Appl., 121 (2015), 24–29.
    [25] W. Gilchrist, Statistical Modelling with Quantile Functions, CRC Press, Abingdon, 2000.
    [26] M. A. Haq, M. Elgarhy, The odd Fréchet-G family of probability distributions, J. Stat. Appl. Prob., 7 (2018), 189–203. doi: 10.18576/jsap/070117
    [27] F. Jamal, C. Chesneau, M. Elgarhy, Type II general inverse exponential family of distributions, J. Stat. Manage. Syst., 23 (2020), 617–641.
    [28] D. Kumar, U. Singh, S. K. Singh, Life time distribution: Derived from some minimum guarantee distribution, Sohag J. Math., 4 (2017), 7–11. doi: 10.18576/sjm/040102
    [29] A. J. Lemonte, G. M. Cordeiro, An extended Lomax distribution, Statistics, 47 (2013), 800–816. doi: 10.1080/02331888.2011.568119
    [30] M. Mead, A new generalization of Burr XII distribution, J. Stat.: Adv. Theo. Appl., 12 (2014), 53–71.
    [31] N. U. Nair, P. G. Sankaran, Quantile based reliability analysis, Commun. Stat. Theory Methods, 38 (2009), 222–232. doi: 10.1080/03610920802187430
    [32] M. A. Nasir, M. H. Tahir, C. Chesneau, F. Jamal, M. A. A. Shah, The odd generalized gamma-G family of distributions: Properties, regressions and applications, Statistica, 80 (2020), 3–38.
    [33] S. Nasiru, Extended odd Fréchet-G family of distributions, J. Probab. Stat., 1 (2018), 1–12.
    [34] D. Bates, J. Chambers, P. Dalgaard, S. Falcon, R Development Core Team, R: A language and environment for statistical computing, R Foundation for Statistical Computing, Vienna, Austria, ISBN 3-900051-07-0, Available from: http://www.R-project.org, 2005.
    [35] E. A. Rady, W. A. Hassanein, T. A. Elhaddad, The power Lomax distribution with an application to bladder cancer data, SpringerPlus, 5 (2016), 1838. doi: 10.1186/s40064-016-3464-y
    [36] A. Rényi, On measures of entropy and information, Proc. 4th Berkeley symposium on mathematical statistics and probability, 1 (1960), 47–561.
    [37] M. Shaked, J. G. Shanthikumar, Stochastic Orders, Wiley, New York, N.Y., USA, 2007.
    [38] M. H. Tahir, G. M. Cordeiro, Compounding of distributions: A survey and new generalized classes, J. Stat. Distrib. Appl., 3 (2016), 1–35.
    [39] C. Tsallis, Possible generalization of Boltzmann-Gibbs statistics, J. Stat. Phys., 52 (1988), 479–487. doi: 10.1007/BF01016429
    [40] M. A. J. Van Montfort, On testing that the distribution of extremes is of type I when type II is the alternative, J. Hydrol., 11 (1970), 421–427. doi: 10.1016/0022-1694(70)90006-5
    [41] H. M. Yousof, M. Rasekhi, M. Alizadeh, G. G. Hamedani, The Marshall-Olkin exponentiated generalized G family of distributions: properties, applications and characterizations, J. Nonlinear Sci. Appl., 13 (2020), 34–52.
    [42] R. A. ZeinEldin, C. Chesneau, F. Jamal, M. Elgarhy, A. M. Almarashi, S. Al-Marzouki, Truncated generalized Fréchet generated family of distributions with applications, Comput. Model Eng. Sci., 126 (2021), 791–819.
  • This article has been cited by:

    1. H.E. Semary, Zawar Hussain, Walaa A. Hamdi, Maha A. Aldahlan, Ibrahim Elbatal, Vasili B.V. Nagarjuna, Alpha–beta-power family of distributions with applications to exponential distribution, 2024, 100, 11100168, 15, 10.1016/j.aej.2024.05.024
    2. Mohammed Elgarhy, Mohamed Kayid, Ibrahim Elbatal, Mustapha Muhammad, A new extended Fréchet model with different estimation methods and applications, 2024, 10, 24058440, e36348, 10.1016/j.heliyon.2024.e36348
    3. Mohammed Elgarhy, Mohamed Kayid, Arne Johannssen, Mahmoud Elsehetry, Survival analysis based on an enhanced Rayleigh-inverted Weibull model, 2024, 10, 24058440, e35851, 10.1016/j.heliyon.2024.e35851
    4. Mahmoud M. Abdelwahab, Ibrahim Elbatal, Mohammed Elgarhy, Anis Ben Ghorbal, H.E. Semary, Ehab M. Almetwally, A new extension of inverse length biased exponential model with statistical inference under joint censored samples and application, 2024, 99, 11100168, 377, 10.1016/j.aej.2024.04.068
    5. Ahmed W. Shawki, Mohamed Kayid, Oluwafemi Samson Balogun, Tamer S. Helal, Modeling to radiotherapy, environmental and engineering data: Using a new approach to generating family of distributions, 2025, 18, 16878507, 101317, 10.1016/j.jrras.2025.101317
    6. Amir Mushtaq, Ehab M. Almetwally, M.H. Tahir, Safar M. Alghamdi, A new exponentiated exponential distribution with application to real data sets, 2025, 18, 16878507, 101502, 10.1016/j.jrras.2025.101502
    7. Ibrahim Elbatal, Joshin Joseph, Jiju Gillariose, Shibli K, Farrukh Jamal, Anis Ben Ghorbal, Confluent hypergeometric inverse exponential distribution with application to climate and economic data, 2025, 18, 16878507, 101516, 10.1016/j.jrras.2025.101516
  • Reader Comments
  • © 2021 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)
通讯作者: 陈斌, bchen63@163.com
  • 1. 

    沈阳化工大学材料科学与工程学院 沈阳 110142

  1. 本站搜索
  2. 百度学术搜索
  3. 万方数据库搜索
  4. CNKI搜索

Metrics

Article views(2454) PDF downloads(72) Cited by(7)

Figures and Tables

Figures(5)  /  Tables(9)

/

DownLoad:  Full-Size Img  PowerPoint
Return
Return

Catalog